-
marktheworst
reason why I'm asking is this page alleges that the owner of divested.dev somehow had its history wiped from Wayback
opinionplatform.org/divestos/changes-happen.html
-
h2ibot
JAABot edited List of websites excluded from the Wayback Machine (+0):
wiki.archiveteam.org/?diff=51784&oldid=51783
-
pokechu22
(e.g. trying to save web.archive.org/save gives the message "This URL is in the Save Page Now service block list and cannot be captured. Please email us at "info⊙ao" if you would like to discuss this more." but there are functional captures from elsewhere e.g.
web.archive.org/web/20231210231626/https://web.archive.org/save)
-
JAA
It's not impossible. I've heard of 'please remove snapshots within this time window from the WBM' before. The data will still be somewhere, but not publicly accessible.
-
JAA
pokechu22: I've been wondering whether we should create a list of those on the wiki, too.
-
JAA
And perhaps we should record this kind of time window exclusion somewhere, too, although it's much harder to pin down than the other ones.
-
pokechu22
I guess we should do an archivebot job of divested.dev too
-
marktheworst
So is this a time window exclusion + server configured to error when WBM crawls/SPN?
-
JAA
I have no idea how to confirm what it is. But it feels similar to those time window exclusions I've seen before.
-
JAA
I think SPN got through based on its output.
-
JAA
The 'delay in registering' error might be *because* of the playback exclusion-ish thing.
-
marktheworst
yup I don't have high hopes of my snapshot appearing on WBM
-
pokechu22
I saw that same "unavailable for archiving" on a URL on
bn-ent.net earlier today which I accidentally loaded (along the lines of
web.archive.org/web/*/http://www.bn-ent.net/something where there were no prior captures) but now I'm just getting normal "not archived that URL" messages. (
bn-ent.net is completely offline, so it's not going to show
-
pokechu22
the "available on the web" or "unavailable" messages)
-
pokechu22
anyways
divested.dev and
opinionplatform.org will run in archivebot (and I also put
spotco.us in there but the site is dead so that won't save anything other than a note in a meta-warc that the site was dead at the time, which won't appear on web.archive.org)
-
marktheworst
sounds good
-
h2ibot
JustAnotherArchivist edited List of websites excluded from the Wayback Machine (+45, Clarify irrelevance of www prefix and remove…):
wiki.archiveteam.org/?diff=51785&oldid=51784
-
h2ibot
-
h2ibot
Zs3o8jtr29 edited Mobile Phone Applications (+24, /* Android Applications */ APKPure is not so…):
wiki.archiveteam.org/?diff=51787&oldid=51781
-
h2ibot
-
pabs
FTR: I'm going through vesz' list, doing the ones shutting down in Feb, or supposed to have been shutdown already
-
pabs
if someone else could put the rest in Deathwatch, that would be good
-
marktheworst
it's been a couple hours now and that snapshot I supposedly made of divested.dev still isn't showing up in WBM
-
marktheworst
I think it should count as an excluded website
-
marktheworst
could be added with a note explaining the weird exclusion that doesn't give an error but there's no snapshots
-
pokechu22
-
JAA
Yeah, waiting for the AB data is probably the most reliable way to tell.
-
marktheworst
Oh I didn't know AB will show up in WBM
-
JAA
If all WARCs in the same item have been derived and the last ones show up in the WBM but divested.dev doesn't, we'll know.
-
marktheworst
Time to wait then
-
nulldata
VICE has removed the rouge episode of CYBER
-
Vokun
Has anyone else had issues with Hetzner locking their ip for "The IP address(es) was/were used to perform scans on other servers."?
-
Vokun
All I do on it is AT docker files
-
nicolas17
nulldata: wew
-
thuban
too late :3
-
nicolas17
hmm doesn't seem to work on WBM/youtube
-
nicolas17
there's multiple captures, maybe one works
-
nicolas17
(I see some SPN captures)
-
pokechu22
-
nicolas17
I get "The Wayback Machine does not have this video archived"
-
nicolas17
I thought we had archived the whole channel? :/
-
nicolas17
pokechu22: that one also seems to work on WBM
-
nulldata
I know at the very least I threw the specific YT video into down the tube
-
nicolas17
I know the episode was archived but manual saves can have bad discoverability... it's good to know at least one of its original URLs work on WBM
-
nicolas17
nulldata: oh, maybe it wasn't indexed yet? it's all quite recent
-
pokechu22
Yeah, I did some testing with
shows.acast.com/cyber/episodes/the-end-of-vice in the WBM at the time to make sure it did play back properly
-
nicolas17
MVP
-
fireonlive
pokechu22++
-
eggdrop
[karma] 'pokechu22' now has 7 karma!
-
pokechu22
specifically there was a MP3 URL it tried to play that hadn't been saved (or hadn't gone into the WBM yet) so I saved it via SPN and things worked
-
thuban
i think the ab asset jobs haven't been indexed yet
-
thuban
(they're on ia but when i check urls from the log in the wbm it only shows spn captures)
-
nicolas17
also, anyone here who has background on "what is Vice" and "what exactly is being shut down"? we really need a
wiki.archiveteam.org/index.php/Vice
-
thuban
pabs: i'll add vesz's list to deathwatch in a few hours if no one gets to it first
-
thuban
(in the meantime, if anyone wants to help find references for the shutdown dates, that would be helpful)
-
marktheworst
pokechu22: Looks like the AB jobs finished and files are up at those links, none of them show up in WBM yet
-
pokechu22
Yeah, it generally takes a while for things to show up on the WBM even after files are on archive.org
-
h2ibot
JustAnotherArchivist edited Imgur (+329, Add new behaviour on accessing image URLs directly):
wiki.archiveteam.org/?diff=51789&oldid=50521
-
JAA
^ If anyone didn't get the memo yet: stop using Imgur for sharing images. :-)
-
pokechu22
They've definitely been doing that beforehand inconsistently
-
JAA
I've never seen it for requests without a Referer header.
-
JAA
Maybe they've been A/B testing at some point?
-
JAA
Anyway, #imgone for that.
-
h2ibot
JustAnotherArchivist edited Imgur (+201, Redirect is based on Accept header):
wiki.archiveteam.org/?diff=51790&oldid=51789
-
missaustraliana
why do some users not have the warrior icon next to them in the latest uploads
-
JAA
Because they run the project images, not the warrior.
-
missaustraliana
ohh the onces on gh
-
JAA
Well, the images are on our container registry, but yeah.
-
missaustraliana
also, where does the warrior grab its urls from?
-
missaustraliana
like im curious whats the url?
-
pokechu22
some API on the tracker I think
-
JAA
It gets items from the tracker, yes.
-
pokechu22
probably not one that's a good idea to use if you're not the warrior script that will actually process the URLs
-
JAA
Correct, please don't mess with that.
-
missaustraliana
yeah im not
-
missaustraliana
was just curious
-
JAA
The warrior retrieves a project list, you select one of the projects. On the project (warrior or otherwise), the worker contacts the tracker and requests items, then reports back once they're done and uploaded.
-
missaustraliana
wait, another question. if a batch of urls isnt done, say power outage or loss of connection, does the batch go back into the pool or does it get marked done and wont ever be done
-
JAA
-
JAA
It'll eventually get released and reclaimed by another worker, yes.
-
JAA
Details depend a bit on the project.
-
JAA
But usually, any item that hasn't been completed after some predefined amount of time (the TTL, time to live) becomes eligible for reclaiming.
-
missaustraliana
URLs. thats what im running. the reason im asking about it is because i have the docker running on my laptop.
-
JAA
Don't know the exact config there, but it definitely does reclaiming.
-
JAA
Sometimes, we also manually release claims, which is separate. But that's rare these days.
-
missaustraliana
like, personal laptop. so im moving it around, closing the lid.
-
missaustraliana
etc
-
JAA
Are you running the container directly on your OS or using a VM?
-
missaustraliana
through docker so on os
-
JAA
Ok, then clock issues should be minimised, I guess.
-
missaustraliana
okay last question. how can i overcome the cors issue with socket.io? im trying to make a widget on my website that updates my upload stats in realtime
-
missaustraliana
when using the tracker socket i get a cors error then 200ok
-
JAA
Oh brilliant, I get to reuse this from a few days ago:
transfer.archivete.am/inline/hymYo/cors.png
-
missaustraliana
HELP
-
missaustraliana
so theres no workaround with your socket?
-
JAA
I don't know.
-
JAA
All I know is CORS is a pain.
-
missaustraliana
righto. thanks!
-
aninternettroll
CORS is a security feature. If the socket owner doesn't want you to use it then you can't do anything with just a browser
-
fireonlive
the real answer is to proxy the socket.io from your own origin
-
fireonlive
:p
-
fireonlive
but yeah
-
h2ibot
Pokechu22 edited Jira (+2263, archived projects):
wiki.archiveteam.org/?diff=51791&oldid=51759
-
immibis
aaronmbushnell.com should probably get archived, yeah?
-
immibis
this is likely the same guy who killed himself in front of the israel embassy
-
thuban
deathwatch edit has been delayed for reasons, hoping to still get to it later today
-
arkiver
hi all, i may have missed messages on feb 24/25/26. or at least my logs indicate some stuff may be missing. if i was pinged on those dates, please do ping me again
-
arkiver
so, what is going on with subscene?
-
arkiver
-
arkiver
JAA: do you know if we got the
honeycodecommunity.aws forums?
-
immibis
the guy from
aaronmbushnell.com says he is NOT the one who set himself on fire
-
h2ibot
OrIdow6 uploaded
File:Poptropica shutdown hoax announcement.jpeg (Apparently faked shutdown announcement for the…):
wiki.archiveteam.org/?title=File%3A…shutdown%20hoax%20announcement.jpeg
-
h2ibot
OrIdow6 created Shutdown rumors, hoaxes, and scares (+1034, I could swear there was another of these, but I…):
wiki.archiveteam.org/?title=Shutdow…umors%2C%20hoaxes%2C%20and%20scares
-
OrIdow6
-
OrIdow6
been finalized
-
OrIdow6
Just searching, not heard of this site before
-
arkiver
OrIdow6: thank you!
-
JAA
arkiver: Looks like there was an AB job for
honeycodecommunity.aws in August, but that's all I know.
-
Darken
Could someone archive
seriouswebdesign.co.uk with archivebot please (limited coverage)
-
pokechu22
Darken: done
-
Darkine
Is Gitorious mirror dead? I need it for obscure Toshiba G900 Android kernel sources that as I know only existed there.
-
pokechu22
seems like
gitorious.org is dead for me but I'd assume the data is also on archive.org somewhere...
-
Darkine
it is not
-
Darkine
and seems like archive died just in moment when i needed it
-
pokechu22
hmm, and
wiki.archiveteam.org/index.php/Gitorious doesn't seem to indicate any other place where it's kept...
-
pokechu22
-
pokechu22
-
pokechu22
-
Darkine
yes, thank you, it is here
-
h2ibot
Petchea edited Tumblr (+65, /* History */):
wiki.archiveteam.org/?diff=51794&oldid=51767
-
h2ibot
Boofdev edited Pomf.se/Clones (+521, Moved almost everything to the list of dead…):
wiki.archiveteam.org/?diff=51795&oldid=51788
-
Darken
pokechu22 thank you
-
fireonlive
Darken = fireisbro?
-
Darken
Yeah I had to use a temp name because I couldn't sign into this one (has no relation to your name lol)
-
fireonlive
ah :3
-
Darken
Should have got access to this account sooner so I get more known here instead of using a different name/alias haha
-
Darken
hopefully I didn't mess up #telegrab lol, its still scanning all the new messages sent
-
fireonlive
nah, that's ok
-
fireonlive
it's a known issue and looks like it'll get fixed now :p
-
fireonlive
plus bot can process other stuff in the mean time
-
JAA
Gitorious has been dead or broken for at least a couple years by now.
-
JAA
And yeah, would be nice to get it all onto IA.
-
fireonlive
gitorious.. thats a name i haven't heard in a while..
-
missaustraliana
3 million =)
-
missaustraliana
-
Darken
what are these stats for?
-
missaustraliana
my website. its just for fun
-
Darken
ah
-
missaustraliana
no real point. its so people can go "wow"
-
JAA
Number go up!
-
imer
we do like numbers that go up here
-
missaustraliana
i know its nothing like the billions others have, but for someone who has a passion in archival, its massive.
-
nulldata
-
missaustraliana
isnt yuzu open source?
-
nulldata
Probably should start archiving anything related to Switch emulation and Yuzu
-
missaustraliana
once a project is made, ill get all my machines to work on it
-
JAA
Anything unofficial vaguely related to Nintendo is always at risk of spontaneous combustion.
-
nulldata
missaustraliana - it's not something a separate project would be created for. This is just throwing related sites into ArchiveBot and gitgud/swh
-
missaustraliana
ahh
-
missaustraliana
but do you think yuzu will collapse due to nintendo?
-
nulldata
No idea - haven't read the complaint yet. Interesting the repo is still up. Either Nintendo hasn't sent a DMCA complaint for the repo itself or GitHub hasn't processed it yet.
-
missaustraliana
ill download the latest commit on main branch
-
balrog
do we have a way of backing up github users? someone wants to run github-backup on the user/org?
-
JAA
#gitgud (already submitted yuzu-emu and related users)
-
pokechu22
flatpak.yuzu-emu.org and
flatpak.citra-emu.org - is there something we can do with these?
-
fireonlive
ah fuck
-
balrog
worth grabbing ryujinx too
-
fireonlive
indeed; gitgud has it coming up and it's running in AB now
-
pokechu22
youtube.ryujinx.org redirects to a youtube channel - there's probably a channel for yuzu too that should be saved
-
pokechu22
and citra