-
h2ibot
Bear edited List of websites excluded from the Wayback Machine (+240, MikeMehlman.net not excluded as of 2023. Fun…):
wiki.archiveteam.org/?diff=52194&oldid=52193
-
h2ibot
JAABot edited List of websites excluded from the Wayback Machine (+0):
wiki.archiveteam.org/?diff=52195&oldid=52194
-
h2ibot
Bear uploaded
File:Abload with statistics - January 2024.png (Front page of [[Abload]] as of January 2024…):
wiki.archiveteam.org/?title=File%3A…statistics%20-%20January%202024.png
-
h2ibot
Bear edited Abload (+149, + [[:
File:Abload with statistics - January…):
wiki.archiveteam.org/?diff=52197&oldid=52191
-
h2ibot
Bear uploaded
File:Abload with statistics - January 2024.png (corrected font size to prevent overflowing text):
wiki.archiveteam.org/?title=File%3A…statistics%20-%20January%202024.png
-
h2ibot
Bear edited List of websites excluded from the Wayback Machine (+13, linked "[[Abload]]"):
wiki.archiveteam.org/?diff=52199&oldid=52195
-
h2ibot
Bear uploaded
File:Abload shutdown announcement - April 2024.png (Shutdown notice shown on [[Abload]] since April…):
wiki.archiveteam.org/?title=File%3A…announcement%20-%20April%202024.png
-
h2ibot
Bear uploaded
File:Abload shutdown announcement - April 2024.png (language switched to English):
wiki.archiveteam.org/?title=File%3A…announcement%20-%20April%202024.png
-
h2ibot
Bear edited Abload (+177, /* Status */ +…):
wiki.archiveteam.org/?diff=52202&oldid=52197
-
h2ibot
Bear edited List of websites excluded from the Wayback Machine (+100, "EHE.com" and 2 more - also from TRS.com, as…):
wiki.archiveteam.org/?diff=52203&oldid=52199
-
h2ibot
Bear edited List of websites excluded from the Wayback Machine (+188, able2know.com not excluded as of 2013-10-27):
wiki.archiveteam.org/?diff=52204&oldid=52203
-
h2ibot
Bear edited List of websites excluded from the Wayback Machine/Partial exclusions (+55, from the Japanese deep web:…):
wiki.archiveteam.org/?diff=52205&oldid=52021
-
HP_Archivist
JAA: I looked at several open source extension for Chrome. Tried downloadthemall, but it forces the save-as dialog to be saved by the user.
-
HP_Archivist
Also, give that page a moment to load
-
HP_Archivist
Site is slow
-
JAA
From -ot: > Hey JAA: I want to grab all the reference files from this page and put them into an item, perhaps sorted by folders. Any suggestions how to download them all enmasse? I have all the hardlinks in a txt, Just need a way to grab them all at once.
-
JAA
-
JAA
The Lounge--
-
eggdrop
[karma] 'The Lounge' now has -12 karma!
-
HP_Archivist
Site isn't at risk. But it would be nice to mirror all that into an item
-
JAA
I'd do that with wget, probably. Needs a few options to get decent filenames though.
-
HP_Archivist
-
eggdrop
-
fireonlive
hmmmm
-
JAA
Something something --content-disposition --trust-server-names, not sure what else.
-
HP_Archivist
I don't have much experience with wget
-
JAA
HP_Archivist: Huh, your list doesn't have the 'click=1' param I'm getting on that page.
-
HP_Archivist
JAA: I removed that. It's not part of the hardlin, so to speak
-
HP_Archivist
hardlink*
-
HP_Archivist
Removed on all of them
-
JAA
I suppose, but it's important for AB archival because the links on the calibration page won't work in the WBM otherwise.
-
JAA
Anyway, looks like the options above are the only ones needed.
-
HP_Archivist
Oh, that's a good point. I actually wasn't thinking about that earlier. I was thinking from the angle of capturing just the direct links, excluding the page.
-
HP_Archivist
Oops.
-
HP_Archivist
Thanks for pointing that out though
-
JAA
So trim your list down to just those download links, then `wget -i list --content-disposition --trust-server-names`, and you'll get E011007.txt, E011023.txt, etc.
-
JAA
We can just run the entire site through recursively anyway.
-
HP_Archivist
Okay thanks for the tip. And yeah, if you want. I did the site not that long ago, but wasn't sure of all of these files were grabbed, too.
-
JAA
Looks like it was run before, but that was two years ago.
-
HP_Archivist
Hm. Thought I last did that much more recently, maybe not. Can't remember. Either way, cool. I'll have a look at wget and see how far I get with your args
-
JAA
Some other subdomains, but not www.
-
OrIdow6
!tell tourist any idea of whether the booru admin might be amenable to helping to archive it, if nothing else by clarifying what that message means?
-
eggdrop
[tell] ok, I'll tell tourist when they join next
-
fireonlive
-
fireonlive
"We’re thrilled to announce we’re partnering with @OpenAI to bring best in class technical knowledge and the world’s most popular LLM models for AI development together! This groundbreaking partnership with OpenAI will drive our mission to empower the world to develop technology through collective knowledge."
-
fireonlive
-
fireonlive
dumps should be good enough if people get upsetti right?
-
fireonlive
(images?)
-
masterx244|m
for images a AB !ao < onto a urllist should do the job to catch them. any essential image is on i.stack.imgur.com (site rule to use the builtin image upload which ends on a paid(!) imgur contract) so the URLs are easy to catch
-
c3manu
fireonlive: say good bye to useful stackoverflow answers :)
-
c3manu
gosh the internet really is going downhill rn
-
masterx244|m
another domain that matters: sstatic.net (caught a answer on travel SE where a answer pic is hosted there)
-
c3manu
-
JAA
c3manu: There are useful Stack Overflow answers‽
-
JAA
But yeah, we should probably archive them anyhow. → #stackunderflow
-
c3manu
JAA: every few months or so i find one :)
-
c3manu
..and yes, i'm immediately archiving them ;)
-
HP_Archivist
-
JAA
:-)
-
HP_Archivist
:)
-
fireonlive
<JAA> Closed. This question is off-topic. It is not currently accepting answers.
-
JAA
fireonlive: No no, that's on the second question. The one that it's marked as a duplicate of even though they're unrelated.
-
fireonlive
ahh yes x3
-
JoBot__
Does any part of the TestFlight Crashland exist still? I am desperate to find an important (at least to me) piece of history in the form of a beta version of a game I played long ago. Any help would be appreciated.
-
JoBot__
The associated package names are "com.blocksworld.editor", "com.blocksworld.player", and "com.boldai.blocksworld.dev".
-
nicolas17
JoBot__: the torrents of the WARCs are still alive
-
JoBot__
I have never used torrents before. Can you tell me a bit more about what I need to do, and provide links to any necessary resources?
-
nicolas17
I mean that would let you download the raw 1TB of data
-
nicolas17
getting data from them is still gonna be a problem :p
-
h2ibot
-
nicolas17
let's see if I can find those apps in my index at least...
-
JoBot__
Hmm, alright. Thank you.
-
h2ibot
-
nicolas17
looks like the testflight pages for "com.blocksworld.player|1.0 (1.0) #2" and "com.blocksworld.editor|1.0.3 (1.0.3) #22" were in the archive but not the actual .ipa files
-
JoBot__
Oh, so they wouldn't have even been retrievable even when the TestFlight Crashland was active?
-
nicolas17
that's what it seems
-
JoBot__
And "com.boldai.blocksworld.dev" was not present at all?
-
nicolas17
oh I didn't check boldai
-
JoBot__
Oh, okay.
-
nicolas17
"com.boldai.raze|1.1 #3|Raze" but also no .ipa file
-
JoBot__
Raze is separate from what I'm searching for but it's also a piece of lost media.
-
nicolas17
only *boldai* in the db
-
JoBot__
Pretty much everything Boldai ever made is lost media.
-
JoBot__
-
JoBot__
Well, alright. Thanks for trying to help.
-
JoBot__
It unfortunately does appear that Boldai's games will remain lost media at the moment.
-
JoBot__
Unless someone comes forth with an iPad they haven't used since 2012, it may stay like that forever.
-
JoBot__
Bye.
-
thuban
ok, sbnation podcasts as listed on
transfer.archivete.am/XmM2V/sbnation_podcasts.txt should all be done, with the following caveats:
-
eggdrop
-
thuban
-
eggdrop
-
thuban
-
eggdrop
-
thuban
-
eggdrop
-
thuban
- at least one podcast has released new episodes (in part concerning the loss of sbnation support) since i scraped it; i have not attempted to handle this
-
thuban
- i got the list of affected podcasts off
podbean.com/podcast-network/sb-nation-podcasts and don't actually know whether it's complete or accurate
-
thuban
!tell icedice i went through the scanlation discord scrape that Vokun did; have requested it in #//, submitted relevant urls to projects, and checked for custom blogspots (there were none)
-
eggdrop
[tell] ok, I'll tell icedice when they join next
-
h2ibot
JustAnotherArchivist edited Deathwatch (+241, /* 2024 */ Add PlanetSquires):
wiki.archiveteam.org/?diff=52209&oldid=52186
-
h2ibot
Myusernameisanything edited Sploder (+0, Raven (Flashpoint admin) says: "Sploder in FP…):
wiki.archiveteam.org/?diff=52210&oldid=52187
-
h2ibot
-
h2ibot
MrScottyPieey edited Me at the zoo (+141, added capation,):
wiki.archiveteam.org/?diff=52212&oldid=52189
-
h2ibot
-
h2ibot
JustAnotherArchivist edited Sploder (-6, Revert shutdown notice change, which should be…):
wiki.archiveteam.org/?diff=52214&oldid=52213
-
Notrealname1234
Look i'm not connecting to a bunch of channels anymore
-
fireonlive
why
-
Notrealname1234
Because yes
-
Notrealname1234
Why is the ctcp command not working
-
Notrealname1234
fireonlive: ^
-
fireonlive
?
-
fireonlive
/ctcp fireonlive VERSION
-
Notrealname1234
Doesn't work
-
Notrealname1234
What's the ctcp command fireonlive
-
fireonlive
check status window
-
Notrealname1234
Status on user?
-
fireonlive
server/your nick
-
JAA
Not supported, apparently:
MCMrARM/revolution-irc #309
-
Notrealname1234
Bruh moment
-
Notrealname1234
Imma switch to WebIRC so i can ses it
-
Notrealname1234
see it
-
Notrealname1234
Done
-
JAA
Also, this is -ot material.
-
Notrealname1234
Gonna move it there