-
fireonlive
weird cultural thing, the delete everything
-
Pedrosso
Yeah
-
Pedrosso
But it's a cultural thing?
-
project10
seemingly lots of japanese folks doing it recently, is all
-
Pedrosso
I suppose you as an artist could see it like "The point wasn't the product but the fun of the journey" But like...
-
Pedrosso
No
-
h2ibot
-
Pedrosso
-
Pedrosso
Would this require a sign-in cookie for archival? I can't find any sitemap. Also new logins were disabled a few days ago
-
JAA
No, there are definitely public parts.
-
Pedrosso
Awesome.
-
Pedrosso
Should it be fed to AB then?
-
JAA
Discovery is still a problem.
-
JAA
Here's a random example that was run through AB before:
tinyletter.com/feministlib
-
pabs
WBM wildcards listings should be enough?
-
JAA
It'd certainly be a start, but I doubt it'd be complete.
-
pabs
oh you mean discovery pre-archiving, I was thinking afterwards
-
JAA
Ah, yeah
-
pabs
hmm, bing scraping perhaps?
-
DogsRNice
Pedrosso: I wonder if there are any other temporary accounts like this for the remake?
-
Pedrosso
?
-
DogsRNice
oh the super mario rpg remake
-
DogsRNice
the one you linked
-
Pedrosso
Oh
-
Pedrosso
I linked a lot of things recently so... Where exactly?
-
DogsRNice
i mean in general
-
Pedrosso
I don't recall linking any super mario rpg remake things. Either I have dimentia /j, you've got the wrong person, or it's in one of the URL-lists from other sites
-
Pedrosso
could you send the link though? I'm curious now
-
qwertyasdfuiopghjkl
-
DogsRNice
oops
-
DogsRNice
yeah wrong person lol
-
Pedrosso
DogsRNice: This might answer the question, might not
nitter.vloup.ch/search?q=%23SuperMarioRPG
-
Pedrosso
-
pabs
-
eggdrop
-
unsimply99
is it possible to look through the google sites archives for a specific google site
-
arkiver
unsimply99: it's in the Wayback Machine, so you if you have the URL you can look it up there
-
nulldata
Can someone please throw
pbdwest.com and www.pbdgolf.com into AB? Related to the Open Hand Foundation scandal
-
h2ibot
JustAnotherArchivist edited Blogger (+3, Fix regex to catch blogger.com URLs that aren't…):
wiki.archiveteam.org/?diff=51239&oldid=51212
-
h2ibot
JustAnotherArchivist edited Template:CTA URL lists (+146, Clarify regex type and add comment on…):
wiki.archiveteam.org/?diff=51240&oldid=50376
-
JAA
Sanqui: The Webzdarma processing finished some hours ago, and I see that uloz.to is still up and serving files for now, so here's the full output (not deduped against the previous list since I didn't keep track of which WARCs had been done by that point):
transfer.archivete.am/FjICx/webzdarma-uloz.zst
-
JAA
As mentioned, there were some errors, I could hunt those down, but this should keep you busy if you ran out of stuff. :-)
-
JAA
Approximately 116 of the 930 files had some sort of issue.
-
JAA
Which doesn't mean they're missing entirely from that output, but it might be incomplete.
-
Sanqui
JAA: Thank you! They didn't kill downloads yet so we'll process them
-
lindowsME
the google account wipe means older videos will lose almost all their comments, right? 90-9-1 rule and all. I just realized, too late... ?
-
flashfire42|m
Oh fuck
-
lindowsME
it's basically the social-networking era of the site disappearing. i know a lotof people don't care for yt comments but to me the videos feel 'dead' without their 'cultural context'
-
lindowsME
homepages, video threads and annotations,
-
Sanqui
uloz.to is dead
-
Sanqui
thanks to those who contributed links
-
imer
RIP, hopefully you managed to save a good bunch
-
Sanqui
imer: I hear it's north of 30 TB.
-
immibis
-
immibis
"Image links will likely insta-rot" - Kyle Pollard (staff)
-
fireonlive
“Imgur no longer provides an enterprise image hosting product” surprise.
-
pokechu22
Did anyone other than stackexchange even use that?
-
fireonlive
-
fireonlive
i guess they did it in 2010, but surprised they didn’t use a cname or something of a domain they themselves owned
-
fireonlive
did it -> started using imgur
-
immibis
someone needs to make a service that redirects obsolete URLs to archive URLs
-
immibis
in the browser
-
DogsRNice
the wayback machine extension already does this
-
fireonlive
oh right, it replaces 404s doesn't it?
-
fireonlive
that's like a blast from the past1
-
fireonlive
s/1/!/
-
h2ibot
-
immibis
the wayback machine isn't always the best archive. it could just as well be something like nitter or invidious
-
joepie91|m
-
fireonlive
i used to use
archivebox.io - but it uses webrecorder/warcio and therefore is bad :(
-
fireonlive
i'd like my warcs standards-compliant please
-
fireonlive
(by people who care about being standards-compliant)
-
fireonlive
oh! that's a viewer
-
fireonlive
i thought it was a creator lol
-
nicolas17
oh crap @ stackexchange
-
JAA
On the plus side, there's only one dataset we need to go through to find all of the images.
-
nicolas17
they're also 5-char IDs right?
-
nicolas17
should I prepare for another round of bruteforcing?
-
imer
ew > Image links will likely insta-rot, I don't expect we'll get redirects from Imgur but I'll ask them
-
nicolas17
there goes matrix
-
fireonlive
nicolas17: yeah, hit a limit and it had to be raised
-
fireonlive
imer: you'd think they'd offer a cname at least after 14y lol
-
imer
10£ vps with a redirect set up is breaking the bank
-
fireonlive
xD yep
-
imer
$ €
-
imer
pick your poison^
-
fireonlive
both have a sad conv. rate to CAD atm :(
-
JAA
10 money units
-
imer
isn't CAD always a bit sad? not as sad as aussie bucks but stil
-
fireonlive
close to 1:1 when?
-
fireonlive
ye
-
fireonlive
it was very briefly almost 1:1 or maybe it hit it with USD? but never too amazing
-
fireonlive
feels kinda lower lately though
-
thuban
success story:
underhanded-c.org is tragically defunct, but archivebot got everything <3
-
thuban
(even though the last post post-dates the last targeted job--it must have been an outlink somewhere)
-
Pedrosso
Awesome
-
fireonlive
:D