-
JAA
Probably ~2 hours for the qwarc grab of He-Man.org.
-
project10
🤞
-
JAA
He-Man.org thread page archival is done.
-
JAA
I got almost 113k threads and 19.5k login pages, which adds up to the number on the homepage. So the AB job missed a fair bit, somehow.
-
JAA
-
JAA
So maybe that's why AB didn't get everything.
-
fireonlive
ahh :(
-
JAA
I've fetched the groups and discussions, too. Group photos are only accessible to group members.
-
JAA
Unless someone has an account to use for attachments, that should be everything that's public.
-
JAA
He-Man.org is down now, serving a shutdown message with a self-signed cert.
-
JAA
Happened between 05:29 and 05:33, and I have all posts up to the former.
-
JAA
(The self-signed cert thing only started at around 05:44 it seems.)
-
» thuban golf claps
-
h2ibot
JustAnotherArchivist edited Deathwatch (+126, /* 2023 */ Add Ohio History Central):
wiki.archiveteam.org/?diff=51147&oldid=51131
-
h2ibot
JustAnotherArchivist edited Blogger (+2, Endangered and upcoming):
wiki.archiveteam.org/?diff=51148&oldid=51142
-
imer
good job JAA :)
-
pabs
"Placemark is going open source and shutting down"
macwright.com/2023/11/13/placemark
-
A_
!help
-
vokunal|m
Welcome. Need help with something?
-
A_
Hi is there FAQ how to search the archive?
-
vokunal|m
Nearly everything here goes to The WayBack Machine at Archive.org
-
A_
OK. thank you for answering
-
vokunal|m
The wiki has a list of all the projects if you really want to try and sift through it manually, but no one would reccomend it. It's all in WARC files
-
vokunal|m
-
magmaus3
-
Scen
transfer.archivete.am/7LvI8/static.…re.com-paths-postcards-2016.txt.zst - Spore.com postcards links (not the files themselves, but the directories). 3 numbers must be guessed to get actual IDs of them
-
joepie91|m
magmaus3: wow, every time opensubtitles finds a way to be even shittier
-
joepie91|m
for any archival attempts, note that they insert advertising into the subtitle files themselves
-
fireonlive
ah right, you'd want a VIP account.. though I guess they may rate limit that
-
fireonlive
i like how 'open'ai and 'open'subtitles aren't very open
-
fireonlive
:p
-
joepie91|m
yes, they do
-
fireonlive
ah ok
-
joepie91|m
unfortunately they also have a shitton of subtitles that are not available anywhere else
-
joepie91|m
they have been so entrenched as The Subtitle Source in the scene for so long that it was the default place where people uploaded their subs
-
fireonlive
:\
-
sepro
-
sepro
Looking for this post, I found another one where it seems like someone made an update
reddit.com/r/DataHoarder/comments/1…sorg_dump_1_million_subtitles_23_gb
-
Pedrosso
Can steamcommunity.com communities be archived by the AB? I see more than a few cases run but I wanna make sure. I'm thinking this could be run? (Spore)
steamcommunity.com/app/17390
-
Pedrosso_
(also I give up on using the desktop irc for now- wow)
-
JAA
I'm pretty sure it can't archive them completely. There's a lot of scripting going on.
-
JAA
It does grab something though IIRC.
-
Pedrosso_
What would it be grabbing?
-
JAA
¯\_(ツ)_/¯
-
Pedrosso
Then, could it be run through the AB?
-
Pedrosso
checking over the other steam ones, might as well not
-
Pedrosso
If I were to be able to give a list of urls for all sporepedia items, that is excluding all missing URLS; would it be better to ask some service like AB or even the warriors to grab it, or could it be better to do on one's own?
-
thuban
Pedrosso: can you give an example of a sporepedia item url?
-
Pedrosso
It's been done here earlier, but yes. Now I specifically mean the png statics.
spore.com/static/thumb/500/226/147/500226147573.png
-
Pedrosso