-
Doranwen
OrIdow6: Pretty much all of ff.n has been backed up by various people in the fic archival community. I have entire fandoms myself, but only the ones I'm interested in (and for my fandoms, only the English langauge - I'm not interested in being the repository of everything ever, lol). But I know of someone who's already dumped quite a bit to IA.
-
Doranwen
But it's anyone's guess how the ff.n situation will progress, lol. Their latest update was here:
x.com/FictionPress/status/1812241025928409122
-
eggdrop
-
Doranwen
Someone on r/fanfiction already posted a DNS workaround. I haven't tested it as I don't visit the site all that often. But if it's not back up tomorrow (unlikely given it's a weekend), I'll probably try editing my hosts file and see if I can make it work that way, so I can get the latest links to run through fichub-cli.
-
Doranwen
Since my next update of some fandoms is due tomorrow.
-
yarrow
fanfiction.net is back up for me
-
Ryz
Mmm, has there been attempts on archiving FanFiction.net? Though the Cloudflare stuff is...a problem, and remains a problem
-
Flashfire42
remind me again are we doing something about
abload.de
-
Doranwen
Ryz: No idea, though it'd be nice if someone archived the reviews. At this point if the site went down, I'd be able to read just about all the fics easily enough, but that'd be it. (Fortunately, it's now back to its usual clunky normal. Complete with Cloudflare being a pita when trying to go read a fic on there, which just reminded me to go read it from my offline stash instead, haha.)
-
yarrow
To illustrate the sorry state of podcast preservation: Barack Obama and Michelle Obama have both had podcasts and neither of them are on archive.org. (I'm archiving them right now.)
-
arkiver
yarrow: podcasts are not being archived into the Wayback Machine
-
yarrow
they're not?
-
yarrow
<yarrow> Is anyone aware of any large-scale projects for automatically archiving podcasts with public RSS feeds?
-
yarrow
<pabs> yarrow: if the feeds are in wikidata, then urls-sources & IA will download the podcasts /cc arkiver
-
yarrow
I guess that's not correct?
-
nimaje
does urls-sources put the stuff it archives into wayback machine?
-
arkiver
nimaje: yes
-
arkiver
yarrow: yeah it's not correct, those feeds are there just to archive the feeds themselves
-
arkiver
there is a podcast archiving project though yes, but it does not store the podcasts in the Wayback Machine
-
arkiver
Flashfire42: no on abload.de
-
OrIdow6
Doranwen: As for "Pretty much all of ff.n has been backed up by various people in the fic archival community" - do you havn links? Especially for recent stuff (seemingly they started with the aggressive anti-bot measures in the late 2010s?)
-
yarrow
Do the podcasts end up on archive.org? As I noted above, I've been surprised by how many podcasts are NOT on archive.org. I've archived about 180 manually so far myself and I've seen very few examples of something already being archived when I check to see if I should do it.
-
OrIdow6
Doranwen: As I'd like to document taht on the ArchiveTeam wiki
-
h2ibot
OrIdow6 edited Shutdown rumors, hoaxes, and scares (+228, Fanfiction.net):
wiki.archiveteam.org/?diff=52745&oldid=51793
-
c3manu
yarrow: how do you archive podcasts btw? do you just yt-dlp the feed and yeet all the episodes into one item?
-
yarrow
I use a tiny, obscure program from GitHub called PodcastBulkDownloader to download the feed. Then, yeah, I put all episodes in one item with cover art and good metadata.
-
yarrow
-
yarrow
The 5-year mortality rate on podcasts with over 100,000 listens has to be something bananas like 30% (and then for podcasts with under 10,000 listens it's gotta be like 80%).
-
Harzilein
oh
-
arkiver
yarrow: yes but archiving has been less intense the last year due to changes in the way the podcasts are being stored
-
arkiver
(and PBs had to be switched over to the different way of storing)
-
arkiver
that is almost done and archiving will start fully again
-
yarrow
How podcasts are being stored internally at the IA?
-
arkiver
yeah with the 'items' system, not wayback
-
h2ibot
Manu created Discourse/archived (+6, start list with archived discourse instances):
wiki.archiveteam.org/?title=Discourse/archived
-
h2ibot
Manu edited Discourse/archived (+79, saved forums.lutris.net):
wiki.archiveteam.org/?diff=52747&oldid=52746
-
h2ibot
Manu edited Discourse (+55, include list of archived instances):
wiki.archiveteam.org/?diff=52748&oldid=52228
-
h2ibot
Manu edited Discourse/uncategorized (-45, forums.lutris.net => /archived):
wiki.archiveteam.org/?diff=52749&oldid=51202
-
c3manu
yarrow: do you keep them updated over time? that's a thing i'd probably struggle with, hence most of the ones i archived are stale ones.
-
yarrow
I only started this 2-3 weeks ago, so I haven't done any updating of shows are are still ongoing (a lot have permanently ended). Unfortunately, I don't have any automation set up nor do I have the know-how to do that.
-
yarrow
*shows that are still ongoing
-
yarrow
arkiver: any idea why Barack Obama's podcast wasn't caught by the automated system?
-
yarrow
-
h2ibot
-
h2ibot
-
that_lurker
-
katia
that_lurker, <3
-
katia
that_lurker++
-
eggdrop
[karma] 'that_lurker' now has 14 karma!
-
yarrow2
that_lurker why is their no downloadable or playable/streamable video in that item? confused
-
yarrow2
*why is there
-
that_lurker
currently uploading it
-
that_lurker
live chat is the in json format
-
yarrow2
ahhhh
-
that_lurker
uploading BREAKING: Secret Service rushes Trump off stage | LiveNOW from FOX [-C2RyORyX0U].mkv: 2%|█▏ | 203/10310 [24:21<19:18:35, 6.88s/MiB]
-
yarrow2
I see
-
yarrow2
Nice work :)
-
h2ibot
Manu edited Bandcamp (+1203, take note of the Vapor Vault):
wiki.archiveteam.org/?diff=52752&oldid=50959
-
h2ibot
-
h2ibot
-
h2ibot
Exorcism edited Bugzilla (+34, /* Not yet archived */):
wiki.archiveteam.org/?diff=52755&oldid=52754
-
h2ibot
-
h2ibot
OrIdow6 edited Shutdown rumors, hoaxes, and scares (-24):
wiki.archiveteam.org/?diff=52757&oldid=52745
-
thuban
yarrow2: just fyi, podcastbulkdownloader works from the podcast rss feed and not a catalog such as apple's, meaning that it will not find back episodes if the rss feed is truncated (as some of them are)
-
» Harzilein wished the archiving approach would go via synthesizing archive rss feeds where rss feeds are truncated.
-
Harzilein
podcasts are a mess. i have a grudge against my country's public broadcasters for: a) always shuffling things around with their web platforms b) muddling terminology (i.e. a podcast episode might be called "an audio" internally, but it will inevitably become "a podcast", even if it's not part of a feed) and c) sometimes having "entire news broadcast" feeds of length one.
-
Harzilein
(of course the segmentized feeds have longer length but it becomes hard to tell then when it really got broadcasted first)
-
h2ibot
Exorcism edited Bugzilla (+38, /* Not yet archived */):
wiki.archiveteam.org/?diff=52758&oldid=52756
-
h2ibot
-
h2ibot