-
Naruyokodocs.google.com/spreadsheets/d/1BtU…ClYi8FGvaYmYyc1p4SkfpNty-U/htmlview A list of Yandex Disk links, spreadsheet found from siivagunner.fandom.com/wiki/Ripping . I don't know about the archive status or the legality of these song stems files.
-
nicolas17there was a terrible storm in Bahia Blanca, Buenos Aires, lanueva.com has news and photos about it
-
nicolas17should I make a list of relevant articles to do a non-recursive AB?
-
pokechu22That sounds reasonable to me
-
nicolas17(since a recursive AB on a news site seems like a bad idea)
-
nicolas17
-
nicolas17has <amp-img src="pxcdn.lanueva.com/122023/1702770456117.jpeg">
-
nicolas17I guess AB can't follow that?
-
nicolas17(on SPN I guess it would execute the Javascript that turns it into an <img>)
-
pokechu22AB should extract that normally, since it likes extracting stuff from data attributes too
-
nicolas17oh good
-
nicolas17
-
eggdropinline (for browser viewing): transfer.archivete.am/inline/QFklN/lanueva.com-20231216-storm.txt
-
pokechu22Yeah, it extracted that and also pxcdn.lanueva.com/122023/1702770456117.jpeg?cw=83&ch=46 and pxcdn.lanueva.com/122023/1702770456117.jpeg?cw=168&ch=94
-
h2ibot
-
h2ibotExorcism edited Moegirlpedia (+38): wiki.archiveteam.org/?diff=51369&oldid=51335
-
h2ibot
-
h2ibot
-
h2ibotExorcism edited WikiApiary (+44): wiki.archiveteam.org/?diff=51372&oldid=51328
-
h2ibot
-
h2ibotExorcism edited Xeno-canto (+36): wiki.archiveteam.org/?diff=51374&oldid=51330
-
Jojo111Hello everyone. I am looking for an old deleted coursera course called "ECEN 5017 Power Electronics For Electric Drive Vehicles". I would be immensely grateful if someone can help me find it. Thanks.
-
nicolas17welp
-
nicolas17the storm reached me
-
nicolas17that was the strongest wind I've ever seen irl
-
JAAnicolas17: Stay safe, mate!
-
nicolas17do we have anything functional for twitter archival?
-
nicolas17
-
eggdrop
-
nicolas17
-
eggdrop
-
nicolas17
-
nicolas17
-
eggdrop
-
Bartonicolas17: twitter archival? I run my own nitter instance at nitter.vloup.ch and it works quite well with twitterminator tokens :-) It's not advertised in their public github wiki, and that's the spirit of it. Just be careful about the 429 ;-)
-
nicolas17Barto: well we could throw those nitter.net links into AB
-
nicolas17I walked around the neighborhood and took 1000+ photos for Mapillary, there wasn't much destruction in this area tho
-
fireonliveraw dog nitter.net doesn’t work with AB
-
fireonlivebut Barto’s does
-
nicolas17phrasing
-
fireonlive:p
-
Bartomuahaha
-
PedrossoWould gamebanana.com be good to be saved? It's got lots of mods (such as portal 2 & HL2) It has very limited coverage but seems quite big. sitemap.gamebanana.com/index.xml (each sitemap in the index seems to refer to only 1 URL). If it's up to date that's about 19k pages
-
pokechu2219k seems kinda small for something like that, hmm
-
PedrossoIt does
-
PedrossoAlthough the site isn't in danger afaik, the coverage is limited and mods can be deleted at any time
-
pokechu22Index (2,103,542) for members - that's a lot of users
-
PedrossoAh. The 1 page per sitemap applies to categories but not necessarily for others
-
Pedrossomod categories. Yeah my main guess was way off. Must be much bigger
-
aninternettrollWow, I've never seen that many sitemap files
-
pokechu22Also looks like things are pretty JS-based: view-source:https://gamebanana.com/games doesn't have any games in it
-
PedrossoWhat about pages that aren't search pages?
-
pokechu22
-
Pedrossooh wow. So AB is a a no-go? Or do the .js files have necessary info?
-
pokechu22AB almost certainly won't work I think
-
Pedrosso
-
Pedrosso0.7 million, hah.
-
PedrossoThe mod downloads appear to work off of a gamebanana.com/dl{id} system. Where the ID can be gotten from the mod page url, gamebanana.com/mods/233183 so a full list of those is an easy one.
-
OrIdow6Is this shutting down?
-
PedrossoI stated before it wasn't, rather that it's just low coverage
-
phuzionCan someone take a look at the archivebot job for forums.questionablecontent.net and see if it's worth upping the speed on the job? I think they managed to get the server migrated, it seems to be pretty stable.
-
phuzionAlso might be worth looking into increasing the ignores on that job, it seems to go off on full-ass tangents of archiving some random other websites.
-
pokechu22Yeah, archivebot saves outlinks and embedded images by default, and unfortunately a lot of old forum image hosts are dead :/
-
pokechu22I've bumped up the speed to 1s-2s, let's see if it's stable like that