-
DogsRNice
-
pokechu22started a #wikibot job for that
-
DogsRNicethanks
-
project10motor-talk.de is on the deathwatch; looks like it got another lease on life (ref motor-talk.de/blogs/motor-talk/bist…alk-und-gutefrage-net-t7534115.html)
-
flashfire42|m
-
rktk^^^^^^^
-
rktkthrow it into the bot!!!
-
masterX244Annoying thing is: the good stuff aka the tools is behind a login wall (post attachments). those needs to be grabbed, too in some form (even if its just a regular IA item to find them)
-
h2ibotSwitchnode edited Deathwatch (+188, /* 2023 */ add xentax): wiki.archiveteam.org/?diff=50944&oldid=50935
-
thubananyone up for contacting the codingforum admins to ask them to turn off cloudflare protection before the 22nd?
-
thuban
-
thuban(...one of these guys has a site with misconfigured ssl and the other one sells ytdl wrappers)
-
audrooku|mI'm scraping soundcloud track/user/playlist metadata for archival purposes (mostly track for now) and I'm looking at a variety of options for having broad coverage of the site, my schema supports multiple copies of the same record and I want to be able to view the updates to a track temporally, i already grab track ids as... (full message at <matrix.hackint.org/_matrix/media/v3…ackint.org/uegPHzYwdYqaymiANnyaszUw>)
-
audrooku|mshould've read the wiki, the telegram outlinks are added to urlteam2 and the urlteam2 url lists are uploaded in chunks as they are scraped to an IA collection, I'm still curious about the commoncrawl outlinks and whether ungrabbed urlteam2 urls are accessible
-
project10fwiw, urlteam project is not the same as the urls project. urlteam is the link shortener crawler
-
audrooku|mOh, that's right
-
nulldataSomeone posted an export of the XeNTaX Wiki that became inaccessible earlier this year. forum.xentax.com/viewtopic.php?p=194854#p194854
-
nulldataDirect link to export on Google Drive -> drive.google.com/file/d/16amHfPjfL7QagQfnUOGB2mEQQ2qGbywY/view
-
Maakuth|mpelaaja.fi/forum this Finnish forum is going to be closed in the near future. No specific date mentioned by quick looks, but they're talking about website refresh that's that should happen in the next weeks
-
Maakuth|mPerhaps a job fit for archivebot?
-
thubanMaakuth|m: yep, someone's already started one archivebot.com/?initialFilter=pelaaja
-
thubanthanks for the report!
-
Maakuth|mSweet, thanks!
-
h2ibot
-
h2ibotExorcism edited Pages Perso Orange (+42): wiki.archiveteam.org/?diff=50946&oldid=50939
-
h2ibotFireonLive edited Current Projects (-703, move PPO to finished, remove 'expired' finished…): wiki.archiveteam.org/?diff=50947&oldid=50922