-
taposNope, I can't code
-
deadorbitwhat
-
taposI'd have to get the domains manually if I was to do it on my own
-
taposI was responding to thuban
-
deadorbitoh i cant see the message lol
-
taposYeah, you joined after he sent it
-
c3manumurb: i just realized i conflated the cases against the journalist and the one against linksunten (sometimes i just need a day or two for that :D). hence the paragraph mixup
-
cpreciosoHi, I think guiasnintendo.com might be in danger of getting turned off. It is an official Nintendo Spain website with detailed game guides for almost all first-party (and some third-party) Nintendo games since the GameBoy. However, it has not been updated since 2022 (as can be seen in the footer), nor has it added any new guides since.
-
cpreciosoI am running grab-site on it right now, as I don't think it is an overly large website; but I am unsure of how to proceed if I do want to send this to the internet archive, or share the archiving load with the rest of the Archive Team. How should I proceed?
-
thubancprecioso: i have started an archivebot job for www.guiasnintendo.com which you can monitor at archivebot.com; the results will be uploaded to the internet archive and indexed by the wayback machine a few days after the job completes. sound good?
-
cpreciosothuban amazing, thanks!
-
thubanyou're welcome; thanks for the tip!
-
katiaa/G aita
-
katiaops
-
thuban* archivebot.com (mea culpa...)
-
c3manu:D
-
c3manuthey'll probably come back when it doesn't work :)
-
CheesyAny list where I can dump government sites to be eventually get crawled?
-
kiskaPerhaps #//
-
CheesyJust gonna do a massive dump there
-
CheesyHopefully not against any rule
-
c3manuCheesy: like a list of urls?
-
CheesyYeah
-
c3manuyou could also list them in a text file and upload it to transfer.archivete.am
-
c3manuand then just post the link here :)
-
c3manuon another note: did anyone here get tripadvisor-urls to work with AB, or has another reliable method of archiving them? if so, i'd be intrigued :)
-
taposIs anyone here willing to write a scraper that extracts Google Sites and Blogspot links from Kemono? Google seems to be about to do a NSFW purge on at least Google Sites and this would probably be the best way of backing up as many NSFW artist sites as possible
-
taposYou could probably use a lot of code from github.com/SatyamSSJ10/Kemono-youtube-fetch
-
tapos
-
tapos
-
tapos^ The webpages that need to be scraped