-
pabsEvanBoehs|m: for the forum itself, just link to the forum from here and someone will run it through archivebot. also tell us any URL patterns you had to ignore during your crawl.
-
pabsand if you have a list of other individual URLs you want saved, put them in a text file, one per line and upload it to transfer.archivete.am
-
EvanBoehs|m<pabs> "Evan Boehs: for the forum itself..." <- Thanks. I've started a grab-site job
-
EvanBoehs|mI will upload the WARC when it's done
-
pabsthat won't get into web.archive.org though
-
pabsif you tell us the forum, then we can put it through archivebot, which will get into web.archive.org
-
EvanBoehs|mpabs: oh, #archivebot?
-
pabsyes
-
EvanBoehs|mSorry it's been a long day :), thanks for the patience
-
pabsthe parameters are: the forum URL, plus any URL patterns we should ignore, and how much delay to add between each request, and how many parallel requests should be run
-
EvanBoehs|m<pabs> "the parameters are: the forum..." <- could I be voiced? I put in a request but it's in use by others so I think it's already obscured
-
pabsrepeat what you said on #archivebot here and someone will get to it later
-
EvanBoehs|mOh. "!a letterstocrushes.com" "Logic is the admin's been inactive for years, I tried to reach him various ways about security issues, it's been 2 years and no response. Site's been around for a decade, I posted many as a teenager years ago and I'd be sad to see them inevitably gone"
-
pabsand its a forum right?
-
EvanBoehs|mpabs: Yes, custom programmed so forum endpoints won't be an issue, only outlinks
-
pabskatocala started it for you, please watch the job on archivebot.com and poke them about any issues
-
EvanBoehs|mpabs: sounds good
-
pabsEvanBoehs|m: also you may waht to check if there are subdomains wiki.archiveteam.org/index.php/Finding_subdomains
-
EvanBoehs|mpabs: What's considered an issue? I saw it briefly touch eff and wikipedia files but it moved on
-
pabslike if the bot gets banned the site breaks, we need to slow the job down a lot until it recovers
-
EvanBoehs|mpabs: Oooh ok more severe stuff got it
-
pabsor if the bot gets into a loop of URLs that don't exist or there are too many of them, then we should ignore those
-
EvanBoehs|mMakes sense. Thanks for everything again. This community seems wonderful
-
pabsno probs. welcome :)
-
h2ibotArkiver uploaded File:Docker (container engine) logo.png (From…): wiki.archiveteam.org/?title=File%3A…%28container%20engine%29%20logo.png
-
h2ibotArkiver edited Dev/Infrastructure (-220, incorrect): wiki.archiveteam.org/?diff=49885&oldid=49411