01:37:46 Flashfire42 edited URLTeam/Warrior (+297): https://wiki.archiveteam.org/?diff=50268&oldid=50266 01:47:47 Flashfire42 edited URLTeam/Warrior (+37, /* Warrior projects */): https://wiki.archiveteam.org/?diff=50269&oldid=50268 01:48:47 Flashfire42 edited URLTeam/Warrior (+179, /* Warrior projects */): https://wiki.archiveteam.org/?diff=50270&oldid=50269 04:16:23 Could someone grab https://www.mitnicksecurity.com now that Kevin Mitnick died. 05:11:28 that_lurker: done 09:50:09 Reece2oo9 edited Internet Archive (+348, /* Mirrors */): https://wiki.archiveteam.org/?diff=50271&oldid=49710 09:50:10 Usernam edited List of websites excluded from the Wayback Machine (+23): https://wiki.archiveteam.org/?diff=50272&oldid=50267 14:15:29 madpro|m: What happened to the Twitter Developers Forum? Is it safe to assume it's back around? 17:35:41 Afternoon 17:36:08 Proposing that at some point - https://www.legislation.gov.uk/browse/eu and related is placed on a list for archive. 17:36:23 It's not an immediate priority however. 17:46:16 With how archivebot works, it'd probably require doing all of https://www.legislation.gov.uk/ instead - which might be kinda big? 17:48:34 But worth it 17:49:00 Not an immediate priority though. 17:49:42 My other suggestions for archiving are potentially NSFW sites , and I wasn't sure you archived those. 17:50:38 ugh, and incomplete coverage: https://www.legislation.gov.uk/eudn/2020/1809/contents from https://www.legislation.gov.uk/eu-origin?page=15 hasn't been saved yet - definitely worth doing then 17:51:46 If it's an NSFW site with unique user-generated content (e.g. an booru) that's closing, or a site with a lot of content and only some is NSFW, it can be saved 18:07:08 https://www.legislation.gov.uk/sitemap-ukcm.xml - 1920-12-23T00:00:00 - I guess they're not exactly wrong about that, but still funny 18:09:22 hahaha 18:23:47 pokechu22: Having legislation.gov.uk as a dump might also be feasible... 18:24:11 I'm not sure if there's a way to FOIA an entire UK gov website though. 18:24:30 I'm currently running it via archivebot, and so far it seems OK, but there is still a lot of law 18:25:00 was https://foiathedead.org ever archivebotted? 18:25:06 unfortunately, foiathedead is also dead 18:25:25 in terms of updates anyways 18:25:56 does that mean that a FOIA request should be made to the FBI about their documents on FOIA The Dead? :p 18:26:05 ooh yes :p 18:26:06 If you are interested in UK laws... - https://statutes.org.uk/site/collections/ this has links to a LOT of items on Hathi/Google that should probably also be on Archive.org 18:27:29 And if you are discussing FOIA - ( I was referencing the UK one) - https://www.whatdotheyknow.com/ has an archive of responses to UK FOIA requests... 18:28:16 The UI can't find stuff going back indefinitly though, but a clever crawler might be able to find earlier requests linked from later ones. 18:28:45 I can't run a crawler bot locally, due to bandwidth/port restrictions on the PC I use. 18:29:36 Another non-immediate priority, archiving Activision before the merger... 18:30:26 I assume archiveteam people know about Wikisource? 18:31:08 Archivial of old UK laws assists that project because pulling down a DJVU/PDF from IA is far easier than it is from some other sites. 18:31:36 Oh and I'll note something about certain Google Books URL's.. 18:31:58 the great thing about archivebot/DPoS is it's available in the 'wayback machine' in addition to collections 18:32:06 so easier for some to find 18:32:13 Certain University of California originated scan links have sequential ID's 18:33:41 https://books.google.co.uk/books?vid=UCAL:B4958531 for excample can be iterated to find related publications. 18:34:45 It would need a clever programmer to develop a special crawler , but there isn't antyghin unfeasible about writing a crawler to go through all potential ID's and grabbing PDF/metadata to put on IA. 18:35:20 (Even better if there is a way to just give it a Hathi ID and teh bot does the rest, pulling from Google Books if needed... :) 18:35:31 Not my area of expertise, but thought I'd mention it. 18:36:06 fireonlive: DPoS? 18:37:35 distributed preservation of service / aka the warrior projects 18:38:31 The two NSFW sites I had in mind for archiving where - https://www.fictionmania.tv/ and https://bigclosetr.us/topshelf/ They should not be immediate priority as the sites are not under threat right now. 18:39:44 I'd also suggest someone looks into archiving UK NSFW sites at some point, before new rules might cause some of them to close down. 18:39:51 (There is a ongoing debate about forcing sites to age gate... and bear the cost of doing so.) 18:48:16 Anyway thanks for listening :)