00:19:07 Can someone throw https://www.torus.com.au/ into AB? Studio laid off employees and is basically shutting down. Some missing YouTube videos on the main page - maybe starting to remove them too? https://www.gamedeveloper.com/business/report-australian-studio-torus-games-effectively-shuts-down-after-30-years 00:55:46 Inti83 edited Argentina (+72, /* Public Media and Communication */): https://wiki.archiveteam.org/?diff=51824&oldid=51633 00:56:49 (Done by pokechu22 earlier) 01:40:38 welp, Yuzu was a fun ride 08:31:39 can someone save https://github.com/merryhime/dynarmic while it is still in Google cache? it was a dependency used by Yuzu/Citra and also went down 09:02:14 it got saved in #gitgud and SWH already 09:10:52 thanks 09:30:03 do we know how complete the vice.com crawl is? is/was AB able to fully reach and archive all public parts of the website? 12:03:30 in the past we had an ArchiveBot pipeline behind Tor 12:03:46 it does not exist at the moment i believe, but perhaps we can bring it back 12:03:56 is there anyone who would like to take something like that on? 12:04:24 CC JAA who manages much of AB 16:49:31 Hey, I have a small question. For some days I spend some time using an massive-ping-tool (AngryIPScan) and just poked around clusters. I found some websited not yet Archived but with an DNS entry so I fed them to the waybackmachine. Now I found some Websites that dont have any DNS-Entry anymore. How do I archive these properly and feed them? 17:06:50 Please? 17:12:33 i thought the germans were patient 18:25:23 arkiver: The VICE crawl from last year should've been fairly complete, and we have covered all newer articles listed in the sitemaps. The recrawl is still running. 18:26:18 Correct, no Tor AB pipeline currently, and some details to be figured out, but we could certainly set one up again. 18:36:57 ... "anonjobs" 18:37:04 perfect pipeline name 18:46:34 I'd have interest in helping with a Tor setup if I could be useful 18:48:14 fireonlive: That sounds like a pipeline for 4chan. 18:48:27 haha perhaps 18:55:50 o.o 19:32:15 i mean, getting through tor technically only needs something with the correct socks proxy available 19:32:25 * Barto is definitely not running a tor relay at home 19:33:32 We have the tech for it, the details to figure out is uploads to IA (data needs to go in a different collection) and who hosts it (with the usual longevity requirements). 19:38:12 we can hijack our client's infra at work, i heard they had good disks :-) 19:40:14 i have that nfs storage of the client that is 80TB right here 19:44:28 correction, 87TB 20:03:04 i'm sure they';d love that :p 20:04:02 TOR is still unproblematic as long as you're not running an exit node, right? 20:04:16 or has that gotten worse? 20:04:46 it has gotten worse, i see legitimate services blacklisting my ip 20:04:58 ugh 20:05:08 car insurance company? Gotta use 4G! 20:05:12 swiss post: 4G! 20:05:16 they wouldnt know your IP as a pure client anyway 20:05:25 german space agency DLR, 4G! 20:06:45 also, china and russia? Blocked :p 20:07:23 china more often, russia is like 50-50 20:21:44 oh lame, i guess "security" companies are choosing the wrong IP lists to blocklist again 20:21:53 "relays also bad" 20:29:35 yeah, that's my guess 20:29:56 also my isp wrongly assumes i have some compromized device, due to the high network load to other tor relays 20:30:15 sheesh 20:30:28 Barto: What if you ran #// on your network :D 20:30:59 'hey you accessed .well-known/something-thats-public, and we got an abuse report. your internet is now deleted' 20:31:02 Perhaps we would nee to archive your ISP, cause they would have a stroke 20:31:38 kiska: did that, once. never again 20:32:00 What was response :D 20:32:29 'i'm sorry, we don't accept "$site's operator has head up faecal tube" as an excuse" 20:33:24 when i ran #//, ofc i got some abuse letters. Probably they ignored a couple of them, but at some point they did temporarily suspend my router config (redirecting all non https traffic to their captive portal with a button+captcha for reactivation of the network). 20:33:49 that's the law being applied here, could be worse 20:34:12 they just have to ask you kindly to "make sure your compromized machine is cleaned". 20:34:35 as there are absolutely no machine compromised, you see the pattern :D 20:35:10 i think there was maybe one asshole that did send an abuse to all tor relays, thought it was #telegrab for a second here 20:40:28 'to all@. subject: tor bad, please take down' isp: 'oh ok' 20:40:37 weird lol 20:44:32 I thought the idea was to run archivebot via Tor, what would the relay be used for? 20:45:03 ye it'd just be a client in that case 20:47:51 Right, and is it to archive .onion sites? or are there many clearnet sites that don't block exit nodes, but are somehow problematic for pointing AT's existing IPs at? 20:48:17 All of the above. 21:07:03 i just shared my experience with tor :-) 21:07:27 9 days of uptime, 2TB transferred 21:07:49 I agree, I wouldn't run a relay at home either :) 21:11:13 eh eh eh :-) 21:24:58 wiki describes installing archivebot as tricky but it looks straightforward, I'll give it a try and see what I'm overlooking 21:25:35 why not a #warrior ? 21:25:39 There are several quirks to it. 21:26:33 aninternettroll: The two serve completely different purposes. 21:26:53 I already have a grab-site setup with Tor, FWIW. 21:27:32 But that's containerised, and past attempts at doing that with AB went somewhere between badly and meh. 21:29:19 AB instructions install youtube-dl via both apt and pip? 21:29:33 AB doesn't even use youtube-dl anymore. 21:29:48 The installation notes are probably outdated by years at this point. 21:30:21 last commit nov.2022 yes 21:30:48 are they even worth trying? 21:33:21 No mention of tcp-closer either. I'm surprised I added the OPENSSL_CONF apparently. 21:33:58 I'm happy to step through them and keep notes if that's helpful 21:34:32 I have complete notes somewhere, just not committed or anything. 21:35:17 equally happy to try installing from those 23:34:52 https://twitter.com/owendeery/status/1765032553147245010 23:34:52 nitter: https://farside.link/nitter/owendeery/status/1765032553147245010 23:36:50 Pokechu22 edited Jira (+395, /* Status */ track.hpccsystems.com done): https://wiki.archiveteam.org/?diff=51825&oldid=51821 23:42:51 Nyght edited Coub (+291): https://wiki.archiveteam.org/?diff=51826&oldid=50862 23:42:52 Bear edited List of websites excluded from the Wayback Machine (+138, cia-on-campus.org - Daniel Brandt, linked…): https://wiki.archiveteam.org/?diff=51827&oldid=51820 23:42:53 Bear created Gifer (+527, Created page with "'''Gifer''' is a repository…): https://wiki.archiveteam.org/?title=Gifer 23:42:54 Bear edited List of websites excluded from the Wayback Machine/Former exclusions (+469, turnoffthelights.com and snopes.com cut off at…): https://wiki.archiveteam.org/?diff=51829&oldid=51782 23:42:55 JustAnotherArchivist changed the user rights of User:Bear 23:49:00 I wonder if other Adult Swim published games are getting removed too 23:58:46 would be a shame if glittermitten grove got pulled