-
nulldata
Can someone throw
torus.com.au into AB? Studio laid off employees and is basically shutting down. Some missing YouTube videos on the main page - maybe starting to remove them too?
gamedeveloper.com/business/report-a…fectively-shuts-down-after-30-years
-
h2ibot
Inti83 edited Argentina (+72, /* Public Media and Communication */):
wiki.archiveteam.org/?diff=51824&oldid=51633
-
JAA
(Done by pokechu22 earlier)
-
Terbium
welp, Yuzu was a fun ride
-
tech234a
can someone save
github.com/merryhime/dynarmic while it is still in Google cache? it was a dependency used by Yuzu/Citra and also went down
-
pabs
it got saved in #gitgud and SWH already
-
tech234a
thanks
-
arkiver
do we know how complete the vice.com crawl is? is/was AB able to fully reach and archive all public parts of the website?
-
arkiver
in the past we had an ArchiveBot pipeline behind Tor
-
arkiver
it does not exist at the moment i believe, but perhaps we can bring it back
-
arkiver
is there anyone who would like to take something like that on?
-
arkiver
CC JAA who manages much of AB
-
DoomDog
Hey, I have a small question. For some days I spend some time using an massive-ping-tool (AngryIPScan) and just poked around clusters. I found some websited not yet Archived but with an DNS entry so I fed them to the waybackmachine. Now I found some Websites that dont have any DNS-Entry anymore. How do I archive these properly and feed them?
-
DoomDog
Please?
-
fireonlive
i thought the germans were patient
-
JAA
arkiver: The VICE crawl from last year should've been fairly complete, and we have covered all newer articles listed in the sitemaps. The recrawl is still running.
-
JAA
Correct, no Tor AB pipeline currently, and some details to be figured out, but we could certainly set one up again.
-
fireonlive
... "anonjobs"
-
fireonlive
perfect pipeline name
-
SootBector
I'd have interest in helping with a Tor setup if I could be useful
-
JAA
fireonlive: That sounds like a pipeline for 4chan.
-
fireonlive
haha perhaps
-
fireonlive
o.o
-
Barto
i mean, getting through tor technically only needs something with the correct socks proxy available
-
» Barto is definitely not running a tor relay at home
-
JAA
We have the tech for it, the details to figure out is uploads to IA (data needs to go in a different collection) and who hosts it (with the usual longevity requirements).
-
Barto
we can hijack our client's infra at work, i heard they had good disks :-)
-
Barto
i have that nfs storage of the client that is 80TB right here
-
Barto
correction, 87TB
-
fireonlive
i'm sure they';d love that :p
-
imer
TOR is still unproblematic as long as you're not running an exit node, right?
-
imer
or has that gotten worse?
-
Barto
it has gotten worse, i see legitimate services blacklisting my ip
-
imer
ugh
-
Barto
car insurance company? Gotta use 4G!
-
Barto
swiss post: 4G!
-
steering
they wouldnt know your IP as a pure client anyway
-
Barto
german space agency DLR, 4G!
-
Barto
also, china and russia? Blocked :p
-
Barto
china more often, russia is like 50-50
-
fireonlive
oh lame, i guess "security" companies are choosing the wrong IP lists to blocklist again
-
fireonlive
"relays also bad"
-
Barto
yeah, that's my guess
-
Barto
also my isp wrongly assumes i have some compromized device, due to the high network load to other tor relays
-
fireonlive
sheesh
-
kiska
Barto: What if you ran #// on your network :D
-
fireonlive
'hey you accessed .well-known/something-thats-public, and we got an abuse report. your internet is now deleted'
-
kiska
Perhaps we would nee to archive your ISP, cause they would have a stroke
-
Barto
kiska: did that, once. never again
-
kiska
What was response :D
-
fireonlive
'i'm sorry, we don't accept "$site's operator has head up faecal tube" as an excuse"
-
Barto
when i ran #//, ofc i got some abuse letters. Probably they ignored a couple of them, but at some point they did temporarily suspend my router config (redirecting all non https traffic to their captive portal with a button+captcha for reactivation of the network).
-
Barto
that's the law being applied here, could be worse
-
Barto
they just have to ask you kindly to "make sure your compromized machine is cleaned".
-
Barto
as there are absolutely no machine compromised, you see the pattern :D
-
Barto
i think there was maybe one asshole that did send an abuse to all tor relays, thought it was #telegrab for a second here
-
fireonlive
'to all@. subject: tor bad, please take down' isp: 'oh ok'
-
fireonlive
weird lol
-
SootBector
I thought the idea was to run archivebot via Tor, what would the relay be used for?
-
fireonlive
ye it'd just be a client in that case
-
SootBector
Right, and is it to archive .onion sites? or are there many clearnet sites that don't block exit nodes, but are somehow problematic for pointing AT's existing IPs at?
-
JAA
All of the above.
-
Barto
i just shared my experience with tor :-)
-
Barto
9 days of uptime, 2TB transferred
-
SootBector
I agree, I wouldn't run a relay at home either :)
-
Barto
eh eh eh :-)
-
SootBector
wiki describes installing archivebot as tricky but it looks straightforward, I'll give it a try and see what I'm overlooking
-
aninternettroll
why not a #warrior ?
-
JAA
There are several quirks to it.
-
JAA
aninternettroll: The two serve completely different purposes.
-
JAA
I already have a grab-site setup with Tor, FWIW.
-
JAA
But that's containerised, and past attempts at doing that with AB went somewhere between badly and meh.
-
SootBector
AB instructions install youtube-dl via both apt and pip?
-
JAA
AB doesn't even use youtube-dl anymore.
-
JAA
The installation notes are probably outdated by years at this point.
-
SootBector
last commit nov.2022 yes
-
SootBector
are they even worth trying?
-
JAA
No mention of tcp-closer either. I'm surprised I added the OPENSSL_CONF apparently.
-
SootBector
I'm happy to step through them and keep notes if that's helpful
-
JAA
I have complete notes somewhere, just not committed or anything.
-
SootBector
equally happy to try installing from those
-
nulldata
-
eggdrop
-
h2ibot
Pokechu22 edited Jira (+395, /* Status */ track.hpccsystems.com done):
wiki.archiveteam.org/?diff=51825&oldid=51821
-
h2ibot
-
h2ibot
Bear edited List of websites excluded from the Wayback Machine (+138, cia-on-campus.org - Daniel Brandt, linked…):
wiki.archiveteam.org/?diff=51827&oldid=51820
-
h2ibot
Bear created Gifer (+527, Created page with "'''Gifer''' is a repository…):
wiki.archiveteam.org/?title=Gifer
-
h2ibot
Bear edited List of websites excluded from the Wayback Machine/Former exclusions (+469, turnoffthelights.com and snopes.com cut off at…):
wiki.archiveteam.org/?diff=51829&oldid=51782
-
h2ibot
JustAnotherArchivist changed the user rights of User:Bear
-
nulldata
I wonder if other Adult Swim published games are getting removed too
-
SootBector
would be a shame if glittermitten grove got pulled