-
ctag
What is argenteam?
-
ctag
I torrent stuff that's in the clear, but that thing mentions subtitles, which gives me pause.
-
nicolas17
ctag: argenteam was a fansub website
-
ctag
I don't know what fansub means, one sec
-
nicolas17
they downloaded movies and TV show episodes in their original language and made their own translated subtitles
-
FireFly
ctag: basically community-made subtitles for media, so the subtitles should be just fine afaik
-
ctag
Hmm
-
ctag
OK, I'll toss it in the grinder, thank you both for the explanations.
-
nicolas17
the website had magnet and e2dk links to the videos, and downloading/seeding those is plain old piracy, but the subtitles were made by the community
-
nicolas17
and now it was shut down
-
nicolas17
and they made a single 1.8GB torrent with *all* the subtitles they had
-
ctag
Ah
-
ctag
It's got plenty of seeders, I'm guessing archival involves more than just keeping the torrent active?
-
nicolas17
yeah the shutdown was today so there's probably a shitload of people downloading it right now
-
nicolas17
ctag: I already archivebot'd the website and subtitles and forum last month, I only posted it here now to get the shutdown notice archived
-
ctag
Ah, OK thanks
-
ctag
Does IA host piracy agacent material?
-
ctag
Or is this to get it saved but blackholed
-
h2ibot
Pokechu22 edited List of website hosts (+32, /* B */ bplaced also uses square7.ch):
wiki.archiveteam.org/?diff=51453&oldid=47760
-
manu|m
learned about a mastodon instance that will likely shut down soon, no idea what to do about it..
woof.group/@aphyr/111683303140271139
-
pabs
manu|m: can you ask them if the instance should be archived? and add it to
wiki.archiveteam.org/index.php/Mastodon
-
pabs
not sure if AT does do fediverse archiving, there was a bit if a backlash before IIRC
-
manu|m
well, it was offline because the instance admin isn't reachable. there's probably not going to be a consensus among its userbase either..
-
pabs
for the logs, the instance is
bear.community
-
» nicolas17 covers fireonlive's eyes
-
fireonlive
:O
-
fireonlive
:D
-
fireonlive
archivebot can't mastodon vlatest anymore, so we'd have to something else
-
h2ibot
YetAnotherArchiver edited The WARC Ecosystem (+592, /* Tools */ Add more tools):
wiki.archiveteam.org/?diff=51454&oldid=51015
-
h2ibot
Igloos edited Deathwatch (+240, Added arcalive/b/genshin):
wiki.archiveteam.org/?diff=51455&oldid=51449
-
brandan
hey i'd like to archive an entire school's website for wiki-en reasons, i already tried to use archivebot but then it dawned on me that i need a second hand to authorize the usage of archivebot in the first place
-
brandan
sorry if i misspell anything im on a newer keyboard
-
Barto
on it
-
litech
Hello everyone, has anyone archived the comments to this deleted livejournal post?
web.archive.org/web/20191114140929/https://varlamov.ru/3267082.html
-
litech
Hello everyone, has anyone archived the comments to this LiveJournal post?
web.archive.org/web/20191114140929/https://varlamov.ru/3267082.html Thank you for looking into this.
-
that_lurker
Anyone know any good way to archive a podcast to IA?
-
that_lurker
s/way/tool
-
nulldata
that_lurker - You can grab the RSS feed of a podcast if it's on iTunes using the details here
superuser.com/a/782413 and then throwing that into something like
github.com/lightpohl/podcast-dl or
codeberg.org/janw/podcast-archiver
-
nulldata
or if it's on SoundCloud or YouTube you could use yt-dlp to grab the entire user/channel
-
that_lurker
ahh podcast-archiver was the one I was looking for. Thanks
-
nulldata
Though as for what switches to use and the kosher way to package it for IA I'm not sure. Probably could use some guidance from someone on formatting, tags, etc. The question has come up a few times I've thought about making a Wiki article.
-
h2ibot
FireonLive edited URLs (-19, remove CTA for now):
wiki.archiveteam.org/?diff=51456&oldid=51406
-
Pedrosso
JAA: Is the steam workshop downloads grab planned to be done at some point?
-
mgrandi
@Pedrosso: steam workshop grab? Like the websites or files?
-
Pedrosso
mgrandi: In this instance I mean files but getting the comments may also be good.
hackint.logs.kiska.pw/archiveteam-bs/20231219#c396229
-
mgrandi
I actually have code to get files
-
mgrandi
I downloaded the entirety of CSGO's maps before cs2 came out
-
mgrandi
I admittedly was lazy and just wrapped SteamCmd since I don't know of a way that you can get another games workshop items otherwise since it requires a license and other stuff
-
JAA
Pedrosso: No specific plans, but it's one of those 'I'd like to do this someday' things. If someone beats me to it, all the better.
-
Pedrosso
I didn't exactly understand how you got the download links from the api
-
Pedrosso
mgrandi: Did you ever find a way to get all the ids? My way has been very clunky of simply asking steam to search through all the dates and iterating through its search pages.
-
mgrandi
Good old fashion iterating the steam workshop pages
-
Pedrosso
Ah yes, iteration. how many 404s do you get?
-
mgrandi
I didn't get any really
-
Pedrosso
wow
-
mgrandi
Just on the workshop gallery pages , not each individual ones
-
mgrandi
-
Pedrosso
ohh
-
Pedrosso
Would you be able to download and upload the p2 workshop items to IA? I have a list of portal 2 workshop item ids which is up-to-date up til the upload date here
archive.org/details/portal2_workshopIDs_20231212
-
mgrandi
I can get that started , it requires windows so I can't easily do it on my server
-
Pedrosso
Awesome. How will you upload it? Like, in item fragments?
-
mgrandi
But right now it's storing everything as rows in a database since it was easiest to get working fast, 7z compressed since there wasn't really a good reason to use warc since I'm not the one downloading it (SteamCmd is)
-
mgrandi
It can be changed to do whatever, or I can upload the code and you can run it as well
-
Pedrosso
I am concerned about getting banned & rate limiting. Regardless, I would indeed like to see the code
-
mgrandi
You don't seem to get banned
-
mgrandi
I did run into a few rate limits but it seems like it correctly handles it
-
Pedrosso
that's good
-
mgrandi
Since it's a command line program and they don't have good exit codes I have to parse the stdout which is fun
-
mgrandi
Let me look at the code tonight and clean it up and publish it since I've been meaning to do that
-
mgrandi
It also might need some adjustments since steam workshop is basically hashtag yolo , even within the Counter strike global offensive workshop, I found several different formats of "maps"
-
Pedrosso
Oh wait, I do have a download script, I had just believed that you all had had better ways of doing it.
-
Pedrosso
Only for the portal maps though
-
mgrandi
I mean the python code just automates it
-
mgrandi
But it also compressed it, creates a database of the files, is resumable , handles errors or if we get rate limited, dtc
-
Pedrosso
That's better than my code at least
-
Pedrosso
Mine just stuffs the files in a .tar file and is done with it