-
Lord_Nightmare
old.reddit.com/r/programming/commen…kedown_from_riaa/g9sphpg/?context=3 is scary as hell, they went after the MAINTAINERS and random contributors first?
-
Lord_Nightmare
i guess the RIAA/MPAA was trying to do another popcorn time
-
Lord_Nightmare
but unlike that case, youtube-dl has lots of significant legal use, while popcorn time was basically built from day 1 to stream content illegally
-
Lord_Nightmare
JAA: should we archive that thread?
-
JAA
Lord_Nightmare: I threw it into AB yesterday, but there are a few more replies now.
-
JAA
Done
-
Lord_Nightmare
phihag as also edited a bunch of posts
-
Lord_Nightmare
... you just told me how to archive a reddit user, its a bunch of scripts...
-
Lord_Nightmare
...but i don't have any functional host that a .txt file could be pulled from for archivebot anymore
-
JAA
-
Lord_Nightmare
only the first page of
old.reddit.com/user/phihag is relevant to recent events, so should i just !ao that?
-
Lord_Nightmare
I'll do that for now, at least...
-
JAA
Hold on, I'll do the whole thing.
-
JAA
Done
-
wickedplayer494
Meanwhile on github:
github/dmca #8142
-
wickedplayer494
bee bee space rollback space username
-
JAA
Yeah, mentioned that in -ot. I dumped the repo earlier.
-
JAA
There was also at least one PR that has been deleted since (8128) and contained a download link. It's still in the repo (for now).
-
JAA
On a completely unrelated note: reminder that Docker Hub will begin to delete 'inactive' images a week from now.
-
OrIdow6
Oh, thought that was in February for some reason
-
OrIdow6
Maybe time to make a channel then
-
JAA
Apparently account holders will be notified of the pending deletion (that wasn't in the original FAQ). No clear mention of when that will happen exactly. 'Account owners will also be notified by email of “inactive” images that are scheduled for deletion.'
-
JAA
Oh, it was in the original FAQ, nevermind.
-
JAA
#failwhale and #dick were suggestions from a discussion a while ago, but I don't think we properly decided. There are a couple users in each.
-
mgrandi
maybe make it mobydick instead
-
mgrandi
failwhale is indeed kinda twitter themed
-
mgrandi
also what is that github dcma pull url supposed to be ?
-
JAA
mobydick isn't really a pun though. :-/
-
mgrandi
i didn't realize it had to be a pun, the two suggested aren't puns either lol
-
Lord_Nightmare
github/dmca bccf7d0#r43532899 the 'claiming copyright' on the code makes me wonder if they actually forced phihag or others to sign over their own copyright to their contributions, so that RIAA could go back and DMCA code they now officially own
-
Lord_Nightmare
i don't think the UNLICENSE lets you DO that though
-
Lord_Nightmare
or does it? has it ever been tested in court?
-
JAA
-ot for that discussion please.
-
JAA
mgrandi: Well yeah, not necessarily puns, and I agree that #dick isn't going to end well probably. And yeah, 'failwhale' is definitely too strongly associated with Twitter in my opinion.
-
mgrandi
i dunno, someone pick something whale themed or dock themed since those are the two primary themes
-
kiska
I guess I submit #undock :D
-
kiska
Or #depier
-
mgrandi
undock sounds good
-
mgrandi
on the playstation online store, it seems some of the links are now broken but for now that all games page is still working
-
mgrandi
i'm not too worried about it because this is all probably public information anyway, but getting the internal IDs of the games is what was requested and will be useful for scraping off of googlecache or similar if people want everything
-
kiska
Anyway I'll let you guys get the name of the chat, then you can invite me into it :D
-
purplebot
Deathwatch edited by JustAnotherArchivist (+398, /* 2020 */ Add FurNation.ru) just now --
archiveteam.org/?diff=45706&oldid=45692
-
kiska
Last one from me #shipwrecked
-
JAA
#texascity1947 :-P
-
mgrandi
the last one is obscure and tragic, lets use it
-
jodizzle
JAA: Do you have a particular strategy for uploading youtube channel comments grabbed by the youtube-comments qwarc project to IA? I have some that I'd like to upload, but there are a lot of individual files, so I'm wondering if I should merge them or something
-
JAA
jodizzle: None yet. I have 15.4k from the Joe Rogan scrape, and the plan was to megawarc them. Cf.
hackint.logs.kiska.pw/archiveteam-bs/20200928#c11409
-
jodizzle
Hm okay. My use-case is much smaller, 102 .warc.gz files (half of them the logs). But maybe the same concept applies?
-
jodizzle
Then again, 102 in a single item doesn't sound that bad.
-
JAA
Yeah, 102 is probably fine.
-
jodizzle
Okay, I'll probably do that then.
-
mgrandi
so, who here knows about blaseball
-
mgrandi
I'm considering setting up a as-it-happens scraper for blaseball, there are other sites that have this but they don't seem to have a nice format for the data
-
mgrandi
this is my face when i wrote a html scraper and then i noticed that it has JSON data in the webpage itself :| :| :|
-
nico_32
haha
-
thuban
JAA: est ~2 TB compressed, ~15 uncompressed
-
wickedplayer494
been having trouble trying to put up new files on the wiki, I'm getting thrown "Internal error: Server failed to store temporary file." when attempting to upload a (newer) screenshot to stick on the Amazon page
-
wickedplayer494
anyone else?
-
mgrandi
Maybe aws s3 is having issues?
-
wickedplayer494
been that way for me for at least a week, tried it once or twice some days back and got the same too
-
Jean-Fred
mgrandi: I /think/ several folks from VGPC also scrapped the internal IDs (many/most of them are probably in
serialstation.com although I’m not sure how best/easy it is to query them) and the data JSON blobs that are on each page ; so I’m not too worried about these either. I was worrying about the individual webpages (for the
-
Jean-Fred
Wiki*edia perspective) but you folks hre are the real experts of web archiving so I’ll defer to your level of worriness ^_^
-
mgrandi
yeah, i will be trying to get those, a few of them already seem gone though, but since they are so old its possible that google cache or WBM already has them
-
Jean-Fred
Cool :)
-
Jean-Fred
Also, i compiled a list of all URLs currently used on Wikidata at
bin.privacytools.io/?44aa8d7fc2c777…OPTs4fCw+QeJ9tF1l0dFkw2ZNdUz0NTM0U= − that gs 25K URLs if using all the regional stores URLs − it’s not a lot but hope that helps
-
mgrandi
do you happen to know all the available regions? (aka their url code)
-
mgrandi
@Jean-Fred , that is the only thing i'm missing atm
-
mgrandi
i'm running a wget-at on the enUS urls at least now
-
Jean-Fred
mgrandi This is a list of 75 regional URLs:
justpaste.it/93kgd ; I heard from someone from VGPC that the complete figure is 94 URLs, trying to get that list from them.
-
mgrandi
geez
-
mgrandi
yeah if you can get all those language codes i can generate the URLs for those and get what i can
-
mgrandi
i am noticing that some of the urls don't work with various language codes but work for enUS , hmm
-
Jean-Fred
So a Japanese ID will not work with the French store URL, but a European ID should work with the French store, the German store etc. Is that what you mean or am I misunderstanding ?
-
Jean-Fred
-
mgrandi
Thanks, I'll give those a go when I wake uoz the enUs urls are going now
-
Jean-Fred
Cheers :)
-
Jean-Fred
Hopefully the stores are still around :'(
-
mgrandi
they are somewhere, cause they are still around if you are on the device
-
mgrandi
and theoretically they all should be the same with just translation differences
-
Jean-Fred
I heard the devices talk to a legacy API, not really the website ; and there are some differences between regional stores (typically, the German USK rating will only be on the German-language store, whereas other EU websites will give he PEGI)
-
Jean-Fred
But I don’t want to sound ungrateful − I’m super grateful for your help & work :)
-
mgrandi
yeah, the urls might be in google cache too
-
mgrandi
also shoutouts to the SKU for untitled goose game being `"UP3971-CUSA23079_00-HONKHONKHONKHONK"`
-
Jean-Fred
:D
-
mgrandi
it is something that probably should be kept track of, since there are so few games honestly
-
mgrandi
cause they do remove stuff, obvious examples are P.T. , and smaller stuff like the original Nier:Automata SKU (
ps4database.io/view/CUSA04551_00/NP, `UP0082-CUSA04551_00-ANDROIDS20030612`,) but is replaced by Nier:Automata game of the yorha edition (`UP0082-CUSA04551_00-GOTYORHADIGITAL0`)
-
mgrandi
but yeah, trying various urls, its in various states of broken, i dunno what is new and what is existing broken-ness, but at least we have full urls and the metadata about the games for further search
-
mgrandi
some games go to a blank grey page , others redirect to store.playstation.com, and others work, and i can't seem to find a common thread , their codebase must be a mess lol
-
JAA
thuban: Ah, tiny. Yes, let's grab that.
-
mgrandi
Anyway, bedtime, I'll ping you when I wake up and look at the data and attempt the other store regions
-
Jake
happy to see some work getting done on the PS4 stuff! :)
-
thuban
JAA: home page documents url format; you got it or want me to generate a list?
-
mgrandi
Glad to be of service jake
-
thuban
(i have the space to download / upload directly to ia, but not really the bandwidth)
-
JAA
thuban: I don't have time to look into it at the moment. Maybe in early November.
-
thuban
JAA: ok. (fwiw, with dateutils: `dateseq 2011-02-12T00:00:00 1h now -f '
data.gharchive.org/%Y-%m-%d-%-H.json.gz'`)
-
JAA
Thanks :-)
-
purplebot
Ultraweb.hu edited by Bzc6p (-245, recovered) just now --
archiveteam.org/?diff=45707&oldid=45640
-
OrIdow6
I'm going to suggest that #failwhale be used for DockerHub - already more people there (8) than any other of the suggested channels
-
OrIdow6
Since there's a lot to do there in a short time, and it's best that that starts being worked on
-
JAA
Since most of the key people for a DPoS project aren't in any of the channels yet, the number of users doesn't matter all that much.
-
JAA
Suggestions that have been made: #failwhale #dick #mobydick #undock #depier #shipwrecked #texascity1947
-
JAA
The term 'failwhale' is strongly associated with Twitter, and '#dick' is probably not a great idea.
-
ivan
what's DPoS?
-
JAA
ivan: Distributed Preservation of Service, aka distributed project with tracker and workers/pipelines.
-
ivan
#dpos? :-)
-
JAA
Sometimes also called 'warrior project', but I don't like that term since the large majority of the work doesn't even involve the warrior (VM).
-
JAA
Er no, you misunderstand. We're looking for a name for a DPoS project for Docker Hub.
-
ivan
ah
-
JAA
(General discussions about the backend, code, etc. is normally in -dev.)
-
ivan
#shipwrecked is ok
-
ivan
#overboard
-
ivan
-
ivan
-
purplebot
Coronavirus edited by Wessel1512 (+149, /* Global */) just now --
archiveteam.org/?diff=45708&oldid=45695
-
wessel1512
i like #overboard (as the contaners a trown overboard)
-
wessel1512
-
nico_32
JAA: #leakymess ?
-
nico_32
for dockerhub :)
-
nico_32
framapic.org a image hoster will close
-
nico_32
2021-01-12
-
nico_32
-
nico_32
mid-2021: framasite, framawiki, framabin
-
nico_32
frama.site <= 3872 sites , 2356 wikis et 4731 pages
-
OrIdow6
-
OrIdow6
Someone should add them to Deathwatch, preferably someone with better French than me
-
OrIdow6
(That's the link that nico 32's image came from)
-
OrIdow6
Looks like both the 2020 things have been shut down already, actually
-
arkiver
nico_32: thanks
-
arkiver
I like #failwhale JAA :P
-
arkiver
or did we already have one