-
vokunal|m
I saw a post on reddit about Romhacking.net removing the homebrew section in their next update. I assume the mod who replied is one of us, if not Ark!ver, which case, not doing a project for it, but could we throw some of it into AB?
-
vokunal|m
At the very least, here's a list of links. their site seems to be /homebrew/[number], and the highest i see that leads to anything is 184, so there they are.
-
vokunal|m
-
vokunal|m
also is u/-Archivist = Ark!ver?
-
JAA
No, different person, though they used to show up here occasionally.
-
vokunal|m
ah cool
-
JAA
FWIW, I archived romhacking.net's downloads in 2018.
-
JAA
Er, 2017
-
h2ibot
Wickedplayer494 edited Twitch.tv (+368, /* Known exceptions */ + Blizzard, Brawlhalla,…):
wiki.archiveteam.org/?diff=51092&oldid=50572
-
JAA
The downloads require special trickery because of the 'human verification' interstitial page.
-
vokunal|m
Ok that's great
-
JAA
Looks like I got homebrews up to 91.
-
vokunal|m
I downloaded one file, realized the captcha didn't show up for the next three, then all of a sudden all the download urls lead in a loop to the page I was already on
-
joepie91|m
can jdownloader not deal with it?
-
JAA
I had to monkey-patch wpull at the time.
-
JAA
joepie91|m: With WARCs?
-
joepie91|m
hm, good point. that'd probably require weird warc proxy hackery
-
JAA
And even then, it might not work in the WBM later depending on what JDownloader requests exactly.
-
joepie91|m
mainly jdownloader seems to have gotten scarily good at dealing with anything that contains a captcha
-
JAA
vokunal|m: Link to the Reddit post?
-
vokunal|m
-
JAA
Of course, my code from 2018 no longer works because RHDN changed their captchas.
-
JAA
(2017 and 2018 are both correct; I grabbed the downloads twice apparently.)
-
nicolas17
the distribution of iTunes content IDs is weird
-
nicolas17
I'm putting 100k scraped IDs into each tarball, most IDs return 404 and I don't write a file at all, this is how many actual items are in each 100k batch:
data.nicolas17.xyz/items-per-file.txt
-
nicolas17
(some gaps are because I didn't scrape that range of IDs yet, some are because zero items were found and my code was failing to create an empty tarball)
-
nicolas17
I tried some between 17150xxxxx and 64400xxxxx... some return no results, others return 400 Bad Request
-
JAA
I have a WARC here that fails to parse correctly, but when I parse it in overlapping chunks, everything works fine. (╯°□°)╯︵ ┻━┻
-
JAA
`head -c 4000082837 3dtotal.com-inf-20231027-213047-ek2hw-00018.warc.gz | ...` → fine
-
JAA
`head -c 5000020550 3dtotal.com-inf-20231027-213047-ek2hw-00018.warc.gz | ...` → crash
-
JAA
`tail -c+4000082838 3dtotal.com-inf-20231027-213047-ek2hw-00018.warc.gz | ...` → fine
-
JAA
> Copying 18446744073709551613 bytes to stdout
-
JAA
oh no
-
h2ibot
JustAnotherArchivist created Elections/2023 Swiss federal election (+1684, Created page with "== Data == Information on…):
wiki.archiveteam.org/?title=Electio…s/2023%20Swiss%20federal%20election
-
h2ibot
JustAnotherArchivist edited Deathwatch (+305, Dead sites are dead, CodingForum got a new…):
wiki.archiveteam.org/?diff=51094&oldid=51091
-
thuban
the codingforum admins never responded to my inquiry about whitelisting us for cloudflare, fwiw
-
fireonlive
:(
-
h2ibot
Manu edited Recurring Events/Hacker Conferences (+467, Add ShmooCon):
wiki.archiveteam.org/?diff=51095&oldid=51079
-
h2ibot
Manu edited Recurring Events/Hacker Conferences (+92):
wiki.archiveteam.org/?diff=51096&oldid=51095
-
Naruyoko
twitter.com/nwata1122/status/1720643565687296096 note.com/nwata1122/n/n41cf1c4df298 Do archivists want to archive a person's online activities? I'm not sure how some may think archiving could be some kind of privacy-violation (To be clear, I'm not arguing for it, but I just don't know how individuals are handled in archives). Enuwata, a Ring Fit Adventures speedrunner, has said they will delete all online
-
Naruyoko
accounts on 2023-11-30. This includes YouTube, Twitch, Nicovideo, X, Note, and their Discord servers. Their final stream will be on 2023-11-23.
-
eggdrop
-
lilirose
can someone help me find an archive of this video ? it has been terminated from youtube
youtube.com/watch?v=COxz8hvl14Y
-
JAA
lilirose: You asked in the right place the first time. It might take a while until someone has time to retrieve it from storage.
-
lilirose
sorry i thought they were different
-
JAA
Naruyoko: Yes, we do. Thank you!
-
masterX244
forum.brickset.com/discussions closing down later today, managed to pull a full mirror though. got to juggle some files around since there are a few "update crawls" and those need to go to the archive, too
-
angelnumber1111
can anyone help find me an archive of this video ? thanks
youtube.com/watch?v=-s2rZYshumw
-
nicolas17
not if you leave immediately
-
h2ibot
JustAnotherArchivist edited Deathwatch (+401, /* 2023 */ Add Brickset Forums):
wiki.archiveteam.org/?diff=51097&oldid=51094
-
moocha
hey ! can someone help me find the archive of this video, thanks in advance
youtube.com/watch?v=V3gbrP2U10A
-
nicolas17
what's going on
-
nicolas17
where was this channel linked? third person asking for youtube archives :D
-
JAA
Single person, three nick names
-
nicolas17
oh I didn't notice the IPs, I thought the first one was different
-
nicolas17
wonder what the videos are
-
project10
"moocha" aptly named
-
Barto
definitely the same ip
-
fireonlive
we're being invaded!