00:26:23 I saw a post on reddit about Romhacking.net removing the homebrew section in their next update. I assume the mod who replied is one of us, if not Ark!ver, which case, not doing a project for it, but could we throw some of it into AB? 00:26:23 At the very least, here's a list of links. their site seems to be /homebrew/[number], and the highest i see that leads to anything is 184, so there they are. 00:26:23 https://transfer.archivete.am/73Opw/romhackingdotcom-homebrew-stuff.txt 00:27:30 also is u/-Archivist = Ark!ver? 00:27:54 No, different person, though they used to show up here occasionally. 00:28:08 ah cool 00:30:02 FWIW, I archived romhacking.net's downloads in 2018. 00:30:20 Er, 2017 00:30:33 Wickedplayer494 edited Twitch.tv (+368, /* Known exceptions */ + Blizzard, Brawlhalla,…): https://wiki.archiveteam.org/?diff=51092&oldid=50572 00:31:03 The downloads require special trickery because of the 'human verification' interstitial page. 00:31:46 Ok that's great 00:32:30 Looks like I got homebrews up to 91. 00:32:45 I downloaded one file, realized the captcha didn't show up for the next three, then all of a sudden all the download urls lead in a loop to the page I was already on 00:33:18 can jdownloader not deal with it? 00:33:42 I had to monkey-patch wpull at the time. 00:33:55 joepie91|m: With WARCs? 00:34:08 hm, good point. that'd probably require weird warc proxy hackery 00:34:30 And even then, it might not work in the WBM later depending on what JDownloader requests exactly. 00:34:30 mainly jdownloader seems to have gotten scarily good at dealing with anything that contains a captcha 00:37:07 vokunal|m: Link to the Reddit post? 00:37:41 https://www.reddit.com/r/DataHoarder/comments/17mlb0v/romhackingdotnet_says_they_will_likely_be/ 00:52:49 Of course, my code from 2018 no longer works because RHDN changed their captchas. 00:53:10 (2017 and 2018 are both correct; I grabbed the downloads twice apparently.) 01:49:00 the distribution of iTunes content IDs is weird 01:50:13 I'm putting 100k scraped IDs into each tarball, most IDs return 404 and I don't write a file at all, this is how many actual items are in each 100k batch: https://data.nicolas17.xyz/items-per-file.txt 01:51:19 (some gaps are because I didn't scrape that range of IDs yet, some are because zero items were found and my code was failing to create an empty tarball) 02:03:24 I tried some between 17150xxxxx and 64400xxxxx... some return no results, others return 400 Bad Request 02:34:58 I have a WARC here that fails to parse correctly, but when I parse it in overlapping chunks, everything works fine. (╯°□°)╯︵ ┻━┻ 02:39:39 `head -c 4000082837 3dtotal.com-inf-20231027-213047-ek2hw-00018.warc.gz | ...` → fine 02:39:46 `head -c 5000020550 3dtotal.com-inf-20231027-213047-ek2hw-00018.warc.gz | ...` → crash 02:39:54 `tail -c+4000082838 3dtotal.com-inf-20231027-213047-ek2hw-00018.warc.gz | ...` → fine 03:36:13 > Copying 18446744073709551613 bytes to stdout 03:36:20 oh no 05:09:33 JustAnotherArchivist created Elections/2023 Swiss federal election (+1684, Created page with "== Data == Information on…): https://wiki.archiveteam.org/?title=Elections/2023%20Swiss%20federal%20election 06:16:48 JustAnotherArchivist edited Deathwatch (+305, Dead sites are dead, CodingForum got a new…): https://wiki.archiveteam.org/?diff=51094&oldid=51091 07:00:55 the codingforum admins never responded to my inquiry about whitelisting us for cloudflare, fwiw 07:01:33 :( 09:21:28 Manu edited Recurring Events/Hacker Conferences (+467, Add ShmooCon): https://wiki.archiveteam.org/?diff=51095&oldid=51079 09:32:31 Manu edited Recurring Events/Hacker Conferences (+92): https://wiki.archiveteam.org/?diff=51096&oldid=51095 21:43:21 https://twitter.com/nwata1122/status/1720643565687296096 https://note.com/nwata1122/n/n41cf1c4df298 Do archivists want to archive a person's online activities? I'm not sure how some may think archiving could be some kind of privacy-violation (To be clear, I'm not arguing for it, but I just don't know how individuals are handled in archives). Enuwata, a Ring Fit Adventures speedrunner, has said they will delete all online 21:43:22 accounts on 2023-11-30. This includes YouTube, Twitch, Nicovideo, X, Note, and their Discord servers. Their final stream will be on 2023-11-23. 21:43:22 nitter: https://nitter.net/nwata1122/status/1720643565687296096 22:01:05 can someone help me find an archive of this video ? it has been terminated from youtube https://www.youtube.com/watch?v=COxz8hvl14Y 22:01:47 lilirose: You asked in the right place the first time. It might take a while until someone has time to retrieve it from storage. 22:02:30 sorry i thought they were different 22:06:40 Naruyoko: Yes, we do. Thank you! 22:26:16 https://forum.brickset.com/discussions closing down later today, managed to pull a full mirror though. got to juggle some files around since there are a few "update crawls" and those need to go to the archive, too 22:34:42 can anyone help find me an archive of this video ? thanks https://www.youtube.com/watch?v=-s2rZYshumw 22:35:47 not if you leave immediately 22:37:37 JustAnotherArchivist edited Deathwatch (+401, /* 2023 */ Add Brickset Forums): https://wiki.archiveteam.org/?diff=51097&oldid=51094 22:49:01 hey ! can someone help me find the archive of this video, thanks in advance https://www.youtube.com/watch?v=V3gbrP2U10A 22:51:44 what's going on 22:52:28 where was this channel linked? third person asking for youtube archives :D 22:52:58 Single person, three nick names 22:53:36 oh I didn't notice the IPs, I thought the first one was different 22:53:46 wonder what the videos are 23:04:56 "moocha" aptly named 23:13:02 definitely the same ip 23:30:49 we're being invaded!