-
pabsarkiver: ping re the urls-sources PRs. apart from mine (blog aggregators, deletionists, lobste.rs), there is one for Ukranian sites and one for crypto news sites
-
datechnomanI think I did the cryptosites one back months ago
-
pabshmm, datechnoman's PR isn't merged though
-
sdomiyo! there's a rumor that Mellanox drivers, firmware and tools are gonna go down behind a paywall soon. I can mirror all of them myself, but I'd like for them to be on the web archive; some of the downloads are supposedly behind an EULA accept, some are just plain URLs
-
sdomi
-
sdomiI can write a script that'd just call the save API on the wayback machine, but I think it'd be better if it was coordinated :) does anyone wanna help?
-
imersdomi: probably more a job for AB, eula accept wall might be an issue depending on how that works?
-
imerwhat i've done previously (for driver sites) was run the main page through SPN and then collect the download links manually for running through #archivebot since SPN didn't catch most of them
-
imerlooks like it's JS hell though, with post requests, for network.nvidia.com/support/firmware/connectx4en -> downloaders.azurewebsites.net/downl…ectx4en_downloader/downloader3.html for example
-
imerso it won't replay in the WBM currently anyways :(
-
imermight be better to move this discussion to #archiveteam-bs as well
-
sdomiimer: should I paste my messages there?
-
sdomieh, i guess that everyone from here is also there, nvm
-
fireonlive
-
h2ibotfireonlive: Deduplicating and queuing 2229 items. (for 'transfer.archivete.am/JJov0/discord…205500330631168-discordapp-urls.txt')
-
h2ibotfireonlive: Deduplicated and queued 2229 items. (for 'transfer.archivete.am/JJov0/discord…205500330631168-discordapp-urls.txt')