01:26:35 ah yes the wayback machine the thing that google doesn't link to in search results 02:12:46 how to archive a github repo? 02:13:25 nicolas17: ask in #gitgud 03:23:27 nicolas17: and save it to Software Heritage using their Save Code Now form, or API 03:24:38 seems SH has it and it's up to date 03:27:32 curl -D /dev/tty -X POST "https://archive.softwareheritage.org/api/1/origin/save/git/url/$url/" | jq 03:28:08 ah good, they auto-update all of GitHub and other things on https://archive.softwareheritage.org/coverage/ 08:14:36 Hi archiveteam, I trying to decompress warc.zst file which I have downloaded to my local. May I know where can I find the DICT for it? Thanks 08:15:01 I wish softwareheritage had a version of snapshot.debian.org that actually works 08:17:21 they imported all of snapshot.debian.org already 08:18:39 how do I access it? see eg https://lists.debian.org/debian-devel/2023/08/msg00014.html 08:20:49 Girish: it's in a skippable frame at the beginning of the file http://iipc.github.io/warc-specifications/specifications/warc-zstd/ 08:21:15 you will probably find https://gitea.arpa.li/JustAnotherArchivist/little-things/src/branch/master/zstdwarccat helpful 08:25:08 kpcyrd: I think via the usual archive.softwareheritage.org site 08:25:33 it of course doesn't contain any binary packages 08:27:11 Thank thuban I tried the zstdwarccat. It does stdout. I was looking for organized folders of each url ... 08:28:35 The actual archive file is a megawarc.warc.zst 08:30:03 the output of zstdwarccat is the uncompressed warc file. you can then use another tool to extract the contents 08:32:28 (https://wiki.archiveteam.org/index.php/The_WARC_Ecosystem) 08:38:02 thuban: Does zstdwarccat create any warc files or is it just stdout? 08:39:34 it's just stdout, so you can use a shell redirect: `zstdwarccat input.warc.zst > output.warc` 09:43:20 pabs: idk, seems incomplete 🤷 https://archive.softwareheritage.org/browse/search/?q=gcc_12.2.0-3_amd64.deb&with_visit=true&with_content=true&search_metadata=true 09:43:46 it of course doesn't contain any binary packages 09:44:08 what did they import then? 09:44:19 source packages 09:49:01 not a snapshot.debian.org replacement then. There's also https://snapshot-cloudflare.debian.org/ but unless you're lucky enough to get a cache hit you're still stuck with the 504 prone snapshot service 09:49:36 essentially if you can't pull the file from snapshot.debian.org yourself, cloudflare won't be able to either 09:57:28 right 10:05:30 IIRC the service needs these: 10:05:35 1) people to care about improving it instead of working around its current limitations 10:05:41 2) the Debian sysadmins to have time to complete the in-progress migration of the primary replica to newer hardware 10:05:46 3) a Debian team for the service, so the Debian sysadmins can just do hardware/OS 10:05:51 4) more replicas (152TB + growth) to meet the demand on the service 10:05:53 5) probably architecture and hardware upgrades 10:11:10 oh, and 6) the Debian sysadmins need to fix the failing proprietary backup system the primary replica uses 10:53:43 Yts98 edited Xuite (+524, Add smallpaint): https://wiki.archiveteam.org/?diff=50452&oldid=50431 12:41:35 "The Future of the Vim Project" https://groups.google.com/g/vim_dev/c/dq9Wu5jqVTw https://news.ycombinator.com/item?id=37074452 13:04:06 +1 for transparency 16:37:33 Could maybe be a good idea to grab all the vim mail lists if possible. 16:44:40 https://www.vim.org/maillist.php 16:46:09 already done 16:46:17 the google based ones at least 16:46:21 well thats awesome 18:00:21 rewby: since I only see rewby|backup in #deadcat i'll post this here 18:00:26 can we please have a target for gfycat? 18:00:33 this would be 18:00:37 archiveteam_gfycat_ 18:00:40 gfycat_ 18:00:45 Archive Team Gfycat: 19:36:23 TheTechRobo edited Periscope (-3, There are still items, but "Tracker rate…): https://wiki.archiveteam.org/?diff=50453&oldid=50291 19:44:24 TheTechRobo edited ArchiveTeam Warrior (-130, /* Warrior architecture and alternatives */…): https://wiki.archiveteam.org/?diff=50454&oldid=50367 19:45:25 TheTechRobo edited ArchiveTeam Warrior (-591, /* Can I use whatever internet access for the…): https://wiki.archiveteam.org/?diff=50455&oldid=50454 19:45:42 * fireonlive edits the TheTechRobo 19:46:28 maybe we should also note that if your ISP blocks custom DNS, the projects won't work, but I'm not sure what the error message is so idk what to add to FAQ 19:46:57 in my fuzzy memory it isn't to descript 19:47:07 too* 23:52:17 any chance of a warrior project for Webs? or is there too many stuff for a warrior project for that atm 23:53:08 flashfire42: https://wiki.archiveteam.org/index.php/Webs 23:53:11 we already did a bunch 23:53:14 i'm not sure how much 23:53:35 I mean its definite again