-
xarph_
ah yes the wayback machine the thing that google doesn't link to in search results
-
nicolas17
how to archive a github repo?
-
thuban
nicolas17: ask in #gitgud
-
pabs
nicolas17: and save it to Software Heritage using their Save Code Now form, or API
-
nicolas17
seems SH has it and it's up to date
-
pabs
-
pabs
ah good, they auto-update all of GitHub and other things on
archive.softwareheritage.org/coverage
-
Girish
Hi archiveteam, I trying to decompress warc.zst file which I have downloaded to my local. May I know where can I find the DICT for it? Thanks
-
kpcyrd
I wish softwareheritage had a version of snapshot.debian.org that actually works
-
pabs
they imported all of snapshot.debian.org already
-
kpcyrd
-
thuban
Girish: it's in a skippable frame at the beginning of the file
iipc.github.io/warc-specifications/specifications/warc-zstd
-
thuban
-
pabs
kpcyrd: I think via the usual archive.softwareheritage.org site
-
pabs
it of course doesn't contain any binary packages
-
Girish
Thank thuban I tried the zstdwarccat. It does stdout. I was looking for organized folders of each url ...
-
Girish
The actual archive file is a megawarc.warc.zst
-
thuban
the output of zstdwarccat is the uncompressed warc file. you can then use another tool to extract the contents
-
thuban
-
Girish
thuban: Does zstdwarccat create any warc files or is it just stdout?
-
thuban
it's just stdout, so you can use a shell redirect: `zstdwarccat input.warc.zst > output.warc`
-
kpcyrd
-
pabs
<pabs> it of course doesn't contain any binary packages
-
kpcyrd
what did they import then?
-
pabs
source packages
-
kpcyrd
not a snapshot.debian.org replacement then. There's also
snapshot-cloudflare.debian.org but unless you're lucky enough to get a cache hit you're still stuck with the 504 prone snapshot service
-
kpcyrd
essentially if you can't pull the file from snapshot.debian.org yourself, cloudflare won't be able to either
-
pabs
right
-
pabs
IIRC the service needs these:
-
pabs
1) people to care about improving it instead of working around its current limitations
-
pabs
2) the Debian sysadmins to have time to complete the in-progress migration of the primary replica to newer hardware
-
pabs
3) a Debian team for the service, so the Debian sysadmins can just do hardware/OS
-
pabs
4) more replicas (152TB + growth) to meet the demand on the service
-
pabs
5) probably architecture and hardware upgrades
-
pabs
oh, and 6) the Debian sysadmins need to fix the failing proprietary backup system the primary replica uses
-
h2ibot
-
pabs
-
Barto
+1 for transparency
-
that_lurker
Could maybe be a good idea to grab all the vim mail lists if possible.
-
that_lurker
-
pabs
already done
-
pabs
the google based ones at least
-
that_lurker
well thats awesome
-
arkiver
rewby: since I only see rewby|backup in #deadcat i'll post this here
-
arkiver
can we please have a target for gfycat?
-
arkiver
this would be
-
arkiver
archiveteam_gfycat_
-
arkiver
gfycat_
-
arkiver
Archive Team Gfycat:
-
h2ibot
TheTechRobo edited Periscope (-3, There are still items, but "Tracker rate…):
wiki.archiveteam.org/?diff=50453&oldid=50291
-
h2ibot
TheTechRobo edited ArchiveTeam Warrior (-130, /* Warrior architecture and alternatives */…):
wiki.archiveteam.org/?diff=50454&oldid=50367
-
h2ibot
TheTechRobo edited ArchiveTeam Warrior (-591, /* Can I use whatever internet access for the…):
wiki.archiveteam.org/?diff=50455&oldid=50454
-
» fireonlive edits the TheTechRobo
-
TheTechRobo
maybe we should also note that if your ISP blocks custom DNS, the projects won't work, but I'm not sure what the error message is so idk what to add to FAQ
-
fireonlive
in my fuzzy memory it isn't to descript
-
fireonlive
too*
-
flashfire42
any chance of a warrior project for Webs? or is there too many stuff for a warrior project for that atm
-
TheTechRobo
-
TheTechRobo
we already did a bunch
-
TheTechRobo
i'm not sure how much
-
flashfire42
I mean its definite again