-
ivan
bleb: I've been doing that with single-file-cli
-
ivan
I haven't integrated my
github.com/ludios/expand-everything with it yet but that's next for me
-
ivan
bleb: the major issue with single-file-cli is that either it removes <audio> or <video> by default, or if you leave them in, the files can be pretty large. maybe it's worth immediately deriving files with just text
-
ivan
so you end up with a thing that works pretty well for maybe 98% of HTML pages but then you start wanting some of WARC again when non-HTML or large embeds come into play
-
ivan
btw: --browser-executable-path google-chrome-unstable --browser-width 1920 --browser-height 32767 --save-original-urls --block-videos false --block-audios false
-
ivan
the nice parts about these single file DOM captures is that they load quickly and linearly, they can be validated by a human immediately for completeness (assuming you didn't neglect to consider some behavior e.g. Twitter link previews being broken), and you can have high confidence that they'll load properly in the future
-
fireonlive
interesting...
-
fireonlive
:)
-
» fireonlive takes notes
-
ivan
bleb: this is how it might be integrated into your own thing if you need ad blockers and stuff single-file-cli doesn't give you
github.com/gildas-lormeau/SingleFil…ode-in-%22custom%22-environments%3F
-
h3ndr1k
-
fireonlive
ooh, thaks :D
-
JAA
I was wondering how long it'd take until someone mentions it. :-P
-
fireonlive
adding it to the wiki now >_>
-
fireonlive
JAA: my alerts go off later in the day >:(
-
h3ndr1k
It appeared in my RSS
-
fireonlive
congrats y'all on being interviewed :)
-
JAA
-
h2ibot
FireonLive edited In The Media (+185, Add Netzpolitik interview :)):
wiki.archiveteam.org/?diff=50295&oldid=49782
-
h2ibot
Switchnode edited Telegram (+791, update bot documentation):
wiki.archiveteam.org/?diff=50296&oldid=50097
-
fireonlive
hmm not sure how to represent two versions there
-
fireonlive
JAA: so did this interview take place over IRC
-
JAA
Just two entries is fine I think.
-
fireonlive
kk
-
fireonlive
i'll add it under
-
fireonlive
the eng one :p
-
JAA
It was technically published a few seconds earlier anyway. :-P
-
fireonlive
>:3
-
fireonlive
i guess it'll be a good test of my alert which should happen in about... 34 mins
-
JAA
Yes, the interview was on IRC.
-
h2ibot
FireonLive edited In The Media (+206, Add German version):
wiki.archiveteam.org/?diff=50297&oldid=50295
-
fireonlive
JAA: good to hear!
-
h2ibot
Switchnode edited Telegram (+126, /* Notable channels */ link etherpad):
wiki.archiveteam.org/?diff=50298&oldid=50296
-
fireonlive
"Or, as our little tagline goes, „We will rescue more of your shit“."
-
fireonlive
=]
-
tzt
-
tzt
"Vistaprint has made the decision to shut down our Webs.com operation and we will no longer be hosting your WEBS site and it will be shut down on August 31, 2023."
-
tzt
i know there was a project for this 2 years ago but i don't know what exactly happened there
-
thuban
wiki.archiveteam.org/index.php/Webs this was when they were bought by vistaprint
-
h2ibot
JAABot edited Main Page/In The Media (-41):
wiki.archiveteam.org/?diff=50299&oldid=49787
-
fireonlive
ohhhh
-
JAA
It's been brought up before, but it looks like it didn't make it to the wiki.
-
fireonlive
i thought it was a cache thing.. i even tried the purge cache thing. shoulda checked the source lol
-
JAA
In any case, #webbed still exists. :-)
-
h2ibot
Switchnode edited Deathwatch (+110, update webs):
wiki.archiveteam.org/?diff=50300&oldid=50257
-
thuban
what's the status on the august 1 deadlines, by the way?
-
thuban
-
thuban
- silph road appears to have taken down website content already but has some kind of inarticulate plans for backup; we might want to reach out to them on reddit
old.reddit.com/r/TheSilphRoad/comme…raordinary_years/jjwh93k/?context=1
-
thuban
- what is happening with wysp? project has been paused with some issue since the 24th. OrIdow6? (#wyspedaway)
-
flashfire42
thuban I ran mudrunners and was gonna do some more jobs for it. Silphroad I did some runs on AB as well
-
thuban
ty. silphroad jobs were 15 july, right? i think a lot of articles/resources went down on 12 may (same day as the announcement), so probably still good to talk to them