-
pokechu22
Doesn't look too big, so I'll throw it in, though the custom software could cause issues
-
mikolaj|m
thanks
-
pabs
fireonlive: yeah, those are mirrors of various lists of lots of different kinds. there are a few such mirrors, for eg: mail-archive.com marc.info and formerly gmane.org/gmane.io
-
mikolaj|m
pokechu22: if I may ask, what issues could these be?
-
» pabs wonders if forum-dl supports those sorts of archives :)
-
pokechu22
We have a forums ignore set that prevents AB from trying to save stuff like the link to submit a new reply or to log in for various forum software
-
pokechu22
For custom stuff we'd need to come up with a list of those (though some forum software doesn't need it)
-
dave
DPReview is off death row, apparently! Acquired by some other site and will continue.
dpreview.com/site-news/8298318614/d…d-to-a-new-chapter-with-gear-patrol
-
fireonlive
pabs: ah ok :)
-
pabs
arkiver: re Album Archive, not sure as some folks on the HN threads were saying Album Archive also contains things like images uploaded to Blogspot
-
mikolaj|m
pabs: I can't support custom forums without some involved fuzzy logic / AI, the plan for now is to provide a quick way for the user to pass CSS selectors
-
pabs
these are mailing list archives mikolaj|m, but ack
-
mikolaj|m
pabs: ooh, you meant about:blank
-
mikolaj|m
dang!
-
mikolaj|m
-
flashfire42
Well then Dave finally an excuse for them to fix the grabs XD
-
pabs
yeah and mail-archive.com marc.info and formerly gmane.org/gmane.io
-
pabs
those are just mirrors though, the source data is elsewhere
-
fireonlive
theres a list of mailman2 on
wiki.archiveteam.org/index.php/Mailman2 but idk if pabs has tooling for that already
-
mikolaj|m
pabs: spinics is MHonArc, which I haven't implemented yet :( . The rest appears custom
-
fireonlive
not sure whats en vogue these days
-
fireonlive
mailman3?
-
JAA
HyperKitty is the modern one.
-
mikolaj|m
Hyperkitty, which is JS but provides Mbox dumps (though Wikipedia mailing lists have these disabled)
-
JAA
Ft. modern shitty design patterns like requiring JS for trivial things etc.
-
mikolaj|m
yeah I don't like Hyperkitty, Pipermail is clunky but less overengineered
-
pabs
fireonlive: those I am just doing using ArchiveBot, mainly to preserve URL structure in the WBM
-
mikolaj|m
oh and last time I checked, the Hyperkitty Mbox dumps usually aren't uploaded to the Wayback Machine, so they might not be crawled or something
-
fireonlive
ah :)
-
JAA
I'm slightly exaggerating as it is fairly usable even without JS, but yeah, I don't get why they built it like that rather than having a functional interface that's enhanced when you have JS enabled.
-
mikolaj|m
JAA: I disagree, I haven't found it usable without JS at all
-
pabs
I'm surprised it works at all without JS, the modern way to do web stuff is JS-only :)
-
JAA
mikolaj|m: I just tested it on
lists.mailman3.org/archives/list/mailman-users⊙mo and it works okay.
-
JAA
'All Threads' on the right gives a list of recent messages, and each message page has a 'Visit here for a non-javascript version of this page.' link at the bottom.
-
dave
JS enhancement is sadly dead because it's way harder than plugging json blobs into the latest hot react/vue/whatever, and those are happy to fail closed
-
nicolas17
-
dave
everyone broadly agrees that progressive enhancement is a good idea, but in practice the only place I see it actually done is to compensate for screen size, because if your stuff is unusable on mobile you lose money
-
dave
sadly nobody loses (enough) money from people wanting to browse without js, or archive stuff :(
-
JAA
... which should be done with CSS media queries instead. :-P
-
fireonlive
nicolas17: T_T
-
JAA
nicolas17: Yup, that sounds about right. :-/
-
mikolaj|m
JAA: not all Hyperkitty instances I've found have this "Visit here for a non-javascript version of this page."
-
JAA
Ah
-
pabs
pseudorizer: is anyone aware of mailing list Message-Id lookup services other than these?
transfer.archivete.am/14Iekp/msgid-search-services.txt
-
pabs
er, woops, sorry pseudorizer. that was meant to be a PS: :)
-
mikolaj|m
-
mikolaj|m
might be because of an older version or something
-
JAA
mikolaj|m: Indeed, looks like they're running an older version of HyperKitty and it was only recently added. :-|
-
JAA
-
JAA
> Add the ability to view a thread without Javascript enabled. This uses the same mechanism we use with bot-detection and rendering of the entire page at once, which will be slow to load but allow reading.
-
JAA
> slow to load
-
fireonlive
no one check the security fixes between versions
-
fireonlive
:3
-
fireonlive
js-hell is 1000% faster than some html dont cha know
-
JAA
Rather, let's load jQuery and jQuery UI to render some text. That'll be faster!
-
fireonlive
:D totally
-
JAA
A quick test on mail.python.org loads ~570 kB of JS. Lovely.
-
dave
that's kinda light by today's standards!
-
» JAA now playing: Metallica - Sad But True
-
dave
it didn't even download an entire wasm runtime for a completely different language's VM
-
mikolaj|m
I wish HTML had some semantic tags for pagination. I think that could resolve a lot of cases where JS is used
-
JAA
You mean like <link rel="next" ...>?
-
JAA
Can also be used on <a> but is rarer there.
-
mikolaj|m
JAA: oh yeah, I forgot about that
-
mikolaj|m
except that it's not very often used, and I don't know if it's even standardized?
-
mikolaj|m
and moreover, sometimes it's used oddly
-
JAA
It is standardised, and it is used a fair bit. Certainly not ubiquitous though.
-
mikolaj|m
-
nicolas17
I recently wrote
data.nicolas17.xyz/pallas-explorer.html and it very much requires JS but I'm proud of how many third-party libraries I used
-
mikolaj|m
it points to the next thread, not to the next thread page
-
JAA
Also, HTML5 allows custom elements, so you could have <pagination><a rel="prev" href="?page=2">Prevous page</a> ...</pagination.
-
JAA
But yeah, that wouldn't be standardised.
-
dave
modern HTML, especially with additions like ARIA, is surprisingly expressive by itself these days
-
fireonlive
-
fireonlive
5.3MB transferred for new reddit; 12.3MB uncompressed?
-
nicolas17
-
imer
thats pretty funny
-
nicolas17
imer: it's the framework I used in the above pallas-explorer page :3
-
fireonlive
ahh vanilla js
-
fireonlive
:3
-
h2ibot
Leo60228 edited Reddit (+326, Mention possible mass-deletion by admins):
wiki.archiveteam.org/?diff=49986&oldid=49950
-
h2ibot
MasterX244 edited Reddit (+55, Formatting fix, addition was past the…):
wiki.archiveteam.org/?diff=49987&oldid=49986
-
masterx244|m
(+55 since i added a more obvious "stopper" comment over the easy to miss strike tag)
-
pabs
-
rktk
:(
-
rktk
It looks like there's a huge amount of new spam Lemmy accounts and instances being created, pushing the total user count past 1 million
-
rktk
i wish linkedin would burn to the ground
-
h2ibot
Nemo bis edited Miraheze (+149, more like 3k; not sure what happens to the…):
wiki.archiveteam.org/?diff=49988&oldid=49977
-
nicolas17
how's the capacity now that reddit and lineblog are doing retries?
-
nicolas17
should we unpause imgur?
-
nicolas17
hm although reddit is sustaining 10GiB/min anyway
-
arkiver
yes i'm unpausing imgur
-
fireonlive
have all the lists been dumped on reddit?
-
arkiver
still one list with image URLs from masterx244|m but need to look better into that
-
fireonlive
ah ok
-
fireonlive
nice we got through the huge backlog though!
-
flashfire42
Time to switch to archiveteam choice and head to work
-
h2ibot
-
fireonlive
good bot!