-
h2ibot
Pedrosso edited Noob.hu (-8, Site finished closing):
wiki.archiveteam.org/?diff=51673&oldid=48145
-
Ryz
Pedrosso, nice, regarding
wiki.archiveteam.org/?diff=51660&oldid=51657 - there is the other iffy thing that there is user made content produced in games on Steam that doesn't use the Steam Workshop; I do recall one game that had a ton of levels, but because the server shut down, all those usermade levels are gone :C
-
h2ibot
Pokechu22 edited Jira (+7150, /* Status */ convert to table):
wiki.archiveteam.org/?diff=51674&oldid=51672
-
rooter
I just came across this ancient free web host. no AT project for it exists. the page i found is archived on IA but idk how extensive the archival is.
wiki.archiveteam.org/index.php/RootsWeb support.rootsweb.com/s/article/Reti…-and-Migrating-Portions-of-RootsWeb
-
rooter
-
h2ibot
JustAnotherArchivist edited The WARC Ecosystem (+119, /* Tools */ ArchiveWeb.page has fatal flaws):
wiki.archiveteam.org/?diff=51675&oldid=51591
-
JAA
... as expected
-
fireonlive
yeah...
-
JAA
rooter: Hmm, I'm pretty sure we did some RootsWeb stuff before.
-
fireonlive
JAA: is like making a archivebot warc.gz into a warc.zst as simple as like gunzip -> zstd?
-
JAA
...
-
fireonlive
damn
-
fireonlive
that timing
-
fireonlive
(for personal use that is)
-
fireonlive
archivebot's labryinth is another thing
-
JAA
fireonlive: No, WARCs need to be compressed per record to provide random access.
-
fireonlive
ahh ok
-
» fireonlive does not touch
-
JAA
Technically, a .warc.gz that is compressed whole rather than per record is valid per the spec, sadly.
-
JAA
Not a .warc.zst though.
-
fireonlive
>_<
-
fireonlive
got you :)
-
» fireonlive jumps into the fatal flaws :3
-
JAA
.warc.gz predates our messing with this and defining standards. :-P
-
fireonlive
ah, the chromebot thing
-
fireonlive
ah :P
-
JAA
Yeah, crocoite had the same fundamental problem. And also did some other awful things.
-
fireonlive
the AT stresstest
-
JAA
Bad enough that the WARCs were entirely pulled from the WBM.
-
fireonlive
oh wow
-
fireonlive
forgot/missed that bit
-
fireonlive
sadly ArchiveWeb.page shall soldier on as is 🥲
-
fireonlive
'we make bad archives, PRs welcome'
-
JAA
Yeah, ignoring the fact that their entire approach cannot work.
-
fireonlive
yeah... 😬
-
fireonlive
i guess accepting you need to start at 0 is something in of itself
-
fireonlive
but ya know, you're supposed to be archivists not fingerpainters
-
h2ibot
JustAnotherArchivist edited Deathwatch (+286, /* 2024 */ Add Anon.cafe and Oldschooldaw.com):
wiki.archiveteam.org/?diff=51676&oldid=51669
-
h2ibot
Monika edited URLs (+150, Add info about access restriction):
wiki.archiveteam.org/?diff=51677&oldid=51462
-
monika
JAA: should we have a field in the infobox about access-restricted collections?
-
h2ibot
JustAnotherArchivist edited Deathwatch (+389, /* 2024 */ Datetimeify):
wiki.archiveteam.org/?diff=51678&oldid=51676
-
JAA
monika: Not sure a separate field makes sense, but we could add it to the data row.
-
monika
yeah i didn't know what it was called lol
-
monika
not familiar with mediawiki at all
-
JAA
Right, so the infobox is a template with a bunch of fields which gets rendered into those rows (and also causes some category additions etc.). We *could* add a new field to the template and make that produce an extra row or a change to the existing 'data' row, just not sure that's worth the effort.
-
» fireonlive suggests lock emoji, if implemented
-
fireonlive
oh earlier i was looking up a imgur thing that had died (surprisingly not porn), and it had exactly two captures: #// and #imgone
-
fireonlive
grats AT
-
JAA
:-)
-
fireonlive
:)
-
Pedrosso
Ryz: D; You're right, the games will certainly need more investigating rather than solely the workshop
-
Ryz
Yeah, that be the much more difficult part, since it would likely vary game by game :C
-
Pedrosso
Mhm. Also, have the free games been saved?
-
Ryz
I don't exactly know what you mean oo;
-
Pedrosso
like the game files of the free games
-
Pedrosso
(Would there be any problems with having saved those, considering they're free?)
-
Ryz
I don't exactly know how that would be possible to save Steam games like that
-
Pedrosso
Hm? Why?
-
Ryz
I would also like to figure out if we can save itch.io games, or games in other video gaming portals and such~
-
Pedrosso
oh, yes
-
Ryz
I'm not deep into the technical details on how stuff goes, I more or less heavily specialize in finding stuff to archive, and helping existing jobs finish faster in ArchiveBot
-
Pedrosso
Which is a great specialization
-
Ryz
And occasionally poke people in status updates on various projects in Archive Team
-
Ryz
Yeah, I find that more and more that websites and internet content tends to disappear faster than what you would assume, since it's ephemeral and such
-
Pedrosso
I mean
tracker.archiveteam.org/how-to-help/warrior-logo.png is a pretty good representation of what goes on here
-
aquaaaa
Hi, I'm currently trying to save a game forum that will be deleted in couple of months (forum.worldoftanks.eu) using ArchiveBot on a Wayback Machine (archive.org/details/archivebot). The problem I'm facing is that I don't have much of an idea of what I'm doing, I've been told that I can try asking there. Can someone guide me through or maybe there’s some kind of guide or documentation on how to use the bot, I couldn’t f
-
aquaaaa
ind any useful information myself, you guys are my only hope
-
katia
aquaaaa, /join #archivebot and ask there maybe
-
aquaaaa
oh sorry, sure i'll ask there
-
DigitalDragons
oh, I think that one was being waited on until after the read-only date?
-
JAA
Pedrosso: I don't think Steam publicly exposes the game data, even for free games. Might be wrong though.
-
JAA
Yeah, we can safe the World of Tanks forums once it's read-only in two weeks.
-
JAA
save*
-
JAA
Looks like they employ a (custom?) JS challenge. It'll need something custom.
-
Pedrosso
-
Pedrosso
The original list does have a long unsorted list but it has some groups
-
JAA
It does now, but it didn't when the bot still existed.
-
Pedrosso
Ahh. What happened to the bot?
-
JAA
It fell off a cliff or something.
-
Pedrosso
So the whole "Do not edit this table, it is automatically updated by bot. There is a raw list of URLs that you can edit." is just wrong now?
-
JAA
The 'is automatically updated' bit, yes.
-
Pedrosso
So "Do not edit this table" is still accurate?
-
JAA
Yes
-
JAA
The plan is to eventually get rid of the tables, move the lists to appropriate places, and have an external thing that directly integrates with up-to-date AB data. Because a bot updating a wiki page is always going to be fiddly and annoying.
-
h2ibot
-
h2ibot
JustAnotherArchivist changed the user rights of User:Yzqzss
-
fireonlive
mediawiki tables: one of those things they torture you with in hell
-
fireonlive
see also: markdown tables
-
JAA
HTML tables aren't exactly wonderful either, but yeah.
-
fireonlive
ye
-
fireonlive
<csv> element when? :3
-
JAA
But which CSV? ;-)
-
fireonlive
welp
-
fireonlive
:p
-
JAA
Actually, it'd be an opportunity to make a formal specification for CSV.
-
h2ibot
Pokechu22 edited Jira (+871, /* Status */ update list):
wiki.archiveteam.org/?diff=51680&oldid=51674
-
TheTechRobo
There are multiple versions of CSV?
-
TheTechRobo
Whatever happened to "a bunch of values separated by commas"?
-
pokechu22
TheTechRobo: values containing commas happened :)
-
h2ibot
-
nicolas17
what if a value has a comma? surround with quotes? how do you escape quotes? how do you escape the escape character?
-
nicolas17
the answers to those questions are highly inconsistent on different implementations
-
JAA
Or values with line breaks.
-
TheTechRobo
Ahh, that makes sense
-
TheTechRobo
See, most of the CSVs I've used weren't sanitised at all. They were a pain to use.
-
TheTechRobo
:P
-
h2ibot
Pedrosso edited ArchiveBot/Educational institutions/list (+1282, Moved unsorted .se sites to Sweden category):
wiki.archiveteam.org/?diff=51682&oldid=51044