-
JAA
So I thought a bit more about this irc_abandoned thing. I did that in the infobox template so far, but it's probably better to integrate it directly into the Template:IRC so it can also be used on other pages, e.g.
wiki.archiveteam.org/index.php/Warrior_projects . The most sensible approach is probably to have an inline mode which puts the former channel name in the title/mouseover text
-
JAA
(possibly with a <sup>[note]</sup> or similar) and a <br>+<small> mode like in the screenshots from last night. Any thoughts on this?
-
OrIdow6
JAA: I suppose because it could make it sounds like the de facto DDoS that occasionally happens is intentional
-
OrIdow6
But it's minor and apparently it's already in use anyway
-
OrIdow6
Ryz: What has been done on 4players so far?
-
OrIdow6
I see that the downloads seem to be simple and numeric, could that work with an !ao? E.g.
4players.de/4players.php/download_info/Downloads/Download/58312.html
-
Ryz
I tossed hopefully all of the domains into ArchiveBot, OrIdow6
-
Ryz
Problem is, unsure if it'll finish at the end since it's not going as fast as I thought
-
OrIdow6
Would it help to split off sections of the site into their own jobs?
-
OrIdow6
I don't know that I'd have time to write a script or something like that - once I get something like a schedule established again my first thing here is fastSWF
-
OrIdow6
Also, here's an idea: split the forums into separate jobs, there are 10 jobs and each only gets topic IDs with a certain last digit, by ignoring everything else
-
OrIdow6
So for fastSWF, I am thinking of writing a script for wget-lua, but maybe trying to run it myself (or having someone else run it, if there are infrastructure etc. concerns) since the site isn't that big
-
OrIdow6
And if not making it a warrior project
-
JAA
OrIdow6: We could link it to
wiki.archiveteam.org/index.php/DPoS and make that page even clearer if needed, I suppose.
-
SketchTheCow
Ha ha HA
-
SketchTheCow
OK, so, power event at IA yesterday, so FOS was not pushin' the data. It is as of an hour ago.
-
SketchTheCow
Disaster should be avoided
-
JAA
:-) Thanks
-
Ryz
OrIdow6, not too sure since the forum and the other subdomains are pretty slow at times~
-
sazki
Hello! I've downloaded a Chinese dialectology forum that closed registration and logins earlier this year, based on its sitemap. What steps can I take now? I know that one potential action is to scan the dump for links not in the sitemap, like PDF attachments. Can it be integrated into the Wayback Machine?
-
OrIdow6
sa
-
OrIdow6
Whoops
-
OrIdow6
Yeah, that'd work JAA
-
JAA
Alright, will add that later.
-
Ryz
Hello sazki, are the forums still up or have they been shut down?
-
TheTechRobo
sazki: Depends on how it was grabbed: was it grabbed into WARC format? If so, yes it likely can be injested into the WBM
-
OrIdow6
Thanks JAA
-
sazki
ispeakmin.com, WARC via Wget with --page-requisites and not --mirror
-
OrIdow6
TheTechRobo: Even if these are warcs, I do not think that outsider warcs are (any longer?) usually put into the Wayback Machine
-
sazki
still up but the closure of logins is not promising
-
TheTechRobo
OrIdow6: really? Did not know that
-
sazki
ic
-
OrIdow6
If the site is still up, ArchiveTeam may be able to make a capture itself
-
TheTechRobo
OrIdow6: should I just stop uploading them then? I don't want to waste my bandwidth, and nobody is going to download WARCs directly
-
OrIdow6
sazki: When you say the site "closed registration and logins", does that mean it's basically frozen?
-
sazki
OrIdow6: yes; 公告:自 2021 年起,海墘閩語論壇不再提供註冊、登錄、回覆等功能,僅以只讀模式呈現。海墘閩語論壇 QQ 群:22827077 == Announcement: From 2021, The Coastal Min Forums will no longer provide registration, login, or reply functionality, and will be read-only. Coastal Min Forum QQ group: 22827077
-
JAA
Yeah, let's run this through AB.
-
OrIdow6
Up to you TheTechRobo
-
TheTechRobo
OrIdow6: I'll probably continue
-
TheTechRobo
That just came as a shock to me