-
h2ibot
Exorcism edited Bugzilla (-49, /* Status */):
wiki.archiveteam.org/?diff=52870&oldid=52869
-
yzqzss
I'm trying to list a urls.txt of all blog posts on cnblogs.com
-
h2ibot
Exorcism edited Bugzilla (+46, /* Status */):
wiki.archiveteam.org/?diff=52871&oldid=52870
-
h2ibot
Exorcism edited Bugzilla (+29, /* Status */):
wiki.archiveteam.org/?diff=52872&oldid=52871
-
h2ibot
Exorcism edited Bugzilla (+23, /* Status */):
wiki.archiveteam.org/?diff=52873&oldid=52872
-
h2ibot
Exorcism edited Bugzilla (+22, /* Status */):
wiki.archiveteam.org/?diff=52874&oldid=52873
-
h2ibot
Exorcism edited Bugzilla (+36, /* Status */):
wiki.archiveteam.org/?diff=52875&oldid=52874
-
h2ibot
Exorcism edited Bugzilla (+34, /* Status */):
wiki.archiveteam.org/?diff=52876&oldid=52875
-
h2ibot
Exorcism edited Bugzilla (+32, /* Status */):
wiki.archiveteam.org/?diff=52877&oldid=52876
-
yzqzss
Bugzilla++
-
eggdrop
[karma] 'Bugzilla' now has 1 karma!
-
h2ibot
Exorcism edited Bugzilla (+79, /* Status */):
wiki.archiveteam.org/?diff=52878&oldid=52877
-
h2ibot
Exorcism edited Bugzilla (+47, /* Status */):
wiki.archiveteam.org/?diff=52879&oldid=52878
-
h2ibot
Exorcism edited Bugzilla (+47, /* Status */):
wiki.archiveteam.org/?diff=52880&oldid=52879
-
h2ibot
Exorcism edited Bugzilla (+26, /* Status */):
wiki.archiveteam.org/?diff=52881&oldid=52880
-
h2ibot
Exorcism edited Bugzilla (+33, /* Status */):
wiki.archiveteam.org/?diff=52882&oldid=52881
-
h2ibot
Exorcism edited Bugzilla (+48, /* Status */):
wiki.archiveteam.org/?diff=52883&oldid=52882
-
h2ibot
Exorcism edited Bugzilla (+24, /* Status */):
wiki.archiveteam.org/?diff=52884&oldid=52883
-
h2ibot
Exorcism edited Bugzilla (+35, /* Status */):
wiki.archiveteam.org/?diff=52885&oldid=52884
-
nyany
Can we get a "thanks, we know about"... whatever the heck that japanese things is
-
yarrow2
kpcyrd makes a good point in #archiveteam. If the Nazi propaganda YouTube videos are archived, maybe whoever uploads them should request the Internet Archive restrict access so that the IA doesn’t become NaziTube for Germans looking to get around the government ban.
-
yarrow2
I’m referring to this: <c3manu> in germany a right wing media outlet has been banned today; their websites are already timing out:
apnews.com/article/germany-far-righ…ed-c284d76eb1a83f7c651299606d31337a
-
c3manu
i think they usually go through stuff like that anyways, even if not immediately. i know other videos i've uploaded that are still publicly available elsewhere have been restricted ("may contain harmful content") and put into the de-emphasized Fringe collection
-
c3manu
yzqzss: what for, if i may ask?
-
h2ibot
Exorcism edited Bugzilla (+32, /* Status */):
wiki.archiveteam.org/?diff=52886&oldid=52885
-
arkiver
18 million or so blog posts on cnblogs.com
-
arkiver
any more info other than
cnblogs.com/cmt/p/18302049 ?
-
arkiver
so big chance of going away
-
c3manu
just asking because there's already a job running (1lbcky9haf2j84w3j3vyb0lv3)
-
arkiver
c3manu: is AB enough?
-
c3manu
in what sense?
-
c3manu
(answer is probably "i've got no idea" either way)
-
h2ibot
Exorcism edited Mailman/2 (+37, /* Status */):
wiki.archiveteam.org/?diff=52887&oldid=52740
-
h2ibot
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52889&oldid=52886
-
asie
About hp.vector.co.jp...
-
asie
web.archive.org/web/20161012170825/…or.co.jp/vpack/author/listpage.html is the latest version of the public homepage list, before they took it down
-
asie
(but I'd hazard a guess that no/few new ones were made since then)
-
asie
I don't know if it's a complete list, but the sum of all the hp.vector.co.jp links on that page is a good starting seed for archiving hp.vector.co.jp
-
asie
I can try and dump them into a more machine-readable format, if that'd help
-
asie
(since all the authors got shutdown notification emails today, I expect some of them will replace their sites with redirects sooner than later)
-
asie
Other than that, all the URLs follow the format
hp.vector.co.jp/authors/VAnnnnnn [nnnnnn - 0-9], so checking for the existence of any unlisted pages could be done too, at a slower pace
-
asie
It's essentially Geocities for Japanese hobbyist software developers, and the pages have a small size limit, so I think AB is sufficient (just a fairly large job due to the sheer number of pages)
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52890&oldid=52889
-
h2ibot
Exorcism edited Bugzilla (-49, /* Status */):
wiki.archiveteam.org/?diff=52891&oldid=52890
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52892&oldid=52891
-
thuban
asie: i think that would be helpful, yeah. i'm brute-forcing authors but it's pretty slow on their end
-
h2ibot
Bzc6p edited Kepfeltoltes.eu (+873, /* Site reconaissance */ list URL types of images):
wiki.archiveteam.org/?diff=52893&oldid=49819
-
h2ibot
Bzc6p edited Kepfeltoltes.eu (+108, /* Archiving */ update with 2023 data):
wiki.archiveteam.org/?diff=52894&oldid=52893
-
c3manu
asie: oh sweet, nice they had a list at all :)
-
Webuser864
Hi, On 2024/12/20, a free web hosting service called hp.vector.co.jp will be shut down. This service is operated by vector.co.jp, a Japanese software distribution service, and is mainly used by old software authors for their websites. hp.vector contains a lot of information about old software, and its disappearance reminds me of Geocities..
-
Webuser864
I couldn't find an official announcement. vector seems to have emailed the closure only to site registrants. here is a link to a screenshot of the email posted on Twitter.
-
Webuser864
-
eggdrop
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52895&oldid=52892
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52896&oldid=52895
-
lennart
Webuser864: this shutdown was a topic previously in this channel, so people should be looking into it
-
yzqzss
HELP us list all article URLs of cnblogs.com:
git.saveweb.org/saveweb/cnblogs
-
Webuser864
lennart: I appreciate you letting me know!
-
lennart
glad to help
-
yzqzss
thank you :)
-
yzqzss
<arkiver> "18 million or so blog posts on..." <- We expect it to be less than 18 million
-
yzqzss
A few years ago they were fined and asked to implement stricter censorship. They hid a lot of posts at that time. Some of them are still hidden now (we don't know what percentage yet)
-
fireonlive
ah :(
-
fireonlive
i assume they didn't make a mistake and we can still see them somehow?
-
yzqzss
<fireonlive> "i assume they didn't make a..." <- I didn't dig it out
-
fireonlive
ah ok
-
yzqzss
git.saveweb.org/saveweb/cnblogs just upload a docker image, feel safe to run :)
-
yzqzss
uploaded
-
asie
-
fireonlive
yzqzss: started a container :)
-
yzqzss
docker++
-
eggdrop
[karma] 'docker' now has 1 karma!
-
DigitalDragons
docker++
-
eggdrop
[karma] 'docker' now has 2 karma!
-
fireonlive
docker++
-
eggdrop
[karma] 'docker' now has 3 karma!
-
fireonlive
oop, it crashed; but it came back
-
fireonlive
-
h2ibot
Exorcism edited Bugzilla (+36, /* Status */):
wiki.archiveteam.org/?diff=52897&oldid=52896
-
thuban
asie: thanks! my brute-force output looks ~identical to that list so far; we'll see whether there's more at the 'end'
-
thuban
know anything about those two differently-formatted slugs?
-
asie
no
-
asie
I've stumbled on hp.vector.co.jp occasionally, but I've never been like, an avid user
-
TheTechRobo
also just saw fireonlive's panic
-
phaeton
I can't get it to run for much more than 2 minutes without seeing a panic
-
yzqzss
<fireonlive> "oop, it crashed; but it came..." <- "EnsureHomepageOK failed for poweredby" These failed tasks will requeue, don't worry
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52898&oldid=52897
-
fireonlive
sounds good :)
-
yzqzss
released a fix (v0.2.1)
-
fireonlive
containrrr/watchtower -Rv gogo
-
fireonlive
hmm needs a new docker iamge maybe
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52899&oldid=52898
-
yzqzss
builds are triggered hourly :)
-
yzqzss
now at 200 post/s, ETA: <20h
-
fireonlive
ah :)
-
DigitalDragons
woo!
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52900&oldid=52899
-
DigitalDragons
yzqzss++
-
eggdrop
[karma] 'yzqzss' now has 6 karma!
-
fireonlive
yzqzss++
-
eggdrop
[karma] 'yzqzss' now has 7 karma!
-
Exorcism
yzqzss++
-
eggdrop
[karma] 'yzqzss' now has 8 karma!
-
h2ibot
Exorcism edited Bugzilla (-49, /* Status */):
wiki.archiveteam.org/?diff=52901&oldid=52900
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52902&oldid=52901
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52903&oldid=52902
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52904&oldid=52903
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */ aborted):
wiki.archiveteam.org/?diff=52905&oldid=52904