-
h2ibot
-
Nick
Hi?
-
JAA
Bye.
-
schwarzkatz|m
<NickS|m> "Ugh, glad I never tried out Atom..." <- I used atom for a long time, I liked it a lot (still do) but with ceased development, you just can’t compete with vscode when you need to actually code stuff :/
-
schwarzkatz|m
Yes, I mean tianya. Looks like my matrix was a bit wonky, the messages above the one I sent were not there when I sent my original message...
-
schwarzkatz|m
should I gather a list of thread urls from tianya or is this already being worked on?
-
arkiver
schwarzkatz|m: feel free to start gathering
-
schwarzkatz|m
Ok
-
arkiver
Maakuth|m: do you know where those WARCs are on IA?
-
arkiver
of koti.mbnet.fi
-
Maakuth|m
arkiver: I don't know if Orldow6^2 uploaded them, but at least they seem to be in the transfer site by quick looks, linked here:
pad.notkiska.pw/p/mbnet
-
Maakuth|m
It doesn't seem like they mentioned an IA URL in #webroasting
-
arkiver
schwarzkatz|m: but we will also have discovery through a project
-
arkiver
Maakuth|m: ah, is this the stuff for which transfer.archivete.am was used, but it should not be used for that?
-
Maakuth|m
I'm afraid so,
-
arkiver
alright now I know what this is about
-
Maakuth|m
ok. let me know if I can be of help
-
Maakuth|m
it seems that I have a full set of those tars on my machine too if some have gone missing from the transfer site for some reason
-
benjins
-
upintheairsheep
Hello, I got some resources about BuzzVideo archival over here
yt-dlp/yt-dlp #5330
-
upintheairsheep
-
upintheairsheep
Question: was buzzvideo put in archivebot yet?
-
upintheairsheep
The above issue contains a buzzvideo extractor, which could be pulled into the main yt-dlp and then we could tubeup the whole site's videos.
-
arkiver
upintheairsheep: we're not going to tubeup BuzzVideo to IA
-
upintheairsheep
Just asking, why?
-
arkiver
IA is already being spammed with a ton of tubeup stuff
-
upintheairsheep
But you could refer to the extractor to WARC the site's videos
-
upintheairsheep
OK, I understand.
-
arkiver
buzzvideo will be archived into WARCs in likely a warrior project
-
arkiver
you can join #buzzoff for the upcoming buzzvideo project
-
upintheairsheep
It would be nice for someone to merge the extractor, as I have been banned for OCD-induced issue spamming without verbose logs
-
arkiver
archiveteam does not maintain yt-dlp
-
ChrisWsrn
Is there a way to archive a Facebook profile as a logged in user? I have a friend who unexpectedly died about a week ago and I am trying to preserve his legacy. I am friends with him on facebook so I see more than the public profile shows. Is it possible for me to craw his profile as me and then add it to a archive?
-
ChrisWsrn
Ryz told me to ask about this here.
-
thuban
ChrisWsrn: warcprox?
-
ChrisWsrn
I do not have any archiving skills yet. Is there a wiki page I can take a look at on warcprox?
-
schwarzkatz|m
ChrisWsrn: you could try
github.com/gildas-lormeau/SingleFile until someone comes up with a better idea for facebook.
-
schwarzkatz|m
It generates a static html page of the site.
-
thuban
not specifically, but there's a readme here:
github.com/internetarchive/warcprox
-
ChrisWsrn
Thanks.
-
ChrisWsrn
What should I do with these files I collect?
-
thuban
(that said, if you're not comfortable with e.g. the command line, you may not find it user-friendly)
-
ChrisWsrn
Command line i am fine with.
-
upintheairsheep
Alternatively, you could manually save each page or the entire feed as a single .mhtml file and upload it to the internet archive if that's all you need, and use tubeup with authentication for videos.
-
upintheairsheep
Does your friend have any other social media accounts
-
thuban
you could upload them to the internet archive. that won't put them in the wayback machine, but it will make them available for download
-
upintheairsheep
I'm sorry for your loss, I never suffered through the pain at this moment.
-
upintheairsheep
Nobody major died.
-
upintheairsheep
You could also download each image by right clicking them and uploading them manually if that's all you need.
-
ChrisWsrn
Youtube which Ryz put in #down-the-tube. He had a blog which I added manually with
web.archive.org/save and I think was added to be crawled by #archivebot. He has a presence on the doomworld fourms and some other fourms. I am still looking for other things.
-
arkiver
ChrisWsrn: which blog? we might want to run in through #archivebot to ensure a complete copy is made
-
upintheairsheep
#down-the-tube seems to only have metadata and comments, did you archive the videos themselves yet?
-
arkiver
#down-the-tube archives the video
-
ChrisWsrn
-
arkiver
videos
-
arkiver
and the videos will become playable in the Wayback Machine
-
ChrisWsrn
It was just added less than a hour ago
-
arkiver
ah good i see Ryz covered it
-
schwarzkatz|m
arkiver, I just checked the the thread count for bbs.tianya. we are looking at roughly 122,358,356 threads...
-
schwarzkatz|m
221 subforums
-
arkiver
yeah, and roughly an equal number of accounts
-
arkiver
(planning on going after all tianya.cn)
-
schwarzkatz|m
I have not started crawling anything, but I wrote some documentation on what I gathered so far, should I upload that to the transfer?
-
arkiver
yes please!~
-
arkiver
yes please!
-
arkiver
let's make a channel for tianya.cn!
-
arkiver
anyone have ideas for a tianya channel?
-
schwarzkatz|m
byenya
-
thuban
endoftheworldclub
-
thuban
-
thuban
(endoftheendoftheworldclub?)
-
arkiver
yeah literally named that in english
en.wikipedia.org/wiki/Tianya_Club
-
schwarzkatz|m
-
arkiver
looks good
-
arkiver
the additional -1 is interesting
-
thuban
(hm... can we get /inline/ for .md?)
-
schwarzkatz|m
if they exist, the -1 threads are suffixed with a small symbol next to it (cannot ocr the text) and are always the first thread in a subforum.
-
schwarzkatz|m
without the additional -1, they redirect to it though so that's good.
-
schwarzkatz|m
-
qw
?
-
mgrandi
Dragalia lost has ended, might be a good idea to do a final run of the website and a comic which doesn't seem to be linked from the main page
dragalialost.com/sp/en comic.dragalialost.com/dragalialife/en
-
mgrandi
twitter.com/DragaliaLostApp (if there is room in the probably overloaded socialbot space)
-
JAA
thuban: /inline/ doesn't do anything special, it just omits the Content-Disposition header to not force a download (so the browser can display the file directly if supported). I'm guessing you mean rendering the Markdown as HTML.
-
JAA
That'd be a bit more complicated.
-
JAA
atom.io is still horribly slow and throwing 500s all the time. I hope it'll get better soon. Can't really archive the packages at the moment.
-
schwarzkatz|m
just in case, the people over at
pulsar-edit.dev saved all the packages a while ago.
-
JAA
But did they do it as WARC so that installing packages inside Atom will still be possible via the WBM after the shutdown?
-
JAA
:-)
-
JAA
Where can I find more details about what they did?
-
schwarzkatz|m
of course not, but at least it's better than nothing.
-
schwarzkatz|m
most likely on the discord server: 7aEbB9dGRT
-
JAA
:-/
-
michaelblob
*sad J_AA noises*
-
JAA
Yeah, an imperfect mirror is certainly better than nothing. Also, most packages are public GitHub repos, so they could probably still be installed from there as well.
-
schwarzkatz|m
most of the packages are on github, yes. also note that they received a huge of amount of spam after announcing that atom will be deprecated.
-
schwarzkatz|m
...I just checked, it's at 415k now... the amount of actual packages is about 12k. On 2022-08-08 it was 15k.
-
schwarzkatz|m
-
schwarzkatz|m
github.com/confused-Techie/atom-package-collection (used to migrate the packages to the pulsar site)
-
schwarzkatz|m
sorry for not thinking of this earlier, I'm not checking in here enough
-
JAA
Yeah, I saw the spam. It's actually way more than 415k, no idea where that number comes from. The API returns almost 36k pages with 30 packages each, which also matches the number in the footer: '1,078,592 packages & themes'
-
schwarzkatz|m
that's incredible. interesting that they have not preemptively killed the site up until now
-
JAA
Yeah, or made it read-only or whatever.
-
mgrandi
How did they even handle uploading a theme, just anyone can do it with no review process?
-
mgrandi
Or package rather
-
ChrisWsrn
I found Ghastlys twitter (He died last week). What should I do to archive this?
twitter.com/ghastly310
-
ChrisWsrn
I also found his bandcamp account, his imgur account, his reddit account, his twitch account (empty), plus "some accounts" that he would not want archived. What tools should be used to archive these accounts?