-
h2ibot
FireonLive edited Archiveteam:IRC (+8, comment out "down for several years" kiska's…):
wiki.archiveteam.org/?diff=50557&oldid=49678
-
fireonlive
.....
-
h2ibot
FireonLive edited Archiveteam:IRC (-1, lmao):
wiki.archiveteam.org/?diff=50558&oldid=50557
-
pabs
should the YouTube wiki page be updated to mention yt-dlp instead of youtube-dl?
wiki.archiveteam.org/index.php/YouTube
-
h2ibot
FireonLive edited Archiveteam:IRC (+425, topic guidelines scream test :p):
wiki.archiveteam.org/?diff=50559&oldid=50558
-
fireonlive
i'd say yes to yt-dlp
-
h2ibot
-
upintheairsheep
If you haven't noticed, Google is starting an "inactive account policy" which deletes accounts automatically when a user is inactive, but THANKFULLY YouTube accounts are spared. Account deletions will begin new years eve of 2023.
cnbc.com/2023/08/19/google-faces-cr…lan-to-purge-inactive-accounts.html
-
upintheairsheep
arstechnica.com/google/2023/05/goog…wont-delete-years-of-youtube-videos (probably applies to accounts with youtube videos as a whole, or would mean to create skeleton youtube accounts with no google account assigned to it in the future)
-
h2ibot
PaulWise edited Me at the zoo (-20, yt-dlp):
wiki.archiveteam.org/?diff=50561&oldid=50091
-
h2ibot
PaulWise edited Floraverse (-41, yt-dlp is reliable these days):
wiki.archiveteam.org/?diff=50562&oldid=43115
-
h2ibot
PaulWise edited MS Paint Fan Adventures (+27, yt-dlp):
wiki.archiveteam.org/?diff=50563&oldid=30407
-
fireonlive
-
h2ibot
-
fireonlive
wikiscope is very wide
-
fireonlive
or was
-
h2ibot
-
upintheairsheep
Permanent projects for Google Drive, Google Photos, Google Sites, Google Maps (reviews and street view contributions) and whatever other sites of Google come with a form of public user generated content should be considered.
-
h2ibot
-
h2ibot
-
fireonlive
upintheairsheep: yeah it's like someone woke up at google and was like 'wait how much data are we storing' or something
-
fireonlive
rather unfortunate
-
fireonlive
OrIdow6 was working on reviving #nearlylostmygoogles for google drive i believe
-
h2ibot
-
h2ibot
-
h2ibot
PaulWise edited UC Berkeley Course Captures (-32, yt-dlp):
wiki.archiveteam.org/?diff=50570&oldid=47526
-
h2ibot
PaulWise edited Twitch.tv/Vinesauce (-4, yt-dlp):
wiki.archiveteam.org/?diff=50571&oldid=48684
-
h2ibot
-
h2ibot
PaulWise edited Google Video Warroom (-97, yt-dlp):
wiki.archiveteam.org/?diff=50573&oldid=47665
-
pabs
fireonlive: yeah, lots of youtube downloader references
-
fireonlive
indeed!
-
pabs
fixed them all, except for 1 page that gave me a 403 error when editing, and a few that were historical references to youtube-dl that should stay
-
fireonlive
oh weird you'd get a 403
-
fireonlive
but sweet, thanks :)
-
pabs
-
fireonlive
hm yeah even a no changes edit just pops a 403
-
fireonlive
or edit → show preview/show changes
-
fireonlive
if there's a typo in that i'm going to be so mad
-
Video
<pabs> "was for
wiki.archiveteam..." <- according to that article, 3 TB costed $6,000?!?!?! when the fuck???
-
imer
Video: "Dollar figures shown to illustrate cost of permanent archives. These are not actual values but are meant to represent simplified values and act as a sane budget. Dollars in USD at $2000 per TB estimate (not per TB of disk space alone). "
-
imer
IA has to store stuff *forever*
-
imer
that includes disk cost, servers, power, labour, replacement disks when the first ones fail etc
-
Video
Oh I didn't see that part my bad
-
imer
no worries :)
-
h2ibot
Yts98 edited Zhihu (+122, Clean up formatting):
wiki.archiveteam.org/?diff=50574&oldid=50089
-
LeGoupil
Could someone with archivebot permission save these small websites. Orange ISP hosting is shutting down early september and there are not maintained anymore.
f6bva.pagesperso-orange.fr f6crp.pagesperso-orange.fr f5zv.pagesperso-orange.fr f6fco.pagesperso-orange.fr f1olj.pagesperso-orange.fr f8kfh.pagesperso-orange.fr
-
AntoninDelFabbro|m
No worries, someone said all Orange pages will be archived. I'm waiting the project is ready to start archiving too.
-
erkinalp
wowturkey still going full blast since sunday evening
-
erkinalp
arkiver: if we manage to finish the first round, is there a way to do an incremental archive _over_ the one that's going on now?
-
erkinalp
to get the last week's posts, edits, images and ratings i mean
-
JAA
erkinalp: Incremental archives are hard, and there is no generic tooling for that. So no, not really.
-
JAA
By the way, is there a public source for the shutdown date?
-
erkinalp
JAA: in wowturkey's case, we know the edit window is 24h and object IDs are sequential, which means exclude everything older than one day before the first archival's start; and yes, mods actually published that date
-
JAA
Do you have a link?
-
erkinalp
(in case of this archival, the incremental archive would need to cover anything posted on and after august 19 afternoon, considering the edit window limits)
-
JAA
Yes, but 'every posted on X or later' isn't as easy as it might seem. Sure, we could grab all the post ID links, but we'd also want to refetch some of the topic pages (which may have old topic IDs).
-
JAA
Hence no generic tooling, needs something custom.
-
erkinalp
one of the mods, Uğuray, actually did announce (owning admin wasn't content with him publicly announcing the shutdown and doesn't want to transfer it to any other mod:
webcache.googleusercontent.com/sear…%3D9334812&cd=4&hl=tr&ct=clnk&gl=tr )
-
erkinalp
JAA:in this website, those directory pages are index.phb, ttum.php and viewforum.php
-
erkinalp
and profile page is profile.php
-
erkinalp
*index.php, not index.phb
-
erkinalp
as it's phpbb, we could try a phpbb-specific tool, with the caveat wowturkey.com not exposing some endpoints
-
JAA
I think you misunderstand. We don't have a phpBB-specific tool that can do that. So we'd have to write something. Which, yes, is possible, but someone has to have the time to do it.
-
erkinalp
oh i see
-
erkinalp
in fact i could go backwards and warc-to-phpbb-mysql-db scraper
-
erkinalp
which will indeed take a time to implement though
-
h2ibot
JustAnotherArchivist edited Deathwatch (+324, /* 2023 */ Add wowTURKEY):
wiki.archiveteam.org/?diff=50575&oldid=50553
-
erkinalp
it's actually worse: some of the mods of wowturkey actually tried to convince Burç, the owner, to back from this decision, to no avail
-
fireonlive
youtube.com/watch?v=JVBnJtzEuI0 - "This video is set to unlisted pending an evaluation of its editorial accuracy - LS"
-
fireonlive
it's begun
-
fireonlive
(via DYA discord)
-
fireonlive
also channel has been dumped in #down-the-tube already, just a heads up :)
-
Shinzodragon
I’m looking for a Shinzo fanfic call “Glowing tears Glowing Blood”.
-
pokechu22
-
pokechu22
... oh, those are probably your comments on the items looking for that, which isn't helpful for you :P
-
erkinalp
wowturkey archival still going full blast
-
fireonlive
brrrrrrrrrrr
-
erkinalp
haha
-
erkinalp
i'll miss wowturkey tho
-
erkinalp
it was announcing all those iett bus route changes faster than iett itself
-
fireonlive
:(
-
flashfire42
AntoninDelFabbro|m I have been working slowly on getting the orange stuff done but using archivebot and working on multiple things means unless we get a warrior project I wont get it all
-
flashfire42
also those page perso ones dont actually resolve for me so I cant tell which is alive which isnt
-
flashfire42
-
AntoninDelFabbro|m
flashfire42:
f6bva.pagesperso-orange.fr works for me
-
AntoninDelFabbro|m
flashfire42: Idk, I want to help (non-technical, I just use the warrior in a VM) but… how ?
-
flashfire42
and it works fine in archivebot but strangely not for me on my home connection. its an australian ISP. they arbitrarily choose what to block and what not to block. I have been working on saving the monsite stuff tho if thats of help
-
flashfire42
It would require scripts to be written and we are probably cutting it too fine for another warrior project this month. If you can make a large list of the pagesperso-orange.fr sites tho we could probably do a !a <
-
flashfire42
and yeah I should probably change from my default DNS but I dont like messing with things I dont fully understand
-
AntoninDelFabbro|m
flashfire42: Have you tried configuring quad9.net DNS? (Won't change anything, despite contouring censorship)
-
flashfire42
203.0.178.191 hooray for Australian ISPs I dont honestly know how to change my dns settings and what else that will affect
-
AntoninDelFabbro|m
<flashfire42> "and yeah I should probably..." <- Oh, ok
-
AntoninDelFabbro|m
Well, as you wish. It's not fully easy to understand, but it have less impact than uBlock IMO haha
-
flashfire42
-
AntoninDelFabbro|m
<flashfire42> "It would require scripts to be..." <- I have no idea of how to proceed haha
-
AntoninDelFabbro|m
I have started a small regional list here
antonin.one/pages-perso to warn and help website owners
-
AntoninDelFabbro|m
flashfire42: Nice!
-
AntoninDelFabbro|m
* In reply to @flashfire42:hackint.org
-
AntoninDelFabbro|m
-
AntoninDelFabbro|m
Nice website
-
flashfire42
Not nice XD. it means the website is blocked likely with DNS tampering. and yeah ooni probe is fascinating
-
AntoninDelFabbro|m
Nice website* yes 😅 But not nice, AU gov'. :(
-
thuban
arkiver (and/or other tracker admins): can we unpause telegram long enough to cover the death of yevgeny prigozhin?
cnn.com/europe/live-news/russia-ukr…/h_0d0b1ba296d42856826f83cda491e9fa
-
nicolas17
huh telegram has 0 todo?
-
nstrom|m
Backlog got stashed when we put projects on hold for space reasons
-
nicolas17
ahh
-
pokechu22
flashfire42: FYI, changing your DNS is pretty easy - take a look at
1.1.1.1/dns/#setup-instructions (though you'd want to use different IPs probably). It's also easy to revert; you can just change it back to automatic (or enter the old DNS servers)
-
fireonlive
fwiw, archiveteam uses quad9 which appears to be a swis-based non-profit:
quad9.net/service/service-addresses-and-features#unsec
-
fireonlive
9.9.9.10 and 149.112.112.10
-
fireonlive
(and 2620:fe::10 2620:fe::fe:10)
-
pokechu22
I personally have 8.8.8.8 (google) and 1.1.1.1 (cloudflare) as my primary and alternative DNS... which is probably a silly configuration, but it's caused me less problems than the ISP DNS did
-
fireonlive
two providers is good for redundancy aiui
-
flashfire42
So which ones should I use?
-
AntoninDelFabbro|m
I agree with Quad9.
-
AntoninDelFabbro|m
Cloudflare and Google are not my cup of tea, I love privacy
-
flashfire42
and for DNS over HTTPS?
-
flashfire42
-
fireonlive
that can be on
-
fireonlive
i think auto template is ok?
-
fireonlive
you can visit
on.quad9.net and it'll tell you if you're using it
-
fireonlive
-
fireonlive
but the on.* should be easy enough
-
flashfire42
server8.kiska.pw/uploads/cec6690bc9a2c188/image.png I set it like this but its still saying I am not using it?
-
fireonlive
could be chrome/your PC caching it maybe
-
fireonlive
did you turn on dns/https?
-
flashfire42
yes
-
flashfire42
Ah well will figure it out later
-
fireonlive
:)
-
fireonlive
try the on.* url again in a little bit
-
fireonlive
see if it changes
-
fireonlive
(saves you having to flush every little nook and cranny)
-
fireonlive
for me the 'no answer' was cached from anywhere from 10-20 mins at the lowest level
-
imer
thuban: telegram isn't paused afaik
-
thuban
it's not now, but i was given to understand that it was earlier
-
nicolas17
imer: I'm seeing 1 item/minute
-
nicolas17
looks pretty paused to me :P
-
imer
Think just backlog got stashed, but anything new is still running?
-
JAA
Correct
-
JAA
(As far as I know, anyway.)
-
nicolas17
then there's very little "new"