-
Rootliam
nicolas17: What yv archives have the mixed up friendster data?
-
nicolas17
YV-003100001-003199999.tar YV-003200000-003213887.tar YV-003900000-003999999.tar YV-009900000-009999999.tar
-
fireonlive
those are pure friendster eh?
-
nicolas17
I think it's just those 4
-
Rootliam
Ok thanks
-
nicolas17
the file listings are disproportionately larger than the rest
-
nicolas17
since the files inside are small (no large .flv's)
-
mikolaj|m
Is there a CLI tool that takes a single CSS selector and just crawls into urls extracted with it while recursively outputting the crawled pages to stdout? Possibly with multiple connections at once
-
mikolaj|m
Would be convenient for writing quick and dirty crawlers and scrapers
-
h2ibot
PaulWise edited ArchiveBot (+245, better info for folks w/o AB perms):
wiki.archiveteam.org/?diff=50876&oldid=50251
-
pabs
with AB, is there no way to extract links from 404 pages?
-
pokechu22
There isn't to my understanding
-
JAA
Nope
-
JAA
There's this related ancient open issue:
ArchiveTeam/wpull #202
-
arkiver
nicolas17: i'll get those moved to the friendster collection
-
arkiver
won't change the identifiers though - they may be referenced in various places
-
fireonlive
thanks arkiver
-
pabs
this is what I was seeing
ArchiveTeam/wpull #444
-
pabs
something like that, there was no sitemap though
-
DogsRNice_
has there ever been an attempt to archive the steam workshop?
-
JAA
Not in the past few years at least, as far as I'm aware. I'd like to see it happen though.
-
DogsRNice_
would also be cool to get the steam community
-
DogsRNice_
though theres no pressing need for that
-
pabs
is it just me or does subdomain enumeration no longer work in bing?
-
masterx244|m
-
kpcyrd
pep.foundation is bankrupt apparently
-
pabs
link kpcyrd ?
-
kpcyrd
pabs: oftc/#sequoia
-
kpcyrd
not sure if the channel has public logs
-
murb
ah the folks who were reimplementing pgp?
-
pabs
hmm, their gitea died already?
-
pabs
-
flashfire42|m
Get archiving then
-
pabs
pep.security also gone
-
pabs
did archiving on what remains
-
murb
pabs:
pep.community is live now.
-
murb
-
pabs
and gitea too, weird
-
pabs
all the other domains too, I'll start again I guess :)
-
murb
also might be worth joining #pEp and asking.
-
pabs
from hackint #pEp <andy> Where did you read that? It is unfortunately true though.
-
kpcyrd
tech seems to be going downhill lately
-
nulldata
-
eggdrop
-
cookie
Hi, I'm trying to archive the whole of an older recipe site (
homemadebymary.blogspot.com), but I can't use the !archive command myself
-
pabs
kpcyrd, murb: more details from #pEp
paste.debian.net/hidden/6e9a8956
-
cookie
Thank you
-
kpcyrd
pabs: thanks for looking into it
-
murb
pabs: ta for the link.
-
JAA
masterx244|m: AB job for it is running.
-
nicolas17
ETA 42 hours to get the last yahoovideos file listing ugh
-
nicolas17
43*
-
fireonlive
-
erkinalp
any updates on ia uploads?
-
JAA
Not really, still limited.
-
erkinalp
btw last four days for wowturkey could revive in its original hosting provider - its hosting expires on the 23th
-
erkinalp
JAA: same in the downloads sadly
-
rktk
This is kind of annoying but it seems ANY BlueSky post needs a fking account to see it
-
rktk
bsky.app I mean
-
rktk
i don't know if blueskyapp.xyz is the same
-
rktk
seems like archiving it will require an account...
-
fireonlive
there was a way to see one post w/o context/replies/etc but I seem to have lost the URL
-
fireonlive
(also without embedded media IIRC)
-
fireonlive
ah no, embedded media works
-
fireonlive
-
fireonlive
(credit to JAA)
-
rktk
yeah embedded media works
-
rktk
psky.app is another url I've observed that people have on shared links
-
fireonlive
ahh ok
-
rktk
skeeet lmao
-
fireonlive
xD
-
fireonlive
i do remember something about staff asking their users to please not call them skeets
-
nicolas17
why that name
-
rktk
well that's a workable archival solution
-
nicolas17
SKy + tweet?
-
rktk
skeeet
-
rktk
.xyz that link
-
rktk
works to archive single links at least which is normally what I do for twitter... although snscrape has been busted for a while now (thought they claim that nitter works, i can't get it going)
-
nicolas17
I meant in response to fireonlive, why were users calling them skeets
-
rktk
Bibliogram is kind of starting to fall apart as well btw, it's unmaintained but still mildly usable
-
fireonlive
nicolas17: looks like it's a portmanteau of 'sky' and 'tweet'
-
fireonlive
(and it's also funny, because cum)
-
rktk
:thinking: where is cum
-
fireonlive
skeet: to ejaculate, cum, get there.
-
fireonlive
(or skeets)
-
rktk
uhhh
-
rktk
wow
-
rktk
TIL lmao!
-
fireonlive
indeed!
-
fireonlive
x3
-
thuban
there are a few nitter workarounds, but between the api being a moving target right now and public instances that do get something up being hammered, things aren't super stable
-
thuban
if they shake out, might be worth running an archiveteam nitter just to point archivebot at
-
h2ibot
Ljcool2006 edited YouTube (+34, /* Music Comment Removal */):
wiki.archiveteam.org/?diff=50877&oldid=50615
-
h2ibot
Adrmcr edited URLTeam (+216, note about capital letters for the other…):
wiki.archiveteam.org/?diff=50878&oldid=50719
-
rktk
and soon X will be a pay-to-use service!
-
rktk
-
fireonlive
rktk: download your favourite cat pictures now!
-
icedice
<rktk> and soon X will be a pay-to-use service!
-
icedice
About time someone puts it out of its misery
-
fireonlive
I see no other future than active users falling off a cliff
-
icedice
That's phase 1
-
icedice
Phase 2 it turns into Parler 2.0
-
icedice
Assuming it survives long enough to reach that phase
-
fireonlive
ah yes :(
-
fireonlive
better back up my favourite creators while i still can