-
fireonlive
kiska: #wuciyuan just started
-
h2ibot
Yts98 edited Current Projects (+12, Move Banciyuan to current. Move ЯRUS to…):
wiki.archiveteam.org/?diff=50114&oldid=50106
-
fireonlive
thanks yts98 :3
-
fireonlive
i had it in a background tab but never hit save lol
-
yts98
fireonlive: lol
-
h2ibot
FireonLive edited Banciyuan (+2, let's goooooooooo):
wiki.archiveteam.org/?diff=50115&oldid=49979
-
h2ibot
JustAnotherArchivist moved Banciyuan to 半次元 (Official name is in Chinese):
wiki.archiveteam.org/?title=%E5%8D%8A%E6%AC%A1%E5%85%83
-
h2ibot
-
h2ibot
Yts98 edited Current Projects (-12, Adjust the link for 半次元):
wiki.archiveteam.org/?diff=50119&oldid=50114
-
fireonlive
ooh a move :)
-
fireonlive
hahahahahahahahahaha
-
fireonlive
oh
-
fireonlive
-
fireonlive
-
fireonlive
on meta theads: their apple app store privacy 'nutrition card':
pbs.twimg.com/media/F0JrcJJaMAEzzfw?format=jpg&name=orig
-
nulldata
Types of Data Collected: Yes
-
-
fireonlive
at least onylfans has a competent paywall
-
Barto
you da real mvp fireonlive
-
fireonlive
:D
-
fireonlive
-
fireonlive
i got very confused; but it's a *different* threads
-
fireonlive
i guess they're reusing the branding
-
JTL
just what I'd expect SV ghouls to do
-
h2ibot
Yts98 edited Skyblog (+169, Add another short URL example):
wiki.archiveteam.org/?diff=50120&oldid=50111
-
Twisty
Starting CheckIP for Item Failed CheckIP for Item Traceback (most recent call last): File "/usr/local/lib/python3.9/site-packages/seesaw/task.py", line 88, in enqueue self.process(item) File "<string>", line 122, in processAssertionError: Your time 1688470941.1319983 is more than 180 seconds off of 1688475727.383.Waiting 10 seconds...
-
Twisty
I just synced my pc clock but this is still showing up
-
nstrom|m
Is your time zone correct?
-
myself
if you're running a VM, do you have the VM host set to "hardware clock in UTC time"?
-
murb
if you're running something unixish then yeah.
-
holographicleah
Hello everyone, I’m new to IRC so please do forgive me if this is the wrong place or if my etiquette isn’t perfect. I’m in need of some help - www.world.kano.me has an estimated 1 million+ user-generated artworks but no plan around the idea of archiving it. Full transparency, I work at Kano and we are looking to sunset the site and our apps
-
holographicleah
soon. How would you go about this mammoth task? Any and all advice is much appreciated. Thanks - holographicleah
-
holographicleah
-
threedeeitguy
Hi holographicleah thanks for getting in touch! Please stick around as it may take a moment for someone to get back to you. JAA arkiver ^^
-
arkiver
holographicleah: thank you very much for letting us know!
-
arkiver
holographicleah: when is the deadline for this?
-
arkiver
not sure how well kano will playback in the Wayback Machine
-
arkiver
nice i see it's all relatively easy to archive with the API as well, but again not sure about playback yet
-
arkiver
holographicleah: do you perhaps have a list of all creations on the site?
-
holographicleah
I wish I had a concrete date. Kano World was actually spun off into a 'sister company' and has its own AWS S3 bucket with the creations in. we might transfer them over to our other AWS account somehow which would keep them around for longer, possibly
-
holographicleah
To put it mildly there are some unpaid bills.
-
threedeeitguy
holographicleah how large is the bucket?
-
holographicleah
About 690GB apparently in the bucket threedeeitguy
-
holographicleah
I think it's a case of - we're probably not going to have the site around for much longer (a few months at tops) and we have all of the creations, along with their code, but we want to find a way to make it accessible after the site closes.
-
arkiver
holographicleah: are you planning to stick around on IRC? if not, feel free to contact me at arkiver⊙pc
-
holographicleah
arkiver: I don't know too much about using IRC so I may end up emailing you, thanks!! I'm seriously such a noob, i'm just here thru the web interface haha. I'm also wary of spamming the chat with too much info!!
-
threedeeitguy
Ah, I was too slow! arkiver feels like there may be two parallel approaches here? 1. Standard scrape to go over to IA for the wayback machine 2. dump raw resources (and maybe source code?) so it leaves the door open for this to live on in some more interactive form. I guess 2 depends on how much access they are comfortable handing out/what IP they
-
threedeeitguy
cannot give away.
-
icedice
holographicleah: Don't worry about that, #archiveteam-bs was made for spamming walls of text
-
holographicleah
I'm going to pop back in here in a couple days, hopefully with some more info, after I have a chance to chat more with engineers on the team (and hopefully the CEO) about the open-source future of user-generated content of world.kano.me. What I can say is that it's safe to put it on deathwatch, we just haven't set an official date.
-
h2ibot
JustAnotherArchivist edited Deathwatch (+60, /* 2023 */ Add Kano World Studio):
wiki.archiveteam.org/?diff=50121&oldid=50107
-
JAA
holographicleah: Thanks for reaching out! I added it.
-
holographicleah
JAA: thanks so much!
-
Skylion
Hey so I downloaded a bunch of CDX files generated by the archive team, and one of the columns appear to be a non-standard CDX column. "S" it's not in spec
archive.org/web/researcher/cdx_file_format.php so I am unsure what it's suppose to be. I wonder if the IA backend is throwing an error about it too
-
masterX244
what item?
-
JAA
-
nicolas17
and both are crap
-
nicolas17
what is a "canonized URL"?
-
JAA
Yeah, neither is detailed, but at least that one lists all the fields actually in use, unlike IA's.
-
fireonlive
link rel=canonical?
-
JAA
No
-
fireonlive
oh
-
JAA
It's a mangled URL after it was run through surt.
-
fireonlive
that’s good its not the rel lol
-
JAA
Stripping protocol, auth, leading www, and port, lowercasing everything, etc.
-
fireonlive
ahh
-
fireonlive
interesting
-
JAA
That is one way to put it, yes...
-
fireonlive
>_> yeeeaah….
-
fireonlive
see: imgur and lots of other stuff
-
fireonlive
lol
-
JAA
Especially the case collapsing causes issues all the time.
-
fireonlive
yeah :/
-
JAA
Everyone should start using Unicode homoglyphs since those don't get collapsed!!1!
-
fireonlive
😁
-
Skylion
Oh sorry missed it
-
Skylion
Some of the 2017 flickr snapshots have this issue
-
Skylion
Ah nvm, I see. Thanks!
-
h2ibot
-
Twisty
nstrom|m myself Yes, I have both checked the timezone as well as enabled hardware clock in UTC time by default
-
Barto
-
jamesp
you can still connect via regular IRC
-
JAA
Yeah, very unsurprising, I've seen lots of complains about how the Matrix bridge operates over there.
-
jamesp
I don't know if there's anything to archive in this case
-
JAA
There isn't.
-
JAA
Anything on the Matrix side is behind a login wall anyway, and IRC is IRC.
-
vokunal|m
What's a good matrix alternative? Unless someone here wants to change careers from IT to plumbing
-
JAA
To be clear, this is about another IRC network (Libera) and does not directly affect us here.
-
vokunal|m
Oh great
-
icedice
<vokunal|m> What's a good matrix alternative? Unless someone here wants to change careers from IT to plumbing
-
icedice
Which part of Matrix?
-
icedice
The federated part, the group chat part, or the end-to-end encrypted part?
-
fireonlive
i'm thinking like the irc part
-
fireonlive
bridge
-
fireonlive
if that's the case 'the lounge' has treated me pretty ok
-
icedice
If you want federation, I think the only other chat option than IRC and Matrix is XMPP / Jabber
-
icedice
If you just want self-hosted Discord and federation does not matter there's Rocket.Chat, Revolt, and Fosscord
-
fireonlive
-
icedice
There's also Mattermost, but that's a Slack clone
-
fireonlive
mattermost was interesting; but i had problems getting push notifications to work :/
-
fireonlive
maybe that was just something on my end
-
fireonlive
(w/ the iOS app)
-
fireonlive
element's E2EE seems kinda buggy and pisses me off a lot lately?
-
fireonlive
i should really get around to trying fluffychat
-
arkiver
should this be in #archiveteam-ot ?
-
icedice
Probably
-
fireonlive
ah yes
-
upintheairsheep
I apologize for asking again, but may you please archive this MEGA folder (34 GB):
mega.nz/folder/sol2UZoK#oMACjgVHPcAv1hPGLX_PoA
-
fireonlive
mega is very hard to archive
-
upintheairsheep
The data in the folder is irreplaceable. The original creator of the folder quit the community due to his choice to spend time with his new girlfriend, and may stop paying for MEGA bills. In no ways this is intended as a form of harassment.
-
upintheairsheep
This is intended to be a manual download of the MEGA folder via MegaBastard
-
fireonlive
looks like a lot of apple stuff... nicolas17 ?
-
fireonlive
are you 'in' with the mega?
-
upintheairsheep
No I am not affiliated at all internally with the folder
-
icedice
I have an online friend that has Mega Pro
-
icedice
Let me know if Mega bandwidth cucks you and you can't bypass it
-
upintheairsheep
The folder contains many rare apple apps, and are most likely of interest to many Apple historians
-
icedice
MegaBasterd has a proxy switcher feature
-
icedice
So if you find a list of usable proxies online that might work
-
upintheairsheep
-
upintheairsheep
Additionally MEGA and other cloud services, especially OneDrive will remove AppleInternal data if they find out
-
newsjunkie
I hope Showbuzzdaily
showbuzzdaily.com/articles/some-unfortunate-news.html is being archived (already listed on Deathwatch). A lot of historical TV ratings data there. Probably already covered by Web Archive, but would probably be good to do a full archive nonetheless.
-
pokechu22
-
upintheairsheep
I’ve heard
unknowntags.netlify.app/internalui also contains working mega links to apps not in the big mega folder
-
newsjunkie
Thanks pokechu2
-
newsjunkie
Thanks pokechu22 (sorry don't know how to mention people.
-
fireonlive
if you mention the nick(name) it's more than enough :)
-
fireonlive
e.g. fireonlive abcd
-
fireonlive
or fireonlive: you're a fucking twat
-
fireonlive
both work
-
imer
upintheairsheep: grabbing the mega folder data unless someone else has already
-
imer
got the ones from
unknowntags.netlify.app/internalui as well (folder download is still going)
-
imer
upintheairsheep: is this folder from unknowntags.netlify.app too? just thinking of what metadata to put so people can find it if they're looking for it
-
nicolas17
I tried saving the files in my own MEGA account to preserve them, and download them later
-
nicolas17
but something is broken and even trying to save a single 100KB folder is saying "can't complete this action because it would put you above your storage quota"
-
fireonlive
are you out of space :p
-
nicolas17
I'm using 28GB of 50GB, and a 100KB folder "doesn't fit"
-
imer
nicolas17: if you want I can hand over the files to you once it's downloaded it, probably do a better job of labeling it properly when uploading and such
-
upintheairsheep
-
nicolas17
okay I figured it out
-
upintheairsheep
I also have some other AppleInternal links that do not belong to Unknown_Tags but are at risk of Apple taking them down (happened many times before)
-
upintheairsheep
-
nicolas17
I copied as much as I could fit in my acct... so I'm missing the 14 biggest folders
-
fireonlive
upintheairsheep: are there any associated youtube channels for this person as well?
-
upintheairsheep
-
upintheairsheep
-
fireonlive
nicolas17 is very apple happy so the more the merrier i’m sure :)
-
icedice
imer: I have a volunteer with a Mega Pro account
-
icedice
Do you want him to rip it?
-
imer
icedice: I should be good on the free download quota part (enough ips to cycle), speed isn't amazing though (9-20mib/s) so going to take me a few hours
-
imer
can do though, certainly won't hurt to have two copies :)
-
nicolas17
I'm downloading 28GB of personal data that I had in MEGA and I wanted to move elsewhere but I kept putting it off :P I had to do it anyway, and it will free up space in my acct
-
imer
brb
-
icedice
imer: He's downloading it
-
icedice
Send me a screencap of his Mega download speeds before I sent him the links
-
icedice
He almost hit 78 MiB/s
-
icedice
* sent
-
icedice
So should be done pretty quickly
-
icedice
He's not willing to post copyrighted content publicly though, but he's willing to hand it over privately to us which can then be uploaded to Internet Archive or elsewhere
-
nicolas17
icedice: afaik the "demo apps" in the last mega link here are "copyrighted content"
-
nicolas17
the 37GB folder is worse, they're Apple-internal employee-only apps
-
icedice
You're telling me I'm having my friend download leaked material on an account that can almost certainly be traced back to him via the payment method?
-
nicolas17
ask upintheairsheep :p
-
icedice
jfc
-
nicolas17
otoh I believe this folder has been up for more than a year
-
fireonlive
ooh apple internal apps
-
icedice
In case Internet Archive has to yeet it at some point I do know about a certain Russian VPS provider that openly ignores copyright and accepts Monero
-
jamesp
that's interesting
-
icedice
-
icedice
As for domains, .st and .li from Njalla are pretty bulletproof when it comes to copyright
-
nicolas17
getting 4.3MB/s from mega
-
nicolas17
I'm surprised I didn't hit a daily cap or anything yet
-
icedice
There are some other TLDs as well, but they're harder to register
-
icedice
And would have to be registered via other registrars that don't fill in their own WHOIS info for the user
-
icedice
"51% done
-
icedice
Got all other links as well downloaded besides the 37 GB one"
-
icedice
^ My friend sent me this 9 minutes ago
-
imer
i've got all the mega stuff downloaded I believe