-
nicolas17
looks like uploading it to IA will be painful
-
nicolas17
getting 7.58s/MiB on the other two
-
fuzzy8021
nicolas17 if i have anything pending, please release it as i dont have any files pending upload
-
nicolas17
as of yesterday there was nothing pending, today I added 3 files, 2 were already returned and the third is 4GB so it wouldn't surprise me if it's still downloading (or failed)
-
nstrom|m
it's still going, has ~2hrs left estimated.
-
nicolas17
hah
-
nicolas17
stupid samsung
-
nstrom|m
it's probably the korean govt's rules that make peering w korean ISPs super expensive so nobody wants to
-
nicolas17
yeah
-
nstrom|m
though hard to say since all that stuff is behind incapsula. but I'd guess it's incapsula nodes talking slowly back to korea
-
nicolas17
and that could also be why samsung added captchas and shit... paying korean prices for my scraping
-
fireonlive
-
fireonlive
hm
-
arkiver
the queuing bot is back up!
-
arkiver
(yes i saw various alternatives mentioned for a name :P )
-
fireonlive
:P
-
immibis
cue the queue
-
arkiver
perhaps we should organize a vote some time soon on a name :P
-
fireonlive
🤔 modify eggdrop votingbot
-
fireonlive
s/modify //
-
h2ibot
Rexma created Deviantart (+24, redirect all lowercase):
wiki.archiveteam.org/?title=Deviantart
-
imer
nicolas17: hah. got a response this morning (7:48 utc) saying "We are pleased to provide you the source code of SM-A107F. You can download the source code from the site below:"
-
imer
lets see if its fixed
-
imer
-
imer
guessing the first one (higher id) is the new one
-
imer
nope, thats broken, trying the other one
-
that_lurker
Could someone do an update grab of
witha.name/data
-
that_lurker
Got a few days worth of new data there
-
that_lurker
small json and csv files
-
pabs
that_lurker: done
-
that_lurker
Thanks <3
-
imer
(dm'd nicolas17 the working zip)
-
oddline
forgive me if you're already aware (I don't see any mention on the wiki), but despite the frontend being gone, a good part of ustream's API (*.ums.ustream.tv) appears to still be online and serving up somewhat sensible responses
-
oddline
I'm trying to unpick the steps required to ultimately get it to give me an HLS stream - hopefully I can either get a definite failure of some kind (e.g. "not found", or an m3u8 that points to nonexistent files), or hopefully a _working_ stream
-
oddline
if I ever get to the latter I'll be sure to let you know, because that would be well worth grabbing, I think
-
pabs
got an example API URL?
-
imer
also if you can provide a summary of what happened vs the current
wiki.archiveteam.org/index.php/Ustream someone can update it
-
oddline
-
oddline
that first tries to grab
r50658865-1-61443262-recorded-lp-li…media=61443262&application=recorded, which as of now is still online and returns a connection id and a reference to a _different_ server to talk to
-
oddline
(successive requests will give different server addresses and connection IDs
-
oddline
)
-
oddline
I think, from looking at what URLs wayback has saved, that if you follow whatever process the client follows, you'll eventually receive a HLS m3u8 like this one (archived, for a different video):
-
oddline
-
oddline
just fyi, I'll be busy with other stuff for a while, so don't expect immediate progress on this. very happy for anyone else to look into it if interested
-
oddline
-
nimaje
hm, "An author lost access to her explicit writing when she shared it with alpha and beta readers. […] This is the usual first warning that a purge is coming."
fandom.ink/@Rozzychan/112161902225538242
-
katia
oh uh
-
c3manu
here are the links from the screenshots posted in the instagram post linked in the fandom.ink mastodon toot (although i can’t find an explicit statement about what changed):
-
c3manu
-
c3manu
-
nimaje
probably they enforce "Do not distribute content that contains sexually explicit material, such as nudity, graphic sex acts, and pornographic material." stricter now
-
imer
So, would've been fine had they not shared it?
-
c3manu
the problem is some *only* share it on google drive (for beta readers etc.) and have no offline copy themselves
-
c3manu
and since it’s probably done highly automated (as it is with youtube demonetisations for example) they will never reach an actual human at Google to sort it out
-
imer
google has actual humans doing support? :p
-
c3manu
i heard the top 5% of youtube earners have a contact person there
-
c3manu
or sth like that
-
h2ibot
Pokechu22 edited Deathwatch (+213, /* 2024 */ community.ingress.com):
wiki.archiveteam.org/?diff=51950&oldid=51923
-
nicolas17
imer: so they left 11148 broken and uploaded a new one? huh
-
imer
apparently
-
imer
at least they fixed it *shrug*
-
icedice
Do we think that will apply to Blogspot as well?
-
icedice
Because there's a bunch of NSFW stuff there
-
icedice
Scanlation groups of all kinds use Blogspot all the time, for example
-
icedice
Might be worth extracting Blogspot domains from
mangaupdates.com/groups.html?perpage=100 and
mangadex.org/groups and archiving them
-
c3manu
icedice: depending on what they mean by "new sites"
-
c3manu
> The program policies below apply to Drive, Docs, Sheets, Slides, Forms, and new Sites.
-
c3manu
but "new sites" to "all sites" is not a huge jump
-
icedice
Sites might also just mean Google Sites
-
fireonlive
i believe it does yeah
-
fireonlive
there was a very long transition that i think recently completed for classic sites?
-
icedice
Still worth being proactive though
-
icedice
Google can't be trusted not to kill 99% of what it touches
-
fireonlive
rip google domains
-
icedice
Maybe we can archive all scanlation group sites, first batch being Blogger and second batch being everything else
-
rewby
I'd like that. I do quite enjoy my obscure manga at times.
-
h2ibot
Pokechu22 edited Deathwatch (+1381, /* 2024 */ other niantic forums):
wiki.archiveteam.org/?diff=51951&oldid=51950
-
h2ibot
Pokechu22 edited Deathwatch (+39, /* 2024 */):
wiki.archiveteam.org/?diff=51952&oldid=51951
-
icedice
vatoto.com/group is also worth scraping. Batoto is the pre-decessor of MangaDex. Batoto sold their domain when they went from a manga reading site to just a forum and group index and rebranded to Vatoto instead.
-
c3manu
fireonlive, icedice: i'm still a little bitter about google news
-
fireonlive
rip google reader too
-
icedice
I miss "unlimited" Google Drives
-
icedice
And semi-unrelated, I miss LeapDroid
-
icedice
It was a great Android emulator
-
icedice
Then the devs got hired by Google, stopped developing it, and shut down the site
-
c3manu
ah, that's what i meant. google news still exists ofc
-
c3manu
but having to ignore google plus links in some old weblogs still makes me chuckle on the other hand :)
-
icedice
Have a guy who doesn't like social media in charge of a social media platform, what could go wrong?
-
fireonlive
rip the unlimited google drive i totally didn't have or use
-
icedice
The community I'm in had 5 PB on there
-
fireonlive
that's a lot of gay porn
-
icedice
Gotta have it in 4K so we can see all the veins
-
steering
Thu⊙05 < imer> google has actual humans doing support? :p
-
steering
as a paying customer... no, their support can't possibly be real humans
-
rewby
There's a few places you can get a hold of humans. But even then it's rare
-
steering
rewby: they claim they are real humans, but I don't believe it ;)
-
rewby
A real human with a sufficiently restrictive script they have to follow is basically a machine anyway
-
rewby
Had an interesting convo with some of their network ops
-
nicolas17
omg
-
nicolas17
support reps with a sufficiently restrictive script are doing the Chinese room thought experiment irl
-
rewby
Have you ever seen conditions in callcenters?
-
nicolas17
"Searle asserts that there is no essential difference between the roles of the computer and himself in the experiment. Each simply follows a program, step-by-step, producing behavior that is then interpreted by the user as demonstrating intelligent conversation."
-
steering
Ugh, don't make me think back to my call center days
-
steering
It took Google Store's support a week to figure out how to send me an email with a link to fulfill a promotion...
-
steering
(which should have been sent automatically, of course)
-
rewby
The only time I've ever gotten a response out of google that didn't feel like a template was when I was going back and forth with NOC trying to fix a peering issue
-
steering
or that time I had to cold-email a Director at google to get them to honor their warranty
-
rewby
I do need to email them again actually
-
nyany
Can't be any worse than OVH's "the reason you were billed is because you received an invoice" email I got a few years ago when I was asking about a prorated refund
-
steering
Sounds about on par.
-
immibis
Reddit banned one of my accounts for the reason "your account has been banned"
-
immibis
I appealed, and they reversed it, strangely.
-
imer
OVH sent me an email telling me to redo my payment method since they were switching providers and it was broken in firefox, "You don't need to redo your payment method, I never told you to"
-
imer
alrighty, guess you don't need my free bug reports then
-
nyany
OVH changes their payment provider more than they change their server configs
-
imer
don't get me started on the payment garbage they've done >_>
-
nyany
They started using Adyen recently
-
imer
"we'll just stop sending you reminder emails for part of a product and not include it in the normal renew anymore so it runs out. silently"
-
imer
10/10
-
nyany
story of my live
-
imer
thankfully i wasnt away for a few weeks and was able to renew when I noticed my ips were gone
-
fireonlive
-_-
-
Nulo|m
has someone requested a grab of cocinerosargentinos.com already? cooking tv show that ended
-
archivst
Hi, could anyone here help me archive some specific subreddits? There are few I am concerned may be removed in the near future.
-
rewby
Hi! If you can detail what subreddits you are looking for and how much of them, someone can probably help!
-
archivst
Controversial political subreddits related to Ukraine war. Reddit recently killed a subreddit with hererodox views on the Ukraine war.
-
JAA
There isn't really a way to do that fully because Reddit restricts how much history of a subreddit you can see (1k submissions for each 'view'). Also, they're banning aggressively.
-
archivst
I want archive these subreddits: TrueAnon, StupIdPol, Dongistan, EuropeanSocialists, EndlessWar, InternationalWar, Palestine, BDS, AskARussian.
-
JAA
We've been archiving all of Reddit continuously for a couple years now (#shreddit), but it's been paused for a while due to the bans.
-
archivst
InternationalNews*
-
archivst
> while due to the bans
-
archivst
what kinds of bans?
-
nicolas17
IP bans I think
-
nicolas17
they seem to detect us based on TLS fingerprinting?
-
JAA
Something like that.
-
nicolas17
yes I'm oversimplifying
-
JAA
Not account bans, anyway, if that's what you're thinking.