-
h2ibot
Usernam edited List of websites excluded from the Wayback Machine (+24, added
epiclper.com which was seen…):
wiki.archiveteam.org/?diff=50900&oldid=50892
-
pabs
-
pabs
(done in AB)
-
pabs
(Twitter/YouTube not done, both not currently archivable)
-
pokechu22
From #archivebot (I've saved the support posts+blogs, but the whole site is probably too big for archivebot and I'm not sure how to deal with the apps):
-
pokechu22
-
pokechu22
05:51 <Mannie> Also the app for andriod and ios is in danger
-
pokechu22
05:51 <Mannie> Google Podcasts on the App Store - ios
-
pokechu22
-
pokechu22
-
pokechu22
-
pokechu22
-
pokechu22
-
Mannie
-
pabs
-
h2ibot
JustAnotherArchivist edited Deathwatch (+280, /* 2024 */ Add Google Podcasts):
wiki.archiveteam.org/?diff=50901&oldid=50899
-
arkiver
sigh :/ Google Podcasts is up next?
-
arkiver
they had this for a year or so?
-
JAA
No deadline yet, sometime in 2024 it seems.
-
arkiver
another one of the google graveyard
-
JAA
And I assume they just index and proxy/cache the audio from their original servers? Not sure.
-
JAA
Most reasonably popular podcasts are available on numerous platforms.
-
flashfire42|m
I use IHeartRadio
-
nighthnh099_
is there a way to save a url (warc) with a token being in local storage?
-
nighthnh099_
I don't have the time to try at the moment which is why I'm asking here
-
nighthnh099_
-
nighthnh099_
6th.anniv.magireco.com/settoken this page sets a token in local storage, the token is really long so I won't post it here for now
-
JAA
localStorage is irrelevant to HTTP; it's never transmitted, only accessible via JavaScript.
-
nighthnh099_
wait, so saving the page and manually manipulating localstorage is enough to make the page work?
-
JAA
So you just have to fetch all the relevant HTTP responses regularly.
-
nighthnh099_
oh okay, thanks
-
JAA
It's possible that some have nasty URLs or cookies are involved, of course.
-
nighthnh099_
it's already been saved, also no, no cookies
-
nighthnh099_
okay so I just checked, it sends a request to a website using that auth token
-
nighthnh099_
don't know why network activity didn't catch that
-
nighthnh099_
definitely need help archiving that specific request then
-
nighthnh099_
-
JAA
Maybe you can just use warcprox and a browser?
-
JAA
Might not be worth the reverse-engineering effort if it's just a single page.
-
nighthnh099_
I'm busy so I'm not sure when I can do it
-
nighthnh099_
also I'm not sure how to use warcprox either
-
TheTechRobo
It's not too difficult - it just involves running warcprox and making it the proxy used by your browser
-
TheTechRobo
And accepting its certificate
-
JAA
You'd probably want to use a separate browser profile to avoid leaking personal things into the WARC.
-
TheTechRobo
That too
-
nighthnh099_
TheTechRobo: thanks, I'll try that when I'm not busy
-
JAA
First pass of FOIAonline will finish in a couple hours it looks like. There are also a few thousand errors to retry/figure out what's wrong with them.
-
JAA
First pass of FOIAonline is done, looking into the ~2k errors etc. in a bit.
-
DogsRNice
-
DogsRNice
discord cdn links might be at risk
-
DogsRNice
probably from people hosting tons of assets for websites on discord