-
GhostyTongue
Someone hasn't given this a page even though it was archived by archive team (
wiki.archiveteam.org/index.php/Pixie_Hollow_Online)
-
JAA
A lot of smaller projects don't get their own page.
-
GhostyTongue
Well it was from a big project
-
JAA
Also, first time I hear of that.
-
GhostyTongue
Basically it was in a big project and went under everyone's noses
-
JAA
[citation needed]
-
GhostyTongue
It was in the Disney cdn go and dolimg archive project
-
GhostyTongue
Disney CDN Name is [Go] [Dolimg]
-
JAA
Right, so it wasn't its own project, which would explain why there isn't a page with that name.
-
JAA
Do you have a link?
-
GhostyTongue
I'll link it
-
JAA
Oh, I found it, that was in 2013, well before my time.
-
JAA
-
JAA
Can't find anything relevant in IRC logs regarding Disney or dolimg.
-
GhostyTongue
I can't find it
-
GhostyTongue
I think I lost the link
-
GhostyTongue
-
JAA
We archive a lot of random things, especially through the URLs project and ArchiveBot. That doesn't mean there was a project for it.
-
GhostyTongue
Huh weird I was told it was
-
JAA
Who told you that?
-
JAA
And what did they tell you exactly?
-
GhostyTongue
It's been months since it happened I don't remember but they had a purple name
-
» fireonlive collates the list of purple users
-
JAA
Maybe in your client, users don't have colours on IRC.
-
GhostyTongue
Oh
-
JAA
But I see now that you requested those sites specifically in July, right.
-
JAA
Yes, pokechu22 listed those things and ran them through ArchiveBot.
-
GhostyTongue
Yes
-
JAA
We don't usually document such small projects.
-
GhostyTongue
But it's was a large cdn and you guys go nuts for cdns
-
JAA
It was 250-ish GiB?
-
JAA
That's tiny. :-)
-
GhostyTongue
More then 250gb
-
GhostyTongue
Not ish
-
nicolas17
JAA: does Knowledge Adventure CDN have a wiki page? :D
-
JAA
Maybe 300, and I'm going by the figures pokechu22 mentioned at the time for the actual bucket size.
-
GhostyTongue
Yeah I thought it wasn't archived someone told me they won't able to
-
GhostyTongue
Reply to nicolas17
-
JAA
nicolas17: Nope, case in point :-)
-
GhostyTongue
JAA It was more then 300gb 325gb to my knowledge
-
JAA
GhostyTongue: We archive a couple dozen terabytes per day usually.
-
JAA
Ain't nobody got time for documenting all of that.
-
GhostyTongue
Ok I know
-
GhostyTongue
Also someone sent me this image i think we need to archive this
gcdnb.pbrd.co/images/pEH7wdRNi6mR.jpg?o=1
-
nicolas17
yo is that even supposed to be public?
-
GhostyTongue
Idk
-
nicolas17
that looks like it could be a compromised account
-
JAA
→ #codearchiver
-
nicolas17
gitlab.wdi.disney.com asks for employee login
-
JAA
Ah yeah, nevermind then.
-
GhostyTongue
Oh OK bc j didn't check the url
-
GhostyTongue
I should of checked lmao
-
JAA
Nothing public on it.
-
nicolas17
"if your issue requires urgent and immediate attention, contact the WDI SRE team via PagerDuty email" oh fun, a public email address to wake up on-call staff!
-
GhostyTongue
Lmao
-
fireonlive
send your best memes
-
GhostyTongue
-
nicolas17
yikes
-
GhostyTongue
Yeah I think it's bad I wasn't even involved whatever he did but he claimed to have access to wonderland, gitlab, jira its wild what the dude claims
-
cas
Hello, I want to inform you guys about a dead satire anime news site called anime maru, I recently found out its domain has been expired, and there appears to be no news pertaining this on its social media, so I want to bring this to attention in hopes of archiving whatever remnant of it that exists,
-
cas
-
eggdrop
-
nicolas17
cas: how long ago did it expire? can you find what IP address it had before it expired?
-
cas
I don't know how to do that, sorry. But this is the site in question
animemaru.com
-
cas
-
nicolas17
ugh seems that expired in february
-
JAA
Doesn't look like it to me.
-
nicolas17
or hm
-
JAA
It expires in February.
-
nicolas17
completedns.com/dns-history/?domain=animemaru.com says dns10.parkpage.foundationapi.com was added as a nameserver in feb 2023
-
JAA
It resolved to 67.222.153.247 as of last week, but that server doesn't respond.
-
JAA
Oh
-
JAA
It has four NS, and they resolve to two different IPs. Beautiful.
-
fireonlive
lovely
-
JAA
The one above from ns{1,2}.exonhost.com and 35.186.223.180 from dns1{0,1}.parkpage.foundationapi.com.
-
nicolas17
maybe WBM can say when the site was last... usable
-
cas
hmmm so what's the status? Are things bad?
-
JAA
According to the WBM, it was up in April, and I'm seeing 67.222.153.247 at that time in DNSHistory, which is dead.
-
nicolas17
according to the WBM, it was dead in May, so there's nothing to archive here, the ship has sailed
-
nicolas17
ofc there's still the facebook/twitter/patreon but
-
thuban
-
eggdrop
-
cas
sucks to hear, that's unfortunate
-
cas
ig the remaining social medias can be archived, perhaps?
-
thuban
facebook unfortunately no, twitter after a fashion
-
fireonlive
added twitter/@AnimeMaru to my todo
-
thuban
ty
-
fireonlive
=]
-
thuban
idk what we normally do about patreon, but it looks like there aren't any posts anyway, so maybe just !ao?
-
fireonlive
sounds reasonable
-
cas
nice thanks fireonlive
-
fireonlive
:) welcome
-
cas
it's a bummer that I was too late in notifying AT about its death, but it is what it is.
-
JAA
Sudden shutdowns without announcements are almost impossible to catch, unfortunately.
-
cas
yeah I imagine it's difficult to catch such cases on time, if ever
-
cas
btw what's WBM?
-
fireonlive
wayback machine -
web.archive.org
-
cas
ahhh ok cool
-
h2ibot
-
arkiver
if anyone feels like queuing all (official) youtube channels for all of
en.wikipedia.org/wiki/Telewizja_Polska#TV_channels in #down-the-tube , please feel free to
-
arkiver
else i can have a look later today
-
JAA
AB, DPoS, IA, SPN, WBM; any other acronyms we commonly use?
-
arkiver
FOS and the new one
-
JAA
Not used much anymore, but true.
-
arkiver
does // in #// count?
-
fireonlive
sure :3
-
thuban
WARC, arguably
-
arkiver
hah yeah
-
arkiver
CDX
-
fireonlive
AT :p
-
arkiver
!!
-
thuban
SWH/SCN in #gitgud
-
fireonlive
BS/OT
-
arkiver
yeah maybe like #-bs , etc., that we often use
-
arkiver
it could confuse new people
-
JAA
Is CDX actually an acronym? I never figured out what it's supposed to mean.
-
fireonlive
gitgud has AFN for add forge now as well
-
fireonlive
well, not really gitgud
-
fireonlive
but ye :p
-
fireonlive
(codearchiver/SWH)
-
» fireonlive um actuallys everyone and introduces the word 'initialism'
-
fireonlive
"Traditionally, an index for a web archive (WARC or ARC) file has been called a CDX file, probably from Capture/Crawl inDeX (CDX)" ~
pywb.readthedocs.io/en/latest/manual/indexing.html
-
fireonlive
though, that's from webrecorder
-
fireonlive
so uh
-
fireonlive
back up a dumptruck full of salt
-
» JAA slaps fireonlive around a bit with a large trout
-
fireonlive
;)
-
JAA
It had to be done.
-
fireonlive
:D
-
JAA
Yeah, good enough.
-
thuban
they're equivalent over irc, since we pronounce all of them '...' :3
-
fireonlive
:3
-
pokechu22
-
» fireonlive asks chatgpt
-
JAA
fireonlive: Actually, if it's 'inDeX', it wouldn't be an initialism. :-P
-
fireonlive
i more meant the other ones :D
-
JAA
I know, I just had to.
-
pokechu22
-
fireonlive
:3
-
fireonlive
there's a whole "um, actually" game show, too!
-
thuban
hm, i really can't find a canonical expansion of 'cdx' anywhere!
-
arkiver
i asked the main wayback machine guy
-
thuban
nice
-
fireonlive
thanks arkiver :3
-
fireonlive
would be cool to get that documented
-
pokechu22
-
h2ibot
JustAnotherArchivist created Archiveteam:Acronyms (+1104, Created page with "This is a list of topical…):
wiki.archiveteam.org/?title=Archiveteam%3AAcronyms
-
thuban
-
nulldata
bird.co , electric scooter rental, filed for chapter 11 - potential for it to disappear.
-
fireonlive
no longer the word
-
pokechu22
-
fireonlive
a scanned printed README from 2002, neat :3
-
fireonlive
oh my it's all perl
-
JAA
> describing the content of the archive
-
JAA
So could be Content inDeX, too.
-
pokechu22
-
thuban
interesting, but authority unclear
-
pokechu22
Yeah
-
fireonlive
oh neat
-
fireonlive
oh hey, *the* brewster left a review on that item
-
fireonlive
-
fireonlive
:)
-
thuban
-
fireonlive
ooh
-
thuban
as does
commoncrawl.org/blog/announcing-the-common-crawl-index by ilya kreymer, who apparently ought to know
-
fireonlive
>Ilya Kreymer is Lead Software Engineer at Webrecorder Software.
-
fireonlive
hmmmm
-
fireonlive
;)
-
thuban
i know
-
thuban
but he _did_ work at ia on the wayback machine, so...
-
fireonlive
hmmm
-
fireonlive
TIL
-
thuban
then again, pywb is a webrecorder project, so why is its documentation so diffident on the subject >:?
-
fireonlive
>:(
-
thuban
i suppose kreymer (or indeed other ex-iaers) need not have looked at that section personally
-
h2ibot
Petchea edited Deathwatch (+278, /* 2023 */ China Judgments Online (court…):
wiki.archiveteam.org/?diff=51393&oldid=51391
-
h2ibot
Nulldata edited Deathwatch (+215, Added Today's Plan):
wiki.archiveteam.org/?diff=51394&oldid=51393
-
h2ibot
JustAnotherArchivist changed the user rights of User:Nulldata
-
fireonlive
🥳
-
fireonlive
congrats
-
h2ibot
ClubBBC TV edited List of lost online videos/list (+409, added alkinboy7500 hd cuz we need to restore…):
wiki.archiveteam.org/?diff=51395&oldid=49247
-
Megame
tvpworld.com and other redirect to
tvp.pl now. Any way to still grab them?
-
Megame
JAA, ^
-
arkiver
ah :/
-
arkiver
so it's happening right now
-
Megame
TVP Info, TVP3, TVP World and TVP Parlament were closed so far
-
Megame
according to wikipedia
-
Megame
all redirect to tvp.pl
-
fireonlive
crt.sh search for tvpworld.com only shows www.tvpworld.com sadly
-
JAA
Yes, grab everything we can.
-
fireonlive
-
fireonlive
"The manufacturer closed the site in February of the following year, then in March the cache was uploaded by the Wayback Machine Archive Team"
-
fireonlive
we have a new name
-
magmaus3
lol
-
JAA
*facepalm*
-
nulldata
no no they were uploaded by the notorious archive hacker James Scott
-
fireonlive
xP
-
nulldata
RIP h2ibot
-
arkiver
RIP
-
fireonlive
f
-
pabs
do we have a way to archive an entire domain+outlinks starting at *multiple* input pages on the domain at different levels of dirs? (IIRC AB !a and !a < aren't suitable)
-
pabs
I was thinking of saving all of the
www2u.biglobe.ne.jp (no index) pages I can find on search engines
-
thuban
nope!
-
thuban
i was just thinking about this use case again the other day. afaict best you could do is sans outlinks using grab-site with `--span-hosts` and `--domains`, and then extract outlinks from the results
-
qwertyasdfuiopghjkl
I'm guessing the gearrice.com article might have been scraped from somewhere else (or possibly AI-generated) because there's the sentence "To find your way around, a search interface has been put online at this address, which links to applications recorded in the Wayback Machine." but no actual link anywhere I can see.
-
qwertyasdfuiopghjkl
Also, when viewing the source of the page, there's "<!-- AI CONTENT END 1 -->" after the text of the article.
-
JAA
lol
-
JAA
Yeah, sounds about right.
-
fireonlive
ahh lol
-
pabs
-
that_lurker
o7
-
pabs
-
arkiver
*poof* another 2.5 billion of value went up in nothing
-
fireonlive
coulda been better used at IA
-
project10
brewster's billions
-
fireonlive
:D
-
tech234a
someone on Wikipedia apparently tagged us as "Organizations disestablished in 2023"
en.wikipedia.org/wiki/Archive_Team
-
fireonlive
wtf
-
fireonlive
(if he's the cofounder btw, who are the other cofounders?)
-
murb
-
murb
their other changes already reverted
-
thuban
same ip editor has done the same thing to other articles; just revert it
-
thuban
ninja'd
-
murb
yeah
-
fireonlive
weird
-
fireonlive
"fireonlive" doesn't have a wikipedia account sadly
-
tech234a
Reverted
-
fireonlive
=]
-
fireonlive
tech234a++
-
eggdrop
[karma] 'tech234a' now has 1 karma!
-
masterX244
-
masterX244
bios archive of asrock server mainbaords, might be useful to pull
-
masterX244
Pulling a dumb dump into my own server
-
qwertyasdfuiopghjkl
hempuli.itch.io/mobile-suit-baba is free for a limited time (~6 days). Is there any way to archive it?
-
kpcyrd
-
kpcyrd
9ccf8c095c2c22deaba2c92ef66700b720b9d960ebbee
-
qwertyasdfuiopghjkl
Thanks. (the first time I tried SPN it got a 403 for some reason, guess I'll try it again to get the other file)
-
qwertyasdfuiopghjkl
-
qwertyasdfuiopghjkl
aders=host&X-Amz-Signature=a45e6d7cb97e1a88dfbe65e25ec7bb35e8e13b5befdd5be77664a0b9a202ac90
-
SketchCow
Regarding Co-Founding
-
SketchCow
There were a set of us, I was just the person who had the idea, and people had all sorts of ideas
-
SketchCow
Like, I'd count chronomex
-
SketchCow
(Checking mail)
-
SketchCow
Hmm.
-
SketchCow
My mail seems to go back to 2009.
-
SketchCow
Oh, I switched to Gmail in June of 2009.
-
SketchCow
I'll put it on the list... find my 2009 mail, find where I talked with Archive Team members, get Co-Founders listed.
-
SketchCow
Motherfuckers, I am re-installing PINE
-
project10
hell yeah
-
anarcat
mastodon.xyz/@johl/111618899554454932 <- "Wikimedia Russia has been dissolved" (the org, not the website)
-
nulldata
Can someone please throw
medium.com/@hyperloop_one into AB? Hyperloop One is shutting down and is scrubbing socials
-
pokechu22
Medium is a bit of a pain, but I can try
-
pokechu22
-
thuban
scribe.rip might do in a pinch
-
pokechu22
hmm, 403s... does medium also need a special UA?
-
pokechu22
ah, seems like it's usually done with -u firefox
-
fireonlive
oh awesome (re: PINE/finding co-founders) :)
-
fireonlive
-
fireonlive
looks like Beeper has given up
-
nulldata
Thanks!
-
fireonlive
(on iMessage, not completely!)
-
nicolas17
they released their bridge as open source
-
nicolas17
maybe gitgud that stuff?
-
fireonlive
pushed the imessage repo to that chan
-
fireonlive
well, relayed