-
fireonlive
-
eggdrop
-
fireonlive
no. bad. stop.
-
OrIdow6
^ some rando using the goo.gl shutdown to promote their own "successor"
-
cantap
cantap64⊙gc
-
cantap
suthamidea30⊙gc
-
OrIdow6
cantap: ?
-
arkiver
thanks for the list yzqzss
-
arkiver
this goo.gl shut down is going to be hugely damaging
-
arkiver
yzqzss: how was this list collected?
-
arkiver
on goog.gl, note that URLTeam2 project does not create WARCs, anything found there we'll reprocess with a custom project
-
arkiver
i'll bring it up early so we have a lot of time to find and queued items to it
-
arkiver
URLs redirected to will be stashed for now, not immediately fed to #//
-
fireonlive
rock the WARC :)
-
arkiver
sigh bot is down due to yet another power outage
-
arkiver
power outage is not related to IA itself
-
arkiver
can't think of a place that has more power outages :/
-
fireonlive
:( yeah
-
arkiver
bot is back
-
fireonlive
:)
-
fireonlive
-
eggdrop
-
fireonlive
all the immutable places (including print media) that have used goo.gl...
-
arkiver
let's make a goo.gl channel! any suggestions?
-
fireonlive
there's #googlecrash for all things google-ish
-
fireonlive
ah, google drive
-
arkiver
also i need opinion on the name of this thing...
en.wikipedia.org/wiki/Google_URL_Shortener is it "Google URL Shortener" or "goo.gl" :P
-
arkiver
fireonlive: yeah let's make a new one, this one will get traffic i think
-
fireonlive
could re-use microsoft's #scroogled
en.wikipedia.org/wiki/Scroogled :p
-
fireonlive
i don't think i've ever heard it called 'Google URL Shortener' myself, interesting
-
fireonlive
(or a more-explicit #screwgled)
-
fireonlive
#googone
-
arkiver
is #sgroogled too far fetched? :P (replace c by g)
-
arkiver
i'm not native english speaker... but i think scroogled and sgroogled would be pronounced the same?
-
arkiver
at 62 characters with 0-9a-zA-Z it's only like 570 billion requests
-
arkiver
oops
-
arkiver
57 billion i mean
-
arkiver
not bad
-
arkiver
we can scan through that
-
fireonlive
hmm the translate bot pronounces it as s-groog-led
-
arkiver
what will be difficult to archive is the *.app.goo.gl stuff though
-
IDK
wtf another product to the google graveyard in 2 weeks?
-
fireonlive
looks like the old-style
goo.gl/Y5VIoG+ and
goo.gl/Y5VIoG+ links died when they moved to firebase dynamic links
-
fireonlive
...
-
fireonlive
goo.gl/Y5VIoG.info for the second one
-
arkiver
?d=1 is nice though
-
arkiver
what was the +?
-
fireonlive
ah nice
-
fireonlive
iirc it used to show where it resolved to as well as stats?
-
arkiver
fireonlive: are we looking at the same stackexchange page? :P
-
arkiver
or is
goo.gl/Y5VIoG that universal
-
fireonlive
-
arkiver
we are indeed haha
-
fireonlive
:D
-
arkiver
.qr would have been nice, looks like that is dead too already
-
fireonlive
yeah :(
-
fireonlive
i guess there's a lot of routing options, so good that d=1 exists..
firebase.google.com/docs/dynamic-links/debug .. though unsure how much if any that was used on goo.gl itself
-
arkiver
yeah there's a ton of routing options
-
arkiver
... time to get them all :)
-
fireonlive
:D
-
arkiver
found a nice example at
f7td5.app.goo.gl/VzgJeH?d=1 (just some random shortened *.app.goo.gl/* URLs, not idea about the page it leads to)
-
fireonlive
-
arkiver
blegh
-
fireonlive
ah, wow
-
fireonlive
yeah lot of character space :/
-
arkiver
yeah we can't scan through that
-
fireonlive
firebase dynamic links also work on custom domains; but that might just be too big too
-
fireonlive
(since they can be anything arbitrary/programatic)
-
arkiver
we'll support it when people want to queue it and try to discover as much as possible, but can't scan through that all
-
arkiver
the goo.gl/* URLs alone we can scan through though
-
fireonlive
ah that sounds good
-
OrIdow6
#goo.gone
-
OrIdow6
And yes sg... and sc... would be pronounced the same
-
fireonlive
(don't have an example handy, but say bbc.co.uk running one at b.bc or sth)
-
DigitalDragons
hmm, are they even shutting down *.apps.goo.gl?
-
DigitalDragons
if I open google maps right now, it still gives me links like
maps.app.goo.gl/W4NXZ2ghG4TcyPpU9 and they are pretty specific about "
goo.gl/*" in the announcement
-
IDK
Also btw, is anyone aware that the target is dead
-
fireonlive
dead or full
-
fireonlive
IA is power outage for a bit so could be backed up
-
fireonlive
DigitalDragons: hm, it seems to be running on firebase dynamic links too
-
fireonlive
-
pabs
images.app.goo.gl is another subdomain
-
pabs
and f7td5.app.goo.gl photos.app.goo.gl
-
pabs
hmm, wonder if goo.gle is affected too, I found
goo.gle/patchz-nomination
-
pabs
also found URLs like
goo.gl/photos...
-
pabs
#gonegl
-
DigitalDragons
-
fireonlive
:|
-
IDK
(Replying to fireonlive) Yep the target is full
-
fireonlive
ah
-
IDK
also I dont think the *.app.goo.gl is affected
-
IDK
"Any developers using links built with the Google URL Shortener in the form
goo.gl/* will be impacted, and these URLs will no longer return a response after August 25th, 2025."
-
IDK
I dont think they are intending to shut down firebase dynamic links
-
fireonlive
they are
-
fireonlive
-
IDK
wtf
-
IDK
why, just WHY
-
thuban
the necrotic engines of the google graveyard demand sacrifice
-
thuban
<@arkiver> i'm not native english speaker... but i think scroogled and sgroogled would be pronounced the same?
-
thuban
^ not imo, [c] is /k/ (voiceless) and [g] is /g/ (voiced). i might approximate initial /sgɹ/ as /skɹ/ but would not really regard it as pronounceable
-
thuban
sounds like there isn't a channel yet? (i like #googone; however, consider also: #ruegl)
-
c3manu
IPA++
-
eggdrop
[karma] 'IPA' now has 1 karma!
-
c3manu
😎
-
masterx244|m
maybe for the app.goo.gl crap we can sift through the old project WARCs again to catch them like we have done for imgur and friends#
-
yzqzss
<arkiver> "yzqzss: how was this list..." <- 1. get a blog's homepage URI via
feed.cnblogs.com/blog/u{BlogID}/rss/ (<feed><author><uri>) 2. iterate through {URI}?page={page} page by page
-
yzqzss
-
h2ibot
Exorcism edited Bugzilla (+4, /* Status */):
wiki.archiveteam.org/?diff=52941&oldid=52940
-
IDK
thuban: would #scroogled be pronunced screw oogled
-
expert
hello people
-
expert
can anyone give me information related to hello_solver123
-
masterx244|m
maybe we should backup the crowdstrike public facing stuff, too incase they go belly-up due to their goofup
-
stupiddoumin
hi
-
stupiddoumin
can i ask about archiving?
-
c3manu
hi! sure, what would you like to know?
-
stupiddoumin
homepage service of vector, japanese software distribution site is closing down on 2024/12/20 (jst?)
-
stupiddoumin
-
stupiddoumin
-
c3manu
ah, we already know about that one. thanks! :)
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52942&oldid=52941
-
c3manu
fyi: since i didn't really get an answer regarding the hp.vector.co.jp stuff, i just queued the list nullpeta requested in #archivebot:
files.catbox.moe/ooe4xx.txt
-
c3manu
depending on how large this will get it can probably get re-run in december, but i share the concern that some users might replace their sites with redirects to their new locations (which is often enough paired with a rebuild of it)
-
c3manu
(or a makeover, rather)
-
c3manu
job is currently waiting for the pipeline
-
nullpeta
I am working on hp.vector.co.jp and will share my results.
-
nullpeta
Here is the list of URLs that return 200 on http and the script we used to extract them. Feel free to use them!
files.catbox.moe/ooe4xx.txt files.catbox.moe/5ga5db.py
-
c3manu
nullpeta: hehe, just told them i queued your list ;)
-
asie
nullpeta: that's not complete
-
asie
there was a public homepage list removed after october 2016... it had two URLs which do not fit that scheme
-
asie
-
asie
not a big deal, but
-
asie
c3manu: ^
-
asie
and yeah rerunning it in december makes sense
-
c3manu
ah, i must have missed that >.<
-
nullpeta
asie: Wow, I did not notice this. Excellent point!
-
asie
-
asie
but a bruteforce is good to run, as thuban previously found six pages created after mid-2016 which were not on that page, but fit the VAnnnnnn scheme
-
asie
the two combined should be fairly comprehensive
-
nullpeta
The problem is that some URLs return 403 instead of 404. Maybe there is some kind of hidden page. Here is the list of id's that return 403. (sorry it's not sorted!)
files.catbox.moe/7esoln.txt
-
c3manu
or maybe banned users?
-
c3manu
some forums return 403 for those
-
asie
-
asie
so IMO this could be something like deleted accounts
-
c3manu
if you can give me a combined list, i'll run that one instead
-
nullpeta
asie: Indeed, it seems likely. I will report back if I find out anything on this.
-
c3manu
aborted nullpeta's list in favor of a combined one
-
asie
-
asie
combined one
-
c3manu
merci :)
-
nullpeta
I checked and the combined list looks complete! This page (
web.archive.org/web/20161012170825/…or.co.jp/vpack/author/listpage.html) seems to be missing a few sites(eg: VA001028), but it looks like a brute force hit found those as well. Thank you for help.
-
Blackb|rd
'ello. I am running a podman based worker and set a http basic auth password that is lost to the sands of time. How can I reset it?
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52943&oldid=52942
-
that_lurker
Blackb|rd: stop the container and remove it and start from the beginning
-
Blackb|rd
I tried using systemctl to stop it, and then deleting it using the podman cmdline, but there was no container anymore
-
that_lurker
podman system prune -a
-
nullpeta
Could someone insert
honz.jp into archivebot? honz is a Japanese book review site that has already been discontinued and is no longer being updated. The site will be deleted in the near future.
-
that_lurker
-
Blackb|rd
So podman system prune -a did delete stuff, and I started the worker again using systemctl, but it still prompts me for a password
-
nulldata
nullpeta - queued in AB
-
nullpeta
nulldata: Thank you!
-
Blackb|rd
huh, it somehow got into the state of remembering the username, but with an empty pw, and loging in with the username and no pw worked. It also remembered the concurrency setting "somehow"
-
that_lurker
when using podman the config presists if I remember correctly, but good if you got in and can set a new password you can remember
-
Blackb|rd
yeah, thanks lurker!
-
yzqzss
akiver: cnblogs will have a warrior project or go to #// ?
-
JaffaCakes118
-
JaffaCakes118
All google short url's will start returning 404
-
nulldata
JaffaCakes118 - Yes, it has been discussed
-
that_lurker
JAA: Would it be a good idea to have the same topic edits for this channel that are made in #archiveteam when topic are known already. Stuff gets burried here at times too :-)
-
h2ibot
Exorcism edited Bugzilla (+0, /* Status */):
wiki.archiveteam.org/?diff=52944&oldid=52943
-
Ravenloft
fireonlive sure, sorry, my bad
-
fireonlive
Ravenloft: not at all! :) the initial 'hey this is on fire' is perfect for that channel (though not a lot of people read the topic unfortunately)
-
fireonlive
we just move here for the more nitty-gritty after
-
fireonlive
(and, soon a dedicated channel for the project itself to contain it in one area)
-
imer
speaking of, any rate limiting on goo.gl?
-
fireonlive
-
JAA
I propose: #goop.gl
-
JAA
that_lurker: That'll go out of sync immediately. Better to keep it in one place, I think.
-
that_lurker
true and i'll second goop :-) I was thinking about gone.gl or grave.gl but that sound better
-
Vokun
goop.gl is good. it keeps it linked to the shortener, which is good, because we're all but guarantied to have more google related projects
-
JAA
!status 4zne9j84uh36an0c5rscw855p
-
JAA
Not here.
-
SootBector
shortening the shortener to #gl might be fitting
-
that_lurker
that could overlap if something happens to the gl tld and that needs a project
-
fireonlive
wouldn't be a pun though
-
fireonlive
(or a joke of some kind)
-
that_lurker
-
fireonlive
🥵
-
SootBector
gig.gl not much connection but made me smile