-
ljcool2006
i hope you saved photobucket images because i recently got an email stating that my account was deactivated
-
ljcool2006
the wiki page still says "not saved yet"
-
ljcool2006
the only thing i could find about it is this but it seems to be outdated:
archive.org/details/photobucketgrabs
-
TheTechRobo
someone's leaving reviews on items in that collection lmao
-
TheTechRobo
2 stars - "A couple of personal photos, photos of second hand cars and a few comics sketches and computers getting built. "
-
OrIdow6
The IA comments seem to attract the most unhinged people for some reason
-
qwertyasdfuiopghjkl
-
h2ibot
Entartet edited List of websites excluded from the Wayback Machine (+34, Added vintagebigblue.org.):
wiki.archiveteam.org/?diff=50451&oldid=50315
-
flashfire42
um h2ibot you wanna get your ass over to some of the other channels
-
pabs
JAA: found a former opensource.com team member on HN:
news.ycombinator.com/item?id=37040739
-
pabs
dev.getsol.us (phabricator) is migrating to github
getsol.us/2023/08/07/state-of-solus-august-2023
-
pabs
I'm working on getting a full list of the git repos for SWH/Codearchiver
-
pabs
-
Antonin
test
-
Antonin
[For French users] Hi! Is there something planned for saving Pages Perso from Orange? You may have seen the news on
pages.perso.orange.fr :/
-
OrIdow6
Antonin: I'm not French, but yes, we know about it
-
OrIdow6
Thanks for reporting anyway
-
Antonin
Great, thanks! :D What's planned? Haven't found anything on the Wiki
-
OrIdow6
As far as I know we haven't done much yet, it's still a month out and usually ISP hosting sites are small enough that we can safely get them within ~2 weeks before shutdown
-
OrIdow6
Not sure what kind of grab it will be, depends on the site
-
OrIdow6
Once it starts we will (try to) have discussion in #webroasting
-
Antonin
Well, there's lot of websites, but okay we'll see :) Have you any Matrix room? I'm not on IRC and if I keep this tab open it will disappear with others
-
OrIdow6
Our IRC network implements a Matrix bridge, I believe
hackint.org/transport/matrix is the page about it
-
OrIdow6
But I don't use it so if you have problems you might want to ask in #hackint
-
OrIdow6
(Or wait til someone who does comes on)
-
OrIdow6
:)
-
AntoninDelFabbro|m
Bonjour from FR !
-
Antonin
Nice, it's `#archiveteam-bs:hackint.org` fyi. Thanks!
-
OrIdow6
Np
-
flashfire42|m
I’ll start on the orange stuff in the next few days
-
AntoninDelFabbro|m
Awesome! Would be happy to help 🙂
-
that_lurker
-
qyxojzh|m
Requiescat in pace
-
pokechu22
I'll also look into doing an archivebot job for orange. I can't tell if
pages.perso.orange.fr provides a list of all the sites directly or not; it seems to list some things but I don't know French
-
qyxojzh|m
<pokechu22> "I'll also look into doing an..." <- I do, if you'd like some help
-
pokechu22
Sure, a list like that would be useful no matter how the project is done
-
qyxojzh|m
September 5 2023 is when it closes
-
qyxojzh|m
9th January 2024 is when all access is revoked
-
qyxojzh|m
Corrections welcome, this is the same Orange that used an Aphex Twin song in their commercials, isn't it?
-
qyxojzh|m
To Cure a Weakling Child (Contour Regard) specifically
-
JAA
pabs, pokechu22: For simple things that don't need custom stuff, just some hardcoded cookie(s), I usually just use wpull with the --load-cookies option. Of course it's also possible with qwarc.
-
JAA
pabs: Also, uh, I think I forgor about opensource.com, but fortunately it's still alive. Will grab soon.
-
JAA
nicolas17: Yeah, it went down a while ago I believe.
-
pokechu22
It might need special checking to make sure that it's not on the challenge page, but I can try with wpull to see if that's sufficient still
-
pokechu22
(I had it randomly give challenges on some of the images when loading the page normally, which means the site didn't work right; it would be better to capture the site without those)
-
JAA
Yeah, that can be checked with one of the hooks.
-
JAA
Don't think you can control the writing to WARC though. qwarc could do that.
-
TheTechRobo
wget-at can do that, I know that much
-
qyxojzh|m
Not being shut down, but worth archiving imo:
jobim.org
-
pokechu22
-
AntoninDelFabbro|m
What could help, is this directory :
annuaire-pp.orange.fr
-
AntoninDelFabbro|m
Not all websites are listed there, but... It's a beginning.
-
flashfire42
AntoninDelFabbro|m um either I am getting some weird errors or there is already something wrong with the perso.orange.fr stuff
-
flashfire42
Did a random sampling on the first page bing and none of them resolve for me but the one you linked there above does
-
flashfire42
Ok our info on the orange site is out of date they are hosted on monsite-orange.fr ranger than pagesperso-orange.fr
-
flashfire42
there is still some stuff in the cache for pageperso-orange.fr tho so I will do grabs for those and then move on to monsite-orange.fr
-
AntoninDelFabbro|m
There's both (tho the first one is "more recent" I think)
-
flashfire42
I mean I will check both but just from my sampling the pagesperso ones didnt resolve for me. so I can do jobs for the cache stuff and whatever resolves on pagesperso and then move to monsite
-
AntoninDelFabbro|m
pagesperso-orange.fr/
-
AntoninDelFabbro|m
monsite-orange.fr/
-
AntoninDelFabbro|m
Alright 😁
-
flashfire42
-
fireonlive
(someone uses microsoft edge? :P)
-
flashfire42
Fuck you microsoft will give me free windows 11 pro soon for the rewards. Plus I am too lazy to switch all my stuff over. I may have to if they start doing that screenshot bullshit or whatever tho
-
pokechu22
proaction.pagespro-orange.fr works for me, and perso.orange.fr/stephane.busson redirects to
stephane.busson.perso.orange.fr
-
pokechu22
Yeah, what doesn't resolve for you?
-
pokechu22
-
flashfire42
Yeah ok those resolve so I will have to check each of them individually it seems
-
pokechu22
Can you give an example of one that doesn't work?
-
flashfire42
-
flashfire42
it appears fine in the cache just doesnt otherwise resolve for me
-
flashfire42
Maybe I am blocked?
-
pokechu22
Yeah that one works fine for me
-
fireonlive
flashfire42: just poking a bit of fun :D
-
pokechu22
so I guess they just hate australia?
-
flashfire42
I have been running skyblogs that is also french maybe they decided FUCK YOU
-
flashfire42
So I may not be able to assist in full scraping at this time until skyblogs is done?
-
pokechu22
This is annoyingly one where !a < list doesn't work, since it's a bunch of different domains
-
AntoninDelFabbro|m
<flashfire42> "
server8.kiska.pw/uploads..." <- And I have a 404 here 😆 No, those doesn't work, they do only with subdomains
-
AntoninDelFabbro|m
<fireonlive> "(someone uses microsoft edge? :P..." <- I have it I czn watch sth for you tomorrow
-
fireonlive
ooh what are you going to watch for me
-
AntoninDelFabbro|m
Idk, why have you asked "someone uses Edge ? :P" ? 😆
-
AntoninDelFabbro|m
Oh nevermind, gotcha
-
fireonlive
bash.org still ded