-
egallager
-
fireonlive
tf
-
fireonlive
hope it's some sort of like 'old system' issue but... hm
-
egallager
-
fireonlive
i don't have that power, but J\AA will do so when they're available
-
fireonlive
could take a bit though but no worries they'll be looked at :)
-
egallager
-
fireonlive
-
DigitalDragons
maybe eggdrop can do bsky links too?
-
fireonlive
if no one objects.....
-
fireonlive
it doesn't look as feature complete though sadly :(
-
fireonlive
missing the image that was embedded, can't click to see the quoted post
-
fireonlive
-
fireonlive
here's what it looks like on real-blue-sky
-
egallager
-
egallager
(or maybe that's just for previews on Discord...)
-
fireonlive
looks like that's just embeds
-
pokechu22
view-source:https://psky.app/profile/littlegreenfootballs.com/post/3k5dy2qbgvw27 - "Redirecting you to the tweet in a moment."
-
pokechu22
ah, because it's based on
vxtwitter.com or something
-
LukeMax
i keep getting stuck in a 404 loop waht do i do
-
LukeMax
s/waht/what
-
fireonlive
? what project
-
LukeMax
uh skyblog
-
flashfire42
switch projects then
-
flashfire42
they are ban happy
-
LukeMax
oh
-
LukeMax
wdym
-
flashfire42
It means they are not happy we are scraping them so switch to another project to continue getting work
-
LukeMax
lmao
-
LukeMax
bunch of dum dums
-
LukeMax
so i just wait for it to go back to normal
-
LukeMax
when that happens
-
fireonlive
for skyblog and deadcat you'll want to run them with a concurrency of 1
-
fireonlive
s/deadcat/gfycat/
-
LukeMax
ohh ok
-
LukeMax
makes sense
-
LukeMax
so that way i dont get banned
-
imer
you’ll get banned slower :/
-
fireonlive
gfycat i think was ok? or at least a shorter ban... but skyblog is a ban anyways
-
nulldata
Safest conc on Skyblog is 0
-
fireonlive
can we do -1
-
h2ibot
-
h2ibot
Roachbones edited Discord (+216, /* Active */ add Discordless):
wiki.archiveteam.org/?diff=50520&oldid=50463
-
h2ibot
Cooljeanius edited Imgur (+32, use URL template more):
wiki.archiveteam.org/?diff=50521&oldid=49751
-
h2ibot
H2g2bob edited Deathwatch (+216, /* 2023 */):
wiki.archiveteam.org/?diff=50522&oldid=50510
-
h2ibot
Gullah edited Deathwatch (+230, Added June 27th 2023 IRL.com shutdown):
wiki.archiveteam.org/?diff=50523&oldid=50522
-
h2ibot
Segergren edited Plays.tv (+1393, Added methods for users to recover their videos…):
wiki.archiveteam.org/?diff=50525&oldid=48085
-
h2ibot
Jarshua edited List of websites excluded from the Wayback Machine (+39, add
eljamesauthor.com/): wiki.archiveteam.org/?diff=50526&oldid=50495
-
h2ibot
-
h2ibot
DigitalDragon edited WikiTeam (+110, /* Tools and source code */ Add footnote about…):
wiki.archiveteam.org/?diff=50529&oldid=50484
-
h2ibot
Nulldata edited Deathwatch (+292, Added Opera News):
wiki.archiveteam.org/?diff=50530&oldid=50523
-
h2ibot
-
h2ibot
Jwm uploaded
File:WikiTide screenshot.png (Screenshot of the new WikiTide main page):
wiki.archiveteam.org/?title=File%3AWikiTide%20screenshot.png
-
h2ibot
Jwm edited WikiTide (+209, Update with information about their new…):
wiki.archiveteam.org/?diff=50533&oldid=50008
-
h2ibot
Jwm created WikiForge (+1424, Creating a page for WikiForge to fix red links…):
wiki.archiveteam.org/?title=WikiForge
-
h2ibot
-
h2ibot
Jwm edited Template:Wikis (+21, Add [[WikiForge]]):
wiki.archiveteam.org/?diff=50536&oldid=50009
-
h2ibot
Jwm edited WikiTeam (+198, /* Wikifarms */ Added WikiForge and added…):
wiki.archiveteam.org/?diff=50537&oldid=50529
-
qwertyasdfuiopghjkl
-
h2ibot
-
h2ibot
FireonLive edited Current Projects (+0, It's "Xuite" that Xuite is running -- move it…):
wiki.archiveteam.org/?diff=50539&oldid=50504
-
h2ibot
JAABot edited List of websites excluded from the Wayback Machine (+0):
wiki.archiveteam.org/?diff=50540&oldid=50526
-
cas
Does anyone know of any Twitter file dumps or archival projects for Twitter? It's going crazy on the site again and Im here to look for anything comprehensive about Twitter, with tweets and media and all
-
DigitalDragons
I think #archivebot is/was okay for grabbing specific profiles/users but I don't personally know of any "let's grab everything!" projects
-
cas
hmmm, is that so? I see
-
cas
I don't necessarily need anything new, I'm hoping for dumps of older stuff to peruse if possible
-
cas
But #archivebot sounds interesting, I'll see that as well, thank you
-
pokechu22
I don't think archivebot works well for twitter anymore - socialbot used to handle it, but hasn't worked for a while
-
pokechu22
The closest we have now is saving individual nitter instances... which doesn't work well
-
fireonlive
nitter.net works now, but the owner pleads people not to scrape it
-
cas
oh that is unfortunate to hear
-
cas
Man, I hope I can find something soon ;-;
-
doublejay
hi - i'm suddenly getting an impassable captcha. firefox / safari / multiple IP loations
-
doublejay
captcha just loops.
-
fireonlive
hi we aren’t archive.is/today/ph/etc
-
doublejay
oh
-
fireonlive
could be the site is having issues
-
doublejay
sorry!
-
fireonlive
all good :)
-
doublejay
(y) (y)
-
fireonlive
it does happen occasionally to me too but usually just coming back later works
-
doublejay
i seem to have hit hard stop.
-
fireonlive
if clearing cookies for the site doesn’t help try waiting a few hours but that’s all i know
-
fireonlive
im just some guy who uses it as is everyone else here :3
-
doublejay
:-) it does work better than the others
-
doublejay
or did!
-
doublejay
appreciate the help.
-
doublejay
have a good one.
-
fireonlive
you too!
-
fireonlive
:)
-
pabs
-
JAA
qwertyasdfuiopghjkl: Good catch, thanks.
-
h2ibot
JustAnotherArchivist edited List of websites excluded from the Wayback Machine (-6, Partially revert revision 50526 by…):
wiki.archiveteam.org/?diff=50541&oldid=50540
-
h2ibot
JAABot edited List of websites excluded from the Wayback Machine (+0):
wiki.archiveteam.org/?diff=50542&oldid=50541
-
LukeMax
bruh my warrior has been going for half an hour constantly getting 404s
-
LukeMax
i think ive been fully ip blocked
-
kiryu
LukeMax: What project are you running on?
-
LukeMax
skyblog
-
JAA
Please use the relevant project channel for project-specific questions.
-
LukeMax
bruh
-
LukeMax
ill do that in the future but can i get help on that now because its n
-
LukeMax
still not working
-
JAA
Not here.
-
JAA
We have project channels for a reason: to keep discussion related to a project in one place.
-
JAA
So ask in #bowlofpetunias instead.
-
nulldata
-
qyxojzh|m
Beautiful website name
-
nulldata
https//x.com:2083 now redirects to Twitter.com, but definitely was CPanel according to Bing
-
kiryu
nulldata: The whole Twitter was hosted on a single GoDaddy server ??
web.archive.org/web/20230412141443/http://x.com:2083
-
fireonlive
🤨 y’all memers
-
qyxojzh|m
@erkinalp is it an old website?
-
qyxojzh|m
I could invite him still, speed up the process
-
pokechu22
erkinalp: any estimate of how many posts and topics it has? I'm not seeing that info on the main page
-
pokechu22
... though I can guess at what "1 milyon Türkiye fotoğrafı" means
-
qyxojzh|m
One million photographs of Turkey
-
erkinalp
qyxojzh|m: it's active since 2004, has about 200k topics, 1.5M+ photos, ~200k users,
-
qyxojzh|m
Because Turkish photographs could be anything ig?
-
qyxojzh|m
erkinalp: No wonder it's HTTP only
-
erkinalp
also high quality turkish transportation discussions
-
pokechu22
Hmm, that might be too big for archivebot... but I can try it at least
-
erkinalp
you could basically trace any iett updates faster than iett's own website publishes
-
erkinalp
pokechu22: a specialised bot for phpbb could help
-
pokechu22
That's true, JAA's qwarc would be better (assuming no rate-limiting)
-
qyxojzh|m
erkinalp: Y'all should make a new website dedicated to this and get the WowTurkey userbase to migrate there, that'd be quite handy
-
arkiver
erkinalp: you noted only high quality photos can be seen with account, but can the direct URL to them still be requested without account?
-
pokechu22
-
arkiver
erkinalp: if you're in contact with an admin - perhaps they can open up high quality to everyone without account?
-
erkinalp
qyxojzh|m: yeah a few wowturkey users including me are considering that
-
erkinalp
arkiver: if i know the specs of the server right, it wouldn't be able to cope with that much traffic
-
erkinalp
iirc it's a single debian box with about 300tb of hdd and 512gb of ram
-
arkiver
erkinalp: alright, so we'll just get it without high res versions
-
arkiver
sounds not too bad
-
arkiver
we'll get a job started on archivebot for it, without account
-
arkiver
ah
-
arkiver
thread IDs are nicely sequential
-
arkiver
JAA: is it possible for qwarc to make an easy/fast copy of wowturkey.com ?
-
arkiver
erkinalp: job is running, fast site
-
pokechu22
-
erkinalp
it's topic leaderboard
-
qyxojzh|m
cevap yazanlar?
-
erkinalp
how many people replied to this thread, and how many posts each wrote
-
pokechu22
It requires being logged in, so probably it's reasonable to ignore it
-
erkinalp
well we could fake it afterwards
-
pokechu22
We might also want to ignore e.g.
wowturkey.com/forum/rating.php?p=9198381 but that at least works when not logged in and has somewhat intersting info... and it looks like most posts have ratings on them too, so it's not getting an empty list for most of them (unlike on some forums)
-
arkiver
fake what?
-
pokechu22
I think the point more is that the page doesn't give you information you couldn't get by looking at all the posts (not that we should re-create it to add it into the warc and on web.archive.org)
-
arkiver
well one thing is for sure - nothing will be faked
-
qyxojzh|m
Maybe we could create a few throwaway accounts?
-
erkinalp
pokechu22: ratings are actually useful, it's more than just like/dislike
-
arkiver
i'm not a big fan of that
-
arkiver
anyone have an example of a high quality image?
-
erkinalp
-
erkinalp
-
erkinalp
(requires login to view)
-
arkiver
ah yeah that gives me a 404
-
pokechu22
-
arkiver
well, it may be worth asking the admins if they can please open up images behind a login wall?
-
pokechu22
-
arkiver
yeah
-
erkinalp
one of the positive ratings is "i'd sign that off"
-
erkinalp
("altına imzamı atarım")
-
erkinalp
arkiver: as i mentioned, the server wouldn't be able to cope with that; they already disabled hi-res uploads for many users; just a few very aged and privileged users can upload now
-
arkiver
alright we'll keep going as is without account
-
erkinalp
messages posted to threads *can* be edited, the edit window is 24 hours
-
erkinalp
non-mods can't delete their own messages
-
erkinalp
this is going to be important for continued archival efforts towards this forum
-
arkiver
this is now being archived under the assumption it will go offline
-
arkiver
it's not currently a long term archiving project
-
erkinalp
yeah, but if doesn't go offline soon enough, the incremental archive can be resumed from (today minus 24h)
-
erkinalp
arkiver: what's the progress now?
-
nulldata
-
fireonlive
-
nulldata
-
erkinalp
oh wowturkey archive is building up quite small without all those hires images
-
JAA
Taking a look at this now.
-
JAA
erkinalp: I assume all photos are linked from thread pages?
-
erkinalp
JAA: yes
-
erkinalp
a few featured ones are linked from the index pages
-
JAA
Some images are linked directly, others go through that t.php thing. I wonder why.
-
erkinalp
yeah the "featured images" thing
-
JAA
-
erkinalp
-
JAA
First two have links, the other three are directly embedded without a link.
-
JAA
Ah
-
erkinalp
low res is up to 430px by 430px
-
erkinalp
not all subforums accept hi-res uploads
-
h2ibot
Nano412510 edited URLTeam (+153, /* Alive */):
wiki.archiveteam.org/?diff=50543&oldid=50421
-
nicolas17
-
h2ibot
-
eggdrop
-
erkinalp
qyxojzh|m: sadly not possible, admin disabled registrations about a week ago
-
nicolas17
JAA: is there anything functional for twitter archival nowadays?
twitter.com/jbeda/status/1693290822370787697
-
eggdrop
-
JAA
nicolas17: Nope :-/
-
JAA
And yeah, I saw that earlier, already threw here other things into the appropriate channels.
-
nicolas17
what about twitch? Kris's last two stream VODs are still up (I downloaded them locally)
-
JAA
I threw it into #burnthetwitch (though that only archives metadata, not the VODs themselves).
-
nicolas17
if I just feed the .m3u8 and .ts URLs into archivebot, the result would have zero discoverability; idk how our twitch stuff works normally
-
erkinalp
nicolas17: they're not archival friendly; non-premium users get baked-in ads in their streams, muxed by twitch
-
nicolas17
isn't that in the live streams rather than the VODs?
-
erkinalp
VODs are opt in
-
nicolas17
I've never seen ads baked into VODs
-
erkinalp
not like youtube
-
erkinalp
the streamer opts into having non-clip VODs available
-
nicolas17
well yes
-
nicolas17
I'm not talking about cases where VODs aren't available to begin with :P
-
erkinalp
arkiver: if we were to assume it's going to shut down exactly in the anniversary, we have about a week in which we could do a second attempt
-
immibis
twitter deleted 10 years worth of media. It's already gone, beyond saving. Next week, tomorrow, or next hour they could delete another 10 years.
-
immibis
(according to what people are saying on the internet)
-
erkinalp
another 10 years makes it to the present
-
erkinalp
*would make
-
erkinalp
and the next thing, shut down entirely
-
immibis
corect
-
immibis
correct and corrrect as well
-
JAA
immibis: They didn't delete the media, they fucked up something on their URL shortener t.co, which broke links. (Yes, media use links, it's weird.) The media still exist and have partially started working again.
-
fireonlive
sadly i doubt there will be a public post mortem on that... too bad; would be neat to read
-
nicolas17
>a Musk company admitting error
-
nicolas17
lol
-
fireonlive
x3
-
arkiver
:P
-
h2ibot
Gullah edited Deathwatch (+256, Added August 16th 2023 Anonfiles.com shutdown):
wiki.archiveteam.org/?diff=50545&oldid=50530
-
LukeMax
ugh my warrior isnt connecting to the internet
-
h2ibot
-
LukeMax
it was working yesterday
-
LukeMax
and i did all the stuff on the wiki page
-
LukeMax
(im on virtualbox if you need that info)
-
LukeMax
what do i do
-
fireonlive
see #warrior - they'd probably know more
-
h2ibot
Pokechu22 edited Deathwatch (+273, /* Pining for the Fjords (Dying) */ various…):
wiki.archiveteam.org/?diff=50547&oldid=50545
-
LukeMax
nobodys responding on #warrior
-
nicolas17
it's sunday people are touching grass
-
fireonlive
patience, young grasshopper
-
LukeMax
fair