00:12:17 more on the recent Twitter deletions: https://nitter.net/bubbaprog/status/1693021147447632062 00:13:37 tf 00:13:53 hope it's some sort of like 'old system' issue but... hm 00:18:07 could someone please approve my wiki edits? https://wiki.archiveteam.org/index.php/Special:Contributions/Cooljeanius 00:19:09 i don't have that power, but J\AA will do so when they're available 00:19:19 could take a bit though but no worries they'll be looked at :) 00:33:27 one more on today's Twitter deletions: https://bsky.app/profile/littlegreenfootballs.com/post/3k5dy2qbgvw27 00:38:43 login bypass: https://skeeet.xyz/?url=https://bsky.app/profile/littlegreenfootballs.com/post/3k5dy2qbgvw27 00:48:59 maybe eggdrop can do bsky links too? 00:49:28 if no one objects..... 00:49:54 it doesn't look as feature complete though sadly :( 00:50:14 missing the image that was embedded, can't click to see the quoted post 00:50:56 https://mkx9delh5a.execute-api.ca-central-1.amazonaws.com/uploads/e5a7339da95ec83e/image.png 00:51:04 here's what it looks like on real-blue-sky 00:53:39 I hear that switching the "b" to a "p" also works: https://psky.app/profile/littlegreenfootballs.com/post/3k5dy2qbgvw27 00:54:13 (or maybe that's just for previews on Discord...) 00:54:57 looks like that's just embeds 00:58:41 view-source:https://psky.app/profile/littlegreenfootballs.com/post/3k5dy2qbgvw27 - "Redirecting you to the tweet in a moment." 00:59:00 ah, because it's based on https://vxtwitter.com/ or something 01:19:15 i keep getting stuck in a 404 loop waht do i do 01:19:20 s/waht/what 01:19:38 ? what project 01:19:47 uh skyblog 01:19:56 switch projects then 01:20:11 they are ban happy 01:20:36 oh 01:20:39 wdym 01:21:29 It means they are not happy we are scraping them so switch to another project to continue getting work 01:21:50 lmao 01:21:56 bunch of dum dums 01:35:15 so i just wait for it to go back to normal 01:35:21 when that happens 01:35:49 for skyblog and deadcat you'll want to run them with a concurrency of 1 01:36:07 s/deadcat/gfycat/ 01:37:25 ohh ok 01:37:40 makes sense 01:37:44 so that way i dont get banned 01:40:15 you’ll get banned slower :/ 01:51:29 gfycat i think was ok? or at least a shorter ban... but skyblog is a ban anyways 01:51:47 Safest conc on Skyblog is 0 01:51:55 can we do -1 02:09:36 Arkiver uploaded File:Xuite-icon.png: https://wiki.archiveteam.org/?title=File%3AXuite-icon.png 02:12:36 Roachbones edited Discord (+216, /* Active */ add Discordless): https://wiki.archiveteam.org/?diff=50520&oldid=50463 02:12:37 Cooljeanius edited Imgur (+32, use URL template more): https://wiki.archiveteam.org/?diff=50521&oldid=49751 02:12:38 H2g2bob edited Deathwatch (+216, /* 2023 */): https://wiki.archiveteam.org/?diff=50522&oldid=50510 02:12:39 Gullah edited Deathwatch (+230, Added June 27th 2023 IRL.com shutdown): https://wiki.archiveteam.org/?diff=50523&oldid=50522 02:12:40 Segergren edited Plays.tv (+1393, Added methods for users to recover their videos…): https://wiki.archiveteam.org/?diff=50525&oldid=48085 02:12:41 Jarshua edited List of websites excluded from the Wayback Machine (+39, add https://www.eljamesauthor.com/): https://wiki.archiveteam.org/?diff=50526&oldid=50495 02:13:36 Jurta edited Twitter (+211): https://wiki.archiveteam.org/?diff=50528&oldid=50237 02:13:37 DigitalDragon edited WikiTeam (+110, /* Tools and source code */ Add footnote about…): https://wiki.archiveteam.org/?diff=50529&oldid=50484 02:13:38 Nulldata edited Deathwatch (+292, Added Opera News): https://wiki.archiveteam.org/?diff=50530&oldid=50523 02:13:40 Jwm uploaded File:WikiTide logo.png ([[WikiTide]] logo): https://wiki.archiveteam.org/?title=File%3AWikiTide%20logo.png 02:13:41 Jwm uploaded File:WikiTide screenshot.png (Screenshot of the new WikiTide main page): https://wiki.archiveteam.org/?title=File%3AWikiTide%20screenshot.png 02:13:42 Jwm edited WikiTide (+209, Update with information about their new…): https://wiki.archiveteam.org/?diff=50533&oldid=50008 02:13:43 Jwm created WikiForge (+1424, Creating a page for WikiForge to fix red links…): https://wiki.archiveteam.org/?title=WikiForge 02:13:44 Jwm uploaded File:WikiForge screenshot.png (Screenshot of [[WikiForge]] main page): https://wiki.archiveteam.org/?title=File%3AWikiForge%20screenshot.png 02:13:45 Jwm edited Template:Wikis (+21, Add [[WikiForge]]): https://wiki.archiveteam.org/?diff=50536&oldid=50009 02:13:46 Jwm edited WikiTeam (+198, /* Wikifarms */ Added WikiForge and added…): https://wiki.archiveteam.org/?diff=50537&oldid=50529 02:35:57 Looks like https://wiki.archiveteam.org/?diff=50526&oldid=50495 broke a few urls. 02:36:41 FireonLive edited Xuite (+2, In progress): https://wiki.archiveteam.org/?diff=50538&oldid=50508 02:36:42 FireonLive edited Current Projects (+0, It's "Xuite" that Xuite is running -- move it…): https://wiki.archiveteam.org/?diff=50539&oldid=50504 03:00:44 JAABot edited List of websites excluded from the Wayback Machine (+0): https://wiki.archiveteam.org/?diff=50540&oldid=50526 05:28:17 Does anyone know of any Twitter file dumps or archival projects for Twitter? It's going crazy on the site again and Im here to look for anything comprehensive about Twitter, with tweets and media and all 05:41:25 I think #archivebot is/was okay for grabbing specific profiles/users but I don't personally know of any "let's grab everything!" projects 05:43:01 hmmm, is that so? I see 05:43:51 I don't necessarily need anything new, I'm hoping for dumps of older stuff to peruse if possible 05:44:09 But #archivebot sounds interesting, I'll see that as well, thank you 05:48:27 I don't think archivebot works well for twitter anymore - socialbot used to handle it, but hasn't worked for a while 05:48:37 The closest we have now is saving individual nitter instances... which doesn't work well 05:51:06 nitter.net works now, but the owner pleads people not to scrape it 06:29:33 oh that is unfortunate to hear 06:42:53 Man, I hope I can find something soon ;-; 08:48:55 hi - i'm suddenly getting an impassable captcha. firefox / safari / multiple IP loations 08:49:10 captcha just loops. 08:49:56 hi we aren’t archive.is/today/ph/etc 08:50:07 oh 08:50:09 could be the site is having issues 08:50:18 sorry! 08:50:23 all good :) 08:50:40 (y) (y) 08:51:17 it does happen occasionally to me too but usually just coming back later works 08:51:54 i seem to have hit hard stop. 08:52:40 if clearing cookies for the site doesn’t help try waiting a few hours but that’s all i know 08:52:54 im just some guy who uses it as is everyone else here :3 08:53:07 :-) it does work better than the others 08:53:15 or did! 08:53:23 appreciate the help. 08:53:27 have a good one. 08:54:08 you too! 08:54:17 :) 08:58:56 https://www.jornada.com.mx/2023/08/17/mundo/027n5mun 12:53:27 qwertyasdfuiopghjkl: Good catch, thanks. 12:53:38 JustAnotherArchivist edited List of websites excluded from the Wayback Machine (-6, Partially revert revision 50526 by…): https://wiki.archiveteam.org/?diff=50541&oldid=50540 13:00:39 JAABot edited List of websites excluded from the Wayback Machine (+0): https://wiki.archiveteam.org/?diff=50542&oldid=50541 13:43:16 bruh my warrior has been going for half an hour constantly getting 404s 13:43:25 i think ive been fully ip blocked 13:44:09 LukeMax: What project are you running on? 13:45:47 skyblog 13:45:59 Please use the relevant project channel for project-specific questions. 13:46:15 bruh 13:46:32 ill do that in the future but can i get help on that now because its n 13:46:37 still not working 13:46:55 Not here. 13:47:11 We have project channels for a reason: to keep discussion related to a project in one place. 13:47:27 So ask in #bowlofpetunias instead. 16:13:10 X.com, powered by CPanel. https://elonsucks.org/@adam/110919365416183544 16:15:31 Beautiful website name 16:17:03 https//x.com:2083 now redirects to Twitter.com, but definitely was CPanel according to Bing 16:38:32 nulldata: The whole Twitter was hosted on a single GoDaddy server ?? https://web.archive.org/web/20230412141443/http://x.com:2083/ 17:11:43 🤨 y’all memers 17:30:27 @erkinalp is it an old website? 17:30:51 I could invite him still, speed up the process 17:30:54 erkinalp: any estimate of how many posts and topics it has? I'm not seeing that info on the main page 17:31:12 ... though I can guess at what "1 milyon Türkiye fotoğrafı" means 17:31:48 One million photographs of Turkey 17:32:00 qyxojzh|m: it's active since 2004, has about 200k topics, 1.5M+ photos, ~200k users, 17:32:15 Because Turkish photographs could be anything ig? 17:32:28 erkinalp: No wonder it's HTTP only 17:32:40 also high quality turkish transportation discussions 17:33:02 Hmm, that might be too big for archivebot... but I can try it at least 17:33:06 you could basically trace any iett updates faster than iett's own website publishes 17:33:30 pokechu22: a specialised bot for phpbb could help 17:34:01 That's true, JAA's qwarc would be better (assuming no rate-limiting) 17:34:02 erkinalp: Y'all should make a new website dedicated to this and get the WowTurkey userbase to migrate there, that'd be quite handy 17:34:22 erkinalp: you noted only high quality photos can be seen with account, but can the direct URL to them still be requested without account? 17:34:44 It looks like images are all hosted on-site, e.g. http://wowturkey.com/forum/viewtopic.php?p=9333400#9333400 uses http://wowturkey.com/t.php?p=/tr871/Abdullah4434_Galata_kulesi_Ayasofya2.jpg and http://wowturkey.com/tr871/k_Abdullah4434_Galata_kulesi_Ayasofya2.jpg. Not sure if a higher-resolution version is available? 17:34:45 erkinalp: if you're in contact with an admin - perhaps they can open up high quality to everyone without account? 17:34:46 qyxojzh|m: yeah a few wowturkey users including me are considering that 17:35:23 arkiver: if i know the specs of the server right, it wouldn't be able to cope with that much traffic 17:35:46 iirc it's a single debian box with about 300tb of hdd and 512gb of ram 17:35:51 erkinalp: alright, so we'll just get it without high res versions 17:35:57 sounds not too bad 17:36:24 we'll get a job started on archivebot for it, without account 17:37:47 ah 17:37:51 thread IDs are nicely sequential 17:38:08 JAA: is it possible for qwarc to make an easy/fast copy of wowturkey.com ? 17:41:25 erkinalp: job is running, fast site 17:44:39 What is http://wowturkey.com/forum/cevap_yazanlar.php?t=118684 ? 17:45:34 it's topic leaderboard 17:45:37 cevap yazanlar? 17:45:57 how many people replied to this thread, and how many posts each wrote 17:46:20 It requires being logged in, so probably it's reasonable to ignore it 17:47:13 well we could fake it afterwards 17:47:38 We might also want to ignore e.g. http://wowturkey.com/forum/rating.php?p=9198381 but that at least works when not logged in and has somewhat intersting info... and it looks like most posts have ratings on them too, so it's not getting an empty list for most of them (unlike on some forums) 17:47:42 fake what? 17:48:26 I think the point more is that the page doesn't give you information you couldn't get by looking at all the posts (not that we should re-create it to add it into the warc and on web.archive.org) 17:48:51 well one thing is for sure - nothing will be faked 17:49:05 Maybe we could create a few throwaway accounts? 17:49:21 pokechu22: ratings are actually useful, it's more than just like/dislike 17:49:24 i'm not a big fan of that 17:49:30 anyone have an example of a high quality image? 17:49:53 http://wowturkey.com/t.php?p=/tr869/Eray_Hasirci_DSCN4747_Karagol.jpg 17:50:05 http://wowturkey.com/tr869/Eray_Hasirci_DSCN4747_Karagol.jpg 17:50:23 (requires login to view) 17:50:36 ah yeah that gives me a 404 17:50:48 ah, and the thumbnail version that's public is http://wowturkey.com/tr869/k_Eray_Hasirci_DSCN4747_Karagol.jpg 17:50:52 well, it may be worth asking the admins if they can please open up images behind a login wall? 17:50:56 (which is what http://wowturkey.com/t.php?p=/tr869/Eray_Hasirci_DSCN4747_Karagol.jpg uses for me) 17:51:00 yeah 17:51:08 one of the positive ratings is "i'd sign that off" 17:51:20 ("altına imzamı atarım") 17:53:06 arkiver: as i mentioned, the server wouldn't be able to cope with that; they already disabled hi-res uploads for many users; just a few very aged and privileged users can upload now 17:53:41 alright we'll keep going as is without account 17:57:11 messages posted to threads *can* be edited, the edit window is 24 hours 17:57:30 non-mods can't delete their own messages 17:57:59 this is going to be important for continued archival efforts towards this forum 18:00:43 this is now being archived under the assumption it will go offline 18:01:02 it's not currently a long term archiving project 18:02:22 yeah, but if doesn't go offline soon enough, the incremental archive can be resumed from (today minus 24h) 18:06:33 arkiver: what's the progress now? 18:14:44 erkinalp - http://archivebot.com/3 18:15:01 08:15:18 AM -+rss- Kris Nova passed away: https://nivenly.org/blog/2023/08/19/an-announcement-regarding-kris-n%C3%B3va/ https://news.ycombinator.com/item?id=37199495 18:23:45 https://www.reuters.com/technology/adobes-co-founder-john-warnock-dies-82-2023-08-20/ 18:25:19 oh wowturkey archive is building up quite small without all those hires images 18:26:45 Taking a look at this now. 18:26:56 erkinalp: I assume all photos are linked from thread pages? 18:28:13 JAA: yes 18:28:30 a few featured ones are linked from the index pages 18:35:36 Some images are linked directly, others go through that t.php thing. I wonder why. 18:36:45 yeah the "featured images" thing 18:37:11 I'm talking about this: http://wowturkey.com/forum/viewtopic.php?t=12171&start=530 18:37:29 http://wowturkey.com/tr869/IMG_20230528_150548_36155349440.jpgthat's a low-res only 18:37:31 First two have links, the other three are directly embedded without a link. 18:37:35 Ah 18:37:48 low res is up to 430px by 430px 18:38:51 not all subforums accept hi-res uploads 18:40:40 Nano412510 edited URLTeam (+153, /* Alive */): https://wiki.archiveteam.org/?diff=50543&oldid=50421 19:10:45 https://twitter.com/jbeda/status/1693290822370787697 19:10:45 Exorcism edited Communpedia (-3): https://wiki.archiveteam.org/?diff=50544&oldid=35969 19:10:45 nitter: https://nitter.net/jbeda/status/1693290822370787697 19:14:17 qyxojzh|m: sadly not possible, admin disabled registrations about a week ago 19:28:10 JAA: is there anything functional for twitter archival nowadays? https://twitter.com/jbeda/status/1693290822370787697 19:28:11 nitter: https://nitter.net/jbeda/status/1693290822370787697 19:32:19 nicolas17: Nope :-/ 19:32:31 And yeah, I saw that earlier, already threw here other things into the appropriate channels. 19:33:45 what about twitch? Kris's last two stream VODs are still up (I downloaded them locally) 19:34:17 I threw it into #burnthetwitch (though that only archives metadata, not the VODs themselves). 19:34:30 if I just feed the .m3u8 and .ts URLs into archivebot, the result would have zero discoverability; idk how our twitch stuff works normally 19:39:11 nicolas17: they're not archival friendly; non-premium users get baked-in ads in their streams, muxed by twitch 19:39:36 isn't that in the live streams rather than the VODs? 19:39:55 VODs are opt in 19:39:58 I've never seen ads baked into VODs 19:40:01 not like youtube 19:40:23 the streamer opts into having non-clip VODs available 19:40:58 well yes 19:41:10 I'm not talking about cases where VODs aren't available to begin with :P 19:54:02 arkiver: if we were to assume it's going to shut down exactly in the anniversary, we have about a week in which we could do a second attempt 20:24:48 twitter deleted 10 years worth of media. It's already gone, beyond saving. Next week, tomorrow, or next hour they could delete another 10 years. 20:26:52 (according to what people are saying on the internet) 20:27:24 another 10 years makes it to the present 20:27:34 *would make 20:28:12 and the next thing, shut down entirely 20:29:41 corect 20:29:45 correct and corrrect as well 20:37:46 immibis: They didn't delete the media, they fucked up something on their URL shortener t.co, which broke links. (Yes, media use links, it's weird.) The media still exist and have partially started working again. 20:39:18 sadly i doubt there will be a public post mortem on that... too bad; would be neat to read 20:39:39 >a Musk company admitting error 20:39:40 lol 20:39:42 x3 20:40:54 :P 20:42:01 Gullah edited Deathwatch (+256, Added August 16th 2023 Anonfiles.com shutdown): https://wiki.archiveteam.org/?diff=50545&oldid=50530 21:05:01 ugh my warrior isnt connecting to the internet 21:05:04 Yts98 created Xuite 隨意窩 (+19, Redirected page to [[Xuite]]): https://wiki.archiveteam.org/?title=Xuite%20%E9%9A%A8%E6%84%8F%E7%AA%A9 21:05:13 it was working yesterday 21:05:23 and i did all the stuff on the wiki page 21:05:35 (im on virtualbox if you need that info) 21:07:30 what do i do 21:09:14 see #warrior - they'd probably know more 21:12:06 Pokechu22 edited Deathwatch (+273, /* Pining for the Fjords (Dying) */ various…): https://wiki.archiveteam.org/?diff=50547&oldid=50545 21:28:06 nobodys responding on #warrior 21:28:35 it's sunday people are touching grass 21:29:29 patience, young grasshopper 21:32:39 fair