01:23:27 After resistance from Elon Musk, they have now agreed to buy Twitter for the original stated price of 44 billion dollars: https://www.wsj.com/articles/elon-musk-proposes-closing-twitter-deal-on-original-terms-11664901454 01:29:21 Any potential of archiving Twitter? Not just the latest goods but from the past? 01:44:44 Hola all. I'm trying to reach someone who holds a copy of the FilePlanet /ftp2/ contents. I'm building an archive of Q3 levels and am hpping someone can run a search over the stash to list out any .pk3 files it contains. 01:47:09 On a related note if any data hoarders would like a ~60Gb stash of Q3 levels, there's a torrent up right now with everything I've managed to find so far =) 02:06:43 Any potential of archiving Twitter? Not just the latest goods but from the past? 02:07:16 I did think of an implementation awhile ago, but not sure if it's AT-ready and it's also resource intensive. 02:07:57 Basically, there'd be hashtag: items, search: items, user: items, etc. user: items would use both the search method of scraping and the user profile method (to miss as few tweets as possible). Then they'd queue t: items, which are individual tweets. 02:08:22 Foo_Q3W: I believe this is everything we were allowed to share from ftp2. https://archive.org/details/Fileplanet_ftp2_FILES_FROM_PUBLIC_ID_GRAB 02:08:36 search: and hashtag: and user: items queue t: items, and t: items queue hashtag: and user: and t: items (from the tweet contents, parent tweet, replies, mentions, etc) 02:08:42 it'd use a lot of memory with redis though 02:08:51 and the backfeed filter 02:09:35 Ah thanks Jake. I have that on download currently to sift through. I was hoping someone with full access would be able to do an offline search for *.PK3, then let me know what's a there so I can identify any lost levels. I figure that'd be a privacy-respecting way of handling it, since PK3s were only packaged game levels. 02:09:57 However I'm hopeful that the public id grab content is going to turn up some lost stuff, so bonus either way! 02:10:27 I believe that's spirit: ^ 02:10:57 Oh Spirit is in here, sweet. I pinged him on email a lil while back but no dice, figured must be busy =) 02:15:32 TheTechRobo, I hope it runs sooner or later, ideally sooner :c 03:23:31 Oh, hello there, all who archive for humanity's sake. 03:23:43 How's it all going this week? 03:28:58 good! 03:35:12 Have you all been versed in the recent Fandom/Wikia acquisition as of now? 03:36:14 Amongst those gobbled up were GameFAQs and Metacritic, both of them holding quite some important video game history and info, especially for the obscure ones. 03:49:37 yep, we've heard. no concrete plans at the moment (we've discussed gamefaqs a couple of times), but we're keeping an eye out for operational changes, so ping us if you see or hear of anything specific 05:42:26 OrIdow6 uploaded File:Yahoo Groups provenance.png (State of information from Yahoo Groups as of…): https://wiki.archiveteam.org/?title=File%3AYahoo%20Groups%20provenance.png 05:53:28 OrIdow6 edited Yahoo! Groups (+13680, Adding this in): https://wiki.archiveteam.org/?diff=49057&oldid=49033 06:38:57 Foo_Q3W: sorry, i am very good at replying to some mails after weeks or months =) 06:39:02 will mail to you tomorrow or so 06:39:30 gist is sadly "no", those pk3 files might be private betas or other files not meant for the public 06:39:39 hehe, no worries. Thanks for the update 06:40:27 the only way i can give out files from that part of the archive is from publically archived proof of public availability, e.g. fileplanet urls archived on some pages in the wayback machine or the live website 06:40:29 Am I right in thinking that if I can pull dl.fileplanet links, that can be tied back to files? 06:40:34 yeah! 06:40:37 Ah, yep, we're on the same page. Got it. 06:40:40 I'll work up that list. 06:41:10 ty ty 06:46:00 :)) 06:46:32 with online sources please, just URLs can be forged and bruteforced 08:31:13 https://blog.archive.org/2022/10/04/internet-archive-seeks-donations-of-materials-to-build-a-digital-library-of-amateur-radio-and-communications/ https://news.ycombinator.com/item?id=33089535 08:31:32 some possible links to archive in the HN thread 14:17:07 Ok, trying grab-site on this annoying cloudflare-protected forum. Is there a way to make grab-site use cookies from a regular firefox browsing session ? I.e. do the clownflare "check" in a regular browser, then start a crawl with whatever cookies it generated 14:17:21 (https://forum.tek.com) 14:18:45 nvm, I had missed the instructions in the Readme 14:25:09 aaand it doesn't work 14:27:11 fenugrec: i am no expert on grab-site, so can't answer that, but I do wonder - is that one shutting down? 14:28:52 https://forum.tek.com/viewtopic.php?f=583&t=143177 says "If you're a frequent visitor of this forum, please be aware that the forum is moving and will be decommissioned before the end of the year." 14:29:34 arkiver, yes, it's scheduled to be "decomissioned", as qwerty posted above 16:20:19 Hi 16:21:38 I am wondering if there is any script made for auto-starting the archivebot service 16:21:44 s 20:52:47 no 20:53:23 there is for grab-site though. 21:43:23 Kaz: look like that grab-site is very user-friendly ;) 21:44:52 It's almost as if that's what you should use... ;-) 21:46:23 JAA: The problem is that.. I need the IRC bot and pipeline system. 21:47:55 Anyways.. I made a simple bash script to start all of archivebot's services. 21:52:27 I can't remember if I asked this before, but why do you need a IRC bot or a pipeline? 21:55:02 Jake: Pipeline will be useful for anyone (who has a lot of space/bandwidth) wants to help me in archiving project 21:56:58 irc bot make it easier for a group of people to manage the archiving process 22:00:44 I.... guess? but you've also spent like a month trying to get it setup. seems way easier to me to go with something like grab-site or something similar? 22:04:43 Yeah, it took me some time to set it up. But, in my opinion... that is easier than creating a full bot for a grab-site and a pipeline-like system.  22:07:56 also, I upload the finished warc to my project's ftp, so... uploader.py is so useful for me... 22:23:26 a tech person arrested in Iran: https://twitter.com/jadi https://jadi.net https://youtube.com/c/JadiMirmirani https://fosstodon.org/@Mehrad/109117086408879957