00:10:12 arkiver, how's progress on uploadir looking? :) 00:11:11 currently checking the links myself & more than half of them are 404 07:34:41 (replying to https://hackint.logs.kiska.pw/archiveteam-bs/20221217#c333766 ) I currently don't have time to make an account on the wiki, could someone else add it please? 13:30:05 qwertyasdfuiopghjkl: done 14:18:17 schwarzkatz: coming up today 14:18:32 nice 14:19:55 rewby: can you please create a target for https://tracker.archiveteam.org/uploadir/ ? 14:19:59 this would be 14:20:03 archiveteam_uploadir_ 14:20:05 uploadir_ 14:20:09 Archive Team Uploader: 14:20:19 not huge, some 1.4 TB expected 14:20:27 schwarzkatz: see PM 14:22:51 I checked 265k out of 416k links and so far 208k are 404 14:23:22 so it probably is > 1 TB 14:25:16 *less than 1 TB 18:37:54 Looks like there may be incoming mass content removals by Twitter: https://twitter.com/TwitterSupport/status/1604531261791522817 18:38:27 "We recognize that many of our users are active on other social media platforms. However, we will no longer allow free promotion of certain social media platforms on Twitter. Specifically, we will remove accounts created solely for the purpose of promoting other social platforms and content that contains links or usernames for the following 18:38:27 platforms: Facebook, Instagram, Mastodon, Truth Social, Tribel, Nostr and Post." 18:40:35 From https://help.twitter.com/en/rules-and-policies/social-platforms-policy : What is a violation of this policy? At both the Tweet level and the account level, we will remove any free promotion of prohibited 3rd-party social media platforms, such as linking out (i.e. using URLs) to any of the below platforms on Twitter, or providing your handle 18:40:36 without a URL: Prohibited platforms: * Facebook, Instagram, Mastodon, Truth Social, Tribel, Post and Nostr * 3rd-party social media link aggregators such as linktr.ee, lnk.bio 18:43:29 What is wrong with them 18:43:46 Thanks for sharing 19:01:01 hello 19:03:10 Hello egallager, what can we do for you? 19:03:33 so I was trying to edit the article on Twitter on the ArchiveTeam wiki 19:04:01 qwertyasdfuiopghjkl: that ban wave sadly already happened 2 days ago. 19:04:01 and apparently I'm still stuck with editing an old version of it, because I have a previous submission that hasn't been approved yet? 19:04:37 "You are editing your version of this article. It is currently awaiting moderation.Once approved, it will be visible to other users.For any questions, Please see the IRC channel #archiveteam-bs (on hackint) for more details" 19:04:54 pcr: I think this is about a new one 19:10:07 egallager: Well hopefully bringing it up here will get someone who can approve/reject it (I'm not included in that group) to get to it faster 19:10:32 yeah... there might be merge conflicts by this point now, though... 19:14:12 Schwarzkatz edited Deathwatch (+343, added forums.furaffinity.net): https://wiki.archiveteam.org/?diff=49248&oldid=49242 19:44:06 qwertyasdfuiopghjkl: thought you were talking specifically about @JoinMastodon being banned, you are probably right that there will be more. 19:57:08 qwertyasdfuiopghjkl: probably better to ask in here instead of #archivebot, it is really spammy there. I think it would be a good idea to get a list of *all* the linktree clones and use the search to find tweets/profiles to archive 19:59:25 ok, copied from #archivebot : Would it be possible to archive all tweets that mention linktr.ee or lnk.bio since those sites are specifically mentioned as being banned? 20:08:58 The policy also says that if a violation is found in the username or bio they will ban the whole account until it is removed (which is effectively a permaban if the user has died/lost the password/stopped using Twitter/etc.), so such accounts should also be archived (but that would probably require a more complicated/larger project) 20:11:20 The search does find matches in usernames, but not partial ones. Searching for 'textfil' won't find @textfiles, for example. Searching bios or profile links isn't a thing, I believe. 20:12:36 The search also finds word matches in display names. 20:27:49 JAA: https://twitter.com/search?f=user&q=%22i+draw+the+comic%22 seems to find https://twitter.com/xkcd by the text in the bio for me, for example? (or is that something different?) 21:14:26 qwertyasdfuiopghjkl: Hmm, indeed, last time I tried that, it didn't work. 21:14:38 Doesn't match on links though, whether in the bio or as the dedicated link. 21:15:06 Or rather, you have to search for the t.co URL, but that's impractical: https://twitter.com/search?q=t.co%2FsdyjXHCZF7&f=user 21:15:52 And even then, that only matches links in the bio, not the profile link: https://twitter.com/search?q=t.co%2FgADSbeGBoi&f=user 21:16:06 (That should find @xkcd otherwise.) 21:35:34 Arkiver uploaded File:Uploadir-icon.png: https://wiki.archiveteam.org/?title=File%3AUploadir-icon.png 22:05:42 I compiled a list of linktree clones, there are probably hundreds more out there. You can find it here: https://transfer.archivete.am/rdYeu/linktree-clones.md 22:15:25 JAA: can you please trigger building uploadir-grab? 22:15:34 we're starting when a target is up 22:19:20 One not in that list: https://band.link/ (example link: https://band.link/counttothree ) 22:21:17 Twitter banned Paul Graham, likely using some variation of the new policy 22:21:39 qwertyasdfuiopghjkl: feel free to edit the file with your additions and post it here :) 22:21:48 tech234a: after his tweet about leaving twitter? 22:21:50 was that the reason? 22:22:00 it appears to be 22:22:01 https://news.ycombinator.com/item?id=34044047 22:22:07 some related discussion ^ 22:22:08 arkiver: Building 22:22:10 mastodon mention? 22:22:11 pff 22:22:28 not even a direct link, apparently people said he just said it was on his website 22:23:23 thanks JAA 22:23:47 recent SPN capture https://web.archive.org/web/20221218215229/https://twitter.com/paulg/ 22:25:52 schwarzkatz: I don't really have any more to add at the moment, band.link was just one I remembered seeing before and noticed wasn't listed 22:29:03 nuts 22:42:09 arkiver: https://twitter.com/TwitterSupport/status/1604531261791522817 https://help.twitter.com/en/rules-and-policies/social-platforms-policy 23:53:39 how do I tell if and when AB was last run on a site? 23:56:05 You can check the viewer, although it's not 100% reliable. Other than that, grepping IRC logs. 23:56:07 (including in-progress, recently completed and older) 23:56:29 Yeah, grepping logs is the only way that covers all of them. 23:57:59 ok. my logs don't go very far back, can someone grep theirs for faif.us? 23:58:18 No match in mine. 23:59:01 And not listed in the viewer, so almost certainly hasn't been run before. 23:59:52 ok, I'll run it today then