-
schwarzkatz
arkiver, how's progress on uploadir looking? :)
-
schwarzkatz
currently checking the links myself & more than half of them are 404
-
qwertyasdfuiopghjkl
(replying to
hackint.logs.kiska.pw/archiveteam-bs/20221217#c333766 ) I currently don't have time to make an account on the wiki, could someone else add it please?
-
schwarzkatz
qwertyasdfuiopghjkl: done
-
arkiver
schwarzkatz: coming up today
-
schwarzkatz
nice
-
arkiver
rewby: can you please create a target for
tracker.archiveteam.org/uploadir ?
-
arkiver
this would be
-
arkiver
archiveteam_uploadir_
-
arkiver
uploadir_
-
arkiver
Archive Team Uploader:
-
arkiver
not huge, some 1.4 TB expected
-
arkiver
schwarzkatz: see PM
-
schwarzkatz
I checked 265k out of 416k links and so far 208k are 404
-
schwarzkatz
so it probably is > 1 TB
-
schwarzkatz
*less than 1 TB
-
qwertyasdfuiopghjkl
Looks like there may be incoming mass content removals by Twitter:
twitter.com/TwitterSupport/status/1604531261791522817
-
qwertyasdfuiopghjkl
"We recognize that many of our users are active on other social media platforms. However, we will no longer allow free promotion of certain social media platforms on Twitter. Specifically, we will remove accounts created solely for the purpose of promoting other social platforms and content that contains links or usernames for the following
-
qwertyasdfuiopghjkl
platforms: Facebook, Instagram, Mastodon, Truth Social, Tribel, Nostr and Post."
-
qwertyasdfuiopghjkl
From
help.twitter.com/en/rules-and-policies/social-platforms-policy : What is a violation of this policy? At both the Tweet level and the account level, we will remove any free promotion of prohibited 3rd-party social media platforms, such as linking out (i.e. using URLs) to any of the below platforms on Twitter, or providing your handle
-
qwertyasdfuiopghjkl
without a URL: Prohibited platforms: * Facebook, Instagram, Mastodon, Truth Social, Tribel, Post and Nostr * 3rd-party social media link aggregators such as linktr.ee, lnk.bio
-
schwarzkatz
What is wrong with them
-
schwarzkatz
Thanks for sharing
-
egallager
hello
-
OrIdow6
Hello egallager, what can we do for you?
-
egallager
so I was trying to edit the article on Twitter on the ArchiveTeam wiki
-
pcr
qwertyasdfuiopghjkl: that ban wave sadly already happened 2 days ago.
-
egallager
and apparently I'm still stuck with editing an old version of it, because I have a previous submission that hasn't been approved yet?
-
egallager
"You are editing your version of this article. It is currently awaiting moderation.Once approved, it will be visible to other users.For any questions, Please see the IRC channel #archiveteam-bs (on hackint) for more details"
-
qwertyasdfuiopghjkl
pcr: I think this is about a new one
-
OrIdow6
egallager: Well hopefully bringing it up here will get someone who can approve/reject it (I'm not included in that group) to get to it faster
-
egallager
yeah... there might be merge conflicts by this point now, though...
-
h2ibot
Schwarzkatz edited Deathwatch (+343, added forums.furaffinity.net):
wiki.archiveteam.org/?diff=49248&oldid=49242
-
pcr
qwertyasdfuiopghjkl: thought you were talking specifically about @JoinMastodon being banned, you are probably right that there will be more.
-
schwarzkatz
qwertyasdfuiopghjkl: probably better to ask in here instead of #archivebot, it is really spammy there. I think it would be a good idea to get a list of *all* the linktree clones and use the search to find tweets/profiles to archive
-
qwertyasdfuiopghjkl
ok, copied from #archivebot : Would it be possible to archive all tweets that mention linktr.ee or lnk.bio since those sites are specifically mentioned as being banned?
-
qwertyasdfuiopghjkl
The policy also says that if a violation is found in the username or bio they will ban the whole account until it is removed (which is effectively a permaban if the user has died/lost the password/stopped using Twitter/etc.), so such accounts should also be archived (but that would probably require a more complicated/larger project)
-
JAA
The search does find matches in usernames, but not partial ones. Searching for 'textfil' won't find @textfiles, for example. Searching bios or profile links isn't a thing, I believe.
-
JAA
The search also finds word matches in display names.
-
qwertyasdfuiopghjkl
JAA:
twitter.com/search?f=user&q=%22i+draw+the+comic%22 seems to find
twitter.com/xkcd by the text in the bio for me, for example? (or is that something different?)
-
JAA
qwertyasdfuiopghjkl: Hmm, indeed, last time I tried that, it didn't work.
-
JAA
Doesn't match on links though, whether in the bio or as the dedicated link.
-
JAA
Or rather, you have to search for the t.co URL, but that's impractical:
twitter.com/search?q=t.co%2FsdyjXHCZF7&f=user
-
JAA
And even then, that only matches links in the bio, not the profile link:
twitter.com/search?q=t.co%2FgADSbeGBoi&f=user
-
JAA
(That should find @xkcd otherwise.)
-
h2ibot
-
schwarzkatz
I compiled a list of linktree clones, there are probably hundreds more out there. You can find it here:
transfer.archivete.am/rdYeu/linktree-clones.md
-
arkiver
JAA: can you please trigger building uploadir-grab?
-
arkiver
we're starting when a target is up
-
qwertyasdfuiopghjkl
One not in that list:
band.link (example link:
band.link/counttothree )
-
tech234a
Twitter banned Paul Graham, likely using some variation of the new policy
-
schwarzkatz
qwertyasdfuiopghjkl: feel free to edit the file with your additions and post it here :)
-
arkiver
tech234a: after his tweet about leaving twitter?
-
arkiver
was that the reason?
-
tech234a
it appears to be
-
tech234a
-
tech234a
some related discussion ^
-
JAA
arkiver: Building
-
arkiver
mastodon mention?
-
arkiver
pff
-
tech234a
not even a direct link, apparently people said he just said it was on his website
-
arkiver
thanks JAA
-
tech234a
-
qwertyasdfuiopghjkl
schwarzkatz: I don't really have any more to add at the moment, band.link was just one I remembered seeing before and noticed wasn't listed
-
arkiver
nuts
-
JAA
-
pabs
how do I tell if and when AB was last run on a site?
-
JAA
You can check the viewer, although it's not 100% reliable. Other than that, grepping IRC logs.
-
pabs
(including in-progress, recently completed and older)
-
JAA
Yeah, grepping logs is the only way that covers all of them.
-
pabs
ok. my logs don't go very far back, can someone grep theirs for faif.us?
-
JAA
No match in mine.
-
JAA
And not listed in the viewer, so almost certainly hasn't been run before.
-
pabs
ok, I'll run it today then