-
Ryz
Has a PR been made yet?
-
Ryz
Alternatively, how about this; I continue to try find websites that haven't been covered in #// via Wikidata, to both benefit here and Wikidata, and if I find a edge case, you folks have to solve it instead~
-
fireonlive
JAA: looks like some of them work, just not the 'Latest Press Releases'
-
fireonlive
some are 503 though
-
JAA
Ah, didn't check the categories, yeah.
-
fireonlive
weird they'd kill all though
-
JAA
Hanlon's razor?
-
fireonlive
:)
-
AK
Seeing rsync errors at the mo: `@ERROR: max connections (-1) reached -- try again later`
-
AK
Not sure who's targets these are running on, are they yours rewby?
-
AK
-
rewby|backup
Ah yeah. Might be a bit busy
-
imer
-
imer
eeeeee :)
-
imer
we're due to "run out" soon as well :)
-
datechnoman
Still 80 million in out. Plenty to go
-
datechnoman
Massive stashes are ready also
-
datechnoman
Plus we have urls from #mataringa
-
datechnoman
We wont ever run out ;)
-
michaelblob
some things never change :)
-
imer
hence the quotes haha
-
datechnoman
The blogger outlinks stash has over 8.30 Billion Urls xD
-
imer
mostly dupes i'm sure :D
-
imer
but yeah, lotsa work to get through
-
imer
not enough money for hardware and power :(
-
datechnoman
Everything is de-duped on that list :P
-
imer
dupes to this tracker of course!
-
imer
JAA: can we get a filter on airbest.com? looks to be spam and is super slow
transfer.archivete.am/BZSzj/2024-03-19_23-27-11.txt ~15req/s here
-
eggdrop
-
imer
pages have been blank for me once they load after like 20s
-
imer
oh, just got an error message leaking their db credentials (to localhost), thats funny
-
imer
Fatal error: Uncaught exception 'Suco_Db_Exception' with message '无法连接数据库服务器. [No such file or directory]' in /repo/_master/librarys/Suco/Db/Adapter/Mysql.php:177 Stack trace: #0 /repo/_master/librarys/Suco/Db.php(65): Suco_Db_Adapter_Mysql->connect('localhost', '3306', '*CENSORED', '*CENSORED*', false)
-
imer
hope thats fine to post :_
-
JAA
*facepalm*
-
JAA
I bet that made it into the archives a few times, too.
-
imer
mh, their main site seems legit
-
imer
not sure what's with the subdomains though
-
nstrom|m
yeah the 80mil reclaims should keep workers busy for a while, no worries here
-
JAA
Yeah, those [a-z0-9]*\d{2}www\.airbest\.com look suspicious.
-
fireonlive
A+ website security
-
imer
JAA:
transfer.archivete.am/7bFrA/2024-03-19_23-42-44.txt www.dragonline86.com looks spammy too. just typing in random garbage in the urls seems to return "content" of sorts, recursive queuing from the looks of it
-
eggdrop
-
imer
airbest seems to be calming down, not sure if we just worked through urls or if you filtered
-
JAA
Didn't filter, was looking at it and then got sidetracked.
-
imer
no worries :)
-
JAA
Looks like ~3% of the queue are airbest.com and ~16% are dragonline86.com.
-
JAA
Of todo:backfeed, that is.
-
JAA
Some of the latter appears to be a mirror of
casino.guru/news or something like that.
-
JAA
Other parts match
bonusmaniac.com
-
JAA
And other sites, too.
-
JAA
Weird spam site, I'll yeet it.
-
imer
thanks :)
-
JAA
^https?://www%.dragonline86%.com/ added
-
JAA
Filter rate 20%, which matches the number I saw just before adding the filter.
-
imer
had been growing quite a bit 5/s -> 10/s in the past 5min or so, guess that makes sense if every page queues a few more
-
JAA
airbest.com continues to drop, so I think I'll just leave it for now.
-
imer
yep, seems fine - seems fast as well again