-
nstrom
not sure if it counts as a loop but I'm seeing some significant crawling of some vietnamese lookalike SEO spam sites for some online casino, all using randomly generated subdomains. examples
hgy22.churchofisolation.net 1qg24.therealestateintucson.com
-
nstrom
ersiering.net/
-
nstrom
-
nstrom
viewing source for any of those shows a bunch of links to other randomly generated subdomains at each domain, that change each time the page is loaded
-
datechnoman
Great pickup. arkiver can we get those filtered out please?
-
nstrom
-
nstrom
cheers
-
datechnoman
Only hard part is that they have nothing in commen except for the .net so will be hard to filter out :(
-
arkiver
yeah this had been going on since yesterday
-
arkiver
annoying casino website spam loop
-
arkiver
has*
-
arkiver
they don't even have .net in common
-
arkiver
some have .info
-
arkiver
or .com even i elieve
-
arkiver
believe*
-
arkiver
i have a plan, but it'll require some work
-
Sanqui
I've been collecting quite a few forums recently, would there be interest in adding "recent posts" pages and rss feeds from those to this project?
-
nyany
datechnoman: F for you my friend
-
nyany
Hetzner was pissed with me because I'm a "Brand new customer" to them and I generated an abuse notice and several Spamhaus listings within a week of account ownership
-
Sanqui
!ig 8of2baxzumf11k98412mrflil /(showthread\.php\?.*(&p=\d+|&mode=(threaded|hybrid))|search\.php)
-
Sanqui
wrong chat apologies
-
nyany
Don't run this project over wifi lol
-
nyany
"No HTTP response received from tracker" yeah, because I'm saturating the life out of this line
-
nyany
lol those spam domains are great. churchofisolation, reptilekeepers..
-
arkiver
fix is coming up
-
datechnoman
nyany I think the spamhaus listings ended up getting me locked for the month as ive never had an issue with any other abuse notices
-
datechnoman
Hopefully they dont do the same thing to you!
-
nyany
datechnoman: They didn't; I was able to remove my listings
-
nyany
Basically explained that the IP addresses were running a copy of the "URLs" ArchiveTeam project, linked to the wiki
-
nyany
Spamhaus were happy to remove it
-
datechnoman
Nice. Sounds like you dodged a bullet
-
nyany
Might also help that I'm associated with DroneBL
-
nyany
lol
-
arkiver
an update is in
-
arkiver
let's see how it goes
-
arkiver
if this works that'd be great
-
arkiver
then we also have a handy new method of removing difficult stuff in the future
-
datechnoman
ohhh that is great to hear :D
-
arkiver
:)
-
datechnoman
Fingers crossed!
-
arkiver
yeah!
-
arkiver
if it works, I'll also start blocking the annoying loop from some days ago with this
-
arkiver
the solution is implemented back then has a higher chance of losing 'good' URLs than the new method
-
datechnoman
Code improvements is always welcomed :)
-
arkiver
well old loop is gone
-
arkiver
new one is in
-
arkiver
this time with a ton of /news/ URLs :P
-
datechnoman
hehehe. Nom nom nom
-
datechnoman
So backlog will increase alot again :P
-
arkiver
not a lot I suspect
-
arkiver
I think this is a left over of the loop over a few days (a week?) ago that was not completely fixed
-
arkiver
i don't see any signs anymore of the one that was supposedly just fixed!
-
arkiver
blegh IA is down. can't check the CDX
-
arkiver
(power issue)
-
datechnoman
:( always when you need it haha
-
arkiver
yep
-
datechnoman
Joys of not running out of DC's
-
datechnoman
Put cost is a killer
-
arkiver
hopefully fixed in a few minutes