-
datechnoman
-
h2ibotdatechnoman: Skipped 1 invalid URLs: transfer.archivete.am/HFcoF/discord_urls.txt.bad-urls.txt (for 'transfer.archivete.am/9bOyy/discord_urls.txt')
-
h2ibotdatechnoman: Deduplicating and queuing 223919 items. (for 'transfer.archivete.am/9bOyy/discord_urls.txt')
-
h2ibotdatechnoman: Deduplicated and queued 223919 items. (for 'transfer.archivete.am/9bOyy/discord_urls.txt')
-
arkiveryeah
-
arkiverlooking into it
-
[42]BornOn420_: you mentioned electrowoz.pl - did hetzner tell you it was electrowoz? i got an abuse report today mentioning sitemaps but there's no indication of a domain, but the log timestamps are from the same day as your messages
-
DLoaderI got both, for this second one today they didn't tell the domain, so no idea
-
[42]i've asked them if they have more information, specifically the domain
-
arkiverthe ?ver= stuff should be handled now
-
BornOn420_[42]: yes they did
-
BornOn420_'I kindly request you to stop your client from automatically downloading tag pages from Elektrowoz.pl. '
-
fireonlivehm, tag pages only
-
BornOn420_'Generating tag pages is a task that puts a heavy load on the CPU and database, in the long term or on a larger scale (automated reading of dozens or hundreds of pages) it is a DDoS attack.'
-
BornOn420_'Today the action was carried out in parallel from many different machines, which WAS a DDoS attack.'
-
BornOn420_and then there's ---- and generic boilerplate text
-
fireonlivenot sure if it's fair to call it a 'DDoS' attack if the intent wasn't there but ¯\_(ツ)_/¯
-
datechnomanAgreed. Not our fault the website doesnt scale or the code isnt efficient to handle a few queries at once....
-
datechnomanIt might seem malicious to them but it is accessing what they have published on the internet :/
-
nicolas17yeah I wouldn't call it DDoS
-
nicolas17but can we rate limit those URLs specifically?
-
BornOn420_You mean like anything with sitemap? URLs are shaped like this:
-
BornOn420_
-
BornOn420_
-
BornOn420_I looked at one of those sitemaps in the Wayback machine and it looks like we're indexing them about 6 times a year
-
BornOn420_And since the maps are generated by a WP plugin this might not be the only site we're over-asking
-
nicolas17are the sitemaps what's causing them CPU load, or the /tag/ pages listed in them?
-
BornOn420_I mainly find the sitemaps in my logs, so I _guess_ it's the sitemaps.
-
BornOn420_For some reason they didn't send a full server log with their abuse request :)
-
BornOn420_nicolas17: This are all the elektrowoz items I found in my logs when the complaint came in: transfer.archivete.am/eG7n6/elektrowoz.txt
-
eggdropinline (for browser viewing): transfer.archivete.am/inline/eG7n6/elektrowoz.txt
-
fireonliveinteresting those are made on the fly