04:17:10 arkiver Request to blackhole the follow URL's on the tracker due to 403 errors for all URL's trying to be archived on "http://epicvin.com/*****" 04:17:44 eg; 8=403 http://epicvin.com/check-vin-number-and-get-the-vehicle-history-report/checkout/5n1az2ms2kn110860 04:18:41 Also hitting thousands of URL's on the domain "http://www.rosarioalerta.com" which gives only Server returned 0 (CONERROR) errors 04:18:51 eg; 1=0 http://www.rosarioalerta.com/img/?id=jQJxG 04:20:22 Id say its blocking our IP's so we are just wasting our time on that 10:54:58 thanks datechnoman looking into that 10:56:31 rosario is filtered out 10:56:42 no worries mate. Chewing through URL's atm but that should speed it up a lot more. Half the logs in seeing Server returned 0 (CONERROR) errors results 10:57:15 thanks for running this project :) 10:57:21 we got into a bit of a backlog 10:58:31 nice 60% filter rate now 10:59:09 Sure thing. I had to drop off for a month as life has been busy with a new baby and had to change my priorities but hopefully should be back more often again :) 10:59:21 Slurping URL's up lol 11:04:31 Thanks for filtering those out. Im heading to bed but if I see anymore ill ping you. Much more traffic flowing now 11:05:38 new baby!! 11:05:43 congratulations! 11:05:49 Thanks so much. First one :) 11:06:03 awesome! very nice 11:06:23 i hope you'll still be able to get enough sleep at night 11:06:34 (no baby here, but I heard things :P) 11:08:59 !a https://transfer.archivete.am/Ffk0C/twitter-big-scrapes-batch4-outlinks.zst 11:09:04 !a https://transfer.archivete.am/Ffk0C/twitter-big-scrapes-batch4-outlinks 11:09:09 (whoops) 11:11:48 arkiver: Skipped 3430104 bad URLs: https://transfer.archivete.am/13o95O/twitter-big-scrapes-batch4-outlinks.zst.bad-urls.txt 11:12:02 yeah, it doesn't handle compressed data 11:12:27 arkiver: Fixed 2737001 unprintable URLs: https://transfer.archivete.am/wHwwf/twitter-big-scrapes-batch4-outlinks.zst.not-printable.txt 11:12:28 arkiver: Deduplicating and queuing 0 items. 11:12:29 arkiver: Deduplicated and queued 0 items. 11:14:59 arkiver: Skipped 431 bad URLs: https://transfer.archivete.am/15PZpn/twitter-big-scrapes-batch4-outlinks.bad-urls.txt 11:15:19 arkiver: Deduplicating and queuing 7926111 items. 11:23:49 arkiver: Deduplicated and queued 7926111 items. 11:40:49 This little one has/is amazing and sleeps all the way through the night which is pretty much unheard of. Let's hope he keeps it up! 19:13:59 !ignore 4hlsa83cwub4ozjqxjxk8iy1g ^https?://www\.byzcath\.org/forums/ubbthreads\.php/ubb/showday/day/\d+/month/\d+/year/\d{3}$ 19:14:00 Oops 23:44:36 !a https://transfer.archivete.am/12WDva/discord-Fosshost 23:44:41 TheTechRobo: Skipped 21 bad URLs: https://transfer.archivete.am/VSSip/discord-Fosshost.bad-urls.txt 23:44:43 TheTechRobo: Fixed 2 unprintable URLs: https://transfer.archivete.am/SFBn/discord-Fosshost.not-printable.txt 23:44:44 TheTechRobo: Deduplicating and queuing 8846 items. 23:44:45 TheTechRobo: Deduplicated and queued 8846 items. 23:45:10 qwertyasdfuiopghjkl: ^