13:10:07 arkiver: did you ever put the common crawl pdf list into #// or is that waiting until we "run out"? 13:11:35 imer: I'd like to wait until the queue is down 13:11:41 which should not take too long i think 13:12:02 ack, just checking :) 21:59:26 https://transfer.archivete.am/Rwhvg/2024-01-26_21-57-04.txt gotta love redirect loops that just add a slash :D 21:59:27 inline (for browser viewing): https://transfer.archivete.am/inline/Rwhvg/2024-01-26_21-57-04.txt 22:00:31 lordy lol 23:11:51 been seeing *.xuite.net persistantly for a while here now (which is dead), not sure if that's worth filtering out? 23:12:57 32=0 https://%D0%BC%D0%BE%D0%B6%D0%B3%D0%B8%D0%BD%D1%81%D0%BA%D0%B8%D0%B5-%D0%B2%D0%B5%D1%81%D1%82%D0%B8.%D1%80%D1%84/ unsure if thats how that works :D 23:14:05 whoops lol 23:16:26 At least it's not Punycode! 23:17:37 JAA: oh go on then, punycode is weird, should they just have forced utf8 for domain names? 23:17:47 sounds like you have an opinion here 23:20:00 I just know I don't like Punycode. :-) 23:20:17 haha, fair 23:20:17 Something like it was probably unavoidable to not break legacy DNS software. 23:20:37 Even though the DNS protocol could certainly support raw UTF-8. 23:23:41 https://transfer.archivete.am/5DvNe/2024-01-26_23-22-54.txt these are just amusing to me (way too long domain name) 23:23:42 inline (for browser viewing): https://transfer.archivete.am/inline/5DvNe/2024-01-26_23-22-54.txt