05:47:52 if they can hold out until I get _the london box_ sorted we're good, if not we're fucked 06:22:08 Local news? Potential source? https://dallasinnovates.com/ - from me finding https://dallasinnovates.com/addison-based-biostat-imaging-acquired-by-houstons-principle-health-systems/ while researching recent acquisitions 08:30:35 datechnoman: Here's about 11M goo.gl urls https://transfer.archivete.am/DCPFe/goo.gl.txt 08:30:36 inline (for browser viewing): https://transfer.archivete.am/inline/DCPFe/goo.gl.txt 08:47:04 7.4m pdf urls https://transfer.archivete.am/inline//ny1aS/pdf_urls.txt 08:47:13 umm. https://transfer.archivete.am/inline/ny1aS/pdf_urls.txt 08:48:00 7.7M edu urls https://transfer.archivete.am/inline/mLpUZ/edu_urls.txt 10:12:03 Thanks legend Vokun! Ill queue them (pdf and edu urls) once we start chewing the backlog down :) 10:12:19 I dont believe we can queue the goo.gl.txt one as we will smash their servers 10:41:26 ill put my 40g nodes on urls for a bit after mildom to assist with that backlog 10:51:49 monoxane sounds great! Will help out 10:53:38 I've put extra nodes on here to chew through the backlog 10:53:48 Just working to get over the monthly sitemaps queuing 12:17:24 still waiting for #mildou stuff to clear and then ill be back on this in full again too 15:43:43 datechnoman: are we doing as usual now with this project? 15:44:12 yes on goo.gl , those will go into the upcoming project 16:29:52 (→ #urlteamwasright ) 20:50:10 arkiver - we are back to normal operations :) 20:51:38 The backlog is slowly going down so it should speed up over the coming days as the sitemaps are processed 23:29:49 Just a matter of the target (IA S3) keeping up with the extra workers ive thrown in. Once Mildom has finished up we should be sweet