04:00:23 !a https://transfer.archivete.am/aGn6u/lnk.bio-filtered-outlinks 04:00:29 JAA: Skipped 1309 invalid URLs: https://transfer.archivete.am/13md4v/lnk.bio-filtered-outlinks.bad-urls.txt (for 'https://transfer.archivete.am/aGn6u/lnk.bio-filtered-outlinks') 04:00:30 JAA: Deduplicating and queuing 142168 items. (for 'https://transfer.archivete.am/aGn6u/lnk.bio-filtered-outlinks') 04:00:38 JAA: Deduplicated and queued 142168 items. (for 'https://transfer.archivete.am/aGn6u/lnk.bio-filtered-outlinks') 04:01:24 Outlinks from AB crawl of https://lnk.bio/ a couple months ago (job cn0hyzgpvabbskvg8f08rg1l6), filtered somewhat aggressively to remove the things we can't process here. 04:02:07 grep -Fv -e //kauth.kakao.com/oauth/ -e //access.line.me/oauth2/ -e //nid.naver.com/oauth2.0/ -e //discord.com/oauth2/ -e //www.tiktok.com/v2/auth/authorize/ -e //accounts.google.com/o/oauth2/ -e //login.microsoftonline.com/common/oauth2/ -e //appleid.apple.com/auth/authorize urls | grep -Fv -e //s3.us-west-2.amazonaws.com/ -e //cdn2.lnk.bi/ -e //cdn.lnk.bi/ | grep -Pv 04:02:12 '^https?://([^/]*\.)?(facebook\.com|instagram\.com|cdninstagram\.com|twitter\.com|tiktok\.com|tiktokcdn\.com|youtube\.com|linkedin\.com|wa\.me)/' 04:02:43 And prior to that: sqlite3 wpull.db 'SELECT url FROM queued_urls JOIN url_strings ON url_string_id = url_strings.id WHERE status = "skipped" AND url NOT LIKE "%//lnk.bio/%" AND inline_level is NULL' >urls 05:26:18 !a https://transfer.archivete.am/K24Zr/lbry-discord-urls.txt 05:26:18 fireonlive: Invalid privileges, need one of ('@', '+'). 05:26:25 .voice 05:26:27 !a https://transfer.archivete.am/K24Zr/lbry-discord-urls.txt 05:26:30 fireonlive: Deduplicating and queuing 24886 items. (for 'https://transfer.archivete.am/K24Zr/lbry-discord-urls.txt') 05:26:32 fireonlive: Deduplicated and queued 24886 items. (for 'https://transfer.archivete.am/K24Zr/lbry-discord-urls.txt') 05:34:19 included link previews too; in case the cached previews linked to content that has since died 08:35:35 .voice