-
arkiver
-
h2ibotarkiver: Skipped 1 bad URLs: transfer.archivete.am/11mgf4/dripr_urls.txt.bad-urls.txt
-
h2ibotarkiver: Skipped 1 unprintable URLs: transfer.archivete.am/UgJEj/dripr_urls.txt.not-printable.txt
-
h2ibotarkiver: Deduplicating and queuing 35975 items.
-
h2ibotarkiver: Deduplicated and queued 35975 items.
-
arkiver
-
h2ibotarkiver: Invalid command message.
-
arkiver
-
h2ibotarkiver: Invalid command message.
-
arkiver
-
h2ibotarkiver: Skipped 31 bad URLs: transfer.archivete.am/CKtm5/discord-Anchor.bad-urls.txt
-
h2ibotarkiver: Skipped 1 unprintable URLs: transfer.archivete.am/vnq32/discord-Anchor.not-printable.txt
-
h2ibotarkiver: Deduplicating and queuing 112516 items.
-
h2ibotarkiver: Deduplicated and queued 112516 items.
-
arkiverhah check out those ugly URLs transfer.archivete.am/CKtm5/discord-Anchor.bad-urls.txt
-
arkiveranyway fixed TheTechRobo :) we can now handle even more bad URLs here
-
JAAHmm, the on.aws one could be valid. URLs are not required to have a path component.
-
JAASame with localhost etc. actually.
-
arkivertrue
-
arkiverURL pattern i use here requires at least one dot
-
arkiverbefore the first /
-
JAAYeah, that's fine I think. But the on.aws one is still valid even when requiring a domain with at least two labels.
-
arkiveryeah
-
JAAI actually had to look it up in the URL standard to make sure, but it's here, point 2 in the path start state: url.spec.whatwg.org/#path-start-state
-
arkiver
-
h2ibotarkiver: Skipped 31 bad URLs: transfer.archivete.am/5sAkw/discord-Anchor.bad-urls.txt
-
h2ibotarkiver: Skipped 1 unprintable URLs: transfer.archivete.am/mFFNm/discord-Anchor.not-printable.txt
-
h2ibotarkiver: Deduplicating and queuing 112516 items.
-
h2ibotarkiver: Deduplicated and queued 112516 items.
-
arkiverah IDN encoding problems
-
arkiver
-
h2ibotarkiver: Skipped 31 bad URLs: transfer.archivete.am/BuWnv/discord-Anchor.bad-urls.txt
-
h2ibotarkiver: Skipped 1 unprintable URLs: transfer.archivete.am/kCnMa/discord-Anchor.not-printable.txt
-
h2ibotarkiver: Deduplicating and queuing 112516 items.
-
h2ibotarkiver: Deduplicated and queued 112516 items.
-
arkiver
-
h2ibotarkiver: Skipped 1 bad URLs: transfer.archivete.am/ZnQCQ/discord-Anchor.bad-urls.txt
-
h2ibotarkiver: Skipped 1 unprintable URLs: transfer.archivete.am/WFgAC/discord-Anchor.not-printable.txt
-
h2ibotarkiver: Deduplicating and queuing 112546 items.
-
h2ibotarkiver: Deduplicated and queued 112546 items.
-
arkiver
-
h2ibotarkiver: Skipped 1 bad URLs: transfer.archivete.am/2TUjc/discord-Anchor.bad-urls.txt
-
h2ibotarkiver: Skipped 1 unprintable URLs: transfer.archivete.am/ZoSr/discord-Anchor.not-printable.txt
-
h2ibotarkiver: Deduplicating and queuing 112546 items.
-
h2ibotarkiver: Deduplicated and queued 112546 items.
-
arkivergoing to keep in the requirement of a dot in the domain
-
arkiver
-
h2ibotarkiver: Skipped 30 bad URLs: transfer.archivete.am/12mcVj/discord-Anchor.bad-urls.txt
-
h2ibotarkiver: Skipped 1 unprintable URLs: transfer.archivete.am/M7vCx/discord-Anchor.not-printable.txt
-
h2ibotarkiver: Deduplicating and queuing 112517 items.
-
h2ibotarkiver: Deduplicated and queued 112517 items.
-
arkiverJAA: we're adding the / now
-
arkiverinserting actually, one could say 'fixing'
-
» arkiver is afk for the night
-
JAASounds good, and good night! :-)
-
TheTechRobo`sh.rustup.rs](https://sh.rustup.rs` is probably a bug in the regex
-
TheTechRobogranted, it is not valid formatting, but a lot of people think that it works on discord since it uses mini markdown iirc
-
RyzAnother social media platform to go through for offsite links called 'Minds'; exameple, minds.com/RussLeachDraws - came from russleach.com
-
Ryzarkiver ^
-
RyzHmm, what about mining outlinks from links like bio.site/momcmasters ? (Came from nitter.net/momcmasters ) - considering that stuff like Twitter only allows one URL to link out (whereas in the past, multiple links can be outputted), websites like this would bypass this limit
-
RyzThe originating website that runs those kinds of links is biosites.com
-
schwarzkatz|mI also thought about that, but they are sooo js hevy most of the time
-
schwarzkatz|mheavy*
-
schwarzkatz|mAnd there are like 100 of them, they can go down anyday and links get lost