00:14:15 mikolaj|m: I'm currently using pup for the scraping side of that, looks similar to yours https://github.com/ericchiang/pup 00:19:42 pabs: pup is great, but it can't crawl on its own (you need to have something else to drive it, e.g. a script). The tool im working on can do that, without still becoming too complicated 00:21:13 Btw, in case anyone wants to try, the repo is here, but please dont post it anywhere on social media: https://github.com/mikwielgus/skrob . i sadly havent had the time to write documentation :( 00:21:37 sounds useful 00:23:37 ooh 00:23:53 i've been using htmlq but this sounds nicer 00:24:50 It's almost done, but i also want it to be able to resume its own state from a logfile, this is the last feature before i finish it i believe 00:26:03 It also works on JSON endpoints by converting JSON to XML, as cursed as it sounds 00:28:00 So you can crawl and scrape JSON endpoints like Discourse's or Hacker News's 00:28:28 mikolaj|m: https://dl.fireon.live/irc/6a2fbc492b5563b8/whathaveyoubuilt.png 00:28:58 It's also concurrent 00:29:25 So you can have multiple connections working concurrently 00:29:26 :P 00:30:18 Here's an example Discourse crawler/scraper: https://github.com/mikwielgus/skrob/blob/develop/tests/test_skrob.py#L131 00:39:44 Neat 00:45:51 :) 00:50:06 <+rss> [@textfiles⊙mao] There was a post on /r/DataHoarder on Reddit with concerns about the future of Internet Archive. Brewster decided to respond. https://www.reddit.com/r/DataHoarder/comments/1bswhdj/comment/kxm65da/?utm_source=reddit&utm_medium=web2x&context=3 https://mastodon.archive.org/@textfiles/112198791321549063 00:50:16 (via #archiveteam-twitter) 01:01:31 update on rpi and wpa3 i missed earlier: https://rachelbythebay.com/w/2024/02/07/feedback/ 02:24:01 [1588] My 36 Inch King Dick Wrecks Locks! https://www.youtube.com/watch?v=7X2PuE6ELmg 02:31:28 cc JAA 02:31:30 :3 02:32:46 Oh yeah! Thanks :-) 02:33:34 :) 02:47:43 the puns are amazing 03:20:58 yeee 03:41:02 I always wonder how many takes these ones require to get one where he stays completely on-brand throughout. 03:50:35 hmm yeah. part of me just assumed he one taked it but no way right? 04:34:09 huh, https://thedailywtf.com/ still exists 04:56:55 "I noticed the discmaster2.html web page updated today with a new feature, the ability to open any file/folder/cd directly in an in-browser emulator with 1 click: https://discmaster.textfiles.com/emuMacVideo.mp4" 05:36:25 TIL there's an Australian island (Lord Howe Island) that has a DST shift of only 30 minutes. UTC+10:30 in winter and UTC+11 in summer. wat 05:36:40 wat 05:36:45 -_- 06:03:07 https://genius.com/Kesha-tik-tok-lyrics#:~:text=Wake%20up%20in%20the%20morning%20feelin%27%20like%20P.%20Diddy 06:03:08 oh no 06:31:45 Barto: https://owasp.org/blog/2024/03/29/OWASP-data-breach-notification < this real? 06:32:00 >I think I am affected. What do I need to do? OWASP has already removed your information from the Internet, so no immediate action on your part is required. 09:08:11 do you think the machines would have honoured their deal with Cypher? 09:08:31 would he actually get what he wanted after? 09:17:05 is this... the matrix? 09:21:10 yes 13:29:47 https://news-patreon.com/articles/f95zone-acquisition-update 13:47:46 what even is f95zone? I looked at it yesterday and it looks like a piracy site for hentai games 14:02:03 that_lurker: It's fake 14:02:18 nukke: that's precisely what it is basically 14:02:54 nyany: I know. Did you open the that link? 14:03:00 -the 14:03:22 no 14:03:29 and at this point i don't trust it 14:03:32 lol 14:03:42 :| 14:03:54 i believe nukke's reaction is very telling 14:04:22 turns out it's real, though. click the link 14:04:41 I'm surprised they paid so little for it 14:05:53 Visa and mastercard don't like spicy stuff so that might be why 14:06:31 i don't trust either of you 14:09:33 ahhhhh 14:09:35 i can see it 14:12:55 https://usercontent.irccloud-cdn.com/file/V5lz2RVj/image.png 14:12:59 my money's on that being a rickroll 14:15:25 the one and only 14:33:08 discord's loot boxes trailer now has 600m+ views after they looped it in the corner of every active discord user accidentally making a view bot lmao https://youtu.be/cc2-4ci4G84 14:39:41 discord can go pound sand lol 14:40:10 discord++ 14:40:10 -eggdrop- [karma] 'discord' now has -6 karma! 14:40:14 Hahaha 14:40:58 No, seriously 14:41:29 If they're willing to take a strong stance on being anti-ad and then suddenly go "well, maybe sponsored quests won't hurt" i can't be confident they won't backtrack on other things 15:19:19 DigitalDragons: "accidentally" 15:19:55 i saw an article about how "you can choose not to interact with it" and my only thought was "for now" 15:28:41 yeah 15:29:24 For me it currently shows 1 406 948 041 views 15:33:38 What frustrates me the most is that I'm paying Discord a monthly fee plus all of these bonuses so they don't have to look to serving "ads" as a way to boost revenue and apparently that wasn't good enough 15:45:12 nitro never had a chance of generating enough revenue to cover their costs 15:46:36 paying money is never enough for these sorts of investor-led corporations, they will always want more 15:46:49 that's basically their business 15:46:51 I would be shocked if Discord is even breaking even 15:47:38 also important to remember that whether a company is "profitable" is generally calculated in very dubious ways for VC-funded stuff in particular 15:48:25 so if a VC-funded company says "we are losing money" then that may very well mean "we are losing money after paying down massive amounts of money to our investors" 15:48:48 which is mostly just a bookkeeping trick 15:49:30 (that is; a company that says they are losing money may well have a higher revenue than their operational expenses anyway) 15:50:25 I mean, that's also true, consistent profitability isn't enough, you need growth 15:51:11 but more fundamentally you need profitability and discord had no chance of that with nitro and game partnerships -vs- media hosting, call hosting, moderation, dev... 15:59:25 doesn't discord already have an opt-out "we can use your data to train AI" thing? 16:00:26 https://www.reddit.com/r/discordapp/comments/11ihqq6/discord_will_possibly_record_your_video_calls/ 16:05:32 No they do not 16:05:58 that was for a "clips" feature when screen sharing 16:07:11 https://support.discord.com/hc/en-us/articles/16861982215703-Clips 16:42:43 fireonlive: owap? sounds like it 16:42:50 ye 16:43:41 i've raised my criticism recently about owasp and its governance, but this aint related, except that their reputation is again tanking 16:43:57 so yeah, they didnt need that 16:51:34 >_< 17:21:55 might need to archive mcdonalds canada, 'cause what the fuck https://foodology.ca/mcdonalds-canada-new-chicken-cheeseburger-surf-n-turf-burger-sweet-chili-junior-chicken-and-apple-pie-mcflurry/ 17:22:32 This seems like a dying breath move to me 17:33:11 wat 17:33:25 and god damn it i'm going to have to try that aren't i 17:33:38 nyany: you going to risk it? 17:33:43 body and mind, yes 17:33:54 in the name of discovery 17:34:17 excellent 17:34:25 always try once lol 17:36:00 nyany go brrrrr https://usercontent.irccloud-cdn.com/file/H14MpyNQ/image.png 17:40:06 one of the AT targets is loving me right now https://usercontent.irccloud-cdn.com/file/yNdCdcET/image.png 17:42:38 :D 18:08:45 got an SMS from MSCHF: https://mschf.com/shop/candy-airpods/ 18:08:50 candy airpods are here lol 18:09:32 finally it's not shoes ™ 18:17:06 sad that i cant get them 18:19:21 yeah :( us only 18:19:45 i've sometimes looked at like a reship service but never had anything really worth it 18:52:35 <+rss> [@textfiles⊙mao] I've visited Internet Archive Canada today!!!! https://mastodon.archive.org/@textfiles/112202989954274089 18:52:50 see also: the whole building https://mastodon.archive.org/@mhoye⊙ms/112203087275036013 19:45:41 <+rss> [front page] Receive push notifications from your rice cooker: https://shkspr.mobi/blog/2024/03/receive-push-notifications-from-your-rice-cooker/ → https://news.ycombinator.com/item?id=39902207 19:48:04 i'd love something like this for my washer/dryer combo unit.. 19:50:37 wonder if there's a smart plug that works with the typical 'dryer socket' 19:58:39 https://news.ycombinator.com/item?id=39903742#39904842 < haven't heard of this 'JOSH' before 19:58:51 https://github.com/josh-project/josh 19:59:20 you make it smart for yourself 19:59:21 fireonlive: should be "easy" to do with esp32/esp8266 modules https://hackaday.com/2023/10/20/spinning-up-a-new-laundry-monitor/ 19:59:34 you can get one of the plugs that measure draw and do stuff based on that 20:02:04 or just conenct it to samsung's servers 20:02:31 oh neat :) 20:02:53 clamp meter might be easier 20:04:11 i'd have to check because i cannot remember, but otherwise i think i have a 14-30? https://dl.fireon.live/irc/e78910d019c54e2d/plug-types.png 20:04:52 something odd looking :p 20:05:13 so unsure if they make a smart polug for that 20:08:47 weird 20:12:04 the washer/dryer unit i have is very dumb, and also doesn't make any noise... when it's done you get a green led under the 'done' label and when the dryer is done you get .... an absence of noise 20:12:18 no tones or anything 21:11:10 Re Discord profitability, I have my doubts they reached that yet. Definitely hadn't as of a couple years ago, and I can't imagine they would keep it secret (and not move towards IPO etc.) if they did by now. 21:12:20 fireonlive: Just rig it up with some wires, WCGW? 21:13:11 :3 21:43:10 why is making new accounts suddenly so much worse everywhere? 21:44:24 like instead of "click 2-3 images that match/don't match 3-5 times" it's "do 5-10 steps 7-10 times, and maybe we'll make you do it again once you click the link in the email" 21:49:23 steering: that may have to do with your IP reputation; did you recently start using a VPN or something? 21:50:50 nah, it's been a few years (& I don't really use VPN, occasionally an SSH tunnel) 21:51:36 sooooooome of it might be my move from $big_state to $flyover_state but it seems like it was getting worse before that 21:51:58 more of what i'm talking about is brand new types of captcha that just suck 21:51:59 Maybe an uptick in spam activity using AI image classifiers and we're watching captchas hurtle towards obsolescence? 21:52:09 and flail madly on the way 21:52:09 the "point it in the right direction" stuff for example 21:52:27 yeah... I guess... 21:54:01 the worst is when it's from a site that *isn't* Shwitter and you'd expect them to care about making the signup flow smooth, lol 22:04:06 +rss- Stop using the Internet Archive as the sole host for preservation projects: https://old.reddit.com/r/DataHoarder/comments/1bswhdj/if_there_is_a_book_on_internet_archive_your/ https://news.ycombinator.com/item?id=39908676 < it made it to HN 22:05:51 someone reported that some site had you do like 20 of those 'point the object in this way' and 'click on the object that doesn't belong' and other like harder/stupider ones and if you failed any one at any point (even 20/20) you had to start over and do 20 of them again 22:13:20 Ok so real talk: for those image grid captchas, do y'all select squares that have like, tiny parts of the object? ex. A square with a slice of the tire 22:15:45 nukke: https://i.pinimg.com/originals/0a/1f/ed/0a1fed4c300f8973ebc7ee1c0ba5b0a0.jpg 22:17:21 Yes except 22:17:25 Exactly** 22:18:59 Good thing you only need to select 4 to get through. (Unless they changed it) 22:22:52 i usually do lol 22:25:06 Official #AT 2024 summer wardrobe https://www.youtube.com/watch?v=DuDjtpUDSu8 22:36:11 *bumps bitrate to 1080p premium* 22:37:54 perfect 22:38:02 yass hunty slaaaaaayyyyyy 22:38:05 boots the house down 22:38:09 💅 22:40:54 nukke: no 22:41:07 i find as long as I get the major parts it lets me slide 22:43:42 🤔 22:43:50 * fireonlive reads that sentence in various fun ways 22:50:58 fireonlive: https://www.youtube.com/watch?v=VdrEIGeDY2Q 22:51:21 that_lurker: :D