01:15:02 it looks like some Google Sites are going down November 1, 2020 per https://www.archiveteam.org/index.php?title=Google_Sites and #nearlylostmygoogles doesn't seem to have started 01:15:22 oh, only if they haven't been viewed in a while 01:16:02 It has started 01:16:14 ah 01:19:37 JAA: #containerspill 01:20:15 Looks like we're using #failwhale despite the Twitter association. 01:22:06 don't stereotype, failwhales are v. diverse 01:40:34 ivan: Google Sites has started, but complete discovery remains an issue. About 17 million sites have been found by searching for links on the internet as well as scraping the internal Google Sites search engine on various keywords (this is still ongoing). 01:44:28 what's the deal with the name failwhale? 01:46:20 cadence: https://en.wikipedia.org/wiki/Failwhale 01:46:48 I see 02:25:15 OrIdow6: yes framabee, framadrop, framanews, framastory 02:26:45 framadrop is still up 02:28:28 https://usercontent.irccloud-cdn.com/file/QRwyqSPf/ima15059258469648676385.png 02:28:36 https://usercontent.irccloud-cdn.com/file/KgiPhAhU/ima7450677160033218090.png 02:29:03 Images from cadence in Invidious Matrix 02:30:02 Glad to see the CEO of GitHub wants to see the YouTube-DL repo restored 02:33:25 Though it seems they have been having difficulty contacting the maintainer 02:34:02 This is not the right channel for that. 02:38:35 Ok so that is for -ot, sorry 02:55:59 https://arstechnica.com/gaming/2020/10/how-indiana-jones-fought-the-communists-and-led-an-era-of-activist-video-games/ 02:56:12 i am sending the url into IA 02:56:24 need to check after if it run from wbm 05:08:11 Welp, Playstation seems to have banned my IP lol 05:08:36 They allowed me to download all of my links and that finished more than 12 hours ago, hopefully the unban me soon 06:49:59 What does scripts only mean for the google sites warrior project? 06:54:43 it won't run directly on the warrior VMs, you'll have to clone the script and run it or use the docker container. 07:09:35 It seems like there have been quite a few of those lately 07:09:57 So that would have nothing to do with why there is no console output on warrior web UI for google sites projects? 07:10:46 yes, the "scripts only" one's won't work on regular warriors. It should have an error somewhere. 07:14:17 It does not give me an error when I select it in the warrior. I am using the VM. 07:17:45 According to the repo, I can just select it from the UI 07:22:34 have you seen this issue? https://github.com/blackjack4494/yt-dlc/issues/9 07:22:51 I like the idea that was posted there of moving discussion to a mailing list 07:23:09 mailing lists are not hard to use, and I think they're good because they don't require signing up for yet another account 07:23:44 of course the main advantage is that if the mailing list server gets taken offline, people can use their inboxes to reconstruct everything 07:39:56 Ill move to #nearlylostmygoogles 08:02:16 Would it be appropriate to change the 'active' projects that are not available in warrior to 'hiatus'? 08:02:26 on https://archiveteam.org/index.php?title=Warrior_projects 08:11:02 mgrandi Jean-Fred: The main problem I saw in WBM crawls of the Playstation store is that M-rated games got stuck on the age gate. Regarding completeness, I don't know. 08:21:05 Hmm, might need a cookie or smarter handling of those 08:21:21 I'm currently ip banned from playstation LOL, will need to do it on my DO box 08:21:31 Really hope this is temporary or else my ps4 is quite useless 08:58:52 so, hot tip, don't be lazy and run stuff on your home network 11:11:59 I'm sure it'll be temporary since home IPs typically rotate. PS wouldn't want to ban people who got rotated onto a formerly banned address. Give it a couple of days. 11:12:15 Or if your address rotates, unplug and replug the router. 11:13:28 yeah. was being lazy , now its running on a Digital ocean instance 11:13:56 the funny thing was is that it completed successfully? so like if they thought i was flooding them then they aren't doing a good job at i 11:15:03 the site is awful, i can't get to the other regions without going to this specific page, clicking on a link, and then copying either the cookie or referer URL and putting that in my script 11:54:21 wget-at is suddenly not getting stuff it was retrieving before as page prereqs =/ skipping images 16:34:19 i installed this https://hub.docker.com/r/archiveteam/warrior-dockerfile/ shouldnt it work with more then only URLTeam 2? 16:38:04 "so, hot tip, don't be lazy and run stuff on your home network" 👀 i run way too many things on my home network 16:39:25 LuMa: That is a docker for the whole warrior, you’d need to pull an image for the specific project you want to run https://hub.docker.com/u/warcforceone 16:48:22 So i install it and run it as a seperate thing? 18:29:18 -purplebot- Warrior projects edited by S-crypt (+364, Added google sites, github, and …) just now -- https://www.archiveteam.org/?diff=45709&oldid=44433 18:30:06 LuMa: yes. I think you have to pull and run the project image. 18:33:45 I could get the whole warrior running but i don't know how to run a specific image 18:44:39 Elsewhere someone provided this example: docker run warcforceone/google-sites-grab:latest username 18:44:47 (replace username with your name) 19:01:34 LuMa: ^ 19:09:20 Got it running any way to interface with it and can i run multiple different projects 19:23:56 LuMa: not really sure... I think there is a web server that runs by default. And there are a few CLI parameters you can set like --concurrent N 19:24:21 If you want to run multiple separate docker containers, you can run multiple projects 19:25:41 Tagging arkiver and Orldow6 just to make sure what I said above is correct 19:25:43 Doe you know what port it runs on? 19:26:59 I think default is 8001 (based on looking at source code) 19:28:01 8688 or 8681 also appear in the code but I don't think it's on those 19:29:22 Nope doesnt work for me but i noticed it doesnt say any port on the docker like the normal warrior 19:29:46 https://cdn.discordapp.com/attachments/399975242417569795/770368555186651186/unknown.png 19:31:45 Also pinging wessel1512 in case they know anything about this ^ 19:38:43 LuMa: here is a list of parameters for the CLI, you can try specifying a port https://www.irccloud.com/pastebin/zbfBGBgM/ 19:42:33 So say i want to specifiy the port 8002 how would i type it? 19:42:58 I think you need to expose the port through Docker as well, else it's only available in the container? I'm not very familiar with Docker though. 19:48:13 run-pipeline3: error: the following arguments are required: PIPELINE, DOWNLOADER 19:48:38 What should i type? 19:50:14 On one project I just built the image with the command I needed https://www.irccloud.com/pastebin/DJfnmyjs/Dockerfile 19:50:38 (edited to remove timestamp) 19:52:51 LuMa: try building and running the image like these examples in the Docker docs: https://docs.docker.com/get-started/part2/#build-and-test-your-image https://docs.docker.com/get-started/part2/#build-and-test-your-image 19:55:57 Might do that later but i can see it working in the logs and i see my name leaderboard so for now i will let it be but if i start another project i will try with what you linked 19:58:34 Alright :) 19:59:07 The args are the pipeline script and your username for the tracker 20:17:25 The only thing was that the docket image couldn't build the wget-at binary, it needs to be upgraded to a later version of ubuntu 22:48:23 Wondering if RedBubble is worth a proactive archive because of https://twitter.com/DAVID_FIRTH/status/1320570963117903878 - problem is, shopping website of the wazoo 22:59:35 a project is coming up archiving URLs (outlinks) discovered in some projects and from other sources 22:59:46 not sure if it was mentioned here already 23:00:04 was talked about a bit in other channels 23:00:58 in this case the outlinks would be archived without page requisites (at least for now), so it'll mostly be HTML, but without the images 23:01:10 this is due to size, to keep it down, but at the same time archive valuable data 23:54:19 -purplebot- File:Docker-Logo-White-RGB Vertical.png uploaded by Arkiver (+0) just now -- https://www.archiveteam.org/?diff=45710&oldid=0 23:55:20 -purplebot- File:Docker-Logo-White-RGB Horizontal.png uploaded by Arkiver (+0) just now -- https://www.archiveteam.org/?diff=45711&oldid=0 23:55:20 -purplebot- File:Horizontal-logo-monochromatic-white.png uploaded by Arkiver (+0) just now -- https://www.archiveteam.org/?diff=45712&oldid=0 23:55:20 -purplebot- File:Vertical-logo-monochromatic.png uploaded by Arkiver (+0) just now -- https://www.archiveteam.org/?diff=45713&oldid=0 23:55:20 -purplebot- File:Docker-Logo-White-RGB Moby.png uploaded by Arkiver (+0) just now -- https://www.archiveteam.org/?diff=45714&oldid=0 23:55:20 -purplebot- File:Moby-logo.png uploaded by Arkiver (+0) just now -- https://www.archiveteam.org/?diff=45715&oldid=0