-
fireonlive
hmm ye :3
-
TheTechRobo
IIRC I asked this before and it’s ‘bullshit’
-
TheTechRobo
this used to be the off topic channel
-
TheTechRobo
#Archiveteam used to be for discussion
-
qyxojzh|m
Oh, hence the BS
-
» pabs wonders if its time to change the channel list to be more self explanatory: #archiveteam-announce #archiveteam-discuss #archiveteam-offtopic etc
-
pabs
(and make the existing channels redirects to the new ones, if hackint can do that)
-
fireonlive
-
fireonlive
Cannot join #archiveteam-announce - Channel is invite only.; Cannot join #archiveteam-discuss - Channel is invite only.; Cannot join #archiveteam-offtopic - Channel is invite only.
-
fireonlive
damn it.
-
fireonlive
pabs: ye, forwarding is possible
-
fireonlive
see #helloworld for an example
-
fireonlive
-
fireonlive
it does mean bullshit :D
-
fireonlive
i guess the 2012 was more canonical, bute h
-
pabs
oh, someone already created those, huh
-
fireonlive
not registered with chanserv, but presumably JAA or someone is idling in there :)
-
thuban
i kind of prefer 'bs' to 'discuss' anyway
-
fireonlive
we can retcon it to bikeshed ;)
-
fireonlive
though ot is said to be for that :D
-
fireonlive
cow dung it stays
-
nulldata
This is for on-topic bikeshedding - ot is for off-topic bikeshedding
-
fireonlive
that works :D
-
fireonlive
bikeshedding it is
-
fireonlive
fuck.
-
fireonlive
oh never mind, crisis averted
-
fireonlive
community.poly.com bought by HP, links broken/gone. but, content seems to now be merged to the 'HP community' under HP's GENIUS naming scheme of
h30434.www3.hp.com
-
fireonlive
i've read somewhere the h#####.www3 is basically an asset number for the server whatever runs on. because that's how they do it in enterprise shit land
-
nulldata
Polycom had some pretty rock solid desk phones. I have no doubt HP will run what is left right into the ground. RIP
-
» fireonlive pours one out
-
pabs
fireonlive: terrifying naming scheme, wonder if that qualifies all of hp.com for archiving :)
-
pabs
or at least h*.www3.hp.com :)
-
fireonlive
hmm might do :)
-
fireonlive
anything poly.com most likely as it is slowly absorbed like community.poly.com suddenly was
-
fireonlive
i like how https?://poly.com is conrefused for me, but www. works
-
fireonlive
very, very enterprise
-
fireonlive
also their old domain which was polycom.com pre-poly re-branding; e.g.
documents.polycom.com which goes to
docs.poly.com now. dunno if anything lives on at the old one
-
fireonlive
-
fireonlive
(open directory)
-
fireonlive
the root redirects to
spectralink.com
-
fireonlive
-
fireonlive
-
fireonlive
-
h2ibot
Yts98 edited CUE! -See You Everyday- (+6, Update project status):
wiki.archiveteam.org/?diff=50347&oldid=46609
-
fireonlive
-
fireonlive
-
fireonlive
-
h2ibot
Yts98 edited ITunes U (+43, Mark status as offline, clean up formatting):
wiki.archiveteam.org/?diff=50348&oldid=47927
-
fireonlive
huh, it said pastebin lol
-
fireonlive
:)
-
h2ibot
FireonLive edited WikiTeam (+25, archiving_type = other (thanks yts98, missed that)):
wiki.archiveteam.org/?diff=50349&oldid=50332
-
yts98
trying to get rid of outdated "closing" projects :p
-
fireonlive
:)
-
fireonlive
wiki needs a lot of love
-
h2ibot
FireonLive edited LGTM.com (-186, It's dead):
wiki.archiveteam.org/?diff=50350&oldid=48894
-
h2ibot
FireonLive edited Karayou.com (+39, still kicking, sorta):
wiki.archiveteam.org/?diff=50351&oldid=49563
-
yts98
Is blog.pl eventually saved or not? I didn't find it in archivebot or AT IA items
-
fireonlive
-
fireonlive
not sure about the "roughly 100k of these were retrieved" (maybe that was another method before fart.website?)
-
fireonlive
-
fireonlive
actually could be some of the blog.pls in the first fart has the homepages of the ~100k blogs (ab does one level of outlink by default)
-
thuban
JAA would know more, having been the project lead
-
h2ibot
Yts98 edited Blox (+40, Mark status as offline; several pages were…):
wiki.archiveteam.org/?diff=50352&oldid=36088
-
fireonlive
yts98++
-
h2ibot
Tech234a edited Gfycat (+80, Note Snap Inc. ownership):
wiki.archiveteam.org/?diff=50353&oldid=50279
-
Buizel
-
fireonlive
i think they're looking to link to the data
-
Buizel
Ah
-
yts98
Oh, I was going to ask blox.pl instead of blog.pl lol
-
fireonlive
😠
-
fireonlive
lol
-
fireonlive
easy mistake x3
-
yts98
-
fireonlive
hmm yeah it's nowhere on
hackint.logs.kiska.pw/?q=blox.pl&w=a (except right now) either
-
fireonlive
arkiver might have to dig deep into the memory banks
-
fireonlive
but it's possible it's a lost one
-
h2ibot
Switchnode edited ArchiveTeam Warrior (+1385, begin to distinguish between warrior and…):
wiki.archiveteam.org/?diff=50354&oldid=50040
-
thuban
JAA: would appreciate a review ^
-
thuban
this doesn't actually address most of the faq/troubleshooting section (either the fact that it doesn't clearly distinguish what's the warrior and what's dpos generally, or the fact that there's a lot of text shared between it and the 'running with docker' page), but it lays the groundwork for doing so
-
thuban
(and i did what i REALLY wanted to do, which was get the big long newbie-scaring command lines out from in front of the toc)
-
thuban
if we're serious about this we should move 'warrior projects' to 'dpos projects' and edit 'dev/infrastructure' accordingly. and probably decide whether it's "warrior" or "Warrior". i'm not gonna though cause i need sleep
-
Sanqui
I think it's a Warrior
-
Sanqui
well done though
-
thuban
btw, Sanqui, do you happen to have corresponding podman instructions for the docker commands in faq/troubleshooting? we might want to add them
-
thuban
(or wiki user Cody, but i don't know whether they're on irc)
-
Sanqui
Cody is probably a better bet, my setup was very basic and they improved it
-
Sanqui
I think my warrior is broken atm and I'm not even sure if it's podman's fault
-
Sanqui
been kinda doing other stuff but if I get back into it I can try to fix it up
-
thuban
it do be like that sometimes
-
thuban
night all
-
Sanqui
nin
-
Buizel
Another idea for LLAMA or similar, have a bot that can rewrite wiki pages automatically
-
Buizel
From chat logs
-
Buizel
Maybe you'd ping it when something noteworthy happens
-
rewby
thuban, Sanqui: Re podman commands: We still have a PR open on
ArchiveTeam/warrior-dockerfile #77 about that
-
yakabuff
hello
-
fireonlive
bye
-
JAA
thuban: You're expecting me to remember anything from a project over 5 years ago? :-)
-
JAA
fireonlive: Checked my notes, I did some independent wpull stuff for blog.pl. The raw data is on IA but not included in the WBM because it's infested with rate limiting errors. I was going to produce a filtered version that would make browsing bearable, but the tooling didn't and still doesn't exist (WIP though, slowly).
-
fireonlive
you are a great note keeper! thanks for checking :)
-
fireonlive
and ye, having WBM full of 'you are rate limited' would be super gross
-
mpeter|m
<TheTechRobo> "I think Google Groips Files..." <- oh, so that was google groups **files**.. I see
-
mpeter|m
do you perhaps also have archives of google groups message boards?
-
JAA
thuban: The warrior doesn't use project images. It just clones the project repo and tries to run the pipeline. Which is also why wget-at upgrades might break for a while on the warrior, because it still ships with an older version (until we rebuild the warrior image, which can only be done when the important projects are using the new wget-at version).
-
fireonlive
important eh :3
-
» fireonlive imagines the politiking
-
JAA
thuban: Other than that, it seems good, thanks a lot!
-
JAA
fireonlive: Not much. It can be summarised as 'nobody* runs URLs in the warrior, so we can ignore that; are all other active projects updated? good, rebuild triggered'.
-
fireonlive
ah :)
-
yakabuff
If anyone's interested, I've been working on a Reddit archiver/viewer:
github.com/yakabuff/redarc
-
yakabuff
It can display pushshift dumps and fetch new threads if you have an API key
-
Ryz
So, with Google starting to push for deleting inactive accounts starting in December, I'm pondering on what stuff to start saving, a real big one would be Blogspot :C
-
Ryz
Would like help on what other areas to archive regarding Google user content... YouTube is very much out of our league since we have to much more picky on what to grab because hard drive space is something that has to be managed x_x;
-
Ryz
There's also Google Drive stuff, but uhh...
-
Ryz
...Is YouTube included...?
-
yakabuff
-
fireonlive
yakabuff: postgres 😍
-
fireonlive
thanks! that looks super neat
-
yakabuff
:)
-
fireonlive
:)
-
yakabuff
I used to have a demo instance but reddit forced me to take it down
-
fireonlive
oof :/
-
Ryz
yakabuff, bet the only reason they said that is that otherwise everyone would be mad-rushing to archive anything YouTube related :/
-
fireonlive
i guess they’re on a warpath
-
VickoSaviour
So, i think Wysp is saved. Tracker shows that everything is done. I think you can edit the page about the Wysp.
-
VickoSaviour
What about the Gfycat platform, is there some kind of work to set up the 2019-2023 project up?
-
imer
soon!
-
imer
ah they gone
-
fireonlive
VickoSaviour never stays in one place for too long
-
fireonlive
one might say too short, even
-
JAA
They also don't seem to grasp the concept of project channels.
-
h2ibot
Petchea created Kienu (+212, Created page with "{{Infobox project | URL =…):
wiki.archiveteam.org/?title=Kienu
-
h2ibot
Tomodachi94 edited INTERNETARCHIVE.BAK/ipfs implementation (+214, Note about…):
wiki.archiveteam.org/?diff=50356&oldid=22152
-
h2ibot
Switchnode edited ArchiveTeam Warrior (+46, /* Warrior architecture and alternatives */…):
wiki.archiveteam.org/?diff=50357&oldid=50354
-
h2ibot
-
h2ibot
-
h2ibot
-
h2ibot
-
h2ibot
-
h2ibot
-
h2ibot
Yts98 edited Blox (-10, No, it's blog.pl rather than blox.pl that is…):
wiki.archiveteam.org/?diff=50364&oldid=50352
-
h2ibot
Yts98 edited Picosong (+21, Mark status of Disqus comments as online):
wiki.archiveteam.org/?diff=50365&oldid=47749
-
h2ibot
Yts98 edited The Lighthouse Directory (+82, Update project status):
wiki.archiveteam.org/?diff=50366&oldid=47592
-
fireonlive
=== The instructions to run the software/scripts are awful and they are difficult to set up. ===
-
fireonlive
Well, excuuuuse me, princess!
-
fireonlive
lmfao
-
h2ibot
FireonLive edited ArchiveTeam Warrior (+67, update docker commands per imer, include…):
wiki.archiveteam.org/?diff=50367&oldid=50357
-
fireonlive
didn't touch podman... they should be the same for logs though?
-
h2ibot
FireonLive edited Running Archive Team Projects with Docker (+255, Include --include-restarting as per imer):
wiki.archiveteam.org/?diff=50368&oldid=49744
-
fireonlive
hmmmmmm
-
fireonlive
maybe urls shouldn't be our 'example' project there, with all our caveats
-
rewby
reddit probably makes more sense
-
fireonlive
it is the longest running one though. could pop shreddit in there I guess.
-
fireonlive
hey twins
-
fireonlive
will update
-
h2ibot
Yts98 edited Apple Daily (+230, Update project status, move {{inprogress}} to…):
wiki.archiveteam.org/?diff=50369&oldid=48908
-
arkiver
fireonlive: URLs is also the most dangerous project
-
arkiver
so might want to be careful with setting it as example or default anywhere
-
h2ibot
FireonLive edited Running Archive Team Projects with Docker (+5, Change URLs project example to Reddit. URLs…):
wiki.archiveteam.org/?diff=50370&oldid=50368
-
fireonlive
for sure.. don't want our first interaction with a new contributor/user to be 'hey i wanted to help but my ISP is threatening to disconnect me???'
-
arkiver
:P
-
fireonlive
even if the notice is bullshit, it leaves a unexpected sour taste in their mouth
-
fireonlive
lol
-
fireonlive
:3
-
arkiver
yep
-
appledash
Why is URLs "dangerous"?
-
appledash
I figured the "risk" was just getting your IP banned from random web sites
-
arkiver
URLs is partially aimed at archiving outlinks from other sites
-
arkiver
for example reddit
-
arkiver
it can cause you to download any random crap someone might link to online
-
fireonlive
ye, and there's 'sinkholes' that (genuinely helpfully) try to alert you if you contact certain domains that you might be infected with X, or people might get angry if you access a file that was linked somewhere because <???>
-
fireonlive
also, you might cause facebook/youtube/etc to start rate-limiting your IP, which would make you sad at your house
-
fireonlive
URLs/outlinks very important, but also quite the beat
-
fireonlive
s/beat/beast/
-
Barto
while running URLs, i had multiple instances where my ISP would suddenly "temporarily suspend" my internet because of those abuse reports. This would get unsuspended by just clicking a checkbox in the router panel, complete a captcha (lol), or change the router password, no permanent things.
-
Barto
but yeah, don't run URLs at home :D
-
h2ibot
Yts98 edited This Is My Jam (+288, Update project status):
wiki.archiveteam.org/?diff=50371&oldid=46192
-
fireonlive
i have found the source of the rewby-pings: I see messages about rsync errors. Uh-oh! Something is not right. Please notify us immediately in the appropriate IRC channel.
-
fireonlive
-
fireonlive
:p
-
h2ibot
FireonLive edited Running Archive Team Projects with Docker (-87, *spiderman pointing meme*):
wiki.archiveteam.org/?diff=50372&oldid=50370
-
fireonlive
-
fireonlive
snenssbible
-
h2ibot
FireonLive edited Running Archive Team Projects with Docker (+255, save rewby's life):
wiki.archiveteam.org/?diff=50373&oldid=50372
-
fireonlive
:p
-
h2ibot
Yts98 edited Tribe.net (+346, Update project status):
wiki.archiveteam.org/?diff=50374&oldid=47641