-
nicolas17
JAA:
opencollective.com/archiveteam/expenses/32327 what's this about? was it paid, is it still needed, etc
-
JAA
nicolas17: I was just trying to find it because I remembered there being something. Thanks.
-
nicolas17
also, seems there's $3700 balance, can that already be used if (say) rewby needs money for a target?
-
JAA
Some of it can be used, but the donations that came from those removed tiers have to wait until early July as explained in the update.
-
nicolas17
ah right, gotta wait 30d in case there's refunds
-
JAA
Yeah
-
fireonlive
if i had a grab-site that finished and just wanted to sploosh somewhere, would i just shoot the whole directory to internet archive as-is? or are crtain files not needed
-
fireonlive
(full well knowing i'm not cool enough to shove them into wayback)
-
TheTechRobo
fireonlive: I'd just upload the whole folder, no reason not to
-
TheTechRobo
If they're big I suggest, zstd-compressing the wpull.db and wpull.log files
-
fireonlive
sounds good :)
-
fireonlive
tks
-
TheTechRobo
Also, if there are wpull.db-shm and wpull.db-wal files, try doing a `gs-dump-urls` to get rid of them. You can just extract the (now zero) todo urls.
-
TheTechRobo
But it doesn't really matter.
-
fireonlive
ah! good tip
-
fireonlive
oops; i accidentally my grab-sites by letting the disk run out of space.. is thee a way to resume them?
-
icedice
You mean you deleted them?
-
icedice
Try Recuva
-
icedice
If you meant to say corrupted them then I have no idea
-
icedice
But you probably meant that since running out of storage space doesn't delete files
-
icedice
I'm tired
-
icedice
I should go to sleep
-
fireonlive
ran out of space on the file system and the python/grab-site processes crashed
-
fireonlive
-
fireonlive
ERROR OSError: [Errno 28] No space left on device
-
fireonlive
accidentally ran something which was like sure let's preallocate space lol
-
fireonlive
uwu
-
andrew
oh dear, all my .ga domains stopped resolving
-
andrew
RIP
-
andrew
seems their registration has been transferred to the registry itself
-
andrew
if you're desperate to archive these sites you can try directing your DNS queries to ns01.freenom.com and maybe some other common DNS providers like bob.ns.cloudflare.com
-
andrew
huh, Freenom's nameservers for the .ga TLD are still active
-
andrew
185.21.168.49, 185.21.169.49, 185.21.170.49, 185.21.171.49 used to be {a,b,c,d}.ns.ga
-
andrew
they still respond to NS lookups
-
andrew
have fun archiving :D
-
fireonlive
some will come up again.. i think?
-
fireonlive
but not all
-
fireonlive
it's all quite confusing lol
-
ect0s
Not sure if this is the correct place to ask, I'm looking for a bunch of files that were hosted on fileplanet, and I see the archived dump online - but I've come up short searching through the archived dumps.
-
ect0s
In case someone knows, or could lend a hand, basically the things I'm after were all hosted under
dl.fileplanet.com/dl/dl.asp
-
SketchCow
Hey, folks, there is now an Unofficial, "Friends of the Internet Archive" discord server you are all invited to.
-
SketchCow
-
SketchCow
It is not an archiveteam server, nor an official Internet Archive server. Meant to be a help setup between people.
-
fireonlive
i'm surprised the word discord isn't auto kickbanned here :p
-
fireonlive
thanks though jason :)
-
masterX244
main page warrior projects section needs a update. Reddit info is outdated there
-
h2ibot
MasterX244 edited Reddit (+251, Updated the subreddit protest news into vital…):
wiki.archiveteam.org/?diff=49875&oldid=49829
-
fullpwnmedia
JAA sorry for the long ass reply, they do. some updates have been removed. its not urgent but it should be done as soon as possible
-
duck
hi, how do I view the archived data? I got to here
archive.org/details/archiveteam_reddit and I am totally lost
-
rewby|backup
So duck's left, but future reference: With a few days of delay the reddit stuff gets ingested into web.archive.org
-
TheTechRobo
duck, assuming you're reading logs: you can just browse the Wayback Machine
-
TheTechRobo
why do I keep getting ninja'd :P
-
rewby|backup
So just take a link and chuck it in
-
rewby|backup
TheTechRobo: Jinx
-
rewby|backup
Also, please actually stay for more than a few minutes. People might take a few minutes to respond.
-
fireonlive
but i want immediate answers!
-
fireonlive
-
Ivan226
-
fireonlive
anyone know a way (or is it impossible)? to resume a dead grab-site?
-
fireonlive
or am i f-ed up the a
-
fireonlive
i had a little looks at docs but nothing stood out
-
fireonlive
or i'm blind
-
alexshpilkin
does anyone have a suggestion on how to track down the FTP data from csrd.uiuc.edu (U of Illinois now-defunct supercomputing group)?
-
alexshpilkin
it’s not on AT’s FTP crawl list, but it may have been already shut down at that time
-
alexshpilkin
it’s possible that it’s been mirrored on one of the sites that *were* archived, but there’s no searchable file index for those that I can see
-
alexshpilkin
(but I’m not hopeful)
-
» alexshpilkin prepares to be kicked to #effteepe but is not sure anybody’d actually respond there
-
alexshpilkin
(FWIW, I’m trying to track down a particular old piece of software that used to be hosted there, “Application executive” by Brian Bliss. unfortunately a file called at.tar.Z is not really findable with web search)
-
alexshpilkin
(if it’s even there)
-
icedice
I wonder if anyone uploaded it to VirusTotal
-
icedice
cc: joepie91|m
-
alexshpilkin
it was a library distributed in source form so doubt it
-
alexshpilkin
but I’m already preparing myself to contact the PR person behind the “contact webmaster” link on their current website so anything goes at this point
-
JAA
I was going to say 'have you tried contacting Mr Bliss?', but it looks like he passed away two years ago.
-
JAA
You're right that there's no index of our FTP data.
-
alexshpilkin
ah so that was him after all? I saw some articles like that but I wasn’t sure they were talking about the same person
-
JAA
Well, the obituary I found mentions that 'he was currently employed as a Software Engineer for National Center for Super Computing Applications for the University of Illinois in Champaign'.
-
JAA
-
flashfire42
I did a lot of work on the FTP list years ago but I never coded anything so I was unable to do any grabs
-
alexshpilkin
oh. yeah, then that’s probably him
-
alexshpilkin
(I couldn’t track him down past his employment at Convex Computer Corp in the 2000s)
-
alexshpilkin
(and their ex-employees site is on its last legs)
-
alexshpilkin
JAA: I was just idly fantasizing about making such an index but I looked through the items on Archive.org and the format seems kinda cursed, also it would probably require downloading everything and I don’t currently have the disks for that even if I do surprisingly have the bandwidth
-
JAA
Yeah, cursed sounds about right.
-
JAA
Some of it is WARC, where it's relatively easy, but I think there's non-WARC stuff as well.
-
alexshpilkin
I mean no problem, crawls do tend to turn out like that, mine included :)
-
alexshpilkin
but it’s not exactly something I’m prepared to code up this evening
-
alexshpilkin
still if anyone has full copies and a good idea of what variations there are I could try to work on test cases and then throw executables at them
-
alexshpilkin
flashfire42: if that was intended for me I couldn’t really understand it, sorry
-
imer
alexshpilkin: been down a rabbit hole, there is a mention of
ftp://ftp.cs.uiuc.edu/pub/research-groups/csrd/oldftp but that is also gone
-
imer
-
JAA
-
JAA
But TIL people are mirroring Jason's site to GitHub.
-
imer
of course :) was searching github, cause you know, code
-
sonick
Did anyone mention the Ragtag Archive that is shutting down on July 24?
-
sonick
They seem to have a 1.38 PB Vtuber video collection.
-
sonick
-
sonick
-
sonick
The collection includes videos of VTubers no longer in existence there, so it is worth archiving.