04:11:00 | <ivan`> | https://www.youtube.com/user/JPCMHD Japanese TV Commercials |
04:14:00 | <ivan`> | (is there a list of channels that have been backed up?) |
04:42:00 | <godane> | i got all of starcade episodes on to archive.org |
04:59:00 | <DFJustin> | I'm out of disk space to back up any more channels lol |
06:05:00 | <ivan`> | well, I've got that one, as long as you trust the 4TB ext4 partition to survive a few years |
10:26:00 | <Nemo_bis> | what sort of mail server does give such a response? http://p.defau.lt/?fGiPeGk9JlJJrqvDmVmxjg |
10:36:00 | <ersi> | Nemo_bis: looks like some costum message from a spam filter or whatever |
10:36:00 | <ersi> | or reputation filter thing |
14:01:00 | <SketchCow> | Next up to upload as MegaWARCs... Splinder! |
14:41:00 | <godane> | my wifi is hating me |
14:41:00 | <SmileyG> | hmmm |
14:41:00 | <SmileyG> | SOLAR FLARE WARNING BY SMILEY. |
14:42:00 | <godane> | i can't upload 128 iso of linux format |
14:42:00 | <SmileyG> | THIS WILL BE PROCEEDED BY OFFICAL WARNINGS BY NASA IN THE NEXT FEW DAYS. |
14:43:00 | | SmileyG checks spaceweather.com |
14:43:00 | <SmileyG> | Oh look, more warnings ¬_¬ Wifi is so damn sensitive and no ones realised yet its the perfect tool for spotting solar flare interferance. |
14:44:00 | <godane> | is archive.org s3 having problems? |
14:45:00 | <SketchCow> | I'm uploading to it as I go. |
14:46:00 | <SmileyG> | solar flares! (Ok, I'll stop now). |
14:52:00 | <SketchCow> | So, I'm uploading Splinder into the wayback. |
14:53:00 | <SketchCow> | That leaves a couple more, according to the spreadsheet, that have to go through the wringer. |
14:53:00 | <SketchCow> | Until we're 100% sure this works, we're not touching MobileMe. |
14:54:00 | <SketchCow> | But after I finish the last couple on the spreadsheet, specifically Picplz and Fortunecity, I'd appreciate some effort going through the archiveteam sets and finding stragglers. |
14:54:00 | <SketchCow> | Now, bear in mind - if something already has a CDX file, it's already in the wayback. I.e. we have a TON of stuff Godane added, that's all gone in, even though it's not on the spreadsheet. |
14:55:00 | <SketchCow> | But I'm looking specifically for uploads done where they're .WARC files inside .tar or .zip files. |
14:55:00 | <ersi> | How would one go about 'going through the sets and finding stragglers'? |
14:55:00 | <SketchCow> | http://archive.org/details/archiveteam |
14:55:00 | <SketchCow> | Go through "All items" |
14:56:00 | <SketchCow> | In multiple cases, we have two sets of the same files. |
14:57:00 | <SketchCow> | That is, we have 26 UMICH items, but we ALSO have 26 WARC items. That means it's done - I'm just being careful and doubling data until we're secure it's working as advertised. |
14:57:00 | <SketchCow> | Plus, we have orphan items that could probably stand to go into collections where possible. |
14:57:00 | <SketchCow> | This is all a function of #whatnow but I'd like it done, so we're cleaned up for 2013. |
14:59:00 | <SketchCow> | This is JUST for stuff in which WARCs play a part. Obviously we have tons of items with no WARCs, like the Mypodcast rescue |
15:00:00 | <SketchCow> | Also: Damn, http://archive.org/details/archiveteam is one fucking impressive wall of items, downloads, and sites |
15:01:00 | <SketchCow> | Good job |
15:04:00 | <SketchCow> | OK, going after fortunecity |
15:07:00 | <SmileyG> | Shit is wack. |
15:34:00 | <ersi> | http://i.imgur.com/eEGC2.jpg |
15:48:00 | <DFJustin> | SketchCow: not warc but needs to go in the archiveteam collection somewhere I would think https://archive.org/details/archiveteam-thingiverse-2012-09 |
15:49:00 | <SketchCow> | Done. |
15:49:00 | <SketchCow> | You will all be shocked to know that transferring millions of files using the megawarc converter slows down other disk operations on that drive. |
16:36:00 | <Ymgve> | http://www.metafilter.com/121172/Goodbye-Cruel-World |
16:36:00 | <Ymgve> | I wonder how hard it would be to extract and archive teletext |
16:41:00 | <SketchCow> | Not |
16:46:00 | <DFJustin> | seems to be a bunch of it on youtube, might be worth doing a keyword grab of some sort |
20:03:00 | <mistym> | Deos Warrior delete downloaded content after uploading? I noticed my Warrior image is growing. |
20:06:00 | <alard> | mistym: It should. Have you checked with df? (The warrior image will grow to 60GB, since the empty space on the disk is not claimed back.) |
20:07:00 | <mistym> | alard: I'll take a look when I get home. |
20:07:00 | <alard> | If you don't want to give it 60GB you can replace the disk with a smaller disk image; it will be formatted when the warrior boots. |
20:13:00 | <ersi> | or a larger one ;) |
22:11:00 | <tef> | Ymgve: ceefax.tv did it |
22:23:00 | <Ymgve> | tef: they are scraping the tv signal directly? |
22:24:00 | <chronomex> | looks like |
22:26:00 | <Ymgve> | apparently http://www.theregister.co.uk/2005/04/01/ceefax_google/ |
22:27:00 | <Ymgve> | but the content is out of date |
22:28:00 | <Ymgve> | still, nice |
22:31:00 | <dashcloud> | so reading the scrollback, I did a brief check of the items, and I came across Coming Soon, which has one item as WARCS, and there's a second item with a WARC file inside a zipfile |
23:33:00 | <godane> | i'm on the net for now |
23:33:00 | <godane> | my wireless internet is not working |
23:34:00 | <godane> | i had to go wired |