00:02:19Bedivere joins
01:05:46Sir_Bedivere joins
01:08:02Bedivere quits [Ping timeout: 252 seconds]
01:11:20pabs quits [Quit: Don't rest until all the world is paved in moss and greenery.]
01:13:33<pokechu22>managed to run out of disk space after 68585014466 bytes of XML :|
01:13:43<pokechu22>hopefully should be able to recover and get more data afterwards
01:15:51pabs (pabs) joins
01:16:20pabs quits [Remote host closed the connection]
01:19:18pabs (pabs) joins
01:39:05igloo22225 quits [Quit: Ping timeout (120 seconds)]
01:40:00igloo22225 (igloo22225) joins
01:40:02nepeat_ quits [Client Quit]
01:40:34nepeat (nepeat) joins
03:13:06igloo222253 joins
03:13:16igloo22225 quits [Client Quit]
03:13:16qwertyasdfuiopghjkl quits [Client Quit]
03:13:17igloo222253 is now known as igloo22225
03:45:50qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
05:44:16Nemo_bis (Nemo_bis) joins
05:51:46<pokechu22>I was able to resume the rationalwiki.org job but I had to modify the tools to do so because the page it died on had invalid utf-8 on one of its revisions, and the resume logic was treating the full file as corrupt in that case. It seems to be successfully continuing from where it left off now (and successfully trimmed and redid the missing chunk) though
05:53:18<pokechu22>I've got NTFS file compression on now so 69GB becomes 47.6 GB (which is pretty crappy - zstd can get it down to a few hundred megs I think - but it's better than nothing), and hopefully the ~50 GB I have free now will be enough :|
05:54:44<pokechu22>(my workaround was to use yield segment.decode('utf-8', 'ignore')
05:55:23<pokechu22>in reverse_readline - and it detected the file was corrupt because the unicode exception thrown otherwise was caught in resumePreviousDump by an except block with "probably file does not exists")
06:13:46jtagcat quits [Quit: Bye!]
06:14:28jtagcat (jtagcat) joins
06:16:49<@JAA>I thought NTFS was a bad idea here because it's case-insensitive?
06:19:01<pokechu22>Yeah, but I'm already using it so I can't do much about that at the moment - I've got an incomplete patch to work around that which I probably should tidy up
06:19:10<pokechu22>It doesn't affect the main XML file though which is the most important thing
06:19:33<pokechu22>(and also the one most likely to cause me to run out of space; Special:MediaStatistics says that files themselves are only 6GB or so)
06:21:41<pokechu22>In this case, it looks like there are 24 files (of 29572) that will be affected by it
07:04:09Sir_Bedivere quits [Ping timeout: 258 seconds]
07:08:59Sir_Bedivere joins
09:11:51<DigitalDragons>I made a little IRC bot the
09:12:18<DigitalDragons>that manages running wikiteam3/dokuwikidumper*
09:12:35@rewby quits [Ping timeout: 252 seconds]
09:12:53<DigitalDragons>it's up and running in #wikibot if people here would like to try it out
10:47:08rewby (rewby) joins
10:47:08@ChanServ sets mode: +o rewby
12:03:14TastyWiener953 (TastyWiener95) joins
12:03:42TastyWiener95 quits [Read error: Connection reset by peer]
12:03:43TastyWiener953 is now known as TastyWiener95
13:38:36TheTechRobo quits [Client Quit]
13:40:07AnotherTechRobo joins
13:50:45AnotherTechRobo quits [Remote host closed the connection]
13:54:17TheTechRobo (TheTechRobo) joins
14:23:17Megame (Megame) joins
16:32:58Sir_Bedivere quits [Read error: Connection reset by peer]
16:34:40TheTechRobo quits [Excess Flood]
16:37:41Bedivere joins
16:44:14TheTechRobo (TheTechRobo) joins
17:32:42Sir_Bedivere joins
17:35:17Bedivere quits [Ping timeout: 252 seconds]
18:00:41Megame quits [Client Quit]
18:55:11TheTechRobo quits [Client Quit]
19:38:55jtagcat quits [Killed (ing.hackint.org (Nickname regained by services))]
19:38:56TastyWiener95 quits [Client Quit]
19:39:02jtagcat (jtagcat) joins
19:39:41TastyWiener95 (TastyWiener95) joins
19:43:47TastyWiener95 quits [Client Quit]
19:44:03TastyWiener95 (TastyWiener95) joins
20:47:06nulldata quits [Quit: Ping timeout (120 seconds)]
20:48:19nulldata (nulldata) joins
21:28:11atphoenix_ quits [Ping timeout: 258 seconds]
21:28:44atphoenix_ (atphoenix) joins
21:59:35that_lurker quits [Quit: Clowning around is not the same as fooling around...I am a clown, not a fool]
22:01:36that_lurker (that_lurker) joins
22:57:02Matthww1 quits [Ping timeout: 252 seconds]
22:58:00Matthww1 joins
23:24:59TheTechRobo (TheTechRobo) joins