00:00:37kaz (Kaz) joins
00:00:37@ChanServ sets mode: +o kaz
00:05:30@kaz quits [*.net *.split]
00:28:58kaz (Kaz) joins
00:28:58@ChanServ sets mode: +o kaz
00:29:32igloo22225 (igloo22225) joins
00:30:22linuxgemini (linuxgemini) joins
01:10:23kokos- joins
01:37:26katia_ (katia) joins
03:39:37PredatorIWD26 joins
03:42:58PredatorIWD2 quits [Ping timeout: 260 seconds]
03:42:58PredatorIWD26 is now known as PredatorIWD2
03:59:53DogsRNice quits [Read error: Connection reset by peer]
04:00:11DogsRNice joins
04:35:58tech234a (tech234a) joins
04:56:54DogsRNice quits [Read error: Connection reset by peer]
05:43:08nukke quits [Ping timeout: 260 seconds]
06:07:17BearFortress_ quits []
06:45:04BearFortress joins
06:46:58nukke (nukke) joins
06:52:12nukke quits [Ping timeout: 250 seconds]
07:06:30nukke (nukke) joins
07:11:42nukke quits [Ping timeout: 250 seconds]
07:24:16nukke (nukke) joins
07:29:02nukke quits [Ping timeout: 250 seconds]
07:42:53nukke (nukke) joins
08:57:57<Nemo_bis>I've just saved 75 % disk space in a staging directory of mine by switching WARCs from gzip to zstd -19 compression (without custom dictionary)... Is it ok to upload *.warc.zst files to archive.org collections or is there some special procedure to follow compared to gz?
09:08:19BornOn420 quits [Remote host closed the connection]
09:15:16nulldata quits [Quit: So long and thanks for all the fish!]
09:16:53nulldata (nulldata) joins
09:24:19Matthww joins
09:24:47BornOn420 (BornOn420) joins
09:55:35myself9 (myself) joins
09:56:12myself quits [Read error: Connection reset by peer]
09:56:12myself9 is now known as myself
10:25:14SootBector quits [Remote host closed the connection]
10:32:20katia_ quits [Ping timeout: 250 seconds]
10:33:03kokos- quits [Ping timeout: 260 seconds]
11:03:36kokos- joins
11:20:06katia_ (katia) joins
11:28:51<@arkiver>Nemo_bis: did you compress the records individually?
11:29:02<@arkiver>or just decompressed the .gz, and compressed the entire WARC as .zst ?
11:29:55<@arkiver>if your .zst WARC are valid, there is no special procedure, they can just be uploaded as normal and will be handled
11:50:27<Nemo_bis>arkiver: I just decompressed and recompressed as is. How do I verify if it's still valid?
11:50:55<Nemo_bis>The entire WARC file I mean.
11:55:06<Nemo_bis>I see that the WARC-Filename headers inside still reference *.gz but they don't need to match, right? They already didn't after smaller WARC filed were megawarc'ed
12:54:08<OrIdow6>Nemo_bis: I think what arkiver means is that your zstd-compressed version needs to have each record zstd-compressed separately, then the compressed versions concatenated together
12:54:12<OrIdow6>Not as a single stream
12:54:23<OrIdow6>Because that lets you seek
12:56:45<@arkiver>yes what OrIdow6 says
12:57:01<@arkiver>Nemo_bis: no they don't need to match
12:58:00<@arkiver>the way you recompressed it creates an invalid WARC, and explains the 75% disk space savings over .gz. if you compress the records individually for a correct WARC, the percentage saved should be much smaller
13:06:27funderscore is now known as f_
14:48:43<Nemo_bis>Right, makes sense. Thanks! (I wasn't planning to upload these, they're just a local cache of mine.)
15:48:40katia_ quits [Ping timeout: 250 seconds]
15:49:13kokos- quits [Ping timeout: 260 seconds]
15:56:59kokos- joins
15:59:12th3z0l4_ joins
16:01:28th3z0l4 quits [Ping timeout: 260 seconds]
16:25:18katia_ (katia) joins
16:32:29SootBector (SootBector) joins
16:36:58SootBector quits [Remote host closed the connection]
16:39:48katia_ quits [Ping timeout: 250 seconds]
16:39:58kokos- quits [Ping timeout: 260 seconds]
18:04:06<nicolas17>turns out the reason I'm uploading at 5s/MiB instead of 5MiB/s is a much more global problem
18:06:14<nicolas17>I see friends complaining about crazy high packet loss to AWS us-east1 even
18:17:06<nicolas17>602/631 [46:28<03:15, 6.73s/MiB]
18:44:33kokos- joins
19:06:16kokos- quits [Ping timeout: 250 seconds]
19:28:42kokos- joins
19:29:06katia_ (katia) joins
19:37:18that_lurker quits [Read error: Connection reset by peer]
19:37:22that_lurker (that_lurker) joins
19:38:46kokos- quits [Ping timeout: 250 seconds]
19:39:03katia_ quits [Ping timeout: 260 seconds]
19:39:19SootBector (SootBector) joins
19:52:12DogsRNice joins
19:57:35kokos- joins
20:07:41BornOn420 quits [Remote host closed the connection]
20:08:15katia_ (katia) joins
20:41:28kokos- quits [Client Quit]
20:41:29katia_ quits [Client Quit]
21:47:12Sidpatchy3 (Sidpatchy) joins
21:47:58Sidpatchy quits [Ping timeout: 260 seconds]
21:47:58Sidpatchy3 is now known as Sidpatchy
22:09:18BornOn420 (BornOn420) joins
22:38:42PredatorIWD2 quits [Read error: Connection reset by peer]
22:42:00PredatorIWD2 joins
23:54:20PredatorIWD2 quits [Read error: Connection reset by peer]