00:02:54<@JAA>> Got status 400 from IA S3 on uploading part 0, retrying after 3 seconds
00:02:57<@JAA>u ok, IA?
00:03:28Exorcism quits [Remote host closed the connection]
00:04:24Exorcism (exorcism) joins
00:04:33<@JAA>Looks like I'm discarding the response details, so no idea what the exact error is. I've only seen it on Content-MD5 mismatches in the past, but it's highly unlikely that's it.
00:05:46<@JAA>The machine has ECC RAM and the retry succeeded, so yeah.
00:06:16<@arkiver>the RAM at IA is ECC too
00:06:41<@arkiver>there have been mismatches yeah, and those have been on unsecured connection
00:08:41<@JAA>Just for added context, this is with ia-upload-stream, which first reads the data into memory (from disk in this case), then calculates the MD5 and uploads from the buffer.
00:09:34<@JAA>Two uploads failed with that error in the past couple hours, both succeeding on the retry. If it keeps happening, I'll start logging the response contents.
00:24:57<@arkiver>do you have the task?
00:25:24<@JAA>No task for it since it failed already on uploading.
00:25:31<@JAA>But foiaonline.gov_requests_202309_part2 is the item.
00:26:09<@JAA>214 and 227 behaved like that.
01:14:51<@JAA>Ok, just got two more of these in the past 20 minutes on 253 and 255.
01:16:45<balrog>btw regarding IA issues: doing a cdx search on assemblergames.com freezes on page 48. I think JAA reproduced it
01:18:50<@JAA>Yeah
01:20:06<@JAA>little-things/ia-cdx-search --concurrency 4 --tries 3 'url=assemblergames.com&collapse=urlkey&fl=original&matchType=domain'
01:20:50<@JAA>From -ot:
01:20:50<@JAA>2023-09-19 19:38:27 UTC <@JAA> balrog: It appears that the API struggles with an extremely long URL.
01:20:53<@JAA>2023-09-19 19:39:01 UTC <@JAA> The last few it returns before breaking all start with 'http://www.assemblergames.com/forums/image/jpeg;base64,'...
01:25:59<@JAA>The ia-cdx-search command above eventually fails with something like:
01:25:59<@JAA>Error retrieving /cdx/search/cdx?url=assemblergames.com&collapse=urlkey&fl=original&matchType=domain&output=json&page=48: http.client.IncompleteRead IncompleteRead(3407076 bytes read)
01:26:17<@JAA>s,\s*$,,
01:28:01<@JAA>That may be in part due to the one-minute timeout, but if it takes one minute to transmit 3.4 MB of data, something's clearly wrong on the server side anyway.
04:13:56flashfire42 quits [Client Quit]
04:13:56kiska quits [Client Quit]
04:13:57s-crypt2 quits [Client Quit]
04:16:11flashfire42 joins
04:16:23s-crypt2 (s-crypt) joins
04:18:11kiska (kiska) joins
06:22:34qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
06:32:24Exorcism quits [Remote host closed the connection]
06:34:46Exorcism (exorcism) joins
06:37:11Exorcism quits [Remote host closed the connection]
06:37:58Exorcism (exorcism) joins
07:13:14Arcorann (Arcorann) joins
07:46:28Exorcism9 (exorcism) joins
07:48:31Exorcism quits [Read error: Connection reset by peer]
07:48:31Exorcism9 is now known as Exorcism
12:13:44imer quits [Ping timeout: 252 seconds]
13:36:36imer (imer) joins
13:52:59<HP_Archivist>pokechu22 and fireonlive: It's still working on some derivations, but the data from that Google Drive RL archive is now archived here: https://archive.org/details/radiolab-wnyc-radio-series
13:53:30<HP_Archivist>I also included a .zip of the entire thing as an alternative way to parse the data locally vs going through all the many files manually.
13:55:29<HP_Archivist>I specifically configured it to *not* create derivatives, but IA still created png audio spectrograms for each mp3
13:56:32<HP_Archivist>I'll probably post in Datahoarder as a follow up to this original post https://www.reddit.com/r/DataHoarder/comments/16n6sbg/redditor_provides_a_personal_archive_of_every/
13:58:47Exorcism quits [Remote host closed the connection]
14:01:09Exorcism (exorcism) joins
14:40:42Arcorann quits [Ping timeout: 265 seconds]
15:30:05HP_Archivist quits [Ping timeout: 252 seconds]
15:46:55Exorcism quits [Remote host closed the connection]
15:47:36Exorcism (exorcism) joins
15:55:05Exorcism quits [Remote host closed the connection]
15:55:43Exorcism (exorcism) joins
15:56:06<fireonlive>:)
16:23:09HP_Archivist (HP_Archivist) joins
18:15:05HP_Archivist quits [Ping timeout: 252 seconds]
18:15:47Exorcism quits [Remote host closed the connection]
18:16:47Exorcism (exorcism) joins
19:50:41HP_Archivist (HP_Archivist) joins
19:57:57Exorcism1 (exorcism) joins
19:59:55Exorcism quits [Read error: Connection reset by peer]
19:59:56Exorcism1 is now known as Exorcism
20:06:11systwi quits [Ping timeout: 252 seconds]
20:24:28systwi (systwi) joins
20:46:09Exorcism quits [Remote host closed the connection]
20:47:13Exorcism (exorcism) joins
21:19:44Exorcism quits [Remote host closed the connection]
21:20:44Exorcism (exorcism) joins
22:37:59igloo22225 quits [Ping timeout: 252 seconds]
22:38:13igloo222250 joins
22:39:28flashfire42 quits [Client Quit]
22:39:28kiska quits [Client Quit]
22:39:29s-crypt2 quits [Client Quit]
22:42:07flashfire42 joins
22:42:18s-crypt2 (s-crypt) joins
22:44:06kiska (kiska) joins
23:21:26Matthww11 quits [Quit: Ping timeout (120 seconds)]
23:22:23Matthww11 joins