00:05:12 | | sralracer quits [Quit: Ooops, wrong browser tab.] |
02:17:11 | | nicolas17 joins |
02:17:13 | | nicolas17 is now authenticated as nicolas17 |
03:03:45 | | SootBector quits [Remote host closed the connection] |
03:04:07 | | SootBector (SootBector) joins |
03:20:43 | | nicolas17 quits [Client Quit] |
03:28:05 | | datechnoman quits [Quit: The Lounge - https://thelounge.chat] |
03:29:24 | | datechnoman (datechnoman) joins |
04:11:50 | | Jake quits [Quit: Leaving for a bit!] |
04:12:46 | | Jake (Jake) joins |
04:47:28 | | DogsRNice quits [Read error: Connection reset by peer] |
08:00:52 | | qwertyasdfuiopghjkl2 quits [Ping timeout: 260 seconds] |
09:20:48 | | qwertyasdfuiopghjkl2 (qwertyasdfuiopghjkl2) joins |
10:05:40 | | mls (mls) joins |
11:08:42 | | SF quits [Ping timeout: 260 seconds] |
11:21:20 | | SF joins |
11:22:52 | | sralracer (sralracer) joins |
12:30:15 | | mls quits [Client Quit] |
13:58:25 | <pabs> | arkiver: I noticed that SPN thinks connection closed mid-way through a download means success. should I report that to the email contact or ? |
13:58:43 | <pabs> | examples: |
13:58:43 | <pabs> | https://web.archive.org/web/20241204042513/https://static.riseup.net/mtl4riseup/mtl4riseup-wav.zip |
13:58:44 | <pabs> | https://web.archive.org/web/20241204042527/https://static.riseup.net/mtl4riseup/mtl4riseup-flac.zip |
13:59:42 | <pabs> | J_A_A mentioned that the WBM doesn't support merging resumed downloads btw |
14:13:37 | | ThreeHM_ quits [Ping timeout: 260 seconds] |
14:47:37 | | ThreeHM (ThreeHeadedMonkey) joins |
15:41:52 | | DigitalDragons quits [Quit: Ping timeout (120 seconds)] |
15:41:52 | | Exorcism quits [Quit: Ping timeout (120 seconds)] |
15:50:05 | | AlsoHP_Archivist quits [Quit: Leaving] |
18:22:24 | | DigitalDragons (DigitalDragons) joins |
18:24:55 | | HP_Archivist (HP_Archivist) joins |
19:52:21 | <@JAA> | Notably, the WBM doesn't return a 'warning' header about the truncation either. |
19:54:06 | <@JAA> | The CDX API does return the truncated length value though: {"urlkey": "net,riseup,static)/mtl4riseup/mtl4riseup-flac.zip", "timestamp": "20241204042527", "original": "https://static.riseup.net/mtl4riseup/mtl4riseup-flac.zip", "mimetype": "application/zip", "statuscode": "200", "digest": "YXWKFZPZIA5DRGN2YFH3U3TOJBEKELOH", "length": "252874111"} |
20:04:50 | <@arkiver> | they really should return a warning |
20:33:21 | | DigitalDragons quits [Client Quit] |
20:35:36 | | DigitalDragons (DigitalDragons) joins |
21:28:35 | | Exorcism (exorcism) joins |
21:49:55 | | KoalaBear joins |
21:50:22 | | KoalaBear84 quits [Ping timeout: 260 seconds] |
22:44:34 | <TheTechRobo> | Is it possible to make the CDX API return captures with any combination of query parameters, except require one specific parameter? |
22:45:25 | <TheTechRobo> | Specifically for https://www.youtube.com/oembed?format=json&url=... — I don't care if the `format` parameter is set to json or even if it's there or not, I just care that the URL field is something specific. |
22:45:55 | <TheTechRobo> | I'd also be fine with getting all captures for /oembed that contain a url parameter and filtering it on my end. Although that might get large. |
22:48:45 | <pokechu22> | Are there captures of oembed that don't contain a URL parameter? |
22:48:54 | <pokechu22> | oh, you want oembed for a specific video |
22:50:15 | <TheTechRobo> | pokechu22: I tried https://web.archive.org/cdx/search/cdx?output=json&url=https://www.youtube.com/oembed to get everything (to filter on my end) but I don'T see anything with query parameters. |
22:51:57 | <pokechu22> | I generally steal URLs from https://web.archive.org/web/*/https://www.youtube.com/oembed* - you want at least match=prefix and collapse=urlkey |
22:52:02 | <@JAA> | collapse=urlkey should take care of that. |
22:52:17 | <@JAA> | There's a bit of rubbish, but the query params start soon enough. |
22:52:39 | <@JAA> | Not sure about filtering by parameter value directly. |
22:52:51 | <TheTechRobo> | Thank you! I'll sort it on my end assuming it isn't too big |
22:53:01 | <TheTechRobo> | Does ia-cdx-search handle pagination on its own? (Is pagination still broken?) |
22:53:03 | <@JAA> | It might be possible with `filter`. |
22:53:29 | <@JAA> | Uh, I think I worked around the bugs. |
22:54:04 | <@JAA> | Which doesn't mean there might not be new ones by now. :-P |
22:55:57 | <TheTechRobo> | ia-cdx-search 'url=https://www.youtube.com/oembed&collapse=urlkey&matchType=prefix' > stuff |
22:55:57 | <TheTechRobo> | Is that correct? |
22:57:08 | <@JAA> | That looks about right, yeah. |
23:00:02 | | SootBector quits [Remote host closed the connection] |
23:00:28 | | SootBector (SootBector) joins |
23:57:19 | | PredatorIWD2 quits [Read error: Connection reset by peer] |