00:01:22dm4v quits [Read error: Connection reset by peer]
00:02:09dm4v joins
00:02:11dm4v quits [Changing host]
00:02:11dm4v (dm4v) joins
00:24:42Ryz quits [Remote host closed the connection]
00:26:29Ryz (Ryz) joins
00:32:04Ryz quits [Remote host closed the connection]
00:33:50Ryz (Ryz) joins
00:48:44tzt quits [Read error: Connection reset by peer]
01:31:33OrIdow6 (OrIdow6) joins
01:31:33@ChanServ sets mode: +o OrIdow6
01:43:29tzt (tzt) joins
01:47:29wickedplayer494 quits [Ping timeout: 244 seconds]
01:52:13wickedplayer494 joins
02:12:40heart_ joins
02:12:45AntiLiberal joins
02:14:16heart_ quits [Changing host]
02:14:16heart_ (heart) joins
02:58:36britmob joins
03:55:13qw3rty__ joins
03:59:02qw3rty_ quits [Ping timeout: 250 seconds]
04:07:03sec^nd quits [Ping timeout: 258 seconds]
04:11:49sec^nd (second) joins
04:18:06DogsRNice quits [Read error: Connection reset by peer]
04:56:40nicolas17 quits [Ping timeout: 250 seconds]
05:53:58Viniter69 (Viniter) joins
06:01:04<tech234a>Super Mario 64 for web, doubt this will stay up long https://froggi.es/mario/
06:34:27<@OrIdow6>tech234a: Looks like people have SPNd it, does it work for you like that?
06:34:31<@OrIdow6>Live version does not work for me
06:35:15<@OrIdow6>DW review:
06:35:28<@OrIdow6>FloydHub (still alive) - paid, not much to archive (IIRC)
06:36:04<@OrIdow6>mods.io - todo
06:36:11<@OrIdow6>i.e. not archived AFAICT
06:36:23<@OrIdow6>xtube - in progress
06:37:07<@OrIdow6>Google Drive - I have finally put some time aside and written a sort-of-working API client for it today, if all goes well should be on time
06:52:50Justin[home] joins
06:54:06DopefishJustin quits [Ping timeout: 250 seconds]
06:58:38<tech234a>OrIdow6: live version works for me, SPN version does not
06:58:48<tech234a>iOS 14.7.1 Safari
07:01:39<@OrIdow6>tech234a: Trying on another browser, looks like Wombat isn't catching the wasm request
07:02:54<@OrIdow6>So it gets blocked
07:03:24<@OrIdow6>It's there, though
07:03:27<@OrIdow6>In the WBM
07:04:05<@OrIdow6>Isn't catching and consequently isn't rewriting, I should say
07:05:18<@OrIdow6>For those who don't know what this means, it's all in the Wayback Machine already, but at present it does not play back
07:10:26<@OrIdow6>Also:
07:10:31<@OrIdow6>e2bn - not responding for me
07:10:44<@OrIdow6>spectrum.chat - only getting frozen, not shut down
07:51:15Megame quits [Client Quit]
08:18:39BlueMaxima quits [Read error: Connection reset by peer]
08:18:53BlueMaxima joins
09:29:04BlueMaxima quits [Client Quit]
09:44:20bitbit (bitbit) joins
09:49:05<rewby>OrIdow6, tech234a: I grabbed a copy of it using grab-site and when I play that back via pywb it works just fine. So looks like something about the WBM is funky if it has the files.
10:01:35rsn joins
10:03:37rsn_ quits [Ping timeout: 252 seconds]
10:08:42Terbium quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]
10:09:03Terbium joins
10:40:24rsn_ joins
10:42:14rsn quits [Ping timeout: 244 seconds]
11:04:22qwertyasdfuiopghjkl joins
11:36:55rsn joins
11:38:46rsn_ quits [Ping timeout: 252 seconds]
11:59:08<@JAA>FloydHub has a lot to archive. Jake's size estimate was ~35 TB.
11:59:45<@JAA>Cf. in here on 2021-07-24.
12:00:49<@HCross>OrIdow6: thoughts on Google Drive playback in the WBM?
12:01:04<@HCross>will it just be a case of getting the data safe into the IA
12:01:10<@HCross>and then waiting for the WBM to support it?
12:03:17<@JAA>FloydHub data is still accessible it seems despite the precise deadline of 2021-08-20 17:00 PDT on https://www.floydhub.com/shutdown .
12:25:34bitbit quits [Client Quit]
12:30:59<@OrIdow6>HCross: Technically I haven't seen anything to exclude WBM playback, but it strikes me as unlikely
12:31:20<@OrIdow6>URLS look like this https://clients6.google.com/batch/drive/v2beta?$ct=multipart/mixed; boundary="=====yerrqj4as8f====="&key=AIzaSyC1qbk75NzWBvSaDh6KnsjjA9pIrP4lYIE
12:32:33<@OrIdow6>(https://clients6.google.com/batch/drive/v2beta?%24ct=multipart%2Fmixed%3B%20boundary%3D%22%3D%3D%3D%3D%3Dyerrqj4as8f%3D%3D%3D%3D%3D%22&key=AIzaSyC1qbk75NzWBvSaDh6KnsjjA9pIrP4lYIE)
12:33:06<@OrIdow6>And then those are multipart POST requests
12:33:49<@OrIdow6>I suspect that at least some of these parameters somewhere are generated randomly in JS
12:35:30<@OrIdow6>Namely the multipart boundry
12:39:28<@OrIdow6>And then of course the WBM doesn't care about the POST request body anyway, so the only way it would work with today's WBM would be if the boundries were all deterministic from the request for the first page, and didn't collide
12:40:02bitbit (bitbit) joins
12:40:51<@OrIdow6>(It may or may not use real multipart, as usual I don't know enough about HTTP)
12:41:55<@OrIdow6>Anyhow, maybe that can be tested, but my plan was to skip the multipart and directly make the requests that it proxies
12:42:38<@OrIdow6>Since it should be easy to write a small tool to get them from the WBM in that case
12:43:27<@OrIdow6>And I do think that individual file downloads at or past the uc?export stage will work
12:47:14<@OrIdow6>Maybe
13:14:48rsn_ joins
13:17:13rsn quits [Ping timeout: 252 seconds]
13:19:25AntiLiberal quits [Ping timeout: 252 seconds]
13:25:32LeGoupil joins
14:21:49qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds]
14:21:58qwertyasdfuiopghjkl joins
14:26:32nertzy (nertzy) joins
14:42:10nertzy quits [Client Quit]
14:48:48nertzy (nertzy) joins
14:55:06yawkat` quits [Ping timeout: 250 seconds]
15:04:11Arcorann quits [Ping timeout: 244 seconds]
15:45:22AntiLiberal joins
15:58:01nertzy quits [Client Quit]
15:59:58yawkat (yawkat) joins
16:07:44Megame (Megame) joins
16:37:16nertzy (nertzy) joins
17:27:18qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds]
17:28:39qwertyasdfuiopghjkl joins
17:30:29nicolas17 joins
17:51:28<Ryz>From #archivebot - in response of http://thetruthofgodsword.com/ content being relocated to https://www.sermonaudio.com/source_detail.asp?sourceid=hggueho - <@JAA> "Someone should archive SermonAudio someday, I guess." "Although that could be quite sizeable at 2 million recordings..."
17:52:46Ruthalas quits [Ping timeout: 252 seconds]
18:03:59qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds]
18:12:41qwertyasdfuiopghjkl joins
18:22:13bitbit quits [Client Quit]
18:27:27Ruthalas (Ruthalas) joins
18:30:51qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds]
18:34:07Ruthalas quits [Client Quit]
18:34:29Ruthalas (Ruthalas) joins
18:40:00nicolas17 quits [Ping timeout: 250 seconds]
18:42:49nicolas17 joins
18:47:32DogsRNice (Webuser299) joins
18:49:28qwertyasdfuiopghjkl joins
18:58:45qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds]
19:10:36nertzy quits [Client Quit]
19:17:06qwertyasdfuiopghjkl joins
19:20:50Iki1 joins
19:24:04Iki quits [Ping timeout: 244 seconds]
19:24:17spirit quits [Client Quit]
19:28:43Iki1 quits [Ping timeout: 244 seconds]
19:36:52<@OrIdow6>Has anyone done any substantial work on that video game thing?
19:36:59<@OrIdow6>mods.io
20:07:06Gereon6 quits [Ping timeout: 250 seconds]
20:09:48<@OrIdow6>Also, anyone know a statistically valid way to make estimates for Google Drive? Folders may include other folders and other files, but those subfolders and files might be sourced separately from the parent
20:10:53<@OrIdow6>That is, some links from the discovery phase, may be contained within other links
20:11:30<@OrIdow6>I am planning on doing deduplication (a folder queues items for each file in it, or similar), so I don't know how to do the estimate
20:13:32<@OrIdow6>Again, since otherwise someone will tell me the obvious: I know I can "randomly pick some and check their sizes"; I am looking for a way to get the overall size estimate, preferably with margin of error etc.
20:13:50<@OrIdow6>Don't want to think about this if someone already knows
20:15:31<[42]>i'm assuming you already consider querying the official drive api for that stuff?
20:15:54<[42]>i don't know about the volume for that and relevant rate limits though
20:16:24<@OrIdow6>I don't think the official drive API has a way to see the total size of everything stored in Google Drive?
20:16:47<[42]>not for everything but for individual items at least
20:16:56<@OrIdow6>I already have that capability
20:19:54<[42]>would it be that bad to query all of them? or at least a significant number?
20:20:35<[42]>looks like you could easily do just over 17m api calls per day per account
20:21:19<@OrIdow6>It is undesirable to have to do so
20:46:41LeGoupil quits [Client Quit]
20:53:54sec^nd quits [Remote host closed the connection]
20:54:20sec^nd (second) joins
21:05:04Iki joins
21:50:55knecht420 quits [Read error: Connection reset by peer]
21:51:03knecht420 (knecht420) joins
21:55:31xit quits [Quit: Ping timeout (120 seconds)]
21:55:51xit joins
22:27:33nertzy (nertzy) joins
22:32:37Jake0 (Jake) joins
22:34:22Jake quits [Ping timeout: 252 seconds]
22:34:22Jake0 is now known as Jake
22:42:28nertzy quits [Client Quit]
22:54:55Ruthalas quits [Client Quit]
22:55:14Ruthalas (Ruthalas) joins
23:15:10Arcorann (Arcorann) joins
23:50:07nertzy (nertzy) joins