| 00:04:50 | | IDK quits [Client Quit] |
| 00:19:20 | <@JAA> | New scan is running now with events. I also realised that the API doesn't return caption URLs on the list endpoints, so I need to retrieve each entry individually as well. |
| 00:19:26 | <@JAA> | Haha qwarc goes brrrrr :-) |
| 00:22:13 | <@JAA> | 1066 events and 29453 sessions (= individual talks at those events) according to the API |
| 01:00:01 | | dm4v quits [Client Quit] |
| 01:01:34 | | dm4v joins |
| 01:01:36 | | dm4v is now authenticated as dm4v |
| 01:01:36 | | dm4v quits [Changing host] |
| 01:01:36 | | dm4v (dm4v) joins |
| 01:04:41 | | piennu joins |
| 01:05:21 | <piennu> | the people running yuku.com are doing something fucky, half of the links at https://www.google.com/search?q=blinkies+site%3Ayuku.com are 404s |
| 01:06:54 | | lennier1 quits [Client Quit] |
| 01:07:31 | | lennier1 (lennier1) joins |
| 01:10:36 | <@JAA> | Not surprised. Tapatalk is horrible. |
| 01:11:44 | | piennu leaves |
| 02:02:38 | | dm4v quits [Read error: Connection reset by peer] |
| 02:04:20 | | dm4v joins |
| 02:04:22 | | dm4v is now authenticated as dm4v |
| 02:04:22 | | dm4v quits [Changing host] |
| 02:04:22 | | dm4v (dm4v) joins |
| 02:08:40 | | qwertyasdfuiopghjkl quits [Client Quit] |
| 02:16:07 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 02:16:21 | | BlueMaxima joins |
| 02:16:42 | | fuzzy8021 quits [Read error: Connection reset by peer] |
| 02:17:09 | | fuzzy8021 (fuzzy8021) joins |
| 02:52:03 | | Ctrl-S quits [Read error: Connection reset by peer] |
| 02:52:07 | | Ryz2 quits [Quit: Ping timeout (120 seconds)] |
| 02:52:07 | | Ctrl-S joins |
| 02:52:20 | | Ryz2 (Ryz) joins |
| 03:14:49 | | ThreeHM quits [Ping timeout: 265 seconds] |
| 03:16:17 | | ThreeHM (ThreeHeadedMonkey) joins |
| 04:04:07 | | ThreeHM quits [Ping timeout: 265 seconds] |
| 04:04:48 | | ThreeHM (ThreeHeadedMonkey) joins |
| 04:15:56 | | qw3rty__ joins |
| 04:18:55 | | wizzito joins |
| 04:19:05 | <wizzito> | Hey guys. back again with a few more things |
| 04:19:12 | <wizzito> | first is that making an edit gives me a 503 now |
| 04:19:22 | <wizzito> | second is more of an archival request |
| 04:19:40 | | qw3rty_ quits [Ping timeout: 252 seconds] |
| 04:22:13 | <wizzito> | there's an organization named Common Sense Media which has been posting reviews for kids' things and popular media for years, one of the categories being websites |
| 04:22:20 | <wizzito> | and a lot of those websites are either dead or dying |
| 04:22:36 | <wizzito> | some of them havent updated since 2008-2010 but the reviews are still up |
| 04:22:50 | <wizzito> | I was wondering if we could go through those reviews and archive some of the lesser known or older sites |
| 04:23:05 | <wizzito> | https://www.commonsensemedia.org/website-reviews?sort=field_review_recommended_age&order=asc |
| 04:23:29 | <wizzito> | ex. 3rd result on this page (Tikety Toc) is a dead link |
| 04:24:36 | <wizzito> | they do have some websites as 'This product is no longer available' though so they care somewhat but probably not enough |
| 04:27:24 | <wizzito> | another example: this one hasnt been updated since 2004 and the link is dead (leads to a 404) but no 'no longer available' message https://www.commonsensemedia.org/website-reviews/diamond-r-ranch-web-site |
| 04:27:37 | <wizzito> | so quite a bit of these websites might not have much time left |
| 04:32:26 | <@JAA> | Yeah, that sounds like a good idea. |
| 04:33:59 | <wizzito> | so many dead links on there that havent been given any attention |
| 04:47:22 | <@JAA> | Ok, second analysis of Channel 9 is done. Looks like it's a fair bit larger. From another 1% sample of all 51k entries (= show episodes) and 29k sessions (= event talks), I'm getting a total size estimate of 23 TiB when getting the largest video file on each item. The majority of this are the sessions, which tend to be hour-long videos while the entries are often shorter. |
| 04:49:17 | <@JAA> | With this check (largest file per item rather than high-quality MP4 or WMV separately), the earlier sample gets to 9.7 TiB, by the way. |
| 04:49:41 | <@JAA> | Also, there are some really fun cases. https://channel9.msdn.com/Blogs/pdc2008/TL42 is both an entry and a session, for example. |
| 04:50:21 | <@JAA> | Classic Microsoft, I guess. |
| 04:50:42 | <@JAA> | arkiver: ^ |
| 04:51:49 | <@JAA> | Not an insignificant size, but I think it'd be worth it to grab in full. It's a quite significant and popular tech reference for the Windows world as I understand it. |
| 04:53:38 | <wizzito> | wow, 9 tb |
| 04:53:55 | <@JAA> | 23, the 9.7 is just a subset. |
| 04:55:18 | <@JAA> | Oh yeah, there'd be some small miscellaneous stuff like thumbnails, captions, and of course the web pages, but that's surely well below 1 TiB anyway. |
| 04:55:20 | <wizzito> | oh |
| 04:55:23 | <wizzito> | neat |
| 05:01:51 | | wizzito quits [Remote host closed the connection] |
| 05:59:08 | <h2ibot> | Tech234a edited YouTube (+216, /* Discussions/Channel Comments (October 2021)…): https://wiki.archiveteam.org/?diff=47758&oldid=47716 |
| 06:02:08 | <h2ibot> | Tech234a edited YouTube (+4, /* Discussions/Channel Comments (October 2021)…): https://wiki.archiveteam.org/?diff=47759&oldid=47758 |
| 06:33:01 | | wizards_ quits [Ping timeout: 258 seconds] |
| 06:34:34 | | wizards_ joins |
| 06:35:27 | | datechnoman quits [Quit: The Lounge - https://thelounge.chat] |
| 06:36:18 | | datechnoman (datechnoman) joins |
| 06:50:08 | | jtagcat quits [Quit: Bye!] |
| 06:52:29 | | jtagcat (jtagcat) joins |
| 06:59:58 | | BlueMaxima_ joins |
| 07:03:34 | | BlueMaxima quits [Ping timeout: 252 seconds] |
| 07:20:50 | | Eighty quits [Ping timeout: 265 seconds] |
| 07:22:21 | | Eighty (Eighty) joins |
| 08:14:55 | | Atom joins |
| 08:16:25 | | Atom__ quits [Ping timeout: 265 seconds] |
| 09:12:04 | | sonick quits [Client Quit] |
| 10:33:10 | | BlueMaxima_ quits [Client Quit] |
| 11:10:29 | | sonick (sonick) joins |
| 11:39:54 | | IDK (IDK) joins |
| 12:04:15 | | netherx86 joins |
| 12:05:28 | <netherx86> | cs61a |
| 12:13:12 | | netherx86 quits [Remote host closed the connection] |
| 13:02:45 | | TheTechRobo (TheTechRobo) joins |
| 13:05:07 | | Atom quits [Read error: Connection reset by peer] |
| 13:06:33 | | Atom joins |
| 13:26:15 | | Atom quits [Ping timeout: 258 seconds] |
| 13:26:38 | | HackMii quits [Ping timeout: 258 seconds] |
| 13:36:22 | | atphoenix_ (atphoenix) joins |
| 13:36:50 | | superkuh__ joins |
| 13:36:50 | | superkuh_ quits [Read error: Connection reset by peer] |
| 13:39:34 | | atphoenix quits [Ping timeout: 252 seconds] |
| 13:42:49 | | HP_Archivist (HP_Archivist) joins |
| 13:51:06 | | qwertyasdfuiopghjkl joins |
| 13:55:31 | | HackMii (hacktheplanet) joins |
| 14:02:02 | | Atom joins |
| 14:07:22 | | datechnoman3 (datechnoman) joins |
| 14:08:02 | | datechnoman quits [Ping timeout: 258 seconds] |
| 14:08:02 | | datechnoman3 is now known as datechnoman |
| 14:08:37 | | Atom quits [Read error: Connection reset by peer] |
| 14:12:12 | | Atom joins |
| 14:15:37 | | Atom quits [Read error: Connection reset by peer] |
| 14:18:46 | | Atom joins |
| 14:30:11 | | Megame (Megame) joins |
| 14:30:16 | | AlsoHP_Archivist joins |
| 14:33:43 | | HP_Archivist quits [Ping timeout: 258 seconds] |
| 14:36:41 | | HackMii quits [Remote host closed the connection] |
| 14:42:49 | | HackMii (hacktheplanet) joins |
| 15:32:53 | | wyatt8750 joins |
| 15:34:17 | | wyatt8740 quits [Ping timeout: 258 seconds] |
| 15:42:59 | | hexa- quits [Quit: WeeChat 3.1] |
| 15:46:05 | | hexa- (hexa-) joins |
| 15:55:03 | | Megame quits [Client Quit] |
| 16:01:23 | | Nick joins |
| 16:01:52 | | Nick is now known as RJHacker91576 |
| 16:05:15 | | RJHacker91576 quits [Remote host closed the connection] |
| 16:15:21 | | AlsoHP_Archivist quits [Client Quit] |
| 16:15:39 | | HP_Archivist (HP_Archivist) joins |
| 16:59:00 | | Eighty quits [Ping timeout: 258 seconds] |
| 17:00:46 | | Eighty (Eighty) joins |
| 17:14:37 | | HackMii quits [Remote host closed the connection] |
| 17:15:11 | | HackMii (hacktheplanet) joins |
| 17:49:26 | | LeGoupil joins |
| 17:54:32 | | qwertyasdfuiopghjkl47 joins |
| 17:56:56 | | qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds] |
| 18:22:35 | | phuzion quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
| 18:22:42 | | phuzion (phuzion) joins |
| 18:24:52 | | phuzion quits [Client Quit] |
| 18:25:13 | | phuzion (phuzion) joins |
| 18:27:12 | | wizards joins |
| 18:28:19 | | wizards_ quits [Ping timeout: 265 seconds] |
| 18:28:25 | | wizards_ joins |
| 18:31:37 | | wizards quits [Ping timeout: 252 seconds] |
| 19:04:37 | <h2ibot> | Ayanami edited List of website hosts (+140, +carrd): https://wiki.archiveteam.org/?diff=47760&oldid=47166 |
| 19:25:26 | | HP_Archivist quits [Ping timeout: 258 seconds] |
| 20:21:59 | | HackMii_ (hacktheplanet) joins |
| 20:22:55 | | HackMii quits [Remote host closed the connection] |
| 20:30:39 | | Megame (Megame) joins |
| 20:44:02 | | HP_Archivist (HP_Archivist) joins |
| 20:53:40 | | AlsoHP_Archivist joins |
| 20:57:22 | | HP_Archivist quits [Ping timeout: 252 seconds] |
| 21:34:32 | | LeGoupil quits [Client Quit] |
| 21:55:48 | | Megame quits [Client Quit] |
| 22:14:06 | | AlsoHP_Archivist quits [Ping timeout: 258 seconds] |
| 23:14:54 | | BlueMaxima joins |
| 23:34:13 | | katocala quits [Ping timeout: 258 seconds] |
| 23:34:24 | | katocala joins |
| 23:55:14 | | katocala is now authenticated as katocala |