00:07:09cy joins
00:09:35onetruth joins
00:19:45sepro quits [Quit: Bye!]
00:31:14qwertyasdfuiopghjkl quits [Client Quit]
00:38:01qwertyasdfuiopghjkl joins
00:38:25bonga quits [Ping timeout: 265 seconds]
00:39:08bonga joins
00:45:23cy quits [Remote host closed the connection]
00:51:08bonga quits [Read error: Connection reset by peer]
00:52:08bonga joins
02:03:31dm4v_ joins
02:04:06dm4v quits [Ping timeout: 265 seconds]
02:04:06dm4v_ is now known as dm4v
02:04:07dm4v quits [Changing host]
02:04:07dm4v (dm4v) joins
02:34:52<@OrIdow6>So I wrote a much simpler script (function inside of a script) to extract the dicts from the zstd warcs
02:36:20<@OrIdow6>Did have me thinking, would there be any interest in a program that, given a gzipped or plaintext warc, trains a dictionary on it specifically then "converts" it to a zstd warc with that dict? Might make for a space-saving measure in some places
02:53:00GarveyPatrickD joins
02:53:26<@arkiver>OrIdow6: yeah that's been an idea for a while
02:54:08<@arkiver>next to that also some URL agnostic deduplication on a certain scale
02:54:14<@arkiver>not sure yet what
02:55:36<@arkiver>it's a delicate process though
03:05:04SM quits [Remote host closed the connection]
03:05:29katocala quits [Ping timeout: 265 seconds]
03:25:50<@JAA>Yeah, it's something I've kept at the back of my mind in the planning of pywarc as well.
03:34:03syntaxx1 (syntaxx) joins
03:34:58syntaxx quits [Ping timeout: 265 seconds]
03:34:58syntaxx1 is now known as syntaxx
03:35:12<Jake>pywarc 👀
03:35:26Sluggs quits [Ping timeout: 240 seconds]
03:36:03Sluggs joins
03:42:32katocala joins
03:54:41GarveyPatrickD quits [Remote host closed the connection]
03:59:19<@JAA>Still WIP albeit stalled for several months now. I need to get back to that.
04:40:58nirv joins
04:41:09hackbug quits [Remote host closed the connection]
04:42:59hackbug (hackbug) joins
04:55:13fuzzy8021 quits [Read error: Connection reset by peer]
04:56:49fuzzy8021 (fuzzy8021) joins
05:35:41<@JAA>TIL BPO has some really high issue numbers. Most are under 47k or so, but then there's this: https://bugs.python.org/issue433030 https://bugs.python.org/issue672115 https://bugs.python.org/issue1703592 FUN!
05:36:40<@JAA>The migration was pushed back by two weeks and is planned to begin tomorrow, specifically 2022-03-24 20:00Z.
05:39:04<@JAA>Total number of issues is about 59k according to the stats page, so that suggests there are quite a few of those oddly numbered issues.
05:47:31<@JAA>I got a list of what I think should be all issues. Will continue to look into this later.
05:49:12<@JAA>I intend to grab a copy of at least the issue pages and attachments before the migration just in case. (They did say they'll keep it online in read-only mode.)
05:51:13BlueMaxima quits [Client Quit]
06:11:12<h2ibot>Petchea edited Deathwatch (+25): https://wiki.archiveteam.org/?diff=48414&oldid=48404
06:11:13<h2ibot>Adrmcr edited Roblox (+0, made audios offline): https://wiki.archiveteam.org/?diff=48415&oldid=48397
06:12:12<h2ibot>JustAnotherArchivist edited Deathwatch (+16, /* 2022 */ BPO deadline shifted): https://wiki.archiveteam.org/?diff=48416&oldid=48414
06:18:13themadpro89 joins
06:19:40<themadpro89>Hey, there's about a week to go until the Stack Overflow Jobs/Dev Story shutdown https://meta.stackoverflow.com/questions/415293/sunsetting-jobs-developer-story
06:20:09<themadpro89>Was there never a project organised for this? Since SE dumps self-archives anyway? https://wiki.archiveteam.org/index.php/Stack_Exchange
06:20:11themadpro89 quits [Remote host closed the connection]
06:21:56<@JAA>I'm pretty sure the job board isn't included in those dumps.
06:23:06<@JAA>But no, don't think anyone did anything about this. The dev story stuff is behind a login wall, I believe.
06:59:08<themadpro>Dunno about job board, but dev story isn't https://stackoverflow.com/users/story/137794
06:59:22<themadpro>I can access this one without logging-in just fine
07:00:15<themadpro>https://usercontent.irccloud-cdn.com/file/RV2zWSxU/devstorysample-user%3A137794.png
07:39:47Lord_Nightmare quits [Quit: ZNC - http://znc.in]
07:42:07Lord_Nightmare (Lord_Nightmare) joins
08:21:53LeGoupil joins
08:32:34LeGoupil quits [Ping timeout: 265 seconds]
09:47:37march_happy quits [Ping timeout: 265 seconds]
09:48:30march_happy (march_happy) joins
09:56:47<h2ibot>Flashfire42 edited URLTeam (-11, /* Warrior projects */): https://wiki.archiveteam.org/?diff=48417&oldid=48176
10:31:42<@OrIdow6>binzyboi: Any updates on Buzzly?
10:36:17<binzyboi>Not yet, no announcements about the site have been made, but still people leaving the site and posting their socials.
10:37:10<binzyboi>Three of the staff members from last time have officially been removed from the team, so it's just CHStark and Vay.
10:37:59<binzyboi>CHStark has deleted a lot of posts made by users on his page, but that's not of too much concern I don't think since it was mainly people shitposting there
10:41:38<@OrIdow6>Alright, thank you
11:00:10syntaxx9 (syntaxx) joins
11:00:20syntaxx quits [Read error: Connection reset by peer]
11:00:20syntaxx9 is now known as syntaxx
11:31:16LeGoupil joins
11:34:47mikael quits [Ping timeout: 265 seconds]
11:43:21Megame (Megame) joins
11:55:43mikael joins
13:13:31keykey joins
13:14:00keykey quits [Remote host closed the connection]
13:17:46girst quits [Ping timeout: 240 seconds]
13:19:53HackMii quits [Client Quit]
13:22:10HackMii (hacktheplanet) joins
13:26:33girst (girst) joins
13:28:51Arcorann quits [Ping timeout: 265 seconds]
13:53:22sepro joins
14:13:53GarveyPatrickD joins
14:58:06<@JAA>themadpro: Huh, maybe that changed? I know I got loginwalled when I looked into it briefly after the announcement. Or maybe users can set their dev story to private and I just got unlucky.
15:31:46<@arkiver>VerifiedJ: i just saw the s.mil.ru URLs list on IA
15:32:05<@arkiver>if you have important lists of random URLs, feel free to ping me and we can queue them in the #// (URLs) project
15:32:19qwertyasdfuiopghjkl quits [Client Quit]
15:39:58LeGoupil quits [Ping timeout: 265 seconds]
15:40:24qwertyasdfuiopghjkl joins
16:25:37LeGoupil joins
16:27:41LeGoupil quits [Read error: Connection reset by peer]
16:27:45LeGoupil1 joins
16:30:09LeGoupil1 is now known as LeGoupil
16:58:11sonick quits [Client Quit]
17:02:51LeGoupil1 joins
17:04:54LeGoupil quits [Ping timeout: 265 seconds]
17:04:54LeGoupil1 is now known as LeGoupil
17:21:19mgrytbak quits [Quit: The Lounge - https://thelounge.chat]
17:22:08mgrytbak joins
17:54:56Ryz quits [Quit: Ping timeout (120 seconds)]
17:54:56IDK_ quits [Quit: Ping timeout (120 seconds)]
17:55:43IDK_ joins
17:55:49Ryz (Ryz) joins
18:08:30Mateon1 quits [Remote host closed the connection]
18:09:00Mateon1 joins
18:10:09bonga quits [Ping timeout: 265 seconds]
18:12:12bonga joins
18:32:27sonick (sonick) joins
18:45:55sepro quits [Ping timeout: 265 seconds]
19:01:45GarveyPatrickD quits [Remote host closed the connection]
19:25:11LeGoupil quits [Client Quit]
19:31:51pabs quits [Read error: Connection reset by peer]
19:32:39Megame quits [Client Quit]
20:17:45bonga quits [Ping timeout: 265 seconds]
20:18:35bonga joins
20:19:42VerifiedJ quits [Quit: The Lounge - https://thelounge.chat]
20:20:27VerifiedJ (VerifiedJ) joins
20:23:42bonga quits [Read error: Connection reset by peer]
20:23:54bonga joins
20:44:57Megame (Megame) joins
21:20:13GarveyPatrickD (GarveyPatrickD) joins
21:36:35<h2ibot>GarveyPatrickD edited ArchiveBot (+155, /* People */ Internet Archive serves ->…): https://wiki.archiveteam.org/?diff=48420&oldid=48411
21:36:36<h2ibot>Themadprogramer edited Deathwatch (+16, Chowhound shutdown deadline updated): https://wiki.archiveteam.org/?diff=48421&oldid=48416
21:36:37<h2ibot>GarveyPatrickD edited Talk:ArchiveBot (+160, DigitalOcean price change): https://wiki.archiveteam.org/?diff=48422&oldid=37782
22:27:03bonga quits [Remote host closed the connection]
22:30:00bonga joins
22:44:49BlueMaxima joins
22:45:00Arcorann (Arcorann) joins
23:24:17march_happy quits [Read error: Connection reset by peer]