00:08:37benjins3 joins
00:31:34xkey (xkey) joins
00:39:21Mateon2 joins
00:40:53Mateon1 quits [Ping timeout: 272 seconds]
00:40:53Mateon2 is now known as Mateon1
00:50:49Wohlstand quits [Client Quit]
00:51:00Doran quits [Client Quit]
01:12:01pabs quits [Quit: Don't rest until all the world is paved in moss and greenery.]
01:15:48Wohlstand (Wohlstand) joins
01:35:14pabs (pabs) joins
02:07:05<fireonlive>gmax youtubes thrown in
03:14:59Wohlstand quits [Client Quit]
03:26:19za4k joins
03:28:18<TheTechRobo>JAA: Hmm, TAU track 900b3a seems to have the original_files thing already done for its track metadata
03:29:16za4k quits [Client Quit]
03:30:34<TheTechRobo>Do you have an example of a track that uses the cloudfront URL format _and_ stream_files? (I know you sent a URL earlier, but that doesn't include the ID.)
03:36:40<TheTechRobo>Ah, 0000ac seems to work
03:38:58<@JAA>:-)
03:39:36<@JAA>Maybe you can extract all the possible prefixes from one of the CDXs.
03:52:36<TheTechRobo>Do you mean all of the possible transformations?
03:52:42<TheTechRobo>If so, maybe, but I won't be doing that anytime soon.
03:53:09<TheTechRobo>But if someone reports a URL outside the known formats, I'll fix it.
03:53:25<@JAA>At least all the patterns, but yeah, that works, too. :-)
03:53:58<TheTechRobo>Currently trying to figure out why my substitution isn't substituting.
03:54:05<TheTechRobo>Python's regex module humbles me.
04:04:07Ruthalas598 (Ruthalas) joins
04:08:10Ruthalas59 quits [Ping timeout: 255 seconds]
04:08:10Ruthalas598 is now known as Ruthalas59
04:11:26HP_Archivist quits [Quit: Leaving]
04:11:50HP_Archivist (HP_Archivist) joins
04:18:14<TheTechRobo>Done! https://tau.thetechrobo.ca/
04:18:30<TheTechRobo>Source code: https://github.com/TheTechRobo/the-artist-union-getAudio/tree/master
04:18:47<@JAA>\o/
04:18:52<@JAA>TheTechRobo++
04:18:52<eggdrop>[karma] 'TheTechRobo' now has 4 karma!
04:21:18<fireonlive>TheTechRobo++
04:21:19<eggdrop>[karma] 'TheTechRobo' now has 5 karma!
04:21:39<TheTechRobo>:D
04:23:16DogsRNice_ quits [Read error: Connection reset by peer]
05:06:32TheTechRobo quits [Read error: Connection reset by peer]
05:06:32ScenarioPlanet quits [Read error: Connection reset by peer]
05:06:32Pedrosso quits [Read error: Connection reset by peer]
05:06:59Pedrosso joins
05:07:03ScenarioPlanet (ScenarioPlanet) joins
05:07:16TheTechRobo (TheTechRobo) joins
05:11:32qwertyasdfuiopghjkl quits [Ping timeout: 265 seconds]
05:14:22qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
05:39:19xkey quits [Client Quit]
05:48:10xkey (xkey) joins
05:48:10xkey quits [Client Quit]
05:52:57JohnnyJ quits [Client Quit]
05:55:27JohnnyJ joins
06:30:39BornOn420 quits [Quit: Textual IRC Client: www.textualapp.com]
07:05:03Unholy2361 quits [Remote host closed the connection]
07:06:18Unholy2361 (Unholy2361) joins
07:28:14michaelblob quits [Read error: Connection reset by peer]
07:29:23michaelblob (michaelblob) joins
07:30:02HP_Archivist quits [Read error: Connection reset by peer]
07:31:18Dango360_ joins
07:33:49Dango360 quits [Ping timeout: 255 seconds]
08:01:43_Dango360 joins
08:06:07Dango360_ quits [Ping timeout: 272 seconds]
08:08:15<fireonlive>which one of you is "Bear" :p
08:09:22nicolas17 quits [Ping timeout: 255 seconds]
08:09:26<h2ibot>Bear edited List of websites excluded from the Wayback Machine/Partial exclusions (+81, + Eric Graebener): https://wiki.archiveteam.org/?diff=52013&oldid=51975
08:10:26<h2ibot>Bear edited List of websites excluded from the Wayback Machine/Partial exclusions (+16, also time ranges in introduction): https://wiki.archiveteam.org/?diff=52014&oldid=52013
08:12:26<h2ibot>Bear edited List of websites excluded from the Wayback Machine/Partial exclusions (+48, bus-stop.net date cut off at 2021): https://wiki.archiveteam.org/?diff=52015&oldid=52014
08:13:26nicolas17 joins
09:00:04Bleo182600 quits [Client Quit]
09:01:23Bleo182600 joins
09:53:47michaelblob quits [Ping timeout: 272 seconds]
09:58:43PredatorIWD_ joins
10:01:23PredatorIWD quits [Ping timeout: 272 seconds]
10:56:44fuzzy8021 quits [Read error: Connection reset by peer]
10:57:07decky_e_ joins
10:57:10fuzzy8021 (fuzzy8021) joins
10:57:28Bleo182600 quits [Client Quit]
10:57:46Bleo182600 joins
10:59:09michaelblob (michaelblob) joins
11:00:17decky quits [Ping timeout: 272 seconds]
11:01:23fuzzy8021 quits [Read error: Connection reset by peer]
11:01:49decky joins
11:01:59_Dango360 quits [Client Quit]
11:02:02fuzzy8021 (fuzzy8021) joins
11:02:20Dango360 (Dango360) joins
11:05:21decky_e_ quits [Ping timeout: 272 seconds]
11:08:31michaelblob quits [Ping timeout: 272 seconds]
11:13:25Barto quits [Ping timeout: 255 seconds]
11:14:37xkey (xkey) joins
11:37:15JaffaCakes118 quits [Client Quit]
11:37:33JaffaCakes118 (JaffaCakes118) joins
11:59:47nic8 quits [Quit: The Lounge - https://thelounge.chat]
12:00:32nic8 (nic) joins
12:04:00<immibis>Im Weltenbaulab ist ein Switch auf 172.30.0.0/23 "alien" netz. Im c-lab ist 10... netz. Kann ich zur 10... im Weltenbaulab verbinden?
12:05:39<@arkiver>immibis: wrong channel?
12:17:51a joins
12:18:52a quits [Client Quit]
12:37:36raxxy-137409 quits [Quit: No Ping reply in 180 seconds.]
12:37:48raxxy-137409 joins
12:38:04<immibis>yes
12:43:52benjins3 quits [Ping timeout: 255 seconds]
12:50:10Arcorann quits [Ping timeout: 255 seconds]
12:59:48nic8 quits [Client Quit]
13:00:36nic8 (nic) joins
13:23:26UwU quits [Quit: bye]
13:28:28MrMcNuggets (MrMcNuggets) joins
13:29:26MrMcNuggets quits [Client Quit]
13:30:25VerifiedJ9 quits [Quit: The Lounge - https://thelounge.chat]
13:31:00VerifiedJ9 (VerifiedJ) joins
13:31:54JaffaCakes118 quits [Remote host closed the connection]
13:32:17JaffaCakes118 (JaffaCakes118) joins
14:02:56itachi1706 quits [Client Quit]
14:10:09itachi1706 (itachi1706) joins
14:10:56benjins3 joins
15:02:01midou quits [Ping timeout: 255 seconds]
15:10:34benjins3 quits [Ping timeout: 255 seconds]
15:40:36kiryu quits [Remote host closed the connection]
15:44:19kiryu joins
15:44:19kiryu quits [Changing host]
15:44:19kiryu (kiryu) joins
15:44:31kiryu quits [Read error: Connection reset by peer]
15:48:54kiryu (kiryu) joins
16:07:08midou joins
16:09:55kiryu_ joins
16:13:34kiryu quits [Ping timeout: 255 seconds]
16:21:14kiryu_ quits [Read error: Connection reset by peer]
16:21:41kiryu_ joins
16:42:04JaffaCakes118 quits [Remote host closed the connection]
16:47:31JaffaCakes118 (JaffaCakes118) joins
16:52:15Guest joins
16:53:19<Guest>is it possible to get html content from archived pages in a warc file?
16:58:56JaffaCakes118 quits [Client Quit]
16:59:11JaffaCakes118 (JaffaCakes118) joins
17:00:23BearFortress_ quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
17:11:56benjins3 joins
17:21:48<kiska>If you have the WARC use warcio or something similar
17:22:44<kiska>If you want to get it programatically, otherwise use pywb or webrecorder
17:52:46RJHacker59000 quits [Client Quit]
17:52:59hexa- joins
17:53:29hexa- is now known as RJHacker21239
17:57:36RJHacker21239 quits [Client Quit]
17:57:51hexa- joins
17:58:20hexa- is now known as RJHacker20063
17:58:47RJHacker20063 quits [Client Quit]
17:59:02hexa joins
17:59:31hexa is now known as RJHacker77361
18:01:45RJHacker77361 quits [Client Quit]
18:01:59hexa- joins
18:02:28hexa- is now known as RJHacker91973
18:03:27RJHacker91973 quits [Changing host]
18:03:27RJHacker91973 (hexa-) joins
18:03:31RJHacker91973 is now known as hexa-
18:03:55hexa- quits [Quit: WeeChat 4.1.1]
18:04:10hexa- (hexa-) joins
18:10:08<Guest>i have the warc file and im using warcio, but record.content_stream().read() does not return anything
18:11:43<Guest>replayweb.page is able to load the contents when i upload the file though
18:38:28zhongfu quits [Remote host closed the connection]
18:38:50zhongfu (zhongfu) joins
18:47:10<Guest>currently experimenting with reading the raw warc file, i think i found something there
19:00:23nulldata quits [Quit: The Lounge - https://thelounge.chat]
19:00:46nulldata (nulldata) joins
19:03:12katocala joins
19:08:30etnguyen03 (etnguyen03) joins
19:45:35eroc1990 quits [Quit: The Lounge - https://thelounge.chat]
19:46:05eroc1990 (eroc1990) joins
19:54:20ThreeHM_ (ThreeHeadedMonkey) joins
19:56:05ThreeHM quits [Ping timeout: 272 seconds]
20:04:49etnguyen03 quits [Client Quit]
20:08:54superkuh_ quits [Read error: Connection reset by peer]
20:11:41ThreeHM_ is now known as ThreeHM
20:21:31tzt quits [Ping timeout: 255 seconds]
20:22:26<@OrIdow6>Is it Zstd-compressed?
20:22:40<@OrIdow6>Don't remember if warcio supports those
20:25:15<@JAA>It does not.
20:29:00<fireonlive>warcio--
20:29:00<eggdrop>[karma] 'warcio' now has -2 karma!
21:12:26Unholy2361 quits [Client Quit]
21:13:13Unholy2361 (Unholy2361) joins
21:14:59jacksonchen666 (jacksonchen666) joins
21:15:31BlueMaxima joins
21:17:56treora quits [Remote host closed the connection]
21:17:57treora joins
21:18:05treora quits [Remote host closed the connection]
21:18:06treora joins
21:18:10treora quits [Remote host closed the connection]
21:18:10treora joins
21:33:35tzt (tzt) joins
21:53:44jacksonchen666 quits [Client Quit]
21:55:02etnguyen03 (etnguyen03) joins
22:00:31wickedplayer494 quits [Ping timeout: 255 seconds]
22:00:34jacksonchen666 (jacksonchen666) joins
22:08:10wickedplayer494 joins
22:21:15etnguyen03 quits [Client Quit]
22:21:57etnguyen03 (etnguyen03) joins
22:31:44etnguyen03 quits [Client Quit]
22:32:26etnguyen03 (etnguyen03) joins
22:42:11etnguyen03 quits [Client Quit]
22:42:53etnguyen03 (etnguyen03) joins
22:54:53pedantic-darwin quits [Quit: The Lounge - https://thelounge.chat]
23:13:41Arcorann (Arcorann) joins
23:24:07<h2ibot>Bear edited List of websites excluded from the Wayback Machine (+209, cybersecurityacademy.com - redirected from two…): https://wiki.archiveteam.org/?diff=52016&oldid=51993
23:24:19<Guest>kiska i couldnt get it by reading the raw warc file, do you know any other way?
23:29:08<h2ibot>Bear edited List of websites excluded from the Wayback Machine (+57, brightkite.com, [[LinkTree]]): https://wiki.archiveteam.org/?diff=52017&oldid=52016
23:31:09<h2ibot>Bear edited List of websites excluded from the Wayback Machine (+40, BrightKite currently returns an HTTP 502.): https://wiki.archiveteam.org/?diff=52018&oldid=52017
23:43:11<h2ibot>Bear created Brightkite (+902, Another tombstone in the Internet cemetery.): https://wiki.archiveteam.org/?title=Brightkite
23:53:59etnguyen03 quits [Client Quit]
23:54:41etnguyen03 (etnguyen03) joins
23:58:31HP_Archivist (HP_Archivist) joins