| 00:08:37 | | benjins3 joins |
| 00:31:34 | | xkey (xkey) joins |
| 00:39:21 | | Mateon2 joins |
| 00:40:53 | | Mateon1 quits [Ping timeout: 272 seconds] |
| 00:40:53 | | Mateon2 is now known as Mateon1 |
| 00:50:49 | | Wohlstand quits [Client Quit] |
| 00:51:00 | | Doran quits [Client Quit] |
| 01:12:01 | | pabs quits [Quit: Don't rest until all the world is paved in moss and greenery.] |
| 01:15:48 | | Wohlstand (Wohlstand) joins |
| 01:35:14 | | pabs (pabs) joins |
| 02:07:05 | <fireonlive> | gmax youtubes thrown in |
| 03:14:59 | | Wohlstand quits [Client Quit] |
| 03:26:19 | | za4k joins |
| 03:28:18 | <TheTechRobo> | JAA: Hmm, TAU track 900b3a seems to have the original_files thing already done for its track metadata |
| 03:29:16 | | za4k quits [Client Quit] |
| 03:30:34 | <TheTechRobo> | Do you have an example of a track that uses the cloudfront URL format _and_ stream_files? (I know you sent a URL earlier, but that doesn't include the ID.) |
| 03:36:40 | <TheTechRobo> | Ah, 0000ac seems to work |
| 03:38:58 | <@JAA> | :-) |
| 03:39:36 | <@JAA> | Maybe you can extract all the possible prefixes from one of the CDXs. |
| 03:52:36 | <TheTechRobo> | Do you mean all of the possible transformations? |
| 03:52:42 | <TheTechRobo> | If so, maybe, but I won't be doing that anytime soon. |
| 03:53:09 | <TheTechRobo> | But if someone reports a URL outside the known formats, I'll fix it. |
| 03:53:25 | <@JAA> | At least all the patterns, but yeah, that works, too. :-) |
| 03:53:58 | <TheTechRobo> | Currently trying to figure out why my substitution isn't substituting. |
| 03:54:05 | <TheTechRobo> | Python's regex module humbles me. |
| 04:04:07 | | Ruthalas598 (Ruthalas) joins |
| 04:08:10 | | Ruthalas59 quits [Ping timeout: 255 seconds] |
| 04:08:10 | | Ruthalas598 is now known as Ruthalas59 |
| 04:11:26 | | HP_Archivist quits [Quit: Leaving] |
| 04:11:50 | | HP_Archivist (HP_Archivist) joins |
| 04:18:14 | <TheTechRobo> | Done! https://tau.thetechrobo.ca/ |
| 04:18:30 | <TheTechRobo> | Source code: https://github.com/TheTechRobo/the-artist-union-getAudio/tree/master |
| 04:18:47 | <@JAA> | \o/ |
| 04:18:52 | <@JAA> | TheTechRobo++ |
| 04:18:52 | <eggdrop> | [karma] 'TheTechRobo' now has 4 karma! |
| 04:21:18 | <fireonlive> | TheTechRobo++ |
| 04:21:19 | <eggdrop> | [karma] 'TheTechRobo' now has 5 karma! |
| 04:21:39 | <TheTechRobo> | :D |
| 04:23:16 | | DogsRNice_ quits [Read error: Connection reset by peer] |
| 05:06:32 | | TheTechRobo quits [Read error: Connection reset by peer] |
| 05:06:32 | | ScenarioPlanet quits [Read error: Connection reset by peer] |
| 05:06:32 | | Pedrosso quits [Read error: Connection reset by peer] |
| 05:06:59 | | Pedrosso joins |
| 05:07:03 | | ScenarioPlanet (ScenarioPlanet) joins |
| 05:07:16 | | TheTechRobo (TheTechRobo) joins |
| 05:11:32 | | qwertyasdfuiopghjkl quits [Ping timeout: 265 seconds] |
| 05:14:22 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
| 05:39:19 | | xkey quits [Client Quit] |
| 05:48:10 | | xkey (xkey) joins |
| 05:48:10 | | xkey quits [Client Quit] |
| 05:52:57 | | JohnnyJ quits [Client Quit] |
| 05:55:27 | | JohnnyJ joins |
| 06:30:39 | | BornOn420 quits [Quit: Textual IRC Client: www.textualapp.com] |
| 07:05:03 | | Unholy2361 quits [Remote host closed the connection] |
| 07:06:18 | | Unholy2361 (Unholy2361) joins |
| 07:28:14 | | michaelblob quits [Read error: Connection reset by peer] |
| 07:29:23 | | michaelblob (michaelblob) joins |
| 07:30:02 | | HP_Archivist quits [Read error: Connection reset by peer] |
| 07:31:18 | | Dango360_ joins |
| 07:33:49 | | Dango360 quits [Ping timeout: 255 seconds] |
| 08:01:43 | | _Dango360 joins |
| 08:06:07 | | Dango360_ quits [Ping timeout: 272 seconds] |
| 08:08:15 | <fireonlive> | which one of you is "Bear" :p |
| 08:09:22 | | nicolas17 quits [Ping timeout: 255 seconds] |
| 08:09:26 | <h2ibot> | Bear edited List of websites excluded from the Wayback Machine/Partial exclusions (+81, + Eric Graebener): https://wiki.archiveteam.org/?diff=52013&oldid=51975 |
| 08:10:26 | <h2ibot> | Bear edited List of websites excluded from the Wayback Machine/Partial exclusions (+16, also time ranges in introduction): https://wiki.archiveteam.org/?diff=52014&oldid=52013 |
| 08:12:26 | <h2ibot> | Bear edited List of websites excluded from the Wayback Machine/Partial exclusions (+48, bus-stop.net date cut off at 2021): https://wiki.archiveteam.org/?diff=52015&oldid=52014 |
| 08:13:26 | | nicolas17 joins |
| 09:00:04 | | Bleo182600 quits [Client Quit] |
| 09:01:23 | | Bleo182600 joins |
| 09:53:47 | | michaelblob quits [Ping timeout: 272 seconds] |
| 09:58:43 | | PredatorIWD_ joins |
| 10:01:23 | | PredatorIWD quits [Ping timeout: 272 seconds] |
| 10:56:44 | | fuzzy8021 quits [Read error: Connection reset by peer] |
| 10:57:07 | | decky_e_ joins |
| 10:57:10 | | fuzzy8021 (fuzzy8021) joins |
| 10:57:28 | | Bleo182600 quits [Client Quit] |
| 10:57:46 | | Bleo182600 joins |
| 10:59:09 | | michaelblob (michaelblob) joins |
| 11:00:17 | | decky quits [Ping timeout: 272 seconds] |
| 11:01:23 | | fuzzy8021 quits [Read error: Connection reset by peer] |
| 11:01:49 | | decky joins |
| 11:01:59 | | _Dango360 quits [Client Quit] |
| 11:02:02 | | fuzzy8021 (fuzzy8021) joins |
| 11:02:20 | | Dango360 (Dango360) joins |
| 11:05:21 | | decky_e_ quits [Ping timeout: 272 seconds] |
| 11:08:31 | | michaelblob quits [Ping timeout: 272 seconds] |
| 11:13:25 | | Barto quits [Ping timeout: 255 seconds] |
| 11:14:37 | | xkey (xkey) joins |
| 11:37:15 | | JaffaCakes118 quits [Client Quit] |
| 11:37:33 | | JaffaCakes118 (JaffaCakes118) joins |
| 11:59:47 | | nic8 quits [Quit: The Lounge - https://thelounge.chat] |
| 12:00:32 | | nic8 (nic) joins |
| 12:04:00 | <immibis> | Im Weltenbaulab ist ein Switch auf 172.30.0.0/23 "alien" netz. Im c-lab ist 10... netz. Kann ich zur 10... im Weltenbaulab verbinden? |
| 12:05:39 | <@arkiver> | immibis: wrong channel? |
| 12:17:51 | | a joins |
| 12:18:52 | | a quits [Client Quit] |
| 12:37:36 | | raxxy-137409 quits [Quit: No Ping reply in 180 seconds.] |
| 12:37:48 | | raxxy-137409 joins |
| 12:38:04 | <immibis> | yes |
| 12:43:52 | | benjins3 quits [Ping timeout: 255 seconds] |
| 12:50:10 | | Arcorann quits [Ping timeout: 255 seconds] |
| 12:59:48 | | nic8 quits [Client Quit] |
| 13:00:36 | | nic8 (nic) joins |
| 13:23:26 | | UwU quits [Quit: bye] |
| 13:28:28 | | MrMcNuggets (MrMcNuggets) joins |
| 13:29:26 | | MrMcNuggets quits [Client Quit] |
| 13:30:25 | | VerifiedJ9 quits [Quit: The Lounge - https://thelounge.chat] |
| 13:31:00 | | VerifiedJ9 (VerifiedJ) joins |
| 13:31:54 | | JaffaCakes118 quits [Remote host closed the connection] |
| 13:32:17 | | JaffaCakes118 (JaffaCakes118) joins |
| 14:02:56 | | itachi1706 quits [Client Quit] |
| 14:10:09 | | itachi1706 (itachi1706) joins |
| 14:10:56 | | benjins3 joins |
| 15:02:01 | | midou quits [Ping timeout: 255 seconds] |
| 15:10:34 | | benjins3 quits [Ping timeout: 255 seconds] |
| 15:40:36 | | kiryu quits [Remote host closed the connection] |
| 15:44:19 | | kiryu joins |
| 15:44:19 | | kiryu is now authenticated as kiryu |
| 15:44:19 | | kiryu quits [Changing host] |
| 15:44:19 | | kiryu (kiryu) joins |
| 15:44:31 | | kiryu quits [Read error: Connection reset by peer] |
| 15:48:54 | | kiryu (kiryu) joins |
| 16:07:08 | | midou joins |
| 16:09:55 | | kiryu_ joins |
| 16:13:34 | | kiryu quits [Ping timeout: 255 seconds] |
| 16:21:14 | | kiryu_ quits [Read error: Connection reset by peer] |
| 16:21:41 | | kiryu_ joins |
| 16:42:04 | | JaffaCakes118 quits [Remote host closed the connection] |
| 16:47:31 | | JaffaCakes118 (JaffaCakes118) joins |
| 16:52:15 | | Guest joins |
| 16:53:19 | <Guest> | is it possible to get html content from archived pages in a warc file? |
| 16:58:56 | | JaffaCakes118 quits [Client Quit] |
| 16:59:11 | | JaffaCakes118 (JaffaCakes118) joins |
| 17:00:23 | | BearFortress_ quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
| 17:11:56 | | benjins3 joins |
| 17:21:48 | <kiska> | If you have the WARC use warcio or something similar |
| 17:22:44 | <kiska> | If you want to get it programatically, otherwise use pywb or webrecorder |
| 17:52:46 | | RJHacker59000 quits [Client Quit] |
| 17:52:59 | | hexa- joins |
| 17:53:29 | | hexa- is now known as RJHacker21239 |
| 17:57:36 | | RJHacker21239 quits [Client Quit] |
| 17:57:51 | | hexa- joins |
| 17:58:20 | | hexa- is now known as RJHacker20063 |
| 17:58:47 | | RJHacker20063 quits [Client Quit] |
| 17:59:02 | | hexa joins |
| 17:59:31 | | hexa is now known as RJHacker77361 |
| 18:01:45 | | RJHacker77361 quits [Client Quit] |
| 18:01:59 | | hexa- joins |
| 18:02:28 | | hexa- is now known as RJHacker91973 |
| 18:03:27 | | RJHacker91973 is now authenticated as hexa- |
| 18:03:27 | | RJHacker91973 quits [Changing host] |
| 18:03:27 | | RJHacker91973 (hexa-) joins |
| 18:03:31 | | RJHacker91973 is now known as hexa- |
| 18:03:55 | | hexa- quits [Quit: WeeChat 4.1.1] |
| 18:04:10 | | hexa- (hexa-) joins |
| 18:10:08 | <Guest> | i have the warc file and im using warcio, but record.content_stream().read() does not return anything |
| 18:11:43 | <Guest> | replayweb.page is able to load the contents when i upload the file though |
| 18:38:28 | | zhongfu quits [Remote host closed the connection] |
| 18:38:50 | | zhongfu (zhongfu) joins |
| 18:47:10 | <Guest> | currently experimenting with reading the raw warc file, i think i found something there |
| 19:00:23 | | nulldata quits [Quit: The Lounge - https://thelounge.chat] |
| 19:00:46 | | nulldata (nulldata) joins |
| 19:03:12 | | katocala joins |
| 19:03:12 | | katocala is now authenticated as katocala |
| 19:08:30 | | etnguyen03 (etnguyen03) joins |
| 19:45:35 | | eroc1990 quits [Quit: The Lounge - https://thelounge.chat] |
| 19:46:05 | | eroc1990 (eroc1990) joins |
| 19:54:20 | | ThreeHM_ (ThreeHeadedMonkey) joins |
| 19:56:05 | | ThreeHM quits [Ping timeout: 272 seconds] |
| 20:04:49 | | etnguyen03 quits [Client Quit] |
| 20:08:54 | | superkuh_ quits [Read error: Connection reset by peer] |
| 20:11:41 | | ThreeHM_ is now known as ThreeHM |
| 20:21:31 | | tzt quits [Ping timeout: 255 seconds] |
| 20:22:26 | <@OrIdow6> | Is it Zstd-compressed? |
| 20:22:40 | <@OrIdow6> | Don't remember if warcio supports those |
| 20:25:15 | <@JAA> | It does not. |
| 20:29:00 | <fireonlive> | warcio-- |
| 20:29:00 | <eggdrop> | [karma] 'warcio' now has -2 karma! |
| 21:12:26 | | Unholy2361 quits [Client Quit] |
| 21:13:13 | | Unholy2361 (Unholy2361) joins |
| 21:14:59 | | jacksonchen666 (jacksonchen666) joins |
| 21:15:31 | | BlueMaxima joins |
| 21:17:56 | | treora quits [Remote host closed the connection] |
| 21:17:57 | | treora joins |
| 21:18:05 | | treora quits [Remote host closed the connection] |
| 21:18:06 | | treora joins |
| 21:18:10 | | treora quits [Remote host closed the connection] |
| 21:18:10 | | treora joins |
| 21:33:35 | | tzt (tzt) joins |
| 21:53:44 | | jacksonchen666 quits [Client Quit] |
| 21:55:02 | | etnguyen03 (etnguyen03) joins |
| 22:00:31 | | wickedplayer494 quits [Ping timeout: 255 seconds] |
| 22:00:34 | | jacksonchen666 (jacksonchen666) joins |
| 22:08:10 | | wickedplayer494 joins |
| 22:08:42 | | wickedplayer494 is now authenticated as wickedplayer494 |
| 22:21:15 | | etnguyen03 quits [Client Quit] |
| 22:21:57 | | etnguyen03 (etnguyen03) joins |
| 22:31:44 | | etnguyen03 quits [Client Quit] |
| 22:32:26 | | etnguyen03 (etnguyen03) joins |
| 22:42:11 | | etnguyen03 quits [Client Quit] |
| 22:42:53 | | etnguyen03 (etnguyen03) joins |
| 22:54:53 | | pedantic-darwin quits [Quit: The Lounge - https://thelounge.chat] |
| 23:13:41 | | Arcorann (Arcorann) joins |
| 23:24:07 | <h2ibot> | Bear edited List of websites excluded from the Wayback Machine (+209, cybersecurityacademy.com - redirected from two…): https://wiki.archiveteam.org/?diff=52016&oldid=51993 |
| 23:24:19 | <Guest> | kiska i couldnt get it by reading the raw warc file, do you know any other way? |
| 23:29:08 | <h2ibot> | Bear edited List of websites excluded from the Wayback Machine (+57, brightkite.com, [[LinkTree]]): https://wiki.archiveteam.org/?diff=52017&oldid=52016 |
| 23:31:09 | <h2ibot> | Bear edited List of websites excluded from the Wayback Machine (+40, BrightKite currently returns an HTTP 502.): https://wiki.archiveteam.org/?diff=52018&oldid=52017 |
| 23:43:11 | <h2ibot> | Bear created Brightkite (+902, Another tombstone in the Internet cemetery.): https://wiki.archiveteam.org/?title=Brightkite |
| 23:53:59 | | etnguyen03 quits [Client Quit] |
| 23:54:41 | | etnguyen03 (etnguyen03) joins |
| 23:58:31 | | HP_Archivist (HP_Archivist) joins |