00:08:37 | | benjins3 joins |
00:31:34 | | xkey (xkey) joins |
00:39:21 | | Mateon2 joins |
00:40:53 | | Mateon1 quits [Ping timeout: 272 seconds] |
00:40:53 | | Mateon2 is now known as Mateon1 |
00:50:49 | | Wohlstand quits [Client Quit] |
00:51:00 | | Doran quits [Client Quit] |
01:12:01 | | pabs quits [Quit: Don't rest until all the world is paved in moss and greenery.] |
01:15:48 | | Wohlstand (Wohlstand) joins |
01:35:14 | | pabs (pabs) joins |
02:07:05 | <fireonlive> | gmax youtubes thrown in |
03:14:59 | | Wohlstand quits [Client Quit] |
03:26:19 | | za4k joins |
03:28:18 | <TheTechRobo> | JAA: Hmm, TAU track 900b3a seems to have the original_files thing already done for its track metadata |
03:29:16 | | za4k quits [Client Quit] |
03:30:34 | <TheTechRobo> | Do you have an example of a track that uses the cloudfront URL format _and_ stream_files? (I know you sent a URL earlier, but that doesn't include the ID.) |
03:36:40 | <TheTechRobo> | Ah, 0000ac seems to work |
03:38:58 | <@JAA> | :-) |
03:39:36 | <@JAA> | Maybe you can extract all the possible prefixes from one of the CDXs. |
03:52:36 | <TheTechRobo> | Do you mean all of the possible transformations? |
03:52:42 | <TheTechRobo> | If so, maybe, but I won't be doing that anytime soon. |
03:53:09 | <TheTechRobo> | But if someone reports a URL outside the known formats, I'll fix it. |
03:53:25 | <@JAA> | At least all the patterns, but yeah, that works, too. :-) |
03:53:58 | <TheTechRobo> | Currently trying to figure out why my substitution isn't substituting. |
03:54:05 | <TheTechRobo> | Python's regex module humbles me. |
04:04:07 | | Ruthalas598 (Ruthalas) joins |
04:08:10 | | Ruthalas59 quits [Ping timeout: 255 seconds] |
04:08:10 | | Ruthalas598 is now known as Ruthalas59 |
04:11:26 | | HP_Archivist quits [Quit: Leaving] |
04:11:50 | | HP_Archivist (HP_Archivist) joins |
04:18:14 | <TheTechRobo> | Done! https://tau.thetechrobo.ca/ |
04:18:30 | <TheTechRobo> | Source code: https://github.com/TheTechRobo/the-artist-union-getAudio/tree/master |
04:18:47 | <@JAA> | \o/ |
04:18:52 | <@JAA> | TheTechRobo++ |
04:18:52 | <eggdrop> | [karma] 'TheTechRobo' now has 4 karma! |
04:21:18 | <fireonlive> | TheTechRobo++ |
04:21:19 | <eggdrop> | [karma] 'TheTechRobo' now has 5 karma! |
04:21:39 | <TheTechRobo> | :D |
04:23:16 | | DogsRNice_ quits [Read error: Connection reset by peer] |
05:06:32 | | TheTechRobo quits [Read error: Connection reset by peer] |
05:06:32 | | ScenarioPlanet quits [Read error: Connection reset by peer] |
05:06:32 | | Pedrosso quits [Read error: Connection reset by peer] |
05:06:59 | | Pedrosso joins |
05:07:03 | | ScenarioPlanet (ScenarioPlanet) joins |
05:07:16 | | TheTechRobo (TheTechRobo) joins |
05:11:32 | | qwertyasdfuiopghjkl quits [Ping timeout: 265 seconds] |
05:14:22 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
05:39:19 | | xkey quits [Client Quit] |
05:48:10 | | xkey (xkey) joins |
05:48:10 | | xkey quits [Client Quit] |
05:52:57 | | JohnnyJ quits [Client Quit] |
05:55:27 | | JohnnyJ joins |
06:30:39 | | BornOn420 quits [Quit: Textual IRC Client: www.textualapp.com] |
07:05:03 | | Unholy2361 quits [Remote host closed the connection] |
07:06:18 | | Unholy2361 (Unholy2361) joins |
07:28:14 | | michaelblob quits [Read error: Connection reset by peer] |
07:29:23 | | michaelblob (michaelblob) joins |
07:30:02 | | HP_Archivist quits [Read error: Connection reset by peer] |
07:31:18 | | Dango360_ joins |
07:33:49 | | Dango360 quits [Ping timeout: 255 seconds] |
08:01:43 | | _Dango360 joins |
08:06:07 | | Dango360_ quits [Ping timeout: 272 seconds] |
08:08:15 | <fireonlive> | which one of you is "Bear" :p |
08:09:22 | | nicolas17 quits [Ping timeout: 255 seconds] |
08:09:26 | <h2ibot> | Bear edited List of websites excluded from the Wayback Machine/Partial exclusions (+81, + Eric Graebener): https://wiki.archiveteam.org/?diff=52013&oldid=51975 |
08:10:26 | <h2ibot> | Bear edited List of websites excluded from the Wayback Machine/Partial exclusions (+16, also time ranges in introduction): https://wiki.archiveteam.org/?diff=52014&oldid=52013 |
08:12:26 | <h2ibot> | Bear edited List of websites excluded from the Wayback Machine/Partial exclusions (+48, bus-stop.net date cut off at 2021): https://wiki.archiveteam.org/?diff=52015&oldid=52014 |
08:13:26 | | nicolas17 joins |
09:00:04 | | Bleo182600 quits [Client Quit] |
09:01:23 | | Bleo182600 joins |
09:53:47 | | michaelblob quits [Ping timeout: 272 seconds] |
09:58:43 | | PredatorIWD_ joins |
10:01:23 | | PredatorIWD quits [Ping timeout: 272 seconds] |
10:56:44 | | fuzzy8021 quits [Read error: Connection reset by peer] |
10:57:07 | | decky_e_ joins |
10:57:10 | | fuzzy8021 (fuzzy8021) joins |
10:57:28 | | Bleo182600 quits [Client Quit] |
10:57:46 | | Bleo182600 joins |
10:59:09 | | michaelblob (michaelblob) joins |
11:00:17 | | decky quits [Ping timeout: 272 seconds] |
11:01:23 | | fuzzy8021 quits [Read error: Connection reset by peer] |
11:01:49 | | decky joins |
11:01:59 | | _Dango360 quits [Client Quit] |
11:02:02 | | fuzzy8021 (fuzzy8021) joins |
11:02:20 | | Dango360 (Dango360) joins |
11:05:21 | | decky_e_ quits [Ping timeout: 272 seconds] |
11:08:31 | | michaelblob quits [Ping timeout: 272 seconds] |
11:13:25 | | Barto quits [Ping timeout: 255 seconds] |
11:14:37 | | xkey (xkey) joins |
11:37:15 | | JaffaCakes118 quits [Client Quit] |
11:37:33 | | JaffaCakes118 (JaffaCakes118) joins |
11:59:47 | | nic8 quits [Quit: The Lounge - https://thelounge.chat] |
12:00:32 | | nic8 (nic) joins |
12:04:00 | <immibis> | Im Weltenbaulab ist ein Switch auf 172.30.0.0/23 "alien" netz. Im c-lab ist 10... netz. Kann ich zur 10... im Weltenbaulab verbinden? |
12:05:39 | <@arkiver> | immibis: wrong channel? |
12:17:51 | | a joins |
12:18:52 | | a quits [Client Quit] |
12:37:36 | | raxxy-137409 quits [Quit: No Ping reply in 180 seconds.] |
12:37:48 | | raxxy-137409 joins |
12:38:04 | <immibis> | yes |
12:43:52 | | benjins3 quits [Ping timeout: 255 seconds] |
12:50:10 | | Arcorann quits [Ping timeout: 255 seconds] |
12:59:48 | | nic8 quits [Client Quit] |
13:00:36 | | nic8 (nic) joins |
13:23:26 | | UwU quits [Quit: bye] |
13:28:28 | | MrMcNuggets (MrMcNuggets) joins |
13:29:26 | | MrMcNuggets quits [Client Quit] |
13:30:25 | | VerifiedJ9 quits [Quit: The Lounge - https://thelounge.chat] |
13:31:00 | | VerifiedJ9 (VerifiedJ) joins |
13:31:54 | | JaffaCakes118 quits [Remote host closed the connection] |
13:32:17 | | JaffaCakes118 (JaffaCakes118) joins |
14:02:56 | | itachi1706 quits [Client Quit] |
14:10:09 | | itachi1706 (itachi1706) joins |
14:10:56 | | benjins3 joins |
15:02:01 | | midou quits [Ping timeout: 255 seconds] |
15:10:34 | | benjins3 quits [Ping timeout: 255 seconds] |
15:40:36 | | kiryu quits [Remote host closed the connection] |
15:44:19 | | kiryu joins |
15:44:19 | | kiryu is now authenticated as kiryu |
15:44:19 | | kiryu quits [Changing host] |
15:44:19 | | kiryu (kiryu) joins |
15:44:31 | | kiryu quits [Read error: Connection reset by peer] |
15:48:54 | | kiryu (kiryu) joins |
16:07:08 | | midou joins |
16:09:55 | | kiryu_ joins |
16:13:34 | | kiryu quits [Ping timeout: 255 seconds] |
16:21:14 | | kiryu_ quits [Read error: Connection reset by peer] |
16:21:41 | | kiryu_ joins |
16:42:04 | | JaffaCakes118 quits [Remote host closed the connection] |
16:47:31 | | JaffaCakes118 (JaffaCakes118) joins |
16:52:15 | | Guest joins |
16:53:19 | <Guest> | is it possible to get html content from archived pages in a warc file? |
16:58:56 | | JaffaCakes118 quits [Client Quit] |
16:59:11 | | JaffaCakes118 (JaffaCakes118) joins |
17:00:23 | | BearFortress_ quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
17:11:56 | | benjins3 joins |
17:21:48 | <kiska> | If you have the WARC use warcio or something similar |
17:22:44 | <kiska> | If you want to get it programatically, otherwise use pywb or webrecorder |
17:52:46 | | RJHacker59000 quits [Client Quit] |
17:52:59 | | hexa- joins |
17:53:29 | | hexa- is now known as RJHacker21239 |
17:57:36 | | RJHacker21239 quits [Client Quit] |
17:57:51 | | hexa- joins |
17:58:20 | | hexa- is now known as RJHacker20063 |
17:58:47 | | RJHacker20063 quits [Client Quit] |
17:59:02 | | hexa joins |
17:59:31 | | hexa is now known as RJHacker77361 |
18:01:45 | | RJHacker77361 quits [Client Quit] |
18:01:59 | | hexa- joins |
18:02:28 | | hexa- is now known as RJHacker91973 |
18:03:27 | | RJHacker91973 is now authenticated as hexa- |
18:03:27 | | RJHacker91973 quits [Changing host] |
18:03:27 | | RJHacker91973 (hexa-) joins |
18:03:31 | | RJHacker91973 is now known as hexa- |
18:03:55 | | hexa- quits [Quit: WeeChat 4.1.1] |
18:04:10 | | hexa- (hexa-) joins |
18:10:08 | <Guest> | i have the warc file and im using warcio, but record.content_stream().read() does not return anything |
18:11:43 | <Guest> | replayweb.page is able to load the contents when i upload the file though |
18:38:28 | | zhongfu quits [Remote host closed the connection] |
18:38:50 | | zhongfu (zhongfu) joins |
18:47:10 | <Guest> | currently experimenting with reading the raw warc file, i think i found something there |
19:00:23 | | nulldata quits [Quit: The Lounge - https://thelounge.chat] |
19:00:46 | | nulldata (nulldata) joins |
19:03:12 | | katocala joins |
19:03:12 | | katocala is now authenticated as katocala |
19:08:30 | | etnguyen03 (etnguyen03) joins |
19:45:35 | | eroc1990 quits [Quit: The Lounge - https://thelounge.chat] |
19:46:05 | | eroc1990 (eroc1990) joins |
19:54:20 | | ThreeHM_ (ThreeHeadedMonkey) joins |
19:56:05 | | ThreeHM quits [Ping timeout: 272 seconds] |
20:04:49 | | etnguyen03 quits [Client Quit] |
20:08:54 | | superkuh_ quits [Read error: Connection reset by peer] |
20:11:41 | | ThreeHM_ is now known as ThreeHM |
20:21:31 | | tzt quits [Ping timeout: 255 seconds] |
20:22:26 | <@OrIdow6> | Is it Zstd-compressed? |
20:22:40 | <@OrIdow6> | Don't remember if warcio supports those |
20:25:15 | <@JAA> | It does not. |
20:29:00 | <fireonlive> | warcio-- |
20:29:00 | <eggdrop> | [karma] 'warcio' now has -2 karma! |
21:12:26 | | Unholy2361 quits [Client Quit] |
21:13:13 | | Unholy2361 (Unholy2361) joins |
21:14:59 | | jacksonchen666 (jacksonchen666) joins |
21:15:31 | | BlueMaxima joins |
21:17:56 | | treora quits [Remote host closed the connection] |
21:17:57 | | treora joins |
21:18:05 | | treora quits [Remote host closed the connection] |
21:18:06 | | treora joins |
21:18:10 | | treora quits [Remote host closed the connection] |
21:18:10 | | treora joins |
21:33:35 | | tzt (tzt) joins |
21:53:44 | | jacksonchen666 quits [Client Quit] |
21:55:02 | | etnguyen03 (etnguyen03) joins |
22:00:31 | | wickedplayer494 quits [Ping timeout: 255 seconds] |
22:00:34 | | jacksonchen666 (jacksonchen666) joins |
22:08:10 | | wickedplayer494 joins |
22:08:42 | | wickedplayer494 is now authenticated as wickedplayer494 |
22:21:15 | | etnguyen03 quits [Client Quit] |
22:21:57 | | etnguyen03 (etnguyen03) joins |
22:31:44 | | etnguyen03 quits [Client Quit] |
22:32:26 | | etnguyen03 (etnguyen03) joins |
22:42:11 | | etnguyen03 quits [Client Quit] |
22:42:53 | | etnguyen03 (etnguyen03) joins |
22:54:53 | | pedantic-darwin quits [Quit: The Lounge - https://thelounge.chat] |
23:13:41 | | Arcorann (Arcorann) joins |
23:24:07 | <h2ibot> | Bear edited List of websites excluded from the Wayback Machine (+209, cybersecurityacademy.com - redirected from two…): https://wiki.archiveteam.org/?diff=52016&oldid=51993 |
23:24:19 | <Guest> | kiska i couldnt get it by reading the raw warc file, do you know any other way? |
23:29:08 | <h2ibot> | Bear edited List of websites excluded from the Wayback Machine (+57, brightkite.com, [[LinkTree]]): https://wiki.archiveteam.org/?diff=52017&oldid=52016 |
23:31:09 | <h2ibot> | Bear edited List of websites excluded from the Wayback Machine (+40, BrightKite currently returns an HTTP 502.): https://wiki.archiveteam.org/?diff=52018&oldid=52017 |
23:43:11 | <h2ibot> | Bear created Brightkite (+902, Another tombstone in the Internet cemetery.): https://wiki.archiveteam.org/?title=Brightkite |
23:53:59 | | etnguyen03 quits [Client Quit] |
23:54:41 | | etnguyen03 (etnguyen03) joins |
23:58:31 | | HP_Archivist (HP_Archivist) joins |