00:01:54 | | etnguyen03 (etnguyen03) joins |
00:08:34 | | DogsRNice quits [Read error: Connection reset by peer] |
00:16:59 | | Dango360 quits [Read error: Connection reset by peer] |
00:49:34 | | nicolas17 quits [Ping timeout: 258 seconds] |
00:53:49 | | nicolas17 joins |
00:56:42 | | Xanthon quits [Remote host closed the connection] |
00:58:45 | | lennier2_ quits [Ping timeout: 260 seconds] |
01:01:25 | | lennier2_ joins |
01:01:40 | | thalia quits [Quit: Connection closed for inactivity] |
01:09:08 | | lennier2 joins |
01:12:58 | | lennier2_ quits [Ping timeout: 258 seconds] |
01:29:31 | | Xanthon joins |
01:29:31 | | Xanthon is now authenticated as Xanthon |
01:29:31 | | Xanthon quits [Changing host] |
01:29:31 | | Xanthon (Xanthon) joins |
02:10:20 | | BlueMaxima quits [Read error: Connection reset by peer] |
02:19:53 | | DogsRNice joins |
02:20:02 | | Chris50105 (Chris5010) joins |
02:21:58 | | Chris5010 quits [Ping timeout: 258 seconds] |
02:21:58 | | Chris50105 is now known as Chris5010 |
02:34:59 | | Dango360 (Dango360) joins |
02:45:34 | | Doranwen (Doranwen) joins |
02:52:56 | | etnguyen03 quits [Remote host closed the connection] |
03:21:20 | | bills joins |
03:21:55 | | bills quits [Client Quit] |
03:25:28 | | JaffaCakes118 quits [Remote host closed the connection] |
03:26:10 | | JaffaCakes118 (JaffaCakes118) joins |
04:01:38 | | nicolas17 quits [Ping timeout: 258 seconds] |
04:05:09 | | nicolas17 joins |
04:11:58 | | rktk quits [Ping timeout: 258 seconds] |
04:15:15 | | Commander001 joins |
04:18:33 | <that_lurker> | "Yes, this is true currently. If you need nice WARCs I recommend Browsertrix by our friends at Webrecorder instead." :-( |
04:18:49 | <@JAA> | Sigh |
04:19:06 | <h2ibot> | PaulWise edited Mailman/2 (+5, launchpad-users list done): https://wiki.archiveteam.org/?diff=53596&oldid=53479 |
04:19:51 | <@JAA> | I haven't checked what Browsertrix does exactly, but I bet it isn't right. |
04:21:10 | <that_lurker> | https://img.kuhaon.fun/u/h2gf1Q.png |
04:21:24 | <that_lurker> | ^The whole conversation |
04:23:08 | <Jake> | nothing good.... 😆 |
04:23:10 | <@JAA> | Not one word in that response from Ilya surprises me. |
04:31:05 | | superkuh quits [Ping timeout: 260 seconds] |
04:35:19 | | DogsRNice quits [Read error: Connection reset by peer] |
04:41:48 | | pokechu22 quits [Quit: Physically moving pi, should be back in at most 15 minutes] |
04:46:03 | <Flashfire42> | Well my warrior force rebooted after 7 daus lol |
04:46:34 | <steering> | looking at Merklemap, it looks like its just from CT logs? Is there a project for CT logs? :P |
04:46:44 | <steering> | (CT being designed for short-term durability rather than long-term) |
04:52:18 | | pokechu22 (pokechu22) joins |
05:14:50 | | adryd quits [Read error: Connection reset by peer] |
05:21:43 | <pabs> | has anyone ever tried curl-impersonate to defeat TLS fingerprinting? https://github.com/lwthiker/curl-impersonate https://daniel.haxx.se/blog/2022/09/02/curls-tls-fingerprint/ |
05:28:17 | <h2ibot> | Ka edited List of micronations (-292, /* Blogs */ not sure if I'm meant to edit this…): https://wiki.archiveteam.org/?diff=53597&oldid=45146 |
05:28:18 | <h2ibot> | Ka edited List of Reddit subs by country and territory (+258, adding some more): https://wiki.archiveteam.org/?diff=53598&oldid=45056 |
05:28:19 | <h2ibot> | Ka edited WikiLeaks (+2, update url): https://wiki.archiveteam.org/?diff=53599&oldid=27550 |
05:38:08 | | matoro quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
05:39:59 | | matoro joins |
05:48:11 | <Barto> | steering: just CT logs? Damn, disappointed |
05:52:55 | <steering> | idk, processed into a big CSV is still useful |
05:54:17 | <steering> | their homepage says (under "How does MerkleMap work?") "MerkleMap continuously syncs with major public CT logs to maintain an up-to-date index of issued SSL/TLS certificates" |
05:54:42 | <@JAA> | Also the combination with DNS records can be useful. |
05:56:14 | | midou quits [Ping timeout: 258 seconds] |
06:00:46 | <Barto> | An internet friend of mine presented this at defcon: https://www.youtube.com/watch?v=IJT6_OcY_dc That might interest you :-) |
06:03:29 | <steering> | I have always wished for good open-source sets of DNS and whois history. |
06:03:37 | <steering> | I wonder how expensive it would end up being. |
06:03:51 | | adryd (adryd) joins |
06:05:18 | <steering> | Probably not that big, the whois results especially should compress quite well |
06:05:35 | | Snivy quits [Ping timeout: 260 seconds] |
06:08:58 | | Miki_57 joins |
06:16:06 | | fuzzy80211 quits [Read error: Connection reset by peer] |
06:16:12 | | fuzzy8021 (fuzzy80211) joins |
06:18:51 | | magmaus3 quits [Ping timeout: 258 seconds] |
06:21:34 | | magmaus3 (magmaus3) joins |
06:31:42 | | superkuh joins |
06:41:17 | | corentin joins |
06:42:23 | <eggdrop> | [remind] OrIdow6: niconino |
06:42:33 | <@OrIdow6> | !remindme 1d niconino |
06:42:34 | <eggdrop> | [remind] ok, i'll remind you at 2024-10-18T06:42:33Z |
06:48:18 | | adryd quits [Client Quit] |
06:48:40 | | adryd (adryd) joins |
06:57:20 | | Snivy (Snivy) joins |
06:57:56 | | midou joins |
07:03:59 | | awauwa joins |
07:04:10 | | awauwa is now authenticated as awauwa |
07:05:50 | | Unholy236192464537713 (Unholy2361) joins |
07:06:10 | <corentin> | Did anyone ever try the WARC support of pywb for CREATING warcs? https://news.ycombinator.com/item?id=41864927#41866285 |
07:07:18 | <corentin> | I'd like to answer to answer "And you're talking to one of the only person that wrote a proper WARC library" or "Well if it's THAT good why is nobody using it in the industry" but Jake is trying to convince me to just shut up, and he is right |
07:19:36 | | igloo22225 quits [Quit: The Lounge - https://thelounge.chat] |
07:24:20 | | Snivy quits [Ping timeout: 260 seconds] |
07:25:24 | | loug831814 joins |
07:28:15 | | Snivy (Snivy) joins |
07:30:37 | | igloo22225 (igloo22225) joins |
07:40:34 | | igloo22225 quits [Client Quit] |
07:45:24 | <pabs> | corentin: info on https://wiki.archiveteam.org/index.php/The_WARC_Ecosystem (and I posted that to the thread) |
07:48:16 | <corentin> | Thank my man. I wrote a message in response to WACZ here https://news.ycombinator.com/item?id=41864675, then Jake pushed me to delete it. I was basically saying that WACZ is useless because it's just a zip with (non-compliant) WARC file and a bunch of metadata that could be in metadata records. And that state-of-the-art should mean respecting the |
07:48:16 | <corentin> | specs it's claiming to respect. :) I won't post anymore but I support anyone that answer hahahahahah |
07:48:55 | <corentin> | Ilya also said "Every archiving tool out there makes trade-offs about what is archived and how." which is so false, wget-at & Zeno do not make trade-offs, we respect the WARC lib and that's it, we don't bend it. |
08:03:20 | | igloo22225 (igloo22225) joins |
08:05:19 | | D00maholic quits [Ping timeout: 255 seconds] |
08:05:39 | | Doomaholic (Doomaholic) joins |
09:21:00 | | Commander001 quits [Ping timeout: 260 seconds] |
09:21:08 | | Commander001 joins |
09:35:52 | | sralracer joins |
09:36:11 | | sralracer is now authenticated as sralracer |
09:51:41 | | Stagnant quits [Remote host closed the connection] |
10:02:59 | | Stagnant (Stagnant) joins |
10:03:31 | | Commander001 quits [Read error: Connection reset by peer] |
10:03:45 | | Commander001 joins |
11:00:06 | | Bleo18260072271962 quits [Quit: The Lounge - https://thelounge.chat] |
11:02:53 | | Bleo18260072271962 joins |
11:12:36 | | Grzesiek11_ joins |
11:12:36 | | Grzesiek11 quits [Read error: Connection reset by peer] |
11:50:53 | | SkilledAlpaca418 quits [Quit: SkilledAlpaca418] |
11:52:46 | | SkilledAlpaca418 joins |
12:14:17 | | shgaqnyrjp quits [Remote host closed the connection] |
12:14:17 | | SootBector quits [Remote host closed the connection] |
12:14:37 | | shgaqnyrjp (shgaqnyrjp) joins |
12:14:38 | | SootBector (SootBector) joins |
12:15:52 | <TheTechRobo> | I don't necessarily have a huge issue with rewriting HTTP/2 data as HTTP/1.1 (as long as it's made exceedingly clear and none of the actual information gets changed) but this is a big enough change that I wouldn't just do it. This is *archival* of all things, we need a proper standard so future people know what they're dealing with |
12:18:13 | <TheTechRobo> | I also don't really get the "easy-to-consume" argument. Why not just put the WARC body through an HTTP parser? You don't need to be reading the WARC completely by yourself. |
12:19:13 | | TheTechRobo has noticed Zeno does not appear to be listed on https://wiki.archiveteam.org/index.php/The_WARC_Ecosystem |
12:28:27 | | SootBector quits [Remote host closed the connection] |
12:28:49 | | SootBector (SootBector) joins |
12:43:37 | | qinplus_mobile joins |
12:49:33 | <myself> | steering: Seriously, I've found myself wanting DNS and whois history plenty of times over the years. I wonder if that data is out there somewhere. |
12:54:53 | <steering> | myself: mostly $$$, ad-filled and poor coverage, or both :P |
12:56:36 | | rktk (rktk) joins |
13:20:42 | | klaffty joins |
13:34:13 | | Radzig quits [Remote host closed the connection] |
13:45:51 | <kiska> | corentin: I think you can answer with that response |
13:56:23 | | Guest54 joins |
14:01:00 | | Guest54 quits [Ping timeout: 260 seconds] |
14:01:56 | | Commander001 quits [Ping timeout: 258 seconds] |
14:02:07 | | Commander001 joins |
14:18:52 | <@JAA> | corentin: The trade-off we make is that we don't support HTTP/2, HTTP/3, or WebSockets. And that is something that needs to be addressed in WARC sooner rather than later. But yeah, doesn't excuse what they are doing at all. |
15:00:30 | | loug831814 quits [Ping timeout: 260 seconds] |
15:00:53 | | loug831814 joins |
15:17:00 | <h2ibot> | Nulldata edited Deathwatch (+344, /* 2024 */ Added Accord's Library): https://wiki.archiveteam.org/?diff=53600&oldid=53589 |
15:17:14 | <nulldata> | Square Enix-- |
15:17:15 | <eggdrop> | [karma] 'Square Enix' now has -1 karma! |
15:30:07 | | Commander001 quits [Read error: Connection reset by peer] |
15:30:19 | | Commander001 joins |
15:30:24 | <@arkiver> | maybe ArchiveBox can use Wget-AT instead |
15:32:14 | <thuban> | iirc (i looked into this last time it came up) the wget binary is a config variable, so indeed it can |
15:34:57 | <thuban> | https://github.com/ArchiveBox/ArchiveBox/blob/315c9f3844d63f897e1c73c3bbbab7bf9f3e0c11/archivebox/config.py#L229 yup (maybe worth mentioning? i don't have an hn account) |
15:34:57 | <@arkiver> | i'll send them a message |
15:35:16 | <@arkiver> | thuban: yeah if someone can mention it on HN, that is great as well |
15:36:00 | <@arkiver> | (sending an email) |
15:45:23 | <@arkiver> | email sent |
15:53:29 | | qinplus_mobile quits [Quit: Connection closed for inactivity] |
16:08:26 | | Commander001 quits [Remote host closed the connection] |
16:14:53 | | Commander001 joins |
16:50:14 | <nulldata> | ArchiveBox supports arm so that might be a blocker for using wget-at |
16:56:02 | | vix5110_ joins |
17:08:59 | | cow_2001 joins |
17:19:17 | <thuban> | using wget-at on arm wouldn't be worse than using wget on arm |
17:23:21 | <@JAA> | It would because wget has been tested on ARM while wget-at hasn't. |
17:23:45 | <@JAA> | (Not that the wget WARC code is great.) |
17:25:10 | <thuban> | oh, i thought that neither had been correctness-tested on arm. |
17:25:57 | <@JAA> | I think I saw something about it at one point, but this would've been many years ago, definitely before the angle brackets disaster. |
17:26:12 | <@JAA> | So yeah, maybe not too relevant. |
17:27:44 | <thuban> | i've never seen anybody be specific on what the issue was--something about endianness, but what? would like to know more if anyone recalls... |
17:43:53 | | aninternettroll quits [Remote host closed the connection] |
17:46:19 | | aninternettroll (aninternettroll) joins |
17:56:05 | | aninternettroll quits [Ping timeout: 260 seconds] |
18:02:39 | | pedantic-darwin4 joins |
18:02:57 | | pedantic-darwin quits [Read error: Connection reset by peer] |
18:02:57 | | pedantic-darwin4 is now known as pedantic-darwin |
18:06:30 | | awauwa quits [Client Quit] |
18:07:20 | | aninternettroll (aninternettroll) joins |
18:23:30 | <nicolas17> | thuban: ARM and x86 have the same endianness |
18:27:25 | <thuban> | aiui arm can be either in principle, although in practice that's true. but endianness is the only thing i've ever seen specifically cited |
18:32:48 | <Jake> | https://youtu.be/K590t6szNLI -- new root key signing ceremony |
18:39:50 | <that_lurker> | oh it was today |
18:45:22 | <katia> | oh! |
18:56:58 | <Barto> | Jake: xfce in the wild |
18:58:50 | <Jake> | indeed |
19:31:43 | <@JAA> | Xfce++ |
19:31:43 | <eggdrop> | [karma] 'Xfce' now has 1 karma! |
20:10:32 | | Dango360_ (Dango360) joins |
20:13:46 | | Dango360 quits [Ping timeout: 258 seconds] |
20:31:28 | | vix5110_ quits [Client Quit] |
20:36:46 | <@OrIdow6> | TheTechRobo: Yeah I wouldn't mind a standard for re-serialized DOM after the JS messes with it - or maybe even some way to pass around screenshots, skip the text layer entirely |
20:49:50 | | etnguyen03 (etnguyen03) joins |
21:22:49 | | loug831814 quits [Quit: The Lounge - https://thelounge.chat] |
21:23:00 | <nicolas17> | ok I'm now pretty sure my stuck-uploading workers are growing their RAM usage |
21:27:15 | | Chris5010 quits [Ping timeout: 260 seconds] |
21:29:28 | | Chris5010 (Chris5010) joins |
21:31:12 | | lennier2 quits [Ping timeout: 258 seconds] |
21:33:40 | | lennier2 joins |
21:34:08 | | Xanthos joins |
21:34:20 | | Xanthon quits [Read error: Connection reset by peer] |
21:34:20 | | Xanthos is now known as Xanthon |
21:34:22 | | Xanthon is now authenticated as Xanthon |
21:34:22 | | Xanthon quits [Changing host] |
21:34:22 | | Xanthon (Xanthon) joins |
21:35:49 | | lennier2_ joins |
21:38:55 | | lennier2 quits [Ping timeout: 260 seconds] |
22:17:06 | | nicolas17 is now authenticated as nicolas17 |
22:45:17 | | ymgve__ joins |
22:48:55 | | ymgve_ quits [Ping timeout: 260 seconds] |
22:51:31 | | Xanthos joins |
22:52:25 | | Sidpatchy quits [Ping timeout: 260 seconds] |
22:53:59 | | Xanthon quits [Ping timeout: 258 seconds] |
22:54:00 | | Xanthos is now known as Xanthon |
22:54:02 | | Xanthon is now authenticated as Xanthon |
22:54:02 | | Xanthon quits [Changing host] |
22:54:02 | | Xanthon (Xanthon) joins |
23:22:02 | | etnguyen03 quits [Client Quit] |
23:31:25 | | sralracer quits [Client Quit] |
23:42:18 | | Snivy quits [Ping timeout: 258 seconds] |
23:43:44 | | etnguyen03 (etnguyen03) joins |
23:47:51 | | Snivy (Snivy) joins |