00:01:54etnguyen03 (etnguyen03) joins
00:08:34DogsRNice quits [Read error: Connection reset by peer]
00:16:59Dango360 quits [Read error: Connection reset by peer]
00:49:34nicolas17 quits [Ping timeout: 258 seconds]
00:53:49nicolas17 joins
00:56:42Xanthon quits [Remote host closed the connection]
00:58:45lennier2_ quits [Ping timeout: 260 seconds]
01:01:25lennier2_ joins
01:01:40thalia quits [Quit: Connection closed for inactivity]
01:09:08lennier2 joins
01:12:58lennier2_ quits [Ping timeout: 258 seconds]
01:29:31Xanthon joins
01:29:31Xanthon quits [Changing host]
01:29:31Xanthon (Xanthon) joins
02:10:20BlueMaxima quits [Read error: Connection reset by peer]
02:19:53DogsRNice joins
02:20:02Chris50105 (Chris5010) joins
02:21:58Chris5010 quits [Ping timeout: 258 seconds]
02:21:58Chris50105 is now known as Chris5010
02:34:59Dango360 (Dango360) joins
02:45:34Doranwen (Doranwen) joins
02:52:56etnguyen03 quits [Remote host closed the connection]
03:21:20bills joins
03:21:55bills quits [Client Quit]
03:25:28JaffaCakes118 quits [Remote host closed the connection]
03:26:10JaffaCakes118 (JaffaCakes118) joins
04:01:38nicolas17 quits [Ping timeout: 258 seconds]
04:05:09nicolas17 joins
04:11:58rktk quits [Ping timeout: 258 seconds]
04:15:15Commander001 joins
04:18:33<that_lurker>"Yes, this is true currently. If you need nice WARCs I recommend Browsertrix by our friends at Webrecorder instead." :-(
04:18:49<@JAA>Sigh
04:19:06<h2ibot>PaulWise edited Mailman/2 (+5, launchpad-users list done): https://wiki.archiveteam.org/?diff=53596&oldid=53479
04:19:51<@JAA>I haven't checked what Browsertrix does exactly, but I bet it isn't right.
04:21:10<that_lurker>https://img.kuhaon.fun/u/h2gf1Q.png
04:21:24<that_lurker>^The whole conversation
04:23:08<Jake>nothing good.... 😆
04:23:10<@JAA>Not one word in that response from Ilya surprises me.
04:31:05superkuh quits [Ping timeout: 260 seconds]
04:35:19DogsRNice quits [Read error: Connection reset by peer]
04:41:48pokechu22 quits [Quit: Physically moving pi, should be back in at most 15 minutes]
04:46:03<Flashfire42>Well my warrior force rebooted after 7 daus lol
04:46:34<steering>looking at Merklemap, it looks like its just from CT logs? Is there a project for CT logs? :P
04:46:44<steering>(CT being designed for short-term durability rather than long-term)
04:52:18pokechu22 (pokechu22) joins
05:14:50adryd quits [Read error: Connection reset by peer]
05:21:43<pabs>has anyone ever tried curl-impersonate to defeat TLS fingerprinting? https://github.com/lwthiker/curl-impersonate https://daniel.haxx.se/blog/2022/09/02/curls-tls-fingerprint/
05:28:17<h2ibot>Ka edited List of micronations (-292, /* Blogs */ not sure if I'm meant to edit this…): https://wiki.archiveteam.org/?diff=53597&oldid=45146
05:28:18<h2ibot>Ka edited List of Reddit subs by country and territory (+258, adding some more): https://wiki.archiveteam.org/?diff=53598&oldid=45056
05:28:19<h2ibot>Ka edited WikiLeaks (+2, update url): https://wiki.archiveteam.org/?diff=53599&oldid=27550
05:38:08matoro quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
05:39:59matoro joins
05:48:11<Barto>steering: just CT logs? Damn, disappointed
05:52:55<steering>idk, processed into a big CSV is still useful
05:54:17<steering>their homepage says (under "How does MerkleMap work?") "MerkleMap continuously syncs with major public CT logs to maintain an up-to-date index of issued SSL/TLS certificates"
05:54:42<@JAA>Also the combination with DNS records can be useful.
05:56:14midou quits [Ping timeout: 258 seconds]
06:00:46<Barto>An internet friend of mine presented this at defcon: https://www.youtube.com/watch?v=IJT6_OcY_dc That might interest you :-)
06:03:29<steering>I have always wished for good open-source sets of DNS and whois history.
06:03:37<steering>I wonder how expensive it would end up being.
06:03:51adryd (adryd) joins
06:05:18<steering>Probably not that big, the whois results especially should compress quite well
06:05:35Snivy quits [Ping timeout: 260 seconds]
06:08:58Miki_57 joins
06:16:06fuzzy80211 quits [Read error: Connection reset by peer]
06:16:12fuzzy8021 (fuzzy80211) joins
06:18:51magmaus3 quits [Ping timeout: 258 seconds]
06:21:34magmaus3 (magmaus3) joins
06:31:42superkuh joins
06:41:17corentin joins
06:42:23<eggdrop>[remind] OrIdow6: niconino
06:42:33<@OrIdow6>!remindme 1d niconino
06:42:34<eggdrop>[remind] ok, i'll remind you at 2024-10-18T06:42:33Z
06:48:18adryd quits [Client Quit]
06:48:40adryd (adryd) joins
06:57:20Snivy (Snivy) joins
06:57:56midou joins
07:03:59awauwa joins
07:05:50Unholy236192464537713 (Unholy2361) joins
07:06:10<corentin>Did anyone ever try the WARC support of pywb for CREATING warcs? https://news.ycombinator.com/item?id=41864927#41866285
07:07:18<corentin>I'd like to answer to answer "And you're talking to one of the only person that wrote a proper WARC library" or "Well if it's THAT good why is nobody using it in the industry" but Jake is trying to convince me to just shut up, and he is right
07:19:36igloo22225 quits [Quit: The Lounge - https://thelounge.chat]
07:24:20Snivy quits [Ping timeout: 260 seconds]
07:25:24loug831814 joins
07:28:15Snivy (Snivy) joins
07:30:37igloo22225 (igloo22225) joins
07:40:34igloo22225 quits [Client Quit]
07:45:24<pabs>corentin: info on https://wiki.archiveteam.org/index.php/The_WARC_Ecosystem (and I posted that to the thread)
07:48:16<corentin>Thank my man. I wrote a message in response to WACZ here https://news.ycombinator.com/item?id=41864675, then Jake pushed me to delete it. I was basically saying that WACZ is useless because it's just a zip with (non-compliant) WARC file and a bunch of metadata that could be in metadata records. And that state-of-the-art should mean respecting the
07:48:16<corentin>specs it's claiming to respect. :) I won't post anymore but I support anyone that answer hahahahahah
07:48:55<corentin>Ilya also said "Every archiving tool out there makes trade-offs about what is archived and how." which is so false, wget-at & Zeno do not make trade-offs, we respect the WARC lib and that's it, we don't bend it.
08:03:20igloo22225 (igloo22225) joins
08:05:19D00maholic quits [Ping timeout: 255 seconds]
08:05:39Doomaholic (Doomaholic) joins
09:21:00Commander001 quits [Ping timeout: 260 seconds]
09:21:08Commander001 joins
09:35:52sralracer joins
09:51:41Stagnant quits [Remote host closed the connection]
10:02:59Stagnant (Stagnant) joins
10:03:31Commander001 quits [Read error: Connection reset by peer]
10:03:45Commander001 joins
11:00:06Bleo18260072271962 quits [Quit: The Lounge - https://thelounge.chat]
11:02:53Bleo18260072271962 joins
11:12:36Grzesiek11_ joins
11:12:36Grzesiek11 quits [Read error: Connection reset by peer]
11:50:53SkilledAlpaca418 quits [Quit: SkilledAlpaca418]
11:52:46SkilledAlpaca418 joins
12:14:17shgaqnyrjp quits [Remote host closed the connection]
12:14:17SootBector quits [Remote host closed the connection]
12:14:37shgaqnyrjp (shgaqnyrjp) joins
12:14:38SootBector (SootBector) joins
12:15:52<TheTechRobo>I don't necessarily have a huge issue with rewriting HTTP/2 data as HTTP/1.1 (as long as it's made exceedingly clear and none of the actual information gets changed) but this is a big enough change that I wouldn't just do it. This is *archival* of all things, we need a proper standard so future people know what they're dealing with
12:18:13<TheTechRobo>I also don't really get the "easy-to-consume" argument. Why not just put the WARC body through an HTTP parser? You don't need to be reading the WARC completely by yourself.
12:19:13TheTechRobo has noticed Zeno does not appear to be listed on https://wiki.archiveteam.org/index.php/The_WARC_Ecosystem
12:28:27SootBector quits [Remote host closed the connection]
12:28:49SootBector (SootBector) joins
12:43:37qinplus_mobile joins
12:49:33<myself>steering: Seriously, I've found myself wanting DNS and whois history plenty of times over the years. I wonder if that data is out there somewhere.
12:54:53<steering>myself: mostly $$$, ad-filled and poor coverage, or both :P
12:56:36rktk (rktk) joins
13:20:42klaffty joins
13:34:13Radzig quits [Remote host closed the connection]
13:45:51<kiska>corentin: I think you can answer with that response
13:56:23Guest54 joins
14:01:00Guest54 quits [Ping timeout: 260 seconds]
14:01:56Commander001 quits [Ping timeout: 258 seconds]
14:02:07Commander001 joins
14:18:52<@JAA>corentin: The trade-off we make is that we don't support HTTP/2, HTTP/3, or WebSockets. And that is something that needs to be addressed in WARC sooner rather than later. But yeah, doesn't excuse what they are doing at all.
15:00:30loug831814 quits [Ping timeout: 260 seconds]
15:00:53loug831814 joins
15:17:00<h2ibot>Nulldata edited Deathwatch (+344, /* 2024 */ Added Accord's Library): https://wiki.archiveteam.org/?diff=53600&oldid=53589
15:17:14<nulldata>Square Enix--
15:17:15<eggdrop>[karma] 'Square Enix' now has -1 karma!
15:30:07Commander001 quits [Read error: Connection reset by peer]
15:30:19Commander001 joins
15:30:24<@arkiver>maybe ArchiveBox can use Wget-AT instead
15:32:14<thuban>iirc (i looked into this last time it came up) the wget binary is a config variable, so indeed it can
15:34:57<thuban>https://github.com/ArchiveBox/ArchiveBox/blob/315c9f3844d63f897e1c73c3bbbab7bf9f3e0c11/archivebox/config.py#L229 yup (maybe worth mentioning? i don't have an hn account)
15:34:57<@arkiver>i'll send them a message
15:35:16<@arkiver>thuban: yeah if someone can mention it on HN, that is great as well
15:36:00<@arkiver>(sending an email)
15:45:23<@arkiver>email sent
15:53:29qinplus_mobile quits [Quit: Connection closed for inactivity]
16:08:26Commander001 quits [Remote host closed the connection]
16:14:53Commander001 joins
16:50:14<nulldata>ArchiveBox supports arm so that might be a blocker for using wget-at
16:56:02vix5110_ joins
17:08:59cow_2001 joins
17:19:17<thuban>using wget-at on arm wouldn't be worse than using wget on arm
17:23:21<@JAA>It would because wget has been tested on ARM while wget-at hasn't.
17:23:45<@JAA>(Not that the wget WARC code is great.)
17:25:10<thuban>oh, i thought that neither had been correctness-tested on arm.
17:25:57<@JAA>I think I saw something about it at one point, but this would've been many years ago, definitely before the angle brackets disaster.
17:26:12<@JAA>So yeah, maybe not too relevant.
17:27:44<thuban>i've never seen anybody be specific on what the issue was--something about endianness, but what? would like to know more if anyone recalls...
17:43:53aninternettroll quits [Remote host closed the connection]
17:46:19aninternettroll (aninternettroll) joins
17:56:05aninternettroll quits [Ping timeout: 260 seconds]
18:02:39pedantic-darwin4 joins
18:02:57pedantic-darwin quits [Read error: Connection reset by peer]
18:02:57pedantic-darwin4 is now known as pedantic-darwin
18:06:30awauwa quits [Client Quit]
18:07:20aninternettroll (aninternettroll) joins
18:23:30<nicolas17>thuban: ARM and x86 have the same endianness
18:27:25<thuban>aiui arm can be either in principle, although in practice that's true. but endianness is the only thing i've ever seen specifically cited
18:32:48<Jake>https://youtu.be/K590t6szNLI -- new root key signing ceremony
18:39:50<that_lurker>oh it was today
18:45:22<katia>oh!
18:56:58<Barto>Jake: xfce in the wild
18:58:50<Jake>indeed
19:31:43<@JAA>Xfce++
19:31:43<eggdrop>[karma] 'Xfce' now has 1 karma!
20:10:32Dango360_ (Dango360) joins
20:13:46Dango360 quits [Ping timeout: 258 seconds]
20:31:28vix5110_ quits [Client Quit]
20:36:46<@OrIdow6>TheTechRobo: Yeah I wouldn't mind a standard for re-serialized DOM after the JS messes with it - or maybe even some way to pass around screenshots, skip the text layer entirely
20:49:50etnguyen03 (etnguyen03) joins
21:22:49loug831814 quits [Quit: The Lounge - https://thelounge.chat]
21:23:00<nicolas17>ok I'm now pretty sure my stuck-uploading workers are growing their RAM usage
21:27:15Chris5010 quits [Ping timeout: 260 seconds]
21:29:28Chris5010 (Chris5010) joins
21:31:12lennier2 quits [Ping timeout: 258 seconds]
21:33:40lennier2 joins
21:34:08Xanthos joins
21:34:20Xanthon quits [Read error: Connection reset by peer]
21:34:20Xanthos is now known as Xanthon
21:34:22Xanthon quits [Changing host]
21:34:22Xanthon (Xanthon) joins
21:35:49lennier2_ joins
21:38:55lennier2 quits [Ping timeout: 260 seconds]
22:45:17ymgve__ joins
22:48:55ymgve_ quits [Ping timeout: 260 seconds]
22:51:31Xanthos joins
22:52:25Sidpatchy quits [Ping timeout: 260 seconds]
22:53:59Xanthon quits [Ping timeout: 258 seconds]
22:54:00Xanthos is now known as Xanthon
22:54:02Xanthon quits [Changing host]
22:54:02Xanthon (Xanthon) joins
23:22:02etnguyen03 quits [Client Quit]
23:31:25sralracer quits [Client Quit]
23:42:18Snivy quits [Ping timeout: 258 seconds]
23:43:44etnguyen03 (etnguyen03) joins
23:47:51Snivy (Snivy) joins