03:13:22monoxane quits [Quit: Ping timeout (120 seconds)]
03:13:50monoxane (monoxane) joins
04:14:36myself quits [Ping timeout: 260 seconds]
04:59:57AlsoHP_Archivist quits [Quit: Leaving]
05:00:11HP_Archivist (HP_Archivist) joins
06:04:32<@arkiver>tech234a: yeah it sucks :/
06:04:35<@arkiver>you could search for `NOT _exists_:access-restricted-item` i think
06:04:37<@arkiver>access-restricted-item signals the WARCs are likely unavailable for download
06:04:38<@arkiver>there's a good chance though that more will become unavailable for download in the near future, and only accessible through the Wayback Machine
11:30:06BornOn420 quits [Remote host closed the connection]
12:37:26linuxgemini quits [Ping timeout: 250 seconds]
12:38:31linuxgemini (linuxgemini) joins
12:47:56DLoader quits [Ping timeout: 260 seconds]
13:16:01DLoader (DLoader) joins
15:06:06BornOn420 (BornOn420) joins
15:21:56Sanqui quits [Ping timeout: 260 seconds]
15:53:35Sanqui joins
15:53:37Sanqui quits [Changing host]
15:53:37Sanqui (Sanqui) joins
16:27:28<Grzesiek11_>arkiver: why is that?
16:28:35Grzesiek11_ is now known as Grzesiek11
16:42:47Dango360 quits [Quit: Leaving]
16:43:02Dango360 (Dango360) joins
17:06:07<@arkiver>because we don't want to provide these to LLM training companies
17:06:21<@arkiver>while we also still want them to be available, which is through the Wayback Machine
18:20:48<Grzesiek11>oh. AI must ruin everything huh.
18:20:55<Grzesiek11>can't have good things with AI
18:48:26magmaus3 quits [Ping timeout: 260 seconds]
18:52:13<tech234a>Is it primarily an issue of LLM companies using too much bandwidth or is it more of a policy issue?
19:17:20SootBector quits [Remote host closed the connection]
19:17:38SootBector (SootBector) joins
19:22:33magmaus3 (magmaus3) joins
20:09:57magmaus3 quits [Read error: Connection reset by peer]
20:10:00<nicolas17>tech234a: if websites find out we have warcs that LLMs can train with, they are more likely to block us from archiving
20:10:07magmaus3 (magmaus3) joins
20:17:40<pokechu22>As long as that doesn't affect archivebot I'm fine. I do occasionally need to download archivebot warcs (and often need to download the meta-warcs, which contain job logs) to do further processing though
20:25:16magmaus3 quits [Ping timeout: 260 seconds]
20:32:51<tech234a>hopefully when the LLM bubble pops in like 5 years things can be opened up again ;)
20:59:46NatTheCat quits [Quit: nya~]
21:45:54NatTheCat (NatTheCat) joins
22:38:26SootBector quits [Remote host closed the connection]
22:38:46SootBector (SootBector) joins
23:58:59fuzzy80211 quits [Read error: Connection reset by peer]
23:59:09fuzzy8021 (fuzzy80211) joins