00:00:01etnguyen03 quits [Client Quit]
00:03:41Naruyoko5 quits [Ping timeout: 260 seconds]
00:05:29Naruyoko joins
00:07:47kansei- (kansei) joins
00:08:10kansei quits [Ping timeout: 250 seconds]
00:30:18khaoohs_ joins
00:30:18khaoohs quits [Read error: Connection reset by peer]
00:36:12etnguyen03 (etnguyen03) joins
00:38:07gust quits [Read error: Connection reset by peer]
01:17:25<pabs>!tell VoynichCR TuxFamily archiving is being co-ordinated on https://pad.notkiska.pw/p/archivebot-tuxfamily
01:17:25<eggdrop>[tell] ok, I'll tell VoynichCR when they join next
01:17:28<pabs>pokechu22: ^
01:17:37<pabs>steering: ^
01:17:56<pabs>there are a *lot* of jobs to do
01:18:25<pabs>I've been working on writing a classifier and AB job generator, so I can get the simple ones done with h2ibot
01:27:33<eggdrop>[remind] arkiver: indafoto bzc6p
02:32:53Webuser213709 quits [Quit: Ooops, wrong browser tab.]
02:33:38etnguyen03 quits [Client Quit]
02:37:36etnguyen03 (etnguyen03) joins
02:45:14etnguyen03 quits [Remote host closed the connection]
02:51:08<pabs>https://sethmlarson.dev/i-fear-for-the-unauthenticated-web https://news.ycombinator.com/item?id=43424340
02:55:03<nicolas17>I just checked my blog access logs to see if there was any such LLM scrapers
02:55:40<nicolas17>but it seems to be mostly security scanners and such
02:56:04<nicolas17>I should probably configure fail2ban to trip on accesses to /.git/config and such
02:56:40Megame (Megame) joins
02:58:54<nicolas17>can't hide from crawlers because they scrape TLS CT logs nowadays
03:15:46katocala quits [Remote host closed the connection]
03:16:03katocala joins
03:19:41<pabs>sounds like Greenpeace US might be bankrupted https://apnews.com/article/greenpeace-dakota-access-pipeline-lawsuit-verdict-5036944c1d2e7d3d7b704437e8110fbb https://news.ycombinator.com/item?id=43422556
03:25:12notSokar is now known as Sokar
03:55:51yasomi quits [Ping timeout: 260 seconds]
03:57:01yasomi (yasomi) joins
03:58:43windyshadow32 joins
04:00:58windyshadow32 quits [Client Quit]
04:02:08Megame quits [Client Quit]
05:36:14vitzli (vitzli) joins
05:41:24sec^nd quits [Remote host closed the connection]
05:41:38sec^nd (second) joins
05:42:44vitzli quits [Client Quit]
05:57:11JayEmbee quits [Ping timeout: 260 seconds]
05:58:45JayEmbee (JayEmbee) joins
06:11:46Sidpatchy quits [Ping timeout: 260 seconds]
06:24:38<@arkiver>joepie91|m: i see it's specifically on some VPS, does it only _only_ happen on that VPS, or is it due to some special settings you are using?
06:24:41Island quits [Read error: Connection reset by peer]
06:25:07<@arkiver>i'd be interesting in looking deeper into it if you have a way to reproduce it (and which is accessible to me)
07:07:48snel joins
07:11:17BornOn420 quits [Remote host closed the connection]
07:11:40BornOn420 (BornOn420) joins
07:23:01nulldata quits [Quit: Ping timeout (120 seconds)]
07:23:39nulldata (nulldata) joins
07:26:28Webuser840765 joins
07:31:42Webuser840765 quits [Client Quit]
07:47:06<masterx244|m>nicolas17: subdomains can be hidden from those crawlers though. wildcard certificate since that does not tell which subdomain gets used
08:14:24Ketchup901 quits [Remote host closed the connection]
08:14:36Ketchup901 (Ketchup901) joins
08:22:26makeworld quits [Ping timeout: 260 seconds]
08:25:38makeworld joins
08:57:46gatagoto (gatagoto) joins
09:04:24vitzli (vitzli) joins
09:15:30nulldata quits [Client Quit]
09:16:04nulldata (nulldata) joins
09:17:26snel quits [Client Quit]
09:19:21<@JAA>https://forum.nginx.org/ was taken down at some point in the past few months. Per the notice there, the server still exists though, so I'll try to grab a copy of that soon.
09:21:39<@JAA>Also per the notice there, it was mostly not unique content, so there's that.
10:01:05LunarianBunny1147 quits [Quit: The Lounge - https://thelounge.chat]
10:16:11ymgve quits [Ping timeout: 260 seconds]
10:24:57LunarianBunny1147 (LunarianBunny1147) joins
10:25:56ymgve joins
10:44:31notarobot13 joins
10:46:28notarobot1 quits [Ping timeout: 250 seconds]
10:46:28ArchivalEfforts quits [Ping timeout: 250 seconds]
10:46:28SketchCo1 quits [Ping timeout: 250 seconds]
10:46:29notarobot13 is now known as notarobot1
10:46:42arch quits [Remote host closed the connection]
10:46:50arch joins
10:48:13<@arkiver>JAA: on arzon, there's also image_*.html (similarly sequential), anime_*.html, goods_*.html
10:48:20ArchivalEfforts joins
10:51:45SketchCo1 joins
10:52:06anarcat quits [Ping timeout: 250 seconds]
11:00:01Bleo18260072271962345 quits [Quit: The Lounge - https://thelounge.chat]
11:02:46Bleo18260072271962345 joins
11:24:13arch quits [Remote host closed the connection]
11:24:21arch joins
11:31:16<joepie91|m>arkiver: it happens on every system I've looked at; two VPSes at different providers and a Hetzner dedi (but all running identical environments otherwise)
11:32:11<joepie91|m>arkiver: relevant configuration: https://gist.github.com/joepie91/f557d9bc0cb6963105560ce69fd46cf2
11:34:18SkilledAlpaca418962 quits [Quit: SkilledAlpaca418962]
11:34:46SkilledAlpaca418962 joins
12:09:01arch quits [Remote host closed the connection]
12:09:09arch joins
12:35:55kuroger quits [Quit: ZNC 1.9.1 - https://znc.in]
12:40:55kuroger (kuroger) joins
12:45:56kuroger quits [Client Quit]
12:47:16SootBector quits [Ping timeout: 276 seconds]
12:47:45SootBector (SootBector) joins
12:49:23kuroger (kuroger) joins
13:02:52FiTheArchiver joins
13:10:10<c3manu>pabs: i got www.greenpeace.org (which redirects to /usa/) running in #archivebot, since there hasn’t been a full grab yet according to the viewer
13:10:59<c3manu>the subdomains are going to be a lot of work though. i think i’d just go and pick what look like the most important, or the ones most relevant to current events
13:18:58kuroger quits [Client Quit]
13:20:54kuroger (kuroger) joins
13:28:19NeonGlitch (NeonGlitch) joins
13:29:25NeonGlitch quits [Client Quit]
13:47:49NeonGlitch (NeonGlitch) joins
13:49:06NeonGlitch quits [Client Quit]
13:55:28NeonGlitch (NeonGlitch) joins
13:58:30VoynichCR (VoynichCR) joins
13:58:31<eggdrop>[tell] VoynichCR: [2025-03-21T01:17:25Z] <pabs> TuxFamily archiving is being co-ordinated on https://pad.notkiska.pw/p/archivebot-tuxfamily
13:59:07<pabs>(and I've been working on writing a classifier and AB job generator, so I can get the simple ones done with h2ibot)
14:00:45Wohlstand (Wohlstand) joins
14:05:00kuroger quits [Client Quit]
14:10:57kuroger (kuroger) joins
14:11:31BornOn420 quits [Remote host closed the connection]
14:11:45BornOn420 (BornOn420) joins
14:14:47notarobot1 quits [Quit: The Lounge - https://thelounge.chat]
14:15:10notarobot1 joins
14:17:59lemuria (lemuria) joins
14:37:47th3z0l4 joins
14:38:12JayEmbee quits [Quit: WeeChat 2.3]
14:39:16th3z0l4_ quits [Ping timeout: 260 seconds]
15:02:11gust joins
15:17:56lemuria quits [Client Quit]
15:20:47Megame (Megame) joins
15:34:38NeonGlitch quits [Client Quit]
15:35:56VoynichCR quits [Client Quit]
15:38:51NeonGlitch (NeonGlitch) joins
15:39:01loug83181422 quits [Quit: The Lounge - https://thelounge.chat]
15:39:23loug83181422 joins
15:39:54NeonGlitch quits [Client Quit]
15:51:21NeonGlitch (NeonGlitch) joins
15:52:08NeonGlitch quits [Client Quit]
15:52:24arch quits [Ping timeout: 250 seconds]
15:57:41arch joins
15:58:07arch quits [Remote host closed the connection]
15:58:11arch_ joins
15:58:17arch_ is now known as arch
16:00:45kuroger quits [Client Quit]
16:12:47anarcat (anarcat) joins
16:16:12kuroger (kuroger) joins
16:20:58NeonGlitch (NeonGlitch) joins
16:23:03gust quits [Remote host closed the connection]
16:23:21gust joins
17:00:56SpikedCola quits [Quit: Ooops, wrong browser tab.]
17:42:43kuroger quits [Client Quit]
17:45:39kuroger (kuroger) joins
17:50:36Megame quits [Ping timeout: 260 seconds]
17:52:54Megame (Megame) joins
17:56:49ducky quits [Ping timeout: 260 seconds]
18:06:26snel joins
18:08:45kuroger quits [Client Quit]
18:18:24ducky (ducky) joins
18:20:29kuroger (kuroger) joins
18:36:45kuroger quits [Client Quit]
18:43:12kuroger (kuroger) joins
18:51:13lflare quits [Quit: Bye]
18:51:37lflare (lflare) joins
18:54:47linuxgemini quits [Quit: Ping timeout (120 seconds)]
18:55:02linuxgemini (linuxgemini) joins
19:00:22NeonGlitch quits [Client Quit]
19:17:07<h2ibot>HadeanEon edited Deaths in 2003 (-265, BOT - Updating page: {{saved}} (2),…): https://wiki.archiveteam.org/?diff=55007&oldid=54682
19:17:08<h2ibot>HadeanEon edited Deaths in 2003/list (-25, BOT - Updating list): https://wiki.archiveteam.org/?diff=55008&oldid=54683
19:18:35vitzli quits [Quit: Leaving]
19:20:18kuroger quits [Client Quit]
19:21:07<h2ibot>HadeanEon edited Deaths in 2004/list (-3, BOT - Updating list): https://wiki.archiveteam.org/?diff=55009&oldid=54685
19:21:44kuroger (kuroger) joins
19:40:11<h2ibot>HadeanEon edited Deaths in 2007 (-290, BOT - Updating page: {{saved}} (4),…): https://wiki.archiveteam.org/?diff=55010&oldid=54691
19:40:12<h2ibot>HadeanEon edited Deaths in 2007/list (-56, BOT - Updating list): https://wiki.archiveteam.org/?diff=55011&oldid=54692
19:57:04sparky14921 (sparky1492) joins
20:00:41sparky1492 quits [Ping timeout: 260 seconds]
20:00:41sparky14921 is now known as sparky1492
20:02:03adamus1red quits [Quit: SigTerm]
20:04:49adamus1red (adamus1red) joins
20:07:51kuroger quits [Client Quit]
20:10:48kuroger (kuroger) joins
20:12:07Island joins
20:12:56Snivy quits [Ping timeout: 260 seconds]
20:16:13snel quits [Client Quit]
20:17:41VoynichCR (VoynichCR) joins
20:18:17<h2ibot>HadeanEon edited Deaths in 2011 (+287, BOT - Updating page: {{saved}} (204),…): https://wiki.archiveteam.org/?diff=55012&oldid=54699
20:18:18<h2ibot>HadeanEon edited Deaths in 2011/list (+36, BOT - Updating list): https://wiki.archiveteam.org/?diff=55013&oldid=54700
20:19:40Snivy (Snivy) joins
20:29:57Sidpatchy (Sidpatchy) joins
20:38:53kuroger quits [Client Quit]
20:42:20kuroger (kuroger) joins
20:55:19etnguyen03 (etnguyen03) joins
20:57:24<h2ibot>HadeanEon edited Deaths in 2015 (+276, BOT - Updating page: {{saved}} (318),…): https://wiki.archiveteam.org/?diff=55014&oldid=54712
20:57:25<h2ibot>HadeanEon edited Deaths in 2015/list (+27, BOT - Updating list): https://wiki.archiveteam.org/?diff=55015&oldid=54713
21:03:24kuroger quits [Client Quit]
21:05:21<jacksonchen666>not sure how worthy this is for #archiveteam since OMDB (osu! rating site) has full db and source code available at https://omdb.nyahh.net/data.html (63MB db), which shut down on 2025-03(-01)
21:11:34VoynichCR quits [Client Quit]
21:12:26<h2ibot>HadeanEon edited Deaths in 2016/list (+6, BOT - Updating list): https://wiki.archiveteam.org/?diff=55016&oldid=54723
21:13:02etnguyen03 quits [Client Quit]
21:13:26BlueMaxima joins
21:13:52kuroger (kuroger) joins
21:19:07Wohlstand quits [Remote host closed the connection]
21:27:16<pokechu22>jacksonchen666: I've started an archivebot job for that and put the github repo in #gitgud
21:28:14BlueMaxima quits [Read error: Connection reset by peer]
21:28:34<pokechu22>uh, and I guess there's also outlinks from that database, will try to do something with those
21:35:18NeonGlitch (NeonGlitch) joins
21:36:26NeonGlitch quits [Client Quit]
21:51:03chrismeller quits [Quit: chrismeller]
21:51:21chrismeller (chrismeller) joins
21:57:55pixel (pixel) joins
22:03:03arch quits [Remote host closed the connection]
22:03:29arch joins
22:03:31arch quits [Remote host closed the connection]
22:03:39arch joins
22:11:59Webuser499490 joins
22:16:49BlueMaxima joins
22:26:08wickedplayer494 quits [Read error: Connection reset by peer]
22:27:16wickedplayer494 joins
22:46:48etnguyen03 (etnguyen03) joins
22:57:31NeonGlitch (NeonGlitch) joins
23:05:34etnguyen03 quits [Client Quit]
23:20:47lunik1 quits [Quit: :x]
23:21:16lunik1 joins
23:22:11simon816 quits [Quit: ZNC 1.9.1 - https://znc.in]
23:30:21etnguyen03 (etnguyen03) joins
23:30:51simon816 (simon816) joins