00:43:32Vito` quits [Client Quit]
00:43:39etnguyen03 (etnguyen03) joins
00:46:46Hackerpcs quits [Quit: Hackerpcs]
00:47:41Hackerpcs (Hackerpcs) joins
00:51:02wickedplayer494 quits [Ping timeout: 268 seconds]
00:52:03wickedplayer494 (wickedplayer494) joins
00:53:28etnguyen03 quits [Client Quit]
01:04:59etnguyen03 (etnguyen03) joins
01:26:37etnguyen03 quits [Client Quit]
01:33:23etnguyen03 (etnguyen03) joins
01:36:40azalea_sh_ quits [Ping timeout: 268 seconds]
01:36:53azalea_sh_ (azalea_sh_) joins
01:41:50retrograde quits [Remote host closed the connection]
01:43:51retrograde (retrograde) joins
01:55:55etnguyen03 quits [Client Quit]
02:13:50<systwi_>Would anybody know how to extract errored URLs from a -meta.warc?
02:14:25<@JAA>`little-things/wpull2-log-extract-errors`
02:15:08<@JAA>You can simply pipe the decompressed meta WARC to it; it'll ignore the non-log lines.
02:15:32<@JAA>The self-test example should clarify what's being extracted.
02:20:07<systwi_>Ahh, close! I was looking at warc-dump-responses initially. I saw "log" on that and wasn't sure if that was for the right thing.
02:20:11<systwi_>Thanks JAA. :-)
02:20:50etnguyen03 (etnguyen03) joins
03:18:14etnguyen03 quits [Remote host closed the connection]
03:31:31DogsRNice quits [Read error: Connection reset by peer]
03:53:34PC quits [Ping timeout: 268 seconds]
04:28:20steering7253 (steering) joins
04:28:40steering7253 quits [Client Quit]
05:04:29n9nes quits [Ping timeout: 268 seconds]
05:05:50n9nes joins
05:09:30nexussfan quits [Quit: Konversation terminated!]
05:10:15<h2ibot>Limon edited Angelfire (+583, added shutdown of angelfire by lycos): https://wiki.archiveteam.org/?diff=60757&oldid=60102
05:10:16<h2ibot>TheCarbonFreeze edited Deathwatch (+685, /* 2026 */): https://wiki.archiveteam.org/?diff=60758&oldid=60728
05:10:17<h2ibot>John5433 edited 4chan (+1844, /* ultra.gondola.pics */ added all the boards): https://wiki.archiveteam.org/?diff=60759&oldid=60730
05:10:18<h2ibot>John5433 edited Soyjak.party (+556, /* Archives */): https://wiki.archiveteam.org/?diff=60760&oldid=60732
05:10:19<h2ibot>John5433 edited Deathwatch (+1, spelling mistake): https://wiki.archiveteam.org/?diff=60761&oldid=60758
05:12:15<h2ibot>JustAnotherArchivist edited Angelfire (+30, Datetimeify): https://wiki.archiveteam.org/?diff=60762&oldid=60757
05:12:16<h2ibot>JustAnotherArchivist edited Angelfire (+0, Not (officially) offline yet): https://wiki.archiveteam.org/?diff=60763&oldid=60762
06:25:53mr_sarge quits [Ping timeout: 268 seconds]
06:26:17mr_sarge (sarge) joins
06:29:27Island quits [Read error: Connection reset by peer]
06:44:28TheEnbyperor_ quits [Ping timeout: 268 seconds]
07:03:52TheEnbyperor joins
07:04:53<pabs>!tell v01d a bunch of archive.org alternatives are on the wiki: https://wiki.archiveteam.org/index.php/Archive_Services https://wiki.archiveteam.org/index.php/Category:Web_archiving_services
07:04:54<eggdrop>[tell] ok, I'll tell v01d when they join next
07:06:41<pabs>triplecamera|m: AX_* are from the autoconf-archive package in Debian at least
07:14:54<pabs>klea: discuss.ropensci.org shut already in Feb it seems (NXDOMAIN now). last AB was 20251203
07:15:40<pabs>schwarzkatz|m: might be worth moving linktree-clones.md to a page Linktree/Clones on the wiki?
07:44:36PC (PC) joins
07:54:38steering (steering) joins
07:54:52steering quits [Client Quit]
07:55:37steering (steering) joins
08:17:29LddPotato quits [Read error: Connection reset by peer]
08:18:44LddPotato (LddPotato) joins
08:19:13Webuser032448 joins
08:19:13Webuser032448 quits [Client Quit]
08:20:25Webuser244402 joins
08:26:07Webuser235954 joins
08:27:54Webuser235954 quits [Client Quit]
08:28:22APOLLO03 quits [Quit: .]
08:32:47LddPotato quits [Read error: Connection reset by peer]
08:33:25LddPotato (LddPotato) joins
08:43:17LddPotato quits [Read error: Connection reset by peer]
08:44:02LddPotato (LddPotato) joins
08:47:49LddPotato quits [Read error: Connection reset by peer]
08:48:27LddPotato (LddPotato) joins
08:58:02Webuser389403 joins
08:58:18Webuser389403 quits [Client Quit]
09:00:04LddPotato quits [Read error: Connection reset by peer]
09:00:57LddPotato (LddPotato) joins
09:30:53pabs quits [Ping timeout: 268 seconds]
09:33:32pabs (pabs) joins
11:00:02Bleo18260072271962345522201 quits [Quit: The Lounge - https://thelounge.chat]
11:00:46Webuser578944 joins
11:00:53Webuser578944 quits [Client Quit]
11:02:48Bleo18260072271962345522201 joins
11:03:16<alexlehm>pabs: where is the linktree-clones file now?
11:12:45VerifiedJ quits [Remote host closed the connection]
11:13:22VerifiedJ7 (VerifiedJ) joins
11:19:57<pabs>alexlehm: see the backlog: <schwarzkatz|m> azalea_sh_: I have not followed the conversation, but I gathered you might be interested: I created a list of linktree clones quite a while ago, idk how many of them still work https://transfer.archivete.am/GPs4e/linktree-clones.md
11:19:58<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/GPs4e/linktree-clones.md
11:20:39<alexlehm>thank you, i missed it in the backlog for some reason
12:00:27APOLLO03 joins
12:06:31<klea>oh :(
12:06:50<klea>I should move it then to the past.
12:07:07<klea>I wonder if we should make a wiki page for them.
12:07:29<klea>It seems like it'd be better, since the list is already public.
12:21:43Wohlstand1 (Wohlstand) joins
12:22:01Webuser244402 quits [Quit: Ooops, wrong browser tab.]
12:24:09Wohlstand1 is now known as Wohlstand
12:37:33Webuser526815 joins
12:38:19Webuser526815 quits [Client Quit]
13:32:42Arcorann_ quits [Ping timeout: 268 seconds]
13:37:03<gamer191-1|m>Regarding archive.today, has archiveteam archived any projects with captchas in the past?
13:48:28pseudorizer quits [Quit: ZNC 1.10.1 - https://znc.in]
13:49:09pseudorizer (pseudorizer) joins
13:50:00Webuser512333 joins
13:50:11Webuser512333 quits [Client Quit]
13:52:26pseudorizer quits [Client Quit]
13:52:36nexussfan (nexussfan) joins
13:54:06pseudorizer (pseudorizer) joins
13:56:53revi quits [Quit: Connection closed for inactivity]
14:08:24sepro5 (sepro) joins
14:10:51sepro quits [Ping timeout: 268 seconds]
14:10:51sepro5 is now known as sepro
14:20:10nexussfan quits [Client Quit]
14:25:44sepro quits [Ping timeout: 268 seconds]
14:28:16sepro (sepro) joins
14:29:25FiTheArchiver joins
14:36:11sepro5 (sepro) joins
14:37:49FiTheArchiver quits [Client Quit]
14:38:41sepro quits [Ping timeout: 268 seconds]
14:38:41sepro5 is now known as sepro
14:59:52MrMcNuggets (MrMcNuggets) joins
14:59:53MrMcNuggets quits [Client Quit]
15:01:12MrMcNuggets (MrMcNuggets) joins
15:06:00MrMcNuggets quits [Read error: Connection reset by peer]
15:17:07croissant` quits [Quit: Leaving]
15:25:20croissant` joins
15:52:33hamouda joins
16:17:12Webuser515876 joins
16:18:12Webuser515876 quits [Client Quit]
16:21:59etnguyen03 (etnguyen03) joins
16:36:12MrMcNuggets (MrMcNuggets) joins
16:41:56etnguyen03 quits [Client Quit]
16:46:54ducky_ (ducky) joins
16:48:48ducky quits [Ping timeout: 268 seconds]
16:48:50ducky_ is now known as ducky
17:08:43etnguyen03 (etnguyen03) joins
17:43:55etnguyen03 quits [Client Quit]
17:51:03BearFortress quits []
17:53:52dabs joins
17:54:50dabs quits [Remote host closed the connection]
17:55:02dabs joins
18:04:46<PC>steering: a bit late but i wanted to say re: archive.is/archive.today, there are some URLs that i've been able to archive on there that haven't worked anywhere else, so i think it'd definitely be good to keep the idea of mirroring those archives somehow on the backburner (though its URLs don't give any info for what the archived URL is, meaning one has to rely on its search, as something to keep in mind. ideally i'd just see those archives mirrored on the WBM d
18:05:26<klea>PC: there's a timegate thing to get the full uri
18:06:42<klea>http://archive.today/2023.05.25-100732/https://github.com/
18:07:30<PC>oh that's cool! didn't know that
18:07:39<klea>Click "Share" button to get those.
18:07:43<PC>nice, thanks
18:07:50<klea>You're welcome.
18:08:40<PC>if its archives do end up up somewhere, would be good to have a way to find them with just the archived site's URL though. given that the site's URL is after a specific timestamp... that'd make finding them if they were on the WBM a lot trickier, mm
18:13:27<skankhunt42>hey :) I was wondering if there is a medica wiki page already? I just threw a few workers in but couldn't find a overall description for that project
18:15:09nexussfan (nexussfan) joins
18:23:21BearFortress joins
18:30:43iseaup (iseaup) joins
18:30:51Webuser735964 joins
18:34:31Webuser735964 quits [Client Quit]
19:14:20ducky quits [Ping timeout: 268 seconds]
19:19:25nexussfan quits [Client Quit]
19:36:19etnguyen03 (etnguyen03) joins
19:45:52nexussfan (nexussfan) joins
20:02:33MrMcNuggets quits [Quit: WeeChat 4.3.2]
20:04:40hamouda quits [Quit: Ooops, wrong browser tab.]
20:24:16ducky (ducky) joins
20:50:10Island joins
20:50:52<nulldata>gamer191-1|m - For Yahoo Groups there was a browser extension that had people join different groups and answer captchas. https://github.com/davidferguson/yahoogroups-joiner
21:09:47Webuser310088 joins
21:10:09Webuser310088 quits [Client Quit]
21:10:39<h2ibot>Systwi created Wordpress (+27, Created an article redirecting "Wordpress" to…): https://wiki.archiveteam.org/?oldid=60764
21:13:40<h2ibot>Systwi edited Wordpress.com (+149, /* Useful pages */ Mentioned wp-content/plugins/.): https://wiki.archiveteam.org/?diff=60765&oldid=60718
21:14:16hamouda joins
21:14:51Webuser563360 joins
21:15:22SootBector quits [Remote host closed the connection]
21:15:33Webuser563360 quits [Client Quit]
21:16:30SootBector (SootBector) joins
21:17:03<klea>Wordpress!=Wordpress.com imho.
21:17:06<@JAA>The internet could always use more confusion between WordPress the software and wordpress.com the commercial hosting service.
21:19:09<@JAA>Ah, [[WordPress]] already exists since 2012 with the same redirect.
21:19:36<@JAA>Yes, they should be separated. And also the P is capitalised in the canonical spelling.
21:19:42<@JAA>(For both the software and the service)
21:21:43<klea>Indeed.
21:22:20<klea>I don't think making an empty page that says not to be confused with [[WordPress.com]], {{underconstruction}}.
21:24:15<pokechu22>Also, the PHP files in wp-content/plugins and wp-includes give 500s or otherwise aren't useful 99% of the time
21:25:18<nicolas17>hi i'm not home can someone look into this https://mastodon.social/@jackyan/116269031342079180 admin complains about archivebot traffic
21:25:27Webuser354350 joins
21:25:31Webuser354350 quits [Client Quit]
21:25:35Webuser858468 joins
21:26:31<pokechu22>Ryz: that's your job
21:28:09TheEnbyperor quits [Ping timeout: 268 seconds]
21:28:18<pokechu22>I guess it's also worth noting that *some*, but not all, wordpress sites have wp-content and wp-includes open directories but will ban your IP if you load too many of them, and also archivebot tends to naturally discover wp-content/plugins and wp-includes from JS extraction
21:30:19<klea>So a normal ignore to put on AB jobs for sites using WordPress is for those, or is that included in the blogs igset?
21:31:30<@JAA>Most don't need anything beyond blogs.
21:31:54<@JAA>There are a couple wordpress.com-specific ignores that are missing from the igset currently.
21:32:27<@JAA>But except for very large blogs, those aren't too problematic either.
21:32:39<klea>AFAIK Updating the ignores database things doesn't require testing?
21:32:40<pokechu22>Normally I don't ignore it unless it runs into problems on the first run, and if it does I add ^https?://kennethforcongress2026\.com/wp-(content|includes)/(.*/)?($|\?) and ^https?://kennethforcongress2026\.com/wp-(content|includes)/.*\.php$
21:32:55<@JAA>xmlrpc.php is another one that can cause bans sometimes.
21:38:43<h2ibot>KleaBot made 2 bot changes: https://wiki.archiveteam.org/index.php?title=Special:Contributions/KleaBot&offset=20260321213844&limit=2&namespace=2&wpfilters%5B%5D=nsInvert&wpfilters%5B%5D=associated
21:40:01<klea>Moved [[Wordpress.com]] into [[WordPress.com]] per JAA's mention of correct capitalization, and also changed the [[Wordpress]] redirect to [[WordPress]]
21:40:09<klea>AAAA
21:40:11<klea>I got one wrong.
21:43:44<h2ibot>Klea edited WordPress (+457, Make a WordPress page.): https://wiki.archiveteam.org/?diff=60769&oldid=7783
21:43:45<h2ibot>Klea edited Wordpress.com (+4, Undo revision 60768 by…): https://wiki.archiveteam.org/?diff=60770&oldid=60768
21:44:44<h2ibot>Klea edited Wordpress (-4, Changed redirect target to [[WordPress]] from…): https://wiki.archiveteam.org/?diff=60771&oldid=60764
21:45:14<klea>aaa idk why i got that wrong.
21:45:37TheEnbyperor joins
21:45:44<h2ibot>Klea edited Wordpress (+0, Changed redirect target from [[Wordpress]] to…): https://wiki.archiveteam.org/?diff=60772&oldid=60771
21:46:32<klea>pokechu22, JAA: Do any of you know if WordPress.com has a open directory like that or xmlrpc bans, or only other WordPress sites do.
21:47:10<pokechu22>I think it's some self-hosted wordpress installs (or perhaps a specific commercial wordpress host other than wordpress.com) that is configured that way
21:47:21<pokechu22>I don't think I've seen it on wordpress.com
21:47:55Vito` joins
21:57:14SootBector quits [Remote host closed the connection]
21:58:46<h2ibot>Klea edited WordPress.com (-1245, Split between [[WordPress]] and [[WordPress.com]]): https://wiki.archiveteam.org/?diff=60773&oldid=60766
21:58:47<h2ibot>Klea edited WordPress (+1586, Split between [[WordPress]] and [[WordPress.com]]): https://wiki.archiveteam.org/?diff=60774&oldid=60769
21:59:03Wohlstand quits [Quit: Wohlstand]
21:59:03SootBector (SootBector) joins
21:59:46<h2ibot>Klea edited WordPress.com (+27, Fix references): https://wiki.archiveteam.org/?diff=60775&oldid=60773
22:02:04^ quits [Ping timeout: 268 seconds]
22:03:41^ (^) joins
22:05:26Webuser655004 joins
22:06:15Webuser655004 quits [Client Quit]
22:10:51hamouda quits [Client Quit]
22:10:55SootBector quits [Remote host closed the connection]
22:12:00SootBector (SootBector) joins
22:14:04Webuser858468 quits [Client Quit]
22:18:06TheEnbyperor quits [Read error: Connection reset by peer]
22:18:14Webuser444492 joins
22:18:26Webuser444492 quits [Client Quit]
22:32:48Webuser817259 joins
22:33:36Webuser817259 quits [Client Quit]
22:39:12TheEnbyperor joins
22:39:22TheEnbyperor_ (TheEnbyperor) joins
22:43:56Webuser241918 joins
22:44:45Webuser241918 quits [Client Quit]
22:46:00SootBector quits [Remote host closed the connection]
22:47:05SootBector (SootBector) joins
23:01:07skyrocket joins
23:05:35skankhunt42 quits [Ping timeout: 268 seconds]
23:07:50Webuser984529 joins
23:09:08Webuser984529 quits [Client Quit]
23:19:10<klea>Huh, I got a bad take for managing Deathwatch.
23:19:35<klea>https://www.mediawiki.org/wiki/Extension:Wikibase?useskin=vector To have our own Wikidata like thing.
23:20:29<klea>Oh, that extension doesn't provide the other fun things Wikidata offers, and in any case would probably not be a good fit for a tracking system for websites.
23:20:59polypeptide (polypeptide) joins
23:37:47Shard111582 quits [Read error: Connection reset by peer]
23:39:45Arcorann_ (Arcorann) joins
23:41:06Shard111582 (Shard) joins
23:44:31Webuser692424 joins
23:45:34Shard111582 quits [Read error: Connection reset by peer]
23:45:57Shard111582 (Shard) joins
23:51:05DrowsyCrow joins
23:59:14TastyWiener95 quits [Ping timeout: 268 seconds]