| 00:43:32 | | Vito` quits [Client Quit] |
| 00:43:39 | | etnguyen03 (etnguyen03) joins |
| 00:46:46 | | Hackerpcs quits [Quit: Hackerpcs] |
| 00:47:41 | | Hackerpcs (Hackerpcs) joins |
| 00:51:02 | | wickedplayer494 quits [Ping timeout: 268 seconds] |
| 00:52:03 | | wickedplayer494 (wickedplayer494) joins |
| 00:53:28 | | etnguyen03 quits [Client Quit] |
| 01:04:59 | | etnguyen03 (etnguyen03) joins |
| 01:26:37 | | etnguyen03 quits [Client Quit] |
| 01:33:23 | | etnguyen03 (etnguyen03) joins |
| 01:36:40 | | azalea_sh_ quits [Ping timeout: 268 seconds] |
| 01:36:53 | | azalea_sh_ (azalea_sh_) joins |
| 01:41:50 | | retrograde quits [Remote host closed the connection] |
| 01:43:51 | | retrograde (retrograde) joins |
| 01:55:55 | | etnguyen03 quits [Client Quit] |
| 02:13:50 | <systwi_> | Would anybody know how to extract errored URLs from a -meta.warc? |
| 02:14:25 | <@JAA> | `little-things/wpull2-log-extract-errors` |
| 02:15:08 | <@JAA> | You can simply pipe the decompressed meta WARC to it; it'll ignore the non-log lines. |
| 02:15:32 | <@JAA> | The self-test example should clarify what's being extracted. |
| 02:20:07 | <systwi_> | Ahh, close! I was looking at warc-dump-responses initially. I saw "log" on that and wasn't sure if that was for the right thing. |
| 02:20:11 | <systwi_> | Thanks JAA. :-) |
| 02:20:50 | | etnguyen03 (etnguyen03) joins |
| 03:18:14 | | etnguyen03 quits [Remote host closed the connection] |
| 03:31:31 | | DogsRNice quits [Read error: Connection reset by peer] |
| 03:53:34 | | PC quits [Ping timeout: 268 seconds] |
| 04:28:20 | | steering7253 (steering) joins |
| 04:28:40 | | steering7253 quits [Client Quit] |
| 05:04:29 | | n9nes quits [Ping timeout: 268 seconds] |
| 05:05:50 | | n9nes joins |
| 05:09:30 | | nexussfan quits [Quit: Konversation terminated!] |
| 05:10:15 | <h2ibot> | Limon edited Angelfire (+583, added shutdown of angelfire by lycos): https://wiki.archiveteam.org/?diff=60757&oldid=60102 |
| 05:10:16 | <h2ibot> | TheCarbonFreeze edited Deathwatch (+685, /* 2026 */): https://wiki.archiveteam.org/?diff=60758&oldid=60728 |
| 05:10:17 | <h2ibot> | John5433 edited 4chan (+1844, /* ultra.gondola.pics */ added all the boards): https://wiki.archiveteam.org/?diff=60759&oldid=60730 |
| 05:10:18 | <h2ibot> | John5433 edited Soyjak.party (+556, /* Archives */): https://wiki.archiveteam.org/?diff=60760&oldid=60732 |
| 05:10:19 | <h2ibot> | John5433 edited Deathwatch (+1, spelling mistake): https://wiki.archiveteam.org/?diff=60761&oldid=60758 |
| 05:12:15 | <h2ibot> | JustAnotherArchivist edited Angelfire (+30, Datetimeify): https://wiki.archiveteam.org/?diff=60762&oldid=60757 |
| 05:12:16 | <h2ibot> | JustAnotherArchivist edited Angelfire (+0, Not (officially) offline yet): https://wiki.archiveteam.org/?diff=60763&oldid=60762 |
| 06:25:53 | | mr_sarge quits [Ping timeout: 268 seconds] |
| 06:26:17 | | mr_sarge (sarge) joins |
| 06:29:27 | | Island quits [Read error: Connection reset by peer] |
| 06:44:28 | | TheEnbyperor_ quits [Ping timeout: 268 seconds] |
| 07:03:52 | | TheEnbyperor joins |
| 07:04:53 | <pabs> | !tell v01d a bunch of archive.org alternatives are on the wiki: https://wiki.archiveteam.org/index.php/Archive_Services https://wiki.archiveteam.org/index.php/Category:Web_archiving_services |
| 07:04:54 | <eggdrop> | [tell] ok, I'll tell v01d when they join next |
| 07:06:41 | <pabs> | triplecamera|m: AX_* are from the autoconf-archive package in Debian at least |
| 07:14:54 | <pabs> | klea: discuss.ropensci.org shut already in Feb it seems (NXDOMAIN now). last AB was 20251203 |
| 07:15:40 | <pabs> | schwarzkatz|m: might be worth moving linktree-clones.md to a page Linktree/Clones on the wiki? |
| 07:44:36 | | PC (PC) joins |
| 07:54:38 | | steering (steering) joins |
| 07:54:52 | | steering quits [Client Quit] |
| 07:55:37 | | steering (steering) joins |
| 08:17:29 | | LddPotato quits [Read error: Connection reset by peer] |
| 08:18:44 | | LddPotato (LddPotato) joins |
| 08:19:13 | | Webuser032448 joins |
| 08:19:13 | | Webuser032448 quits [Client Quit] |
| 08:20:25 | | Webuser244402 joins |
| 08:26:07 | | Webuser235954 joins |
| 08:27:54 | | Webuser235954 quits [Client Quit] |
| 08:28:22 | | APOLLO03 quits [Quit: .] |
| 08:32:47 | | LddPotato quits [Read error: Connection reset by peer] |
| 08:33:25 | | LddPotato (LddPotato) joins |
| 08:43:17 | | LddPotato quits [Read error: Connection reset by peer] |
| 08:44:02 | | LddPotato (LddPotato) joins |
| 08:47:49 | | LddPotato quits [Read error: Connection reset by peer] |
| 08:48:27 | | LddPotato (LddPotato) joins |
| 08:58:02 | | Webuser389403 joins |
| 08:58:18 | | Webuser389403 quits [Client Quit] |
| 09:00:04 | | LddPotato quits [Read error: Connection reset by peer] |
| 09:00:57 | | LddPotato (LddPotato) joins |
| 09:30:53 | | pabs quits [Ping timeout: 268 seconds] |
| 09:33:32 | | pabs (pabs) joins |
| 11:00:02 | | Bleo18260072271962345522201 quits [Quit: The Lounge - https://thelounge.chat] |
| 11:00:46 | | Webuser578944 joins |
| 11:00:53 | | Webuser578944 quits [Client Quit] |
| 11:02:48 | | Bleo18260072271962345522201 joins |
| 11:03:16 | <alexlehm> | pabs: where is the linktree-clones file now? |
| 11:12:45 | | VerifiedJ quits [Remote host closed the connection] |
| 11:13:22 | | VerifiedJ7 (VerifiedJ) joins |
| 11:19:57 | <pabs> | alexlehm: see the backlog: <schwarzkatz|m> azalea_sh_: I have not followed the conversation, but I gathered you might be interested: I created a list of linktree clones quite a while ago, idk how many of them still work https://transfer.archivete.am/GPs4e/linktree-clones.md |
| 11:19:58 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/GPs4e/linktree-clones.md |
| 11:20:39 | <alexlehm> | thank you, i missed it in the backlog for some reason |
| 12:00:27 | | APOLLO03 joins |
| 12:06:31 | <klea> | oh :( |
| 12:06:50 | <klea> | I should move it then to the past. |
| 12:07:07 | <klea> | I wonder if we should make a wiki page for them. |
| 12:07:29 | <klea> | It seems like it'd be better, since the list is already public. |
| 12:21:43 | | Wohlstand1 (Wohlstand) joins |
| 12:22:01 | | Webuser244402 quits [Quit: Ooops, wrong browser tab.] |
| 12:24:09 | | Wohlstand1 is now known as Wohlstand |
| 12:37:33 | | Webuser526815 joins |
| 12:38:19 | | Webuser526815 quits [Client Quit] |
| 13:32:42 | | Arcorann_ quits [Ping timeout: 268 seconds] |
| 13:37:03 | <gamer191-1|m> | Regarding archive.today, has archiveteam archived any projects with captchas in the past? |
| 13:48:28 | | pseudorizer quits [Quit: ZNC 1.10.1 - https://znc.in] |
| 13:49:09 | | pseudorizer (pseudorizer) joins |
| 13:50:00 | | Webuser512333 joins |
| 13:50:11 | | Webuser512333 quits [Client Quit] |
| 13:52:26 | | pseudorizer quits [Client Quit] |
| 13:52:36 | | nexussfan (nexussfan) joins |
| 13:54:06 | | pseudorizer (pseudorizer) joins |
| 13:56:53 | | revi quits [Quit: Connection closed for inactivity] |
| 14:08:24 | | sepro5 (sepro) joins |
| 14:10:51 | | sepro quits [Ping timeout: 268 seconds] |
| 14:10:51 | | sepro5 is now known as sepro |
| 14:20:10 | | nexussfan quits [Client Quit] |
| 14:25:44 | | sepro quits [Ping timeout: 268 seconds] |
| 14:28:16 | | sepro (sepro) joins |
| 14:29:25 | | FiTheArchiver joins |
| 14:36:11 | | sepro5 (sepro) joins |
| 14:37:49 | | FiTheArchiver quits [Client Quit] |
| 14:38:41 | | sepro quits [Ping timeout: 268 seconds] |
| 14:38:41 | | sepro5 is now known as sepro |
| 14:59:52 | | MrMcNuggets (MrMcNuggets) joins |
| 14:59:53 | | MrMcNuggets quits [Client Quit] |
| 15:01:12 | | MrMcNuggets (MrMcNuggets) joins |
| 15:06:00 | | MrMcNuggets quits [Read error: Connection reset by peer] |
| 15:17:07 | | croissant` quits [Quit: Leaving] |
| 15:25:20 | | croissant` joins |
| 15:52:33 | | hamouda joins |
| 16:17:12 | | Webuser515876 joins |
| 16:18:12 | | Webuser515876 quits [Client Quit] |
| 16:21:59 | | etnguyen03 (etnguyen03) joins |
| 16:36:12 | | MrMcNuggets (MrMcNuggets) joins |
| 16:41:56 | | etnguyen03 quits [Client Quit] |
| 16:46:54 | | ducky_ (ducky) joins |
| 16:48:48 | | ducky quits [Ping timeout: 268 seconds] |
| 16:48:50 | | ducky_ is now known as ducky |
| 17:08:43 | | etnguyen03 (etnguyen03) joins |
| 17:43:55 | | etnguyen03 quits [Client Quit] |
| 17:51:03 | | BearFortress quits [] |
| 17:53:52 | | dabs joins |
| 17:54:50 | | dabs quits [Remote host closed the connection] |
| 17:55:02 | | dabs joins |
| 18:04:46 | <PC> | steering: a bit late but i wanted to say re: archive.is/archive.today, there are some URLs that i've been able to archive on there that haven't worked anywhere else, so i think it'd definitely be good to keep the idea of mirroring those archives somehow on the backburner (though its URLs don't give any info for what the archived URL is, meaning one has to rely on its search, as something to keep in mind. ideally i'd just see those archives mirrored on the WBM d |
| 18:05:26 | <klea> | PC: there's a timegate thing to get the full uri |
| 18:06:42 | <klea> | http://archive.today/2023.05.25-100732/https://github.com/ |
| 18:07:30 | <PC> | oh that's cool! didn't know that |
| 18:07:39 | <klea> | Click "Share" button to get those. |
| 18:07:43 | <PC> | nice, thanks |
| 18:07:50 | <klea> | You're welcome. |
| 18:08:40 | <PC> | if its archives do end up up somewhere, would be good to have a way to find them with just the archived site's URL though. given that the site's URL is after a specific timestamp... that'd make finding them if they were on the WBM a lot trickier, mm |
| 18:13:27 | <skankhunt42> | hey :) I was wondering if there is a medica wiki page already? I just threw a few workers in but couldn't find a overall description for that project |
| 18:15:09 | | nexussfan (nexussfan) joins |
| 18:23:21 | | BearFortress joins |
| 18:30:43 | | iseaup (iseaup) joins |
| 18:30:51 | | Webuser735964 joins |
| 18:34:31 | | Webuser735964 quits [Client Quit] |
| 19:14:20 | | ducky quits [Ping timeout: 268 seconds] |
| 19:19:25 | | nexussfan quits [Client Quit] |
| 19:36:19 | | etnguyen03 (etnguyen03) joins |
| 19:45:52 | | nexussfan (nexussfan) joins |
| 20:02:33 | | MrMcNuggets quits [Quit: WeeChat 4.3.2] |
| 20:04:40 | | hamouda quits [Quit: Ooops, wrong browser tab.] |
| 20:24:16 | | ducky (ducky) joins |
| 20:50:10 | | Island joins |
| 20:50:52 | <nulldata> | gamer191-1|m - For Yahoo Groups there was a browser extension that had people join different groups and answer captchas. https://github.com/davidferguson/yahoogroups-joiner |
| 21:09:47 | | Webuser310088 joins |
| 21:10:09 | | Webuser310088 quits [Client Quit] |
| 21:10:39 | <h2ibot> | Systwi created Wordpress (+27, Created an article redirecting "Wordpress" to…): https://wiki.archiveteam.org/?oldid=60764 |
| 21:13:40 | <h2ibot> | Systwi edited Wordpress.com (+149, /* Useful pages */ Mentioned wp-content/plugins/.): https://wiki.archiveteam.org/?diff=60765&oldid=60718 |
| 21:14:16 | | hamouda joins |
| 21:14:51 | | Webuser563360 joins |
| 21:15:22 | | SootBector quits [Remote host closed the connection] |
| 21:15:33 | | Webuser563360 quits [Client Quit] |
| 21:16:30 | | SootBector (SootBector) joins |
| 21:17:03 | <klea> | Wordpress!=Wordpress.com imho. |
| 21:17:06 | <@JAA> | The internet could always use more confusion between WordPress the software and wordpress.com the commercial hosting service. |
| 21:19:09 | <@JAA> | Ah, [[WordPress]] already exists since 2012 with the same redirect. |
| 21:19:36 | <@JAA> | Yes, they should be separated. And also the P is capitalised in the canonical spelling. |
| 21:19:42 | <@JAA> | (For both the software and the service) |
| 21:21:43 | <klea> | Indeed. |
| 21:22:20 | <klea> | I don't think making an empty page that says not to be confused with [[WordPress.com]], {{underconstruction}}. |
| 21:24:15 | <pokechu22> | Also, the PHP files in wp-content/plugins and wp-includes give 500s or otherwise aren't useful 99% of the time |
| 21:25:18 | <nicolas17> | hi i'm not home can someone look into this https://mastodon.social/@jackyan/116269031342079180 admin complains about archivebot traffic |
| 21:25:27 | | Webuser354350 joins |
| 21:25:31 | | Webuser354350 quits [Client Quit] |
| 21:25:35 | | Webuser858468 joins |
| 21:26:31 | <pokechu22> | Ryz: that's your job |
| 21:28:09 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 21:28:18 | <pokechu22> | I guess it's also worth noting that *some*, but not all, wordpress sites have wp-content and wp-includes open directories but will ban your IP if you load too many of them, and also archivebot tends to naturally discover wp-content/plugins and wp-includes from JS extraction |
| 21:30:19 | <klea> | So a normal ignore to put on AB jobs for sites using WordPress is for those, or is that included in the blogs igset? |
| 21:31:30 | <@JAA> | Most don't need anything beyond blogs. |
| 21:31:54 | <@JAA> | There are a couple wordpress.com-specific ignores that are missing from the igset currently. |
| 21:32:27 | <@JAA> | But except for very large blogs, those aren't too problematic either. |
| 21:32:39 | <klea> | AFAIK Updating the ignores database things doesn't require testing? |
| 21:32:40 | <pokechu22> | Normally I don't ignore it unless it runs into problems on the first run, and if it does I add ^https?://kennethforcongress2026\.com/wp-(content|includes)/(.*/)?($|\?) and ^https?://kennethforcongress2026\.com/wp-(content|includes)/.*\.php$ |
| 21:32:55 | <@JAA> | xmlrpc.php is another one that can cause bans sometimes. |
| 21:38:43 | <h2ibot> | KleaBot made 2 bot changes: https://wiki.archiveteam.org/index.php?title=Special:Contributions/KleaBot&offset=20260321213844&limit=2&namespace=2&wpfilters%5B%5D=nsInvert&wpfilters%5B%5D=associated |
| 21:40:01 | <klea> | Moved [[Wordpress.com]] into [[WordPress.com]] per JAA's mention of correct capitalization, and also changed the [[Wordpress]] redirect to [[WordPress]] |
| 21:40:09 | <klea> | AAAA |
| 21:40:11 | <klea> | I got one wrong. |
| 21:43:44 | <h2ibot> | Klea edited WordPress (+457, Make a WordPress page.): https://wiki.archiveteam.org/?diff=60769&oldid=7783 |
| 21:43:45 | <h2ibot> | Klea edited Wordpress.com (+4, Undo revision 60768 by…): https://wiki.archiveteam.org/?diff=60770&oldid=60768 |
| 21:44:44 | <h2ibot> | Klea edited Wordpress (-4, Changed redirect target to [[WordPress]] from…): https://wiki.archiveteam.org/?diff=60771&oldid=60764 |
| 21:45:14 | <klea> | aaa idk why i got that wrong. |
| 21:45:37 | | TheEnbyperor joins |
| 21:45:44 | <h2ibot> | Klea edited Wordpress (+0, Changed redirect target from [[Wordpress]] to…): https://wiki.archiveteam.org/?diff=60772&oldid=60771 |
| 21:46:32 | <klea> | pokechu22, JAA: Do any of you know if WordPress.com has a open directory like that or xmlrpc bans, or only other WordPress sites do. |
| 21:47:10 | <pokechu22> | I think it's some self-hosted wordpress installs (or perhaps a specific commercial wordpress host other than wordpress.com) that is configured that way |
| 21:47:21 | <pokechu22> | I don't think I've seen it on wordpress.com |
| 21:47:55 | | Vito` joins |
| 21:57:14 | | SootBector quits [Remote host closed the connection] |
| 21:58:46 | <h2ibot> | Klea edited WordPress.com (-1245, Split between [[WordPress]] and [[WordPress.com]]): https://wiki.archiveteam.org/?diff=60773&oldid=60766 |
| 21:58:47 | <h2ibot> | Klea edited WordPress (+1586, Split between [[WordPress]] and [[WordPress.com]]): https://wiki.archiveteam.org/?diff=60774&oldid=60769 |
| 21:59:03 | | Wohlstand quits [Quit: Wohlstand] |
| 21:59:03 | | SootBector (SootBector) joins |
| 21:59:46 | <h2ibot> | Klea edited WordPress.com (+27, Fix references): https://wiki.archiveteam.org/?diff=60775&oldid=60773 |
| 22:02:04 | | ^ quits [Ping timeout: 268 seconds] |
| 22:03:41 | | ^ (^) joins |
| 22:05:26 | | Webuser655004 joins |
| 22:06:15 | | Webuser655004 quits [Client Quit] |
| 22:10:51 | | hamouda quits [Client Quit] |
| 22:10:55 | | SootBector quits [Remote host closed the connection] |
| 22:12:00 | | SootBector (SootBector) joins |
| 22:14:04 | | Webuser858468 quits [Client Quit] |
| 22:18:06 | | TheEnbyperor quits [Read error: Connection reset by peer] |
| 22:18:14 | | Webuser444492 joins |
| 22:18:26 | | Webuser444492 quits [Client Quit] |
| 22:32:48 | | Webuser817259 joins |
| 22:33:36 | | Webuser817259 quits [Client Quit] |
| 22:39:12 | | TheEnbyperor joins |
| 22:39:22 | | TheEnbyperor_ (TheEnbyperor) joins |
| 22:43:56 | | Webuser241918 joins |
| 22:44:45 | | Webuser241918 quits [Client Quit] |
| 22:46:00 | | SootBector quits [Remote host closed the connection] |
| 22:47:05 | | SootBector (SootBector) joins |
| 23:01:07 | | skyrocket joins |
| 23:05:35 | | skankhunt42 quits [Ping timeout: 268 seconds] |
| 23:07:50 | | Webuser984529 joins |
| 23:09:08 | | Webuser984529 quits [Client Quit] |
| 23:19:10 | <klea> | Huh, I got a bad take for managing Deathwatch. |
| 23:19:35 | <klea> | https://www.mediawiki.org/wiki/Extension:Wikibase?useskin=vector To have our own Wikidata like thing. |
| 23:20:29 | <klea> | Oh, that extension doesn't provide the other fun things Wikidata offers, and in any case would probably not be a good fit for a tracking system for websites. |
| 23:20:59 | | polypeptide (polypeptide) joins |
| 23:37:47 | | Shard111582 quits [Read error: Connection reset by peer] |
| 23:39:45 | | Arcorann_ (Arcorann) joins |
| 23:41:06 | | Shard111582 (Shard) joins |
| 23:44:31 | | Webuser692424 joins |
| 23:45:34 | | Shard111582 quits [Read error: Connection reset by peer] |
| 23:45:57 | | Shard111582 (Shard) joins |
| 23:51:05 | | DrowsyCrow joins |
| 23:59:14 | | TastyWiener95 quits [Ping timeout: 268 seconds] |