| 00:06:34 | | SootBector quits [Ping timeout: 276 seconds] |
| 00:07:13 | | HackMii quits [Ping timeout: 276 seconds] |
| 00:08:55 | | SootBector (SootBector) joins |
| 00:09:42 | | HackMii (hacktheplanet) joins |
| 00:35:42 | | rohvani joins |
| 00:59:30 | | whimsysciences joins |
| 01:49:01 | <pabs> | klea: haha, easy bypass for the anti-scraper thing on that madhouse site: set-cookie: x-robot-challenge=passed;path=/ |
| 01:50:43 | | Island joins |
| 02:11:33 | | etnguyen03 (etnguyen03) joins |
| 02:42:45 | | second (second) joins |
| 02:43:13 | | sec^nd quits [Ping timeout: 276 seconds] |
| 02:43:13 | | second is now known as sec^nd |
| 02:43:55 | | sg72 quits [Remote host closed the connection] |
| 02:45:03 | | sg72 joins |
| 02:45:55 | <steering> | pabs: any chance your https://wiki.archiveteam.org/index.php/ArchiveBot/Monitoring regexes are somewhere |
| 02:52:52 | | etnguyen03 quits [Client Quit] |
| 02:56:42 | | etnguyen03 (etnguyen03) joins |
| 02:56:52 | | SootBector quits [Ping timeout: 276 seconds] |
| 02:59:18 | | SootBector (SootBector) joins |
| 03:26:35 | | etnguyen03 quits [Client Quit] |
| 03:33:18 | <pabs> | haven't put them anywhere yet |
| 03:33:53 | <pabs> | need to move that stuff to a server, but the one option for that fell through |
| 03:37:07 | | etnguyen03 (etnguyen03) joins |
| 03:39:00 | <pabs> | steering: current ones https://transfer.archivete.am/10CTLV/archivebot-monitoring-categories-match-ignore-regexes.txt |
| 03:39:00 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/10CTLV/archivebot-monitoring-categories-match-ignore-regexes.txt |
| 03:40:30 | <h2ibot> | PaulWise edited ArchiveBot/Monitoring (+97, link current regexes): https://wiki.archiveteam.org/?diff=58503&oldid=58390 |
| 03:56:17 | | etnguyen03 quits [Remote host closed the connection] |
| 04:07:47 | | Island quits [Read error: Connection reset by peer] |
| 04:08:46 | | Hackerpcs (Hackerpcs) joins |
| 05:26:30 | | cyanbox quits [Read error: Connection reset by peer] |
| 05:28:04 | | cyanbox joins |
| 05:33:50 | | DogsRNice quits [Read error: Connection reset by peer] |
| 06:10:50 | | stepney141 quits [Quit: Ping timeout (120 seconds)] |
| 06:11:08 | | stepney141 (stepney141) joins |
| 06:21:31 | | Wohlstand (Wohlstand) joins |
| 06:22:44 | | nexussfan quits [Quit: Konversation terminated!] |
| 06:25:00 | | croissant quits [Read error: Connection reset by peer] |
| 06:27:48 | | croissant joins |
| 06:34:54 | <h2ibot> | Klea edited Deathwatch (+173, Add land.to): https://wiki.archiveteam.org/?diff=58504&oldid=58411 |
| 06:40:21 | <klea> | stepney141: i've added it to the deathwatch, if you're interested in seeing more about FC2, it's likely a good idea that you join #forbiddencravingcenter :-) |
| 06:57:57 | <h2ibot> | Klea edited ArchiveBot/Ignore (+33, Add references section): https://wiki.archiveteam.org/?diff=58505&oldid=57890 |
| 06:58:57 | <h2ibot> | Klea edited ArchiveBot/Ignore (+0, Use [[Template:Url]] directly rather than…): https://wiki.archiveteam.org/?diff=58506&oldid=58505 |
| 07:07:22 | | Wohlstand quits [Client Quit] |
| 07:19:44 | | Wohlstand (Wohlstand) joins |
| 07:19:53 | | Wohlstand quits [Client Quit] |
| 07:52:19 | <pabs> | @canonical.com folks responded to my mail about the Ubuntu wikis that are being shut down, seem positive about archiving |
| 07:53:21 | <pabs> | they also want us to save the already-shutdown ubuntuforums.org site by opening it up to us /cc dxrt |
| 08:13:27 | | Shyy46 quits [Quit: Ping timeout (120 seconds)] |
| 08:37:15 | | Wohlstand (Wohlstand) joins |
| 09:11:08 | | Pedrosso0 joins |
| 09:11:11 | | ScenarioPlanet6 (ScenarioPlanet) joins |
| 09:11:31 | | TheTechRobo1 (TheTechRobo) joins |
| 09:12:08 | | Pedrosso quits [Ping timeout: 256 seconds] |
| 09:12:08 | | ScenarioPlanet quits [Ping timeout: 256 seconds] |
| 09:12:08 | | TheTechRobo quits [Ping timeout: 256 seconds] |
| 09:12:08 | | Pedrosso0 is now known as Pedrosso |
| 09:12:08 | | ScenarioPlanet6 is now known as ScenarioPlanet |
| 09:12:08 | | TheTechRobo1 is now known as TheTechRobo |
| 09:13:17 | <stepney141> | @klea: thanks! |
| 09:26:52 | | wessel1512 quits [Ping timeout: 256 seconds] |
| 09:27:07 | <steering> | pabs: thanks :) |
| 09:34:13 | | wessel1512 joins |
| 09:37:30 | <h2ibot> | PaulWise edited ArchiveBot/Ignore (+191, add !ignd eggdrop command for ignoring offsite…): https://wiki.archiveteam.org/?diff=58507&oldid=58506 |
| 09:37:31 | <h2ibot> | PaulWise edited ArchiveBot/Ignore (+2, formatting): https://wiki.archiveteam.org/?diff=58508&oldid=58507 |
| 09:39:30 | <h2ibot> | PaulWise edited ArchiveBot/Ignore (-2, formatting bleh): https://wiki.archiveteam.org/?diff=58509&oldid=58508 |
| 09:40:30 | <h2ibot> | PaulWise edited ArchiveBot/Ignore (-4, typo): https://wiki.archiveteam.org/?diff=58510&oldid=58509 |
| 09:40:40 | | Dada joins |
| 10:00:43 | | rohvani quits [Quit: The Lounge - https://thelounge.chat] |
| 10:03:54 | | rohvani joins |
| 10:56:24 | | rohvani quits [Ping timeout: 256 seconds] |
| 11:19:19 | | pabs is now authenticated as * |
| 11:19:19 | | pabs is now known as RJHacker35549 |
| 11:19:36 | | pabs (pabs) joins |
| 11:20:15 | | RJHacker35549 quits [Ping timeout: 272 seconds] |
| 12:01:25 | | pabs quits [Ping timeout: 272 seconds] |
| 12:04:07 | | pabs (pabs) joins |
| 12:08:54 | | APOLLO03a joins |
| 12:10:08 | | APOLLO03a quits [Client Quit] |
| 12:11:18 | | APOLLO03a joins |
| 12:12:11 | | APOLLO03 quits [Ping timeout: 272 seconds] |
| 12:20:10 | <@dxrt> | great news pabs |
| 12:27:29 | <steering> | will non-http schemes even show up in the AB websocket? (i.e. gemini://) |
| 12:27:53 | <steering> | i assume not |
| 12:35:49 | | Juesto (Juest) joins |
| 12:37:50 | | Juest quits [Ping timeout: 256 seconds] |
| 12:37:50 | | Juesto is now known as Juest |
| 12:59:54 | | NF885 (NF885) joins |
| 13:00:56 | <NF885> | klea Nintendofan885 is me |
| 13:01:10 | <NF885> | I meant the page being added to https://wiki.archiveteam.org/index.php/Category:Infobox_project_pages_without_URL |
| 13:05:40 | | NF885 quits [Client Quit] |
| 13:07:44 | | bakedsilica (bakedsilica) joins |
| 13:12:06 | | bakedsilica quits [Remote host closed the connection] |
| 13:17:07 | | bakedsilica (bakedsilica) joins |
| 13:20:04 | | Webuser834516 joins |
| 13:20:36 | <bakedsilica> | FYI, Discourse-based Techlore Forum is now read-only and will be deleted June 1, 2026 |
| 13:20:43 | | Webuser834516 quits [Client Quit] |
| 13:21:03 | <bakedsilica> | didn't see it listed on the wiki or elsewhere |
| 13:22:33 | <bakedsilica> | url: https://discuss.techlore.tech/ |
| 13:22:33 | <bakedsilica> | announcement: https://techlore.tech/techlores-new-home-our-platform-transition-whats-next/ |
| 13:23:34 | | bakedsilica quits [Remote host closed the connection] |
| 13:24:15 | | bakedsilica (bakedsilica) joins |
| 13:24:57 | <bakedsilica> | as an important resource in the privacy space it should be of interest to the Deathwatch |
| 13:34:17 | <pabs> | steering: they don't indeed, only stuff wpull understands. see also https://wiki.archiveteam.org/index.php/SmolNet |
| 13:35:12 | <pabs> | next step for gemini:// etc is to get them into the WARC standard |
| 13:37:02 | <pabs> | bakedsilica: looks like it got saved already https://archive.fart.website/archivebot/viewer/job/202512102113535kb7a |
| 13:41:11 | <bakedsilica> | pabs: ty, i didn't see it in the dashboard and somehow assumed no job could have finished so soon. hope the result is reasonably complete |
| 13:46:14 | <h2ibot> | Klea edited Discourse/uncategorized (+50, Add https://discuss.techlore.tech/): https://wiki.archiveteam.org/?diff=58511&oldid=58498 |
| 13:46:17 | <klea> | oh im stupid |
| 13:47:00 | <pabs> | bakedsilica: 79252 onsite URLs fetched, 72142 200 OK, looks plausible |
| 13:48:15 | <h2ibot> | Klea edited Discourse/archived (+96, Add [https://discuss.techlore.tech/]): https://wiki.archiveteam.org/?diff=58512&oldid=58315 |
| 13:50:15 | <h2ibot> | Klea edited Discourse/archived (+4, Replace jobid with url based jobid): https://wiki.archiveteam.org/?diff=58513&oldid=58512 |
| 14:02:17 | <h2ibot> | Klea edited Discourse/uncategorized (-50, Undo revision 58511 by…): https://wiki.archiveteam.org/?diff=58514&oldid=58511 |
| 14:08:42 | | BearFortress quits [] |
| 14:21:22 | <klea> | https://lavozdeibiza.com/en/society/robe-iniesta-dies-a-farewell-that-shakes-spanish-rock-and-leaves-a-legacy-impossible-to-ignore/ |
| 14:22:41 | <klea> | Robe Iniesta died two days ago, https://lavozdeibiza.com/en/society/robe-iniesta-dies-a-farewell-that-shakes-spanish-rock-and-leaves-a-legacy-impossible-to-ignore/ |
| 14:22:45 | <klea> | oh im stupid |
| 14:22:47 | <klea> | i already sent it |
| 14:25:15 | <justauser> | https://www.extremoduro.com/ - obituary already published. |
| 14:26:10 | <justauser> | Social links are at the bottom, behind the cookie notice. |
| 14:32:01 | | BearFortress joins |
| 14:33:53 | <justauser> | Main website running in AB. |
| 14:35:27 | <klea> | justauser: thanks |
| 14:38:52 | | sg72 quits [Remote host closed the connection] |
| 14:44:50 | | sg72 joins |
| 14:45:23 | <h2ibot> | Klea edited Discourse (-7063, Move active Discourses to subpage): https://wiki.archiveteam.org/?diff=58515&oldid=58497 |
| 14:45:24 | <h2ibot> | Klea created Discourse/active (+7074, Created page with "*…): https://wiki.archiveteam.org/?title=Discourse/active |
| 14:46:23 | <h2ibot> | KleaBot edited Discourse/archived (+0, Reordered websites): https://wiki.archiveteam.org/?diff=58517&oldid=58513 |
| 14:51:24 | <h2ibot> | Klea edited Discourse (+84, Add "edit" links, and include uncategorized): https://wiki.archiveteam.org/?diff=58518&oldid=58515 |
| 14:53:15 | <klea> | not sure if we should include the list of uncategorized pages, if not, we could remove that |
| 15:15:13 | | nathang2184 quits [Ping timeout: 272 seconds] |
| 15:46:48 | | gosc joins |
| 15:57:52 | | sg72 quits [Ping timeout: 256 seconds] |
| 15:57:52 | | mrfooooo quits [Remote host closed the connection] |
| 15:58:05 | | mrfooooo joins |
| 15:58:05 | | mrfooooo quits [Remote host closed the connection] |
| 15:58:17 | | mrfooooo joins |
| 15:58:17 | | mrfooooo quits [Remote host closed the connection] |
| 15:59:06 | | chrismeller quits [Quit: Ping timeout (120 seconds)] |
| 15:59:35 | | chrismeller (chrismeller) joins |
| 16:00:27 | | sg72 joins |
| 16:07:34 | <h2ibot> | Justauser edited The WARC Ecosystem (+743, /* Tools */ added browsertrix and crocoite): https://wiki.archiveteam.org/?diff=58519&oldid=57706 |
| 16:23:21 | | gosc_1 joins |
| 16:26:46 | | gosc quits [Ping timeout: 256 seconds] |
| 16:32:44 | <justauser> | https://www.robe.es/inicio |
| 16:32:44 | <justauser> | klea: Looks like it's a different person, but I'm not sure - maybe you have a better idea? |
| 16:35:05 | | graham9 joins |
| 16:44:41 | <gosc_1> | hey so I've got a bunch of urls for archivebot with not needed query parameters (eg. ?v=texthere), would you guys like the version of the url with or without? or both? |
| 16:49:08 | <justauser> | Rough non-authoritative opinion: leave as-is unless it causes massive duplication. |
| 16:49:41 | <gosc_1> | okay then, will leave it there |
| 16:51:26 | <gosc_1> | there's tons of pages in this list I made which are login-only, remove or keep? |
| 16:51:31 | <gosc_1> | they're all just going to redirect |
| 16:52:35 | <justauser> | On the same terms: my vote goes to removing. |
| 16:52:50 | <justauser> | Ryz: ^ |
| 16:53:01 | <gosc_1> | okay then so remove the login pages but keep the url query stuff |
| 16:53:14 | <gosc_1> | just making sure because this list is a little big |
| 17:04:49 | | lemuria quits [Remote host closed the connection] |
| 17:12:08 | | bakedsilica quits [Remote host closed the connection] |
| 17:16:56 | | gosc_1 quits [Client Quit] |
| 17:17:38 | <klea> | justauser: it seems to be his, but instead of the music group, his personal site? |
| 17:18:08 | <justauser> | Okay, running as well. |
| 17:18:37 | <klea> | thanks |
| 17:19:36 | <klea> | btw, extremoduro is a group that according to spanish's wikipedia, had a activity period of 1987 - 2019 ( https://es.wikipedia.org/wiki/Extremoduro ) |
| 18:21:54 | <klea> | i just got this email from newsletter@hypixel.net |
| 18:21:56 | <klea> | > As a valued member of the Hypixel community, you're getting early access to reserve your Hytale username before early access launches! |
| 18:21:58 | <klea> | > We have acquired Hytale back from Riot Games and are preparing early access for January 13, 2026. We are using the legacy engine from the 2018 trailer and focusing on the original vision of the game. |
| 18:31:56 | | gosc joins |
| 18:33:48 | <gosc> | is there a way to check if a url exists en masse or something? |
| 18:34:06 | <justauser> | CDX API, probably... |
| 18:34:25 | <gosc> | I mean live urls |
| 18:35:14 | <gosc> | I've got a huge list, but all the urls are number based (from 000-999), and there's clearly gaps between the range |
| 18:36:05 | <justauser> | I'd try some variation of binary search. |
| 18:36:50 | <gosc> | there doesn't seem to be any pattern between the urls that don't exist and the ones that do |
| 18:37:15 | | alexlehm quits [Ping timeout: 272 seconds] |
| 18:37:15 | | klea quits [Ping timeout: 272 seconds] |
| 18:37:20 | <justauser> | If there are no patterns, than you are probably left with requesting each of them anyways. |
| 18:37:29 | <gosc> | oh okay |
| 18:37:37 | <gosc> | so the wget HEAD method is truly the only way |
| 18:37:53 | <gosc> | I didn't want to submit a list of 1k urls if most of it was gonna 404 lol |
| 18:38:02 | <justauser> | Try wget2 or some other that can work in parallel. |
| 18:38:12 | <justauser> | 1K is tiny on AB scale. |
| 18:38:32 | <justauser> | It hallucinates more 404s on a serious job. |
| 18:38:33 | <gosc> | yes but most of them being 404s wouldn't be a good thing to me I guess |
| 18:38:43 | <gosc> | I have the original wget but yeah I will get wget 2 |
| 18:42:04 | | gosc quits [Client Quit] |
| 18:46:16 | | gosc joins |
| 18:47:03 | <gosc> | justauser, okay I dug up an old script of mine, python requests does the job |
| 18:57:18 | | lukash98 joins |
| 18:57:23 | | gosc quits [Client Quit] |
| 18:59:01 | | chrismeller4 (chrismeller) joins |
| 19:01:57 | | chrismeller quits [Ping timeout: 272 seconds] |
| 19:01:57 | | chrismeller4 is now known as chrismeller |
| 19:08:47 | | NF885 (NF885) joins |
| 19:15:25 | | NF885 quits [Client Quit] |
| 19:20:08 | | gosc joins |
| 19:22:27 | | Lord_Nightmare quits [Quit: ZNC - http://znc.in] |
| 19:26:23 | | Lord_Nightmare (Lord_Nightmare) joins |
| 19:29:42 | | gosc quits [Client Quit] |
| 19:49:43 | | Webuser898721 joins |
| 19:50:42 | | Webuser898721 quits [Client Quit] |
| 20:06:51 | | kansei- (kansei) joins |
| 20:08:27 | | kansei quits [Ping timeout: 272 seconds] |
| 20:22:30 | | DogsRNice joins |
| 20:23:14 | <h2ibot> | Nulldata edited Deathwatch (+169, /* 2026 */ Added EyeEm): https://wiki.archiveteam.org/?diff=58520&oldid=58504 |
| 20:30:24 | | Wohlstand quits [Quit: Wohlstand] |
| 20:46:31 | | klea (jmjl) joins |
| 20:48:34 | | alexlehm (alexlehm) joins |
| 21:22:45 | | Webuser797948 joins |
| 21:23:13 | <Webuser797948> | Anyway to save this? https://x.com/realKalos/status/1999435419394490601 |
| 21:23:13 | <eggdrop> | nitter: https://nitter.net/realKalos/status/1999435419394490601 |
| 21:27:58 | | rohvani joins |
| 21:29:09 | <pokechu22> | Unfortunately we don't have a way of saving twitter still, as far as I know :/ |
| 21:36:00 | | Dada quits [Remote host closed the connection] |
| 21:37:54 | | Dada joins |
| 21:46:46 | | Webuser797948 quits [Client Quit] |
| 22:05:36 | | nexussfan (nexussfan) joins |
| 22:31:45 | | etnguyen03 (etnguyen03) joins |
| 22:44:57 | | etnguyen03 quits [Client Quit] |