00:06:34SootBector quits [Ping timeout: 276 seconds]
00:07:13HackMii quits [Ping timeout: 276 seconds]
00:08:55SootBector (SootBector) joins
00:09:42HackMii (hacktheplanet) joins
00:35:42rohvani joins
00:59:30whimsysciences joins
01:49:01<pabs>klea: haha, easy bypass for the anti-scraper thing on that madhouse site: set-cookie: x-robot-challenge=passed;path=/
01:50:43Island joins
02:11:33etnguyen03 (etnguyen03) joins
02:42:45second (second) joins
02:43:13sec^nd quits [Ping timeout: 276 seconds]
02:43:13second is now known as sec^nd
02:43:55sg72 quits [Remote host closed the connection]
02:45:03sg72 joins
02:45:55<steering>pabs: any chance your https://wiki.archiveteam.org/index.php/ArchiveBot/Monitoring regexes are somewhere
02:52:52etnguyen03 quits [Client Quit]
02:56:42etnguyen03 (etnguyen03) joins
02:56:52SootBector quits [Ping timeout: 276 seconds]
02:59:18SootBector (SootBector) joins
03:26:35etnguyen03 quits [Client Quit]
03:33:18<pabs>haven't put them anywhere yet
03:33:53<pabs>need to move that stuff to a server, but the one option for that fell through
03:37:07etnguyen03 (etnguyen03) joins
03:39:00<pabs>steering: current ones https://transfer.archivete.am/10CTLV/archivebot-monitoring-categories-match-ignore-regexes.txt
03:39:00<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/10CTLV/archivebot-monitoring-categories-match-ignore-regexes.txt
03:40:30<h2ibot>PaulWise edited ArchiveBot/Monitoring (+97, link current regexes): https://wiki.archiveteam.org/?diff=58503&oldid=58390
03:56:17etnguyen03 quits [Remote host closed the connection]
04:07:47Island quits [Read error: Connection reset by peer]
04:08:46Hackerpcs (Hackerpcs) joins
05:26:30cyanbox quits [Read error: Connection reset by peer]
05:28:04cyanbox joins
05:33:50DogsRNice quits [Read error: Connection reset by peer]
06:10:50stepney141 quits [Quit: Ping timeout (120 seconds)]
06:11:08stepney141 (stepney141) joins
06:21:31Wohlstand (Wohlstand) joins
06:22:44nexussfan quits [Quit: Konversation terminated!]
06:25:00croissant quits [Read error: Connection reset by peer]
06:27:48croissant joins
06:34:54<h2ibot>Klea edited Deathwatch (+173, Add land.to): https://wiki.archiveteam.org/?diff=58504&oldid=58411
06:40:21<klea>stepney141: i've added it to the deathwatch, if you're interested in seeing more about FC2, it's likely a good idea that you join #forbiddencravingcenter :-)
06:57:57<h2ibot>Klea edited ArchiveBot/Ignore (+33, Add references section): https://wiki.archiveteam.org/?diff=58505&oldid=57890
06:58:57<h2ibot>Klea edited ArchiveBot/Ignore (+0, Use [[Template:Url]] directly rather than…): https://wiki.archiveteam.org/?diff=58506&oldid=58505
07:07:22Wohlstand quits [Client Quit]
07:19:44Wohlstand (Wohlstand) joins
07:19:53Wohlstand quits [Client Quit]
07:52:19<pabs>@canonical.com folks responded to my mail about the Ubuntu wikis that are being shut down, seem positive about archiving
07:53:21<pabs>they also want us to save the already-shutdown ubuntuforums.org site by opening it up to us /cc dxrt
08:13:27Shyy46 quits [Quit: Ping timeout (120 seconds)]
08:37:15Wohlstand (Wohlstand) joins
09:11:08Pedrosso0 joins
09:11:11ScenarioPlanet6 (ScenarioPlanet) joins
09:11:31TheTechRobo1 (TheTechRobo) joins
09:12:08Pedrosso quits [Ping timeout: 256 seconds]
09:12:08ScenarioPlanet quits [Ping timeout: 256 seconds]
09:12:08TheTechRobo quits [Ping timeout: 256 seconds]
09:12:08Pedrosso0 is now known as Pedrosso
09:12:08ScenarioPlanet6 is now known as ScenarioPlanet
09:12:08TheTechRobo1 is now known as TheTechRobo
09:13:17<stepney141>@klea: thanks!
09:26:52wessel1512 quits [Ping timeout: 256 seconds]
09:27:07<steering>pabs: thanks :)
09:34:13wessel1512 joins
09:37:30<h2ibot>PaulWise edited ArchiveBot/Ignore (+191, add !ignd eggdrop command for ignoring offsite…): https://wiki.archiveteam.org/?diff=58507&oldid=58506
09:37:31<h2ibot>PaulWise edited ArchiveBot/Ignore (+2, formatting): https://wiki.archiveteam.org/?diff=58508&oldid=58507
09:39:30<h2ibot>PaulWise edited ArchiveBot/Ignore (-2, formatting bleh): https://wiki.archiveteam.org/?diff=58509&oldid=58508
09:40:30<h2ibot>PaulWise edited ArchiveBot/Ignore (-4, typo): https://wiki.archiveteam.org/?diff=58510&oldid=58509
09:40:40Dada joins
10:00:43rohvani quits [Quit: The Lounge - https://thelounge.chat]
10:03:54rohvani joins
10:56:24rohvani quits [Ping timeout: 256 seconds]
11:19:19pabs is now known as RJHacker35549
11:19:36pabs (pabs) joins
11:20:15RJHacker35549 quits [Ping timeout: 272 seconds]
12:01:25pabs quits [Ping timeout: 272 seconds]
12:04:07pabs (pabs) joins
12:08:54APOLLO03a joins
12:10:08APOLLO03a quits [Client Quit]
12:11:18APOLLO03a joins
12:12:11APOLLO03 quits [Ping timeout: 272 seconds]
12:20:10<@dxrt>great news pabs
12:27:29<steering>will non-http schemes even show up in the AB websocket? (i.e. gemini://)
12:27:53<steering>i assume not
12:35:49Juesto (Juest) joins
12:37:50Juest quits [Ping timeout: 256 seconds]
12:37:50Juesto is now known as Juest
12:59:54NF885 (NF885) joins
13:00:56<NF885>klea Nintendofan885 is me
13:01:10<NF885>I meant the page being added to https://wiki.archiveteam.org/index.php/Category:Infobox_project_pages_without_URL
13:05:40NF885 quits [Client Quit]
13:07:44bakedsilica (bakedsilica) joins
13:12:06bakedsilica quits [Remote host closed the connection]
13:17:07bakedsilica (bakedsilica) joins
13:20:04Webuser834516 joins
13:20:36<bakedsilica>FYI, Discourse-based Techlore Forum is now read-only and will be deleted June 1, 2026
13:20:43Webuser834516 quits [Client Quit]
13:21:03<bakedsilica>didn't see it listed on the wiki or elsewhere
13:22:33<bakedsilica>url: https://discuss.techlore.tech/
13:22:33<bakedsilica>announcement: https://techlore.tech/techlores-new-home-our-platform-transition-whats-next/
13:23:34bakedsilica quits [Remote host closed the connection]
13:24:15bakedsilica (bakedsilica) joins
13:24:57<bakedsilica>as an important resource in the privacy space it should be of interest to the Deathwatch
13:34:17<pabs>steering: they don't indeed, only stuff wpull understands. see also https://wiki.archiveteam.org/index.php/SmolNet
13:35:12<pabs>next step for gemini:// etc is to get them into the WARC standard
13:37:02<pabs>bakedsilica: looks like it got saved already https://archive.fart.website/archivebot/viewer/job/202512102113535kb7a
13:41:11<bakedsilica>pabs: ty, i didn't see it in the dashboard and somehow assumed no job could have finished so soon. hope the result is reasonably complete
13:46:14<h2ibot>Klea edited Discourse/uncategorized (+50, Add https://discuss.techlore.tech/): https://wiki.archiveteam.org/?diff=58511&oldid=58498
13:46:17<klea>oh im stupid
13:47:00<pabs>bakedsilica: 79252 onsite URLs fetched, 72142 200 OK, looks plausible
13:48:15<h2ibot>Klea edited Discourse/archived (+96, Add [https://discuss.techlore.tech/]): https://wiki.archiveteam.org/?diff=58512&oldid=58315
13:50:15<h2ibot>Klea edited Discourse/archived (+4, Replace jobid with url based jobid): https://wiki.archiveteam.org/?diff=58513&oldid=58512
14:02:17<h2ibot>Klea edited Discourse/uncategorized (-50, Undo revision 58511 by…): https://wiki.archiveteam.org/?diff=58514&oldid=58511
14:08:42BearFortress quits []
14:21:22<klea>https://lavozdeibiza.com/en/society/robe-iniesta-dies-a-farewell-that-shakes-spanish-rock-and-leaves-a-legacy-impossible-to-ignore/
14:22:41<klea>Robe Iniesta died two days ago, https://lavozdeibiza.com/en/society/robe-iniesta-dies-a-farewell-that-shakes-spanish-rock-and-leaves-a-legacy-impossible-to-ignore/
14:22:45<klea>oh im stupid
14:22:47<klea>i already sent it
14:25:15<justauser>https://www.extremoduro.com/ - obituary already published.
14:26:10<justauser>Social links are at the bottom, behind the cookie notice.
14:32:01BearFortress joins
14:33:53<justauser>Main website running in AB.
14:35:27<klea>justauser: thanks
14:38:52sg72 quits [Remote host closed the connection]
14:44:50sg72 joins
14:45:23<h2ibot>Klea edited Discourse (-7063, Move active Discourses to subpage): https://wiki.archiveteam.org/?diff=58515&oldid=58497
14:45:24<h2ibot>Klea created Discourse/active (+7074, Created page with "*…): https://wiki.archiveteam.org/?title=Discourse/active
14:46:23<h2ibot>KleaBot edited Discourse/archived (+0, Reordered websites): https://wiki.archiveteam.org/?diff=58517&oldid=58513
14:51:24<h2ibot>Klea edited Discourse (+84, Add "edit" links, and include uncategorized): https://wiki.archiveteam.org/?diff=58518&oldid=58515
14:53:15<klea>not sure if we should include the list of uncategorized pages, if not, we could remove that
15:15:13nathang2184 quits [Ping timeout: 272 seconds]
15:46:48gosc joins
15:57:52sg72 quits [Ping timeout: 256 seconds]
15:57:52mrfooooo quits [Remote host closed the connection]
15:58:05mrfooooo joins
15:58:05mrfooooo quits [Remote host closed the connection]
15:58:17mrfooooo joins
15:58:17mrfooooo quits [Remote host closed the connection]
15:59:06chrismeller quits [Quit: Ping timeout (120 seconds)]
15:59:35chrismeller (chrismeller) joins
16:00:27sg72 joins
16:07:34<h2ibot>Justauser edited The WARC Ecosystem (+743, /* Tools */ added browsertrix and crocoite): https://wiki.archiveteam.org/?diff=58519&oldid=57706
16:23:21gosc_1 joins
16:26:46gosc quits [Ping timeout: 256 seconds]
16:32:44<justauser>https://www.robe.es/inicio
16:32:44<justauser>klea: Looks like it's a different person, but I'm not sure - maybe you have a better idea?
16:35:05graham9 joins
16:44:41<gosc_1>hey so I've got a bunch of urls for archivebot with not needed query parameters (eg. ?v=texthere), would you guys like the version of the url with or without? or both?
16:49:08<justauser>Rough non-authoritative opinion: leave as-is unless it causes massive duplication.
16:49:41<gosc_1>okay then, will leave it there
16:51:26<gosc_1>there's tons of pages in this list I made which are login-only, remove or keep?
16:51:31<gosc_1>they're all just going to redirect
16:52:35<justauser>On the same terms: my vote goes to removing.
16:52:50<justauser>Ryz: ^
16:53:01<gosc_1>okay then so remove the login pages but keep the url query stuff
16:53:14<gosc_1>just making sure because this list is a little big
17:04:49lemuria quits [Remote host closed the connection]
17:12:08bakedsilica quits [Remote host closed the connection]
17:16:56gosc_1 quits [Client Quit]
17:17:38<klea>justauser: it seems to be his, but instead of the music group, his personal site?
17:18:08<justauser>Okay, running as well.
17:18:37<klea>thanks
17:19:36<klea>btw, extremoduro is a group that according to spanish's wikipedia, had a activity period of 1987 - 2019 ( https://es.wikipedia.org/wiki/Extremoduro )
18:21:54<klea>i just got this email from newsletter@hypixel.net
18:21:56<klea>> As a valued member of the Hypixel community, you're getting early access to reserve your Hytale username before early access launches!
18:21:58<klea>> We have acquired Hytale back from Riot Games and are preparing early access for January 13, 2026. We are using the legacy engine from the 2018 trailer and focusing on the original vision of the game.
18:31:56gosc joins
18:33:48<gosc>is there a way to check if a url exists en masse or something?
18:34:06<justauser>CDX API, probably...
18:34:25<gosc>I mean live urls
18:35:14<gosc>I've got a huge list, but all the urls are number based (from 000-999), and there's clearly gaps between the range
18:36:05<justauser>I'd try some variation of binary search.
18:36:50<gosc>there doesn't seem to be any pattern between the urls that don't exist and the ones that do
18:37:15alexlehm quits [Ping timeout: 272 seconds]
18:37:15klea quits [Ping timeout: 272 seconds]
18:37:20<justauser>If there are no patterns, than you are probably left with requesting each of them anyways.
18:37:29<gosc>oh okay
18:37:37<gosc>so the wget HEAD method is truly the only way
18:37:53<gosc>I didn't want to submit a list of 1k urls if most of it was gonna 404 lol
18:38:02<justauser>Try wget2 or some other that can work in parallel.
18:38:12<justauser>1K is tiny on AB scale.
18:38:32<justauser>It hallucinates more 404s on a serious job.
18:38:33<gosc>yes but most of them being 404s wouldn't be a good thing to me I guess
18:38:43<gosc>I have the original wget but yeah I will get wget 2
18:42:04gosc quits [Client Quit]
18:46:16gosc joins
18:47:03<gosc>justauser, okay I dug up an old script of mine, python requests does the job
18:57:18lukash98 joins
18:57:23gosc quits [Client Quit]
18:59:01chrismeller4 (chrismeller) joins
19:01:57chrismeller quits [Ping timeout: 272 seconds]
19:01:57chrismeller4 is now known as chrismeller
19:08:47NF885 (NF885) joins
19:15:25NF885 quits [Client Quit]
19:20:08gosc joins
19:22:27Lord_Nightmare quits [Quit: ZNC - http://znc.in]
19:26:23Lord_Nightmare (Lord_Nightmare) joins
19:29:42gosc quits [Client Quit]
19:49:43Webuser898721 joins
19:50:42Webuser898721 quits [Client Quit]
20:06:51kansei- (kansei) joins
20:08:27kansei quits [Ping timeout: 272 seconds]
20:22:30DogsRNice joins
20:23:14<h2ibot>Nulldata edited Deathwatch (+169, /* 2026 */ Added EyeEm): https://wiki.archiveteam.org/?diff=58520&oldid=58504
20:30:24Wohlstand quits [Quit: Wohlstand]
20:46:31klea (jmjl) joins
20:48:34alexlehm (alexlehm) joins
21:22:45Webuser797948 joins
21:23:13<Webuser797948>Anyway to save this? https://x.com/realKalos/status/1999435419394490601
21:23:13<eggdrop>nitter: https://nitter.net/realKalos/status/1999435419394490601
21:27:58rohvani joins
21:29:09<pokechu22>Unfortunately we don't have a way of saving twitter still, as far as I know :/
21:36:00Dada quits [Remote host closed the connection]
21:37:54Dada joins
21:46:46Webuser797948 quits [Client Quit]
22:05:36nexussfan (nexussfan) joins
22:31:45etnguyen03 (etnguyen03) joins
22:44:57etnguyen03 quits [Client Quit]