00:00:07<klea>cruller: should we also write about the things for tomorrow and next day (ie 2026-01-01)?
00:01:46<@JAA>klea: Yeah, see recent discussion about section transclusions. I haven't looked at that closer yet though.
00:01:52<klea>> 2026-01-01: Pair.com mailing lists (PairLists) will be shut down. While this has not yet been announced on their website, this has been emailed to list managers. https://coco3.org/TCCN/TCCN-432.html https://pairlist7.pair.net/pipermail/techtoolslist/2025q3/003870.html
00:02:32<klea>is there some way to get the list of all lists' names?, based on https://pairlist7.pair.net/pipermail/techtoolslist/, i suppose other lists would also have indexes that can be ABed
00:04:10lunik1 joins
00:05:21<cruller>klea: At the very least, I will add a note regarding the canceled and postponed ones.
00:05:33<klea>good
00:06:31<klea>https://en.wikipedia.org/wiki/Help:Labeled_section_transclusion
00:07:05<klea>JAA: i was thinking of doing the inverse, but yeah it seems like a fine idea, making a bunch of labels and then smaller pages that translcude the deathwatch using those labels
00:07:18<cruller>That aside, I believe Deathwatch should have an archiving status field. However, it shouldn't be so resource-intensive to input that it hinders archiving.
00:07:47<hexagonwin>arkiver is ktoon finished? if 'series:' items are HTML, doesn't it mean the latest/last downloads shouldn't be 'series:' since they always discover new stuff (episode/image)?
00:07:55<@JAA>The thing is more that it clutters the page even further.
00:08:09<@JAA>Not that the current approach is good, mind you.
00:08:19<hexagonwin>i'm just slightly worried due to its small size, and we have about 15hrs left till deadline
00:08:49<nicolas17>didn't we talk about turning deathwatch into a ticket system once? :P
00:09:31<klea>inb4 we start running phorge
00:09:45<klea>nicolas17: if you have time could you link me to irclogs about it?
00:10:33<klea>JAA: <section begin=dying-2025-01 />¨CONTENT¨<section end=dying-2025-01 /> doesn't look so bad
00:10:54<@JAA>nicolas17: We talked about that kind of thing, but as an additional thing, not a replacement, I think.
00:11:19<klea>i don't know how we could do to let users easily edit those sections, i was thinking of doing the inverse making deathwatch as small as possible and linking outwards
00:11:43<klea>then those sections of the deathwatch per year would be smaller, even if it'd make moving things between sections slightly annoying.
00:12:22<klea>we could do per year and have the section tag thingies per month (and also add to some specific days if there's too many entries)
00:12:28<@JAA>If that moving were a rare task, whatever, but it's definitely not (and should be more frequent than we do).
00:12:35<nicolas17>maybe leave Deathwatch as it is, but create a new page for every site and add more info there, how hard could it be (:
00:12:54<@JAA>I mean, Deathwatch already links to the pages when they exist.
00:13:56<klea>we can add more section labels for dates, add more spaces for when editing the wikitext (is there a way to make those not show?) and have a section of machine generated transclusion pages for easier viewing
00:14:07<klea>but that still clutters it a bit much.
00:14:08Dada joins
00:14:18<@JAA>Keeping editing simple is important, too.
00:14:39<@JAA>As in, random people adding things that are shutting down.
00:15:04Dada quits [Remote host closed the connection]
00:16:19<@OrIdow6>Yeah
00:18:58<@OrIdow6>I do like the wiki being a big public notepad without understanding much machinery needed to edit
00:19:55<klea>that's why i said putting lots of spaces (i meant newlines), and maybe comments to make it clear how to add stuff easily
00:23:02<cruller>13. https://www.greun.nu/ — This must be one of "some that are still unfinished but can be completed even now." Just 25 pages.
00:23:07<cruller>14. https://community.ns.nl/ — Saved with AB on 2025-06-21. It has been offline from December 15.
00:23:12<cruller>15. https://www.myktoon.com/ — Saved with DPoS.
00:23:26<cruller>Ex1. https://arca.live/ https://refsheet.net/ https://horaro.org/ are still online.
00:25:39<cruller>Sorry for sending so many messages :P
00:29:47<klea>cruller: don't worry, the messages are actually usefull :)
00:36:44<DigitalDragons>Maybe issues in a repo on the gitea is a nice simple ticketing system? Then you can either make an issue as you add something to deathwatch or someone else can go back and make an issue later
00:36:52<DigitalDragons>(is the gitea open registration?)
00:39:43<@OrIdow6>cruller: It's tradition - https://irclogs.archivete.am/archiveteam-bs/2024-12-16#laf550d06
00:41:38<@OrIdow6>I feel like we need a wiki page summarizing discussion of the gitea idea
00:43:11<klea>DigitalDragons: i believe gitea's not configured as open registration
00:43:12etnguyen03 quits [Client Quit]
00:44:02<DigitalDragons>right, okay
00:44:11<DigitalDragons>Phorge or something would be cool but someone would have to set that up and I think people are busy
00:44:45Guest58 quits [Read error: Connection reset by peer]
00:44:57klea kindly asks busy for a little small container to setup a lamp stack
00:45:27<@OrIdow6>Previous discussions on this: https://irclogs.archivete.am/archiveteam-bs/2021-06-05#lf2101768 https://irclogs.archivete.am/archiveteam-bs/2022-08-27#l5344ff5e https://irclogs.archivete.am/archiveteam-bs/2024-11-06#la3a5dcb3
00:45:56<klea>OrIdow6: thanks!
00:46:30<@OrIdow6>Interesting how it was originally merged with the idea of scanning social media for shutdown discussion
00:47:57<klea>https://irclogs.archivete.am/archiveteam-bs/2024-11-06#l2fe6fe7a highligthing **no one wanted to host it**, i wonder, are people trying to avoid maintenance?
00:49:30<nicolas17>tbh I think we should try gitlab.com or something just to experiment with the ticket workflow, rather than bikeshedding what exact software to use or who will host it
00:49:40<nicolas17>and either way it can wait until january :D
00:50:32<klea>or next year :p
00:50:47klea sees opp
00:50:50<klea>sorry
00:52:52<eggdrop>[remind] OrIdow6: add realkalos to hoaxes
00:53:06cyanbox joins
00:53:12<@OrIdow6>eggdrop: What does that mean, how long ago did I ask for this remind
00:53:34<nicolas17>dec 16
00:53:40<klea>OrIdow6: https://irclogs.archivete.am/archiveteam-bs/2025-12-17#l641ab70f
00:53:42<@OrIdow6>Whatever
00:53:42<@OrIdow6>!remindme 2w add realkalos to hoaxes
00:53:43<eggdrop>[remind] ok, i'll remind you at 2026-01-14T00:53:42Z
00:54:03<klea>lol
00:54:12<nicolas17>procrastination++
00:54:12<eggdrop>[karma] 'procrastination' now has 79 karma!
00:54:25<klea>procrastination++
00:54:26<eggdrop>[karma] 'procrastination' now has 80 karma!
00:55:25<@OrIdow6>I'm evidently on IRC around the same time
00:56:10<klea>[remind] OrIdow6 that is a request by yourself to add realkalos (what is that?) to https://wiki.archiveteam.org/index.php/Shutdown_rumors,_hoaxes,_and_scares
01:10:42<@JAA>OrIdow6: Yeah, that general plan is still on the menu. There were some blockers that were only very recently sorted out.
01:11:43etnguyen03 (etnguyen03) joins
01:22:10<h2ibot>DigitalDragon created Better Tracking for Deathwatch (+1671, Created page with "''See past discussions on…): https://wiki.archiveteam.org/?title=Better%20Tracking%20for%20Deathwatch
01:22:55<DigitalDragons>I did my best to summarize but might've missed some stuff
01:34:11<h2ibot>Cruller edited Deathwatch (+373, Update regarding CreativeLive and the Shogi…): https://wiki.archiveteam.org/?diff=59058&oldid=59030
02:04:21nexussfan quits [Quit: Konversation terminated!]
02:43:32Island joins
02:59:23<h2ibot>PaulWise edited Internet Archive/Save Page Now (+198, add more SPN blocklist entries): https://wiki.archiveteam.org/?diff=59059&oldid=58724
03:23:53<tzt>Baker & Taylor, a library services company, will shut down 2026, a lot of library catalogs rely on it for book metadata, it looks like baker-taylor.com was saved in AB but contentcafe2.btol.com, which has the book metadata has not been archived. The SSL certificate for baker-taylor.com expired 12 days ago and has not been renewed so it seems that shutdown is immninent.
03:42:00<tzt>The service uses ASP.NET Web Forms, with a POST request link on each of the pages with book data, but it can be reformatted to GET. Here is an example URL: https://contentcafe2.btol. com/ContentCafeClient/ContentCafe.aspx?UserID=ebsco-test&Password=ebsco-test&ItemKey=0679405593
03:42:12etnguyen03 quits [Client Quit]
03:42:45etnguyen03 (etnguyen03) joins
03:43:09<tzt>The ItemKey is the book ISBN
03:46:32etnguyen03 quits [Remote host closed the connection]
03:56:16<nicolas17>yikes
04:04:09<nulldata>Do we have a list of ISBNs?
04:29:28Wohlstand (Wohlstand) joins
04:36:53<@arkiver>hexagonwin: i think it's finished... we went through the user uploaded stuff, since as you mentioned the other comics seemed to be behind a wall
04:37:34DogsRNice quits [Read error: Connection reset by peer]
04:38:20<@arkiver>justauser: can https://transfer.archivete.am/aqSnd/pair.com_pair.net_pairlist.net_ddg.txt simply go in ArchiveBot?
04:38:20<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/aqSnd/pair.com_pair.net_pairlist.net_ddg.txt
04:38:32<@arkiver>and https://transfer.archivete.am/dlfpw/pair.com_pair.net_pairlist.net.cdx.zst
04:39:17<@arkiver>nicolas17: klea: you can use https://archive.fart.website/archivebot/viewer/ to see if at least something ran for a site
04:41:24<@arkiver>tzt: is there a more precise deadline than "2026"?
04:48:55<@OrIdow6>DigitalDragons: Thanks!
04:53:39<h2ibot>OrIdow6 edited Better Tracking for Deathwatch (+422, On Mediawiki extensions): https://wiki.archiveteam.org/?diff=59060&oldid=59057
04:55:16<@arkiver>good a web page is being made on this :)
04:56:00<@arkiver>i think it is important to keep things simple, so anyone would be able to add an entry without requiring technical knowledge
04:56:09<@arkiver>or too many accounts in various places
05:00:41<@JAA>https://transfer.archivete.am/dlfpw/pair.com_pair.net_pairlist.net.cdx.zst is surts, so definitely not AB-able.
05:05:17<@arkiver>interesting, why format it like that :P
05:05:52<@arkiver>nearly 500k URLs
05:06:05<@arkiver>may be doable with AB in time
05:06:13<@arkiver>with high concurrency
05:07:57<@JAA>Probably took the first field from the CDX API response instead of the original URL.
05:11:49<@arkiver>ah didn't know it was taken from the CDX API
05:17:49<@JAA>And I had somehow missed the context of this disappearing.
05:18:41<@JAA>Looking into throwing these into AB.
05:22:43<h2ibot>PaulWise edited Obstacles (+61, Vercel Security Checkpoint, Deflect): https://wiki.archiveteam.org/?diff=59061&oldid=58626
05:23:43<h2ibot>JustAnotherArchivist edited Deathwatch (+161, /* 2026 */ Add PE-Community.eu): https://wiki.archiveteam.org/?diff=59062&oldid=59058
05:30:50<hexagonwin>arkiver: i see... it does seem like *some* non-user-uploaded comics are still public, like https://v2.myktoon.com/web/works/list.kt?worksseq=3
05:31:44<hexagonwin>seems like stuff linked on https://v2.myktoon.com/web/homescreen/box_webtoon.kt ("박스오피스" on top navbar) is largely not paywalled?
05:34:32SootBector quits [Remote host closed the connection]
05:35:39SootBector (SootBector) joins
05:37:11<@JAA>Ugh, there's duplication, e.g. https://pairlist2.pair.net/pipermail/wolfdev/ == https://two.pairlist.net/pipermail/wolfdev/
05:48:43<tzt>tzt: it appears that Content Cafe has been acquired according to this post but it will be transitioned to "LibraryOne" https://old.reddit.com/r/librarians/comments/1pq6fog/baker_and_taylor_not_dead_yet_well_at_least_part/
05:48:49<tzt>arkiver:
05:59:47Webuser041704 joins
06:07:51Webuser041704 quits [Client Quit]
06:13:18<@JAA>Oh
06:13:45<@JAA>I think I found a way to get a complete list of the mailing lists.
06:15:14SootBector quits [Remote host closed the connection]
06:16:21SootBector (SootBector) joins
06:21:13<pabs>JAA++
06:21:14<eggdrop>[karma] 'JAA' now has 326 karma!
06:28:59gosc joins
06:34:53<h2ibot>PaulWise edited Better Tracking for Deathwatch (+440, custom software idea): https://wiki.archiveteam.org/?diff=59063&oldid=59060
06:45:29<gosc>archivebot can handle 700k urls right?
06:47:09<pabs>yes, could take a while though
06:47:27<gosc>oh nice
06:47:35<pabs>if you also need fast, DPoS is the only option
06:47:43<gosc>I see
06:47:57<pokechu22>or split the list into several and do several archivebot jobs
06:48:03<pabs>or qwarc I guess
07:04:58<h2ibot>PaulWise edited In The Media (+124, Honey fraud video): https://wiki.archiveteam.org/?diff=59064&oldid=58985
07:14:51Island quits [Read error: Connection reset by peer]
07:16:16<@arkiver>qwarc may be a good fit in this case
07:18:35<@JAA>Which case?
07:27:32<@arkiver>to get a copy of pair.com lists
07:27:37<@arkiver>JAA: ^
07:29:16driib97 quits [Quit: Ping timeout (120 seconds)]
07:29:51<@arkiver>cruller: from what you checked - do you think there are any that need a custom project?
07:30:13driib97 (driib) joins
07:30:24Island joins
07:36:44driib97 quits [Ping timeout: 256 seconds]
07:40:47<@JAA>arkiver: Not a terrible fit, but qwarc isn't really good at generic recursion.
07:44:35<cruller>arkiver: I've only checked whether starting AB now would finish it today, but AFAIK there aren't.
08:04:07<@JAA>It appears that pairlistN.pair.net is the canonical location for the lists. The {one,two,...}.pairlist.net servers redirect there on some URLs.
08:09:32<cruller>https://digizeitschriften.de/id/{id} won't load with noscript, but since there are no POST requests or complex processing, I think creating a URL and doing AB should suffice. (If you want to archive each document's metadata)
08:10:59<cruller>s/URL/URL list/
08:13:23<cruller>However, the metadata for each document may be available from another database if its ID is known.
08:17:48<cruller>Oh, digizeitschriften seems to have just gone offline... nvm
08:23:33tzt quits [Ping timeout: 272 seconds]
08:25:14tzt (tzt) joins
08:31:33Island quits [Read error: Connection reset by peer]
08:34:04beastbg8_ joins
08:38:07beastbg8__ quits [Ping timeout: 272 seconds]
09:16:26<@JAA>AB jobs for pairlistN.pair.net are running. One !ao < job for the /mailman/listinfo/x and one !a < job per host (1 through 10) for /pipermail/x.
09:17:10<@JAA>The CDX API data has a lot of noise, but I left everything that was workable in the list, so that gets some 404s.
09:20:33APOLLO03 quits [Ping timeout: 272 seconds]
09:25:00lennier2 joins
09:25:36APOLLO03 joins
09:27:14lennier2_ quits [Ping timeout: 256 seconds]
09:35:07nathang2184 quits [Ping timeout: 272 seconds]
09:42:48nathang2184 joins
09:53:47<cruller>I've also checked the 6 things on 2026-01-01. It seems none have been archived yet except for channel attitude.com.
09:57:43<@JAA>PairLists and PE-Community are both running through AB.
09:59:20<cruller>Yeah, I forgot to check the dashboard...
10:03:36<cruller> I just saw a job for quantum-mirror.hu. Thanks.
10:07:13<cruller>When running noscript, https://www.pe-community.eu/ displays the message “JavaScript ist deaktiviert. Für eine bessere Darstellung aktiviere bitte JavaScript in deinem Browser, bevor du fortfährst.” However, at first glance, it seems to be working fine.
10:12:19<@JAA>Yeah, XenForo works fine without JS.
10:16:08Dada joins
10:22:12Wohlstand quits [Quit: Wohlstand]
10:32:54archiveDrill quits [Quit: The Lounge - https://thelounge.chat]
10:35:41gosc quits [Quit: Leaving]
10:38:26<klea>pabs: can you check if the SPN blocklist is for /generate_204 directly, iirc google has other domains that also generate 204s
10:38:33<h2ibot>Cruller edited Deathwatch (+184, The Berean Test cancelled its planned shutdown.): https://wiki.archiveteam.org/?diff=59065&oldid=59062
11:01:04<klea>i'm going to try to setup a phorge instance to toy with it, if it becomes more production desired use, i'll probably ask for a subdomain for it, so it's not relying on a third party i'm not entirely sure i trust to keep paying a domain :)
11:14:16<pabs>not easily, I only saw the error in my SPN email response
11:19:59<klea>thanks
11:25:54<cruller>https://support.google.com/chrome/a/answer/6334001 These "generate_204"s are also blocked.
11:34:55<cruller>Ah yes, generate_204 in any domain or directory is blocked. (Even if it's example.com)
11:35:53<cruller>It's a very simple blocking rule.
11:40:41<h2ibot>Cruller edited Internet Archive/Save Page Now (-27, /* Blocks */ Any generate_204 is blocked): https://wiki.archiveteam.org/?diff=59066&oldid=59059
11:42:06<cruller>Incidentally, https://example.com/generate_204/a.txt isn't blocked.
11:43:32PaiMei quits [Quit: Ping timeout (120 seconds)]
11:48:27PaiMei (PaiMei) joins
11:50:05Dada quits [Remote host closed the connection]
11:54:54croissant_ joins
11:57:37croissant quits [Ping timeout: 272 seconds]
12:00:02Bleo182600722719623455222 quits [Quit: The Lounge - https://thelounge.chat]
12:02:45Bleo182600722719623455222 joins
12:03:26lennier2_ joins
12:06:29lennier2 quits [Ping timeout: 272 seconds]
12:10:51Wohlstand (Wohlstand) joins
12:19:21T31M quits [Quit: ZNC - https://znc.in]
12:21:13T31M joins
12:26:48<h2ibot>Manu edited Discourse/archived (+93, Queued forums.fast.ai): https://wiki.archiveteam.org/?diff=59067&oldid=59056
12:29:27<c3manu>lucky coincidence, https://forum.posit.co/ has a notice at the top about sunsetting bookdown.org
12:29:48<h2ibot>Manu edited Discourse/archived (+93, Queued forum.posit.co): https://wiki.archiveteam.org/?diff=59068&oldid=59067
12:31:50Dada joins
12:33:01<c3manu>https://posit.co/blog/bookdown-org-sunset/
12:37:49<h2ibot>Manu edited Deathwatch (+192, 2027: bookdown.org): https://wiki.archiveteam.org/?diff=59069&oldid=59065
12:38:04Dada quits [Remote host closed the connection]
12:38:28<c3manu>it’s only taken down end of Jan 2027, but the source files won’t be available starting Jan 31 in 2026 already
12:38:56<c3manu>also some books listed in the archive https://bookdown.org/home/archive/ are hosted on custom domains, like https://r-graphics.org/ for example
12:39:31<c3manu>not sure whether the source files are public or only accessible to the author/uploader
12:40:32<c3manu>^ does it make sense to create small wiki pages for as little information as that?
12:42:30Dada joins
13:09:49TheEnbyperor quits [Ping timeout: 272 seconds]
13:10:54<h2ibot>Manu edited Discourse (+385, Known Victims: Add discussions.udacity.com): https://wiki.archiveteam.org/?diff=59070&oldid=58834
13:11:04TheEnbyperor_ quits [Ping timeout: 256 seconds]
13:11:54<h2ibot>Manu edited Discourse/active (-62, Udacity Discussion Board no longer active): https://wiki.archiveteam.org/?diff=59071&oldid=58835
13:13:54<h2ibot>Manu edited Discourse/archived (+99, Queued forum.torproject.org): https://wiki.archiveteam.org/?diff=59072&oldid=59068
13:16:55<h2ibot>Manu edited Discourse/archived (+95, Queued devforum.zoom.us): https://wiki.archiveteam.org/?diff=59073&oldid=59072
13:22:37TheEnbyperor joins
13:25:37TheEnbyperor_ (TheEnbyperor) joins
13:31:01<h2ibot>Manu edited Discourse/archived (+100, Queued community.toradex.com): https://wiki.archiveteam.org/?diff=59074&oldid=59073
13:38:23sec^nd quits [Remote host closed the connection]
13:38:48sec^nd (second) joins