01:16:40seadog007__ joins
01:23:04ericgallager joins
01:35:09ducky_ (ducky) joins
01:37:10ducky quits [Ping timeout: 268 seconds]
01:37:11ducky_ is now known as ducky
01:38:20cyanbox joins
01:38:24polypept1 (polypeptide) joins
01:41:42polypeptide quits [Ping timeout: 240 seconds]
02:06:41ats quits [Ping timeout: 268 seconds]
02:10:54ats (ats) joins
02:13:53Arcorann (Arcorann) joins
02:17:50<h2ibot>Cooljeanius edited SmolNet (+306, copyedit for readability (section headings and…): https://wiki.archiveteam.org/?diff=61137&oldid=60427
02:34:24beastbg8_ joins
02:38:45beastbg8 quits [Ping timeout: 268 seconds]
02:45:55<h2ibot>PaulWise edited Web Roasting (+51, add WHTop /cc Ryz): https://wiki.archiveteam.org/?diff=61138&oldid=61116
02:51:55<h2ibot>PaulWise edited Web Roasting (+147, more URL sources): https://wiki.archiveteam.org/?diff=61139&oldid=61138
02:57:51etnguyen03 quits [Remote host closed the connection]
03:11:27Starchives_ quits [Quit: Leaving]
03:12:56Island quits [Quit: Leaving]
03:17:25DogsRNice quits [Read error: Connection reset by peer]
03:19:15wickedplayer494 (wickedplayer494) joins
03:44:35hackbug quits [Remote host closed the connection]
03:46:48hackbug joins
04:00:02Starchives (Starchives) joins
04:04:33n9nes quits [Ping timeout: 268 seconds]
04:05:38n9nes joins
04:09:38michaelblob7641 quits [Quit: yoop]
04:10:17michaelblob7641 joins
04:15:07<h2ibot>PaulWise edited Discord (+194, add urgent archiving procedures, pad for…): https://wiki.archiveteam.org/?diff=61140&oldid=61123
04:24:08<h2ibot>John5433 edited 4chan (+921, /* Lost Archives */): https://wiki.archiveteam.org/?diff=61141&oldid=61126
04:25:08<h2ibot>John5433 edited 4chan (+7, /* yeet.net */): https://wiki.archiveteam.org/?diff=61142&oldid=61141
04:31:28Nekroschizofrenetyk joins
04:32:09<h2ibot>John5433 uploaded File:Yukila.png: https://wiki.archiveteam.org/?title=File%3AYukila.png
04:34:34Nekroschizofrenetyk quits [Client Quit]
04:36:10<h2ibot>John5433 edited 4chan (+190, /* yuki.la */): https://wiki.archiveteam.org/?diff=61144&oldid=61142
04:38:10<h2ibot>John5433 uploaded File:Screenshot 2026-04-22 16-36-57.png: https://wiki.archiveteam.org/?title=File%3AScreenshot%202026-04-22%2016-36-57.png
04:39:10<h2ibot>John5433 edited 4chan (+268, /* boards.deniableplausibility.net */): https://wiki.archiveteam.org/?diff=61146&oldid=61144
04:39:29Nekroschizofrenetyk joins
04:42:11<h2ibot>John5433 uploaded File:Firedenn.png: https://wiki.archiveteam.org/?title=File%3AFiredenn.png
04:46:11<h2ibot>John5433 edited 4chan (+236, /* foolz.fireden.net (Static Archive.moe…): https://wiki.archiveteam.org/?diff=61148&oldid=61146
05:25:46seadog007__ quits [Quit: Connection closed for inactivity]
05:45:12nexussfan quits [Quit: Konversation terminated!]
05:59:19Nekroschizofrenetyk quits [Client Quit]
06:10:31MPThLee quits [Quit: bye]
06:12:34MPThLee (MPThLee) joins
06:56:48Webuser108409 joins
06:56:49Webuser108409 quits [Client Quit]
07:33:20Webuser232291 joins
07:33:41Webuser232291 quits [Client Quit]
07:54:01Sluggs quits [Quit: ZNC - http://znc.in]
08:00:40Sluggs (Sluggs) joins
08:05:02SootBector quits [Remote host closed the connection]
08:06:09SootBector (SootBector) joins
08:10:36barry quits [Ping timeout: 268 seconds]
08:12:44barry joins
08:13:20HP_Archivist quits [Read error: Connection reset by peer]
08:18:50Paw-chivist joins
08:18:56<Paw-chivist>Hi everyone ! :)
08:19:04<Paw-chivist>(from HexChat this time :D)
08:27:11<cruller>Hello~
08:28:26<Paw-chivist>I forgot to say "Good night" yesterday, sorry to everyone that was here.
08:28:58<Paw-chivist>I don't have the logs, so, did we figured what to do with my massive list of URLs for French Public librairies ?
08:29:42<cruller>You want https://irclogs.archivete.am/archiveteam-bs/2026-04-22 ?
08:31:19<Paw-chivist>Yes, thanks a lot ! <3
08:32:33<cruller>It doesn't seem like there's been any progress since you quit yesterday.
08:33:50nathang21843 quits [Read error: Connection reset by peer]
08:34:37<Paw-chivist>Yep, good news, I don't have to read a lot :P
08:35:00<Paw-chivist>So, I will create a list for the already dead ones, like JAA suggested
08:35:55<h2ibot>User edited France (-105253, /* Public librairies */): https://wiki.archiveteam.org/?diff=61149&oldid=61136
08:36:55<h2ibot>User created France/Public librairies (+105372, Created page with "== Still up links == ===…): https://wiki.archiveteam.org/?oldid=61150
08:37:05pseudorizer quits [Quit: ZNC 1.10.1 - https://znc.in]
08:37:41pseudorizer (pseudorizer) joins
08:40:55<h2ibot>User edited France/Public librairies (+25529, /* Already dead links */): https://wiki.archiveteam.org/?diff=61151&oldid=61150
08:41:13<Paw-chivist>Done ! :3
08:41:15nathang21843 joins
08:44:03<cruller>Good!
08:47:36<Paw-chivist>klea told me on archivebot to send a list of URLs to JAA, so here is the list of the up websites : https://transfer.archivete.am/6hnhp/public_libraries_france_list.txt
08:47:37<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/6hnhp/public_libraries_france_list.txt
08:56:46<cruller>I think libraries' websites are difficult to crawl because of their OPAC, but experienced ABers will deal with them well.
09:02:17<Paw-chivist>You're right ! :)
09:03:04<Paw-chivist>According to you, is it useful to archive OPACs ?
09:09:43JTL quits [Ping timeout: 268 seconds]
09:10:46<cruller>OPAC pages can generally be divided into the following two types: (1) lists of books (i.e., search results) and (2) detailed bibliographic information for each book.
09:11:17<cruller>While (1) is worth archiving, (2) has a lower priority because it can be supplemented from other sources such as WorldCat.
09:11:25JTL (JTL) joins
09:15:59<cruller>If their data are distributed in a more efficient format (such as BibTeX) rather than HTML, that should also be taken into account.
09:16:00<h2ibot>User edited List of website hosts (+38, /* R */): https://wiki.archiveteam.org/?diff=61152&oldid=61114
09:16:27<Paw-chivist>Oh, okay. So it is worth of archiving ! :sparkles:
09:17:29<cruller>BTW Japan’s NDL WARP preserves some OPACs in their entirety.
09:18:01<h2ibot>User edited List of website hosts (+40, /* 0-9 */): https://wiki.archiveteam.org/?diff=61153&oldid=61152
09:18:08<cruller>Oh, everyone might forgot https://wiki.archiveteam.org/index.php/ArchiveBot/Other_libraries
09:19:01<h2ibot>User edited List of website hosts (+39, /* C */): https://wiki.archiveteam.org/?diff=61154&oldid=61153
09:19:32<Paw-chivist>So nice of Japan to have a such archiving system.
09:20:04<Paw-chivist>Yep, we talked about bot pages yesterday, it seems like nobody is checking them those times :(
09:21:01<h2ibot>John5433 uploaded File:Archivedmoe.png: https://wiki.archiveteam.org/?title=File%3AArchivedmoe.png
09:21:02<h2ibot>User edited List of website hosts (+40, /* F */): https://wiki.archiveteam.org/?diff=61156&oldid=61154
09:22:01<h2ibot>User edited List of website hosts (+39, /* F */): https://wiki.archiveteam.org/?diff=61157&oldid=61156
09:22:02<h2ibot>User edited List of website hosts (+41, /* G */): https://wiki.archiveteam.org/?diff=61158&oldid=61157
09:23:01<h2ibot>User edited List of website hosts (+47, /* G */): https://wiki.archiveteam.org/?diff=61159&oldid=61158
09:23:02<h2ibot>John5433 edited 4chan (+194, /* Archived.Moe */): https://wiki.archiveteam.org/?diff=61160&oldid=61148
09:24:02<h2ibot>User edited List of website hosts (+38, /* G */): https://wiki.archiveteam.org/?diff=61161&oldid=61159
09:24:03<h2ibot>User edited List of website hosts (+48, /* I */): https://wiki.archiveteam.org/?diff=61162&oldid=61161
09:26:02<h2ibot>User edited List of website hosts (+142): https://wiki.archiveteam.org/?diff=61163&oldid=61162
09:32:03<h2ibot>User edited List of website hosts (+1110): https://wiki.archiveteam.org/?diff=61164&oldid=61163
09:35:03<h2ibot>User edited List of website hosts (+26, /* P */): https://wiki.archiveteam.org/?diff=61165&oldid=61164
09:36:03<h2ibot>User edited List of website hosts (+25, /* P */): https://wiki.archiveteam.org/?diff=61166&oldid=61165
09:36:04<h2ibot>User edited List of website hosts (+16, /* P */): https://wiki.archiveteam.org/?diff=61167&oldid=61166
09:37:03<h2ibot>User edited List of website hosts (+25, /* 0-9 */): https://wiki.archiveteam.org/?diff=61168&oldid=61167
09:40:47Paw-chivist quits [Client Quit]
09:41:06Paw-chivist joins
09:41:34<Paw-chivist>I go AFK, thanks for helping ! <3
09:42:01Paw-chivist quits [Client Quit]
10:09:00nine quits [Ping timeout: 268 seconds]
10:10:58polypept1 quits [Remote host closed the connection]
10:11:34polypeptide (polypeptide) joins
10:18:47Webuser056608 joins
10:18:47Webuser056608 quits [Client Quit]
10:39:00Paw-chivist joins
10:39:14<Paw-chivist>I'm back :)
10:47:13<h2ibot>User edited ArchiveBot/Museums/Poland/list (+45): https://wiki.archiveteam.org/?diff=61169&oldid=49374
10:55:14<h2ibot>User edited Deathwatch (+255, /* 2025 */): https://wiki.archiveteam.org/?diff=61170&oldid=61119
10:58:30<Paw-chivist>Can someone archive https://tygodniksanocki.pl with archivebot please ? It is still up but can shut down at any time. The website is on WordPress so it coult be easy for the bot. Thanks in advance ! <3
11:00:02Bleo1826007227196234552220110 quits [Quit: The Lounge - https://thelounge.chat]
11:02:44Bleo1826007227196234552220110 joins
11:25:26<@arkiver>imer: we got a little bit of an emergency project coming up, can we have a target for wikimediaetherpad? with
11:25:33<@imer>ack
11:25:34<@arkiver>Archive Team Wikimedia Etherpad:
11:25:43<@arkiver>archiveteam_wikimediaetherpad_
11:25:49<@arkiver>wikimediaetherpad_
11:27:37<@imer>arkiver: target's up
11:29:44<@arkiver>imer: thank you :)
11:29:49<@arkiver>this is starting asap, deadline today
11:29:55<@arkiver>will not be much data
11:34:10<Paw-chivist>etherpad table size is 233GB btw
11:39:19<Paw-chivist>tracker seems empty and target is not showing on my warrior, is it normal ?
11:41:38<@imer>Paw-chivist: project code likely isn't ready yet, stand by :)
11:42:07<Paw-chivist>Oh, Okay, sorry. :(
11:42:20<Paw-chivist>I'm creating the wiki page for this project :3
11:46:05nine joins
11:47:21<h2ibot>User created Wikimedia Etherpad (+594, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?oldid=61171
11:48:21<h2ibot>User edited Wikimedia Etherpad (-10, /* References */): https://wiki.archiveteam.org/?diff=61172&oldid=61171
11:49:21<h2ibot>User edited Wikimedia Etherpad (+14, /* References */): https://wiki.archiveteam.org/?diff=61173&oldid=61172
11:49:28<Paw-chivist>Here it is ^ :)
11:52:49nine quits [Client Quit]
11:53:01nine joins
11:58:19<@imer>thanks!
12:00:53annie (annie) joins
12:02:28<Paw-chivist><3
12:02:59<annie>Hiya :)
12:03:49<Paw-chivist>Hello ! :)
13:04:40bladem quits [Ping timeout: 268 seconds]
13:12:16<klea>OPAC is a standardized term? (TIL)
13:26:47nine quits [Client Quit]
13:26:59nine joins
13:28:02goecho quits [Quit: Ping timeout (120 seconds)]
13:28:12goecho (goecho) joins
13:41:16<cruller>Probably the de facto standard. Sites that aren't just catalogs are often called that too.
14:04:29Arcorann quits [Ping timeout: 268 seconds]
14:10:05<justauser>Where did you get the deadline of today? https://etherpad.wikimedia.org/ mentions end of May.
14:15:41Nekroschizofrenetyk joins
14:19:45Nekroschizofrenetyk quits [Client Quit]
14:19:53<klea>> If there are no further comments I intend to perform the above change on April 30 when the etherpad instance is wiped. * Pppery * it has begun 01:00, 20 March 2026 (UTC)
14:20:09<klea>https://meta.wikimedia.org/wiki/Talk:Interwiki_map?useskin=vector#Etherpad:~:text=April%2030%20when%20the%20etherpad%20instance%20is%20wiped
14:33:12<@arkiver>wikimediaetherpad project starting shortly
14:33:26<@arkiver>imer: if you happen to be around, can you poke drone please?
14:33:31<@arkiver>on the -grab repo
14:38:15klea wonders what's the ETA to migrate to WoodPecker.
14:46:35Webuser673323 joins
14:46:39Webuser673323 quits [Client Quit]
14:49:01Island joins
14:52:24<justauser>Still can't see it in projects.json.
14:55:27<@imer>arkiver: did that earlier :(
14:55:30<@imer>:)*
14:56:50<@arkiver>thank you
14:56:52<@arkiver>pushing the code
14:56:53<@arkiver>items are up
14:58:06<@arkiver>it's up
15:04:14<@imer>arkiver++
15:04:15<eggdrop>[karma] 'arkiver' now has 102 karma!
15:04:25<@imer>we may have overwhelmed the site already
15:04:29<@arkiver>yeah :/
15:04:45<@imer>oh there it goes
15:04:50<@arkiver>justauser: someone on the inside, decided to start it anyway
15:05:27<justauser>8K is a suprisingly small number for 200GB DB...
15:15:58<h2ibot>Qazwsxplm edited Formspring (+27, /* Later developments */): https://wiki.archiveteam.org/?diff=61174&oldid=59524
15:15:59<h2ibot>Qazwsxplm edited Template:Stub (+12): https://wiki.archiveteam.org/?diff=61175&oldid=9869
15:16:00<h2ibot>Qazwsxplm edited .hu domains seed (+10, /* Justification */): https://wiki.archiveteam.org/?diff=61176&oldid=60230
15:17:58<h2ibot>Arkiver uploaded File:Wikimedia-etherpad-icon.png: https://wiki.archiveteam.org/?title=File%3AWikimedia-etherpad-icon.png
15:17:59<h2ibot>Arkiver uploaded File:Etherpad.wikimedia.org-screenshot.png: https://wiki.archiveteam.org/?title=File%3AEtherpad.wikimedia.org-screenshot.png
15:18:25<@arkiver>Paw-chivist: maybe those two can be added to the wiki page? ^
15:19:10<Paw-chivist>I'll do that, thanks <3
15:19:38<@arkiver>thank you :)
15:21:05<justauser>How did you enumerate the items?
15:21:28<Paw-chivist>Done
15:21:50<@arkiver>justauser: it's from a list, not really enumerated
15:21:59<h2ibot>User edited Wikimedia Etherpad (+85): https://wiki.archiveteam.org/?diff=61179&oldid=61173
15:23:59<justauser>Running mwlinkscrape in case it finds something new. Still not a complete list of Wikimedia projects on the page...
15:24:53<@arkiver>justauser: thank you! let me know if you have something
15:24:58<Paw-chivist>Why do we rate limit so much ?
15:25:17<@arkiver>Paw-chivist: because their server seems to fall over as soon as you blow a little wind against it
15:25:38<@arkiver>(but, seriously, i think the etherpad exports may be heavy on it, causing it to fall over)
15:25:53<Paw-chivist>Oh, okay :/
15:26:01<Paw-chivist>In my mind Wikimedia servers were massive
15:26:42<Paw-chivist>(I use massive things on PAWS and there never were a single problem)
15:33:32<justauser>https://transfer.archivete.am/Bgzyc/etherpad.wikimedia.org_cdx.txt
15:33:32<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/Bgzyc/etherpad.wikimedia.org_cdx.txt
15:34:11<justauser>CDX also has some pad names as parts of socket.io URLs - they are probably redundant but I kept them.
15:34:21<justauser>https://transfer.archivete.am/iXvAH/etherpad.wikimedia.org_mwlinkscrape.txt
15:34:21<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/iXvAH/etherpad.wikimedia.org_mwlinkscrape.txt
15:40:01<h2ibot>Justauser edited Wikimedia Etherpad (+176, Infobox filled): https://wiki.archiveteam.org/?diff=61180&oldid=61179
15:48:59moth3 joins
15:50:04Paw-chivist quits [Quit: Leaving]
15:52:04<moth3>Regarding Sora videos, this user was doing some really interesting short animations with it, many of them with over 1k likes. https://sora.chatgpt.com/profile/keigo_matsumaru
15:55:37<moth3>I saw there was talk of archiving stuff with 1k+ likes, but I'm not sure if that's materialized in anything yet?
16:03:36Webuser687386 joins
16:03:59Webuser687386 quits [Client Quit]
16:14:02<justauser>Not yet I believe.
16:17:04<Juest>hey, anyone aware of vocaroo updated expiry policy that's going to come into effect soon? would it be worth compensating their storage loss due to costs by archiving?
16:17:36<Juest>oh sorry, it was posted on feb 21 and i guess its starting to be in effect since its a few months later now
16:19:33<Juest>i dont remember who was interested in vocaroo as a side project
16:20:49fluke joins
16:26:48<klea>Might be neat to try to get Archiefweb.eu to upload their data to IA if they aren't doing that yet, and possibly make a web page about them, like we have for Arquivo.pt.
16:30:04<justauser>TIL
16:33:06<moth3>I'll probably at least archive the videos from that user for myself, but I imagine a personal archive, not done via the warrior, of a single user's videos, probably isn't very useful in the grand scheme of things.
16:34:01<justauser>They have a reference to the WARC spec in the page source, but it's commented out.
16:34:44<justauser>https://archief06.archiefweb.eu/archives/archiefweb/ is this pywb?
17:00:32<justauser>Looks like Wikimedia Etherpad should have been an AB job. Unless they have bad IP ratelimits?
17:11:17Webuser016300 joins
17:26:50ducky quits [Ping timeout: 268 seconds]
17:43:14ducky (ducky) joins
17:44:01Wohlstand (Wohlstand) joins
18:19:47UwU quits [Ping timeout: 268 seconds]
18:22:04UwU joins
18:26:26<IDK>wikipedia etherpad really hates it when I do the export thingy
18:26:31<IDK>constant 0s
18:37:57<justauser>I wonder how bad would it be to run an AB job against https://stuff.mit.edu/afs/ .
18:46:25Cuphead2527480 (Cuphead2527480) joins
18:46:55<@JAA>Exports are pretty resource-intensive on the server side. That's true for Etherpad instances in general.
18:47:14<@JAA>And yeah, probably could've been done with AB.
18:47:44<@JAA>Doesn't the actual pad loading happen with a WebSocket, which we can't archive anyway?