00:06:51AFly quits [Quit: 🪰]
00:07:06AFly (AFly) joins
00:15:29Arcorann__ (Arcorann) joins
00:15:59etnguyen03 (etnguyen03) joins
00:34:21Webuser573334 joins
00:34:38Webuser573334 quits [Client Quit]
00:37:34SootBector quits [Remote host closed the connection]
00:38:42SootBector (SootBector) joins
00:39:04etnguyen03 quits [Client Quit]
00:41:54hackbug quits [Remote host closed the connection]
00:42:07Goofybally quits [Killed (NickServ (GHOST command used by Goofybally2!~Goofyball@141.179.9.229))]
00:42:12Goofybally joins
00:45:10HermanToothrot quits [Ping timeout: 268 seconds]
00:47:11hackbug joins
00:51:25etnguyen03 (etnguyen03) joins
01:02:54HermanToothrot joins
01:16:37HermanToothrot quits [Ping timeout: 268 seconds]
01:30:29HermanToothrot joins
01:44:22HermanToothrot quits [Ping timeout: 268 seconds]
01:52:08chrismeller3 quits [Quit: chrismeller3]
01:52:28chrismeller3 (chrismeller) joins
01:58:53HermanToothrot joins
02:15:49HermanToothrot quits [Ping timeout: 268 seconds]
02:15:53cyanbox joins
02:17:14<klea>uhh
02:17:19<klea>https://about.gitlab.com/blog/gitlab-act-2/ https://mastodon.me.uk/@pikesley/116560335270718717
02:28:36notarobot quits [Quit: The Lounge - https://thelounge.chat]
02:28:57notarobot joins
02:31:09HermanToothrot joins
02:38:41HermanToothrot quits [Ping timeout: 268 seconds]
02:40:15Teraunce joins
02:40:52<Teraunce>Hey, is there any way to use Archivebot to search for a specific file in the archives?
02:44:29HermanToothrot joins
02:44:39<Teraunce>I've been trying to pull up a PlanetElderScrolls page on the Wayback Machine to find the filename to search for it?
02:49:14<Teraunce>nevermind. I found it by doing very unintuitive searches on a search website because the archive it was searching did not have the author's name listed.
02:49:38<Teraunce>have a goodnight.
02:49:45Teraunce quits [Client Quit]
02:52:12nothere quits [Ping timeout: 268 seconds]
02:52:12ivan quits [Ping timeout: 268 seconds]
02:55:17HermanToothrot quits [Ping timeout: 268 seconds]
02:59:46ivan joins
03:03:31nothere_ joins
03:09:21HermanToothrot joins
03:24:06Webuser830993 joins
03:24:39Webuser830993 quits [Client Quit]
03:26:10HermanToothrot quits [Ping timeout: 268 seconds]
03:30:16etnguyen03 quits [Remote host closed the connection]
03:40:46HermanToothrot joins
03:57:34HermanToothrot quits [Ping timeout: 268 seconds]
04:16:02ThetaDev quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
04:16:52ThetaDev joins
04:29:40HermanToothrot joins
04:32:38DogsRNice quits [Read error: Connection reset by peer]
04:49:22HermanToothrot quits [Ping timeout: 268 seconds]
05:03:42<h2ibot>Himond000 edited Deathwatch (+338, /* 2026-07 */ add OPENREC.tv): https://wiki.archiveteam.org/?diff=61375&oldid=61324
05:04:50StarletCharlotte joins
05:04:51<eggdrop>[tell] StarletCharlotte: [2026-02-12T17:53:19Z] <JAA> Dots in IA item names are perfectly fine; I use them all the time. And there's a script in little-things for metadata as well. Feel free to ask in #internetarchive if you have more questions.
05:04:52<eggdrop>[tell] StarletCharlotte: [2026-03-19T17:54:27Z] <c3manu> i recently queued https://horizon.meta.com/, unaware of the upcoming announcement. i can't guarantee for its completeness though, since it was adding some junk parameter which made it fetch the same picture files over and over again, so i eventually ignored it.
05:05:38<StarletCharlotte>parallaxstellar I'm here now.
05:05:53<parallaxstellar>ah, it looks like you havent connected in a while. sorry for baiting you with banter
05:06:01<StarletCharlotte>it's fine, dw
05:07:52<parallaxstellar>StarletCharlotte: uh sorry for being intrusive. is that your reddit username?
05:08:10<StarletCharlotte>No, I don't use this name elsewhere.
05:09:16<parallaxstellar>ditto
05:10:08<StarletCharlotte>JAA: Is there a good place for proactive archiving of YouTube channels and metadata? Or is that just... Not an option?
05:10:11DigitalDragons quits [Quit: Leaving]
05:10:11Exorcism quits [Quit: Bye!]
05:10:33<parallaxstellar>you could lie about the reason /s
05:11:51<StarletCharlotte>No.
05:12:06HermanToothrot joins
05:12:12<parallaxstellar>you are a well mannered person then
05:13:18<@JAA>#down-the-tube is the channel for YouTube archival, so I answered there.
05:14:28nexussfan quits [Remote host closed the connection]
05:14:49<parallaxstellar>JAA: im sorry for acting delirious but... do you wish I was dead?
05:15:21DigitalDragons (DigitalDragons) joins
05:25:13DopefishJustin quits [Remote host closed the connection]
05:31:55HermanToothrot quits [Ping timeout: 268 seconds]
05:32:54<exorcism|m>wha
05:34:40<parallaxstellar>exorcism|m back off
05:34:49DopefishJustin (DopefishJustin) joins
05:34:51<parallaxstellar>go to twitter or something
05:35:15<parallaxstellar>im clearly depressed and sometimes i think people wish i was dead
05:35:21<parallaxstellar>so ya
05:35:25<parallaxstellar>i should get some sleep
05:35:51<exorcism|m>huh
05:38:54<@JAA>That sounds like a good idea, yeah.
05:41:03<parallaxstellar>+OK t15Ua1kjF.j1jYY1e/CcAjW/aqsfT0d9iet1kXRet0S2SOV1W8D8p.ugVlq.sJwK9.vdp2e.WQQnb1TIBQe/5OMgj1DsJY9.1flsE/r8BFg1vsPWa.jSwGg1
05:45:49HermanToothrot joins
05:52:26Island quits [Read error: Connection reset by peer]
05:53:03Exorcism (exorcism) joins
05:58:33<parallaxstellar>+OK ygomg.y/BYm/sGMPg13BnUF/GgMOC0HCKcO00Z2GH.WkwwB0bxBJN.CK4MF/i41.H.CvXD10tnbcQ.SA82v0J/e4R0faX.z.zr9Ey/q4Fm7/Ufnz6/zszaV0UuN64/QaEAS.nLdO6/LQA0s.5lvp8/gbHjP/ibG0j.0Vrgc1zozs8/6nknY101g9S.xEZkq.cHdKe/52RnG.YsVEV0WTxL11sAC2Z1f3zDX/ecI5Y1WywiL03Y2pV.qimUw0Y8X6r/z.Le50h/gck.ATDsA0cY7yC0Raar21p28Yw.PNX4h09SuYN.a04wg1
05:58:33<parallaxstellar>+OK A1l4Z1cStjh/5mp9N12ZQGb.kRWE6.BdmxV/KvsRp0WdOc/0
05:58:43<parallaxstellar>+OK dcQ4k.1z16b04TBn..LmfZ40T4SuO01yvJW/5A.Lx/9BbJg/Q/MVJ.VEWVy0h2l7901zw0L1RNEvF/xJeYC1ko5OF0HhrbB1NR9zV1UbW5p1tQH/T.4xtA10
05:58:49<parallaxstellar>+OK 8LjRP1Ucexh.Ut0Yf.fFEJo.Y8ap717HWlX1
06:02:45HermanToothrot quits [Ping timeout: 268 seconds]
06:17:52<h2ibot>PaulWise edited Medium (+113, RIP scribe.rip/Freedium, add LibMedium): https://wiki.archiveteam.org/?diff=61376&oldid=60513
06:18:52<h2ibot>PaulWise edited Medium (+49, add scribe code): https://wiki.archiveteam.org/?diff=61377&oldid=61376
06:20:52<h2ibot>PaulWise edited Medium (+41, Freedium code): https://wiki.archiveteam.org/?diff=61378&oldid=61377
06:22:52<h2ibot>PaulWise edited Medium (+2, better freedium link): https://wiki.archiveteam.org/?diff=61379&oldid=61378
06:31:41<pabs>do we have something that can archive https://freeclassicaudiobooks.com/ https://www.freeclassicaudiobooks.com/ and dedup the files between them? there is interlinking and I don't want to dupe in AB
06:32:37HermanToothrot joins
06:36:01<pokechu22>You can !a < list them together, but if the same file is hosted on both sites it'd still get duplicated
06:58:15HermanToothrot quits [Ping timeout: 268 seconds]
07:01:58Cupping1285 joins
07:02:08midou quits [Remote host closed the connection]
07:04:33midou joins
07:14:24HermanToothrot joins
07:16:56<pabs>yeah, the files are on both
07:24:11<h2ibot>Cruller edited Obstacles (+1005, /* Anti-bot & related */ Converted the list of…): https://wiki.archiveteam.org/?diff=61380&oldid=61092
07:27:11<h2ibot>Cruller edited Obstacles (+59, /* List of softwares */ Added Usage Example for…): https://wiki.archiveteam.org/?diff=61381&oldid=61380
07:29:12<h2ibot>PaulWise edited Obstacles (+145, add Prot.li Shieldwall, Centinel Analytica): https://wiki.archiveteam.org/?diff=61382&oldid=61381
07:33:12<h2ibot>Cruller edited Obstacles (+105, /* List of softwares */ Added Centinel…): https://wiki.archiveteam.org/?diff=61383&oldid=61382
07:33:49<cruller>oops
07:34:04HermanToothrot quits [Ping timeout: 268 seconds]
07:34:12<h2ibot>PaulWise edited Obstacles (-79, dedup): https://wiki.archiveteam.org/?diff=61384&oldid=61383
07:35:08<cruller>pabs++
07:35:09<eggdrop>[karma] 'pabs' now has 185 karma!
07:36:13<h2ibot>PaulWise edited Obstacles (-308, dedup links): https://wiki.archiveteam.org/?diff=61385&oldid=61384
07:40:13<h2ibot>PaulWise edited Obstacles (+11, move hugin to the table): https://wiki.archiveteam.org/?diff=61386&oldid=61385
07:40:14<h2ibot>PaulWise edited Obstacles (+1, typo): https://wiki.archiveteam.org/?diff=61387&oldid=61386
07:51:15<h2ibot>PaulWise edited Obstacles (+67, Pyison poisoner): https://wiki.archiveteam.org/?diff=61388&oldid=61387
07:51:45<pabs>arkiver: another poisoner https://github.com/JonasLong/Pyison
07:53:03lossless (lossless) joins
07:56:32<cruller>RE: https://irclogs.archivete.am/archiveteam-bs/2026-05-14#l5d4f66d7 It appears that ScienceDirect uses Cloudflare solutions, but I’m not sure exactly which ones. They’re probably using a combination of several, and they might even be using some that are quite expensive and rare.
07:58:55<cruller>It seems Zotero is also struggling with that: https://forums.zotero.org/discussion/109940/zotero-is-not-saving-pdfs-from-sciencedirect/p3
08:00:06HermanToothrot joins
08:02:31<cruller>IIRC the Chrome extension for Zotero saves webpages as SingleFile rather than WARC, and this CAPTCHA might be one of the reasons why. (Though the main reason is probably convenience.)
08:06:32<cruller>(However, I found it a bit inconvenient that when I used Chrome’s built-in translation feature and then clicked “Save as Single File,” only the translated page was saved.)
08:14:18<h2ibot>Cruller edited Obstacles (+773, /* List of softwares */ Replaced “Cloudflare”…): https://wiki.archiveteam.org/?diff=61389&oldid=61388
08:19:39HermanToothrot quits [Ping timeout: 268 seconds]
08:28:57McAfee leaves [Disconnected: Replaced by new connection]
08:29:03McAfee joins
08:30:20<h2ibot>Cruller edited List of websites excluded from the Wayback Machine (+110, Added https://www.libero.it/ — Thx…): https://wiki.archiveteam.org/?diff=61390&oldid=61373
08:34:08HermanToothrot joins
08:39:03Wohlstand (Wohlstand) joins
08:45:23<h2ibot>KleaBot edited List of websites excluded from the Wayback Machine (+0, Reordered websites and/or updated count.): https://wiki.archiveteam.org/?diff=61391&oldid=61390
08:48:41TheEnbyperor quits [Ping timeout: 268 seconds]
08:49:22TheEnbyperor_ is now known as TheEnbyperor
08:51:06HermanToothrot quits [Ping timeout: 268 seconds]
08:57:46TheEnbyperor_ joins
09:01:05<klea>Oh, woah, people have worked on that Obstacles page a lot.
09:02:41<klea>Time to develop a packet dump → WARC tool? You'd also need the fun encryption secrets, but IIRC those can be obtained with ${SSLKEYLOGFILE}.
09:17:21<pabs>hmm, I wonder if wireshark can open WARC yet
09:22:05HermanToothrot joins
09:33:29<h2ibot>Cruller edited Obstacles (+78, /* List of softwares */ Added Official Site and…): https://wiki.archiveteam.org/?diff=61392&oldid=61389
09:34:00<cruller>BTW https://builtwith.com/ is great
09:39:15<klea>Uhh
09:39:52<klea>https://developers.google.com/recaptcha/ claims it's deprecated, and tells me to look at https://docs.cloud.google.com/recaptcha/docs which I suppose does more things since it says "Google Cloud Fraud Defense".
09:42:30<h2ibot>Cruller edited Discourse (+110, Added links to https://builtwith.com/?discourse): https://wiki.archiveteam.org/?diff=61393&oldid=61067
09:42:31<pabs>the latter is a new thing, where you have to own a Google/Apple phone to bypass the protection
09:43:26<klea>https://en.wikipedia.org/wiki/ReCAPTCHA?useskin=vector links to the latter.
09:43:31<h2ibot>Klea edited Obstacles (+276, Added Official Site for reCaptcha and usage…): https://wiki.archiveteam.org/?diff=61394&oldid=61392
09:43:34<klea>uhh, not to either.
09:44:31<h2ibot>Klea edited Obstacles (-12, /* List of softwares */ Link to the legacy…): https://wiki.archiveteam.org/?diff=61395&oldid=61394
09:45:42<klea>cruller: I don't exactly like it requiring js.
09:48:11<cruller>klea: I agree with you on that point.
09:48:25<cruller>Over the past few days, there’s been a lot of talk about the reCAPTCHA update
09:48:28<klea>And also it doesn't give their full list, which is a bit misleading.
09:53:11<cruller>Despite the message "Download Full Lead List — Create a Free Account to see more results."? That's too bad.
09:53:33<klea>I mean, not WARCable!
09:54:10<klea>Funs: https://cloud.google.com/recaptcha-enterprise → https://cloud.google.com/security/products/recaptcha
10:03:52phuzion quits [Ping timeout: 268 seconds]
10:04:39<klea>cruller: Do you think making a Template:builtwith that returns something like [https://trends.builtwith.com/websitelist/{{{{1|}} builtwith:{{{1|}}] is something useful or probably just annoying?
10:06:57TastyWiener952 (TastyWiener95) joins
10:07:34TastyWiener95 quits [Ping timeout: 268 seconds]
10:07:34TastyWiener952 is now known as TastyWiener95
10:08:14Wohlstand quits [Client Quit]
10:09:34<h2ibot>Klea edited Obstacles (+533, /* List of softwares */ Split reCAPTCHA versions): https://wiki.archiveteam.org/?diff=61397&oldid=61395
10:10:35<h2ibot>Klea edited Obstacles (-104, Remove extra (builtwith website lists) text,…): https://wiki.archiveteam.org/?diff=61398&oldid=61397
10:11:35<h2ibot>Klea edited Obstacles (+23, /* List of softwares */ Make table smaller by…): https://wiki.archiveteam.org/?diff=61399&oldid=61398
10:16:48<cruller>I have no idea, but I don't think it will be a template that's used very often.
10:24:53phuzion (phuzion) joins
10:27:37<h2ibot>Klea edited Vine (+10, Project data available only via WBM, since IA…): https://wiki.archiveteam.org/?diff=61400&oldid=59805
10:29:09Juest quits [Ping timeout: 268 seconds]
10:33:02<klea>I should make a list of all the AT collections that got ARId, and have KleaBot modify the wiki adding the padlock in the [[Template:IA id]] invocations that don't yet have it.
10:36:39<cruller>The use cases for Template:builtwith would likely be archival targets (e.g., Discourse) and obstacles?
10:37:16<klea>I think it might be better to just link it, and also download their full lists and put them on the wiki too.
10:38:57<klea>grep -Po '\{\{IA (id|file|item|collection)\|\K[^}]*\}\}' archiveteam-dump-2026-05-16-nohistory-nofiles.xml|sed 's,}}$,,' |sed 's,^1=,,' >collections_with_tags.txt; sed 's,|.*,,' collections_with_tags.txt|sort|uniq >collections.txt
10:39:07<klea>Now I have to probe at IA 1588 times.
10:40:56Juest (Juest) joins
10:44:12<cruller>If you made their full lists available for anyone, they would be upset :P
10:46:24<klea>Uhh, they publish the data already. :>
10:49:02<cruller>Yeah, the area behind the free login wall is indeed "public"
10:50:20<klea>cruller: Here's a fun list of IA collections: :P https://wiki.archiveteam.org/index.php/Internet_Archive/Collections
10:51:59<klea>JAA: I've noticed that https://internetarchive.archiveteam.org/dumps/ doesn't exist.
10:53:54<klea>WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD
10:54:13<klea>I am sending it here rather than on EFNet's #archiveteam channel for reasons.
10:56:17<klea>JAA: Can you tell it to me, you know the secret word!
10:56:28Wohlstand (Wohlstand) joins
10:59:11<klea>My quest is trying to update the wiki somehow, and also setting up a KleaBot there too, for if I need to do mass-edits.
10:59:13<cruller>Oh it seems I was completely mistaken lol
11:00:12Bleo18260072271962345522201107 quits [Quit: The Lounge - https://thelounge.chat]
11:00:18<klea>I guess I can try the fileformats secret word.
11:02:59Bleo18260072271962345522201107 joins
11:03:04HermanToothrot quits [Ping timeout: 268 seconds]
11:03:17@imer quits [Quit: Oh no]
11:04:09imer (imer) joins
11:04:09@ChanServ sets mode: +o imer
11:06:25<klea>JAA: Disregard, I have gotten enlightened.
11:10:03<klea>JAA: May you change the message to ask people to come to HackINT's #archiveteam-bs instead of to EFNet's #archiveteam, possibly change the captcha word, and give bot rights to User:KleaBot?
11:11:43<klea>Uhh
11:11:52<klea>Well, first register I guess.
11:11:57Wohlstand quits [Client Quit]
11:15:43<h2ibot>Cruller edited Deathwatch (+161, /* 2026-06 */ Add Holoearth): https://wiki.archiveteam.org/?diff=61401&oldid=61375
11:18:27HermanToothrot joins
11:19:38<klea>JAA: May you update the ArchiveTeam wiki (wiki.archiveteam.org) to a MediaWiki version that supports 2FA?