| 00:06:51 | | AFly quits [Quit: 🪰] |
| 00:07:06 | | AFly (AFly) joins |
| 00:15:29 | | Arcorann__ (Arcorann) joins |
| 00:15:59 | | etnguyen03 (etnguyen03) joins |
| 00:34:21 | | Webuser573334 joins |
| 00:34:38 | | Webuser573334 quits [Client Quit] |
| 00:37:34 | | SootBector quits [Remote host closed the connection] |
| 00:38:42 | | SootBector (SootBector) joins |
| 00:39:04 | | etnguyen03 quits [Client Quit] |
| 00:41:54 | | hackbug quits [Remote host closed the connection] |
| 00:42:07 | | Goofybally quits [Killed (NickServ (GHOST command used by Goofybally2!~Goofyball@141.179.9.229))] |
| 00:42:12 | | Goofybally joins |
| 00:45:10 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 00:47:11 | | hackbug joins |
| 00:51:25 | | etnguyen03 (etnguyen03) joins |
| 01:02:54 | | HermanToothrot joins |
| 01:16:37 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 01:30:29 | | HermanToothrot joins |
| 01:44:22 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 01:52:08 | | chrismeller3 quits [Quit: chrismeller3] |
| 01:52:28 | | chrismeller3 (chrismeller) joins |
| 01:58:53 | | HermanToothrot joins |
| 02:15:49 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 02:15:53 | | cyanbox joins |
| 02:17:14 | <klea> | uhh |
| 02:17:19 | <klea> | https://about.gitlab.com/blog/gitlab-act-2/ https://mastodon.me.uk/@pikesley/116560335270718717 |
| 02:28:36 | | notarobot quits [Quit: The Lounge - https://thelounge.chat] |
| 02:28:57 | | notarobot joins |
| 02:31:09 | | HermanToothrot joins |
| 02:38:41 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 02:40:15 | | Teraunce joins |
| 02:40:52 | <Teraunce> | Hey, is there any way to use Archivebot to search for a specific file in the archives? |
| 02:44:29 | | HermanToothrot joins |
| 02:44:39 | <Teraunce> | I've been trying to pull up a PlanetElderScrolls page on the Wayback Machine to find the filename to search for it? |
| 02:49:14 | <Teraunce> | nevermind. I found it by doing very unintuitive searches on a search website because the archive it was searching did not have the author's name listed. |
| 02:49:38 | <Teraunce> | have a goodnight. |
| 02:49:45 | | Teraunce quits [Client Quit] |
| 02:52:12 | | nothere quits [Ping timeout: 268 seconds] |
| 02:52:12 | | ivan quits [Ping timeout: 268 seconds] |
| 02:55:17 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 02:59:46 | | ivan joins |
| 03:03:31 | | nothere_ joins |
| 03:09:21 | | HermanToothrot joins |
| 03:24:06 | | Webuser830993 joins |
| 03:24:39 | | Webuser830993 quits [Client Quit] |
| 03:26:10 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 03:30:16 | | etnguyen03 quits [Remote host closed the connection] |
| 03:40:46 | | HermanToothrot joins |
| 03:57:34 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 04:16:02 | | ThetaDev quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
| 04:16:52 | | ThetaDev joins |
| 04:29:40 | | HermanToothrot joins |
| 04:32:38 | | DogsRNice quits [Read error: Connection reset by peer] |
| 04:49:22 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 05:03:42 | <h2ibot> | Himond000 edited Deathwatch (+338, /* 2026-07 */ add OPENREC.tv): https://wiki.archiveteam.org/?diff=61375&oldid=61324 |
| 05:04:50 | | StarletCharlotte joins |
| 05:04:51 | <eggdrop> | [tell] StarletCharlotte: [2026-02-12T17:53:19Z] <JAA> Dots in IA item names are perfectly fine; I use them all the time. And there's a script in little-things for metadata as well. Feel free to ask in #internetarchive if you have more questions. |
| 05:04:52 | <eggdrop> | [tell] StarletCharlotte: [2026-03-19T17:54:27Z] <c3manu> i recently queued https://horizon.meta.com/, unaware of the upcoming announcement. i can't guarantee for its completeness though, since it was adding some junk parameter which made it fetch the same picture files over and over again, so i eventually ignored it. |
| 05:05:38 | <StarletCharlotte> | parallaxstellar I'm here now. |
| 05:05:53 | <parallaxstellar> | ah, it looks like you havent connected in a while. sorry for baiting you with banter |
| 05:06:01 | <StarletCharlotte> | it's fine, dw |
| 05:07:52 | <parallaxstellar> | StarletCharlotte: uh sorry for being intrusive. is that your reddit username? |
| 05:08:10 | <StarletCharlotte> | No, I don't use this name elsewhere. |
| 05:09:16 | <parallaxstellar> | ditto |
| 05:10:08 | <StarletCharlotte> | JAA: Is there a good place for proactive archiving of YouTube channels and metadata? Or is that just... Not an option? |
| 05:10:11 | | DigitalDragons quits [Quit: Leaving] |
| 05:10:11 | | Exorcism quits [Quit: Bye!] |
| 05:10:33 | <parallaxstellar> | you could lie about the reason /s |
| 05:11:51 | <StarletCharlotte> | No. |
| 05:12:06 | | HermanToothrot joins |
| 05:12:12 | <parallaxstellar> | you are a well mannered person then |
| 05:13:18 | <@JAA> | #down-the-tube is the channel for YouTube archival, so I answered there. |
| 05:14:28 | | nexussfan quits [Remote host closed the connection] |
| 05:14:49 | <parallaxstellar> | JAA: im sorry for acting delirious but... do you wish I was dead? |
| 05:15:21 | | DigitalDragons (DigitalDragons) joins |
| 05:25:13 | | DopefishJustin quits [Remote host closed the connection] |
| 05:31:55 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 05:32:54 | <exorcism|m> | wha |
| 05:34:40 | <parallaxstellar> | exorcism|m back off |
| 05:34:49 | | DopefishJustin (DopefishJustin) joins |
| 05:34:51 | <parallaxstellar> | go to twitter or something |
| 05:35:15 | <parallaxstellar> | im clearly depressed and sometimes i think people wish i was dead |
| 05:35:21 | <parallaxstellar> | so ya |
| 05:35:25 | <parallaxstellar> | i should get some sleep |
| 05:35:51 | <exorcism|m> | huh |
| 05:38:54 | <@JAA> | That sounds like a good idea, yeah. |
| 05:41:03 | <parallaxstellar> | +OK t15Ua1kjF.j1jYY1e/CcAjW/aqsfT0d9iet1kXRet0S2SOV1W8D8p.ugVlq.sJwK9.vdp2e.WQQnb1TIBQe/5OMgj1DsJY9.1flsE/r8BFg1vsPWa.jSwGg1 |
| 05:45:49 | | HermanToothrot joins |
| 05:52:26 | | Island quits [Read error: Connection reset by peer] |
| 05:53:03 | | Exorcism (exorcism) joins |
| 05:58:33 | <parallaxstellar> | +OK ygomg.y/BYm/sGMPg13BnUF/GgMOC0HCKcO00Z2GH.WkwwB0bxBJN.CK4MF/i41.H.CvXD10tnbcQ.SA82v0J/e4R0faX.z.zr9Ey/q4Fm7/Ufnz6/zszaV0UuN64/QaEAS.nLdO6/LQA0s.5lvp8/gbHjP/ibG0j.0Vrgc1zozs8/6nknY101g9S.xEZkq.cHdKe/52RnG.YsVEV0WTxL11sAC2Z1f3zDX/ecI5Y1WywiL03Y2pV.qimUw0Y8X6r/z.Le50h/gck.ATDsA0cY7yC0Raar21p28Yw.PNX4h09SuYN.a04wg1 |
| 05:58:33 | <parallaxstellar> | +OK A1l4Z1cStjh/5mp9N12ZQGb.kRWE6.BdmxV/KvsRp0WdOc/0 |
| 05:58:43 | <parallaxstellar> | +OK dcQ4k.1z16b04TBn..LmfZ40T4SuO01yvJW/5A.Lx/9BbJg/Q/MVJ.VEWVy0h2l7901zw0L1RNEvF/xJeYC1ko5OF0HhrbB1NR9zV1UbW5p1tQH/T.4xtA10 |
| 05:58:49 | <parallaxstellar> | +OK 8LjRP1Ucexh.Ut0Yf.fFEJo.Y8ap717HWlX1 |
| 06:02:45 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 06:17:52 | <h2ibot> | PaulWise edited Medium (+113, RIP scribe.rip/Freedium, add LibMedium): https://wiki.archiveteam.org/?diff=61376&oldid=60513 |
| 06:18:52 | <h2ibot> | PaulWise edited Medium (+49, add scribe code): https://wiki.archiveteam.org/?diff=61377&oldid=61376 |
| 06:20:52 | <h2ibot> | PaulWise edited Medium (+41, Freedium code): https://wiki.archiveteam.org/?diff=61378&oldid=61377 |
| 06:22:52 | <h2ibot> | PaulWise edited Medium (+2, better freedium link): https://wiki.archiveteam.org/?diff=61379&oldid=61378 |
| 06:31:41 | <pabs> | do we have something that can archive https://freeclassicaudiobooks.com/ https://www.freeclassicaudiobooks.com/ and dedup the files between them? there is interlinking and I don't want to dupe in AB |
| 06:32:37 | | HermanToothrot joins |
| 06:36:01 | <pokechu22> | You can !a < list them together, but if the same file is hosted on both sites it'd still get duplicated |
| 06:58:15 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 07:01:58 | | Cupping1285 joins |
| 07:02:08 | | midou quits [Remote host closed the connection] |
| 07:04:33 | | midou joins |
| 07:14:24 | | HermanToothrot joins |
| 07:16:56 | <pabs> | yeah, the files are on both |
| 07:24:11 | <h2ibot> | Cruller edited Obstacles (+1005, /* Anti-bot & related */ Converted the list of…): https://wiki.archiveteam.org/?diff=61380&oldid=61092 |
| 07:27:11 | <h2ibot> | Cruller edited Obstacles (+59, /* List of softwares */ Added Usage Example for…): https://wiki.archiveteam.org/?diff=61381&oldid=61380 |
| 07:29:12 | <h2ibot> | PaulWise edited Obstacles (+145, add Prot.li Shieldwall, Centinel Analytica): https://wiki.archiveteam.org/?diff=61382&oldid=61381 |
| 07:33:12 | <h2ibot> | Cruller edited Obstacles (+105, /* List of softwares */ Added Centinel…): https://wiki.archiveteam.org/?diff=61383&oldid=61382 |
| 07:33:49 | <cruller> | oops |
| 07:34:04 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 07:34:12 | <h2ibot> | PaulWise edited Obstacles (-79, dedup): https://wiki.archiveteam.org/?diff=61384&oldid=61383 |
| 07:35:08 | <cruller> | pabs++ |
| 07:35:09 | <eggdrop> | [karma] 'pabs' now has 185 karma! |
| 07:36:13 | <h2ibot> | PaulWise edited Obstacles (-308, dedup links): https://wiki.archiveteam.org/?diff=61385&oldid=61384 |
| 07:40:13 | <h2ibot> | PaulWise edited Obstacles (+11, move hugin to the table): https://wiki.archiveteam.org/?diff=61386&oldid=61385 |
| 07:40:14 | <h2ibot> | PaulWise edited Obstacles (+1, typo): https://wiki.archiveteam.org/?diff=61387&oldid=61386 |
| 07:51:15 | <h2ibot> | PaulWise edited Obstacles (+67, Pyison poisoner): https://wiki.archiveteam.org/?diff=61388&oldid=61387 |
| 07:51:45 | <pabs> | arkiver: another poisoner https://github.com/JonasLong/Pyison |
| 07:53:03 | | lossless (lossless) joins |
| 07:56:32 | <cruller> | RE: https://irclogs.archivete.am/archiveteam-bs/2026-05-14#l5d4f66d7 It appears that ScienceDirect uses Cloudflare solutions, but I’m not sure exactly which ones. They’re probably using a combination of several, and they might even be using some that are quite expensive and rare. |
| 07:58:55 | <cruller> | It seems Zotero is also struggling with that: https://forums.zotero.org/discussion/109940/zotero-is-not-saving-pdfs-from-sciencedirect/p3 |
| 08:00:06 | | HermanToothrot joins |
| 08:02:31 | <cruller> | IIRC the Chrome extension for Zotero saves webpages as SingleFile rather than WARC, and this CAPTCHA might be one of the reasons why. (Though the main reason is probably convenience.) |
| 08:06:32 | <cruller> | (However, I found it a bit inconvenient that when I used Chrome’s built-in translation feature and then clicked “Save as Single File,” only the translated page was saved.) |
| 08:14:18 | <h2ibot> | Cruller edited Obstacles (+773, /* List of softwares */ Replaced “Cloudflare”…): https://wiki.archiveteam.org/?diff=61389&oldid=61388 |
| 08:19:39 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 08:28:57 | | McAfee leaves [Disconnected: Replaced by new connection] |
| 08:29:03 | | McAfee joins |
| 08:30:20 | <h2ibot> | Cruller edited List of websites excluded from the Wayback Machine (+110, Added https://www.libero.it/ — Thx…): https://wiki.archiveteam.org/?diff=61390&oldid=61373 |
| 08:34:08 | | HermanToothrot joins |
| 08:39:03 | | Wohlstand (Wohlstand) joins |
| 08:45:23 | <h2ibot> | KleaBot edited List of websites excluded from the Wayback Machine (+0, Reordered websites and/or updated count.): https://wiki.archiveteam.org/?diff=61391&oldid=61390 |
| 08:48:41 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 08:49:22 | | TheEnbyperor_ is now known as TheEnbyperor |
| 08:51:06 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 08:57:46 | | TheEnbyperor_ joins |
| 09:01:05 | <klea> | Oh, woah, people have worked on that Obstacles page a lot. |
| 09:02:41 | <klea> | Time to develop a packet dump → WARC tool? You'd also need the fun encryption secrets, but IIRC those can be obtained with ${SSLKEYLOGFILE}. |
| 09:17:21 | <pabs> | hmm, I wonder if wireshark can open WARC yet |
| 09:22:05 | | HermanToothrot joins |
| 09:33:29 | <h2ibot> | Cruller edited Obstacles (+78, /* List of softwares */ Added Official Site and…): https://wiki.archiveteam.org/?diff=61392&oldid=61389 |
| 09:34:00 | <cruller> | BTW https://builtwith.com/ is great |
| 09:39:15 | <klea> | Uhh |
| 09:39:52 | <klea> | https://developers.google.com/recaptcha/ claims it's deprecated, and tells me to look at https://docs.cloud.google.com/recaptcha/docs which I suppose does more things since it says "Google Cloud Fraud Defense". |
| 09:42:30 | <h2ibot> | Cruller edited Discourse (+110, Added links to https://builtwith.com/?discourse): https://wiki.archiveteam.org/?diff=61393&oldid=61067 |
| 09:42:31 | <pabs> | the latter is a new thing, where you have to own a Google/Apple phone to bypass the protection |
| 09:43:26 | <klea> | https://en.wikipedia.org/wiki/ReCAPTCHA?useskin=vector links to the latter. |
| 09:43:31 | <h2ibot> | Klea edited Obstacles (+276, Added Official Site for reCaptcha and usage…): https://wiki.archiveteam.org/?diff=61394&oldid=61392 |
| 09:43:34 | <klea> | uhh, not to either. |
| 09:44:31 | <h2ibot> | Klea edited Obstacles (-12, /* List of softwares */ Link to the legacy…): https://wiki.archiveteam.org/?diff=61395&oldid=61394 |
| 09:45:42 | <klea> | cruller: I don't exactly like it requiring js. |
| 09:48:11 | <cruller> | klea: I agree with you on that point. |
| 09:48:25 | <cruller> | Over the past few days, there’s been a lot of talk about the reCAPTCHA update |
| 09:48:28 | <klea> | And also it doesn't give their full list, which is a bit misleading. |
| 09:53:11 | <cruller> | Despite the message "Download Full Lead List — Create a Free Account to see more results."? That's too bad. |
| 09:53:33 | <klea> | I mean, not WARCable! |
| 09:54:10 | <klea> | Funs: https://cloud.google.com/recaptcha-enterprise → https://cloud.google.com/security/products/recaptcha |
| 10:03:52 | | phuzion quits [Ping timeout: 268 seconds] |
| 10:04:39 | <klea> | cruller: Do you think making a Template:builtwith that returns something like [https://trends.builtwith.com/websitelist/{{{{1|}} builtwith:{{{1|}}] is something useful or probably just annoying? |
| 10:06:57 | | TastyWiener952 (TastyWiener95) joins |
| 10:07:34 | | TastyWiener95 quits [Ping timeout: 268 seconds] |
| 10:07:34 | | TastyWiener952 is now known as TastyWiener95 |
| 10:08:14 | | Wohlstand quits [Client Quit] |
| 10:09:34 | <h2ibot> | Klea edited Obstacles (+533, /* List of softwares */ Split reCAPTCHA versions): https://wiki.archiveteam.org/?diff=61397&oldid=61395 |
| 10:10:35 | <h2ibot> | Klea edited Obstacles (-104, Remove extra (builtwith website lists) text,…): https://wiki.archiveteam.org/?diff=61398&oldid=61397 |
| 10:11:35 | <h2ibot> | Klea edited Obstacles (+23, /* List of softwares */ Make table smaller by…): https://wiki.archiveteam.org/?diff=61399&oldid=61398 |
| 10:16:48 | <cruller> | I have no idea, but I don't think it will be a template that's used very often. |
| 10:24:53 | | phuzion (phuzion) joins |
| 10:27:37 | <h2ibot> | Klea edited Vine (+10, Project data available only via WBM, since IA…): https://wiki.archiveteam.org/?diff=61400&oldid=59805 |
| 10:29:09 | | Juest quits [Ping timeout: 268 seconds] |
| 10:33:02 | <klea> | I should make a list of all the AT collections that got ARId, and have KleaBot modify the wiki adding the padlock in the [[Template:IA id]] invocations that don't yet have it. |
| 10:36:39 | <cruller> | The use cases for Template:builtwith would likely be archival targets (e.g., Discourse) and obstacles? |
| 10:37:16 | <klea> | I think it might be better to just link it, and also download their full lists and put them on the wiki too. |
| 10:38:57 | <klea> | grep -Po '\{\{IA (id|file|item|collection)\|\K[^}]*\}\}' archiveteam-dump-2026-05-16-nohistory-nofiles.xml|sed 's,}}$,,' |sed 's,^1=,,' >collections_with_tags.txt; sed 's,|.*,,' collections_with_tags.txt|sort|uniq >collections.txt |
| 10:39:07 | <klea> | Now I have to probe at IA 1588 times. |
| 10:40:56 | | Juest (Juest) joins |
| 10:44:12 | <cruller> | If you made their full lists available for anyone, they would be upset :P |
| 10:46:24 | <klea> | Uhh, they publish the data already. :> |
| 10:49:02 | <cruller> | Yeah, the area behind the free login wall is indeed "public" |
| 10:50:20 | <klea> | cruller: Here's a fun list of IA collections: :P https://wiki.archiveteam.org/index.php/Internet_Archive/Collections |
| 10:51:59 | <klea> | JAA: I've noticed that https://internetarchive.archiveteam.org/dumps/ doesn't exist. |
| 10:53:54 | <klea> | WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD |
| 10:54:13 | <klea> | I am sending it here rather than on EFNet's #archiveteam channel for reasons. |
| 10:56:17 | <klea> | JAA: Can you tell it to me, you know the secret word! |
| 10:56:28 | | Wohlstand (Wohlstand) joins |
| 10:59:11 | <klea> | My quest is trying to update the wiki somehow, and also setting up a KleaBot there too, for if I need to do mass-edits. |
| 10:59:13 | <cruller> | Oh it seems I was completely mistaken lol |
| 11:00:12 | | Bleo18260072271962345522201107 quits [Quit: The Lounge - https://thelounge.chat] |
| 11:00:18 | <klea> | I guess I can try the fileformats secret word. |
| 11:02:59 | | Bleo18260072271962345522201107 joins |
| 11:03:04 | | HermanToothrot quits [Ping timeout: 268 seconds] |
| 11:03:17 | | @imer quits [Quit: Oh no] |
| 11:04:09 | | imer (imer) joins |
| 11:04:09 | | @ChanServ sets mode: +o imer |
| 11:06:25 | <klea> | JAA: Disregard, I have gotten enlightened. |
| 11:10:03 | <klea> | JAA: May you change the message to ask people to come to HackINT's #archiveteam-bs instead of to EFNet's #archiveteam, possibly change the captcha word, and give bot rights to User:KleaBot? |
| 11:11:43 | <klea> | Uhh |
| 11:11:52 | <klea> | Well, first register I guess. |
| 11:11:57 | | Wohlstand quits [Client Quit] |
| 11:15:43 | <h2ibot> | Cruller edited Deathwatch (+161, /* 2026-06 */ Add Holoearth): https://wiki.archiveteam.org/?diff=61401&oldid=61375 |
| 11:18:27 | | HermanToothrot joins |
| 11:19:38 | <klea> | JAA: May you update the ArchiveTeam wiki (wiki.archiveteam.org) to a MediaWiki version that supports 2FA? |