00:00:36<v01d>ok you are the irc channel of archiveteam website
00:01:30<v01d>that was misleading to put your address as a reference to archive. today
00:01:44<klea>should i add "The tables have turned…", TechLinked, for that shoutout?, https://www.youtube.com/watch?v=2sl0NwUCZlg&t=417s ( i guess not since https://irclogs.archivete.am/archiveteam/2025-07-05#l5afc0c30 it was said)
00:11:44<TheTechRobo>klea: Warcprox would need hardening for that idea to work. It exposes an API with a fair amount of attack surface (e.g. WARCPROX_WRITE_RECORD): https://github.com/internetarchive/warcprox/blob/master/api.rst
00:11:55<klea>oh
00:12:05<TheTechRobo>It'd definitely be useful, though. An archiveweb.page alternative.
00:12:22<TheTechRobo>(that presumably wouldn't violate the WARC standard :P)
00:12:47v01d quits [Remote host closed the connection]
00:13:09<klea>yes, so long you trust AT to not manipulate your data, and add the cert from the service to your CA roots
00:13:27<TheTechRobo>it'd be the other way around, no?
00:13:38<klea>wdym?
00:13:43<TheTechRobo>If AT trusts you not to abuse the warcprox API
00:13:49<TheTechRobo>Unless you mean in general, with a hardened warcprox
00:14:09<klea>no, the the service provided by AT would probably not give you any api to poke at
00:14:24<klea>the thingy i was thinking of would connect to warcprox instead of you to warcprox
00:14:41<TheTechRobo>Oh, I see
00:14:51<klea>mostly because it needs to run a separate instance per user, to have all your traffic be sent back to you
00:14:56<klea>but mitmed at AT's end
00:15:37<TheTechRobo>You'd have to make sure to replace any Warcprox-Meta header in the request, but other than that, that'd work
00:16:32<TheTechRobo>or at least certain fields, since that header does have non-dangerous uses
00:20:27Webuser524414 joins
00:20:48etnguyen03 quits [Client Quit]
00:21:26Webuser5244140 joins
00:21:42Webuser524414 quits [Client Quit]
00:21:42Webuser5244140 quits [Client Quit]
00:24:04v01d joins
00:27:58<v01d>still nice to have stumbled here
00:28:44<v01d>I didn't know about you before
00:29:07lennier2 joins
00:32:11lennier2_ quits [Ping timeout: 272 seconds]
00:35:09Cuphead2527480 quits [Client Quit]
00:43:22graham9 joins
00:45:29graham9 quits [Client Quit]
00:49:44<h2ibot>Klea edited Manual projects (-28, update statuses somewhat): https://wiki.archiveteam.org/?diff=58297&oldid=47657
00:52:29etnguyen03 (etnguyen03) joins
00:53:45<h2ibot>Klea edited Woohoo (+0, Close table properly so it doesn't appear at…): https://wiki.archiveteam.org/?diff=58298&oldid=46764
01:15:32Wohlstand (Wohlstand) joins
01:16:25ell7 quits [Remote host closed the connection]
01:22:21Webuser906815 quits [Client Quit]
01:27:36cyanbox joins
01:56:43<nicolas17>I didn't know of the Manual Projects page
01:56:49<nicolas17>I should document my samsung-opensource stuff
01:59:02<nicolas17>although tbf it *could* be less manual than it is
02:00:20guest1 joins
02:04:45ericgallager quits [Quit: This computer has gone to sleep]
02:10:51<klea>nicolas17: was last updated in 2019, so makes sense
02:11:15<klea>nicolas17: afaik you have to manually add more files to download?
02:17:23<pabs>anyone know of global website checkers other than check-host.net and globalping.io? (both need JS to display the results)
02:17:52pabs . o O 0 ( maybe we already have a DPoS one? )
02:27:00<klea>it'd be a fun project to setup for warriors to run :p
02:29:21<@JAA>FWIW, check-host.net needs JS to check or render, but the data is all in the HTML, so AB still works fine with it.
02:29:39<@JAA>Or at least it did last time I checked.
02:31:01<pabs>huh
02:31:57<pabs>yep, confirmed that is still true
02:32:24<pabs>JavaScript--
02:32:25<eggdrop>[karma] 'JavaScript' now has -23 karma!
02:36:09<h2ibot>Klea edited Manual projects (+6, [[Current_Projects]] says it's On hiatus): https://wiki.archiveteam.org/?diff=58299&oldid=58297
02:37:32ericgallager joins
02:43:43<nicolas17>klea: yeah when Samsung publishes new stuff I have to manually add it to the queue
02:45:24<nicolas17>and for items with multiple files like this one it's easier to just download them myself https://opensource.samsung.com/uploadSearch?searchValue=F731BXXU5EYD9
02:48:24<jinn6>+6
02:53:22guest1 quits [Read error: Connection reset by peer]
02:54:13guest1 joins
02:59:34<klea>oh
02:59:44<klea>nicolas17: do you still add it to the tracker?
03:04:07nexussfan quits [Quit: Konversation terminated!]
03:05:17nexussfan (nexussfan) joins
03:17:22guest1 quits [Read error: Connection reset by peer]
03:18:49guest1 joins
03:21:54guest1 quits [Read error: Connection reset by peer]
03:24:39etnguyen03 quits [Client Quit]
03:32:54nine quits [Quit: See ya!]
03:33:06nine joins
03:33:06nine quits [Changing host]
03:33:06nine (nine) joins
03:36:59Island quits [Read error: Connection reset by peer]
03:41:21guest1 joins
03:43:32guest2 joins
03:45:27guest2 quits [Client Quit]
03:47:50guest1 quits [Ping timeout: 256 seconds]
03:49:43Wohlstand quits [Client Quit]
03:53:19nathang2184 quits [Quit: Ping timeout (120 seconds)]
03:53:37nathang2184 joins
03:59:17SootBector quits [Remote host closed the connection]
04:00:24SootBector (SootBector) joins
04:01:28Justin14p_ joins
04:04:16Justin14p quits [Ping timeout: 256 seconds]
04:07:59etnguyen03 (etnguyen03) joins
04:11:22<h2ibot>Zen edited List of websites excluded from the Wayback Machine (+29, https://elnea.wicurio.com/): https://wiki.archiveteam.org/?diff=58300&oldid=58277
04:17:25etnguyen03 quits [Remote host closed the connection]
04:20:11DogsRNice quits [Read error: Connection reset by peer]
04:25:39Webuser407710 joins
04:32:23ericgallager quits [Client Quit]
04:32:54Webuser407710 quits [Client Quit]
05:13:40HackMii quits [Remote host closed the connection]
05:14:19HackMii (hacktheplanet) joins
05:52:14guest1 joins
06:14:16SootBector quits [Remote host closed the connection]
06:15:26SootBector (SootBector) joins
06:21:02nexussfan quits [Quit: Konversation terminated!]
06:24:34tzt quits [Quit: tzt]
06:24:44tzt (tzt) joins
06:24:47tzt quits [Client Quit]
06:24:57tzt (tzt) joins
06:40:01Guest58 quits [Read error: Connection reset by peer]
06:40:09Guest58_ joins
06:47:26guest1 quits [Read error: Connection reset by peer]
06:48:24guest1 joins
06:52:56guest1 quits [Read error: Connection reset by peer]
06:53:58ericgallager joins
06:54:44Wohlstand (Wohlstand) joins
07:01:41lemuria quits [Ping timeout: 272 seconds]
07:02:22lemuria (lemuria) joins
07:09:54v01d quits [Remote host closed the connection]
07:18:56midou quits [Read error: Connection reset by peer]
07:20:47midou joins
07:35:42Webuser421602 joins
07:36:09Webuser421602 quits [Client Quit]
07:39:36Webuser882091 joins
07:39:49Webuser882091 quits [Client Quit]
08:03:36michaelblob quits [Quit: yoop]
08:04:11michaelblob joins
08:15:58emphie joins
08:32:36ericgallager quits [Client Quit]
08:43:27Dada joins
09:52:07Sluggs quits [Excess Flood]
09:56:12Sluggs (Sluggs) joins
10:13:17evergreen55 joins
10:16:45evergreen5 quits [Ping timeout: 272 seconds]
10:16:45evergreen55 is now known as evergreen5
10:31:20ymgve_ joins
10:35:07ymgve__ quits [Ping timeout: 272 seconds]
10:51:53despot joins
10:52:26<despot>question, is there any way to still look at xoom.com archives? besides using archive.today
11:06:47pabs quits [Ping timeout: 272 seconds]
11:17:36void09 quits [Read error: Connection reset by peer]
11:18:00void09 joins
11:20:02@imer quits [Ping timeout: 256 seconds]
11:31:47imer (imer) joins
11:31:47@ChanServ sets mode: +o imer
11:38:01nine quits [Quit: See ya!]
11:38:14nine joins
11:38:14nine quits [Changing host]
11:38:14nine (nine) joins
11:38:24BlackWinnerYoshi joins
11:54:31despot quits [Client Quit]
11:54:43<cruller>despot: A WARC file created by ArchiveBot in 2022 is public, but it appears to contain only the homepage. https://archive.fart.website/archivebot/viewer/?q=xoom.com
11:57:52<cruller>Services listed on https://wiki.archiveteam.org/index.php/Archive_Services may have some snapshots.
11:58:22awauwa (awauwa) joins
11:59:34awauwa quits [Client Quit]
11:59:48awauwa (awauwa) joins
12:02:29<BlackWinnerYoshi>Can IcebergCharts.com be added to Deathwatch? On 2025-11-14T14:16:24.840Z, Coda (admin) stated on Discord that "site will still be up for a year or so" (registration and creating new icebergs already disabled) and that editing will be disabled "in one or two months or so, maybe later". ArchiveBot might have problems with it due to the NSFW toggle
12:02:29<BlackWinnerYoshi>(JS?)
12:15:40<h2ibot>Cruller edited Archive Services (+43, Replace the dead link…): https://wiki.archiveteam.org/?diff=58301&oldid=57898
12:24:59<cruller>Should MemWeb be removed from https://wiki.archiveteam.org/index.php/Template:Url ? It would be nice if there were any good alternative web services...
12:31:24Dada quits [Remote host closed the connection]
12:39:32<cruller>https://github.com/oduwsdl/MemGator is self-hostable, but I don't know any public servers.
12:46:06pabs (pabs) joins
13:04:15<cruller>!tell despot see https://irclogs.archivete.am/archiveteam-bs/2025-12-08#l50ec5a4b
13:04:16<eggdrop>[tell] ok, I'll tell despot when they join next
13:10:55awauwa quits [Ping timeout: 272 seconds]
13:11:46awauwa (awauwa) joins
13:52:05TheEnbyperor_ quits [Ping timeout: 272 seconds]
13:53:02TheEnbyperor quits [Ping timeout: 256 seconds]
14:02:17TheEnbyperor (TheEnbyperor) joins
14:02:39TheEnbyperor_ joins
14:35:00<nicolas17>https://mastodon.green/@cbruegg/115683118868291321 "I always thought the Internet Archive automatically indexed pages. Is that not true at all?"
14:35:09<nicolas17>I already replied to this but maybe someone has something to add
14:42:22<nicolas17>in particular I'm out of the loop about how much crawling IA does on their own
14:42:41<nicolas17>or what other non-archiveteam sources they have for WARCs
14:42:54<nulldata>https://commoncrawl.org/ is also a data source
14:45:52<nicolas17>BlackWinnerYoshi: I heard icebergcharts was already archived, maybe you can check on wayback machine if it has the latest posts and if it has some nsfw ones?
14:46:11<h2ibot>Calmevening edited Android Applications (+366): https://wiki.archiveteam.org/?diff=58302&oldid=58212
14:46:56<nicolas17>but if edits are still possible, maybe we need to revisit closer to shutdown
14:53:12<h2ibot>Calmevening edited Android Applications (+406): https://wiki.archiveteam.org/?diff=58303&oldid=58302
14:58:13<h2ibot>Calmevening edited Android Applications (+225): https://wiki.archiveteam.org/?diff=58304&oldid=58303
14:59:21guest1 joins
15:19:11<BlackWinnerYoshi>nicolas17: oh yeah, I forgot ArchiveBot Viewer exists, and it did archive both NSFW (suf. `/nsfw`) and SFW (suf. `?sfw=true`) versions of categories, though I'm not sure to what extent. It should be rechecked on ~ Feb 2026 anyway, so I put it onto my watchlist
15:24:51<justauser>pabs: ping.pe partially works without JS, ping-admin.com seems to work fully.
15:28:54nicolas17 quits [Quit: power outage]
15:49:15guest1 quits [Read error: Connection reset by peer]
15:50:05DopefishJustin quits [Remote host closed the connection]
15:54:34guest1 joins
16:05:47DopefishJustin joins
16:11:05Wohlstand quits [Quit: Wohlstand]
16:38:27guest1 quits [Read error: Connection reset by peer]
16:47:51Wohlstand (Wohlstand) joins
16:51:36<justauser>Should I do something with color.io's Unsplash?
16:51:38<justauser>https://unsplash.com/@colorio/collections
16:51:59<justauser>Doesn't look easy to paste to some bot and probably not directly endangered.
16:57:30<h2ibot>KleaBot edited List of websites excluded from the Wayback Machine (+0): https://wiki.archiveteam.org/?diff=58305&oldid=58300
16:58:12<justauser>Any special consideration for Instagram image direct links?
16:58:22<justauser>Don't seem to be IP-restricted.
16:58:51<klea>maybe somewhat hard to later find?
17:06:08<justauser>Definitely so. Probably the best way would be finding the !ao list.
17:07:28<klea>yea
17:07:47<justauser>JAA, arkiver: ^
17:08:42<justauser>https://transfer.archivete.am/pm4sW/instagram_color.io.txt
17:08:42<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/pm4sW/instagram_color.io.txt
17:08:42<justauser>https://transfer.archivete.am/Y99I5/app.color.io_full.txt
17:15:33<h2ibot>Justauser edited Instagram (+278, /* Accessibility */ imginn.com, kilogram.makeup): https://wiki.archiveteam.org/?diff=58306&oldid=56438
17:26:19guest1 joins
17:55:03Dada joins
18:09:24guest2 joins
18:13:26<kline>does anyone know if scene.org throttles? they say to use their rsync service but i get <200kBps download speeds
18:13:39guest1 quits [Ping timeout: 272 seconds]
18:14:18<kline>https://files.scene.org/faq/ - currently: 61.20kB/s in rsync
18:16:32<justauser>Try one of the mirrors?
18:18:42<kline>the mirrors dont support rsync - i can download files from them but it's hard to sync the entire tree unless i crawl it the hard way
18:19:01<kline>I will if I need to, I'd just like to use their preferred method, which is apparently this
18:19:35<kline>(if you know of a mirror with unpublished rsync, that would be nice though)
18:28:11simon816 quits [Remote host closed the connection]
18:30:50simon816 (simon816) joins
18:32:56BlackWinnerYoshi quits [Quit: Ooops, wrong browser tab.]
18:42:43Wohlstand quits [Client Quit]
18:56:47<h2ibot>Klea edited List of websites excluded from the Wayback Machine (-39, Remove count to make bot regenerate it): https://wiki.archiveteam.org/?diff=58307&oldid=58305
18:59:47<h2ibot>KleaBot edited List of websites excluded from the Wayback Machine (+39, Rendered from template): https://wiki.archiveteam.org/?diff=58308&oldid=58307
19:00:54<klea>yay!
19:01:53<klea>oh no, it filled my archiveteam directory with a bunch of files :p
19:02:48nicolas17 (nicolas17) joins
19:05:12guest3 joins
19:07:47guest1 joins
19:09:23guest2 quits [Ping timeout: 272 seconds]
19:10:39guest3 quits [Ping timeout: 272 seconds]
19:10:48<@JAA>Aw, I was just thinking we could maybe use the count parser function to replace that part of the page, but nope, it blocks the transclusion loop.
19:11:40<@JAA>E.g. {{#invoke:String|count|{{:List of websites excluded from the Wayback Machine}}|\* [h]ttps?://|plain=false}}
19:11:54<@JAA>> Template loop detected: List of websites excluded from the Wayback Machine
19:12:12<klea>oh, sad :(
19:13:53<@JAA>Could move it to a template though.
19:14:19<klea>would it work with the subpages too?
19:14:53<@JAA>We'd pass the list into the template and make it return the count + the list itself.
19:15:01<@JAA>So it doesn't matter where it's located.
19:15:50<klea>nice :)
19:34:11<@JAA>We don't actually have the necessary stuff enabled on the wiki, so nope.
19:34:54guest2 joins
19:38:02<klea>oh
19:38:42guest1 quits [Ping timeout: 256 seconds]
19:44:04<klea>i made it all work with pywikibot :)
19:44:11<klea>i wonder where i should stuff the source code
19:44:19<klea>sad that i can't register on git. nor gitea. on AT
19:55:08cyanbox quits [Read error: Connection reset by peer]
19:58:44awauwa quits [Quit: awauwa]
20:04:09guest2 quits [Read error: Connection reset by peer]
20:09:39guest2 joins
20:09:50ericgallager joins
20:11:24Webuser610079 joins
20:14:32Webuser610079 quits [Client Quit]
20:19:47<steering>Soon(TM) :P
20:26:56sec^nd quits [Remote host closed the connection]
20:27:16sec^nd (second) joins
20:29:35Wohlstand (Wohlstand) joins
20:41:07<h2ibot>Klea edited Main Page/Current Projects (+12, [[CurrentWarirorProject]] is a redirect to…): https://wiki.archiveteam.org/?diff=58309&oldid=58147
20:42:12<klea>i typoed CurrentWarriorProject on the edit reason :(
20:50:13guest2 quits [Client Quit]
21:07:35Justin14p_ quits [Remote host closed the connection]
21:35:15<h2ibot>Cooljeanius edited Miiverse (+102, add "See Also" section for MVClonapedia): https://wiki.archiveteam.org/?diff=58310&oldid=57594
21:36:21DogsRNice joins
21:39:21<ericgallager>https://bsky.app/profile/zacklabe.com/post/3m7isxeqlzc2k
21:40:31<ericgallager>oh wait actually I meant to put that in #UncleSamsArchive
22:03:43<klea>btw, what's the state of archiving single pages of archive.ph?
22:18:21<h2ibot>Cooljeanius edited Discourse/uncategorized (+0, add https://community.pikminbloom.com/): https://wiki.archiveteam.org/?diff=58311&oldid=56271
22:20:56Guest58_ quits [Quit: My Mac has gone to sleep. ZZZzzz…]
22:28:30<ericgallager>oops, I didn't mean to remove the emojis in that last edit... I think I have some browser extension acting up...
22:29:17<ericgallager>possibly this one? https://addons.mozilla.org/en-US/firefox/addon/twemojify/
22:51:44Wohlstand quits [Client Quit]
22:59:26<h2ibot>Klea edited Discourse/uncategorized (+0, Undo revision 58311 by…): https://wiki.archiveteam.org/?diff=58312&oldid=58311
23:01:01<klea>possibly
23:01:20klea wonders if she should configure the url bot to order the urls on that page
23:01:27<klea>/cc JAA ^
23:01:27<h2ibot>Klea edited Discourse/uncategorized (+70, Readd https://community.pikminbloom.com/…): https://wiki.archiveteam.org/?diff=58313&oldid=58312
23:03:24<@JAA>Hmm, not sure either sorting by 'name' or by URL makes sense on that page.
23:04:06nexussfan (nexussfan) joins
23:04:31<klea>i think by url, removing common prefixes like community.
23:07:32<klea>(forum|community|discourse|forums|www|discuss(ions?)?|foro|support|bbs|help|ask|chat|comunidad)
23:07:59<@JAA>Seems overly complicated for what is basically a dumping ground. :-P
23:08:05<klea>lol :p
23:08:29<klea>is the [[Discourse]] page ordered or it's done manually too?
23:08:41<klea>afaik your code was made to support ordering that
23:08:55<@JAA>I never did anything with [[Discourse]], I think.
23:09:37<klea>i meant that your code for [[URLTeam/Dead]] could easily be repurposed for [[Discourse]]
23:10:38<@JAA>My code was only for the WBM exclusion list, but yes, sure.
23:11:15<@JAA>Oh, yeah, it did that one, too, true. But that's not URLs.
23:11:36<@JAA>That's really just sorting the list entries as they are.
23:12:18Guest58 joins
23:12:20<klea>the wbm exclusion list is separated in subpages, whilst that one uses only list entries, so i'll have to make a merge between the two :p
23:25:53Czechball quits [Ping timeout: 272 seconds]
23:29:40Czechball joins
23:32:39etnguyen03 (etnguyen03) joins
23:44:30Guest58 quits [Client Quit]