00:00:30<Webuser446199>Anyone else think VRChat should be kept on watch since Rec Room is getting taken down and VRChat is like a very very sketchy thing with some pretty sketchy devs
00:02:00pedantic-darwin joins
00:06:55etnguyen03 (etnguyen03) joins
00:10:50<pokechu22>VRChat would be neat to do at some point, but it probably would also be a lot of work
00:11:38<Webuser446199>Wdym by that?
00:13:19<Webuser446199>I know a lot of people in VRChat people who make clients and tinker with the API I feel like if anything like the grabber needed to get made I could try to contact some people I know to get that cooked up
00:13:38<pokechu22>It would require a lot of reeverse-engineering, but if people have already looked into it that would help
00:16:18<Webuser446199>Yeah the only documentation for the API is at https://vrchat.community/
00:16:27<Webuser446199>Worlds would be easy as they don't have any encryption
00:17:58<Webuser446199>There is already a lil community that archive avatars https://files.catbox.moe/79mhkm.mp4
00:20:04<Webuser446199>The way people decrypt avatars is pretty unknown but
00:23:24SootBector quits [Remote host closed the connection]
00:26:42SootBector (SootBector) joins
00:27:37<Webuser446199>But I feel like getting world, group, and account data would be the easiest stuff to archive
00:33:33fangfufu_ is now known as fangfufu
01:04:03Arcorann__ (Arcorann) joins
01:10:37Webuser864614 quits [Client Quit]
01:11:52BlueBlaziken quits [Quit: See you all soon!]
01:21:51moth3 joins
01:23:44<@JAA>The bulk of CBS News Radio's news feeds is done. Some small parts still to be dealt with, mostly the segments from the past couple months, which I'll do later today but before the final broadcasts. I didn't do all the redirects yet either because there are 18 of them (6 patterns on 3 domains) and their server's too slow, but they're not critical; one domain and pattern is covered, and if things stay
01:23:50<@JAA>online past the end of broadcasting, I'll run the others then.
01:25:18<@JAA>I also found an endpoint that directly returns the most recent ID, by the way. But they're sequential anyway, so it would've worked fine without that as well.
01:25:26<@JAA>Vito`: ^^
01:26:30<@JAA>About 3039731 successful downloads and 7634 errors (likely mostly 403s, still to be analysed), 1.13 TiB
01:27:15<Vito`>JAA: great to hear! What was the endpoint you discovered?
01:38:37dabs quits [Read error: Connection reset by peer]
01:39:17<Webuser446199>guess youre muted
01:42:12Webuser271342 joins
01:49:52<@JAA>Vito`: https://www.cbsradionewsfeed.com/RemotePlayer/newscast.php
01:50:24<@JAA>I haven't checked whether it's the exact last ID or just the last hourly newscast, but close enough.
02:10:12Webuser446199 quits [Client Quit]
02:53:07etnguyen03 quits [Client Quit]
02:54:11etnguyen03 (etnguyen03) joins
03:14:43etnguyen03 quits [Remote host closed the connection]
03:28:45Island quits [Read error: Connection reset by peer]
03:54:25Muad-Dib quits [Quit: ZNC - http://znc.in]
04:03:08Muad-Dib joins
04:04:15Webuser991334 joins
04:04:24Webuser991334 quits [Client Quit]
04:09:31<Vito`>JAA: oh interesting, I had that in my list of scraped URLs but I didn't realize what it was, good catch
04:18:49<@JAA>Vito`: Do you know of any way to get metadata on the episodes/files? (Apart from the little that is in the URL structure, obviously.)
04:25:30DogsRNice quits [Read error: Connection reset by peer]
04:26:13<Vito`>JAA: ugh, so, the original various RSS feeds used to link to the files directly. After the Audacy acquisition they rewrote all of the enclosure URLs and GUIDs to a dynamic ad injection provider. You can scrape the WayBack for all the captured RSS feeds pre-acquisition and map those, but it's only for things syndicated that way. Nothing has ID3 tags in all the files I spot-checked.
04:26:40<Vito`>JAA: Your best bet for the rest, especially the NXXX_STATION_COUNTER files, would be to try and crack a login to the actual web UI
04:30:22<Vito`>Could probably get lucky and phish someone's JWT, surely no-one is paying that close attention to what they're clicking on for their last 24 hours on air
04:59:11ThetaDev quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
05:00:00ThetaDev joins
05:08:36n9nes quits [Ping timeout: 268 seconds]
05:26:41nexussfan quits [Remote host closed the connection]
05:27:22SootBector quits [Ping timeout: 240 seconds]
05:30:16SootBector (SootBector) joins
05:48:52hexa- quits [Quit: Disconnected]
05:49:48hexa- (hexa-) joins
06:33:45Webuser254902 joins
06:37:21_Apgap joins
06:41:06Boppen_ quits [Ping timeout: 268 seconds]
06:45:29unknownsrc quits [Ping timeout: 268 seconds]
06:47:27unknownsrc (unknownsrc) joins
06:58:45BC01 quits [Quit: Ping timeout (120 seconds)]
07:01:10BC01 joins
07:08:55leo60228_ (leo60228) joins
07:09:05n9nes joins
07:10:45leo60228 quits [Ping timeout: 268 seconds]
07:10:46leo60228_ is now known as leo60228
07:15:04n9nes quits [Ping timeout: 268 seconds]
07:15:18n9nes joins
07:16:15driib975 quits [Ping timeout: 268 seconds]
07:23:39ATinySpaceMarine quits [Ping timeout: 268 seconds]
07:36:39rohvani quits [Ping timeout: 268 seconds]
07:37:50ThreeHM quits [Ping timeout: 268 seconds]
07:42:51Arcorann_ (Arcorann) joins
07:42:57Starchives_ (Starchives) joins
07:43:41Boppen_ joins
07:43:46nine quits [Quit: See ya!]
07:43:58nine joins
07:44:17LddPotato quits [Read error: Connection reset by peer]
07:44:43LddPotato joins
07:46:28Starchives__ quits [Ping timeout: 268 seconds]
07:47:05_Apgap quits [Ping timeout: 268 seconds]
07:47:05Arcorann__ quits [Ping timeout: 268 seconds]
07:47:09ThreeHM (ThreeHeadedMonkey) joins
07:47:09ThreeHM quits [K-Lined]
07:48:19ThreeHM (ThreeHeadedMonkey) joins
07:50:01Webuser815946 quits [Quit: Ooops, wrong browser tab.]
07:53:35beardicus quits [Quit: Ping timeout (120 seconds)]
07:53:49beardicus (beardicus) joins
08:13:30barry quits [Quit: Ping timeout (120 seconds)]
08:15:00driib975 (driib) joins
08:19:23McAfee leaves [Disconnected: Hibernating too long]
08:19:25barry joins
08:35:55fluke joins
09:10:02lun4 quits [Read error: Connection reset by peer]
09:10:12lun4 joins
09:10:16beastbg8_ quits [Read error: Connection reset by peer]
09:11:00beastbg8_ joins
09:12:48colona quits [Ping timeout: 268 seconds]
09:13:14HP_Archivist (HP_Archivist) joins
09:13:25efi quits [Ping timeout: 268 seconds]
09:16:09BC01 quits [Client Quit]
09:16:44BC01 joins
09:23:43McAfee joins
09:24:19colona joins
09:24:26yasomimi (yasomi) joins
09:26:25yasomi quits [Ping timeout: 268 seconds]
09:26:26yasomimi is now known as yasomi
09:38:35efi (efi) joins
09:49:10<h2ibot>Exorcism edited Kakao TV (+286): https://wiki.archiveteam.org/?diff=61935&oldid=61931
09:55:23gosc joins
09:56:02<gosc>did anyone notice or attempt to archive this? blog.nicovideo.jp just randomly deleted seemingly all of their articles prior to the year 2017
09:56:31<gosc>it must've been very recent (within the last three or so months), every single article is still listed on google but gives a 404
09:56:48<gosc>every article under https://blog.nicovideo.jp/niconews/ni.*.html now gives a 404
09:57:07<gosc>articles from 2017 have broken css, I suspect these are at massive risk of being deleted next
09:58:41<gosc>some samples of pre and post 2017 niconico blog articles here: https://www.google.com/search?q=https%3A%2F%2Fblog.nicovideo.jp%2F&tbs=cdr%3A1%2Ccd_min%3A1%2F1%2F2016%2Ccd_max%3A12%2F31%2F2017&tbm=
10:00:14gosc quits [Client Quit]
10:00:14<h2ibot>Exorcism edited Main Page/Current Projects (+265): https://wiki.archiveteam.org/?diff=61936&oldid=61335
10:00:39gosc joins
10:06:12<gosc>seems the images used for the blog entries are still around though
10:11:53SootBector quits [Remote host closed the connection]
10:13:03SootBector (SootBector) joins
10:17:38hyperreal quits [Quit: hyperreal]
10:23:34hyperreal (hyperreal) joins
10:44:03<c3manu>gosc: doesn't look like it, but i'm gonna run it in #archivebot now just in case
10:53:48BC01 quits [Client Quit]
10:56:07<c3manu>gosc: hm, pages like https://blog.nicovideo.jp/niconews/1954 seem to existstill at least
10:56:48<c3manu>oh nvm, those all seem to be from 2026 :/
11:00:12Bleo18260072271962345522201107 quits [Quit: The Lounge - https://thelounge.chat]
11:00:50BC01 joins
11:03:01Bleo18260072271962345522201107 joins
11:05:35<c3manu>gosc: i briefly tried finding an announcement or anything but didn't find anything. but i'm not really at home in that environment and also have to rely on machine translation for that
11:05:39<hexagonwin>gosc the top result on your google link is https://blog.nicovideo.jp/niconews/3609.html for me and it loads
11:06:27<hexagonwin>the one right below it https://blog.nicovideo.jp/niconews/ni064932.html doesn't, but the third one loads too https://blog.nicovideo.jp/niconews/37350.html
11:06:28<hexagonwin>hmm...
11:08:41<hexagonwin>maybe can we save some from yandex webcache? for example https://yandexwebcache.net/yandbtm?fmode=inject&tm=1779448086&tld=ru&lang=ja&la=1776790784&text=https%3A//blog.nicovideo.jp/niconews/ni000684.html&url=https%3A//blog.nicovideo.jp/niconews/ni036869.html&l10n=ru&mime=html&sign=da5d439ac647d3b53993f5c3c85f0cb2&keyno=0
11:09:09<hexagonwin>(i barely use yandex so idk well, but afaik they're the only search engine that has webcache feature now)
11:10:54<c3manu>we should check whether the #archivebot found URLs like https://blog.nicovideo.jp/niconews/120766.html - it seems to me like it could possibly discover most other old posts from that, if they still exist?
11:11:25<c3manu>looks like the old ones start with the numer '1', while the new ones start with '2': https://blog.nicovideo.jp/niconews/269254.html
11:11:36<c3manu>would be curious to see whether that holds up
11:13:05<c3manu>https://blog.nicovideo.jp/niconews/120766.html has a few navigation links at the bottom. looks like pages below http://blog.nicovideo.jp/seiga/ are also gone
11:19:11c3manu quits [Remote host closed the connection]
11:19:16c3manu (c3manu) joins
11:22:35<c3manu>hm..i can see a bunch of hist on the index.xml in January from #// (also a few single ones starting 2025-10), but i don't see a mention of nicosomething in the logs
11:22:36<c3manu>https://web.archive.org/web/20260000000000*/https://blog.nicovideo.jp/niconews/index.xml
11:24:07Bleo18260072271962345522201107 quits [Client Quit]
11:24:21Bleo18260072271962345522201107 joins
11:25:12LddPotato quits [Read error: Connection reset by peer]
11:25:37LddPotato joins
11:28:59<c3manu>looks like the job has discovered https://blog.nicovideo.jp/niconews/122345.html, so at least some of the old content will be found
11:31:34Cupping1285 quits [Quit: Ping timeout (120 seconds)]
11:31:45Cupping1285 joins
11:34:22chrismeller3 quits [Quit: Ping timeout (120 seconds)]
11:35:07<c3manu>if someone can get all versions of https://blog.nicovideo.jp/niconews/index.xml that have ever been recorded in the WBM, one could create at least an impartial list. but i have never done that before
11:35:20chrismeller3 (chrismeller) joins
11:38:12chrismeller3 quits [Client Quit]
11:38:32chrismeller3 (chrismeller) joins
12:00:47AlsoHP_Archivist (HP_Archivist) joins
12:02:26HP_Archivist quits [Ping timeout: 268 seconds]
12:28:56<cruller>I'm downloading https://transfer.archivete.am/inline/HoZ5W/All%20versions%20of%20https%EF%BC%8F%EF%BC%8Fblog.nicovideo.jp%EF%BC%8Fniconews%EF%BC%8Findex.xml.txt now. Also, shall we move onto #niconino:hackint.org ? This discussion is likely to be a bit lengthy.
12:53:22Webuser298522 joins
13:03:19Webuser298522 quits [Client Quit]
13:07:30<gosc>c3manu, thanks! sorry was out for a bit
13:07:45<gosc>hexagonwin, didn't know about yandex webcache at all, was looking for something like that!
13:08:08<gosc>cruller, forgot there was a channel for niconico sorry
13:27:24anarcat quits [Quit: rebooting]
13:30:41anarcat (anarcat) joins
13:46:56ATinySpaceMarine joins
13:57:15Chris50102 (Chris5010) joins
13:59:33Chris5010 quits [Ping timeout: 268 seconds]
13:59:33Chris50102 is now known as Chris5010
14:12:02Island joins
14:15:02<cruller>Re MikuMikuDance, the last mass deletion took place in January 2025, and there are no signs that the same thing is about to happen again AFAIK.
14:18:50<cruller>However, while it’s not urgent at all, I think the official MikuMikuDance website https://sites.google.com/view/vpvp/ is worth archiving. It hasn’t been updated much in seven years, and AB has never crawled it.
14:21:12<cruller>Well, I'm not a NicoNico freak anymore, so I might be wrong.
14:22:20Cupping1285 quits [Client Quit]
14:22:37Cupping1285 joins
14:29:26rohvani joins
14:57:34Arcorann_ quits [Ping timeout: 268 seconds]
15:12:00Exorcism quits [Quit: Ping timeout (120 seconds)]
15:12:00DigitalDragons quits [Quit: Ping timeout (120 seconds)]
15:12:19DigitalDragons (DigitalDragons) joins
15:12:20Exorcism (exorcism) joins
15:22:32<that_lurker>If someone has a gaming computer https://endix-expo.com/ contains a lot of qr codes and maybe videos that should maybe be archived
15:23:05<that_lurker>another company is againg trying the expo as a game thing
15:42:41gosc quits [Quit: Leaving]
15:44:21<exorcism|m>Too many project what's going on this week 😭😭
15:45:24<@arkiver>exorcism|m: end of the month :P
15:54:32<@JAA>Vito`: Yeah, definitely not doing any of that latter part. Are the RSS feeds also under cbsradionewsfeed.com?
15:59:51ThreeHM quits [Ping timeout: 268 seconds]
16:01:39ThreeHM (ThreeHeadedMonkey) joins
16:05:36nine quits [Quit: See ya!]
16:05:48nine joins
16:21:03michaelblob7641 quits [Quit: yoop]
16:21:46michaelblob7641 joins
16:22:14<Vito`>JAA: here's a list of everything that looked like an RSS feed from my WB scrape: https://gist.github.com/vitorio/5e239ae2dd7f534c5462b4030eb2ab38
16:26:08<@JAA>Ack, thanks.
16:26:18<@JAA>There's the newscast.php again. :-P
16:26:42<@JAA>Those URLs in the CDX API data are exactly how I found it.
16:30:41michaelblob7641 quits [Client Quit]
16:31:26michaelblob7641 joins
16:32:35michaelblob7641 quits [Client Quit]
16:36:04michaelblob7641 joins
16:38:02Cupping1285 quits [Ping timeout: 268 seconds]
16:40:45Cupping1285 joins
17:26:05McAfee leaves [Disconnected: Hibernating too long]
17:29:14McAfee joins
17:43:20hyperreal7 (hyperreal) joins
17:43:22hyperreal7 quits [Client Quit]
18:09:27Kabaya quits [Quit: おつかなた~]
18:15:31ThreeHM quits [Ping timeout: 268 seconds]
18:17:00ThreeHM (ThreeHeadedMonkey) joins
18:51:54BC01 quits [Ping timeout: 268 seconds]
19:25:22nine quits [Quit: See ya!]
19:25:34nine joins
19:53:55Wohlstand1 (Wohlstand) joins
19:56:21Wohlstand1 is now known as Wohlstand
20:00:15etnguyen03 (etnguyen03) joins
20:07:08moth3 quits [Ping timeout: 268 seconds]
20:08:06moth3 joins
20:13:03Webuser079494 joins
20:17:41Webuser079494 quits [Client Quit]
20:19:17Webuser319031 joins
20:22:00Webuser845400 joins
20:22:32Webuser845400 quits [Client Quit]
20:24:10Webuser271342 quits [Quit: Ooops, wrong browser tab.]
20:27:18nathang2184384 joins
20:29:54nathang218438 quits [Ping timeout: 268 seconds]
20:29:54nathang2184384 is now known as nathang218438
20:31:54BearFortress_ quits []
20:32:38<Dango360>is may the most popular month for websites to shut down?
20:33:39moth3 quits [Ping timeout: 268 seconds]
20:34:16<katia>maybe deathwatch wiki page can answer this for you
20:36:27<@JAA>My perception is December, but maybe not.
20:59:33Webuser319031 quits [Client Quit]
21:01:08Webuser364319 joins
21:02:38klea quits [Ping timeout: 268 seconds]
21:02:39alexlehm quits [Ping timeout: 268 seconds]
21:03:00alexlehm (alexlehm) joins
21:03:05klea (jmjl) joins
21:11:04<Webuser364319>To archive rec room would the address be this? docker run -d --name archiveteam --label=com.centurylinklabs.watchtower.enable=true --restart=unless-stopped atdr.meo.ws/archiveteam/grab-base:nss --concurrent 1 logged-zip
21:12:27<@JAA>The image names match the grab repo name listed on the wiki page: atdr.meo.ws/archiveteam/recroom-grab
21:13:18<Webuser364319>oh yeah i just noticed that i thought it was the thing in the Dockerfile file
21:13:56<@JAA>That's the base image.
21:14:15dabs joins
21:15:09dabs quits [Remote host closed the connection]
21:15:22dabs joins
21:16:21Webuser254902 quits [Quit: Ooops, wrong browser tab.]
21:17:14<Webuser364319>Wait say I want to run like roblox groups and rec room should i change --name archiveteam to specificy what it is or can it just stay as archiveteam and it will be all put under like one thing
21:17:52<@JAA>I don't think you can have multiple containers with the same name.
21:19:51<Webuser364319>Do you know what the max concurrent can be without getting rate limited for recroom
21:20:21<@JAA>→ #wreckroom
21:21:31nexussfan (nexussfan) joins
21:22:05BearFortress joins
21:24:13klea quits [Ping timeout: 268 seconds]
21:25:27alexlehm quits [Ping timeout: 268 seconds]
21:28:29alexlehm (alexlehm) joins
21:36:30alexlehm quits [Ping timeout: 268 seconds]
21:43:49etnguyen03 quits [Client Quit]
21:45:51alexlehm (alexlehm) joins
21:47:25klea (jmjl) joins
21:51:18alexlehm quits [Ping timeout: 268 seconds]
21:55:20alexlehm (alexlehm) joins
22:03:01moth3 joins
22:06:51Shard7 quits [Remote host closed the connection]
22:15:56moth3 quits [Read error: Connection reset by peer]
22:18:16moth3 joins
22:22:48moth3 quits [Ping timeout: 268 seconds]
22:28:25moth3 joins
22:43:02etnguyen03 (etnguyen03) joins
22:49:32dabs quits [Read error: Connection reset by peer]
22:57:13etnguyen03 quits [Client Quit]
22:59:43Wohlstand quits [Quit: Wohlstand]
23:06:01wickedplayer494 quits [Remote host closed the connection]
23:06:50wickedplayer494 (wickedplayer494) joins
23:18:08SootBector quits [Remote host closed the connection]
23:24:04SootBector (SootBector) joins
23:30:03etnguyen03 (etnguyen03) joins
23:32:26f_ quits [Ping timeout: 268 seconds]
23:32:40f_ (funderscore) joins
23:35:56McAfee leaves [Disconnected: Replaced by new connection]
23:36:00McAfee joins
23:37:22f_ quits [Ping timeout: 268 seconds]
23:40:20etnguyen03 quits [Client Quit]
23:40:54Shard7 (Shard) joins
23:42:33etnguyen03 (etnguyen03) joins
23:42:51f_ (funderscore) joins
23:43:32McAfee leaves
23:43:34McAfee joins
23:47:51f_ quits [Ping timeout: 268 seconds]
23:50:51f_ (funderscore) joins