| 00:03:01 | <h2ibot> | PaulWise edited SourceForge (+120, adjust file deletions wording. for Yakov): https://wiki.archiveteam.org/?diff=57826&oldid=57821 |
| 00:13:21 | | etnguyen03 quits [Quit: Konversation terminated!] |
| 00:18:03 | <h2ibot> | PaulWise edited SourceForge (+480, update status of the developer web DPoS): https://wiki.archiveteam.org/?diff=57827&oldid=57826 |
| 00:32:23 | | SootBector quits [Remote host closed the connection] |
| 00:33:27 | | SootBector (SootBector) joins |
| 00:54:10 | | @imer quits [Ping timeout: 256 seconds] |
| 00:58:27 | | etnguyen03 (etnguyen03) joins |
| 01:01:05 | <Lord_Nightmare> | ok, so the rebecca heineman stuff needs to be done, now that she's passed |
| 01:01:16 | <Lord_Nightmare> | if it hasn't been fed to AB already |
| 01:01:36 | | imer (imer) joins |
| 01:01:36 | | @ChanServ sets mode: +o imer |
| 01:02:09 | <Lord_Nightmare> | I'm assuming the livejournal site is covered by archivebot -i blogs? |
| 01:03:46 | <pokechu22> | Livejournal uses dreamwidth and also has some other weird rate-limiting |
| 01:05:30 | <cruller> | pabs: Thank you for letting me know. For now, I should look into existing approaches before thinking more. |
| 01:07:19 | | Sokar joins |
| 01:09:20 | <pabs> | cruller: more examples: I often find people who died or company shutdowns on HN in the newest articles firehose, but have no time to read that fully, let alone archive them all, especially doing it fully (like all subdomains, social media etc). sometimes LWN has them too. also Wikipedia has a page of recent deaths, and an AT person has a bot sometimes updating https://wiki.archiveteam.org/index.php?title=Deaths_in_2025 |
| 01:09:59 | | notSokar quits [Ping timeout: 272 seconds] |
| 01:11:19 | <Lord_Nightmare> | how would i get a number of sites added to that list? |
| 01:11:47 | <Lord_Nightmare> | rebecca heineman just passed away and has a whole bunch of sites hosted on a vps, which already is having cloudflare/certificate problems |
| 01:12:04 | <Lord_Nightmare> | see backscroll from this morning |
| 01:12:29 | <Lord_Nightmare> | at the very least it includes burgerbecky.com contrabandgames.com deadreckoningstudios.com jaquays.com logicware.com myreflectioncomic.com oldeskuul.com sailorranko.com |
| 01:12:30 | <pabs> | if you have a list of domains, folks here can throw them into #archivebot |
| 01:13:02 | <Lord_Nightmare> | plus several social media accounts: https://burgerbecky.livejournal.com/ https://www.deviantart.com/burgerbecky https://bsky.app/profile/burgerbecky.bsky.social https://x.com/burgerbecky/ http://www.fanfiction.net/~burgerbecky and https://www.linkedin.com/in/burgerbecky |
| 01:13:02 | <eggdrop> | nitter: https://nitter.net/burgerbecky/ |
| 01:13:40 | <Lord_Nightmare> | i'm somewhat less worried about the social media and moreso about the sites she hosted herself |
| 01:13:55 | <pabs> | many subdomains on those sites? |
| 01:14:34 | <pabs> | looks like balrog did burgerbecky.com |
| 01:14:43 | <Lord_Nightmare> | dev.oldeskuul.com is bypassing the broken cloudflare gateway to oldeskuul.com but ends up reaching the site endpoint for myreflectioncomic.com |
| 01:14:57 | <Lord_Nightmare> | apparently oldeskuul.com can be accessed with some DNS hackery |
| 01:15:44 | <Lord_Nightmare> | according to justauser|m although i don't know how they did it |
| 01:16:09 | <pabs> | contrabandgames.com gone already (Cloudflare Invalid SSL certificate Error code 526) |
| 01:16:31 | <Lord_Nightmare> | i suspect the same DNS trick justauser|m used can get through that... somehow. |
| 01:16:43 | <pabs> | (but there was a 2022 AB already) |
| 01:16:59 | <Lord_Nightmare> | sailorranko.com has the ?same? bad SSL cert |
| 01:17:07 | <pabs> | started AB for https://deadreckoningstudios.com/ |
| 01:18:03 | <pabs> | started AB for https://jaquays.com/ |
| 01:18:16 | <Lord_Nightmare> | becky was bedridden for the last 2 months with aggressive cancer and i guess wasn't able to manually update the SSL certs, and/or whatever automation she used broke |
| 01:18:54 | <pabs> | logicware had a 2023 save, adding another one tho |
| 01:19:12 | <h2ibot> | JustAnotherArchivist edited Goo Blog (+348, Add exact shutdown time announcement): https://wiki.archiveteam.org/?diff=57828&oldid=57777 |
| 01:19:40 | <pabs> | balrog did https://www.myreflectioncomic.com/ |
| 01:20:44 | <pabs> | balrog did http://www.sailorranko.com/ |
| 01:21:19 | <pabs> | added the livejournal to https://pad.notkiska.pw/p/archivebot-livejournal |
| 01:22:14 | <Lord_Nightmare> | sadly, becky never got to finish this, as far as I know: https://www.dexerto.com/gaming/interplay-cofounder-reveals-source-code-for-fallout-1-2-thought-lost-still-exists-3189721/ |
| 01:22:17 | <h2ibot> | JustAnotherArchivist edited Goo Blog (+122, Shutdown time announcement date; URL template): https://wiki.archiveteam.org/?diff=57829&oldid=57828 |
| 01:22:49 | <pabs> | AB for https://www.deviantart.com/burgerbecky/about failed :/ |
| 01:22:52 | <Lord_Nightmare> | I'm hoping her daughter has digital access to her collection/PCs/etc and can finish that project |
| 01:24:18 | <h2ibot> | PaulWise edited DokuWiki (+194, mention indexer.php too): https://wiki.archiveteam.org/?diff=57830&oldid=57218 |
| 01:24:23 | <pabs> | bluesky is JSy |
| 01:25:49 | <pabs> | did SPN/archive.today/mnbot and AB !ao for it |
| 01:27:31 | <nicolas17> | so oldeskuul and contrabandgames are inaccessible, the rest is done or in progress? |
| 01:27:43 | <pabs> | twitter is JSy, did archive.today, mnbot and added to the pad https://pad.notkiska.pw/p/archivebot-twitter |
| 01:28:43 | <pabs> | http://www.fanfiction.net/~burgerbecky not easy to do, will create a list |
| 01:30:15 | <nicolas17> | can confirm DNS trickery works for the two inaccessible sites |
| 01:30:56 | <nicolas17> | curl -insecure --resolve contrabandgames.com:443:99.120.219.105 https://contrabandgames.com/ |
| 01:31:52 | <pabs> | JAA arkiver - can we put DNS trickery stuff in the WBM? |
| 01:32:25 | <nicolas17> | needs ignoring the bad cert too, but maybe our tooling does that already |
| 01:33:08 | <nicolas17> | pabs: did your AB jobs finish? I only see the livejournal in the dashboard |
| 01:33:43 | <pabs> | yeah already finished |
| 01:33:52 | <pabs> | I didn't start the lj one |
| 01:34:02 | <@JAA> | pabs: Generally no, but I can grab a copy at least. |
| 01:34:39 | <pabs> | nicolas17: /finished should have the recently completed stuff |
| 01:35:01 | <@JAA> | Oh, that's just a single page? |
| 01:35:29 | <nicolas17> | JAA: the broken sites due to expired cert are contrabandgames.com and oldeskuul.com |
| 01:35:47 | <nicolas17> | same IP works for both |
| 01:36:07 | <@JAA> | Ah, two sites. https://contrabandgames.com/ seems to just be a page with a few images. |
| 01:36:34 | <@JAA> | That's the origin IP, I assume? |
| 01:36:53 | <pabs> | oldeskuul.com has a bit more, /humans.txt for eg |
| 01:37:03 | <nicolas17> | I *suspect* it's the origin IP, I got it from dev.oldeskuul.com |
| 01:37:49 | <nicolas17> | I assume oldeskuul.com and dev.oldeskuul.com are hosted on the same server, with the former then using cloudflare in front and dev pointing directly at origin in DNS |
| 01:38:31 | <pabs> | https://www.fanfiction.net/~burgerbecky blocks curl with all AB UAs, and curl with a browser UA, but isn't TLS fingerprinting |
| 01:40:02 | <pabs> | (doing Mnbot, SPN, archive.today at least) |
| 01:40:26 | <@JAA> | Ah, there's a bit more on contrabandgames.com as well. |
| 01:45:01 | <pabs> | did Mnbot, SPN, archive.today for https://www.linkedin.com/in/burgerbecky |
| 01:46:33 | <pabs> | did Mnbot, SPN, archive.today for https://www.deviantart.com/burgerbecky |
| 01:49:48 | <@JAA> | grab-sited https://contrabandgames.com/ and https://oldeskuul.com/ with the DNS override. |
| 01:51:40 | <Lord_Nightmare> | I also missed a social media site: https://www.youtube.com/@RebeccaHeineman |
| 01:51:51 | <Lord_Nightmare> | I probably missed several others |
| 01:52:15 | <nicolas17> | oh I'll do youtube |
| 01:52:23 | <Lord_Nightmare> | I missed facebook.com/burgerbecky as well |
| 01:58:22 | <pabs> | added facebook to AB, Mnbot, SPN, archive.today |
| 01:59:14 | <pabs> | found https://www.instagram.com/burgerbecky/ - doing Mnbot/SPN/archive.today |
| 02:07:45 | | Guest58 quits [Client Quit] |
| 02:13:57 | | Sokar quits [Ping timeout: 272 seconds] |
| 02:15:18 | | Sokar joins |
| 02:16:54 | <nulldata> | Added https://macplay.oldeskuul.com/ to AB |
| 02:19:32 | <Stagnant_> | Could someone add this to AB? A 25 year old finnish forum https://www.dvdplaza.fi is shutting down "at the end of 2025". Announcement: https://www.dvdplaza.fi/threads/tiedote-dvdplaza-fi-p%C3%A4%C3%A4tt%C3%A4%C3%A4-toimintansa.97779/ |
| 02:20:09 | <nulldata> | Stagnant_ Yeah I can |
| 02:20:16 | <nulldata> | Thanks for the report! |
| 02:20:20 | <Stagnant_> | Thanks! |
| 02:23:58 | | Guest58 joins |
| 02:25:44 | <pokechu22> | arkiver: note that webtoons' sitemap double-escapes & to &amp; which leadds to broken URLs |
| 02:27:20 | <@JAA> | sitemaps are hard++ |
| 02:27:21 | <eggdrop> | [karma] 'sitemaps are hard' now has 7 karma! |
| 02:30:31 | | etnguyen03 quits [Client Quit] |
| 02:39:21 | | etnguyen03 (etnguyen03) joins |
| 02:42:27 | | Chris5010 quits [Ping timeout: 272 seconds] |
| 02:44:36 | <pabs> | cruller: perhaps we need something like this pipeline: #// or other DPoS to ingest data, process it for shutdown leads, file leads with discovery, discover subdomains/URLs, save front pages, detect software used, pass those on to AB/Mnbot/Wikibot/etc queues, then people click approve |
| 02:45:47 | <pabs> | (and insert some DPoS based human workers too, for captchas, looking at search results etc) |
| 02:46:36 | | Chris5010 (Chris5010) joins |
| 02:48:38 | <Lord_Nightmare> | another two social medias i missed for rebecca heineman: https://www.reddit.com/u/burgerbecky and https://www.twitch.tv/burgerbecky |
| 02:49:54 | <nicolas17> | no videos on twitch |
| 02:50:00 | | twiswist (twiswist) joins |
| 02:53:24 | <DigitalDragons> | i think a lot could be accomplished with just a google form equivalent on the wiki/tracker homepage for people to submit dying things to |
| 02:58:17 | | Guest58 quits [Ping timeout: 272 seconds] |
| 02:58:39 | | Guest58 joins |
| 03:18:36 | | Webuser288763 joins |
| 03:18:58 | | Webuser288763 quits [Client Quit] |
| 03:25:15 | | etnguyen03 quits [Client Quit] |
| 03:25:35 | <h2ibot> | Glaps edited List of websites excluded from the Wayback Machine (+27, Added saabnet.com; I wanted to access its…): https://wiki.archiveteam.org/?diff=57831&oldid=57805 |
| 03:25:36 | <h2ibot> | Brad edited In The Media (+2437, added DiVine link for TechLinked): https://wiki.archiveteam.org/?diff=57832&oldid=57090 |
| 03:25:37 | <h2ibot> | Brad edited Alive... OR ARE THEY (+242, Added Moerdijk): https://wiki.archiveteam.org/?diff=57833&oldid=57789 |
| 03:27:27 | | Guest58 quits [Client Quit] |
| 03:31:26 | | etnguyen03 (etnguyen03) joins |
| 03:40:29 | | PredatorIWD258 joins |
| 03:41:13 | | ymgve joins |
| 03:41:21 | | ymgve_ quits [Ping timeout: 272 seconds] |
| 03:41:59 | | PredatorIWD25 quits [Ping timeout: 272 seconds] |
| 03:41:59 | | PredatorIWD258 is now known as PredatorIWD25 |
| 03:49:21 | | notSokar joins |
| 03:49:33 | | HackMii quits [Remote host closed the connection] |
| 03:49:41 | | etnguyen03 quits [Client Quit] |
| 03:49:49 | | HackMii (hacktheplanet) joins |
| 03:50:13 | | Sokar quits [Ping timeout: 272 seconds] |
| 03:51:23 | | etnguyen03 (etnguyen03) joins |
| 03:57:23 | | ummmSokar joins |