00:00:51<u>What do you mean?
00:04:02<u>The thing I'm thinking of would use IPFS; it's the other stuff which is important too (code+data).
00:05:09FiTheArchiver quits [Read error: Connection reset by peer]
00:12:05cascode quits [Read error: Connection reset by peer]
00:13:49cascode joins
00:22:58TheTechRobo (TheTechRobo) joins
00:26:08lennier1 (lennier1) joins
00:39:01Webuser908585 joins
00:39:26Webuser908585 quits [Client Quit]
00:40:42ATinySpaceMarine joins
00:47:19ATinySpaceMarine quits [Client Quit]
00:47:59ATinySpaceMarine joins
00:48:16<u>Block 4 in the chain which is pretty boring right now: ipfs://bafybeib2vdle5dh2tafiaijkztcwto7augp6tykkwwpk7jrb73npd3f3we (latest). I made it work with some crappy Bash code. Next: make it work with an HTML HTML generator thing.
00:49:47Webuser818667 joins
00:49:54cascode quits [Ping timeout: 250 seconds]
00:50:36cascode joins
00:50:38<Webuser818667>I have collected approximately 130,000 blog URLs from So-net Blog (SS Blog), which is scheduled to shut down on March 31, 2025. Could you please register these URLs with ArchiveBot? SS Blog is a blog hosting service that has been in operation since 2004.
00:50:38<Webuser818667>Here is the blog URL list: https://transfer.archivete.am/W4clI/ssblog_urls.txt
00:50:39<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/W4clI/ssblog_urls.txt
00:59:10cascode quits [Read error: Connection reset by peer]
00:59:12cascode joins
00:59:36<pokechu22>I think we're doing a separate project for SS Blog instead of using archivebot... let me check the details
01:00:07<pokechu22>ah, I don't think we've created a separate channel for that project yet? At least https://wiki.archiveteam.org/index.php/SS_Blog doesn't mention it yet
01:00:11<pokechu22>arkiver: you might find that list useful
01:04:36<nstrom|m>#sos-blog for ss blog
01:05:00xkey quits [Quit: WeeChat 4.4.3]
01:05:14xkey (xkey) joins
01:06:20<Webuser818667>Does this mean that the archiving of SS Blog is already in progress?
01:06:36<h2ibot>Pokechu22 edited SS Blog (+8, #sos-blog): https://wiki.archiveteam.org/?diff=54921&oldid=53950
01:08:03<pokechu22>I don't think downloading anything has started yet, but work on it is going on in that channel
01:08:23<Webuser818667>Sorry, I didn't realize that a channel for the blog had already been set up. My apologies.
01:16:05<Webuser818667>pokechu22 thank you
01:19:02iram quits [Quit: Ping timeout (120 seconds)]
01:19:07iram joins
01:19:19graham9 joins
01:20:42<u>Did you know that the original episodes of "Jeopardy!" from the 1960/1970s have been lost? NBC destroyed them. So "episode 1" is the first episode in the 80s revived series. But episode #1 is actual episode #200? Don't know how many episodes were in the lost media series.
01:21:15<u>*1960s
01:31:33<u>NBC scum
01:57:36<nulldata>Wait till u learn about the BBC lol
01:59:50kansei quits [Quit: ZNC 1.9.1 - https://znc.in]
02:06:52<pabs>https://www.icann.org/en/announcements/details/icann-update-launching-rdap-sunsetting-whois-27-01-2025-en
02:07:01kansei (kansei) joins
02:14:33etnguyen03 quits [Client Quit]
02:22:19etnguyen03 (etnguyen03) joins
02:24:58raccoon quits [Ping timeout: 260 seconds]
02:26:58cascode quits [Ping timeout: 250 seconds]
02:27:42cascode joins
02:37:49etnguyen03 quits [Remote host closed the connection]
02:38:31gust quits [Read error: Connection reset by peer]
02:40:00cascode quits [Read error: Connection reset by peer]
02:40:10cascode joins
02:58:14sparky14928 (sparky1492) joins
03:01:38sparky1492 quits [Ping timeout: 250 seconds]
03:01:39sparky14928 is now known as sparky1492
03:06:31raccoon (raccoon) joins
03:06:51Mateon2 joins
03:08:08Mateon1 quits [Ping timeout: 250 seconds]
03:08:12Mateon2 is now known as Mateon1
03:13:29<u>For some reason I have like more than 7 torrents of Goblet Of Fire movie (which IIRC I disliked more than previous movie in the series): weird. (such as "Harry.Potter.and.The.Goblet.Of.Fire[2005]DvDrip[Eng]-aXXo")
03:13:59<u>pabs: looking at https://web.archive.org/web/20250127224222/https://www.icann.org/en/announcements/details/icann-update-launching-rdap-sunsetting-whois-27-01-2025-en - what's it about...
03:16:20<u>It says "As of 28 January 2025, the Registration Data Access Protocol (RDAP) will be the definitive source for delivering generic top-level domain name (gTLD) registration information in place of sunsetted WHOIS services."
03:16:33<u>So whois sites will be disappearing? IDK
04:52:55ericgallager joins
05:04:43atphoenix_ (atphoenix) joins
05:05:50DogsRNice quits [Read error: Connection reset by peer]
05:07:43atphoenix__ quits [Ping timeout: 260 seconds]
05:08:08atphoenix__ (atphoenix) joins
05:09:54atphoenix_ quits [Ping timeout: 250 seconds]
05:33:23atphoenix__ quits [Ping timeout: 260 seconds]
05:34:51atphoenix__ (atphoenix) joins
05:47:02tmg1|michelson leaves
05:55:59Island quits [Read error: Connection reset by peer]
07:10:58Jens quits []
07:11:31Jens (JensRex) joins
07:20:05ahm2587 quits [Quit: The Lounge - https://thelounge.chat]
07:20:20ahm2587 joins
07:47:48Webuser818667 quits [Quit: Ooops, wrong browser tab.]
07:57:30<steering>hmm, my warrior seems a bit borken, it's been running at 2.5 cores for... well presumably all day, I noticed the fans earlier too. also connection reset when i try to go to the web thingy
08:01:22<steering>https://transfer.archivete.am/inline/iDt7v/Screenshot%20from%202025-03-17%2002-00-51.png
08:02:34<steering>host disk seems fine but vm disk seems trashed
08:30:06Ketchup901 quits [Remote host closed the connection]
08:30:19Ketchup901 (Ketchup901) joins
08:31:00gamer191 joins
08:32:22<gamer191>I noticed that my Github warrior is endlessly looping this:
08:32:22<gamer191>```
08:32:22<gamer191>Requesting targets.
08:32:22<gamer191>Trying target rsync://at-rsync3.phirephly.design:8892/ateam-airsync/:downloader/.
08:32:22<gamer191>Trying target rsync://hel1.targets.rewby.archivete.am:8892/ateam-airsync/:downloader/.
08:32:22<gamer191>Picking target rsync://hel1.targets.rewby.archivete.am:8892/ateam-airsync/:downloader/.
08:32:22<gamer191>Starting RsyncUpload for Item web:complete:idaholab/Location-Generalizer
08:32:23<gamer191>@ERROR: max connections (-1) reached -- try again later
08:32:23<gamer191>rsync error: error starting client-server protocol (code 5) at main.c(1863) [sender=3.2.7]
08:32:24<gamer191>Process RsyncUpload returned exit code 5 for Item web:complete:idaholab/Location-Generalizer
08:32:24<gamer191>Failed RsyncUpload for Item web:complete:idaholab/Location-Generalizer
08:32:25<gamer191>Failed to upload, retrying...
08:33:10<gamer191>```
08:33:39<gamer191>Apologies, I didn't expect that to send a different message for every line
08:38:18cascode quits [Ping timeout: 260 seconds]
08:38:28cascode joins
08:44:22gamer191 quits [Client Quit]
08:48:40loug83181422 joins
09:15:45<@arkiver>pokechu22: thanks for the ping, indeed useful!
09:56:13ljcool2006 joins
09:57:39<ljcool2006>we have a month to archive twitch apparently
09:57:59<ljcool2006>related to the 100 hour cap
09:58:28<ljcool2006>oh wrong channel
10:02:00igloo22225 quits [Quit: The Lounge - https://thelounge.chat]
10:02:32igloo22225 (igloo22225) joins
10:03:28gamer191 joins
10:07:17gamer191 quits [Client Quit]
10:10:12pabs quits [Read error: Connection reset by peer]
10:10:55pabs (pabs) joins
10:17:30le0n quits [Quit: see you later, alligator]
10:29:08emphatic quits [Ping timeout: 260 seconds]
10:53:06ljcool2006 quits [Ping timeout: 250 seconds]
11:00:04Bleo18260072271962345 quits [Quit: The Lounge - https://thelounge.chat]
11:00:15<u>Felt like trying for bit of a challenge, just for fun I guess. How would you download this webpage from a CLI? Failed "yt-dlp --cookies-from-browser brave+gnomekeyring --write-pages --skip-download --extractor-args "generic:impersonate" https://rateyourmusic.com/release/album/the-residents/tweedles/ " -> "ERROR: [generic] Got HTTP Error 403 caused by Cloudflare anti-bot challenge; see
11:00:21<u>https://github.com/yt-dlp/yt-dlp#impersonation for how to install the required impersonation dependency"
11:00:35<u>rateyourmusic.com = horribly walled site
11:02:50Bleo18260072271962345 joins
11:08:06<u>So even with cookies and after completing the CF challenge, it still failed
11:10:00ericgallager quits [Quit: This computer has gone to sleep]
11:16:04nine quits [Ping timeout: 250 seconds]
11:26:31nine joins
11:26:31nine quits [Changing host]
11:26:31nine (nine) joins
11:31:43<u>I ran this command -- https://dweb.link/ipfs/bafkreiccgwhjfyel7ug2je4o4d6btjmcvfc6aqlexlugli3zpy6y65gw5i -- and it worked. It opens up Selenium/browser and you still gotta do the CF checkbox thing. Next ideas: how would I click the checkbox from a different computer? How would I keep the automated browser open but open and close tabs and get the pages that show up in that? How would I get Selenium to
11:31:49<u>download all page resources and not just the source code of the URL?
11:33:54FiTheArchiver joins
11:34:03SkilledAlpaca418962 quits [Quit: SkilledAlpaca418962]
11:34:31SkilledAlpaca418962 joins
11:36:00BlueMaxima quits [Read error: Connection reset by peer]
11:38:35ericgallager joins
12:00:35tertu quits [Quit: so long...]
12:01:05tertu (tertu) joins
12:11:12terry joins
12:11:13NotGLaDOS quits [Ping timeout: 260 seconds]
12:31:59PredatorIWD25 joins
12:33:23cascode quits [Ping timeout: 260 seconds]
12:34:55cascode joins
12:59:23kansei quits [Quit: ZNC 1.9.1 - https://znc.in]
13:19:56gamer191 joins
13:24:00gamer191 leaves
13:28:20<u>It's like DRM in the web. Using Selenium with mitmdump = can't get past the https://challenges.cloudflare.com/ thing in https://rateyourmusic.com/release/album/residents/meet-the-residents
13:28:52<u>Code: sudo cp -n ~/.mitmproxy/mitmproxy-ca-cert.pem /usr/local/share/ca-certificates/mitmproxy-ca-cert.crt; time=$(TZ=UTC date -u +%Y%m%d%H%M%S); sudo mitmdump --listen-port 85 -w $time.mitm 1>$time.mitm1.txt 2>$time.mitm2.txt & disown; sleep 4; ps aux | grep mitm; ./selenium_.py # sudo update-ca-certificates
13:29:05loug831814224 joins
13:29:45loug83181422 quits [Read error: Connection reset by peer]
13:29:45loug831814224 is now known as loug83181422
13:30:34Wohlstand quits [Quit: Wohlstand]
13:34:54th3z0l4 joins
13:36:00<u>Here's the little python script: https://nftstorage.link/ipfs/bafkreifdi6oeetwzw443gnna6yf2yk2eccxwzb66lqrvxp7nefgcwjm2zy (don't name it "selenium.py" because that will result in an error). I can complete the CF checkbox multiple times but it still is stuck on the "Verifying you are human" page.
13:36:45<u>So I can use selenium to download webpage "raws" but so far nothing similar to WARC or WARCs exactly.
13:52:47graham9 quits [Quit: The Lounge - https://thelounge.chat]
14:17:13notarobot1 quits [Ping timeout: 260 seconds]
14:20:07ericgallager quits [Client Quit]
14:22:25Webuser488769 joins
14:23:07Webuser488769 quits [Client Quit]
14:27:51@imer quits [Quit: Ping timeout (120 seconds)]
14:28:15imer (imer) joins
14:28:16@ChanServ sets mode: +o imer
14:28:50@imer quits [Excess Flood]
14:29:42imer (imer) joins
14:29:42@ChanServ sets mode: +o imer
14:37:54SootBector quits [Remote host closed the connection]
14:38:16SootBector (SootBector) joins
14:39:44Sluggs quits [Ping timeout: 250 seconds]
14:45:37kansei (kansei) joins
14:48:34Sluggs joins
14:49:34graham9 joins
14:51:00magmaus3 quits [Remote host closed the connection]
14:52:53BornOn420 quits [Remote host closed the connection]
14:53:22magmaus3 (magmaus3) joins
14:53:58BornOn420 (BornOn420) joins
14:58:23graham9 quits [Client Quit]
15:22:51gust joins
15:24:22atphoenix__ quits [Ping timeout: 250 seconds]
15:26:38JayEmbee quits [Ping timeout: 260 seconds]
15:28:47atphoenix__ (atphoenix) joins
15:37:48atphoenix__ quits [Ping timeout: 250 seconds]
15:38:18ATinySpaceMarine quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
15:40:49wyatt8750 joins
15:41:13wyatt8740 quits [Ping timeout: 260 seconds]
15:44:16ATinySpaceMarine joins
15:54:42wyatt8750 quits [Ping timeout: 250 seconds]
15:57:42sparky14929 (sparky1492) joins
15:59:06wyatt8740 joins
16:01:38sparky1492 quits [Ping timeout: 260 seconds]
16:01:39sparky14929 is now known as sparky1492
16:52:35sparky14927 (sparky1492) joins
16:54:55sparky149277 (sparky1492) joins
16:56:28sparky1492 quits [Ping timeout: 260 seconds]
16:56:29sparky149277 is now known as sparky1492
16:58:48sparky14927 quits [Ping timeout: 260 seconds]
17:07:49graham9 joins
17:13:03riteo (riteo) joins
17:19:20ljcool2006 joins
17:26:50atphoenix__ (atphoenix) joins
17:30:48atphoenix_ (atphoenix) joins
17:34:22atphoenix__ quits [Ping timeout: 250 seconds]
17:41:07nyakase quits [Remote host closed the connection]
17:43:10nyakase (nyakase) joins
17:43:21nyakase quits [Remote host closed the connection]
17:43:51nyakase (nyakase) joins
17:43:56Island joins
17:44:02nyakase quits [Read error: Connection reset by peer]
17:46:38wyatt8740 quits [Ping timeout: 260 seconds]
17:48:23wyatt8740 joins
17:51:06devkev quits [Quit: The Lounge - https://thelounge.chat]
17:52:39devkev (devkev) joins
17:56:34nyakase (nyakase) joins
17:56:48nyakase quits [Remote host closed the connection]
17:57:20nyakase (nyakase) joins
17:57:37nyakase quits [Read error: Connection reset by peer]
17:58:13nyakase (nyakase) joins
18:04:05yano quits [Quit: WeeChat, the better IRC client, https://weechat.org/]
18:07:18yano (yano) joins
18:14:01driib9 quits [Quit: The Lounge - https://thelounge.chat]
18:14:35JayEmbee (JayEmbee) joins
18:15:29<h2ibot>VoynichCr edited Citizendium (+0, /* Dumps */ The latest complete dump was…): https://wiki.archiveteam.org/?diff=54922&oldid=45635
18:16:34driib9 (driib) joins
18:17:29<h2ibot>VoynichCr edited Citizendium (+45, …): https://wiki.archiveteam.org/?diff=54923&oldid=54922
18:19:30<h2ibot>VoynichCr created Template:Gallery (+104, Created page with "<gallery>…): https://wiki.archiveteam.org/?title=Template%3AGallery
18:20:30<h2ibot>VoynichCr edited Template:Gallery (+390): https://wiki.archiveteam.org/?diff=54925&oldid=54924
18:21:30<h2ibot>VoynichCr edited Template:Gallery (+351): https://wiki.archiveteam.org/?diff=54926&oldid=54925
18:21:31<h2ibot>VoynichCr edited Template:Gallery (+234): https://wiki.archiveteam.org/?diff=54927&oldid=54926
18:27:31<h2ibot>VoynichCr edited Template:Gallery (-2, bypassing <gallery> tag to allow parserfunctions): https://wiki.archiveteam.org/?diff=54928&oldid=54927
18:27:32<h2ibot>VoynichCr edited Template:Gallery (+104): https://wiki.archiveteam.org/?diff=54929&oldid=54928
18:30:31<h2ibot>VoynichCr uploaded File:Citizendium in 2025.png: https://wiki.archiveteam.org/?title=File%3ACitizendium%20in%202025.png
18:31:32<h2ibot>VoynichCr edited Citizendium (+129): https://wiki.archiveteam.org/?diff=54931&oldid=54923
18:31:33<h2ibot>VoynichCr created Category:Archived in 2012 (+29, Created page with "{{Category archives by year}}"): https://wiki.archiveteam.org/?title=Category%3AArchived%20in%202012
18:32:32<h2ibot>VoynichCr created Category:Archived in 2011 (+29, Created page with "{{Category archives by year}}"): https://wiki.archiveteam.org/?title=Category%3AArchived%20in%202011
18:32:33<h2ibot>VoynichCr edited Template:Category archives by year (+86, 2011, 2012): https://wiki.archiveteam.org/?diff=54934&oldid=54886
18:35:33<h2ibot>VoynichCr edited Template:Gallery (+97): https://wiki.archiveteam.org/?diff=54935&oldid=54929
18:36:32<h2ibot>VoynichCr edited Wikispaces (+11, was): https://wiki.archiveteam.org/?diff=54936&oldid=47356
18:41:33<h2ibot>VoynichCr uploaded File:ScribbleWiki.jpg (…): https://wiki.archiveteam.org/?title=File%3AScribbleWiki.jpg
18:42:33<h2ibot>VoynichCr edited ScribbleWiki (+16, logo): https://wiki.archiveteam.org/?diff=54938&oldid=42137
18:43:43LunarianBunny1147 quits [Quit: The Lounge - https://thelounge.chat]
18:45:34<h2ibot>VoynichCr uploaded File:ScribbleWiki stickers.jpg (Source:…): https://wiki.archiveteam.org/?title=File%3AScribbleWiki%20stickers.jpg
18:45:35<h2ibot>VoynichCr edited ScribbleWiki (+25, image): https://wiki.archiveteam.org/?diff=54940&oldid=54938
18:47:34<h2ibot>Pokechu22 edited Mailing Lists (+30, /* Software */ https://listes.existrans.org/ Sympa): https://wiki.archiveteam.org/?diff=54941&oldid=54844
18:48:34<h2ibot>VoynichCr edited ScribbleWiki (+223, 14,000 wikis?): https://wiki.archiveteam.org/?diff=54942&oldid=54940
18:51:52VoynichCR (VoynichCR) joins
19:01:56Megame (Megame) joins
19:02:46katocala quits [Ping timeout: 250 seconds]
19:02:59katocala joins
19:06:52ericgallager joins
19:10:03rappet quits [Ping timeout: 260 seconds]
19:15:20sparky1492 quits [Ping timeout: 250 seconds]
19:29:53riteo quits [Ping timeout: 260 seconds]
19:35:42katocala quits [Ping timeout: 250 seconds]
19:35:55katocala joins
19:48:49<h2ibot>VoynichCr edited Miraheze (-9, 14,000 wikis): https://wiki.archiveteam.org/?diff=54943&oldid=53836
19:55:41sec^nd quits [Remote host closed the connection]
19:55:57sparky1492 (sparky1492) joins
19:55:58sec^nd (second) joins
19:56:51VoynichCR quits [Client Quit]
20:09:04graham9 quits [Quit: The Lounge - https://thelounge.chat]
20:33:58<h2ibot>VoynichCr edited Referata (+1, was): https://wiki.archiveteam.org/?diff=54944&oldid=27558
20:34:58<h2ibot>VoynichCr edited Referata (-19): https://wiki.archiveteam.org/?diff=54945&oldid=54944
20:34:59<h2ibot>VoynichCr edited Referata (+9): https://wiki.archiveteam.org/?diff=54946&oldid=54945
20:39:32BornOn420 quits [Ping timeout: 276 seconds]
20:43:35BornOn420 (BornOn420) joins
20:54:40kuroger quits [Quit: ZNC 1.9.1 - https://znc.in]
20:58:08kuroger (kuroger) joins
21:00:52BlueMaxima joins
21:31:36iram quits [Quit: Ping timeout (120 seconds)]
21:31:42iram joins
21:37:38cascode quits [Ping timeout: 260 seconds]
21:37:51cascode joins
21:43:08cascode quits [Read error: Connection reset by peer]
21:43:21cascode joins
21:45:20Webuser161207 joins
21:48:10sparky14929 (sparky1492) joins
21:51:20sparky1492 quits [Ping timeout: 250 seconds]
21:51:21sparky14929 is now known as sparky1492
21:56:27NeonGlitch (NeonGlitch) joins
21:59:21emphatic joins
22:01:18lunik1 quits [Quit: :x]
22:01:49lunik1 joins
22:10:30loug83181422 quits [Quit: The Lounge - https://thelounge.chat]
22:13:33FiTheArchiver1 joins
22:16:02riteo (riteo) joins
22:17:20FiTheArchiver quits [Ping timeout: 250 seconds]
22:18:12FiTheArchiver1 quits [Ping timeout: 250 seconds]
22:19:03ThreeHM quits [Ping timeout: 260 seconds]
22:20:47ThreeHM (ThreeHeadedMonkey) joins
22:27:00etnguyen03 (etnguyen03) joins
22:40:15graham9 joins
22:49:15NeonGlitch quits [Client Quit]
22:50:47kuroger quits [Client Quit]
22:52:47kuroger (kuroger) joins
23:05:49<ljcool2006>are tucows downloads from 2005 onwards archived
23:07:46<ljcool2006>also what happened to the irc logs
23:09:23<nicolas17>which version of irc logs
23:09:51<ljcool2006>irclogs.archivete.am
23:10:11<nicolas17>afaik, logs still being saved, but temporarily unavailable on the web
23:19:10graham9 quits [Client Quit]
23:21:33etnguyen03 quits [Client Quit]
23:35:35<u>Guess I'm giving up on using Selenium+Brave with mitmdump to bypass title="Just a moment..." webpages. Python script: https://ar.4everland.io/1YSJOiqr11Vewtz1_lfjdl5fuK6yMwDxc9EHvAV38QI (latest).
23:42:37<u>Learned or was reminded of the following. 1. PORTS. mitmdump at its default port of 8080 works better than making its port 85 (have to run as sudo for that). Port 8080 = don't have to run it as sudo. 2. CANONICAL CERT. Go to http://mitm.it to get the cert and copy it into /usr/local/share/ca-certificates/ (then run sudo update-ca-certificates). 3. CERT IN BROWSER: makes https trusted when mitm'd. Also
23:42:44<u>have to put the certificate into the browser. Go to brave://settings/certificates > Authorities tab > import cert > trust it with everything. 4. CODE. "options.accept_untrusted_certs = True" seems to do nothing, so use "options.add_argument('--ignore-certificate-errors')" in addition or instead: it does do something and it does work.
23:43:36<u>So even after having the browser and the system trust the mitmdump cert, Cloudfarle still managed to disallow me from getting the webpage!
23:46:18nine quits [Quit: See ya!]
23:46:20kuroger quits [Client Quit]
23:46:31nine joins
23:46:31nine quits [Changing host]
23:46:31nine (nine) joins
23:48:46kuroger (kuroger) joins