00:03:57 | <nicolas17> | ugh I wish I could just rsync the NCBI data server to a local disk |
00:05:51 | <nicolas17> | but I don't have anywhere near enough local storage, and also the server feels like it's running on a highly contended magnetic disk |
00:06:01 | <nicolas17> | high latency for metadata ops |
00:06:40 | <@JAA> | nicolas17: → #UncleSamsArchive ? |
00:07:51 | <nicolas17> | my current level of "fucking around with unclear goal" is almost worth -ot, but sure |
00:07:55 | <nicolas17> | :p |
00:20:21 | <nicolas17> | JAA: what do you think is the best approach for https://wiki.archiveteam.org/index.php/Foro_3DJuegos ? would we need DPoS? |
00:21:04 | <@JAA> | nicolas17: I haven't looked at it yet as other things are more urgent. |
00:22:18 | <nicolas17> | I haven't looked at the size yet, maybe a single-computer wget-lua can do it, but I have no experience writing such lua scripts |
00:23:46 | <Vokun> | we've got about 32 days left on it. Probably enough time, hopefully, if unclesam doesn't take 100% of dev time for the 3 weeks |
00:23:47 | <@JAA> | Yeah, or qwarc. The IDs seem sequential enough, but I'm guessing they're using post IDs for topics, too, since 54M topics seems a bit too large. |
00:24:03 | <@JAA> | And it seems you need to know which forum the post is in, too. |
00:24:06 | <pabs> | c3manu: I have a browser console based scraper for the API that the securitytrails.com frontend uses, sec |
00:24:42 | <nicolas17> | oh, I didn't realize if you go to the ID alone it redirects to the correct slug |
00:25:04 | <nicolas17> | that makes sequential scraping much easier |
00:25:07 | <@JAA> | Ah no, /foros/temas/$ID/0/ does work, actually. |
00:25:17 | <@JAA> | /foros/tema/$ID/0/ * |
00:25:49 | <nicolas17> | yeah I just thought you needed the correct slug too, and then you'd have no option but crawling links |
00:25:50 | <pabs> | c3manu: you have to adjust the domain at the top and the API URL further down (it changes): https://transfer.archivete.am/5f2XC/securitytrails-domain-scraper.js |
00:25:51 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/5f2XC/securitytrails-domain-scraper.js |
00:25:56 | | etnguyen03 quits [Client Quit] |
00:26:21 | <pabs> | c3manu: I only use it when there are more than one page of results though :) |
00:26:25 | <@JAA> | Yeah, I thought you needed the forum slug (foros or community-foros or maybe others), but I made a typo. |
00:26:36 | <@JAA> | Which would be less bad but still annoying. |
00:27:00 | <Webuser849303> | Thanks JAA, got the docker working with your help |
00:27:15 | <nicolas17> | throwing 54M URLs into archivebot is probably a bad idea (: |
00:27:51 | | drfish joins |
00:28:18 | <@JAA> | nicolas17: Also wouldn't work properly for pagination URLs. |
00:28:19 | <Vokun> | there've been bigger projects |
00:28:46 | <@JAA> | Anyway, I'll take a closer look in a couple days, it's just not top priority for me right now. |
00:28:52 | <nicolas17> | yeah |
00:29:05 | <Vokun> | if only copy pasting another #y project worked |
00:29:11 | <@JAA> | If you want to poke around, it'd be good to know whether there's any content that isn't in topics. |
00:31:16 | <h2ibot> | PaulWise edited Finding subdomains (+184, add securitytrails-domain-scraper.js): https://wiki.archiveteam.org/?diff=54352&oldid=53857 |
00:32:30 | | drfish quits [Client Quit] |
00:50:24 | | Dango360_ (Dango360) joins |
00:53:33 | | _Dango360 quits [Ping timeout: 260 seconds] |
00:58:18 | | sec^nd quits [Remote host closed the connection] |
00:58:34 | | spirit quits [Ping timeout: 250 seconds] |
00:58:45 | | sec^nd (second) joins |
01:11:30 | | spirit joins |
01:18:30 | | cascode quits [Ping timeout: 250 seconds] |
01:19:05 | | cascode joins |
01:28:33 | | cascode quits [Ping timeout: 260 seconds] |
01:28:57 | | cascode joins |
01:32:23 | | gust joins |
01:39:22 | | Naruyoko joins |
01:41:39 | | etnguyen03 (etnguyen03) joins |
01:41:58 | | Naruyoko5 quits [Ping timeout: 260 seconds] |
01:45:29 | | notarobot19 joins |
01:49:33 | | notarobot1 quits [Ping timeout: 260 seconds] |
01:49:34 | | notarobot19 is now known as notarobot1 |
01:49:42 | | cascode quits [Read error: Connection reset by peer] |
01:50:01 | | cascode joins |
01:54:48 | | cascode quits [Ping timeout: 260 seconds] |
01:57:43 | | cascode joins |
01:59:03 | | monoxane (monoxane) joins |
02:05:45 | | cascode quits [Read error: Connection reset by peer] |
02:05:53 | | cascode joins |
02:13:21 | | etnguyen03 quits [Client Quit] |
02:20:58 | | ramsey (ramsey) joins |
02:24:43 | | ramsey quits [Changing host] |
02:24:43 | | ramsey (ramsey) joins |
02:27:26 | | etnguyen03 (etnguyen03) joins |
02:27:28 | | Barto quits [Ping timeout: 260 seconds] |
02:31:54 | | Wohlstand (Wohlstand) joins |
02:36:16 | | gust quits [Read error: Connection reset by peer] |
02:39:57 | | BornOn420 quits [Remote host closed the connection] |
02:40:35 | | BornOn420 (BornOn420) joins |
02:41:24 | | Wohlstand quits [Client Quit] |
02:45:01 | <h2ibot> | Nulldata uploaded File:Dailymotion logo 2023.png: https://wiki.archiveteam.org/?title=File%3ADailymotion%20logo%202023.png |
02:56:03 | <h2ibot> | Nulldata edited Dailymotion (+898, Added project and information regarding…): https://wiki.archiveteam.org/?diff=54356&oldid=49222 |
02:59:05 | <h2ibot> | Nulldata edited Current Projects (+86, Added Dailymotion to upcoming projects): https://wiki.archiveteam.org/?diff=54357&oldid=54315 |
02:59:32 | | etnguyen03 quits [Client Quit] |
03:18:59 | | HP_Archivist (HP_Archivist) joins |
03:19:08 | | etnguyen03 (etnguyen03) joins |
03:19:22 | <pabs> | https://sublimemusic.app/ is EOL |
03:19:30 | <pabs> | http://sumnerevans.com/posts/projects/sublime-music-eom/ |
03:19:47 | | Retroity joins |
03:22:13 | <Retroity> | Hi all, I'm currently running the warrior on my Windows PC using Docker. I'm currently running the US Government project. The CheckIP step keeps failing, with the error noting "AssertionError: Invalid return code 4 on http://legacy-api.arpa.li/now." Not sure what the issue is. Any help would be appreciated. Thanks! |
03:22:51 | | Wohlstand (Wohlstand) joins |
03:24:03 | | scurvy_duck quits [Ping timeout: 260 seconds] |
03:25:13 | <TheTechRobo> | Retroity: Are you using a VPN? |
03:26:04 | <Retroity> | No. I'm on my university's network but that's the only thing I can think of that might be causing issues |
03:32:17 | <that_lurker> | can you open that url on the computer? |
03:33:14 | <Retroity> | Yes, opening it on my web browser displays "1738812779.243" |
03:47:25 | | HP_Archivist quits [Client Quit] |
03:49:41 | | icedice quits [Quit: Leaving] |
03:53:19 | <h2ibot> | PaulWise edited Dailymotion (+134, add dailymotion-dl): https://wiki.archiveteam.org/?diff=54358&oldid=54356 |
03:53:20 | <h2ibot> | PaulWise edited Dailymotion (+0, format typo): https://wiki.archiveteam.org/?diff=54359&oldid=54358 |
03:55:38 | | etnguyen03 quits [Remote host closed the connection] |
03:58:00 | | Retroity quits [Client Quit] |
03:58:03 | | Craigle quits [Quit: The Lounge - https://thelounge.chat] |
03:58:20 | <h2ibot> | PaulWise edited Mailing Lists (+76, groups.io supports custom domains :(): https://wiki.archiveteam.org/?diff=54360&oldid=54252 |
03:58:41 | | @imer quits [Quit: Oh no] |
03:58:56 | | Craigle (Craigle) joins |
03:59:12 | | imer (imer) joins |
03:59:13 | | @ChanServ sets mode: +o imer |
04:00:21 | <h2ibot> | PaulWise edited Mailing Lists (+165, fix up message-id text, add mapping/search idea): https://wiki.archiveteam.org/?diff=54361&oldid=54360 |
04:00:44 | | Webuser809883 joins |
04:01:12 | | Webuser809883 quits [Client Quit] |
04:05:11 | | Craigle quits [Client Quit] |
04:06:18 | | Craigle (Craigle) joins |
04:11:19 | | Wohlstand quits [Client Quit] |
04:13:03 | | Krownest (Krownest) joins |
04:13:28 | <TheTechRobo> | Retroity: Your university is likely blocking custom DNS, unfortunately. We use Quad9 in our projects. |
04:13:33 | <TheTechRobo> | RIP |
04:15:06 | <@JAA> | Perhaps we should do the DNS check before the time check so it's more obvious what's wrong. |
04:17:30 | | BlueMaxima quits [Read error: Connection reset by peer] |
05:06:22 | | ahm258 joins |
05:09:50 | | scurvy_duck joins |
05:22:12 | | HP_Archivist (HP_Archivist) joins |
05:22:12 | | TerritoryJazz quits [Quit: Ooops, wrong browser tab.] |
05:27:30 | | Webuser741750 joins |
05:35:07 | | Webuser741750 quits [Client Quit] |
05:35:26 | | abahbob joins |
06:18:01 | | ArchivalEfforts joins |
06:22:51 | | fuzzy8021 (fuzzy80211) joins |
06:23:42 | | fuzzy80211 quits [Read error: Connection reset by peer] |
06:44:44 | | spirit quits [Quit: Leaving] |
07:12:06 | | ahm258 quits [Ping timeout: 250 seconds] |
07:18:10 | | monika quits [Ping timeout: 250 seconds] |
07:19:11 | <c3manu> | pabs: i was looking to get a subdomain list for gatech.edu (since c99 and subdomain.center both hat obvious ones missing). securitytrails just said "10,000+" or sth ^^" |
07:19:17 | <c3manu> | so yeah, more than one page |
07:19:46 | <pabs> | hmm, seems like too many domains? |
07:20:02 | <c3manu> | no idea how i would use that js file you sent me |
07:20:08 | <c3manu> | too many for what? |
07:20:16 | <pabs> | too many to be realistic |
07:20:47 | <pabs> | basically, open your browser console, paste the code in there and run it. you will get a urls.txt file saved with the URLs |
07:20:49 | <c3manu> | it’s a university. i would of course expect some junk in it, but also an awful lot of valid ones |
07:20:56 | | monika (boom) joins |
07:20:59 | <pabs> | hmm |
07:22:03 | | LunarianBunny11474 (LunarianBunny1147) joins |
07:23:48 | | LunarianBunny1147 quits [Ping timeout: 250 seconds] |
07:23:48 | | LunarianBunny11474 is now known as LunarianBunny1147 |
07:26:54 | <c3manu> | pabs: might be a little late now anyways, some search results for "DEI" google has indexed return 404s already. i found hits on other subdomains, but going through them one by one is being tedious, at best |
07:27:06 | <c3manu> | that was the reason for grabbing those: https://cyberplace.social/@GossiTheDog/113921481331311737/ |
07:27:32 | <pabs> | ah yeah |
07:31:23 | | abirkill quits [Ping timeout: 260 seconds] |
07:33:33 | | Webuser849303 quits [Quit: Ooops, wrong browser tab.] |
07:36:36 | | abirkill (abirkill) joins |
07:40:44 | | ahm258 joins |
08:18:03 | | scurvy_duck quits [Ping timeout: 260 seconds] |
08:40:53 | | Webuser108513 joins |
08:41:39 | | Webuser108513 quits [Client Quit] |
09:03:36 | | myself quits [Read error: Connection reset by peer] |
09:03:58 | | myself (myself) joins |
09:41:25 | <@JAA> | So Equinix Metal, we should maybe start a list of projects that are in danger. Not all of them may be able to find adequate hosting that allows them to transfer all existing data. |
09:46:32 | <@JAA> | Freedesktop, Alpine Linux, and WireGuard are the ones I'm aware of so far. |
09:51:11 | <monoxane> | JAA is something going on with Equinix Metal? |
09:51:41 | <monoxane> | oh fuck I see it ignore me |
09:51:44 | <monoxane> | thats a big rip |
09:52:31 | <@JAA> | Yeah, https://deploy.equinix.com/blog/sunsetting-equinix-metal/ |
09:53:04 | <monoxane> | particularly unfortunate for us because the 100tb storage box for $500/mo would make a wonderful target 😆 |
09:53:05 | <@JAA> | And those projects who were given free hosting resources are getting kicked within a couple months. |
10:00:08 | | Hackerpcs quits [Ping timeout: 260 seconds] |
10:03:40 | | icedice (icedice) joins |
10:17:45 | | Hackerpcs (Hackerpcs) joins |
10:21:43 | | threedeeitguy (threedeeitguy) joins |
10:50:53 | | Church quits [Ping timeout: 260 seconds] |
11:05:54 | | Church (Church) joins |
11:19:06 | | abahbob quits [Quit: Ooops, wrong browser tab.] |
11:22:06 | | @arkiver quits [Remote host closed the connection] |
11:23:18 | | arkiver (arkiver) joins |
11:23:18 | | @ChanServ sets mode: +o arkiver |
11:41:41 | | Webuser048517 joins |
11:42:15 | | Webuser048517 quits [Client Quit] |
11:46:10 | | loug8318142 joins |
11:46:35 | | yasomi quits [Read error: Connection reset by peer] |
11:49:00 | | yasomi (yasomi) joins |
11:55:04 | | cascode quits [Ping timeout: 250 seconds] |
11:55:17 | | cascode joins |
12:00:02 | | Bleo18260072271962345 quits [Quit: The Lounge - https://thelounge.chat] |
12:02:53 | | Bleo18260072271962345 joins |
12:05:03 | | mls (mls) joins |
12:10:01 | | @arkiver quits [Remote host closed the connection] |
12:10:58 | | arkiver (arkiver) joins |
12:10:58 | | @ChanServ sets mode: +o arkiver |
12:18:55 | | moth_ quits [Remote host closed the connection] |
12:27:28 | | Freiner quits [Quit: Ooops, wrong browser tab.] |
12:35:32 | | SkilledAlpaca418962 quits [Quit: SkilledAlpaca418962] |
12:36:03 | | SkilledAlpaca418962 joins |
12:55:25 | | nomead joins |
13:06:26 | | Naruyoko5 joins |
13:10:02 | | Naruyoko quits [Ping timeout: 250 seconds] |
13:13:09 | | pixel (pixel) joins |
13:31:09 | | Barto (Barto) joins |
13:44:15 | | tmob joins |
13:59:14 | | pixel leaves |
14:18:53 | <h2ibot> | Imer edited Deathwatch (+170, /* 2025 */ add splits.io): https://wiki.archiveteam.org/?diff=54362&oldid=54349 |
14:21:54 | <h2ibot> | Imer edited Deathwatch (+327, /* Frozen Solid */ add evga forums): https://wiki.archiveteam.org/?diff=54363&oldid=54362 |
14:56:46 | <Hans5958> | What is the "web server" on the Docker containers? Are there any benefits on turning them on for monitoring purposes? |
15:06:11 | | tmob quits [Read error: Connection reset by peer] |
15:07:23 | | tmob joins |
15:17:33 | <Hans5958> | Btw Livestream is brewing (minus videos) |
15:18:50 | <@arkiver> | oh yeah |
15:21:32 | | caylin quits [Read error: Connection reset by peer] |
15:21:49 | | caylin (caylin) joins |
15:24:19 | | scurvy_duck joins |
15:28:53 | | Wohlstand (Wohlstand) joins |
15:54:18 | | Webuser182716 joins |
15:54:24 | | Webuser182716 quits [Client Quit] |
15:55:06 | | Naruyoko joins |
15:58:18 | | Naruyoko5 quits [Ping timeout: 260 seconds] |
15:58:53 | | Wohlstand quits [Remote host closed the connection] |
15:59:15 | | Wohlstand (Wohlstand) joins |
16:01:23 | | ahm258 quits [Quit: The Lounge - https://thelounge.chat] |
16:03:31 | | VickoSaviour joins |
16:05:21 | | ahm258 joins |
16:06:26 | | Wohlstand quits [Client Quit] |
16:07:55 | <VickoSaviour> | I just randomly saw this while browsing on answers.microsoft.com |
16:07:55 | <VickoSaviour> | "Windows Client Forum Moving to Microsoft Q&A |
16:07:55 | <VickoSaviour> | We are excited to announce that soon, the Windows Client for IT Pros forum will be available exclusively in the Microsoft Q&A.. This change will help us provide a more streamlined and efficient experience for all your questions and discussions... |
16:07:55 | <VickoSaviour> | Starting February 21, you will no longer be able to create new questions here in the Microsoft Support Community. However, you can continue to participate in ongoing discussions and create new questions in the Microsoft Q&A." |
16:07:55 | <VickoSaviour> | This includes the Windows Server forum, and i'm afraid that maybe the rest of the forums may be included and inaccessible in the near future. It's up to you if you would archive Microsoft Support Community. |
16:08:17 | <@arkiver> | whenever it goes "we are excited [...]", it's bad news |
16:26:48 | | breadbrix (breadbrix) joins |
17:19:40 | | lennier2_ joins |
17:22:53 | | lennier2 quits [Ping timeout: 260 seconds] |
17:30:43 | | SootBector quits [Remote host closed the connection] |
17:31:05 | | SootBector (SootBector) joins |
17:41:38 | | lennier2 joins |
17:44:20 | | lennier2_ quits [Ping timeout: 250 seconds] |
18:10:50 | | lennier2_ joins |
18:11:18 | | scurvy_duck quits [Ping timeout: 260 seconds] |
18:13:22 | | lennier2 quits [Ping timeout: 250 seconds] |
18:13:30 | <that_lurker> | ohh microsoft changing stuff. That means millions of links that will take you Dictionary |
18:13:41 | <that_lurker> | s/Dictionary/nowhere |
18:18:54 | | that_lurker quits [Remote host closed the connection] |
18:18:59 | | that_lurker (that_lurker) joins |
18:39:32 | | scurvy_duck joins |
18:45:00 | | benjins3 quits [Ping timeout: 250 seconds] |
18:49:13 | | NatTheCat quits [Ping timeout: 260 seconds] |
18:56:13 | | HP_Archivist quits [Ping timeout: 260 seconds] |
18:57:08 | | Sluggs quits [Ping timeout: 250 seconds] |
19:01:26 | | NatTheCat joins |
19:04:36 | | Sluggs joins |
19:09:18 | | VickoSaviour quits [Quit: Ooops, wrong browser tab.] |
19:14:18 | | wyatt8740 quits [Ping timeout: 260 seconds] |
19:15:18 | | wyatt8740 joins |
19:23:55 | | scurvy_duck quits [Client Quit] |
19:42:20 | | khaoohs quits [Read error: Connection reset by peer] |
19:45:34 | | lennier2 joins |
19:48:43 | | lennier2_ quits [Ping timeout: 260 seconds] |
19:54:14 | | Webuser544416 joins |
19:55:39 | | khaoohs joins |
20:01:16 | | loug8318142 quits [Ping timeout: 250 seconds] |
20:04:00 | <nicolas17> | brickshelf reached 1TB |
20:04:10 | | loug8318142 joins |
20:06:17 | | loug8318142 quits [Read error: Connection reset by peer] |
20:06:33 | | loug8318142 joins |
20:07:34 | | HP_Archivist (HP_Archivist) joins |
20:17:55 | | AlsoHP_Archivist joins |
20:21:38 | | HP_Archivist quits [Ping timeout: 250 seconds] |
20:24:40 | | loug8318142 quits [Ping timeout: 250 seconds] |
20:32:50 | | Webuser544416 quits [Client Quit] |
20:37:28 | <@JAA> | answers.microsoft.com is fun. It has an OAuth flow even for anonymous access. |
20:44:30 | <h2ibot> | JustAnotherArchivist edited ArchiveTeam Warrior (+231, /* Can I use whatever internet access for the…): https://wiki.archiveteam.org/?diff=54364&oldid=53514 |
20:45:37 | | DogsRNice joins |
20:57:48 | | loug8318142 joins |
20:59:58 | | SootBector quits [Remote host closed the connection] |
21:15:49 | | Webuser312814 joins |
21:16:54 | | Webuser312814 quits [Client Quit] |
21:21:24 | | hyenatown joins |
21:26:29 | | lennier2_ joins |
21:29:14 | | lennier2 quits [Ping timeout: 250 seconds] |
21:46:33 | | tmob quits [Ping timeout: 260 seconds] |
21:53:18 | | AlsoHP_Archivist quits [Client Quit] |
21:53:34 | | HP_Archivist (HP_Archivist) joins |
21:59:31 | | Webuser705947 joins |
22:00:20 | <Webuser705947> | Hello |
22:04:09 | <Webuser705947> | JAA Hello |
22:04:25 | <@JAA> | Hi |
22:06:03 | <Webuser705947> | JAA So, is there a reason why you can't list the URL that I just requested to archive including the others? |
22:06:58 | | etnguyen03 (etnguyen03) joins |
22:07:06 | <Webuser705947> | JAA i mean't to say URL's contents. My bad |
22:07:26 | <nicolas17> | how can we possibly know what files are in wdig-2.ocs.llnw.net? |
22:08:58 | <pokechu22> | do *you* have a list of all URLs on http://wdig-2.ocs.llnw.net/? If you do we can run that, but I'm not seeing any way to do it |
22:09:19 | <pokechu22> | that page is a 403 and https://duckduckgo.com/?t=ffab&q=site%3Awdig-2.ocs.llnw.net&ia=web and https://www.google.com/search?hl=en&q=site%3Awdig%2D2.ocs.llnw.net give no results |
22:11:45 | <Webuser705947> | maybe you can ask a user who is good at archiving websites that have 403 forbidden codes on them, or maybe use FileZilla application. And no, i don't have a list of all URL's for that URL. |
22:13:57 | <nicolas17> | FileZilla is for FTP |
22:13:59 | <nicolas17> | this is not an FTP server |
22:14:32 | <Webuser705947> | Oops |
22:14:40 | <@JAA> | Well, it actually is an FTP server, but not an open one we can access. |
22:15:29 | | nomead quits [Quit: Leaving] |
22:15:29 | <@JAA> | If you have a login, maybe that can be used to list it. |
22:15:50 | <@JAA> | If you know of *any* content on there, maybe it's possible to guess more URLs. |
22:16:17 | <@JAA> | We don't have a magic wand, unfortunately. |
22:16:35 | <nicolas17> | bizarre, the unencrypted-rsync port is open too, but you can't retrieve anything from there either |
22:17:51 | | SootBector (SootBector) joins |
22:19:38 | | benjins3 joins |
22:55:05 | <that_lurker> | They do have some interesting subdomains https://subdomainfinder.c99.nl/scans/2025-02-06/llnw.net |
22:55:51 | | territoryjazz joins |
22:55:52 | | territoryjazz quits [Client Quit] |
22:56:13 | | territoryjazz joins |
22:57:39 | <@JAA> | TIL they're called Edgio now. It was Limelight, but they acquired/merged with Edgecast recently. |
23:01:14 | <that_lurker> | and now they are part of akamai |
23:01:36 | <that_lurker> | https://learn.microsoft.com/en-us/azure/cdn/edgio-retirement-faq |
23:03:03 | | loug8318142 quits [Client Quit] |
23:05:25 | <@JAA> | Akamai-- |
23:05:25 | <eggdrop> | [karma] 'Akamai' now has -83 karma! |
23:07:15 | <pokechu22> | https://github.com/ufs-community https://github.com/NOAA-EMC https://github.com/NOAA-OWP https://github.com/NOAA-GFDL https://github.com/NOAA-GSL https://github.com/NOAA-PMEL https://github.com/NOAA-ORR-ERD https://github.com/NOAA-FIMS https://github.com/NCAR |
23:07:25 | <pokechu22> | err, that's supposed to be #gitgud |
23:46:08 | | cascode quits [Ping timeout: 260 seconds] |
23:46:35 | | cascode joins |