00:07:20 | | aninternettroll (aninternettroll) joins |
00:11:06 | | balrog quits [Ping timeout: 260 seconds] |
00:18:34 | | Webuser423063 joins |
00:18:41 | | Webuser423063 quits [Client Quit] |
00:19:31 | | Wohlstand quits [Quit: Wohlstand] |
00:26:00 | | emphatic quits [Ping timeout: 258 seconds] |
00:30:56 | | aninternettroll quits [Ping timeout: 260 seconds] |
00:37:15 | | lennier2_ joins |
00:39:52 | | aninternettroll (aninternettroll) joins |
00:40:11 | | lennier2 quits [Ping timeout: 258 seconds] |
00:43:11 | | ymgve_ quits [Ping timeout: 260 seconds] |
00:51:49 | | ymgve joins |
00:52:04 | | threedeeitguy6 quits [Ping timeout: 258 seconds] |
00:52:54 | | threedeeitguy6 (threedeeitguy) joins |
00:56:22 | <pabs> | JAA arkiver - any thoughts on Medaka's diary.fc2.com messages above? |
01:04:43 | | ymgve quits [Ping timeout: 258 seconds] |
01:32:03 | | Megame quits [Read error: Connection reset by peer] |
01:34:32 | | Megame (Megame) joins |
01:46:15 | | DogsRNice joins |
01:57:50 | | etnguyen03 (etnguyen03) joins |
02:42:48 | | dabs quits [Read error: Connection reset by peer] |
02:42:51 | | balrog (balrog) joins |
02:43:37 | | etnguyen03 quits [Remote host closed the connection] |
03:09:48 | | ymgve joins |
03:13:08 | <nicolas17> | I collected URLs for all crestron HTML docs |
03:13:43 | <nicolas17> | excluding https://docs.crestron.com/en-us/8525/Content/Topics/Home.htm (which we already archived, and happens to be the largest), it's 5353 URLs, plus prerequisites to be discovered |
03:14:12 | <nicolas17> | /8525/ was 436 URLs by itself, dunno how much it grew when archivebot added prerequisites |
03:14:50 | <nicolas17> | hm I found a few cases where I extracted url "___", need to figure out where that came from |
03:19:32 | <@JAA> | You mean page requisites? |
03:20:26 | <nicolas17> | yes |
03:20:31 | <nicolas17> | -.- |
03:21:02 | <nicolas17> | words |
03:22:57 | <@JAA> | words are hard++ |
03:22:58 | <eggdrop> | [karma] 'words are hard' now has 1 karma! |
03:47:23 | | pedantic-darwin quits [Quit: Ping timeout (120 seconds)] |
03:48:09 | | pedantic-darwin joins |
04:03:41 | | DogsRNice quits [Read error: Connection reset by peer] |
04:10:02 | | camrod636 (camrod) joins |
04:30:35 | | archiveDrill quits [Quit: The Lounge - https://thelounge.chat] |
04:34:09 | | archiveDrill joins |
04:39:37 | | BornOn420 quits [Ping timeout: 258 seconds] |
04:52:15 | | BornOn420 (BornOn420) joins |
04:53:45 | <pabs> | https://www.thewrap.com/dilbert-scott-adams-prostate-cancer-biden/ https://news.ycombinator.com/item?id=44031917 |
05:02:11 | | ichdasich quits [Ping timeout: 260 seconds] |
05:02:11 | | pabs quits [Ping timeout: 260 seconds] |
05:03:42 | | ichdasich joins |
05:21:49 | | eroc1990 quits [Quit: The Lounge - https://thelounge.chat] |
05:22:16 | | eroc1990 (eroc1990) joins |
05:30:02 | | pabs (pabs) joins |
05:37:34 | <@arkiver> | pabs yeah, interesting |
05:44:46 | | cm quits [Ping timeout: 260 seconds] |
05:44:54 | <pabs> | I'm guessing it would require DPoS, don't think the other options support arbitrary UAs? |
05:47:56 | | cm joins |
05:52:23 | <@JAA> | Well, grab-site |
05:52:53 | <@JAA> | I haven't looked at the site at all though. Maybe it requires scripting anyway. |
05:54:00 | | Megame quits [Quit: Leaving] |
05:55:32 | <h2ibot> | PaulWise edited Discord (+149, searchcord.io): https://wiki.archiveteam.org/?diff=55709&oldid=55239 |
05:56:26 | <pabs> | sounds like it would require the sitemap trick, IIRC grab-site has the parent issue too |
05:57:54 | <@JAA> | Yeah |
05:58:59 | <@JAA> | Well, actually, maybe not. |
06:00:22 | <@JAA> | Looks like the blogs reside entirely in /cgi-sys/ed.cgi/$whatever/, so a recursive crawl would be fine. Omitting the final slash would also let it recurse to other blogs that might not be in the list. |
06:00:51 | <@JAA> | --no-parent would only be a problem if there were parts outside of /cgi-sys/ed.cgi/. |
06:04:56 | <pabs> | hmm, I thought making !a < work needed /cgi-sys/ed.cgi/$whatever rather than /cgi-sys/ed.cgi/$whatever/ (ie no trailing slash). I get 403s, does the slashless URL work? |
06:06:14 | <@JAA> | It's not required, but it's preferred in case of crosslinks between blogs. Else some parts may be missed. |
06:06:21 | <@JAA> | Both versions work fine for me with the Googlebot UA. |
06:07:34 | | pabs needs a UA webextension |
06:11:39 | | @JAA was just looking at it with curl-ua. |
06:23:23 | | cooljeanius quits [Quit: This computer has gone to sleep] |
07:14:07 | | beastbg8 quits [Read error: Connection reset by peer] |
07:21:15 | | Island quits [Read error: Connection reset by peer] |
07:48:54 | | emphatic joins |
07:58:09 | | cooljeanius joins |
08:11:34 | <@JAA> | pabs: I have some tooling for MEGA and am grabbing a copy of that upload now. But that's just a plain file, not proper archival. It's a bunch of POST requests, so playback wouldn't work anyway. |
08:14:40 | <@JAA> | (The tooling is mostly to avoid the crappy web interface.) |
08:17:26 | <@JAA> | > Data download failed: Server returned 509 (over quota) |
08:17:31 | <@JAA> | Oh yeah, I forgot about that crap. |
08:22:01 | | khaoohs quits [Read error: Connection reset by peer] |
08:22:21 | | lennier2_ quits [Read error: Connection reset by peer] |
08:22:36 | | lennier2_ joins |
08:22:40 | | khaoohs joins |
08:23:09 | | makeworld quits [Quit: Ping timeout (120 seconds)] |
08:23:11 | | Snivy quits [Quit: Ping timeout (120 seconds)] |
08:23:18 | | Bleo182600722719623455 quits [Quit: Ping timeout (120 seconds)] |
08:23:29 | | eroc19909 (eroc1990) joins |
08:23:31 | | Bleo182600722719623455 joins |
08:23:39 | | Snivy (Snivy) joins |
08:24:13 | | Dada joins |
08:24:38 | | eroc1990 quits [Read error: Connection reset by peer] |
08:43:32 | | threedeeitguy6 quits [Read error: Connection reset by peer] |
08:43:47 | | threedeeitguy6 (threedeeitguy) joins |
09:14:46 | | threedeeitguy6 quits [Ping timeout: 260 seconds] |
09:15:08 | | threedeeitguy6 (threedeeitguy) joins |
09:21:11 | | threedeeitguy6 quits [Ping timeout: 260 seconds] |
09:29:40 | | beastbg8 (beastbg8) joins |
09:34:08 | | threedeeitguy6 (threedeeitguy) joins |
10:31:23 | | threedeeitguy60 (threedeeitguy) joins |
10:31:52 | | threedeeitguy6 quits [Read error: Connection reset by peer] |
10:31:52 | | threedeeitguy60 is now known as threedeeitguy6 |
10:37:51 | | notSokar joins |
10:40:06 | | Sokar quits [Ping timeout: 258 seconds] |
11:00:03 | | Bleo182600722719623455 quits [Client Quit] |
11:01:32 | | cooljeanius quits [Quit: This computer has gone to sleep] |
11:02:47 | | Bleo182600722719623455 joins |
11:40:38 | | pedantic-darwin9 joins |
11:41:49 | | pedantic-darwin quits [Ping timeout: 258 seconds] |
11:41:49 | | pedantic-darwin9 is now known as pedantic-darwin |
11:47:22 | | nstrom joins |
11:50:39 | | nstrom quits [Client Quit] |
11:53:02 | | nstrom joins |
12:31:27 | | Wohlstand (Wohlstand) joins |
13:34:30 | | Wohlstand quits [Client Quit] |
14:30:50 | | balrog quits [Quit: Bye] |
14:34:37 | | legoktm quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.] |
14:34:39 | | Wohlstand (Wohlstand) joins |
14:35:15 | | legoktm joins |
14:38:36 | | balrog (balrog) joins |
14:56:34 | | abirkill (abirkill) joins |
15:04:14 | | MrMcNuggets quits [Quit: WeeChat 4.3.2] |
15:18:02 | | cooljeanius joins |
16:29:06 | | ahm258760 quits [Quit: The Lounge - https://thelounge.chat] |
16:29:21 | | ahm258760 joins |
16:38:31 | <h2ibot> | Exorcism edited Discourse/archived (+92): https://wiki.archiveteam.org/?diff=55710&oldid=55707 |
16:41:52 | | nyakase quits [Read error: Connection reset by peer] |
16:41:58 | | nyakase (nyakase) joins |
16:51:29 | | Matthww quits [Quit: The Lounge - https://thelounge.chat] |
16:56:31 | | Matthww joins |
16:57:21 | | nyakase quits [Ping timeout: 260 seconds] |
16:59:22 | | nyakase (nyakase) joins |
17:05:21 | | i_have_n0_idea3 quits [Quit: The Lounge - https://thelounge.chat] |
17:05:38 | | i_have_n0_idea3 (i_have_n0_idea) joins |
17:09:00 | | grill (grill) joins |
17:20:37 | <h2ibot> | HadeanEon edited Deaths in 2025 (+7066, BOT - Updating page: {{saved}} (128),…): https://wiki.archiveteam.org/?diff=55711&oldid=55702 |
17:20:38 | <h2ibot> | HadeanEon edited Deaths in 2025/list (+622, BOT - Updating list): https://wiki.archiveteam.org/?diff=55712&oldid=55682 |
17:25:28 | | lennier2 joins |
17:28:16 | | lennier2_ quits [Ping timeout: 260 seconds] |
17:47:23 | | lennier2_ joins |
17:50:26 | | lennier2 quits [Ping timeout: 260 seconds] |
18:10:06 | | nyakase5 (nyakase) joins |
18:12:36 | | nyakase quits [Ping timeout: 260 seconds] |
18:12:36 | | nyakase5 is now known as nyakase |
18:26:00 | | flotwig is now authenticated as flotwig |
18:26:00 | | flotwig quits [Changing host] |
18:26:00 | | flotwig (flotwig) joins |
18:27:35 | <legoktm> | https://social.freedom.press/@freedomofpress/114541366194039556 a panel on Friday discussing attacks on Voice of America featuring Jason Scott |
18:28:43 | | lennier2 joins |
18:31:51 | | lennier2_ quits [Ping timeout: 260 seconds] |
18:39:57 | | f_ is now known as f_|DEPRECATED |
18:40:02 | | f_|DSR is now known as f_ |
18:40:48 | | grill quits [Ping timeout: 258 seconds] |
18:42:26 | | grill (grill) joins |
19:03:07 | | nine quits [Quit: See ya!] |
19:03:20 | | nine joins |
19:03:20 | | nine is now authenticated as nine |
19:03:20 | | nine quits [Changing host] |
19:03:20 | | nine (nine) joins |
19:32:29 | | Island joins |
19:44:11 | | Church quits [Ping timeout: 260 seconds] |
19:45:29 | | Megame (Megame) joins |
19:48:16 | | kansei quits [Ping timeout: 258 seconds] |
19:48:30 | | kansei- (kansei) joins |
19:58:38 | | rohvani joins |
19:59:56 | | grill quits [Ping timeout: 260 seconds] |
20:18:33 | | APOLLO03 quits [Ping timeout: 258 seconds] |
20:23:09 | | dabs joins |
20:55:58 | | cooljeanius quits [Quit: This computer has gone to sleep] |
21:10:34 | | nstrom quits [Quit: Ooops, wrong browser tab.] |
21:21:31 | | dabs quits [Remote host closed the connection] |
21:21:49 | | dabs joins |
21:27:45 | | nine quits [Client Quit] |
21:28:02 | | nine joins |
21:28:03 | | nine is now authenticated as nine |
21:28:03 | | nine quits [Changing host] |
21:28:03 | | nine (nine) joins |
21:32:53 | | sepro (sepro) joins |
21:58:31 | | Matthww quits [Quit: The Lounge - https://thelounge.chat] |
22:03:09 | | Matthww joins |
22:09:21 | | beastbg8_ joins |
22:12:21 | | beastbg8 quits [Ping timeout: 260 seconds] |
22:15:28 | | PredatorIWD25 quits [Read error: Connection reset by peer] |
22:28:30 | | riteo quits [Read error: Connection reset by peer] |
22:30:43 | | riteo (riteo) joins |
22:57:43 | | Dada quits [Remote host closed the connection] |
23:08:21 | | CYBERDEV quits [Ping timeout: 260 seconds] |
23:17:53 | | CYBERDEV joins |
23:34:39 | | bladem (bladem) joins |