| 00:15:37 | | nerdguy1138 quits [Ping timeout: 258 seconds] |
| 00:31:27 | | nerdguy1138 (nerdguy1138) joins |
| 00:34:09 | | Sylirana quits [Ping timeout: 244 seconds] |
| 00:34:58 | | Sylirana (Sylirana) joins |
| 00:40:15 | | Mineroboter_ joins |
| 00:41:00 | | Mineroboter quits [Ping timeout: 250 seconds] |
| 00:47:56 | | Arcorann_ quits [Ping timeout: 250 seconds] |
| 01:03:32 | | dm4v quits [Ping timeout: 250 seconds] |
| 01:05:07 | | dm4v joins |
| 01:05:09 | | dm4v is now authenticated as dm4v |
| 01:05:09 | | dm4v quits [Changing host] |
| 01:05:09 | | dm4v (dm4v) joins |
| 01:52:57 | | HP_Archivist quits [Client Quit] |
| 02:26:18 | | Zerote_ quits [Ping timeout: 250 seconds] |
| 02:31:47 | | Zerote joins |
| 03:07:37 | | Zerote_ joins |
| 03:10:30 | | Zerote quits [Ping timeout: 250 seconds] |
| 03:25:29 | | DopefishJustin quits [Remote host closed the connection] |
| 03:33:14 | | DopefishJustin joins |
| 03:34:07 | | DopefishJustin is now authenticated as DopefishJustin |
| 03:37:12 | | DogsRNice quits [Read error: Connection reset by peer] |
| 03:39:19 | | qw3rty__ joins |
| 03:41:11 | | pcr leaves |
| 03:41:13 | | pcr joins |
| 03:43:00 | | qw3rty_ quits [Ping timeout: 258 seconds] |
| 03:51:52 | | webdownload quits [Remote host closed the connection] |
| 04:08:19 | | superkuh joins |
| 04:21:48 | | etnguyen03 quits [Client Quit] |
| 04:43:57 | | benjins quits [Ping timeout: 258 seconds] |
| 05:18:45 | | cmlow quits [Quit: Connection closed for inactivity] |
| 05:45:38 | | howardad quits [Ping timeout: 250 seconds] |
| 05:56:49 | | rbraun joins |
| 06:20:41 | | benjins joins |
| 06:39:25 | | howardad (howardad) joins |
| 06:42:11 | | VukkyWork (VukkyWork) joins |
| 06:54:34 | | MaxG joins |
| 07:00:25 | | VukkyWork quits [Remote host closed the connection] |
| 07:28:17 | | duce1337 (duce1337) joins |
| 07:39:51 | | Arcorann_ joins |
| 07:52:45 | | tzt is now authenticated as tzt |
| 07:53:52 | | tzt quits [Changing host] |
| 07:53:52 | | tzt (tzt) joins |
| 08:31:33 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 08:59:20 | | icedice quits [Ping timeout: 250 seconds] |
| 09:33:44 | | bobbyb quits [Remote host closed the connection] |
| 09:33:56 | | bobbyb joins |
| 09:35:31 | | lennier1 (lennier1) joins |
| 10:02:09 | | hilda quits [Read error: Connection reset by peer] |
| 10:06:40 | | hilda joins |
| 10:27:15 | | benjins is now authenticated as benjins |
| 10:28:49 | | duce1337_ (duce1337) joins |
| 10:28:49 | | duce1337 quits [Read error: Connection reset by peer] |
| 11:15:43 | | Gereon quits [Ping timeout: 258 seconds] |
| 11:16:30 | | Gereon (Gereon) joins |
| 11:52:38 | | LeGoupil joins |
| 11:54:15 | | pcr leaves |
| 11:54:17 | | pcr joins |
| 12:18:48 | | ThreeHea1 (ThreeHeadedMonkey) joins |
| 12:18:58 | | ThreeHeadedMonkey quits [Ping timeout: 258 seconds] |
| 12:19:32 | | ThreeHea1 is now known as ThreeHeadedMonkey |
| 12:27:26 | | IKI joins |
| 13:08:14 | | benjinsmith joins |
| 13:11:06 | | benjins quits [Ping timeout: 250 seconds] |
| 13:24:43 | | benjinsmith is now known as benjins |
| 13:24:45 | | benjins is now authenticated as benjins |
| 14:03:51 | | hilda quits [Client Quit] |
| 14:14:01 | | cmlow (cmlow) joins |
| 14:17:25 | | HackMii_ quits [Ping timeout: 258 seconds] |
| 14:19:15 | | HackMii_ (hacktheplanet) joins |
| 14:21:53 | | nuroten quits [Remote host closed the connection] |
| 14:24:44 | | duce1337_ quits [Read error: Connection reset by peer] |
| 14:24:44 | | duce1337 (duce1337) joins |
| 14:25:39 | | sonick quits [Quit: Connection closed for inactivity] |
| 14:52:51 | | nuroten joins |
| 15:14:32 | <betamax> | jodizzle: FYI I've just reprocessed the lists of party / candidate websites, to only have base URLs (and then removed duplicates). This will have removed any that are just subsections of a larger site. New lists linked on the wiki page. |
| 15:15:18 | | Arcorann_ quits [Ping timeout: 258 seconds] |
| 15:15:27 | <betamax> | The one danger with the new lists is that there could be sites like "about.me/<candidate>" that are now just "about.me" - I've already removed "about.me" and "youtube.com" from the lists, but there could be more. |
| 15:18:58 | <betamax> | I've also just put the candidate web pages (not sites, the single pages) into AB as an '!ao <' job |
| 16:24:38 | | lennier1 quits [Client Quit] |
| 16:33:17 | | Sylirana quits [Read error: Connection reset by peer] |
| 16:34:25 | | Sylirana (Sylirana) joins |
| 16:43:26 | | endrift quits [Ping timeout: 250 seconds] |
| 16:43:36 | | endrift joins |
| 16:47:49 | | lennier1 (lennier1) joins |
| 16:48:02 | | lennier2 quits [Client Quit] |
| 16:52:57 | | DogsRNice (Webuser299) joins |
| 17:49:19 | | duce1337_ (duce1337) joins |
| 17:49:19 | | duce1337 quits [Read error: Connection reset by peer] |
| 17:55:00 | | rbraun quits [Client Quit] |
| 17:59:30 | <Sanqui> | comic genesis would be nice to archive |
| 18:00:18 | <Sanqui> | hmm |
| 18:00:28 | <Sanqui> | their search is broken, but I'm going to start a forums grab and we can derive domains from that |
| 18:02:00 | | Daloader joins |
| 18:32:15 | | yarrow leaves |
| 18:42:22 | <AK> | ori |
| 18:42:25 | <AK> | Well oops |
| 18:44:42 | <@EggplantN> | What you done now AK |
| 18:44:58 | <AK> | Attempted to launch origin to play some titanfall 2 |
| 18:45:37 | <@EggplantN> | Fuck sake AK |
| 18:47:37 | | spirit joins |
| 19:30:42 | | Daloader quits [Ping timeout: 250 seconds] |
| 19:39:00 | | spirit quits [Client Quit] |
| 19:41:48 | | @EggplantN is now known as @EggplantBot |
| 19:41:59 | | @EggplantBot is now known as @EggplantN |
| 19:51:41 | | LeighR (LeighR) joins |
| 21:07:23 | <betamax> | JAA: my plan is to start feeding the candidate / party sites into AB via '!a <'. If I do 100 per job then there will be around 16 jobs total. |
| 21:07:53 | <betamax> | For the first one or two I may try with outlinks enabled, and can turn that off in future jobs if it proves to be an issue. |
| 21:10:55 | <betamax> | An alternative approach would be for me to archive them all manually and upload the WARCs to IA, like I did for the US 2018 midterms - https://archive.org/details/2018_us_midterm_campaign_site_archive |
| 21:11:00 | <betamax> | But I don't think there's really any benefits to that approach aside form not clogging up AB pipelines - the resulting WARCs are less complete (no outlinks) and can't go into the wayback |
| 21:20:03 | | duce1337_ quits [Read error: Connection reset by peer] |
| 21:20:18 | | duce1337 (duce1337) joins |
| 21:23:13 | | godane joins |
| 21:23:13 | | godane is now authenticated as godane |
| 21:43:11 | | duce1337 quits [Client Quit] |
| 21:43:27 | | sonick (sonick) joins |
| 21:44:10 | | Wayward quits [Ping timeout: 250 seconds] |
| 21:45:11 | | Wayward (wayward) joins |
| 21:48:34 | | LeGoupil quits [Client Quit] |
| 22:00:10 | | LeighR quits [Client Quit] |
| 22:01:43 | | sec^nd quits [Remote host closed the connection] |
| 22:02:07 | | sec^nd (second) joins |
| 22:16:53 | <@JAA> | betamax: Yeah, let's try. !a < is restricted though as it has many pitfalls. Let me know when you have the lists ready, and I'll look over them and throw them in. |
| 22:17:16 | <@JAA> | Try to group them such that there's little chance of crosslinks as those mess with the recursion. |
| 22:29:20 | | webdownload joins |
| 22:30:09 | <betamax> | JAA: thanks. It's late here (got distrated - oops) so I'll make the lists tomorow. By "crosslinks", you mean sites that refer to each other? I'm not sure if there's an easy / obvious way to do that... |
| 22:31:31 | <webdownload> | I pronounce www.ted.com to be fully archived at Heatengine. |
| 22:36:53 | <@JAA> | betamax: Yeah, that's what I mean. The problem is that if you !a < a list that has example.org and example.net, and then the former has a link to example.net/foo/ and gets retrieved before a page from example.net linking there, it won't recurse further from that page. |
| 22:40:02 | <@JAA> | And nope, there isn't an easy way to do this. You'd have to group the sites accordingly, e.g. build lists of candidates all from different parties or parts of the country, which while obviously not a guarantee would at least lower the risk considerably. |
| 22:40:56 | | MaxG quits [Remote host closed the connection] |
| 23:00:54 | | BlueMaxima joins |
| 23:27:14 | | Arcorann_ joins |
| 23:52:58 | | sonick quits [Client Quit] |