00:06:26<Ryz>Regarding http://courses.umass.edu/ and http://people.umass.edu/ - yeah, this needs to be run into queueh2ibot, since there's an unknown amount of time til the content disappears as someone had reported
00:06:34@JAA sets the topic to: Finding ISP and university web hosting services before the Grim Reaper finds them. | https://wiki.archiveteam.org/index.php/ISP_Hosting https://wiki.archiveteam.org/index.php/University_Web_Hosting
00:07:37seacow joins
00:11:16<pokechu22>That should be fine to do an !a < list; I don't think queueh2ibot is needed
00:11:31<pokechu22>and I'd just list them both with and without tildes - if they both appear then that's fine
00:11:56<@JAA>If we do just one job with everything we can find, yeah.
01:59:58Aoede_ (Aoede) joins
02:01:58Aoede quits [Ping timeout: 240 seconds]
02:28:16<Ryz>Updates on the listings creation, JAA and pokechu22?
02:28:41<pokechu22>uh, I started scraping duckduckgo and then got distracted
02:36:46<@JAA>I haven't gotten around to it yet.
02:37:10<@JAA>But poking briefly, I see that the same things are available with and without the tilde, e.g. http://people.umass.edu/~aef6000/ and http://people.umass.edu/aef6000/
02:37:29<@JAA>So that's something to keep in mind for the generation. I have no idea what's canonical.
03:41:51qwertyasdfuiopghjkl quits [Quit: Client closed]
03:52:25qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
09:44:27atphoenix quits [Read error: Connection reset by peer]
09:44:57atphoenix (atphoenix) joins
11:33:58Sluggs quits [Ping timeout: 240 seconds]
11:44:39Sluggs joins
12:03:41Sluggs quits [Excess Flood]
12:06:34Sluggs joins
15:11:35kiska quits [Ping timeout: 260 seconds]
15:11:35@Flashfire42 quits [Ping timeout: 260 seconds]
15:29:48Flashfire42 joins
15:35:37kiska (kiska) joins
16:58:58Flashfire42 quits [Ping timeout: 240 seconds]
16:59:30kiska quits [Ping timeout: 260 seconds]
17:33:40Flashfire42 joins
17:34:46kiska (kiska) joins
17:36:50Maturion joins
18:02:28qwertyasdfuiopghjkl quits [Ping timeout: 255 seconds]
18:20:31nulldata quits [Quit: Ping timeout (120 seconds)]
18:22:30nulldata (nulldata) joins
18:52:29qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
19:29:07Craigle quits [Quit: The Lounge - https://thelounge.chat]
19:31:35Craigle (Craigle) joins
19:32:29@ChanServ sets mode: +o Flashfire42
20:39:30qwertyasdfuiopghjkl20 joins
20:42:13qwertyasdfuiopghjkl quits [Ping timeout: 255 seconds]
21:32:19qwertyasdfuiopghjkl28 joins
21:35:19qwertyasdfuiopghjkl20 quits [Ping timeout: 255 seconds]
21:35:55qwertyasdfuiopghjkl59 joins
21:38:55qwertyasdfuiopghjkl28 quits [Ping timeout: 255 seconds]
23:22:36qwertyasdfuiopghjkl59 quits [Client Quit]
23:23:31qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
23:49:45Maturion quits [Remote host closed the connection]
23:53:14<@JAA>%E2%88%BCstatdata
23:53:46<@JAA>E2 88 BC is U+223C TILDE OPERATOR. <but_why.gif>
23:56:09<@JAA>There are also some &tilde;aizen URLs in the WBM.
23:56:25<@JAA>Parsing CDX API output is always a lot of fun due to things like this.
23:59:31<pokechu22>Still working on scraping links, but I can probably have something in like 15 minutes