00:28:25 | <datechnoman> | I have 4 concurrent extractions running at the moment. Getting some hits but going to be a very low number compared to the massive dumps I find for our other projects |
00:34:49 | <datechnoman> | Grepping at 500mbps |
00:36:30 | <@JAA> | If we get few new hits, that's also a useful result, namely that we probably covered the vast majority of what exists. |
00:37:16 | <@JAA> | s/namely that/namely because it means that/ |
00:37:54 | | wickedplayer494 quits [Read error: Connection reset by peer] |
00:39:26 | | wickedplayer494 joins |
03:04:07 | <datechnoman> | And like you said, I reckon the website is pretty small |
03:06:40 | <datechnoman> | I'm just grepping "umass.edu" and whatever you need can be grabbed from there :) |
03:18:57 | | qwertyasdfuiopghjkl18 is now authenticated as qwertyasdfuiopghjkl |
03:19:07 | | qwertyasdfuiopghjkl18 is now known as qwertyasdfuiopghjkl |
05:55:18 | <datechnoman> | JAA - You scrapped all of the urls that the WBM CDX had for people.umass.edu and courses.umass.edu yeah? |
05:57:01 | <@JAA> | datechnoman: I extracted the directories/usernames from them, yeah. |
05:59:01 | <datechnoman> | Smick. That means that a huge chunk of my collection I can skip through as all .edu domains were already pushed through / processed in #// |
05:59:13 | <datechnoman> | The 4 batches im doing atm have not gone through extraction for all of our various projects |
05:59:25 | <datechnoman> | The rest I can skip scanning as they should already be in the WBM |
06:00:05 | <datechnoman> | So in a day or two I will have a list to provide and that will be everything I can provide at this time |
06:00:28 | <@JAA> | Ah, nice |
08:34:40 | | Maturion joins |
10:04:05 | | imer quits [Ping timeout: 260 seconds] |
10:39:28 | | imer (imer) joins |
12:11:41 | | root joins |
12:12:57 | | root quits [Client Quit] |
13:54:11 | | decky_e joins |
13:55:58 | | decky quits [Ping timeout: 240 seconds] |
14:53:57 | | Chris50108 (Chris5010) joins |
14:55:45 | | Chris5010 quits [Ping timeout: 260 seconds] |
14:55:45 | | Chris50108 is now known as Chris5010 |
15:36:16 | | Chris50104 (Chris5010) joins |
15:37:45 | | Chris5010 quits [Ping timeout: 260 seconds] |
15:37:45 | | Chris50104 is now known as Chris5010 |
15:42:16 | | Chris5010 quits [Client Quit] |
15:43:07 | | Chris5010 (Chris5010) joins |
19:01:20 | | wickedplayer494 quits [Ping timeout: 260 seconds] |
19:03:06 | | wickedplayer494 joins |
19:03:21 | | wickedplayer494 is now authenticated as wickedplayer494 |
21:09:38 | | wickedplayer494 quits [Ping timeout: 240 seconds] |
21:11:25 | | wickedplayer494 joins |
21:11:41 | | wickedplayer494 is now authenticated as wickedplayer494 |
22:08:38 | | Sluggs quits [Excess Flood] |
22:24:22 | | Sluggs joins |
22:26:50 | | that_lurker quits [Remote host closed the connection] |
22:27:51 | | that_lurker (that_lurker) joins |
22:41:09 | | Maturion quits [Remote host closed the connection] |