00:08:59bradp quits [Client Quit]
00:09:31madcarbs joins
00:32:18yawkat quits [Ping timeout: 258 seconds]
00:35:19yawkat (yawkat) joins
01:02:29dm4v_ joins
01:03:21dm4v quits [Ping timeout: 258 seconds]
01:03:21dm4v_ is now known as dm4v
01:03:21dm4v quits [Changing host]
01:03:21dm4v (dm4v) joins
01:07:57onetruth joins
01:09:23madcarbs quits [Client Quit]
01:22:52britmob quits [Read error: Connection reset by peer]
01:26:05britmob joins
01:26:37britmob quits [Read error: Connection reset by peer]
01:27:25madcarbs_ (madcarbs) joins
01:27:48Mineroboter joins
01:28:08madcarbs joins
01:29:02Mineroboter_ quits [Ping timeout: 258 seconds]
01:30:37madcarbs_ quits [Client Quit]
01:31:30madcarbs_ (madcarbs) joins
01:31:46madcarbs_ quits [Client Quit]
01:44:53madcarbs_ (madcarbs) joins
01:46:29madcarbs_ quits [Changing host]
01:46:29madcarbs_ (madcarbs) joins
01:47:28madcarbs_ quits [Client Quit]
01:56:20BlueMaxima joins
02:01:36madcarbs quits [Changing host]
02:01:36madcarbs (madcarbs) joins
02:02:12Mineroboter_ joins
02:04:08Mineroboter quits [Ping timeout: 250 seconds]
02:25:55madcarbs_ (madcarbs) joins
02:26:44madcarbs_ quits [Client Quit]
03:17:24<Somebody2>Looks like the Bandcamp index grab has basically finished and is uploading.
03:18:07madcarbs_ (madcarbs) joins
03:18:33<Somebody2>7.5gb of 1.7 million artist/band names and (100 pixel square) images.
03:19:12<Somebody2>We should do an updated (partial) grab in six months or so.
03:19:28<Somebody2>The index is ordered by added-date, so that should be easy.
03:20:22<Somebody2>And consider whether or not to do a (probably Warrior) job to grab all the album & track info (i.e. everything *but* the actual music).
03:21:52<Wayward>can we go back to just 1 or 2 dozen composers?
03:22:19<Somebody2>Interestingly, the WBM has a copy of the front page of the index from back in 2013, when there were only 958 pages: https://web.archive.org/web/20130809124034/https://bandcamp.com/artist_index
03:22:38<Somebody2>rather than the 3583 now
03:24:44monoxane quits [Ping timeout: 250 seconds]
03:24:48jazza quits [Ping timeout: 258 seconds]
03:27:18britmob joins
03:37:13DogsRNice quits [Read error: Connection reset by peer]
03:46:05hilda quits [Read error: Connection reset by peer]
03:50:09hilda joins
03:50:18madcarbs_ quits [Client Quit]
03:51:06jazza joins
03:55:46etnguyen03 quits [Client Quit]
03:57:16qw3rty_ joins
03:58:12monoxane (monoxane) joins
04:01:13qw3rty quits [Ping timeout: 258 seconds]
04:05:26Ryz quits [Ping timeout: 258 seconds]
04:14:25Sylirana quits [Ping timeout: 244 seconds]
04:37:55madcarbs_ (madcarbs) joins
04:38:57madcarbs_ quits [Client Quit]
04:48:17Sylirana (Sylirana) joins
04:52:12tzt quits [Ping timeout: 258 seconds]
05:01:21lennier2 joins
05:04:05lennier1 quits [Ping timeout: 258 seconds]
05:04:13lennier2 is now known as lennier1
06:38:34rsn quits [Client Quit]
07:19:06<masterX244>Gah... something looks off on the tm-x crawl... filecount doesnt look right (roughly double of what was initial file but it got replays so it didnt catch all track files...)
07:19:44<masterX244>Got to crunch logs and update my discovery crawler code for a second run with a alternative approach
07:24:07LeighR (LeighR) joins
07:31:18HP_Archivist quits [Ping timeout: 250 seconds]
07:35:54hooway joins
07:51:00HP_Archivist (HP_Archivist) joins
07:57:47LeGoupil joins
08:02:04fuzzy8021 quits [Ping timeout: 250 seconds]
08:04:21fuzzy8021 joins
08:04:21fuzzy8021 quits [Changing host]
08:04:21fuzzy8021 (fuzzy8021) joins
08:04:25fuzzy8021 quits [Excess Flood]
08:04:43fuzzy8021 joins
08:04:43fuzzy8021 quits [Changing host]
08:04:43fuzzy8021 (fuzzy8021) joins
08:04:46fuzzy8021 quits [Excess Flood]
08:05:03fuzzy8021 joins
08:05:04fuzzy8021 quits [Changing host]
08:05:04fuzzy8021 (fuzzy8021) joins
08:43:57Arcorann__ joins
09:07:30fuzzy8021 quits [Ping timeout: 258 seconds]
09:07:54fuzzy8021 joins
09:07:54fuzzy8021 quits [Changing host]
09:07:54fuzzy8021 (fuzzy8021) joins
09:07:56fuzzy8021 quits [Excess Flood]
09:08:17fuzzy8021 joins
09:08:17fuzzy8021 quits [Changing host]
09:08:17fuzzy8021 (fuzzy8021) joins
09:08:22fuzzy8021 quits [Excess Flood]
09:08:39fuzzy8021 joins
09:08:39fuzzy8021 quits [Changing host]
09:08:39fuzzy8021 (fuzzy8021) joins
09:08:43fuzzy8021 quits [Excess Flood]
09:09:01fuzzy8021 joins
09:09:02fuzzy8021 quits [Changing host]
09:09:02fuzzy8021 (fuzzy8021) joins
09:09:05fuzzy8021 quits [Excess Flood]
09:09:19fuzzy8021 joins
09:09:19fuzzy8021 quits [Changing host]
09:09:19fuzzy8021 (fuzzy8021) joins
09:09:23fuzzy8021 quits [Excess Flood]
09:09:38fuzzy8021 joins
09:09:38fuzzy8021 quits [Changing host]
09:09:38fuzzy8021 (fuzzy8021) joins
09:09:41fuzzy8021 quits [Excess Flood]
09:09:58fuzzy8021 joins
09:09:59fuzzy8021 quits [Changing host]
09:09:59fuzzy8021 (fuzzy8021) joins
09:10:05fuzzy8021 quits [Excess Flood]
09:10:19fuzzy8021 joins
09:10:19fuzzy8021 quits [Changing host]
09:10:19fuzzy8021 (fuzzy8021) joins
09:10:22fuzzy8021 quits [Excess Flood]
10:29:44jtagcat quits [Quit: Ping timeout (120 seconds)]
10:29:56jtagcat (jtagcat) joins
11:23:45madcarbs_ (madcarbs) joins
11:24:03madcarbs_ quits [Client Quit]
11:44:37tzt joins
12:09:12HP_Archivist quits [Ping timeout: 258 seconds]
13:12:55Iki joins
13:36:21etnguyen03 (etnguyen03) joins
13:48:52hilda quits [Ping timeout: 258 seconds]
13:51:32Iki quits [Remote host closed the connection]
14:03:16Ryz (Ryz) joins
14:03:28Ryz quits [Remote host closed the connection]
14:07:13Ryz (Ryz) joins
14:08:58fuzzy8021 (fuzzy8021) joins
14:17:35voltagex_ joins
14:17:46voltagex quits [Ping timeout: 250 seconds]
14:30:25Iki joins
14:42:11mutantmonkey quits [Remote host closed the connection]
14:42:33mutantmonkey (mutantmonkey) joins
14:45:53hilda joins
15:14:32Guest69 joins
15:36:17Guest69 quits [Client Quit]
16:03:35<Ryz>Just a thought, is there a specific/niche web search engine that searches keywords? For example, back then, checking the view-source of say http://www.historywiz.org/ - I recall there used to be a bunch of keywords that's in something like meta name="Keywords" - but it's something that search engines in general don't take account anymore because of
16:03:35<Ryz> abuse?
16:09:10Arcorann__ quits [Ping timeout: 258 seconds]
16:40:07rsn joins
16:40:13onetruth quits [Ping timeout: 258 seconds]
18:01:25ragu joins
18:04:33mutantmonkey quits [Remote host closed the connection]
18:04:55mutantmonkey (mutantmonkey) joins
18:22:32godane1 joins
18:22:47Iki quits [Ping timeout: 244 seconds]
18:23:52godane2 joins
18:24:06godane quits [Ping timeout: 258 seconds]
18:27:10godane1 quits [Ping timeout: 258 seconds]
18:28:01DogsRNice (Webuser299) joins
19:07:37Iki joins
19:11:28BlueMaxima quits [Read error: Connection reset by peer]
19:11:41BlueMaxima joins
19:12:35lun4 quits [Quit: Ping timeout (120 seconds)]
19:12:49lun4 (lun4) joins
20:06:15Matthww4 joins
20:06:36Matthww quits [Ping timeout: 250 seconds]
20:06:36Matthww4 is now known as Matthww
20:16:17Sylirana quits [Read error: Connection reset by peer]
20:17:16Sylirana (Sylirana) joins
20:19:50Matthww3 joins
20:21:46Matthww quits [Ping timeout: 251 seconds]
20:21:46Matthww3 is now known as Matthww
20:22:50HP_Archivist (HP_Archivist) joins
20:51:14LeGoupil quits [Client Quit]
20:57:31lennier1 quits [Client Quit]
21:08:19lennier1 (lennier1) joins
21:40:04HP_Archivist quits [Remote host closed the connection]
21:40:26HP_Archivist (HP_Archivist) joins
22:23:26qw3rty_ quits [Client Quit]
22:30:49<masterX244>dangit... TMX devs abusing a form field for storing data sent to the client thats parsed by javascript and then inserted into the page...
22:53:47<masterX244>second stage discovery crawler running. Got to cross-ref the results withh the partial first crawl and then another run in grab-site
22:53:48qw3rty joins
22:56:45<masterX244>code pushed to my github, too
23:24:22<@arkiver>masterX244: where is that
23:24:26<@arkiver>your github
23:38:55<@JAA>arkiver: https://github.com/masterX244/TMExchange-Enumerator
23:46:43hooway quits [Client Quit]
23:59:22<@arkiver>thanks