00:13:00 | <GLaDOS> | Remember: You cannot trust them. |
05:29:00 | <SketchCow> | Aw man, Brewster's tearing up on stage |
05:30:00 | <Cameron_D> | This is being recorded, yes? |
05:37:00 | <balrog_> | I sure hope so. |
05:38:00 | <SketchCow> | I assume so |
05:50:00 | <bsmith094> | 2 things, whats being recorded and where ?, and , I hate to beat a dead horse, but is anyone still working on ffnet grab, warrior -izing that would be helpful |
05:57:00 | <SketchCow> | Virgil! |
06:09:00 | <SketchCow> | Ted Nelson! |
08:31:00 | <xk_id> | alard: sorry, I was afk. My research is in network science. I wish to crawl an online social network to extract the social graph and measure its topological features (degree distribution, etc.) |
09:09:00 | <alard> | xk_id: Ah, interesting. |
09:14:00 | <xk_id> | thanks |
09:26:00 | <Nemo_bis> | xk_id: could also do it for pseudo-social networs like Wikipedia's userbase? :) |
09:27:00 | <Nemo_bis> | *networks |
09:31:00 | <xk_id> | yes. |
09:31:00 | <xk_id> | How are the wikipedia users linked? |
09:54:00 | <godane> | your not going to believe this |
09:56:00 | <godane> | someone uploaded 'TechTV Music Wars' to MySpleen |
10:15:00 | <godane> | i also found a mcdonalds vhs training tape from 1972 |
10:18:00 | <Nemo_bis> | xk_id: I think the most likely way to link them is checking their respective user talk edits |
10:21:00 | <Nemo_bis> | xk_id: for instance, https://toolserver.org/~tparis/pcount/index.php?name=Nemo+bis&lang=meta&wiki=wikimedia will immediately disclose that I like users Sj and Pathoschild (see "user talk" top edits) :) |
10:22:00 | <Nemo_bis> | You could also check userpages cross-links but those are more rare. Anyway, all this is available in dumps for Wikimedia wikis. (And perhaps also Wikia soon? :/ ) |
11:11:00 | <omf__> | Did we ever finish backing up the halflife collection from planetphillip.com ? |
11:12:00 | <omf__> | I think schbiridi was working on it last July |
11:12:00 | <omf__> | the site owner took it offline, then put it back up |
11:42:00 | <xk_id> | Nemo_bis: aha, yes. I was thinking it might be something like that. indeed, it's a good way to infer connections between users |
11:44:00 | <xk_id> | Nemo_bis: by the way, there's a "conversation" in network science going on atm, which focuses on contrasting the friendship graphs as given by the friends lists, and the interaction graphs of the actual interactions between users |
11:44:00 | <xk_id> | Nemo_bis: some studies showed they tend to differ a great deal :) |
11:44:00 | <xk_id> | s/and the interaction/with the interaction |
11:45:00 | <xk_id> | but yeah, that's just tangential |
11:48:00 | <schbiridi> | omf__: nope, i ran into trouble and stopped. can't remember |
15:02:00 | <omf__> | how much did you get? |
15:03:00 | <omf__> | I am looking at doing a backup |
15:03:00 | <omf__> | now that steam is in full swing on linux there is a ton of content I need to find again for older games |
15:35:00 | <schbiridi> | omf__: i do not remember and i did not keep the files, sorry. |
15:36:00 | <schbiridi> | i'd estimate ~50GB but that is a guess really. maybe 100 or more |
15:48:00 | <Ymgve> | schbiridi: what is 50gb? |
15:53:00 | <schbiridi> | planetphillip maybe |
16:22:00 | <omf__> | Did you premap the site or just download files as you went? |
16:44:00 | <schbiridi> | cant remember but i remember that you better premap |
17:03:00 | <xk_id> | "More generally, Google said it best that ârunning a web crawler generates a fair amount of phone calls.â" |
17:09:00 | <xk_id> | gives a good impression of the scale of Google crawls: "To download a billion pages in one year, a crawler must sustain a rate of 32 pages/second. However, search engines must also recrawl pages to obtain the most recent version, which amplifies the need for speed." |
17:22:00 | <xk_id> | this looks alright, but... tickets cost £27.50 http://www.youtube.com/watch?v=FNGm3ZL43gs |
17:25:00 | <xk_id> | I am so sorry |
17:25:00 | <xk_id> | wrong channel |
17:25:00 | <xk_id> | :P |
17:53:00 | <omf__> | I should have something partial later today. I will need to add warc data though. |
18:34:00 | <bsmith094> | godane1: holy crap, you found it!?! link, please? |
19:28:00 | <godane1> | bsmith094: i will link it when i get it uploaded |
19:29:00 | <godane1> | ftp is being very slow |
19:41:00 | <Schbirid> | german gaming streaming site http://www.own3d.tv/ is out of money and will be closing shortly |
19:42:00 | <balrog_> | :( |
19:42:00 | <Schbirid> | http://www.cadred.org/News/Article/191800/ |