03:48:00 | <SketchCow> | Supernap |
03:55:00 | <SketchCow> | alard: |
03:55:00 | <SketchCow> | While we'd like nothing more than to ride out the storm by transcribing Waldorf-Astoria menus, it looks like that's no longer an option. |
03:55:00 | <SketchCow> | Wait |
03:55:00 | <SketchCow> | I lost hope of retrieving my tabblo pictures when I found the Tabblo lifeboat thread. |
03:55:00 | <SketchCow> | The lifeboat does not work for me so myabe you can help |
03:55:00 | <SketchCow> | My username is teodorapopa |
04:48:00 | <SketchCow> | -------------- |
04:48:00 | <SketchCow> | So, just to prepare. Chance my power might go out (Hurricaine) |
04:49:00 | <SketchCow> | I'll use my cell to say hi and check on mail, but I might be iffy for the next few days. |
04:50:00 | <chronomex> | noted |
04:50:00 | <chronomex> | maybe the problem will be solved when you come back |
04:50:00 | | chronomex ducks |
04:53:00 | <underscor> | hahaha |
04:54:00 | | BlueMax throws chronomex out a window |
05:14:00 | <godane> | SketchCow: make sure the cube is wet proof |
05:14:00 | <godane> | maybe if space bag some of the stuff |
05:18:00 | <SketchCow> | Yeah, already on that. |
05:19:00 | <SketchCow> | The cube itself is not, the stuff inside is a foot higher or even higher than than as required |
05:21:00 | <godane> | i was thinking the space bag thing cause there suppose to keep water out |
05:22:00 | <godane> | i'm in nh so we may lose power too |
05:22:00 | <godane> | i backed up most the gbtv stuff last night |
05:46:00 | <SketchCow> | http://sphotos-b.xx.fbcdn.net/hphotos-prn1/547044_430246950357496_951372999_n.jpg |
05:47:00 | <chronomex> | hahaha |
05:47:00 | <underscor> | bahahaha |
05:53:00 | <SketchCow> | I'm proposing an Internet Archive kickstarter. Let's see how that flies. |
05:53:00 | <SketchCow> | Done right, instant $500k |
05:53:00 | <SketchCow> | That would be good |
06:26:00 | <SketchCow> | http://justsolve.archiveteam.org/index.php/FAQ |
10:48:00 | <alard> | SketchCow: http://ia601202.us.archive.org/3/items/test-memac-index-test/tabblo.html#teodorapopa |
10:53:00 | <SketchCow> | Thanks much. |
15:15:00 | <dragondon> | Just started my warrior, getting nothing but "Tracker rate limiting is in effect. Retrying after 30 seconds..." |
15:18:00 | <ersi> | dragondon: It's okay. It's intended. We're slowing down/pausing the Webshots archival project for the moment |
15:18:00 | <dragondon> | ah, guess I'll switch to something else. |
15:19:00 | <alard> | Change it to "ArchiveTeam's Choice"! |
15:19:00 | <ersi> | You can leave it on if you'd like, it'll get work to do - just not as often for the time being. Or you might switch to one of the other projects, like AT's choice |
15:19:00 | <alard> | I've just pointed that to the URLTeam project and will switch it back to Webshots when we continue that. |
15:19:00 | <dragondon> | yeah, just switched to AT Choice. |
15:19:00 | <flaushy> | \o/ my nas is dominating the recent stats ... slowest thing to turn in work late ;) |
15:20:00 | <alard> | dragondon: Great. |
15:21:00 | <flaushy> | alard: after about 15 mins / 800 pages wikipediareview gives me HTTP 400s |
15:21:00 | <flaushy> | could a fresh cookie help at that point? |
15:21:00 | <alard> | flaushy: Ah, yes, I saw your message. |
15:22:00 | <alard> | I don't know. You could try, or you could try with more time between requests. |
15:22:00 | <alard> | (Problem is: how do you get Wget to ask for a fresh cookie?) |
15:23:00 | <flaushy> | overwrite it in a second process? |
15:23:00 | <flaushy> | but i am probably too naive, i ll try :) |
15:24:00 | <alard> | I'm not sure if it reads the cookie file. |
15:24:00 | <alard> | So it isn't an IP-based block? |
15:25:00 | <flaushy> | it wasn't |
15:25:00 | <flaushy> | at least i could browse the forums |
15:25:00 | <alard> | Does it give any browsable error messages? |
15:25:00 | <alard> | (Error messages you could search for on Google, I mean.) |
15:26:00 | <flaushy> | checking |
15:30:00 | <flaushy> | google suggest using sane user agents |
15:31:00 | <alard> | I think just going slower might help. Invision power board seems to have a lot of ways to limit the number of X per second. |
15:32:00 | <flaushy> | ok slowly crawling := |
15:32:00 | <alard> | It's not going away soon, is it? |
15:33:00 | <flaushy> | na, it was more a "we should get it sometime" i think |
15:47:00 | <flaushy> | ok running with with wait 10 and random-wait |
16:44:00 | <soultcer> | That's a lot of URLTeam users: http://tracker.tinyarchive.org/v1/ |
17:01:00 | <ersi> | I stopped my workers when the tracker kept resetting every few days |
17:15:00 | <soultcer> | Oh, it's not resetting, I'm just draining it |
17:17:00 | <soultcer> | I think I'll have to add an all-time leaderboard that saves the number of tasks done by each user, even when I remove the finished tasks |
17:26:00 | <flaushy> | oh 2 of my workers stopped -.- |
17:27:00 | <flaushy> | soultcer: running into no buffer space available on my vps |
17:27:00 | <soultcer> | Can you paste the exact error message? |
18:48:00 | <SketchCow> | > x-archive-meta-title:Mirror of SAMPLES.MPLAYERHQ.HU - Multimedia Samples Archive |
18:48:00 | <SketchCow> | > Content-Length: 57073367040 |
18:48:00 | <SketchCow> | So that's happening. |
18:55:00 | <SmileyG> | soultcer: plz do, i asked for that long ago |
18:56:00 | <SmileyG> | statswhore me! |
19:14:00 | <bsmith094> | any other projects i could help with? webshots is rate limited apparently |
19:14:00 | <bsmith094> | remote server so not earrior |
19:14:00 | <bsmith094> | warrior |
19:16:00 | <flaushy> | urlteam :) |
19:21:00 | <SketchCow> | just solve the problem |
19:22:00 | <SketchCow> | archiveteam wiki |
19:27:00 | <ersi> | bsmith094: Yeah, like flaushy and SketchCow said: 1) help add content to http://justsolve.archiveteam.org 2) help update and pretty up http://archiveteam.org 3) urlteam or AT's choice |
19:28:00 | <SketchCow> | We're dealing with a small slowdown, please be patient about that. |
19:29:00 | <ersi> | ie. take the ADHD meds |
19:29:00 | <ersi> | and possibly a beer |
19:29:00 | <SketchCow> | At the same time? |
19:33:00 | <ersi> | mayhapples |
19:33:00 | <ersi> | most likely; no |
21:48:00 | <bsmith094> | does urlteam have a script? |
21:50:00 | <ersi> | Do you mean a pipeline script? Yes |
21:51:00 | <bsmith094> | where? |
21:52:00 | <bsmith094> | i dont think i can run the warrior on cli, so i need a pipeline script |
21:57:00 | <ersi> | I don't know where. But soultcer does, I think. Or you run the GUI |
21:57:00 | <ersi> | s/GUI/Warrior/ |
21:57:00 | <ersi> | the warrior has an API, you can HTTP commands to it |
21:59:00 | <alard> | bsmith094: https://github.com/soult/tinyback/ |
21:59:00 | <alard> | But if you want to run it yourself, you might be better of running ./run.py directly. |
22:04:00 | <bsmith094> | alard: how, the instructions are vague |
22:04:00 | <alard> | bsmith094: I have not tried it, but the pipeline.py gives an example. |
22:05:00 | <alard> | ./run.py -h |
22:08:00 | <bsmith094> | alard: well i feel stupid for not realizing that, thank:) |
22:12:00 | <alard> | bsmith094: But I have to agree with you that the readme instructions under "How to run TinyBack" aren't exactly helpful. :) Perhaps, once you figure out what to do, you should send soultcer a patch. |
22:13:00 | <bsmith094> | alard: run this screen -SL grab ./run.py --tracker=http://tracker.tinyarchive.org/v1/ --sleep=20 --one-task --temp-dir=./data --username=bsmith093 -d -c |
22:14:00 | <alard> | Thank you (I have a warrior). |