00:35:00<kanzure>anyone have thoughts on removing watermarks from pdfs
00:35:00<kanzure>oops, forgot my question mark :( https://groups.google.com/group/science-liberation-front/t/c68964cf55d8f6fa
00:42:00<bsmith094>i hate ruby, it's probably not ruby's fault but its the most nitpicky install of a language ever!!!
00:43:00<bsmith094>im just trying to run one script, and it wants top copmpile itself all over again!?!?
01:04:00<beardicus>hiya. i'm fixin' to update the reject-regex for yahooblogs grab, but the current regex seems off to me: [\\\\"\']
01:04:00<beardicus>am i wrong in thinking there's an extra backslash there?
01:05:00<beardicus>if, indeed, the character set is \ and " and '
01:32:00<nitro2k01>Four backslashes sounds like two are to get passed the language's escaping, which gives you anotehr two which end up in the expression
01:42:00<beardicus>aye. but there are only three unaccounted for. one is escaping the double quote.
01:42:00<beardicus>well. if it's not a mistake, alard can verbally abuse me :) pull request is pending, btw.
02:20:00<tef>kanzure: yeah, download it twice, keep the content that is the same
09:11:00<Nemo_bis>ARGH
09:11:00<Nemo_bis>alard: if I upload a txt file to s3 with a .7z extension, will s3 delete an existing 7z file?
09:12:00<Nemo_bis>I just did something as stupid as that and now I fear I'm going to lose my 49 GiB 7zip :( https://www.us.archive.org/log_show.php?task_id=140173799
09:15:00<alard>Nemo_bis: I think it overwrites files, yes. But since the task has apparently failed, you have a little time to download the original file. (Or ask someone with a fast connection to do that.)
09:15:00<Nemo_bis>Ooh, relief. I found the "interrupt" button.
09:15:00<Nemo_bis>alard: I killed it. :)
09:15:00<Nemo_bis>Hopefully this shouldn't break anything.
09:16:00<alard>Is there an interrupt button?
09:17:00<alard>Perhaps you should rename the file on archive.org, just to be sure. (Perhaps the task is automatically restarted.)
09:17:00<Nemo_bis>It's waiting for admin.
09:17:00<Nemo_bis>There's an interrupt button on the history page, for admins.
09:17:00<Nemo_bis>As far as I know one should never use it. :p
09:19:00<Nemo_bis>And download is awfully slow even from a USA server, doesn't go above 2.4 MiB/s and averages at 1, meh.
09:20:00<Nemo_bis>hmm
09:20:00<Nemo_bis>SKIPPING UPDATE to ftp-ftp.rta.nato.int_archive.torrent IN /35/items/ftp-ftp.rta.nato.int... item (50155 MB) exceeds maximum size (25600 MB)
09:21:00<Nemo_bis>The rename worked, but waited for the other task anyway, I had to also pass that one.
09:43:00<kanzure>does anyone have a copy of the jstor charter/constitution?
14:23:00<alard>The wiki says: Warning: file_get_contents(/home/archivet/public_html/extensions/SpamBlacklist/wikimedia_blacklist) [function.file-get-contents]: failed to open stream: No such file or directory in /home/archivet/public_html/extensions/SpamBlacklist/SpamBlacklist_body.php on line 123
14:30:00<Nemo_bis>Maybe it was configured for a local file BL?
14:30:00<Nemo_bis>It should only be configured for fetch Meta-Wiki 's blacklist and [[MediaWiki:Spamblacklist]]
14:31:00<alard>Perhaps I should add that everything still works, it's just a warning. It's not a very urgent problem.
14:32:00<Nemo_bis>alard: I took this screenshot and forgot to share it: http://imgur.com/Fnst3
14:32:00<Nemo_bis>IPs are not behaving http://archiveteam.org/index.php?title=GeoCities&curid=78&diff=9132&oldid=9131
14:35:00<Nemo_bis>And rollbacking is a pain, with captchas :)
14:36:00<alard>What's the word that rhymes with hiccups?
14:36:00<SketchCow>backups
14:36:00<Nemo_bis>Ah, I always have to skip that too
14:36:00<alard>Ah.
14:36:00<Nemo_bis>oh right :(
14:36:00<SketchCow>I'll replace that
14:36:00<alard>I'm going to make a captcha-bookmarklet.
14:37:00<Nemo_bis>SketchCow: you should perhaps disable anonymous editing and at the same time try removing the captcha for new links.
14:38:00<alard>Nemo_bis: That's a nice screenshot, but what browser is it? It doesn't look familiar.
14:38:00<SketchCow>Give me the LocalSettings.php
14:38:00<SketchCow>entries
14:39:00<SketchCow>I thought it was no anonymous editing
14:52:00<Nemo_bis>alard: firefox
14:53:00<Nemo_bis>alard: IIRC; but there are only the scrollbars, what are you looking at? ^^'
14:54:00<Nemo_bis>SketchCow: $wgGroupPermissions['*']['edit'] = false;
14:55:00<Nemo_bis>SketchCow: and after that, for captcha, if you wish $wgCaptchaTriggersOnNamespace[NS_MAIN]['addurl'] = false;
15:06:00<godane>good news
15:06:00<godane>i got the exterinal images form thebox.bz
15:08:00<alard>https://gist.github.com/44e4e20da5777688dbe3
15:30:00<godane>SketchCow: this item is a typo: http://archive.org/details/cnetbuzz_120106_
15:31:00<godane>i kill that upload and reupload as cnetbuzz_120106
18:17:00<godane>i'm uploading all my thebox.bz forums warc.gz into one item
18:17:00<godane>most are very small so i decide to to do it this way
21:37:00<Nemo_bis>aww, still anonymous edits http://archiveteam.org/index.php?title=Special:ListGroupRights