00:39:04AlsoHP_Archivist quits [Client Quit]
00:39:21HP_Archivist (HP_Archivist) joins
01:02:50dm4v quits [Ping timeout: 250 seconds]
01:03:38dm4v joins
01:03:40dm4v quits [Changing host]
01:03:40dm4v (dm4v) joins
01:09:21rewby quits [Ping timeout: 258 seconds]
01:22:54rewby (rewby) joins
01:27:06rsn joins
01:29:16rsn_ quits [Ping timeout: 250 seconds]
01:30:33rsn_ joins
01:31:54rsn_ quits [Client Quit]
01:33:07rsn quits [Ping timeout: 258 seconds]
01:40:42Arcorann__ joins
01:41:13Mineroboter_ joins
01:43:34Mineroboter quits [Ping timeout: 250 seconds]
01:57:21crispyalice2 (crispyalice2) joins
01:57:25crispyalice2 leaves
01:57:34crispyalice2 (crispyalice2) joins
02:17:15lennier1 quits [Quit: Going offline, see ya! (www.adiirc.com)]
02:17:33lennier1 (lennier1) joins
02:18:26<purplebot>Google Poly edited by Ajay (+1216, Add info about retrieving data …) just now -- https://www.archiveteam.org/?diff=46522&oldid=46512
02:27:10ThreeHeadedMonkey quits [Ping timeout: 258 seconds]
02:29:08ThreeHeadedMonkey (ThreeHeadedMonkey) joins
02:32:30BlueMaxima joins
03:01:08gfhj joins
03:02:11gfhj leaves
03:12:24nathan quits [Ping timeout: 250 seconds]
03:18:19nathan joins
03:38:41Eighty (Eighty) joins
03:46:05Wayward (wayward) joins
03:50:25atphoenix_ is now known as atphoenix
03:50:58Arcorann__ quits [Ping timeout: 250 seconds]
03:51:52DogsRNice_ quits [Read error: Connection reset by peer]
03:53:50etnguyen03 quits [Client Quit]
04:05:24<atphoenix>OrIdow6, I know that sites have been put back online by original, sympathetic webmasters in cooperation with AT in a way that *only* AT could still access the site in order to complete an archiving effort. The opposite has also happened, like with Yahoo and Nintendo
05:30:12aleph quits [Ping timeout: 250 seconds]
05:34:44aleph joins
05:46:14aleph quits [Ping timeout: 250 seconds]
05:51:42aleph joins
06:16:44wickedplayer494 quits [Remote host closed the connection]
06:22:26wickedplayer494 (wickedplayer494) joins
06:22:26wickedplayer494 quits [Excess Flood]
06:22:47wickedplayer494 joins
06:27:35Arcorann__ joins
06:27:54Hyenadae joins
06:27:59Hyenadae leaves
06:32:10hooway joins
06:52:49Arcorann__ quits [Ping timeout: 258 seconds]
07:23:51LeighR (LeighR) joins
07:27:00Viniter (Viniter) joins
07:29:14Viniter quits [Client Quit]
08:07:00atphoenix quits [Remote host closed the connection]
08:07:43atphoenix (atphoenix) joins
08:09:40yawkat quits [Ping timeout: 250 seconds]
08:13:49Arcorann__ joins
08:21:04yawkat (yawkat) joins
08:31:36Arcorann_ joins
08:35:14Arcorann__ quits [Ping timeout: 250 seconds]
09:10:41<cadence>@jodizzle: Hi, I'm one of the people who spearheaded the youtube annotations archive project, and while the chat logs are lost to time, I believe the trust system worked very well for us.
09:12:23<cadence>@AK: in the annotations archive, when a worker submitted correct data its trust would increase, and higher trust means less (but never zero) chance to redo existing work for verification.
09:13:11<AK>Ooh that's clever
09:13:42<cadence>though here we have the problem that the annotation data is always the same when requested multiple times - it's an XML - but Y!A's page almost certainly will change if people submit new answers to the question, or even if there's a "recommended" section that's somewhat randomly generated.
09:14:21<AK>I think that's gonna be the issue, static vs potentially dynamic content
09:14:24<cadence>the calculation was probably something like...
09:14:35<AK>For an api (or annotations), the trust works well
09:14:56<cadence>1/($trust+1) chance to redo existing work
09:15:01<cadence>if work is correct, $trust++
09:15:21<cadence>if work is incorrect, $trust-=10
09:15:34<cadence>negative trust means you can't do work anymore, kill the access token
09:15:44<cadence>and then only allow a certain number of access tokens to be generated per IP address
09:15:59<cadence>that's how we did it. though this was mainly to stop faulty workers rather than maliciously edited data.
09:16:38<cadence>might want to do something like $trust = min($trust/2, $trust-10) so that it drops off sharply even when high, to prevent somebody jacking up their trust ridiculously high before submitting spam
09:16:59<cadence>/shrug
09:17:39<cadence>well, the issue isn't that the submissions need to be the _same,_ they just need to be _equivalent_
09:18:01<cadence>if you can afford some kind of server-side validation that checks the basic page structure, and extracts the main question body for comparison... that might work?
09:20:38hooway_ joins
09:21:55hooway quits [Client Quit]
09:21:55<cadence>a much simpler thing to do would be - assuming you already have a list of questions to scrape - hand them all out once, and after you reach the end, hand them all out again. only if there is a difference between the 1st and 2nd attempt of a question do you send it out a 3rd time. so if anybody tries to alter a question, they'd have to get really lucky and alter it twice somehow to not be caught.
09:21:55<cadence>this one assumes that we have enough time to scrape all questions twice, but it will at least try to go through all the questions a single time first.
09:21:55<cadence>food for thought!
09:22:11hooway_ quits [Client Quit]
09:23:14LeGoupil joins
09:24:38hooway joins
09:44:10HP_Archivist quits [Ping timeout: 258 seconds]
09:46:15Arcorann__ joins
09:49:20Arcorann_ quits [Ping timeout: 250 seconds]
09:59:24BlueMaxima quits [Client Quit]
10:10:14<@Kaz>cadence: how do you authenticate workers
10:10:38<cadence>on first run, before they can do any work, they ask the central server for an access token.
10:11:03<cadence>if the requesting IP has not already generated too many tokens, the server will respond with a new access token with trust 0 that can be used
10:12:03<@Kaz>ah ok, so it's actually per-instance rather than like, 'per human operator' or anything like that
10:13:03<cadence>yah
10:13:03<cadence>there's no real way to verify humans
10:13:03<cadence>(fuck you stripe)
10:27:02LeighR quits [Ping timeout: 244 seconds]
10:33:30<AK>I suppose another option would be per human operator
10:33:42<AK>Then we could remember peoples trust levels for future projects
10:34:29<AK>(As in a token per operator that they use on all their instances)
10:34:48Viniter (Viniter) joins
10:40:43Mateon1 quits [Remote host closed the connection]
10:41:35Mateon2 joins
10:41:36Mateon2 is now known as Mateon1
11:26:42<flashfire42>https://twitter.com/RoyalFamily/status/1380475865323212800?s=20
11:34:21<cadence>claims to be pro-life; dies anyway
11:37:56<Wayward>:o
12:57:38Sylirana quits [Ping timeout: 244 seconds]
12:58:05Sylirana (Sylirana) joins
13:24:01brgtt joins
13:43:11etnguyen03 (etnguyen03) joins
13:44:54Doranwen quits [Ping timeout: 258 seconds]
13:49:05Doranwen (Doranwen) joins
14:27:06etnguyen03 quits [Ping timeout: 250 seconds]
14:32:10<@OrIdow6>atphoenix: Could work here, don't know what the specifics of Zopolis's (who has gone offline) situation are
14:44:48DopefishJustin quits [Remote host closed the connection]
14:45:10DopefishJustin joins
14:49:43lennier1 quits [Client Quit]
14:50:19lennier1 (lennier1) joins
14:57:45spirit quits [Client Quit]
15:07:25<purplebot>Google Poly edited by Ajay (+312) just now -- https://www.archiveteam.org/?diff=46523&oldid=46522
15:18:31brgtt2 joins
15:20:03Daloader_ joins
15:21:42brgtt quits [Ping timeout: 250 seconds]
15:32:46etnguyen03 (etnguyen03) joins
16:05:02Matthww joins
16:22:57LeGoupil quits [Client Quit]
16:25:56brgtt2 quits [Client Quit]
16:26:17Arcorann__ quits [Ping timeout: 258 seconds]
16:33:40Viniter quits [Client Quit]
16:35:59Viniter joins
16:48:25Viniter leaves
16:56:54Viniter (Viniter) joins
17:01:33Jonboy345 quits [Ping timeout: 258 seconds]
17:08:09<jodizzle>cadence: Thanks for this. I didn't realize that the point of the trust system was to stop faulty workers, rather than malicious actors, but that makes sense.
17:16:10Jonboy345 joins
17:19:58Viniter quits [Client Quit]
17:20:21Viniter (Viniter) joins
17:50:37x9fff00 quits [Ping timeout: 258 seconds]
17:52:16x9fff00 joins
17:57:13Hackerpcs quits [Quit: Hackerpcs]
17:59:09Hackerpcs (Hackerpcs) joins
18:21:56jacob joins
18:23:35aleph quits [Ping timeout: 258 seconds]
18:24:57DogsRNice (Webuser299) joins
18:27:11aleph joins
18:30:41Mateon1 quits [Remote host closed the connection]
18:30:49Mateon1 joins
18:31:19godane (godane) joins
18:43:38Daloader_ quits [Ping timeout: 250 seconds]
18:57:39<atphoenix>cadence: what is the reference to stripe? was that a human who tried to make trouble?
19:05:09Viniter quits [Client Quit]
19:05:53Viniter (Viniter) joins
19:44:54crispyalice2 quits [Client Quit]
20:03:38jacob quits [Ping timeout: 258 seconds]
20:35:07jacob joins
21:04:07Viniter quits [Client Quit]
21:08:22godane quits [Ping timeout: 250 seconds]
21:34:24Lord_Nightmare quits [Quit: ZNC - http://znc.in]
21:39:05luckcolors quits [Read error: Connection reset by peer]
21:39:29luckcolors (luckcolors) joins
21:40:42Lord_Nightmare (Lord_Nightmare) joins
21:41:53luckcolors quits [Client Quit]
21:45:56luckcolors (luckcolors) joins
21:48:52Nessie joins
21:52:17britmob25 quits [Quit: britmob25]
21:57:34britm0b quits [Read error: Connection reset by peer]
22:08:21<tech234a>perhaps Stripe the payment processor?
22:33:23hooway quits [Client Quit]
22:59:10luckcolors quits [Read error: Connection reset by peer]
22:59:22luckcolors (luckcolors) joins
23:04:42qw3rty joins
23:16:52Nessie quits [Remote host closed the connection]
23:29:33<billy549>when is archiveteam warrior shirts? as cool as the stickers are, iirc it's not handled by a main AT person (though by them being linked on the wiki i trust them ;p)
23:29:40<billy549>would be cool to do a shirt ;)
23:32:09<@hook54321>billy549: i'm not sure but i think the person selling the stickers might be the person that designed it
23:32:51<billy549>ahh oki
23:33:57<@arkiver>chfoo: you know anything on that? ^
23:46:00<@chfoo>i don't remember what license things were uploaded to the wiki, but it should have been one of the licenses that allows sharing and commercial use
23:47:29<@hook54321>no license
23:47:51<billy549>its a very pretty logo
23:47:57<billy549>but no, i dont think ajh is the original designer
23:50:06nerdguy1138 quits [Quit: Leaving.]
23:52:03<@chfoo>oh, i think the logos were commissioned and licensing might be implied for reuse, but it was done before my time here
23:53:05BlueMaxima joins
23:53:45<@hook54321>I think I found the artist, I can ask. Either way I'm not sure a random person's redbubble store should be advertised on the wiki though.
23:55:21Sylirana quits [Ping timeout: 244 seconds]
23:55:39Sylirana (Sylirana) joins
23:59:11<billy549>yeah