00:01:19icedice quits [Quit: Leaving]
00:02:02cascode quits [Read error: Connection reset by peer]
00:02:17cascode joins
00:17:49hyenatown quits [Client Quit]
00:29:07etnguyen03 quits [Client Quit]
00:35:53BearFortress quits []
00:56:44sec^nd quits [Remote host closed the connection]
00:57:02sec^nd (second) joins
01:09:51etnguyen03 (etnguyen03) joins
01:11:28BornOn420 quits [Remote host closed the connection]
01:11:59BornOn420 (BornOn420) joins
01:17:06BlueMaxima joins
01:21:24beastbg8 (beastbg8) joins
01:22:35utulien joins
01:23:33beastbg8__ quits [Ping timeout: 260 seconds]
01:49:14threedeeitguy quits [Ping timeout: 250 seconds]
02:11:19tmob joins
02:44:28riteo (riteo) joins
02:54:40mls quits [Ping timeout: 250 seconds]
02:56:08Webuser283951 joins
02:57:34Webuser283951 quits [Client Quit]
02:59:17some_body3 joins
02:59:48some_body quits [Read error: Connection reset by peer]
02:59:48some_body3 is now known as some_body
03:08:28emcapi joins
03:10:42cascode quits [Ping timeout: 250 seconds]
03:12:51cascode joins
03:15:36tmob quits [Read error: Connection reset by peer]
03:16:40mls (mls) joins
03:16:48tmob joins
03:31:45<Hans5958>> It has an OAuth flow even for anonymous access.
03:31:45<Hans5958>Microsoft sites be like redirecting to god knows how many auth endpoints just to get me logged out (because I'm not logged in
03:31:54<Hans5958>* > It has an OAuth flow even for anonymous access.
03:31:54<Hans5958>Microsoft sites be like redirecting to god knows how many auth endpoints just to get me logged out (because I'm not logged in
03:33:03jinn6 quits [Ping timeout: 260 seconds]
03:34:45<nicolas17>Hans5958: https://transfer.archivete.am/inline/S0jkc/sso-be-like.mp4
03:35:03<nulldata>Yeah the answers site was infuriating when you followed from a Google search. Going back would just put you in a loop back to the same page
03:35:28<@JAA>Still is, but was, too.
03:42:39etnguyen03 quits [Client Quit]
03:44:36seab joins
03:45:57Webuser609773 joins
03:46:42<Webuser609773>hi. Has anyone tried to extract 3D files from clara.io WARC?
03:48:42etnguyen03 (etnguyen03) joins
03:53:05yarnover|m joins
03:53:19seab quits [Client Quit]
03:55:30jinn6 joins
04:03:34<@JAA>Webuser705947: #clarity is the right channel for that.
04:04:00etnguyen03 quits [Remote host closed the connection]
04:05:01<LunarianBunny1147>nicolas17: never seen a site redirect **that** much before, thats way too many redirects
04:07:28tmob quits [Ping timeout: 260 seconds]
04:08:14Webuser609773 quits [Client Quit]
04:13:01driib9 quits [Read error: Connection reset by peer]
04:13:14driib9 (driib) joins
04:15:27BlueMaxima quits [Read error: Connection reset by peer]
04:39:58cascode quits [Ping timeout: 250 seconds]
04:44:37cascode joins
05:24:14DogsRNice quits [Read error: Connection reset by peer]
05:31:36moth_ joins
05:38:28cascode quits [Ping timeout: 260 seconds]
05:39:22cascode joins
05:44:16<Vokun>Some day in like 137 years when there is a small break with no emergency projects, it'll be nice to see mediafire get put as AT-Choice and watch it go brrr for once.
05:51:43Mateon1 quits [Quit: Mateon1]
05:52:48Mateon1 joins
06:08:52Overlordz quits [Quit: Leaving]
06:09:51<Hans5958>* > It has an OAuth flow even for anonymous access.
06:09:51<Hans5958>Microsoft sites be like redirecting to god knows how many auth endpoints just to get me logged out (because I'm not logged in)
06:12:12alexlehm quits [Quit: ZNC - https://znc.in]
06:14:48<Hans5958><nicolas17> "Hans5958: https://transfer...." <- Reminds me with https://transfer.archivete.am/inline/12YlLL/2025-02-07_13-13-00_firefox.mp4
06:15:48<h2ibot>PaulWise edited Mailing Lists (+31, alpine sourcehut instance): https://wiki.archiveteam.org/?diff=54365&oldid=54361
06:21:33caylin quits [Quit: eaten by the grongler]
06:21:49caylin (caylin) joins
06:22:07Wohlstand (Wohlstand) joins
06:33:04SootBector quits [Remote host closed the connection]
06:33:25SootBector (SootBector) joins
07:01:25Webuser146348 joins
07:02:10Jonboy3451 joins
07:03:41Jonboy3452 joins
07:05:34Jonboy345 quits [Ping timeout: 250 seconds]
07:07:08Jonboy3451 quits [Ping timeout: 260 seconds]
07:28:06utulien quits [Ping timeout: 250 seconds]
08:07:51nomead joins
08:08:56Mateon1 quits [Remote host closed the connection]
08:11:08Mateon1 joins
08:17:01Webuser146348 quits [Client Quit]
08:25:24emcapi quits [Quit: Ooops, wrong browser tab.]
09:03:00Jonboy3452 quits [Ping timeout: 250 seconds]
09:05:12notarobot1 quits [Quit: Ping timeout (120 seconds)]
09:05:26notarobot1 joins
09:06:50qw3rty__ joins
09:10:22qw3rty_ quits [Ping timeout: 250 seconds]
09:11:02BearFortress joins
09:15:24nulldata quits [Quit: So long and thanks for all the fish!]
09:15:56nulldata (nulldata) joins
09:21:01territoryjazz quits [Quit: Ooops, wrong browser tab.]
09:27:22loug8318142 joins
09:30:04Island quits [Read error: Connection reset by peer]
09:34:51threedeeitguy (threedeeitguy) joins
09:51:14sausa joins
09:56:44Church quits [Ping timeout: 250 seconds]
10:06:32sausa quits [Client Quit]
10:13:24Webuser441414 joins
10:22:15Church (Church) joins
10:25:47TastyWiener956 (TastyWiener95) joins
10:25:52cow_2001 quits [Quit: ✡]
10:26:12TastyWiener95 quits [Ping timeout: 250 seconds]
10:26:13TastyWiener956 is now known as TastyWiener95
10:30:40cow_2001 joins
10:31:24TastyWiener95 quits [Ping timeout: 250 seconds]
10:35:09TastyWiener95 (TastyWiener95) joins
10:39:11nomead quits [Read error: Connection reset by peer]
10:41:12izolight joins
10:46:15linuxgemini6 (linuxgemini) joins
10:46:25linuxgemini quits [Read error: Connection reset by peer]
10:46:25linuxgemini6 is now known as linuxgemini
11:01:01LunarianBunny1147 quits [Quit: The Lounge - https://thelounge.chat]
11:01:55LunarianBunny1147 (LunarianBunny1147) joins
11:15:20izolight quits [Client Quit]
11:28:43Webuser441414 quits [Client Quit]
11:28:52Webuser160280 joins
11:43:48<@OrIdow6>Have lost messages from the last ~2 days, doubt there was anything for me, but if there was you'll need to notify me again
11:49:47Jake quits [Quit: Ping timeout (120 seconds)]
11:50:02Jake (Jake) joins
11:51:33<@OrIdow6>nicolas17: Suggestion to run foro3djuegos in AB anyway for now, we'll only get a subset but if e.g. site has limited capacity might be a net gain
12:00:06Bleo18260072271962345 quits [Quit: The Lounge - https://thelounge.chat]
12:02:48Bleo18260072271962345 joins
12:20:05arch quits [Remote host closed the connection]
12:20:14arch joins
12:35:10SkilledAlpaca418962 quits [Quit: SkilledAlpaca418962]
12:35:42SkilledAlpaca418962 joins
12:44:26Wohlstand quits [Quit: Wohlstand]
12:48:40<Hans5958>Wish there is VisualEditor on the wiki
12:55:17Wohlstand (Wohlstand) joins
13:08:47<hexa->all non-eol versions have it *cough*
13:16:00<arch>what is the difference between todo and "out" in the tracker? https://tracker.archiveteam.org/usgovernment/
13:16:26<@arkiver>arch: out are items that have been handed out to workers
13:16:35<@arkiver>they are moved from there and marked as done if workers return them
13:16:48<@arkiver>if they're not returned, the items are reclaimed at some point
13:19:31<arch>aaaaah gotcha okay, so still eligible to be picked up at some point but being worked on
13:19:35<arch>makes sense
13:21:02beastbg8_ joins
13:25:08beastbg8 quits [Ping timeout: 260 seconds]
13:30:32yasomi quits [Quit: ZNC 1.9.1 - https://znc.in]
13:35:30yasomi (yasomi) joins
13:37:31Wohlstand quits [Client Quit]
13:41:28katocala quits [Ping timeout: 260 seconds]
13:41:42katocala joins
14:00:57Webuser692740 quits [Quit: Ooops, wrong browser tab.]
14:02:28katocala quits [Ping timeout: 260 seconds]
14:03:05katocala joins
14:31:06<arch>yay i can now automatically scale how many warriors i have running https://git.gmem.ca/arch/infra/commit/d53edb7a56d18983d09b5f94c30b73833512ae09
14:31:10<arch>...really badly but it works
14:32:31<arch>wouldn't preferred to run it in the same place as the webserver but python threading
14:34:33Webuser219093 joins
14:40:54<kiska>arch: If you want you can query influxdb for the other stats :D
14:42:35<arch>kiska: i was wondering about that, is there a wiki page or something?
14:43:31<kiska>No, but maybe I am misunderstanding what you're asking for. I have been awake for 18 hours at this point
14:46:36<arch>basically i just want to pull stats to autoscale my own warriors :p the http api works fine in this case
14:47:00<kiska>Yeah :D
14:50:47nomead joins
14:56:14<nicolas17>OrIdow6: good point
15:03:02<h2ibot>Nicolas17v2 edited Foro 3DJuegos (+71, Archivebot project started): https://wiki.archiveteam.org/?diff=54366&oldid=54346
15:10:00Mateon1 quits [Remote host closed the connection]
15:10:01Wohlstand (Wohlstand) joins
15:11:15Mateon1 joins
15:16:06<AK>Ran https://support.captureone.com/hc/en-us/community/topics through AB with useragent firefox and we got an instant 403, (8w03stgm0ihey2y0p3dx0six) so not sure we'll be able to AB it
15:16:10<AK>(It's behind CF)
15:19:40Wohlstand quits [Client Quit]
15:21:56caylin quits [Read error: Connection reset by peer]
15:22:15caylin (caylin) joins
15:31:09<h2ibot>Nulldata edited Deathwatch (+220, Added Capture One): https://wiki.archiveteam.org/?diff=54367&oldid=54363
15:42:04<@OrIdow6>nicolas17: :thumbsup:
15:44:42cascode quits [Ping timeout: 250 seconds]
15:45:23<myself>How often do individual warriors check what AT's-choice is? Could we time-modulate different projects into the AT's choice, a few seconds at a time, to get different fractions of warriors working on different projects instead of a single thundering herd all on one?
15:47:30cascode joins
15:51:09<masterx244|m>or "shard" it based on the IP address of the warrior request like last octet of IP%eglible_projects (could be weighted on the projects array)
15:58:34midou quits [Ping timeout: 250 seconds]
15:59:08cascode quits [Ping timeout: 260 seconds]
15:59:45cascode joins
16:00:36midou joins
16:13:43midou quits [Ping timeout: 260 seconds]
16:18:34<@OrIdow6>I've thought we could check the irsr and divert them based on that
16:20:37<@OrIdow6>As for how it selects them I believe it's this https://github.com/ArchiveTeam/warrior-hq
16:20:46<@OrIdow6>And some stuff in seesaw.py that's an API client
16:20:50<@OrIdow6>*warrior.py
16:29:39lflare (lflare) joins
16:36:42cascode quits [Ping timeout: 250 seconds]
16:37:21cascode joins
16:53:37<@arkiver>OrIdow6: what is irsr in this case?
16:55:10<myself>item request serve rate, I had assumed? to tell which projects are more/less busy than others and need more/fewer workers?
16:55:24<@arkiver>i'm horrible with abbreviations
16:55:26<@arkiver>thanks myself :)
16:55:35<@arkiver>(funny to write that)
16:55:39myself bows
16:56:06<@arkiver>even when abbreviations are very obvious, it just doesn't ring a bell with me most of the time
17:00:57<myself>Ah, I'm fueled by abbreviations, I absorb them and fit them to the world around me. I used to be able to walk through a room of telecom equipment I'd never seen before, glance at the labels on the circuit packs and what sort of cabling came in/out, make assumptions about their abbreviations, then infer what each machine must be doing.
17:02:01<myself>This was great on Lucent and Fujitsu and DSC and Pulsecom equipment, but completely fell flat on Nortel whose abbreviations I assume were made French-first or something, I couldn't make heads or tails of that stuff without concerted study of the manuals.
17:03:17<nicolas17>OrIdow6: foro3djuegos started returning 403 and had to be slowed down :/
17:03:21<nicolas17>on AB
17:09:02sec^nd quits [Remote host closed the connection]
17:09:22sec^nd (second) joins
17:11:10<TheTechRobo>Is the order of queues todo, todo:backfeed, todo:redo, todo:secondary?
17:11:54midou joins
17:16:43midou quits [Ping timeout: 260 seconds]
17:18:19<TheTechRobo>Also, is reclaim rate the percentage of items in the tracker that have been reclaimed at least once, or is it something else?
17:18:58<TheTechRobo>(Not to be confused with "reclaim serve rate", which I'm guessing means the percentage of item requests that result in an item being reclaimed)
17:24:07breadbrix quits [Quit: Ooops, wrong browser tab.]
17:29:08benjins2_ quits [Ping timeout: 250 seconds]
17:31:40<h2ibot>TheTechRobo edited Tracker (+1731, More info, mostly on stats): https://wiki.archiveteam.org/?diff=54368&oldid=54031
17:31:41<TheTechRobo>^ please correct that if it is wrong
17:35:36midou joins
17:35:41<h2ibot>TheTechRobo edited Archiveteam:Acronyms (+143, Add irsr and rtt): https://wiki.archiveteam.org/?diff=54369&oldid=53989
17:36:35benjins2 joins
17:37:52Webuser587589 joins
17:38:37<Webuser587589>Hello everyone, can someone help me find the image address for the archive team choice project? I'd like for my containers to automatically contribute to the archive team choice, whatever that might be. Thank you!
17:39:51<that_lurker>Webuser587589: The warrior image has that option.
17:40:34<that_lurker>https://wiki.archiveteam.org/index.php/ArchiveTeam_Warrior
17:48:06<@arkiver>myself: hah, yeah, that is the exact opposite of me :P
17:48:21<@arkiver>TheTechRobo: almost, switch todo:secondary and todo:redo
17:48:33<@arkiver>todo, todo:backfeed, todo:secondary, todo:redo
17:48:41lennier2 joins
17:49:44<h2ibot>TheTechRobo edited Tracker (+0, Fix queue order): https://wiki.archiveteam.org/?diff=54370&oldid=54368
17:49:53<TheTechRobo>arkiver: Thanks!
17:50:22lennier2_ quits [Ping timeout: 250 seconds]
18:05:47Ashurbinary joins
18:12:55utulien joins
18:16:42seab joins
18:16:43<Webuser587589>that_lurker I'm not seeing the image address for the warrior image. I've tried combinations of archiveteam-grab, warrior-grab etc and haven't figured it out. Can you please give me some more direction?
18:17:17<nicolas17>I'm not sure if the warrior is in docker
18:17:23<nicolas17>if you want to use docker, use a specific project
18:18:36<TheTechRobo>the Warrior is in Docker, that's what the VM uses
18:18:46<TheTechRobo>I believe the image address is atdr.meo.ws/archiveteam/warrior-dockerfile
18:20:12<TheTechRobo>indeed: https://wiki.archiveteam.org/index.php/ArchiveTeam_Warrior#Advanced_usage_(container_only)
18:23:20Webuser841483 joins
18:25:52seab quits [Client Quit]
18:26:02Webuser841483 quits [Client Quit]
18:31:03<@OrIdow6>nicolas17: I assume those are should-be-429s 403s and not "site is gloabbly overloaded" 403s?
18:31:16<@OrIdow6>Yes item request serve rate
18:34:01<nicolas17>OrIdow6: hard to tell, we slowed down and it's working again
18:35:25<h2ibot>TheTechRobo edited Tracker (+254, Add typical queue usages): https://wiki.archiveteam.org/?diff=54371&oldid=54370
18:35:26<h2ibot>TheTechRobo edited ArchiveTeam Warrior (+69, /* I'm looking at the leaderboard. What do the…): https://wiki.archiveteam.org/?diff=54372&oldid=54364
18:35:28<@arkiver>using item request server rate could be a solution
18:36:03<@arkiver>what I'd rather see though, gives more control, which is doable when there's not a huge number of projects, is a factor per project
18:37:29<@arkiver>from those factors a total can be calculate, an a percentage the factor is of this total. then the warrior on the default project evaluate this every hour or half hour, and uses this percentage as probability of switching to a different project
18:37:47<@arkiver>might have to put something in place to not causes switches without need, which may be more difficult
18:38:58<@arkiver>the probability for a worker to run a project could be adjusted according to the number of workers that actually run that project
18:39:55<@arkiver>so we'd have "admin set fraction", and "live situation adjusted fraction" (probably bad name)
18:40:13<@arkiver>maybe the difference between those can be used to regulate switches, so a warrior does not switch from project to project every hour
18:40:28<@arkiver>(not sure if i'm being entirely clear)
18:40:48<@arkiver>this would be set in the JSON file pulled by the warriors
18:43:59<@arkiver>separate process pings an endpoint every minutes noting which project their running, and that is how we keep track of what percentage is on what project, and adjust accordingly
18:45:27<@arkiver>opinions very welcome ^
18:54:22HP_Archivist quits [Quit: Leaving]
19:02:16Larsenv quits [Quit: The Lounge - https://thelounge.chat]
19:03:30Larsenv (Larsenv) joins
19:05:31Larsenv quits [Client Quit]
19:06:46Larsenv (Larsenv) joins
19:07:13notarobot1 quits [Quit: The Lounge - https://thelounge.chat]
19:07:33PredatorIWD25 quits [Ping timeout: 260 seconds]
19:07:42notarobot1 joins
19:19:43TastyWiener95 quits [Read error: Connection reset by peer]
19:20:20TastyWiener95 (TastyWiener95) joins
19:40:13TastyWiener95 quits [Ping timeout: 260 seconds]
19:42:30Webuser705947 quits [Quit: Ooops, wrong browser tab.]
19:50:08cascode quits [Ping timeout: 260 seconds]
19:50:58cascode joins
19:54:37cascode quits [Read error: Connection reset by peer]
19:54:50cascode joins
19:58:18TastyWiener95 (TastyWiener95) joins
20:01:24lennier2_ joins
20:03:24lennier2 quits [Ping timeout: 250 seconds]
20:03:46tmob joins
20:10:39scurvy_duck joins
20:13:03Webuser587589 quits [Client Quit]
20:18:35anarcat quits [Quit: rebooting]
20:31:25anarcat (anarcat) joins
20:37:23TastyWiener95 quits [Ping timeout: 260 seconds]
20:38:20TastyWiener95 (TastyWiener95) joins
20:44:58scurvy_duck quits [Ping timeout: 260 seconds]
20:45:06anarcat quits [Client Quit]
20:47:47anarcat (anarcat) joins
20:49:32BornOn420 quits [Excess Flood]
20:50:20<AK>I guess one option is warriors won't change more often than say once per hour. Unless a project is then marked as "disabled" in which case they'll switch to default regardless of their switching timeout
20:51:11<AK>It's almost some kind of "sticky" project, that also avoids suddenly swarming a project with thousands of warriors that have all finished their check at the same time
20:51:17BornOn420 (BornOn420) joins
20:58:23BlueMaxima joins
21:09:58scurvy_duck joins
21:28:58BoxcarVillain6 joins
21:29:10BoxcarVillain6 quits [Client Quit]
21:29:22BoxcarVillain joins
21:44:33theozzone joins
21:45:34nomead quits [Quit: Leaving]
21:53:34BoxcarVillain quits [Client Quit]
22:00:57dfish joins
22:08:23scurvy_duck quits [Ping timeout: 260 seconds]
22:16:26utulien quits [Ping timeout: 250 seconds]
22:20:20Snivy quits [Ping timeout: 250 seconds]
22:21:16Snivy (Snivy) joins
22:23:52<h2ibot>Cooljeanius edited YouTube (+27, /* Archive Team project */ link to…): https://wiki.archiveteam.org/?diff=54373&oldid=54320
22:25:53<h2ibot>Cooljeanius edited Government Backup (+4, minor wikification): https://wiki.archiveteam.org/?diff=54374&oldid=28964
22:29:35DogsRNice joins
22:31:54<h2ibot>Cooljeanius edited NOAA (+21, minor wikification and update): https://wiki.archiveteam.org/?diff=54375&oldid=53690
22:33:54<h2ibot>Cooljeanius edited US Government (+108, add "See Also" section): https://wiki.archiveteam.org/?diff=54376&oldid=54351
22:35:55<h2ibot>Cooljeanius edited US Government (+33, separate section for references): https://wiki.archiveteam.org/?diff=54377&oldid=54376
22:36:55<h2ibot>Cooljeanius edited Ftp-gov (+4, minor wikification): https://wiki.archiveteam.org/?diff=54378&oldid=49299
22:39:56<h2ibot>Cooljeanius edited USA-Gov (+20, minor wikification and update): https://wiki.archiveteam.org/?diff=54379&oldid=30034
22:43:58tmob quits [Ping timeout: 260 seconds]
22:45:57useretail quits [Remote host closed the connection]
22:48:17theozzone quits [Client Quit]
22:50:56etnguyen03 (etnguyen03) joins
23:01:42dfish quits [Client Quit]
23:09:44lennier2_ quits [Quit: Going offline, see ya! (www.adiirc.com)]
23:11:08scurvy_duck joins
23:18:24cascode quits [Ping timeout: 250 seconds]
23:19:07cascode joins
23:21:20sec^nd quits [Remote host closed the connection]
23:21:37sec^nd (second) joins
23:24:48Webuser160280 quits [Quit: Ooops, wrong browser tab.]
23:25:04Island joins
23:30:06cascode quits [Ping timeout: 250 seconds]
23:30:23cascode joins
23:33:33utulien joins
23:48:21loug8318142 quits [Quit: The Lounge - https://thelounge.chat]
23:52:16HP_Archivist (HP_Archivist) joins
23:53:07tmob joins
23:53:51HP_Archivist quits [Client Quit]