00:08:34<Washuu>Thank you for the answer. Cheers!
00:08:39Washuu leaves [Ooops, wrong browser tab.]
00:10:05Dada quits [Remote host closed the connection]
00:23:43etnguyen03 quits [Client Quit]
00:43:35<kiska>szczot3k I use https://github.com/nicholas-fedor/watchtower
00:55:31SootBector quits [Remote host closed the connection]
00:56:41SootBector (SootBector) joins
01:03:54Shard111 (Shard) joins
01:04:40Shard11 quits [Ping timeout: 256 seconds]
01:04:40Shard111 is now known as Shard11
01:09:06SootBector quits [Remote host closed the connection]
01:09:14etnguyen03 (etnguyen03) joins
01:10:15Webuser919909 joins
01:14:38SootBector (SootBector) joins
01:18:24Webuser919909 quits [Client Quit]
01:33:43xkey quits [Quit: WeeChat 4.8.1]
01:34:47xkey (xkey) joins
01:36:47<h2ibot>Cruller edited Deathwatch (+257, /* 2026-02 */ Add 47CLUB): https://wiki.archiveteam.org/?diff=60449&oldid=60429
02:33:55<h2ibot>PaulWise edited Steam (-2456, drop old how-can-i-help section): https://wiki.archiveteam.org/?diff=60450&oldid=59758
02:42:14CYBERDEV quits [Quit: Leaving]
02:56:26nexussfan quits [Read error: Connection reset by peer]
04:04:04theexit8 joins
04:04:39theexit8 quits [Client Quit]
04:14:38LddPotato quits [Read error: Connection reset by peer]
04:15:22LddPotato (LddPotato) joins
04:25:14LddPotato quits [Read error: Connection reset by peer]
04:25:58LddPotato (LddPotato) joins
04:28:25<triplecamera|m>justauser: Oh I see, you were replying to my previous message: "What does SFDW stand for?"
04:28:40<triplecamera|m>https://irclogs.archivete.am/archiveteam-bs/2026-02-05#l4faaf306
04:35:53LddPotato quits [Read error: Connection reset by peer]
04:36:41LddPotato (LddPotato) joins
04:38:39DogsRNice quits [Read error: Connection reset by peer]
04:46:40Island quits [Read error: Connection reset by peer]
04:54:58LddPotato quits [Read error: Connection reset by peer]
04:55:36LddPotato (LddPotato) joins
04:58:29etnguyen03 quits [Quit: Konversation terminated!]
04:59:16etnguyen03 (etnguyen03) joins
05:04:15n9nes quits [Ping timeout: 272 seconds]
05:05:28LddPotato quits [Read error: Connection reset by peer]
05:05:50n9nes joins
05:06:19LddPotato (LddPotato) joins
05:14:07etnguyen03 quits [Remote host closed the connection]
06:04:03nine quits [Quit: See ya!]
06:04:16nine joins
06:04:16nine quits [Changing host]
06:04:16nine (nine) joins
08:33:45<h2ibot>Bzc6p edited Talk:Wayback Machine (+251, /* Status as redirect */ be bold): https://wiki.archiveteam.org/?diff=60452&oldid=60402
09:24:24hexagonwin (hexagonwin) joins
09:54:33LddPotato quits [Read error: Connection reset by peer]
09:55:14LddPotato (LddPotato) joins
10:04:36Dada joins
10:05:03LddPotato quits [Read error: Connection reset by peer]
10:05:54LddPotato (LddPotato) joins
10:09:52iPwnedYourIOTSmartdog7 joins
10:12:04iPwnedYourIOTSmartdog quits [Ping timeout: 256 seconds]
10:12:04iPwnedYourIOTSmartdog7 is now known as iPwnedYourIOTSmartdog
10:15:59LddPotato quits [Read error: Connection reset by peer]
10:16:26ducky quits [Remote host closed the connection]
10:16:37ducky (ducky) joins
10:16:37LddPotato (LddPotato) joins
10:34:57LddPotato quits [Read error: Connection reset by peer]
10:35:34LddPotato (LddPotato) joins
10:37:48theexit86 joins
10:40:24<theexit86>Hello all, and many thanks to you all for your hard work on ArchiveTeam and its warriors. Firstly, my apologies if this is the wrong place or channel to ask, as the #archivebot channel seems busy. If so, please direct me to the correct place.
10:40:42<theexit86>I wish to apply to run an ArchiveBot pipeline on one of my servers. For the record, I've run several warriors since June of last year under the tracker username "theexit8", and sometimes joined their respective IRC channels to report issues.
10:41:03<theexit86>I have some logistics questions I'll ask later, but first and most important, does the ArchiveBot project truly need more hosts, or is it already at a good enough capacity? Asking as it seems to me the US Government warrior is fairly well connected and has few allocations to spare, so was wondering if this is/isn't the case with ArchiveBot too.
10:41:21<theexit86>Many thanks for reading, and if you're open to having me volunteer my computer, I'll be happy to install the pipeline and get registered to process jobs. Any issues afterwards I'll of course communicate them here.
10:45:31LddPotato quits [Read error: Connection reset by peer]
10:46:16LddPotato (LddPotato) joins
11:00:11ducky quits [Ping timeout: 272 seconds]
11:02:32NatTheCat quits [Quit: Ping timeout (120 seconds)]
11:02:59NatTheCat (NatTheCat) joins
11:15:12ducky (ducky) joins
11:16:22Sk1d joins
11:35:18Wohlstand (Wohlstand) joins
11:48:25<BennyOtt>OpenDiary redirects me to the page https://opendiaryhelp.zendesk.com/hc/en-us/articles/360041783451-Why-am-I-being-blocked-from-Open-Diary even at my home IP address, where I hadn't even run the crawler.
11:58:02Sk1d quits [Ping timeout: 256 seconds]
12:00:03Bleo1826007227196234552220 quits [Quit: The Lounge - https://thelounge.chat]
12:02:25Sk1d joins
12:02:48Bleo1826007227196234552220 joins
12:35:29Sk1d quits [Client Quit]
12:39:51sg72 quits [Quit: Leaving]
12:45:48sg72 joins
13:10:25Arcorann quits [Changing host]
13:10:25Arcorann (Arcorann) joins
13:11:38Webuser295588 joins
13:11:50Webuser295588 quits [Client Quit]
13:12:46Arcorann_ (Arcorann) joins
13:16:14Arcorann quits [Ping timeout: 256 seconds]
13:22:52Arcorann__ (Arcorann) joins
13:26:26Arcorann_ quits [Ping timeout: 256 seconds]
13:31:33Arcorann__ quits [Ping timeout: 272 seconds]
14:10:17Wohlstand quits [Client Quit]
14:10:29Wohlstand (Wohlstand) joins
14:11:24etnguyen03 (etnguyen03) joins
14:12:12etnguyen03 quits [Remote host closed the connection]
14:13:13etnguyen03 (etnguyen03) joins
14:33:56Webuser439278 joins
14:34:06Webuser439278 quits [Client Quit]
15:02:38roverinexile joins
15:04:39rover quits [Ping timeout: 272 seconds]
15:06:45nexussfan (nexussfan) joins
15:29:33etnguyen03 quits [Client Quit]
15:29:48<klea>It might be neat to have a project to archive ACM, since they moved to Open Access recently (2026-01-XX) as stated in: https://dl.acm.org/openaccess Sample link for pdf: https://dl.acm.org/doi/pdf/10.1145/2901739.2903508 sadly behind Buttflare.
15:29:51<klea>ButtFlare--
15:29:52<eggdrop>[karma] 'ButtFlare' now has -247 karma!
15:30:58<justauser>This needs to be checked, but quite probably everything is already available in... another archive.
15:31:10<klea>oh
15:59:45<h2ibot>Justauser edited Distributed recursive crawls (+42, /* Candidates */ Added mmnt.net, dropped…): https://wiki.archiveteam.org/?diff=60453&oldid=60414
16:01:28DogsRNice joins
16:08:58etnguyen03 (etnguyen03) joins
17:28:01DogsRNice_ joins
17:28:18cyanbox_ joins
17:31:14DogsRNice quits [Ping timeout: 256 seconds]
17:31:48cyanbox quits [Ping timeout: 256 seconds]
17:46:29anonandon joins
17:49:18<pokechu22>They might be available in https://scholar.archive.org/ (which probably isn't the archive justauser is referring to for what it's worth, but is a highly useful resource)
17:49:57<anonandon>Hi, I've archived a particular set of content that (whilst not immediately at risk, needs guarding against future corporate pivots. I wanted to upload it to archive.org but it consists of work from hundreds of individual artists so getting permission isn't feasible. The content is currently freely available via a specific app so I'm not cheating
17:49:57<anonandon>any individuals out of revenue.
17:49:57<anonandon>My question is this - should I upload it to IA to keep it safe? I'd rather not store it locally in case I lose it. And if the answer is "yes" - what metadata should I give IA about ownership and license?
17:49:58<klea>Yeah, the team that runs that is undervalued in IA apparently.
17:50:39<klea>anonandon: Probably more suitable to ask in #internetarchive.
17:51:18<justauser>anonandon: IA is fairly lax here.
17:51:46<justauser>You can even upload with no details, and it will stay here unless someone complains.
17:52:13<justauser>App name will probably make a better metadata than any individual artist,
17:52:27<anonandon>each file has attribution so there's that at least
17:58:20<pokechu22>I do wonder if archiveteam crawls end up on https://scholar.archive.org/ or if it's only a subset of IA's own crawls (e.g. archive-it crawls for universities)
17:59:39wavz joins
17:59:47<wavz>wsg
18:00:46<justauser>?
18:01:15<wavz>==what's good
18:01:25<justauser>Where?
18:01:42<wavz>it's just a greeting lol
18:05:57<wavz>does warrior still work?
18:06:08<justauser>Yes.
18:06:35<wavz>thx
18:06:46<justauser>You may need to update if you haven't touched it for a while, but the general idea is still there.
18:07:06nexussfan quits [Quit: Konversation terminated!]
18:09:19<wavz>on my old workstation i just set it up a couple of years ago and never touched it again lol
18:09:37<wavz>i don't even think it was running
18:10:53<justauser>-> #warrior for future questions, https://wiki.archiveteam.org/index.php/ArchiveTeam_Warrior for docs.
18:14:51etnguyen03 quits [Client Quit]
18:15:29<wavz>thanks. opendiary still on warrior?
18:16:51<justauser>Still?
18:17:01<justauser>Was added a week ago.
18:17:21<wavz>oh true it ends on the 28th
18:17:58<justauser>Not considered urgent and seems to be server-bound, but still welcome.
18:19:22<justauser>Huh, paused?
18:19:32<wavz>on the wiki it's still reported as urgent and short-term
18:19:58<wavz>is it down right now?
18:20:16<justauser>No, not paused.
18:20:32<wavz>it is for me
18:20:34<wavz>"Server returned bad response"
18:20:46<justauser>Probably you are banned for some reason.
18:21:14<justauser>Lots of people are without an apparent reason.
18:21:16<wavz>lovely
18:21:31<wavz>i didn't even know this website was a thing until yesterday
18:21:37<justauser>Probably an attempt to curb spam - if you have a spare proxy, look at the homepage.
18:21:47<wavz>it loads but very slowly
18:21:49<wavz>no proxy
18:21:54<justauser>Hint: consists of the said spam entirely.
18:22:20<wavz>?
18:26:24wavz quits [Client Quit]
18:28:19etnguyen03 (etnguyen03) joins
18:28:41<justauser>!tell wavz https://web.archive.org/web/20260129003520/http://opendiary.com/
18:28:43<eggdrop>[tell] ok, I'll tell wavz when they join next
18:49:01cyanbox_ quits [Read error: Connection reset by peer]
18:54:19Island joins
18:54:56DogsRNice_ quits [Client Quit]
18:55:11DogsRNice_ joins
18:55:11DogsRNice_ quits [Remote host closed the connection]
18:55:23DogsRNice joins
19:00:09<justauser>Opendiary is unstable and times out a lot /cc imer ?
19:00:30<@imer>:( do we need to lower the limit even more?
19:00:34<@imer>pausing for a bit
19:01:12<justauser>Warriors seem to be getting through, but I wanted to !ao the / and couldn't.
19:03:08<@imer>!remindme 15min has opendiary recovered?
19:03:10<eggdrop>[remind] ok, i'll remind you at 2026-02-08T19:18:08Z
19:18:09<eggdrop>[remind] imer: has opendiary recovered?
19:25:11nepeat quits [Quit: ZNC - https://znc.in]
19:25:40nepeat (nepeat) joins
19:29:44<@imer>nay
19:29:48<@imer>!remindme 15min has opendiary recovered?
19:29:49<eggdrop>[remind] ok, i'll remind you at 2026-02-08T19:44:49Z
19:32:04<@imer>mh, i might be going insane - apparently didnt limit it?
19:33:24<@imer>no i did. nvm
19:33:36<@imer>just taking that long to go through worker backlog
19:44:49<eggdrop>[remind] imer: has opendiary recovered?
19:50:18<nicolas17>consider disabling multiitems in opendiary, that reduces RTT and lets you adjust rate limits more accurately
20:14:20<h2ibot>IDKhowToEdit edited YouTube (+24, Updated bot permission: Queueing video now…): https://wiki.archiveteam.org/?diff=60454&oldid=59182
20:15:22<nicolas17>https://data.nicolas17.xyz/samsung-grab/
20:19:32<nicolas17>9 files pending
20:24:04<@imer>nicolas17: yep. did that
20:24:30<IDK>Trying but its downloading at 0.2mb/s😂
20:24:48<@imer>back to 1s response times, so resuming at 1/s
20:25:22<@imer>was 2/s
20:25:59<nicolas17>IDK: yeah samsung seems to throttle heavily
21:04:14<nicolas17>opendiary success ratio 86%
21:06:16cyanbox joins
21:16:42<@Fusl>nicolas17: wouldnt it be easier for everyone involved to just crowdsource the captcha solving?
21:19:29etnguyen03 quits [Client Quit]
21:25:58chunkynutz60 quits [Read error: Connection reset by peer]
21:26:16chunkynutz60 joins
21:28:58Wohlstand quits [Quit: Wohlstand]
21:38:46<theexit86>Hi all. Sorry, just restating my previous question: Would the ArchiveBot project benefit from running additional pipeline hosts? Many thanks in advance!
21:41:09<nicolas17>Fusl: how would you do that?
21:42:03<nicolas17>pretty sure if I serve you a captcha from my website using samsung's API key, it won't work because the origin isn't opensource.samsung.com
21:43:08v01d joins
21:49:54<nicolas17>I'd also need to bypass TLS fingerprinting which they introduced last month
21:58:07<klea>huh, could Samsung be sued for not making their files downloadable, and putting them behind a captcha?
22:02:02<nicolas17>maybe
22:02:54<nicolas17>throttling download speeds to 200KB/s, banning IPs based on opaque limits, having a captcha (which is proprietary software), all could maybe get them into trouble
22:03:02<klea>Oh yeah I forgot lawsuits cost money.
22:03:03<nicolas17>plus apparently they're missing many files
22:03:39<klea>fun
22:37:35<hexa->https://infosec.exchange/@bucketchallenge/116036543449145849
22:38:47<klea>pokechu22: IIRC you handled s3 buckets?, could you check https://wp-stat.s3.us-east-1.amazonaws.com/ ?
22:40:05<pokechu22>listing now
22:40:38<klea>ack
22:41:41<pokechu22>https://gitea.arpa.li/JustAnotherArchivist/little-things/src/branch/master/s3-bucket-list for reference (I'm locally using an older and hacked up version because I've been too lazy to update, but current version should work too)
22:46:09Arcorann__ (Arcorann) joins
22:58:31khaoohs joins
23:00:28khaoohs__ quits [Ping timeout: 256 seconds]
23:07:07sec^nd quits [Remote host closed the connection]
23:07:14<@Fusl>nicolas17: just like how captcha solver services work, you grab the captcha and then forward that to someone else to solve
23:07:34sec^nd (second) joins
23:32:31<nicolas17>Fusl: "grab the captcha" seems hard
23:33:19<nicolas17>some captchas are an image where you have to click multiple things, some are a 9x9 grid of separate images, some involve drag and drop, and there's probably highly obfuscated JS code involved
23:33:42cicadia71 joins
23:33:49<cicadia71>bs engaged
23:42:41<TheTechRobo>The one I ran into today was an animation where you had to select the object with a different motion pattern than the others, so you'd have to capture video (or reverse-engineer the data structures that define it)
23:46:59<klea>ok, so running a headfull instance and letting people over vnc solve captchas.
23:47:11<klea>IIRC There's also VNC for the web, for people not needing to install VNC clients.
23:58:47Dada quits [Remote host closed the connection]