| 00:08:34 | <Washuu> | Thank you for the answer. Cheers! |
| 00:08:39 | | Washuu leaves [Ooops, wrong browser tab.] |
| 00:10:05 | | Dada quits [Remote host closed the connection] |
| 00:23:43 | | etnguyen03 quits [Client Quit] |
| 00:43:35 | <kiska> | szczot3k I use https://github.com/nicholas-fedor/watchtower |
| 00:55:31 | | SootBector quits [Remote host closed the connection] |
| 00:56:41 | | SootBector (SootBector) joins |
| 01:03:54 | | Shard111 (Shard) joins |
| 01:04:40 | | Shard11 quits [Ping timeout: 256 seconds] |
| 01:04:40 | | Shard111 is now known as Shard11 |
| 01:09:06 | | SootBector quits [Remote host closed the connection] |
| 01:09:14 | | etnguyen03 (etnguyen03) joins |
| 01:10:15 | | Webuser919909 joins |
| 01:14:38 | | SootBector (SootBector) joins |
| 01:18:24 | | Webuser919909 quits [Client Quit] |
| 01:33:43 | | xkey quits [Quit: WeeChat 4.8.1] |
| 01:34:47 | | xkey (xkey) joins |
| 01:36:47 | <h2ibot> | Cruller edited Deathwatch (+257, /* 2026-02 */ Add 47CLUB): https://wiki.archiveteam.org/?diff=60449&oldid=60429 |
| 02:33:55 | <h2ibot> | PaulWise edited Steam (-2456, drop old how-can-i-help section): https://wiki.archiveteam.org/?diff=60450&oldid=59758 |
| 02:42:14 | | CYBERDEV quits [Quit: Leaving] |
| 02:56:26 | | nexussfan quits [Read error: Connection reset by peer] |
| 04:04:04 | | theexit8 joins |
| 04:04:39 | | theexit8 quits [Client Quit] |
| 04:14:38 | | LddPotato quits [Read error: Connection reset by peer] |
| 04:15:22 | | LddPotato (LddPotato) joins |
| 04:25:14 | | LddPotato quits [Read error: Connection reset by peer] |
| 04:25:58 | | LddPotato (LddPotato) joins |
| 04:28:25 | <triplecamera|m> | justauser: Oh I see, you were replying to my previous message: "What does SFDW stand for?" |
| 04:28:40 | <triplecamera|m> | https://irclogs.archivete.am/archiveteam-bs/2026-02-05#l4faaf306 |
| 04:35:53 | | LddPotato quits [Read error: Connection reset by peer] |
| 04:36:41 | | LddPotato (LddPotato) joins |
| 04:38:39 | | DogsRNice quits [Read error: Connection reset by peer] |
| 04:46:40 | | Island quits [Read error: Connection reset by peer] |
| 04:54:58 | | LddPotato quits [Read error: Connection reset by peer] |
| 04:55:36 | | LddPotato (LddPotato) joins |
| 04:58:29 | | etnguyen03 quits [Quit: Konversation terminated!] |
| 04:59:16 | | etnguyen03 (etnguyen03) joins |
| 05:04:15 | | n9nes quits [Ping timeout: 272 seconds] |
| 05:05:28 | | LddPotato quits [Read error: Connection reset by peer] |
| 05:05:50 | | n9nes joins |
| 05:06:19 | | LddPotato (LddPotato) joins |
| 05:14:07 | | etnguyen03 quits [Remote host closed the connection] |
| 06:04:03 | | nine quits [Quit: See ya!] |
| 06:04:16 | | nine joins |
| 06:04:16 | | nine is now authenticated as nine |
| 06:04:16 | | nine quits [Changing host] |
| 06:04:16 | | nine (nine) joins |
| 08:33:45 | <h2ibot> | Bzc6p edited Talk:Wayback Machine (+251, /* Status as redirect */ be bold): https://wiki.archiveteam.org/?diff=60452&oldid=60402 |
| 09:24:24 | | hexagonwin (hexagonwin) joins |
| 09:54:33 | | LddPotato quits [Read error: Connection reset by peer] |
| 09:55:14 | | LddPotato (LddPotato) joins |
| 10:04:36 | | Dada joins |
| 10:05:03 | | LddPotato quits [Read error: Connection reset by peer] |
| 10:05:54 | | LddPotato (LddPotato) joins |
| 10:09:52 | | iPwnedYourIOTSmartdog7 joins |
| 10:12:04 | | iPwnedYourIOTSmartdog quits [Ping timeout: 256 seconds] |
| 10:12:04 | | iPwnedYourIOTSmartdog7 is now known as iPwnedYourIOTSmartdog |
| 10:15:59 | | LddPotato quits [Read error: Connection reset by peer] |
| 10:16:26 | | ducky quits [Remote host closed the connection] |
| 10:16:37 | | ducky (ducky) joins |
| 10:16:37 | | LddPotato (LddPotato) joins |
| 10:34:57 | | LddPotato quits [Read error: Connection reset by peer] |
| 10:35:34 | | LddPotato (LddPotato) joins |
| 10:37:48 | | theexit86 joins |
| 10:40:24 | <theexit86> | Hello all, and many thanks to you all for your hard work on ArchiveTeam and its warriors. Firstly, my apologies if this is the wrong place or channel to ask, as the #archivebot channel seems busy. If so, please direct me to the correct place. |
| 10:40:42 | <theexit86> | I wish to apply to run an ArchiveBot pipeline on one of my servers. For the record, I've run several warriors since June of last year under the tracker username "theexit8", and sometimes joined their respective IRC channels to report issues. |
| 10:41:03 | <theexit86> | I have some logistics questions I'll ask later, but first and most important, does the ArchiveBot project truly need more hosts, or is it already at a good enough capacity? Asking as it seems to me the US Government warrior is fairly well connected and has few allocations to spare, so was wondering if this is/isn't the case with ArchiveBot too. |
| 10:41:21 | <theexit86> | Many thanks for reading, and if you're open to having me volunteer my computer, I'll be happy to install the pipeline and get registered to process jobs. Any issues afterwards I'll of course communicate them here. |
| 10:45:31 | | LddPotato quits [Read error: Connection reset by peer] |
| 10:46:16 | | LddPotato (LddPotato) joins |
| 11:00:11 | | ducky quits [Ping timeout: 272 seconds] |
| 11:02:32 | | NatTheCat quits [Quit: Ping timeout (120 seconds)] |
| 11:02:59 | | NatTheCat (NatTheCat) joins |
| 11:15:12 | | ducky (ducky) joins |
| 11:16:22 | | Sk1d joins |
| 11:35:18 | | Wohlstand (Wohlstand) joins |
| 11:48:25 | <BennyOtt> | OpenDiary redirects me to the page https://opendiaryhelp.zendesk.com/hc/en-us/articles/360041783451-Why-am-I-being-blocked-from-Open-Diary even at my home IP address, where I hadn't even run the crawler. |
| 11:58:02 | | Sk1d quits [Ping timeout: 256 seconds] |
| 12:00:03 | | Bleo1826007227196234552220 quits [Quit: The Lounge - https://thelounge.chat] |
| 12:02:25 | | Sk1d joins |
| 12:02:48 | | Bleo1826007227196234552220 joins |
| 12:35:29 | | Sk1d quits [Client Quit] |
| 12:39:51 | | sg72 quits [Quit: Leaving] |
| 12:45:48 | | sg72 joins |
| 13:09:04 | | Arcorann is now authenticated as Arcorann |
| 13:10:25 | | Arcorann quits [Changing host] |
| 13:10:25 | | Arcorann (Arcorann) joins |
| 13:11:38 | | Webuser295588 joins |
| 13:11:50 | | Webuser295588 quits [Client Quit] |
| 13:12:46 | | Arcorann_ (Arcorann) joins |
| 13:16:14 | | Arcorann quits [Ping timeout: 256 seconds] |
| 13:22:52 | | Arcorann__ (Arcorann) joins |
| 13:26:26 | | Arcorann_ quits [Ping timeout: 256 seconds] |
| 13:31:33 | | Arcorann__ quits [Ping timeout: 272 seconds] |
| 14:10:17 | | Wohlstand quits [Client Quit] |
| 14:10:29 | | Wohlstand (Wohlstand) joins |
| 14:11:24 | | etnguyen03 (etnguyen03) joins |
| 14:12:12 | | etnguyen03 quits [Remote host closed the connection] |
| 14:13:13 | | etnguyen03 (etnguyen03) joins |
| 14:33:56 | | Webuser439278 joins |
| 14:34:06 | | Webuser439278 quits [Client Quit] |
| 15:02:38 | | roverinexile joins |
| 15:04:39 | | rover quits [Ping timeout: 272 seconds] |
| 15:06:45 | | nexussfan (nexussfan) joins |
| 15:29:33 | | etnguyen03 quits [Client Quit] |
| 15:29:48 | <klea> | It might be neat to have a project to archive ACM, since they moved to Open Access recently (2026-01-XX) as stated in: https://dl.acm.org/openaccess Sample link for pdf: https://dl.acm.org/doi/pdf/10.1145/2901739.2903508 sadly behind Buttflare. |
| 15:29:51 | <klea> | ButtFlare-- |
| 15:29:52 | <eggdrop> | [karma] 'ButtFlare' now has -247 karma! |
| 15:30:58 | <justauser> | This needs to be checked, but quite probably everything is already available in... another archive. |
| 15:31:10 | <klea> | oh |
| 15:59:45 | <h2ibot> | Justauser edited Distributed recursive crawls (+42, /* Candidates */ Added mmnt.net, dropped…): https://wiki.archiveteam.org/?diff=60453&oldid=60414 |
| 16:01:28 | | DogsRNice joins |
| 16:08:58 | | etnguyen03 (etnguyen03) joins |
| 17:28:01 | | DogsRNice_ joins |
| 17:28:18 | | cyanbox_ joins |
| 17:31:14 | | DogsRNice quits [Ping timeout: 256 seconds] |
| 17:31:48 | | cyanbox quits [Ping timeout: 256 seconds] |
| 17:46:29 | | anonandon joins |
| 17:49:18 | <pokechu22> | They might be available in https://scholar.archive.org/ (which probably isn't the archive justauser is referring to for what it's worth, but is a highly useful resource) |
| 17:49:57 | <anonandon> | Hi, I've archived a particular set of content that (whilst not immediately at risk, needs guarding against future corporate pivots. I wanted to upload it to archive.org but it consists of work from hundreds of individual artists so getting permission isn't feasible. The content is currently freely available via a specific app so I'm not cheating |
| 17:49:57 | <anonandon> | any individuals out of revenue. |
| 17:49:57 | <anonandon> | My question is this - should I upload it to IA to keep it safe? I'd rather not store it locally in case I lose it. And if the answer is "yes" - what metadata should I give IA about ownership and license? |
| 17:49:58 | <klea> | Yeah, the team that runs that is undervalued in IA apparently. |
| 17:50:39 | <klea> | anonandon: Probably more suitable to ask in #internetarchive. |
| 17:51:18 | <justauser> | anonandon: IA is fairly lax here. |
| 17:51:46 | <justauser> | You can even upload with no details, and it will stay here unless someone complains. |
| 17:52:13 | <justauser> | App name will probably make a better metadata than any individual artist, |
| 17:52:27 | <anonandon> | each file has attribution so there's that at least |
| 17:58:20 | <pokechu22> | I do wonder if archiveteam crawls end up on https://scholar.archive.org/ or if it's only a subset of IA's own crawls (e.g. archive-it crawls for universities) |
| 17:59:39 | | wavz joins |
| 17:59:47 | <wavz> | wsg |
| 18:00:46 | <justauser> | ? |
| 18:01:15 | <wavz> | ==what's good |
| 18:01:25 | <justauser> | Where? |
| 18:01:42 | <wavz> | it's just a greeting lol |
| 18:05:57 | <wavz> | does warrior still work? |
| 18:06:08 | <justauser> | Yes. |
| 18:06:35 | <wavz> | thx |
| 18:06:46 | <justauser> | You may need to update if you haven't touched it for a while, but the general idea is still there. |
| 18:07:06 | | nexussfan quits [Quit: Konversation terminated!] |
| 18:09:19 | <wavz> | on my old workstation i just set it up a couple of years ago and never touched it again lol |
| 18:09:37 | <wavz> | i don't even think it was running |
| 18:10:53 | <justauser> | -> #warrior for future questions, https://wiki.archiveteam.org/index.php/ArchiveTeam_Warrior for docs. |
| 18:14:51 | | etnguyen03 quits [Client Quit] |
| 18:15:29 | <wavz> | thanks. opendiary still on warrior? |
| 18:16:51 | <justauser> | Still? |
| 18:17:01 | <justauser> | Was added a week ago. |
| 18:17:21 | <wavz> | oh true it ends on the 28th |
| 18:17:58 | <justauser> | Not considered urgent and seems to be server-bound, but still welcome. |
| 18:19:22 | <justauser> | Huh, paused? |
| 18:19:32 | <wavz> | on the wiki it's still reported as urgent and short-term |
| 18:19:58 | <wavz> | is it down right now? |
| 18:20:16 | <justauser> | No, not paused. |
| 18:20:32 | <wavz> | it is for me |
| 18:20:34 | <wavz> | "Server returned bad response" |
| 18:20:46 | <justauser> | Probably you are banned for some reason. |
| 18:21:14 | <justauser> | Lots of people are without an apparent reason. |
| 18:21:16 | <wavz> | lovely |
| 18:21:31 | <wavz> | i didn't even know this website was a thing until yesterday |
| 18:21:37 | <justauser> | Probably an attempt to curb spam - if you have a spare proxy, look at the homepage. |
| 18:21:47 | <wavz> | it loads but very slowly |
| 18:21:49 | <wavz> | no proxy |
| 18:21:54 | <justauser> | Hint: consists of the said spam entirely. |
| 18:22:20 | <wavz> | ? |
| 18:26:24 | | wavz quits [Client Quit] |
| 18:28:19 | | etnguyen03 (etnguyen03) joins |
| 18:28:41 | <justauser> | !tell wavz https://web.archive.org/web/20260129003520/http://opendiary.com/ |
| 18:28:43 | <eggdrop> | [tell] ok, I'll tell wavz when they join next |
| 18:49:01 | | cyanbox_ quits [Read error: Connection reset by peer] |
| 18:54:19 | | Island joins |
| 18:54:56 | | DogsRNice_ quits [Client Quit] |
| 18:55:11 | | DogsRNice_ joins |
| 18:55:11 | | DogsRNice_ quits [Remote host closed the connection] |
| 18:55:23 | | DogsRNice joins |
| 19:00:09 | <justauser> | Opendiary is unstable and times out a lot /cc imer ? |
| 19:00:30 | <@imer> | :( do we need to lower the limit even more? |
| 19:00:34 | <@imer> | pausing for a bit |
| 19:01:12 | <justauser> | Warriors seem to be getting through, but I wanted to !ao the / and couldn't. |
| 19:03:08 | <@imer> | !remindme 15min has opendiary recovered? |
| 19:03:10 | <eggdrop> | [remind] ok, i'll remind you at 2026-02-08T19:18:08Z |
| 19:18:09 | <eggdrop> | [remind] imer: has opendiary recovered? |
| 19:25:11 | | nepeat quits [Quit: ZNC - https://znc.in] |
| 19:25:40 | | nepeat (nepeat) joins |
| 19:29:44 | <@imer> | nay |
| 19:29:48 | <@imer> | !remindme 15min has opendiary recovered? |
| 19:29:49 | <eggdrop> | [remind] ok, i'll remind you at 2026-02-08T19:44:49Z |
| 19:32:04 | <@imer> | mh, i might be going insane - apparently didnt limit it? |
| 19:33:24 | <@imer> | no i did. nvm |
| 19:33:36 | <@imer> | just taking that long to go through worker backlog |
| 19:44:49 | <eggdrop> | [remind] imer: has opendiary recovered? |
| 19:50:18 | <nicolas17> | consider disabling multiitems in opendiary, that reduces RTT and lets you adjust rate limits more accurately |
| 20:14:20 | <h2ibot> | IDKhowToEdit edited YouTube (+24, Updated bot permission: Queueing video now…): https://wiki.archiveteam.org/?diff=60454&oldid=59182 |
| 20:15:22 | <nicolas17> | https://data.nicolas17.xyz/samsung-grab/ |
| 20:19:32 | <nicolas17> | 9 files pending |
| 20:24:04 | <@imer> | nicolas17: yep. did that |
| 20:24:30 | <IDK> | Trying but its downloading at 0.2mb/s😂 |
| 20:24:48 | <@imer> | back to 1s response times, so resuming at 1/s |
| 20:25:22 | <@imer> | was 2/s |
| 20:25:59 | <nicolas17> | IDK: yeah samsung seems to throttle heavily |
| 21:04:14 | <nicolas17> | opendiary success ratio 86% |
| 21:06:16 | | cyanbox joins |
| 21:16:42 | <@Fusl> | nicolas17: wouldnt it be easier for everyone involved to just crowdsource the captcha solving? |
| 21:19:29 | | etnguyen03 quits [Client Quit] |
| 21:25:58 | | chunkynutz60 quits [Read error: Connection reset by peer] |
| 21:26:16 | | chunkynutz60 joins |
| 21:28:58 | | Wohlstand quits [Quit: Wohlstand] |
| 21:38:46 | <theexit86> | Hi all. Sorry, just restating my previous question: Would the ArchiveBot project benefit from running additional pipeline hosts? Many thanks in advance! |
| 21:41:09 | <nicolas17> | Fusl: how would you do that? |
| 21:42:03 | <nicolas17> | pretty sure if I serve you a captcha from my website using samsung's API key, it won't work because the origin isn't opensource.samsung.com |
| 21:43:08 | | v01d joins |
| 21:44:04 | | v01d is now authenticated as v01d |
| 21:49:54 | <nicolas17> | I'd also need to bypass TLS fingerprinting which they introduced last month |
| 21:58:07 | <klea> | huh, could Samsung be sued for not making their files downloadable, and putting them behind a captcha? |
| 22:02:02 | <nicolas17> | maybe |
| 22:02:54 | <nicolas17> | throttling download speeds to 200KB/s, banning IPs based on opaque limits, having a captcha (which is proprietary software), all could maybe get them into trouble |
| 22:03:02 | <klea> | Oh yeah I forgot lawsuits cost money. |
| 22:03:03 | <nicolas17> | plus apparently they're missing many files |
| 22:03:39 | <klea> | fun |
| 22:37:35 | <hexa-> | https://infosec.exchange/@bucketchallenge/116036543449145849 |
| 22:38:47 | <klea> | pokechu22: IIRC you handled s3 buckets?, could you check https://wp-stat.s3.us-east-1.amazonaws.com/ ? |
| 22:40:05 | <pokechu22> | listing now |
| 22:40:38 | <klea> | ack |
| 22:41:41 | <pokechu22> | https://gitea.arpa.li/JustAnotherArchivist/little-things/src/branch/master/s3-bucket-list for reference (I'm locally using an older and hacked up version because I've been too lazy to update, but current version should work too) |
| 22:46:09 | | Arcorann__ (Arcorann) joins |
| 22:58:31 | | khaoohs joins |
| 23:00:28 | | khaoohs__ quits [Ping timeout: 256 seconds] |
| 23:07:07 | | sec^nd quits [Remote host closed the connection] |
| 23:07:14 | <@Fusl> | nicolas17: just like how captcha solver services work, you grab the captcha and then forward that to someone else to solve |
| 23:07:34 | | sec^nd (second) joins |
| 23:32:31 | <nicolas17> | Fusl: "grab the captcha" seems hard |
| 23:33:19 | <nicolas17> | some captchas are an image where you have to click multiple things, some are a 9x9 grid of separate images, some involve drag and drop, and there's probably highly obfuscated JS code involved |
| 23:33:42 | | cicadia71 joins |
| 23:33:49 | <cicadia71> | bs engaged |
| 23:42:41 | <TheTechRobo> | The one I ran into today was an animation where you had to select the object with a different motion pattern than the others, so you'd have to capture video (or reverse-engineer the data structures that define it) |
| 23:46:59 | <klea> | ok, so running a headfull instance and letting people over vnc solve captchas. |
| 23:47:11 | <klea> | IIRC There's also VNC for the web, for people not needing to install VNC clients. |
| 23:58:47 | | Dada quits [Remote host closed the connection] |