00:03:43lunik1 quits [Quit: :x]
00:04:07lunik1 joins
01:07:16nexussfan quits [Client Quit]
01:11:47<h2ibot>Hans5958 edited In The Media (-206, Merge multilingual entries): https://wiki.archiveteam.org/?diff=60318&oldid=59332
01:23:15Wohlstand1 (Wohlstand) joins
01:28:15Wohlstand1 quits [Ping timeout: 272 seconds]
01:41:13cyanbox joins
01:49:15etnguyen03 (etnguyen03) joins
01:52:21nexussfan (nexussfan) joins
02:03:36chrismeller3 quits [Quit: chrismeller3]
02:04:14chrismeller3 (chrismeller) joins
02:11:35chrismeller3 quits [Client Quit]
02:13:42chrismeller3 (chrismeller) joins
02:45:40LddPotato quits [Read error: Connection reset by peer]
02:46:54Wohlstand1 (Wohlstand) joins
02:47:05LddPotato (LddPotato) joins
02:48:14HP_Archivist (HP_Archivist) joins
02:57:16LddPotato quits [Read error: Connection reset by peer]
02:58:43LddPotato (LddPotato) joins
02:59:37nexussfan quits [Client Quit]
03:01:04nexussfan (nexussfan) joins
03:18:24Wohlstand1 quits [Client Quit]
03:25:54Guest58_ joins
03:27:57Guest58 quits [Ping timeout: 272 seconds]
03:38:08Webuser870958 joins
03:42:45<Webuser870958>Not that it matters but it looks like this should be on the largest projects: https://wiki.archiveteam.org/index.php/Google_Drive Also interesting that 7 of the largest projects are in 2025. I wonder if there will be as many large projects in 2026...
03:42:54Webuser870958 quits [Client Quit]
04:02:26etnguyen03 quits [Remote host closed the connection]
04:13:57jason joins
04:13:58jason quits [Remote host closed the connection]
04:22:44Island quits [Read error: Connection reset by peer]
04:49:33Guest58 joins
04:53:17sec^nd quits [Remote host closed the connection]
04:53:27Guest58_ quits [Ping timeout: 272 seconds]
04:53:54sec^nd (second) joins
05:04:50n9nes quits [Ping timeout: 256 seconds]
05:05:09n9nes joins
05:07:02DogsRNice quits [Read error: Connection reset by peer]
05:13:48lennier2_ joins
05:16:53lennier2 quits [Ping timeout: 272 seconds]
05:34:46<pokechu22>justauser: https://wormbase.org/ got a cloudflare challenge for me
05:36:01<pokechu22>https://blog.wormbase.org/ works though
05:43:20Guest58 quits [Read error: Connection reset by peer]
05:48:20Guest58 joins
05:49:46Guest58_ joins
05:53:37Guest58 quits [Ping timeout: 272 seconds]
06:17:15nexussfan quits [Quit: Konversation terminated!]
06:24:16ArchivalEfforts quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
06:24:25ArchivalEfforts joins
06:28:36<h2ibot>KleaBot made 2 bot changes: https://wiki.archiveteam.org/index.php?title=Special:Contributions/KleaBot&offset=20260123062819&limit=2&namespace=2&wpfilters[]=nsInvert&wpfilters[]=associated
06:32:09<@JAA>Eh, not sure about that [[In The Media]] edit.
06:33:11<klea>JAA: did you update the code to run my version?
06:33:46<klea>JAA: should i revert it?
06:34:13<klea>well, i guess you can revert it.
06:37:06sec^nd quits [Ping timeout: 256 seconds]
06:37:49sec^nd (second) joins
06:39:08<@JAA>klea: I obviously didn't deploy it given the [].
06:39:19<klea>sorr.
06:40:05<@JAA>And to be clear, I mean Hans5958's edit of [[In The Media]], not what your bot did with it. That's just how I found the earlier edit.
06:40:26<klea>yeah i understand.
06:41:22<@JAA>I feel like we should list both language versions fully. But if we want to collapse it, I'd rather see the original as the main entry.
06:41:37gosc joins
06:41:48<Hans5958>I guess that can be reverted (or I will later)
06:42:01<Hans5958>mb gng
06:42:23lennier2_ quits [Ping timeout: 272 seconds]
06:43:11lennier2_ joins
07:32:49Guest58 joins
07:34:57Guest58_ quits [Ping timeout: 272 seconds]
07:46:59gosc quits [Client Quit]
08:15:48ducky quits [Ping timeout: 256 seconds]
08:52:26midou quits [Read error: Connection reset by peer]
09:02:06midou joins
09:19:27Jake quits [Ping timeout: 272 seconds]
09:19:39Jake (Jake) joins
09:28:03Jake8 (Jake) joins
09:28:57Jake quits [Ping timeout: 272 seconds]
09:28:57Jake8 is now known as Jake
09:37:45ducky (ducky) joins
10:12:23Dada joins
10:26:53irisfreckles13 joins
11:40:58APOLLO03a joins
11:41:19irisfreckles13 quits [Ping timeout: 272 seconds]
11:44:41irisfreckles13 joins
11:44:46APOLLO03- joins
11:44:54APOLLO03 quits [Ping timeout: 256 seconds]
11:47:44APOLLO03a quits [Ping timeout: 256 seconds]
11:51:27irisfreckles13 quits [Ping timeout: 272 seconds]
12:00:03Bleo182600722719623455222 quits [Quit: The Lounge - https://thelounge.chat]
12:02:53Bleo182600722719623455222 joins
12:16:48<justauser>pokechu22: Cloudflare all over the place indeed. Maybe they will be approachable; otherwise, we are left with grab-site for smaller sites.
12:16:50<justauser>Fortunately, https://downloads.wormbase.org/ is seemingly intended to be kept indefinitely.
12:16:58<justauser>kline: FTP didn't work when I tested a few months ago. I suspect Cloudflare can't forward it through, and they don't want to expose directly.
12:19:41<justauser>kline: Regarding hostile ISP, we have some projects involving manual download then upload.
12:20:24<justauser>They are mostly over HTTPS, but with supervision you can simply try another one if you see your ISP interfering.
12:21:36<justauser>That's at least #wikiteam and #discard.
12:25:01<justauser>Warrior set to a static project *might* be safe - do they block Telegram? #telegrab is overloaded and always needs more workers.
12:27:17<justauser>DPI probably looks at TLS SNI extension, so encrypted DNS alone can't stop it.
13:01:07Sluggs quits [Ping timeout: 272 seconds]
13:08:46Wohlstand (Wohlstand) joins
13:54:57tzt quits [Ping timeout: 272 seconds]
14:32:19<klea>i wonder how hard it'd be to have warriors only for CF
14:32:53<klea>justauser: do you know if wormbase has some endpoint where it will do a outwards request for some purpose (to get the IP for the server)?
14:40:54<justauser>Not found any so far.
14:41:53<justauser>FTR, there is a thing called FlareSolverrr to deal with CF CAPTCHAs automatically, and it's possible in theory to have it inside the Warrior container.
14:41:56<klea>Is there's a easy way to put all redirects from websites like https://code.rocketnine.space/tslocum/cview/issues/85 which seem to always redirect to a codeberg URL onto AB?, since i don't know all urls, but I guess I could make up my own list of urls based on the projects tslocum has on codeberg, and extract common issue IDs from it too?
14:42:08<justauser>Not that I ever managed to get it to work.
14:53:21calisto joins
15:01:08calisto quits [Client Quit]
15:04:55liest joins
15:31:58<liest>Hi, i'm about to archive soundcloud.org, i already scraped metadata for all of their tracks - 321302446 with combined duration of 151426320204960 ms
15:32:02<liest>By my calculations storing all of them in opus 64k would take roughly 1.1PB of storage, but by something that is not dependent on me i have to compress it down to about 500TB
15:32:06<liest>Initially i thought about using opus at 32k, but i've heard that xhe-aac is more efficient. I think i'll be able to deduplicate tracks from soundcloud with spotify scrape from Anna's Archive to save space, i might also categorize longer tracks into podcasts and maybe somehow remove ai slop from recent years. So i was thinking about doing 40k VBR
15:32:06<liest>for music and 22k CBR for podcasts.
15:32:09<liest>I really need help with this since i don't have much experience with audio
15:34:42<justauser>We don't either.
15:36:20<justauser>Our past experience with Soundcloud is complicated, cf. https://wiki.archiveteam.org/index.php/SoundCloud
15:36:58<justauser>You are probably better off asking around at AA.
15:37:27<justauser>However, please don't run away right now. Someone else might have a different opinion.
15:39:38<justauser>In particular, we probably can't provide the required storage for the reasons above.
15:45:00<liest>no problem
15:45:03<liest>the most important thing for me is feedback on encoding strategy
15:45:45<@arkiver>yes on that we cannot provide the storage
16:02:47Webuser590046 joins
16:14:12liest quits [Client Quit]
16:27:28liest joins
17:08:43<kline><justauser> [12:19:41] kline: Regarding hostile ISP, we have some projects involving manual download then upload.
17:08:45<kline>nice
17:14:31Webuser137440 joins
17:14:38Webuser137440 quits [Client Quit]
17:25:11DogDisco joins
17:26:55Dada quits [Remote host closed the connection]
17:27:04sec^nd quits [Ping timeout: 256 seconds]
17:28:45Dada joins
17:32:47sec^nd (second) joins
17:33:58<kline><justauser> [12:27:17] DPI probably looks at TLS SNI extension, so encrypted DNS alone can't stop it.
17:34:02<kline>that also makes sense, thanks
17:41:52Webuser590046 quits [Client Quit]
17:56:10Wohlstand quits [Remote host closed the connection]
18:04:51<@arkiver>let's keep a very close eye at vimeo
18:05:13<@arkiver>i hope more news comes out or leaks about what these layoffs reported in #archiveteam will mean
18:26:55DogDisco leaves [Leaving]
19:01:58MrMcNuggets quits [Quit: WeeChat 4.3.2]
19:14:44<egallager>#vimeoff
19:17:32Wohlstand1 (Wohlstand) joins
19:19:54Wohlstand1 is now known as Wohlstand
19:22:34Max_G quits [Quit: Bye Bye]
19:23:28Max_G joins
19:25:17<@arkiver>nice
19:25:32<@arkiver>good one egallager
19:25:53DogsRNice joins
19:36:19egallager quits [Ping timeout: 272 seconds]
19:40:37xarph joins
19:41:48<xarph>I got a text from a friend who is a dod web content guy that the next wave of scrubs appears to be anything related to the us withdrawal from afghanistan, with the request "please ask the internet archivists to work fast"
19:43:13Wohlstand quits [Client Quit]