00:02:04<imer>good to hear
00:02:51<Misty>personally i don't think videos should go to WBM either
00:03:33<yzqzss|m>go to IA item?
00:03:45<flashfire42>what does it mean when an item has a strike through on the tracker leaderboard?
00:04:02<imer>a single dupe I think?
00:09:56Perk quits [Read error: Connection reset by peer]
00:11:02<fireonlive>Misty: have you been hit by google's read only yet?
00:11:26<fireonlive>also google's read only will end at some point from what I understand, but they're going to give people time to sort things out
00:11:39<fireonlive>(there's no defined point for 'some point' at this time)
00:12:21Perk joins
00:14:11BlueMaxima joins
00:14:15Twisty quits [Remote host closed the connection]
00:16:08<Misty>@fireonlive I'm not yet. For RO's end, currently neither clear ETA are given, nor do Google tells how they will do.
00:16:37<Misty>google didn't clearly say what they will do if a user ALWAYS goes over capacity, but keeps paying
00:16:54<fireonlive>ah ok, it's a bit random who they hit it seems
00:17:44<fireonlive>they've been updating FAQs etc as time passes so it's all as clear as mud :D
00:18:25<Misty>I believe Google won't do this for Business like they did for Education before, because NPO & EDU are usually free, and having no critical data, but for corporations Google may get lawsuits if they simply choose to delete data.
00:18:35<Misty>yeah, we can just wait lol
00:21:09<fireonlive>ye we'll see haha
00:21:21<fireonlive>the pricing is quite something per extra 10TB
00:23:54Ruthalas5 quits [Client Quit]
00:24:28Ruthalas5 (Ruthalas) joins
00:33:57pabs quits [Client Quit]
00:35:58pabs (pabs) joins
01:02:42AlbertLarsan68 quits [Client Quit]
02:01:16BigBrain quits [Ping timeout: 245 seconds]
02:03:52BigBrain (bigbrain) joins
02:05:53justmolamola joins
02:06:07<h2ibot>Yts98 uploaded File:Banciyuan-logo.png: https://wiki.archiveteam.org/?title=File%3ABanciyuan-logo.png
02:10:07<h2ibot>Yts98 edited Banciyuan (+3202, Clarify old URL rules and describe Magic…): https://wiki.archiveteam.org/?diff=49979&oldid=49967
02:30:49PredatorIWD joins
02:53:00<nicolas17>JAA rewby: targets are a bit more busy now https://paste.debian.net/plain/1283571
03:00:33<@JAA>nicolas17: More accurately, IA is more busy now, probably because it's evening in the US.
03:01:42<nicolas17>okay I caught hel1 working a few times
03:01:54<nicolas17>so we're fine, nothing is permanently stuck
03:04:34nulldata quits [Quit: The Lounge - https://thelounge.chat]
03:04:50nulldata joins
03:13:45<fireonlive>oh that's a 1 not an l
03:19:24<nicolas17>Hetzner datacenter in Helsinki I think
03:20:16sonick (sonick) joins
03:21:42<fireonlive>ahh
03:25:13<fireonlive>uh, just putting it out there but yesterday or so I learned that xvideos publishes a dump of sorts: https://info.xvideos.net/db
03:25:15<fireonlive>"It contains about 7,000,000 videos with video URL, thumb URL, tags, duration, title, and embed code"
03:25:32<fireonlive>if anyone finds that useful (also oops, shoulda been in -ot)
03:50:57DLoader_ joins
03:51:36franga2000 quits [Quit: Ping timeout (120 seconds)]
03:51:47franga2000 joins
03:52:04Letur0 joins
03:52:33mikael quits [Quit: ZNC - http://znc.in]
03:52:53Xesxen quits [Quit: No Ping reply in 180 seconds.]
03:52:58Xesxen (Xesxen) joins
03:53:20Dj-Wawa_ quits [Remote host closed the connection]
03:53:21luckcolors quits [Quit: No Ping reply in 180 seconds.]
03:53:29raxxy-137409 quits [Read error: Connection reset by peer]
03:53:37Ketchup901 quits [Remote host closed the connection]
03:53:47DLoader quits [Ping timeout: 252 seconds]
03:53:47Letur quits [Ping timeout: 252 seconds]
03:53:47Letur0 is now known as Letur
03:53:49DLoader_ is now known as DLoader
03:54:18raxxy-137409 joins
03:54:21Dj-Wawa joins
03:54:23Ketchup901 (Ketchup901) joins
03:54:25luckcolors (luckcolors) joins
03:54:42mikael joins
04:02:02emberquill quits [Quit: The Lounge - https://thelounge.chat]
04:02:22emberquill (emberquill) joins
04:03:43dumbgoy__ quits [Ping timeout: 258 seconds]
04:05:35Ketchup901 quits [Remote host closed the connection]
04:05:53Ketchup901 (Ketchup901) joins
04:06:29lennier1 quits [Client Quit]
04:10:56AmAnd0A quits [Read error: Connection reset by peer]
04:11:08AmAnd0A joins
04:13:20Island quits [Read error: Connection reset by peer]
04:28:14lennier1 (lennier1) joins
04:37:44dumbgoy__ joins
04:53:33Megame quits [Client Quit]
04:57:40Jake quits [Ping timeout: 265 seconds]
05:20:13Misty|m joins
05:25:33<nicolas17>what is the .cdx.idx file?
05:56:21hitgrr8 joins
06:06:33infiliotech joins
06:08:23infiliotech quits [Remote host closed the connection]
06:23:29BlueMaxima quits [Client Quit]
06:38:00AlbertLarsan68 (AlbertLarsan68) joins
06:49:59<h2ibot>PaulWise edited Mailman2 (+65, lug-owl.de lists): https://wiki.archiveteam.org/?diff=49980&oldid=49971
07:02:02<h2ibot>PaulWise edited Mailman2 (+38, tuhs lists): https://wiki.archiveteam.org/?diff=49981&oldid=49980
07:18:55Jake (Jake) joins
07:23:48Irenes quits [Quit: WeeChat 3.7.1]
07:32:49pabs quits [Ping timeout: 265 seconds]
07:36:49fishingforsoup_ joins
07:38:23fishingforsoup quits [Ping timeout: 258 seconds]
07:51:35jtagcat quits [Quit: Ping timeout (120 seconds)]
07:51:45jtagcat (jtagcat) joins
08:03:43Irenes (ireneista) joins
08:55:17Chris5010 quits [Quit: ]
09:06:38Chris5010 (Chris5010) joins
09:07:06BigBrain quits [Ping timeout: 245 seconds]
09:20:55Chris50106 (Chris5010) joins
09:23:01Chris5010 quits [Ping timeout: 265 seconds]
09:23:01Chris50106 is now known as Chris5010
09:31:32BigBrain (bigbrain) joins
09:49:43pabs (pabs) joins
09:53:43W7RFa6AbNFz quits [Read error: Connection reset by peer]
09:53:43W7RFa6AbNFz_ joins
10:00:01railen63 quits [Remote host closed the connection]
10:00:19railen63 joins
10:08:53Ruthalas5 quits [Ping timeout: 252 seconds]
10:10:06Hajdar quits [Client Quit]
10:10:48Ruthalas5 (Ruthalas) joins
10:18:09icedice (icedice) joins
10:28:21sec^nd quits [Ping timeout: 245 seconds]
10:29:25sec^nd (second) joins
10:38:40Paro joins
10:39:31Paro leaves
10:51:49Arcorann (Arcorann) joins
11:01:17Hajdar (Hajdar) joins
11:05:10sec^nd quits [Remote host closed the connection]
11:06:00sec^nd (second) joins
11:42:50<Iki1>IA-excluded sites: https://twitter.com/LexerLux https://lexerlux.substack.com/
11:56:00decky_e quits [Remote host closed the connection]
12:21:18qwertyasdfuiopghjkl is now known as qwertyasdfuiopghjkl_
12:21:18yts98 leaves
12:21:26yts98 joins
12:21:49qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
12:22:52sonick quits [Client Quit]
12:25:06justmolamola quits [Remote host closed the connection]
12:26:39qwertyasdfuiopghjkl_ quits [Client Quit]
12:54:36Unholy23613 (Unholy2361) joins
12:58:17Unholy2361 quits [Ping timeout: 252 seconds]
12:58:17Unholy23613 is now known as Unholy2361
14:02:38Arcorann quits [Ping timeout: 252 seconds]
15:12:13<nicolas17>what targets are being used on lineblog, imgur, and reddit?
15:27:29<h2ibot>Hexspark edited Mailman2 (-2, +ce): https://wiki.archiveteam.org/?diff=49982&oldid=49981
15:27:30<h2ibot>Hexspark edited List of websites excluded from the Wayback Machine (+33, + https://lexerlux.substack.com/): https://wiki.archiveteam.org/?diff=49983&oldid=49918
15:27:31<h2ibot>Hexspark edited List of websites excluded from the Wayback Machine/Partial exclusions/Twitter accounts (+30, + https://twitter.com/LexerLux): https://wiki.archiveteam.org/?diff=49984&oldid=49914
15:35:01<nicolas17>JAA: imgur is progressing faster than lineblog, we may want to change what targets they use or pause imgur...
15:35:42railen63 quits [Remote host closed the connection]
15:35:59railen63 joins
15:36:54<nicolas17>at the current speed, lineblog won't meet the deadline, so I hope IA gets less busy soon
15:42:11<nicolas17>but I wasn't sure if lineblog and imgur even used overlapping sets of targets
15:42:51<@JAA>They use the same targets.
15:43:06<@JAA>Well, target, singular.
15:43:13<nicolas17>there's optane9 and hel1
15:43:25<@JAA>Not on imgur or lineblog.
15:43:28<nicolas17>and I think one of lineblog,reddit,imgur was using only optane9 and another was using both
15:43:42<@JAA>Egloos is also only optane9.
15:43:50<@JAA>Reddit has hel1 and optane9.
15:45:55<nicolas17>egloos is totally out of my radar atm :)
15:46:25za3k quits [Client Quit]
15:46:27<nicolas17>hel1 is only being used on reddit now? it seems pretty damn busy despite that
15:47:23<masterX244>probably time for the autotargets again since we got burning sites... not sure if the donation stuff for that got figured out yet
15:49:27<nicolas17>masterX244: I think the donation stuff got complicated because rewby has a hetzner invoice with both his archiveteam stuff and his personal stuff, and it was messy to ask for reimbursement for the AT part alone, the website only allowed entering the invoice total or something?
15:49:48<@JAA>Yeah, that's been sorted out.
15:49:51<nicolas17>ah good
15:50:16<nicolas17>if that's sorted out then yeah, maybe he can start incurring costs again :P
15:56:49Misty quits [Remote host closed the connection]
15:58:40<@arkiver>I believe Album Archive (from Google) is not publicly accessible right?
16:00:35<h2ibot>JAABot edited List of websites excluded from the Wayback Machine (+0): https://wiki.archiveteam.org/?diff=49985&oldid=49983
16:01:42<nicolas17>no, I think that's personal photos
16:01:54<@arkiver>yeah
16:28:02CanRabbit joins
17:18:27<rktk>I'm not sure where to put this, but https://vhscollector.com is not necessarily at risk of disappearing, but appears abandoned in a sense of updates and admin maintenance
17:18:47<rktk>contains a lot of information on VHS and other older formats like that .similar to LDDB and Discogs
17:19:12<rktk>their admin contact email gives a bounceback...
17:19:38<rktk>er well, there was an update in may to bring the site back to life, but... TBD
17:27:39<@JAA>rktk: Thanks, it's running through ArchiveBot now.
17:42:01<nstrom|m>I'm getting super slow (~50KB/sec) speeds to hel1 target from some locations
17:44:26AlbertLarsan68 quits [Client Quit]
17:44:43<@kaz>out of interest, which locations
17:44:43AmAnd0A quits [Ping timeout: 265 seconds]
17:45:26AmAnd0A joins
17:45:59<nstrom|m>hil hetzner , buf chi colocrossing
17:46:58<kiska>You have BBR enabled since hel is quite far away from those locations
17:47:00<nstrom|m>maybe a transatlantic cable is busted
17:47:23<nstrom|m>bbr?
17:47:43<kiska>tcp bbr
17:50:51BigBrain quits [Ping timeout: 245 seconds]
17:52:46BigBrain (bigbrain) joins
17:55:31DLoader_ joins
17:56:08<nstrom|m>interesting. I probably never noticed before but spun up some workers for mediafire since looked like that could use some help and it's sending big files that are taking a while :)
17:56:20<nstrom|m>eh as long as nothing's broken I'll just let it do its thing
17:57:05DLoader quits [Ping timeout: 258 seconds]
17:57:12DLoader_ is now known as DLoader
18:05:45<imer>seeing about 250kib/s there as well if it lets me through ^
18:06:03<imer>within europe
18:07:09<nstrom|m>of course after I posted that things sped up /shrug
18:07:30<nstrom|m> 660,733,952 13% 4.03MB/s 0:16:57
18:07:31<nstrom|m>big boi
18:30:55<mikolaj|m>fireonlive: Forum-dlthe problems with Tom's Hardware
18:30:57<mikolaj|m>aw
18:31:16<mikolaj|m>fireonlive: Forum-dl's problems with Tom's Hardware should be fixed, please let me know if there are still any issues
18:35:00<fireonlive>mikolaj|m: thank you ^_^
19:05:42AmAnd0A quits [Ping timeout: 258 seconds]
19:11:31AmAnd0A joins
19:16:04AmAnd0A quits [Ping timeout: 265 seconds]
19:17:05AmAnd0A joins
19:24:17tiger_millionaire quits [Ping timeout: 265 seconds]
19:52:35decky_e (decky_e) joins
19:55:55AmAnd0A quits [Ping timeout: 258 seconds]
19:55:56AmAnd0A joins
20:35:49Perk quits [Read error: Connection reset by peer]
20:38:58Perk joins
20:44:16Megame (Megame) joins
20:47:18BigBrain quits [Remote host closed the connection]
20:47:45BigBrain (bigbrain) joins
20:48:58<fireonlive>pabs: i found this: https://www.spinics.net/lists/ while looking to fix an issue I had but it looks like maybe it's just mirrors of other mailman lists?
20:50:24<fireonlive>or lists in general. not sure what to make of it; but I thought i'd throw it your way if you havne't seen ti before; not sure how much use it is
21:00:05skyrock3t quits [Ping timeout: 252 seconds]
21:01:34skyrocket joins
21:10:39jacksonchen666 (jacksonchen666) joins
21:14:01Unholy2361 quits [Remote host closed the connection]
21:17:40Island joins
21:18:16Unholy23613 (Unholy2361) joins
21:20:09Unholy23613 quits [Remote host closed the connection]
21:20:37Unholy23613 (Unholy2361) joins
21:40:08hitgrr8 quits [Client Quit]
21:57:49AmAnd0A quits [Ping timeout: 258 seconds]
21:57:52AmAnd0A joins
22:00:03Hajdar quits [Remote host closed the connection]
22:00:21Hajdar (Hajdar) joins
22:02:25AmAnd0A quits [Ping timeout: 258 seconds]
22:08:33@Fusl quits [Excess Flood]
22:08:51Fusl (Fusl) joins
22:08:51@ChanServ sets mode: +o Fusl
22:08:53AmAnd0A joins
22:15:52AmAnd0A quits [Ping timeout: 265 seconds]
22:17:04AmAnd0A joins
22:33:05AmAnd0A quits [Ping timeout: 258 seconds]
22:35:02AmAnd0A joins
22:48:42AmAnd0A quits [Read error: Connection reset by peer]
22:49:02AmAnd0A joins
23:05:29Gereon quits [Ping timeout: 252 seconds]
23:10:23AmAnd0A quits [Read error: Connection reset by peer]
23:10:31AmAnd0A joins
23:11:00AmAnd0A quits [Read error: Connection reset by peer]
23:11:32AmAnd0A joins
23:17:18<@JAA>Knowledge Adventure S3 bucket will be 50% done in the next hour or two. 16.3 TiB downloaded into 1.6 TiB WARCs so far.
23:17:53<fireonlive>:D
23:17:54Gereon (Gereon) joins
23:17:56<nicolas17>neat.jpg
23:18:07<fireonlive>the wonders of compression
23:18:29<@JAA>Mostly dedupe, since I'm downloading everything four times.
23:18:43<@JAA>Plus all the dupes between different files.
23:20:33<fireonlive>indeed
23:21:02<fireonlive>i wonder if any of the 'big cloud' providers do dedupe at scale
23:21:18<@arkiver>they sure do
23:21:28<fireonlive>surely they don't hold on to 10k users' copy of otters.mp4
23:21:32<fireonlive>ah good lol
23:21:39<@arkiver>in a much better way us
23:21:49<@arkiver>we just write a duplicate if a single record is a duplicate in a single session
23:22:08<@arkiver>i bet google and such do deduplication for all files, and perhaps even parts of files
23:23:18<fireonlive>ahh i see
23:23:40<fireonlive>i read somewhere dropbox didn't but hmm companies are very secretive about that it seems lol
23:23:55<fireonlive>that or it's buried in $whitepaper or $contalk
23:24:37<nicolas17>I think Apple deduplicates *despite encryption*, by deriving the encryption key from a hash of the file
23:24:56<fireonlive>(and otters has no double meaning don't look it up 😇)
23:25:26<imer>why did you do this to me
23:25:32<fireonlive>xD
23:25:42<fireonlive>nicolas17: oh hmmm
23:25:52<fireonlive>they do have their security whitepaper but I can't remembr mucha bout it at this pt
23:25:56<nicolas17>https://en.wikipedia.org/wiki/Convergent_encryption
23:26:20<fireonlive>oh which is now available as both pdf and website; nice
23:26:22<fireonlive>https://support.apple.com/en-ca/guide/security/welcome/web
23:26:49<fireonlive>(pdf: https://help.apple.com/pdf/security/en_US/apple-platform-security-guide.pdf)
23:27:25<fireonlive>kinda cool they'd spend the resources (and now design resources) making such documentation
23:31:34AmAnd0A quits [Read error: Connection reset by peer]
23:31:51AmAnd0A joins
23:41:25AmAnd0A quits [Ping timeout: 265 seconds]
23:43:37AmAnd0A joins
23:48:12BlueMaxima joins
23:54:38<mikolaj|m>I'd like to throw http://www.racjonalista.pl/ to ArchiveBot, the site doesn't look too healthy but has a still-active (custom software) Polish-language forum with posts dating to 2006