00:19:49Naruyoko joins
00:41:16SootBector quits [Remote host closed the connection]
01:09:56lennier2 quits [Read error: Connection reset by peer]
01:10:12lennier2 joins
01:37:56<flashfire42>Hel1 is still clogged atm but we have other projects running on optane so its fine
02:02:53ave5 (ave) joins
02:03:12nepeat quits [Client Quit]
02:04:38nepeat (nepeat) joins
02:05:25ave quits [Ping timeout: 272 seconds]
02:05:25ave5 is now known as ave
02:35:27Island joins
03:01:23pseudorizer quits [Quit: ZNC 1.8.2 - https://znc.in]
03:02:08pseudorizer (pseudorizer) joins
03:11:49sec^nd quits [Remote host closed the connection]
04:20:00ctag quits [Read error: Connection reset by peer]
04:20:36ctag (ctag) joins
04:28:08lexikiq quits [Client Quit]
05:00:00<nicolas17>progress with samsung-opensource: I found a few cases of multiple files in the same "item" with the same filename
05:01:02<nicolas17>there's two cases where an "upload" has two source file "attachments" with the same filename, and there's one case where an "upload" has a .zip file in the list of sources and another .zip file in the list of "announcements"
05:01:57<nicolas17>then I verified these are actually identical (except for the 'attachment ID'), so we're fine, I can keep using {upload_id}/{filename} and don't need to disambiguate with {attach_id}
05:21:09DogsRNice quits [Read error: Connection reset by peer]
05:56:20GhostyTongue joins
05:57:30<GhostyTongue>I found a cloudfront bucket of a Nvidia tech demo https://dcb18d6mfegct.cloudfront.net/
05:59:46<GhostyTongue>I feel like it should be archived
06:02:30GhostyTongue quits [Remote host closed the connection]
06:08:14<Terbium>time to increase Nvidias AWS fees
06:14:24Doranwen quits [Remote host closed the connection]
06:14:44Doranwen (Doranwen) joins
06:26:30Naruyoko5 joins
06:30:47Naruyoko quits [Ping timeout: 272 seconds]
06:31:41Naruyoko joins
06:32:03Naruyoko5 quits [Ping timeout: 272 seconds]
07:21:20nulldata quits [Ping timeout: 240 seconds]
07:23:03Ruthalas59 quits [Client Quit]
07:28:52nulldata (nulldata) joins
07:33:52Wohlstand (Wohlstand) joins
07:35:30Wohlstand quits [Client Quit]
07:40:18Ruthalas59 (Ruthalas) joins
09:12:46c3manu (c3manu) joins
10:00:03Bleo18260 quits [Client Quit]
10:01:20Bleo18260 joins
10:10:08Naruyoko quits [Remote host closed the connection]
10:10:29Naruyoko joins
10:11:09Island quits [Read error: Connection reset by peer]
11:42:59BlueMaxima quits [Client Quit]
12:19:11hitgrr8 joins
13:09:50<h2ibot>Bzc6p edited List of website hosts (-30, /* E */ eOldal is gone): https://wiki.archiveteam.org/?diff=51506&oldid=51496
13:09:51<h2ibot>Bzc6p edited List of website hosts (+30, Undo revision 51506 by…): https://wiki.archiveteam.org/?diff=51507&oldid=51506
13:16:51<h2ibot>Bzc6p edited List of website hosts (+303, Add Webnode, uCoz, eStránky and ini.hu, cross…): https://wiki.archiveteam.org/?diff=51508&oldid=51507
13:21:20Arcorann quits [Ping timeout: 240 seconds]
13:23:19Megame (Megame) joins
14:00:28eroc1990 quits [Read error: Connection reset by peer]
14:02:28eroc1990 (eroc1990) joins
14:19:32<angenieux>Does anybody else here running project-specific docker containers use docker volumes and tmpfs mount for the /grab/data directory?
14:21:26nfriedly quits [Remote host closed the connection]
14:22:44<angenieux>The docker documentation says that docker volumes are higher performance than the writable layer of the container
14:24:21<angenieux>"Storage drivers are optimized for space efficiency, but (depending on the storage driver) write speeds are lower than native file system performance, especially for storage drivers that use a copy-on-write filesystem. Write-intensive applications, such as database storage, are impacted by a performance overhead, particularly if pre-existing data
14:24:22<angenieux>exists in the read-only layer."
14:26:01<angenieux>I'm wondering if this would be applicable to this use case
14:30:14<angenieux>In that same vein, I am also wondering if using --network host would be beneficial
14:31:05katia quits [Remote host closed the connection]
14:31:56katia (katia) joins
14:33:48<angenieux>I've searched around a bit, but I can't seem to find a clear answer
14:36:04<angenieux>Oh and I recently read this article: https://pythonspeed.com/articles/docker-performance-overhead/
14:42:28<Terbium>I sometimes mount to tmpfs to reduce disk writes for the projects, although I haven't gauged tgh performance difference
14:42:51<Terbium>Writing to RAM is still likely an order of magnitude faster than the benefits of Docker volumes (non-bind mount)
14:47:39<angenieux>When you say you mount to tmpfs, you are using something like "--mount type=tmpfs,destination=/grab/data" as a part of the docker run command?
14:50:10<angenieux>I was thinking of using docker volumes for projects with large item size like mediafire and youtube, where using tmpfs isn't an option
14:54:34Megame quits [Client Quit]
15:30:40<Terbium>yep, I also use /dev/shm too
15:31:00<Terbium>releally depends on how much RAM you have
15:33:50nic9070 quits [Ping timeout: 240 seconds]
15:34:01nic9070 (nic) joins
15:39:20Ruthalas59 quits [Ping timeout: 240 seconds]
15:43:27Guest42 joins
15:44:08Guest42 quits [Remote host closed the connection]
15:46:43<@JAA>We should probably document the tmpfs thing on the wiki if it isn't there yet.
16:00:41<@JAA>762 GiB in that bucket
16:03:38<@JAA>373k files
16:04:31<@JAA>There are a bunch of dupes in it.
16:20:12pedantic-darwin2 joins
16:20:12ctag quits [Read error: Connection reset by peer]
16:20:50ctag (ctag) joins
16:22:19pedantic-darwin quits [Ping timeout: 272 seconds]
16:22:19pedantic-darwin2 is now known as pedantic-darwin
16:51:23n joins
16:51:36n quits [Remote host closed the connection]
17:01:20aninternettroll quits [Remote host closed the connection]
17:01:20aninternettroll_ is now known as aninternettroll
17:03:22aninternettroll_ (aninternettroll) joins
17:10:39<betamax>Sorry for (yet more) HDD recovery questions:
17:10:45<betamax>ddrescue has now finished, with 99.99% OK (so hopefully I've lost very little)
17:10:51<betamax>fsck reports some bad files, but annoyingly won't tell me which ones (or at least I can't find an option to do so)
17:11:09<betamax>Should I just run fsck -p and hope for the best? Or is there a better approach that tells me what data has been lost (it may be that I have other copies of those bad files)
17:12:57<betamax>Oh, and I made a backup of the .img before running any fsck commands :)
18:10:54<@JAA>Unique size of the bucket is only 299 GiB.
18:59:14<@JAA>There are some fun files in this bucket. I've found a 134 MB PNG.
19:00:27<pedantic-darwin>satellite images?
19:01:57<@JAA>No, video game textures
19:05:40DogsRNice joins
19:11:50Ruthalas59 (Ruthalas) joins
19:22:54lennier2_ joins
19:25:59lennier2 quits [Ping timeout: 272 seconds]
19:38:06<h2ibot>Pokechu22 edited Jira (+34, https://jira.automotivelinux.org/): https://wiki.archiveteam.org/?diff=51509&oldid=51441
19:49:53<Vokun>Any idea what we might do for the runescape forum?
19:52:23<@JAA>The site's a mess with multiple URLs for the same pages etc. Just for the homepage, there's https://secure.runescape.com/m=forum/forums and https://secure.runescape.com/m=forum/sl=0/forums depending on where you click on the 'forums' link...
19:55:01adaz joins
19:55:02adaz quits [K-Lined]
20:23:17katia quits [Remote host closed the connection]
20:24:02katia (katia) joins
20:25:53Megame (Megame) joins
20:40:43itachi1706 quits [Ping timeout: 272 seconds]
20:42:13itachi1706 (itachi1706) joins
20:44:33c3manu quits [Client Quit]
20:46:42Megame quits [Client Quit]
20:58:41<pokechu22>archive.fart.website doesn't seem to be indexing some items properly, e.g. 37zavntdvg9vdssqduopqvehn <gazeta.grodno.net> is blank at https://archive.fart.website/archivebot/viewer/job/37zav but exists at https://archive.org/download/archiveteam_archivebot_go_20240111182359_0b544b6e (furthermore
20:58:43<pokechu22>https://archive.fart.website/archivebot/viewer/item/archiveteam_archivebot_go_20240111182359_0b544b6e is broken), and the same applies to grodno.net (https://archive.org/download/archiveteam_archivebot_go_20240111185002_6059879a), love.grodno.net (https://archive.org/download/archiveteam_archivebot_go_20240111192308_3224e5cd),
20:58:45<pokechu22>urls-transfer.archivete.am-chinesepainting.seattleartmuseum.org_new_url_list.txt (https://archive.org/download/archiveteam_archivebot_go_20240113050043_17b7c526). Not sure what might be causing this (all of the jobs were on different pipelines and they've all been derived on archive.org)
20:59:26<@JAA>> Last update: 2024-01-10 13:12:20.556265956 UTC
21:00:01<@JAA>So it wouldn't have seen those items yet, obviously.
21:00:04<@JAA>chfoo: ^
21:00:13<pokechu22>oh, that's a lot simpler of a pattern than I was thinking
21:00:34<pokechu22>For some reason I was under the impression that some newer stuff had been indexed, but I don't think I actually saw that; it was just my poor organization that made it seem that way
21:00:50<pokechu22>though also I don't think I ever noticed that update date; good to know for the future
21:10:54c3manu (c3manu) joins
21:11:34<ScenarioPlanet>What do you think about archiving old audio donations on DonationAlerts (~150kb each, a lot of them) ?
21:14:38aninternettroll_ quits [Remote host closed the connection]
21:14:51aninternettroll_ (aninternettroll) joins
21:14:56aninternettroll_ quits [Remote host closed the connection]
21:15:06aninternettroll_ (aninternettroll) joins
21:16:40aninternettroll_ quits [Remote host closed the connection]
21:18:07lennier2 joins
21:18:32aninternettroll_ (aninternettroll) joins
21:21:15lennier2_ quits [Ping timeout: 272 seconds]
21:21:32leo60228 quits [Quit: ZNC 1.8.2 - https://znc.in]
21:21:50leo60228 (leo60228) joins
21:24:53BlueMaxima joins
21:26:17aninternettroll quits [Remote host closed the connection]
21:27:24aninternettroll_ is now known as aninternettroll
21:33:28<h2ibot>Megame edited Deathwatch (+72, mix.tamatv.com - March 29): https://wiki.archiveteam.org/?diff=51510&oldid=51505
21:50:50andrew quits [Quit: ]
21:51:29andrew (andrew) joins
21:59:33<h2ibot>JustAnotherArchivist edited The WARC Ecosystem (+177, /* Tools */ Add Common Crawl's Nutch fork): https://wiki.archiveteam.org/?diff=51511&oldid=51481
22:11:34Wohlstand (Wohlstand) joins
22:28:58fireonlive quits [Quit: Connection gently closed by peer]
22:30:27mgrytbak quits [Quit: Ping timeout (120 seconds)]
22:30:37mgrytbak joins
22:35:01mgrytbak quits [Client Quit]
22:35:08mgrytbak joins
22:37:11toss (toss) joins
22:45:09<Terbium>Re: tachiyomi, I think they nuked their subreddit
22:45:51<Pedrosso>Was there any consensus on my curiosity of whether a new DPoS of furaffinity should be done?
22:52:52fireonlive (fireonlive) joins
23:00:59Ruthalas59 quits [Client Quit]
23:01:19Ruthalas59 (Ruthalas) joins
23:08:11sec^nd (second) joins
23:09:52SootBector (SootBector) joins
23:10:57jacksonchen666 (jacksonchen666) joins
23:11:20fireonlive quits [Ping timeout: 240 seconds]
23:11:52bilboed quits [Quit: Ping timeout (120 seconds)]
23:12:07bilboed joins
23:13:01SootBector quits [Remote host closed the connection]
23:13:21fireonlive (fireonlive) joins
23:13:37SootBector (SootBector) joins
23:15:24sec^nd quits [Remote host closed the connection]
23:16:02sec^nd (second) joins
23:22:00Arcorann (Arcorann) joins
23:36:21jasmin joins
23:36:35jasmin is now known as Hallo
23:37:02Hallo quits [Remote host closed the connection]
23:41:18qwertyasdfuiopghjkl quits [Remote host closed the connection]
23:45:39fireonlive quits [Client Quit]
23:50:45fireonlive (fireonlive) joins
23:56:47Island joins
23:59:34hitgrr8 quits [Client Quit]