00:12:01nepeat quits [Ping timeout: 272 seconds]
00:13:32nepeat (nepeat) joins
00:17:21ATinySpaceMarine quits [Client Quit]
00:20:15nepeat quits [Ping timeout: 272 seconds]
00:21:06etnguyen03 quits [Client Quit]
00:23:30nepeat (nepeat) joins
00:32:33<klea>btw, JAA if you give dir-to-ia a folder with a file that has no content it will fail as dirty because IA doesn't allow files with no content-length
00:33:09<klea>A request of the requested method PUT requires a valid Content-length.
00:38:54<katia>We archiving empty files now ig
00:39:54<klea>no, but i noticed that it marked it as dirty when i accidentally left an empty file from test.
00:39:54<@JAA>Reasonable
00:40:13<@JAA>I mean, I could make it fail locally instead.
00:40:27<klea>you could make it not mark it as dirty, or is that a bad idea?
00:40:35<katia>Doesn’t seem better
00:40:39<katia>Fix your empty files
00:41:01<klea>yeah, that is also a good idea
00:41:13<klea>and having dir-to-ia add a marker when it has successfully finished uploading would be possible?
00:41:17<@JAA>This might actually be a bug in ia-upload-stream though. You can upload empty files to IA.
00:41:23<klea>oh.
00:41:37<katia>klea: have a patch?
00:41:48<klea>katia: i can create patch if JAA wants.
00:41:53<@JAA>A significant percentage of DPoS items have an empty .tar.
00:42:10<@JAA>Random example: https://archive.org/download/archiveteam_urls_20260125001525_d9e94fa7
00:42:17<klea>JAA: we can't see.
00:42:22<klea>oh wait what
00:42:27<@JAA>Of course you can.
00:42:52<klea>huh
00:45:26<c3manu>bookdown.org is still running, but by now i queued all the books listed on https://www.bookdown.org/home/archive/ (i don't think i missed any, i went through them manually)
00:46:07<c3manu>i mean the ones that are hosted on domains other than bookdown.org
00:59:01ATinySpaceMarine joins
01:03:47Shard116 (Shard) joins
01:04:35Shard11 quits [Ping timeout: 272 seconds]
01:04:35Shard116 is now known as Shard11
01:04:59etnguyen03 (etnguyen03) joins
01:20:01midou joins
01:24:11<klea>i wonder why JAA choose the hextable approach rather than strtol or strtoul: https://stackoverflow.com/questions/10324/convert-a-hexadecimal-string-to-an-integer-efficiently-in-c
01:24:24<klea>from #archiveteam-ot (i was told that was off topic there)
01:24:33Sk1d quits [Read error: Connection reset by peer]
01:25:10<@JAA>I didn't write that code. But I suspect a static array lookup is faster than a function call.
01:25:35<steering>especially when that function call is gonna do a bunch of math on it
01:25:40<klea>oh.
01:25:50<steering>(and a ton of branches)
01:26:00<steering>https://sourceware.org/git/?p=glibc.git;a=blob;f=stdlib/strtol_l.c;h=ac53312ba87f48a86ca7a6e494ea8ae0838e0b6e;hb=HEAD#l236
01:26:13<@JAA>Anubis--
01:26:13<eggdrop>[karma] 'Anubis' now has -14 karma!
01:26:15<klea>anubis--
01:26:15<eggdrop>[karma] 'anubis' now has -15 karma!
01:27:03<klea>i was too far away from keyboard to type a uppercase a (also was going to type it with the wrong hand)
01:27:23midou quits [Ping timeout: 272 seconds]
01:27:25<@JAA>Yeah, fun function
01:30:19<PC>is #photosucket dead? i've got some image URLs that aren't saving (and since it may be a free account, they might not be around forever, but i think they'll last a few months), but the channel seems pretty quiet
01:31:03<@JAA>Channels for inactive projects tend to be quiet. It's the right channel.
01:32:14<PC>nods. so inactive currently? wondering what the best way to get those images on the WBM would be. (i'm also curious if that'll fix them where they're embedded, since i've got a livejournal post with a bunch of them that are just showing as broken images)
01:33:45<klea>JAA: how bad of an idea is running this script in a loop somewhere as long as i won't have dir-to-ia folders in the dir-to-ia directory that will be getting new files in a incremental manner? <https://transfer.archivete.am/inline/vAP6T/clean-dir-to-ia.sh>
01:40:11<@JAA>klea: I can't be bothered to think about the interactions with the current version of dir-to-ia, but regardless, you'd be relying on implementaiton details that might change at any time.
01:40:34<klea>could you expose things that aren't implementation details that won't change?
01:40:49<@JAA>Those are in the config file.
01:41:00<@JAA>And in the --help text.
01:41:11PC quits [Quit: PC]
01:41:33<klea>that doesn't allow me to hook into jobs failing, or jobs completing succesfully
01:41:53<klea>i wanted to delete just the .dir-to-ia metadata files and then the directory they're in.
01:41:53<@JAA>Correct
01:42:48PC joins
01:58:44Wohlstand quits [Quit: Wohlstand]
02:12:19<@Fusl>klea justauser: update just in case youre wondering, jason pointed me to a discord server with people that archive old gaming magazines and similar stuff, im talks with them now
02:12:33<klea>ah
02:12:56<klea>Fusl: if you could, and the discord seems to not be too small, could you consider giving a invite to #discard for archival?
02:15:41<@Fusl>klea: sure, just did
02:15:47<klea>thanks
02:15:53<@Fusl>ofc!
02:20:18PC quits [Ping timeout: 256 seconds]
02:33:35midou joins
02:42:01<kiska>Fusl as someone who works in freight, do not send those magazines internationally. They will experience the most horrific things that happen when a worker has to move up to 1.2k cartons per hour
02:51:46<nicolas17>many years ago I had a subscription to a weekly electronics magazine, which came with components
02:52:25<nicolas17>in a little bag together attached to the magazine
02:52:32<nicolas17>all the way from Spain to Argentina
02:52:35<nicolas17>there wasn't a single issue where the integrated circuits came intact, I always had to un-bend the pins
02:54:04<klea>i wonder what company did that.
02:54:51<nicolas17>klea: well a few years ago I scanned every single of the 1200 pages so here you go https://archive.org/details/fg-electronica-modular/
02:55:01<klea>huh
02:55:04<klea>how did you do those scans?
02:55:30<nicolas17>flatbed scanner
02:55:53<nicolas17>the pages were pretty much designed to be detached and put into a binder
02:56:01<klea>oh
02:56:50<that_lurker>IA has some interesting magazines stored
02:57:04<klea>btw, how did you load it to archive?
02:57:28<klea>oh simply electronica-modular-NN_images.zip files?
02:57:31<nicolas17>yes
02:57:36<klea>nice
02:58:00<nicolas17>the format is simply a zip with image files, which should have filenames that sort correctly
02:58:20<that_lurker>nicolas17: you should add the scanner to the item metadata :-)
02:58:37<@Fusl>nicolas17: flatbed scanner is probably also how im going to archive mine if i cant find anyone who will do it for me but its going to be a lot of work to go through all of them :/
02:59:42<klea>nicolas17 apparently went trough 1200 pages :p
02:59:42<nicolas17>that_lurker: it's unclear to me what the 'scanner' metadata property is supposed to mean
03:00:18<klea><https://archive.org/developers/metadata-schema/index.html#scanner>
03:00:43<nicolas17>given that the uploader likes to stick "Internet Archive HTML5 Uploader 1.7.0" there
03:01:18<klea>i suppose it should be the physical flatbed scanner that you used.
03:07:20Wohlstand1 (Wohlstand) joins
03:09:48Wohlstand1 is now known as Wohlstand
03:27:44oxtyped quits [Ping timeout: 256 seconds]
03:29:18<h2ibot>Klea edited List of websites excluded from the Wayback Machine/Partial exclusions (+246, Add…): https://wiki.archiveteam.org/?diff=60331&oldid=60321
03:35:04<cruller>That reminds me, is [[ArchiveCorps]] still alive?
03:36:46<klea>given the last actual edit with actual changes that aren't gramatical, automated changes, or changes in style was in 2015-08-30, i'd say no. <https://wiki.archiveteam.org/index.php?title=ArchiveCorps&oldid=24132>
03:36:50oxtyped joins
03:37:07Webuser516269 quits [Quit: Ooops, wrong browser tab.]
03:37:13<klea>i should've used the Special: revision link format.
03:39:39DogsRNice quits [Read error: Connection reset by peer]
03:44:23<cruller>Their site was accessible as of last August, but was not being actively updated. https://web.archive.org/web/20250101000000*/http://www.archivecorps.org/
03:47:44<klea>domain got parked 2025-09-08 so likely expired. <view-source:https://web.archive.org/web/20250908154038if_/http://www.archivecorps.org/>
03:47:57<klea>smh
03:48:01<klea>(*) <https://web.archive.org/web/20250908154038if_/http://www.archivecorps.org/>
03:53:11Webuser692881 joins
03:53:22Webuser692881 quits [Client Quit]
04:03:52Wohlstand quits [Client Quit]
04:20:03etnguyen03 quits [Quit: Konversation terminated!]