00:01:03loug8318142 quits [Quit: The Lounge - https://thelounge.chat]
00:07:26etnguyen03 (etnguyen03) joins
00:26:43etnguyen03 quits [Client Quit]
00:27:48beardicus quits [Ping timeout: 260 seconds]
00:43:10<pabs>do we have a way of archiving M3U files that also downloads the files they link to? AFAICT ArchiveBot does not do that
00:44:28<pabs>https://pig.observer/ links to some live video cameras with M3U files, and I'm a bit worried that if I do the link extraction here, by the time AB gets to the linked chunks, they might have been deleted
00:50:44<pabs>hmm, its too JavaScripty
00:58:13beardicus (beardicus) joins
01:10:59etnguyen03 (etnguyen03) joins
01:12:07<@OrIdow6>pabs: Uh, you could do it with wget-at though you might need to do some "clever" stuff to make it work in time, maybe qwarc would work?
01:12:11<@OrIdow6>Testing how long they last
01:12:47<@OrIdow6>I will say traffic cameras feel weird archving-wise
01:12:53<@OrIdow6>Since they're going continuously
01:13:01<@OrIdow6>And ephemeral by nature
01:13:06<pabs>they don't use <video> so it won't work in the WBM anyway, too much JS
01:13:21<pabs>yeah, just wanted a couple snapshots to preserve the feel of the project
01:14:10<@OrIdow6>Hm, one ts expired after 4 minutes
01:19:51<@JAA>Yeah, wouldn't be hard with qwarc.
01:20:26<@JAA>They probably only keep a few segments on the server. That's how most live stream things work.
01:30:10Webuser363958 joins
01:30:48Webuser3639580 joins
01:32:59Webuser363958 quits [Client Quit]
01:32:59Webuser3639580 quits [Client Quit]
01:35:58<h2ibot>PaulWise edited Mailing Lists (+94, add Simplelists): https://wiki.archiveteam.org/?diff=54251&oldid=54182
01:37:58<h2ibot>PaulWise edited Mailing Lists (+5, better link for scheme lists): https://wiki.archiveteam.org/?diff=54252&oldid=54251
02:27:48<nicolas17>sometimes ts files are deleted from the origin server after like a minute, but the CDN keeps them cached for much longer
02:37:46<@OrIdow6>nicolas17: I didn't do anything funny with the URL I tried again, just got it with curl, so can't be that logn
02:37:50<@OrIdow6>*long
02:50:59PredatorIWD2 joins
03:01:57<eggdrop>[remind] OrIdow6: add thing to dw
03:05:53<@OrIdow6>!remindme 2h add thing to dw
03:05:53<eggdrop>[remind] ok, i'll remind you at 2025-01-18T05:05:53Z
03:14:38pabs quits [Ping timeout: 260 seconds]
03:15:23<h2ibot>Cooljeanius edited TikTok (+211, /* Vital Signs */ mention RedNote): https://wiki.archiveteam.org/?diff=54253&oldid=54248
04:26:21dontwashyourhands (dontwashyourhands) joins
04:26:30SootBector quits [Remote host closed the connection]
04:26:38<dontwashyourhands>Hello! I have a request for ArchiveBot if there is capacity to spare: https://oncinematimeline.com/
04:26:48SootBector (SootBector) joins
04:27:55<dontwashyourhands>I am not worried about the site itself going down. Rather, it's the painstakingly collected tweets that accompany each episode. A large number are gone already because people have deleted their Twitter accounts.
04:29:36dontwashyourhands quits [Client Quit]
04:31:42dontwashyourhands (dontwashyourhands) joins
04:40:12etnguyen03 quits [Remote host closed the connection]
04:59:16beardicus quits [Ping timeout: 250 seconds]
04:59:52DogsRNice quits [Read error: Connection reset by peer]
05:05:53<eggdrop>[remind] OrIdow6: add thing to dw
05:18:44ymgve_ quits [Read error: Connection reset by peer]
05:25:22<@OrIdow6>!remindme 2h add thing to dw
05:25:22<eggdrop>[remind] ok, i'll remind you at 2025-01-18T07:25:22Z
05:27:55<dontwashyourhands>thank you :)
05:28:00dontwashyourhands quits [Client Quit]
05:58:16<pokechu22>Interesting, some substacks don't have sitemaps: https://integ.substack.com/sitemap.xml - not sure why that'd be the case
06:16:18pabs (pabs) joins
06:21:18wickedplayer494 quits [Ping timeout: 260 seconds]
06:21:39wickedplayer494 joins
06:25:56PredatorIWD2 quits [Ping timeout: 250 seconds]
06:26:20notarobot1 quits [Read error: Connection reset by peer]
06:26:20pabs quits [Read error: Connection reset by peer]
06:26:50notarobot1 joins
06:27:33PredatorIWD2 joins
06:31:25SootBector quits [Remote host closed the connection]
06:31:43SootBector (SootBector) joins
06:56:52pabs (pabs) joins
07:01:14<h2ibot>Cooljeanius edited TikTok (+84, /* Archiving tools */ switch links to deleted…): https://wiki.archiveteam.org/?diff=54254&oldid=54253
07:06:15<h2ibot>Cooljeanius edited TikTok (+62, /* Archiving tools */ cleanup): https://wiki.archiveteam.org/?diff=54255&oldid=54254
07:25:22<eggdrop>[remind] OrIdow6: add thing to dw
07:35:49tek_dmn quits [Quit: ZNC - https://znc.in]
07:36:28ymgve joins
09:49:09Island quits [Read error: Connection reset by peer]
10:17:54midou joins
10:33:43loug8318142 joins
10:51:39tek_dmn (tek_dmn) joins
11:31:13pixel leaves [Disconnected: Replaced by new connection]
11:31:19pixel (pixel) joins
11:45:20MrMcNuggets (MrMcNuggets) joins
11:47:38szczot3k|m quits [Client Quit]
11:47:42szczot3k|m joins
12:00:06Bleo18260072271962345 quits [Quit: The Lounge - https://thelounge.chat]
12:01:34benjins3 quits [Remote host closed the connection]
12:01:52benjins3 joins
12:02:51Bleo18260072271962345 joins
12:26:31<kpcyrd>is there something like https://pypi.org/project/internetarchive/ but for Rust?
12:26:53Matthww quits [Remote host closed the connection]
12:46:10APOLLO03 joins
12:46:13APOLLO03 quits [Remote host closed the connection]
12:46:35APOLLO03 joins
12:46:42APOLLO03 quits [Client Quit]
12:56:24SkilledAlpaca418962 quits [Quit: SkilledAlpaca418962]
12:57:38SkilledAlpaca418962 joins
13:06:34beardicus (beardicus) joins
13:28:52chrismrtn quits [Quit: leaving]
13:33:24chrismrtn (chrismrtn) joins
13:33:55BouncerServ is now known as `
13:50:15PredatorIWD2 quits [Read error: Connection reset by peer]
13:53:24PredatorIWD2 joins
14:07:58beardicus quits [Ping timeout: 260 seconds]
14:33:07<TheTechRobo>kpcyrd: What are you trying to do?
14:58:23kansei (kansei) joins
15:26:17beardicus (beardicus) joins
15:40:44Radzig quits [Remote host closed the connection]
15:43:53Radzig joins
16:03:56MrMcNuggets quits [Quit: WeeChat 4.3.2]
16:26:18qinplus_phone joins
16:31:22<kpcyrd>TheTechRobo: https://gitlab.archlinux.org/archlinux/arch-historical-archive/-/blob/master/upload_pkg_internetarchive.py?ref_type=heads
16:31:42<kpcyrd>essentially this, but for more distros
17:00:00<@imer>the s3-like upload api isnt too complicated to operate with just http requests: https://archive.org/developers/ias3.html
17:00:00<@imer>e.g. curl command: https://gitea.arpa.li/ArchiveTeam/archiveteam-megawarc-factory/src/commit/e183dd0578cd4db55b805d7cdd4abe564c1ea82d/upload-one#L75-L89
17:00:42Blueacid joins
17:01:11<szczot3k>Isn't it s3-compatible? So any lib for s3 should work?
17:01:46ymgve quits [Quit: Leaving]
17:03:09<@imer>no idea.
17:04:07<szczot3k>"How this is different from normal S3" section looks like it would be possible, just without some features
17:19:21Naruyoko joins
17:25:00ymgve joins
17:28:54sec^nd quits [Remote host closed the connection]
17:28:54BornOn420 quits [Remote host closed the connection]
17:29:19sec^nd (second) joins
17:29:29BornOn420 (BornOn420) joins
17:42:03beardicus quits [Ping timeout: 260 seconds]
17:46:53beardicus (beardicus) joins
17:51:28beardicus quits [Ping timeout: 250 seconds]
18:02:58beardicus (beardicus) joins
18:36:04qinplus_phone quits [Client Quit]
19:04:16beardicus quits [Ping timeout: 250 seconds]
19:20:33beardicus (beardicus) joins
19:40:28<nicolas17>szczot3k: it can be a bit fragile https://status.digitalocean.com/incidents/zbrpd3j7hrrd
19:41:12<nicolas17>the latest official AWS SDK suddenly stopped working with DigitalOcean's supposedly "S3-compatible" storage, who knows if it would work with archive.org
20:06:57etnguyen03 (etnguyen03) joins
20:11:57Island joins
20:24:13beardicus quits [Ping timeout: 260 seconds]
20:36:00tertu quits [Quit: so long...]
20:37:42beardicus (beardicus) joins
21:00:39ThetaDev quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
21:02:40ThetaDev joins
21:25:01etnguyen03 quits [Client Quit]
21:47:38Doranwen quits [Ping timeout: 250 seconds]
21:53:17BlueMaxima joins
21:55:14Doranwen (Doranwen) joins
22:00:56tertu (tertu) joins
22:15:51etnguyen03 (etnguyen03) joins
22:41:48beardicus quits [Ping timeout: 250 seconds]
22:45:29th3z0l4 joins
22:58:30holbrooke joins
22:58:32holbrooke quits [Client Quit]
22:58:49holbrooke joins
23:01:40etnguyen03 quits [Client Quit]
23:06:59beardicus (beardicus) joins
23:36:31<h2ibot>PaulWise edited Blogger (+297, document custom blogger domains archiving): https://wiki.archiveteam.org/?diff=54257&oldid=51279