| 00:01:03 | | loug8318142 quits [Quit: The Lounge - https://thelounge.chat] |
| 00:07:26 | | etnguyen03 (etnguyen03) joins |
| 00:26:43 | | etnguyen03 quits [Client Quit] |
| 00:27:48 | | beardicus quits [Ping timeout: 260 seconds] |
| 00:43:10 | <pabs> | do we have a way of archiving M3U files that also downloads the files they link to? AFAICT ArchiveBot does not do that |
| 00:44:28 | <pabs> | https://pig.observer/ links to some live video cameras with M3U files, and I'm a bit worried that if I do the link extraction here, by the time AB gets to the linked chunks, they might have been deleted |
| 00:50:44 | <pabs> | hmm, its too JavaScripty |
| 00:58:13 | | beardicus (beardicus) joins |
| 01:10:59 | | etnguyen03 (etnguyen03) joins |
| 01:12:07 | <@OrIdow6> | pabs: Uh, you could do it with wget-at though you might need to do some "clever" stuff to make it work in time, maybe qwarc would work? |
| 01:12:11 | <@OrIdow6> | Testing how long they last |
| 01:12:47 | <@OrIdow6> | I will say traffic cameras feel weird archving-wise |
| 01:12:53 | <@OrIdow6> | Since they're going continuously |
| 01:13:01 | <@OrIdow6> | And ephemeral by nature |
| 01:13:06 | <pabs> | they don't use <video> so it won't work in the WBM anyway, too much JS |
| 01:13:21 | <pabs> | yeah, just wanted a couple snapshots to preserve the feel of the project |
| 01:14:10 | <@OrIdow6> | Hm, one ts expired after 4 minutes |
| 01:19:51 | <@JAA> | Yeah, wouldn't be hard with qwarc. |
| 01:20:26 | <@JAA> | They probably only keep a few segments on the server. That's how most live stream things work. |
| 01:30:10 | | Webuser363958 joins |
| 01:30:48 | | Webuser3639580 joins |
| 01:32:59 | | Webuser363958 quits [Client Quit] |
| 01:32:59 | | Webuser3639580 quits [Client Quit] |
| 01:35:58 | <h2ibot> | PaulWise edited Mailing Lists (+94, add Simplelists): https://wiki.archiveteam.org/?diff=54251&oldid=54182 |
| 01:37:58 | <h2ibot> | PaulWise edited Mailing Lists (+5, better link for scheme lists): https://wiki.archiveteam.org/?diff=54252&oldid=54251 |
| 02:27:48 | <nicolas17> | sometimes ts files are deleted from the origin server after like a minute, but the CDN keeps them cached for much longer |
| 02:37:46 | <@OrIdow6> | nicolas17: I didn't do anything funny with the URL I tried again, just got it with curl, so can't be that logn |
| 02:37:50 | <@OrIdow6> | *long |
| 02:50:59 | | PredatorIWD2 joins |
| 03:01:57 | <eggdrop> | [remind] OrIdow6: add thing to dw |
| 03:05:53 | <@OrIdow6> | !remindme 2h add thing to dw |
| 03:05:53 | <eggdrop> | [remind] ok, i'll remind you at 2025-01-18T05:05:53Z |
| 03:14:38 | | pabs quits [Ping timeout: 260 seconds] |
| 03:15:23 | <h2ibot> | Cooljeanius edited TikTok (+211, /* Vital Signs */ mention RedNote): https://wiki.archiveteam.org/?diff=54253&oldid=54248 |
| 04:26:21 | | dontwashyourhands (dontwashyourhands) joins |
| 04:26:30 | | SootBector quits [Remote host closed the connection] |
| 04:26:38 | <dontwashyourhands> | Hello! I have a request for ArchiveBot if there is capacity to spare: https://oncinematimeline.com/ |
| 04:26:48 | | SootBector (SootBector) joins |
| 04:27:55 | <dontwashyourhands> | I am not worried about the site itself going down. Rather, it's the painstakingly collected tweets that accompany each episode. A large number are gone already because people have deleted their Twitter accounts. |
| 04:29:36 | | dontwashyourhands quits [Client Quit] |
| 04:31:42 | | dontwashyourhands (dontwashyourhands) joins |
| 04:40:12 | | etnguyen03 quits [Remote host closed the connection] |
| 04:59:16 | | beardicus quits [Ping timeout: 250 seconds] |
| 04:59:52 | | DogsRNice quits [Read error: Connection reset by peer] |
| 05:05:53 | <eggdrop> | [remind] OrIdow6: add thing to dw |
| 05:18:44 | | ymgve_ quits [Read error: Connection reset by peer] |
| 05:25:22 | <@OrIdow6> | !remindme 2h add thing to dw |
| 05:25:22 | <eggdrop> | [remind] ok, i'll remind you at 2025-01-18T07:25:22Z |
| 05:27:55 | <dontwashyourhands> | thank you :) |
| 05:28:00 | | dontwashyourhands quits [Client Quit] |
| 05:58:16 | <pokechu22> | Interesting, some substacks don't have sitemaps: https://integ.substack.com/sitemap.xml - not sure why that'd be the case |
| 06:16:18 | | pabs (pabs) joins |
| 06:21:18 | | wickedplayer494 quits [Ping timeout: 260 seconds] |
| 06:21:39 | | wickedplayer494 joins |
| 06:22:02 | | wickedplayer494 is now authenticated as wickedplayer494 |
| 06:25:56 | | PredatorIWD2 quits [Ping timeout: 250 seconds] |
| 06:26:20 | | notarobot1 quits [Read error: Connection reset by peer] |
| 06:26:20 | | pabs quits [Read error: Connection reset by peer] |
| 06:26:50 | | notarobot1 joins |
| 06:27:33 | | PredatorIWD2 joins |
| 06:31:25 | | SootBector quits [Remote host closed the connection] |
| 06:31:43 | | SootBector (SootBector) joins |
| 06:56:52 | | pabs (pabs) joins |
| 07:01:14 | <h2ibot> | Cooljeanius edited TikTok (+84, /* Archiving tools */ switch links to deleted…): https://wiki.archiveteam.org/?diff=54254&oldid=54253 |
| 07:06:15 | <h2ibot> | Cooljeanius edited TikTok (+62, /* Archiving tools */ cleanup): https://wiki.archiveteam.org/?diff=54255&oldid=54254 |
| 07:25:22 | <eggdrop> | [remind] OrIdow6: add thing to dw |
| 07:35:49 | | tek_dmn quits [Quit: ZNC - https://znc.in] |
| 07:36:28 | | ymgve joins |
| 09:49:09 | | Island quits [Read error: Connection reset by peer] |
| 10:17:54 | | midou joins |
| 10:33:43 | | loug8318142 joins |
| 10:51:39 | | tek_dmn (tek_dmn) joins |
| 11:31:13 | | pixel leaves [Disconnected: Replaced by new connection] |
| 11:31:19 | | pixel (pixel) joins |
| 11:45:20 | | MrMcNuggets (MrMcNuggets) joins |
| 11:47:38 | | szczot3k|m quits [Client Quit] |
| 11:47:42 | | szczot3k|m joins |
| 12:00:06 | | Bleo18260072271962345 quits [Quit: The Lounge - https://thelounge.chat] |
| 12:01:34 | | benjins3 quits [Remote host closed the connection] |
| 12:01:52 | | benjins3 joins |
| 12:02:51 | | Bleo18260072271962345 joins |
| 12:26:31 | <kpcyrd> | is there something like https://pypi.org/project/internetarchive/ but for Rust? |
| 12:26:53 | | Matthww quits [Remote host closed the connection] |
| 12:46:10 | | APOLLO03 joins |
| 12:46:13 | | APOLLO03 quits [Remote host closed the connection] |
| 12:46:35 | | APOLLO03 joins |
| 12:46:42 | | APOLLO03 quits [Client Quit] |
| 12:56:24 | | SkilledAlpaca418962 quits [Quit: SkilledAlpaca418962] |
| 12:57:38 | | SkilledAlpaca418962 joins |
| 13:06:34 | | beardicus (beardicus) joins |
| 13:28:52 | | chrismrtn quits [Quit: leaving] |
| 13:33:24 | | chrismrtn (chrismrtn) joins |
| 13:33:55 | | BouncerServ is now known as ` |
| 13:50:15 | | PredatorIWD2 quits [Read error: Connection reset by peer] |
| 13:53:24 | | PredatorIWD2 joins |
| 14:07:58 | | beardicus quits [Ping timeout: 260 seconds] |
| 14:33:07 | <TheTechRobo> | kpcyrd: What are you trying to do? |
| 14:58:23 | | kansei (kansei) joins |
| 15:26:17 | | beardicus (beardicus) joins |
| 15:40:44 | | Radzig quits [Remote host closed the connection] |
| 15:43:53 | | Radzig joins |
| 16:03:56 | | MrMcNuggets quits [Quit: WeeChat 4.3.2] |
| 16:26:18 | | qinplus_phone joins |
| 16:31:22 | <kpcyrd> | TheTechRobo: https://gitlab.archlinux.org/archlinux/arch-historical-archive/-/blob/master/upload_pkg_internetarchive.py?ref_type=heads |
| 16:31:42 | <kpcyrd> | essentially this, but for more distros |
| 17:00:00 | <@imer> | the s3-like upload api isnt too complicated to operate with just http requests: https://archive.org/developers/ias3.html |
| 17:00:00 | <@imer> | e.g. curl command: https://gitea.arpa.li/ArchiveTeam/archiveteam-megawarc-factory/src/commit/e183dd0578cd4db55b805d7cdd4abe564c1ea82d/upload-one#L75-L89 |
| 17:00:42 | | Blueacid joins |
| 17:01:11 | <szczot3k> | Isn't it s3-compatible? So any lib for s3 should work? |
| 17:01:46 | | ymgve quits [Quit: Leaving] |
| 17:03:09 | <@imer> | no idea. |
| 17:04:07 | <szczot3k> | "How this is different from normal S3" section looks like it would be possible, just without some features |
| 17:19:21 | | Naruyoko joins |
| 17:25:00 | | ymgve joins |
| 17:28:54 | | sec^nd quits [Remote host closed the connection] |
| 17:28:54 | | BornOn420 quits [Remote host closed the connection] |
| 17:29:19 | | sec^nd (second) joins |
| 17:29:29 | | BornOn420 (BornOn420) joins |
| 17:42:03 | | beardicus quits [Ping timeout: 260 seconds] |
| 17:46:53 | | beardicus (beardicus) joins |
| 17:51:28 | | beardicus quits [Ping timeout: 250 seconds] |
| 18:02:58 | | beardicus (beardicus) joins |
| 18:36:04 | | qinplus_phone quits [Client Quit] |
| 19:04:16 | | beardicus quits [Ping timeout: 250 seconds] |
| 19:20:33 | | beardicus (beardicus) joins |
| 19:40:28 | <nicolas17> | szczot3k: it can be a bit fragile https://status.digitalocean.com/incidents/zbrpd3j7hrrd |
| 19:41:12 | <nicolas17> | the latest official AWS SDK suddenly stopped working with DigitalOcean's supposedly "S3-compatible" storage, who knows if it would work with archive.org |
| 20:06:57 | | etnguyen03 (etnguyen03) joins |
| 20:11:57 | | Island joins |
| 20:24:13 | | beardicus quits [Ping timeout: 260 seconds] |
| 20:36:00 | | tertu quits [Quit: so long...] |
| 20:37:42 | | beardicus (beardicus) joins |
| 21:00:39 | | ThetaDev quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
| 21:02:40 | | ThetaDev joins |
| 21:25:01 | | etnguyen03 quits [Client Quit] |
| 21:47:38 | | Doranwen quits [Ping timeout: 250 seconds] |
| 21:53:17 | | BlueMaxima joins |
| 21:55:14 | | Doranwen (Doranwen) joins |
| 22:00:56 | | tertu (tertu) joins |
| 22:15:51 | | etnguyen03 (etnguyen03) joins |
| 22:41:48 | | beardicus quits [Ping timeout: 250 seconds] |
| 22:45:29 | | th3z0l4 joins |
| 22:58:30 | | holbrooke joins |
| 22:58:32 | | holbrooke quits [Client Quit] |
| 22:58:49 | | holbrooke joins |
| 23:01:40 | | etnguyen03 quits [Client Quit] |
| 23:06:59 | | beardicus (beardicus) joins |
| 23:36:31 | <h2ibot> | PaulWise edited Blogger (+297, document custom blogger domains archiving): https://wiki.archiveteam.org/?diff=54257&oldid=51279 |