00:01:03 | | loug8318142 quits [Quit: The Lounge - https://thelounge.chat] |
00:07:26 | | etnguyen03 (etnguyen03) joins |
00:26:43 | | etnguyen03 quits [Client Quit] |
00:27:48 | | beardicus quits [Ping timeout: 260 seconds] |
00:43:10 | <pabs> | do we have a way of archiving M3U files that also downloads the files they link to? AFAICT ArchiveBot does not do that |
00:44:28 | <pabs> | https://pig.observer/ links to some live video cameras with M3U files, and I'm a bit worried that if I do the link extraction here, by the time AB gets to the linked chunks, they might have been deleted |
00:50:44 | <pabs> | hmm, its too JavaScripty |
00:58:13 | | beardicus (beardicus) joins |
01:10:59 | | etnguyen03 (etnguyen03) joins |
01:12:07 | <@OrIdow6> | pabs: Uh, you could do it with wget-at though you might need to do some "clever" stuff to make it work in time, maybe qwarc would work? |
01:12:11 | <@OrIdow6> | Testing how long they last |
01:12:47 | <@OrIdow6> | I will say traffic cameras feel weird archving-wise |
01:12:53 | <@OrIdow6> | Since they're going continuously |
01:13:01 | <@OrIdow6> | And ephemeral by nature |
01:13:06 | <pabs> | they don't use <video> so it won't work in the WBM anyway, too much JS |
01:13:21 | <pabs> | yeah, just wanted a couple snapshots to preserve the feel of the project |
01:14:10 | <@OrIdow6> | Hm, one ts expired after 4 minutes |
01:19:51 | <@JAA> | Yeah, wouldn't be hard with qwarc. |
01:20:26 | <@JAA> | They probably only keep a few segments on the server. That's how most live stream things work. |
01:30:10 | | Webuser363958 joins |
01:30:48 | | Webuser3639580 joins |
01:32:59 | | Webuser363958 quits [Client Quit] |
01:32:59 | | Webuser3639580 quits [Client Quit] |
01:35:58 | <h2ibot> | PaulWise edited Mailing Lists (+94, add Simplelists): https://wiki.archiveteam.org/?diff=54251&oldid=54182 |
01:37:58 | <h2ibot> | PaulWise edited Mailing Lists (+5, better link for scheme lists): https://wiki.archiveteam.org/?diff=54252&oldid=54251 |
02:27:48 | <nicolas17> | sometimes ts files are deleted from the origin server after like a minute, but the CDN keeps them cached for much longer |
02:37:46 | <@OrIdow6> | nicolas17: I didn't do anything funny with the URL I tried again, just got it with curl, so can't be that logn |
02:37:50 | <@OrIdow6> | *long |
02:50:59 | | PredatorIWD2 joins |
03:01:57 | <eggdrop> | [remind] OrIdow6: add thing to dw |
03:05:53 | <@OrIdow6> | !remindme 2h add thing to dw |
03:05:53 | <eggdrop> | [remind] ok, i'll remind you at 2025-01-18T05:05:53Z |
03:14:38 | | pabs quits [Ping timeout: 260 seconds] |
03:15:23 | <h2ibot> | Cooljeanius edited TikTok (+211, /* Vital Signs */ mention RedNote): https://wiki.archiveteam.org/?diff=54253&oldid=54248 |
04:26:21 | | dontwashyourhands (dontwashyourhands) joins |
04:26:30 | | SootBector quits [Remote host closed the connection] |
04:26:38 | <dontwashyourhands> | Hello! I have a request for ArchiveBot if there is capacity to spare: https://oncinematimeline.com/ |
04:26:48 | | SootBector (SootBector) joins |
04:27:55 | <dontwashyourhands> | I am not worried about the site itself going down. Rather, it's the painstakingly collected tweets that accompany each episode. A large number are gone already because people have deleted their Twitter accounts. |
04:29:36 | | dontwashyourhands quits [Client Quit] |
04:31:42 | | dontwashyourhands (dontwashyourhands) joins |
04:40:12 | | etnguyen03 quits [Remote host closed the connection] |
04:59:16 | | beardicus quits [Ping timeout: 250 seconds] |
04:59:52 | | DogsRNice quits [Read error: Connection reset by peer] |
05:05:53 | <eggdrop> | [remind] OrIdow6: add thing to dw |
05:18:44 | | ymgve_ quits [Read error: Connection reset by peer] |
05:25:22 | <@OrIdow6> | !remindme 2h add thing to dw |
05:25:22 | <eggdrop> | [remind] ok, i'll remind you at 2025-01-18T07:25:22Z |
05:27:55 | <dontwashyourhands> | thank you :) |
05:28:00 | | dontwashyourhands quits [Client Quit] |
05:58:16 | <pokechu22> | Interesting, some substacks don't have sitemaps: https://integ.substack.com/sitemap.xml - not sure why that'd be the case |
06:16:18 | | pabs (pabs) joins |
06:21:18 | | wickedplayer494 quits [Ping timeout: 260 seconds] |
06:21:39 | | wickedplayer494 joins |
06:22:02 | | wickedplayer494 is now authenticated as wickedplayer494 |
06:25:56 | | PredatorIWD2 quits [Ping timeout: 250 seconds] |
06:26:20 | | notarobot1 quits [Read error: Connection reset by peer] |
06:26:20 | | pabs quits [Read error: Connection reset by peer] |
06:26:50 | | notarobot1 joins |
06:27:33 | | PredatorIWD2 joins |
06:31:25 | | SootBector quits [Remote host closed the connection] |
06:31:43 | | SootBector (SootBector) joins |
06:56:52 | | pabs (pabs) joins |
07:01:14 | <h2ibot> | Cooljeanius edited TikTok (+84, /* Archiving tools */ switch links to deleted…): https://wiki.archiveteam.org/?diff=54254&oldid=54253 |
07:06:15 | <h2ibot> | Cooljeanius edited TikTok (+62, /* Archiving tools */ cleanup): https://wiki.archiveteam.org/?diff=54255&oldid=54254 |
07:25:22 | <eggdrop> | [remind] OrIdow6: add thing to dw |
07:35:49 | | tek_dmn quits [Quit: ZNC - https://znc.in] |
07:36:28 | | ymgve joins |
09:49:09 | | Island quits [Read error: Connection reset by peer] |
10:17:54 | | midou joins |
10:33:43 | | loug8318142 joins |
10:51:39 | | tek_dmn (tek_dmn) joins |
11:31:13 | | pixel leaves [Disconnected: Replaced by new connection] |
11:31:19 | | pixel (pixel) joins |
11:45:20 | | MrMcNuggets (MrMcNuggets) joins |
11:47:38 | | szczot3k|m quits [Client Quit] |
11:47:42 | | szczot3k|m joins |
12:00:06 | | Bleo18260072271962345 quits [Quit: The Lounge - https://thelounge.chat] |
12:01:34 | | benjins3 quits [Remote host closed the connection] |
12:01:52 | | benjins3 joins |
12:02:51 | | Bleo18260072271962345 joins |
12:26:31 | <kpcyrd> | is there something like https://pypi.org/project/internetarchive/ but for Rust? |
12:26:53 | | Matthww quits [Remote host closed the connection] |
12:46:10 | | APOLLO03 joins |
12:46:13 | | APOLLO03 quits [Remote host closed the connection] |
12:46:35 | | APOLLO03 joins |
12:46:42 | | APOLLO03 quits [Client Quit] |
12:56:24 | | SkilledAlpaca418962 quits [Quit: SkilledAlpaca418962] |
12:57:38 | | SkilledAlpaca418962 joins |
13:06:34 | | beardicus (beardicus) joins |
13:28:52 | | chrismrtn quits [Quit: leaving] |
13:33:24 | | chrismrtn (chrismrtn) joins |
13:33:55 | | BouncerServ is now known as ` |
13:50:15 | | PredatorIWD2 quits [Read error: Connection reset by peer] |
13:53:24 | | PredatorIWD2 joins |
14:07:58 | | beardicus quits [Ping timeout: 260 seconds] |
14:33:07 | <TheTechRobo> | kpcyrd: What are you trying to do? |
14:58:23 | | kansei (kansei) joins |
15:26:17 | | beardicus (beardicus) joins |
15:40:44 | | Radzig quits [Remote host closed the connection] |
15:43:53 | | Radzig joins |
16:03:56 | | MrMcNuggets quits [Quit: WeeChat 4.3.2] |
16:26:18 | | qinplus_phone joins |
16:31:22 | <kpcyrd> | TheTechRobo: https://gitlab.archlinux.org/archlinux/arch-historical-archive/-/blob/master/upload_pkg_internetarchive.py?ref_type=heads |
16:31:42 | <kpcyrd> | essentially this, but for more distros |
17:00:00 | <@imer> | the s3-like upload api isnt too complicated to operate with just http requests: https://archive.org/developers/ias3.html |
17:00:00 | <@imer> | e.g. curl command: https://gitea.arpa.li/ArchiveTeam/archiveteam-megawarc-factory/src/commit/e183dd0578cd4db55b805d7cdd4abe564c1ea82d/upload-one#L75-L89 |
17:00:42 | | Blueacid joins |
17:01:11 | <szczot3k> | Isn't it s3-compatible? So any lib for s3 should work? |
17:01:46 | | ymgve quits [Quit: Leaving] |
17:03:09 | <@imer> | no idea. |
17:04:07 | <szczot3k> | "How this is different from normal S3" section looks like it would be possible, just without some features |
17:19:21 | | Naruyoko joins |
17:25:00 | | ymgve joins |
17:28:54 | | sec^nd quits [Remote host closed the connection] |
17:28:54 | | BornOn420 quits [Remote host closed the connection] |
17:29:19 | | sec^nd (second) joins |
17:29:29 | | BornOn420 (BornOn420) joins |
17:42:03 | | beardicus quits [Ping timeout: 260 seconds] |
17:46:53 | | beardicus (beardicus) joins |
17:51:28 | | beardicus quits [Ping timeout: 250 seconds] |
18:02:58 | | beardicus (beardicus) joins |
18:36:04 | | qinplus_phone quits [Client Quit] |
19:04:16 | | beardicus quits [Ping timeout: 250 seconds] |
19:20:33 | | beardicus (beardicus) joins |
19:40:28 | <nicolas17> | szczot3k: it can be a bit fragile https://status.digitalocean.com/incidents/zbrpd3j7hrrd |
19:41:12 | <nicolas17> | the latest official AWS SDK suddenly stopped working with DigitalOcean's supposedly "S3-compatible" storage, who knows if it would work with archive.org |
20:06:57 | | etnguyen03 (etnguyen03) joins |
20:11:57 | | Island joins |
20:24:13 | | beardicus quits [Ping timeout: 260 seconds] |
20:36:00 | | tertu quits [Quit: so long...] |
20:37:42 | | beardicus (beardicus) joins |
21:00:39 | | ThetaDev quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
21:02:40 | | ThetaDev joins |
21:25:01 | | etnguyen03 quits [Client Quit] |
21:47:38 | | Doranwen quits [Ping timeout: 250 seconds] |
21:53:17 | | BlueMaxima joins |
21:55:14 | | Doranwen (Doranwen) joins |
22:00:56 | | tertu (tertu) joins |
22:15:51 | | etnguyen03 (etnguyen03) joins |
22:41:48 | | beardicus quits [Ping timeout: 250 seconds] |
22:45:29 | | th3z0l4 joins |
22:58:30 | | holbrooke joins |
22:58:32 | | holbrooke quits [Client Quit] |
22:58:49 | | holbrooke joins |
23:01:40 | | etnguyen03 quits [Client Quit] |
23:06:59 | | beardicus (beardicus) joins |
23:36:31 | <h2ibot> | PaulWise edited Blogger (+297, document custom blogger domains archiving): https://wiki.archiveteam.org/?diff=54257&oldid=51279 |