00:01:13 | | Arcorann_ joins |
00:05:36 | | adamus1red quits [Client Quit] |
00:08:12 | | adamus1red (adamus1red) joins |
00:25:12 | | Guest joins |
00:28:12 | | imer quits [Client Quit] |
00:29:16 | | Notrealname1234 (Notrealname1234) joins |
00:30:08 | <Notrealname1234> | JAA: you active right now? I want to ask a question. |
00:32:36 | | imer (imer) joins |
00:33:38 | <thuban> | Notrealname1234: https://dontasktoask.com/ |
00:34:32 | <Notrealname1234> | thuban: only knew about https://nohello.net/ |
00:34:50 | <thuban> | same principle |
00:35:10 | <Notrealname1234> | JAA: literally, is ISIS websites allowed to be scraped by ArchiveBot? |
00:38:36 | <@JAA> | Notrealname1234: I replied to this earlier in #archivebot, and I don't want to repeat it in a publicly logged channel. |
00:38:56 | <Notrealname1234> | Dang, i disconnected too early |
00:39:45 | <Notrealname1234> | "Pokechu22" already quoted it to me |
00:40:09 | | Wohlstand quits [Client Quit] |
00:44:30 | <@JAA> | Xe: Your HLS playlists are perfectly fine. The 'only' problem here is that ArchiveBot (which is really the primary tool we use for small to medium sites) currently doesn't even attempt to support HLS in any way. So there's not much you could do, unless you had a single video file on the server side and used Range requests in the playlist and also referenced the video file directly elsewhere, e.g. in |
00:44:36 | <@JAA> | the <video> tag. But yeah, this is purely a lack of support on our side. |
00:44:52 | <@JAA> | My tooling simply collects the segment URLs, and then I queue those directly for archival. |
00:45:13 | | etnguyen03 quits [Client Quit] |
00:48:42 | | Notrealname1234 quits [Client Quit] |
01:12:32 | | nulldata quits [Client Quit] |
01:13:00 | | nulldata (nulldata) joins |
01:20:24 | | nulldata quits [Client Quit] |
01:20:51 | | nulldata (nulldata) joins |
01:32:14 | <Xe> | JAA: would it help if i somehow mechanically recreated the origin video as part of my upload process? |
01:32:21 | <Xe> | like muxing it from HLS to a mkv |
01:32:32 | <Xe> | i'm more than happy to change how I do uploads |
01:35:05 | <pokechu22> | I don't think it's worth worrying about changing how your site is structured for archivebot if archivebot's only going to run into it a few times a year at most |
01:36:19 | <@JAA> | Agreed, though a simple link to download the whole video as a single file would likely also be useful to some people. |
01:38:02 | <@JAA> | Looks like we hit the site at least once every other month or so since 2022. |
01:38:47 | <pokechu22> | True, but probably we wouldn't be downloading videos in those cases, right? |
01:39:00 | <@JAA> | Yeah, probably not. |
01:39:11 | <@JAA> | Unless they were directly in a <video> src. |
01:40:05 | <@JAA> | I suppose WebM would be preferable for that. |
01:40:30 | <Xe> | pokechu22: people have asked for it before |
01:40:42 | <Xe> | isn't webm a mkv file with extra spice? |
01:41:10 | <@JAA> | Yeah, they're closely related. Regular mkv files don't work across browsers though, I believe. |
01:41:45 | <Xe> | would mp4 video and aac audio in a webm container be overly fucked from a compat standpoint? |
01:42:37 | <@JAA> | I don't believe WebM supports either of those codecs. |
01:42:48 | <@JAA> | VP8, VP9, Vorbis, etc. |
01:43:13 | <@JAA> | Ah, AV1 and Opus are the remaining ones. |
01:43:38 | <@JAA> | Royalty-free codecs and all that. |
01:44:09 | <Xe> | ah, i chose mp4 and aac for my video stuff because it's cross platform universal |
01:44:36 | <@JAA> | Right |
01:44:58 | <Xe> | (and iOS likes it) |
01:45:09 | <Xe> | most of my mobile viewers are iOS |
01:45:11 | <@JAA> | Even iOS can play WebM these days, I think. |
01:45:42 | <@JAA> | Yeah, added in iOS 15 in 2021. |
01:46:05 | <Xe> | i'll throw a script together that scans my S3 bucket for index.m3u8 files, fabricates the right URL, muxes it into an MKV container, and then uploads that next to the target as `foldername.mkv` |
01:48:54 | <@JAA> | :-) |
01:49:36 | | etnguyen03 (etnguyen03) joins |
02:00:55 | | DogsRNice quits [Read error: Connection reset by peer] |
02:19:41 | | xamgu joins |
02:29:11 | | adamus1red_ (adamus1red) joins |
02:30:31 | | adamus1red quits [Ping timeout: 255 seconds] |
02:30:31 | | adamus1red_ is now known as adamus1red |
02:44:26 | | TheTechRobo quits [Remote host closed the connection] |
02:44:26 | | ScenarioPlanet quits [Remote host closed the connection] |
02:44:26 | | Pedrosso quits [Remote host closed the connection] |
02:44:52 | | Pedrosso joins |
02:44:56 | | ScenarioPlanet (ScenarioPlanet) joins |
02:45:11 | | TheTechRobo (TheTechRobo) joins |
03:26:52 | | xamgu quits [Client Quit] |
04:16:34 | | BlueMaxima joins |
04:34:14 | | ArcticCircleSys joins |
04:46:00 | | ArcticCircleSys quits [Ping timeout: 265 seconds] |
05:06:46 | | etnguyen03 quits [Client Quit] |
05:11:15 | | etnguyen03 (etnguyen03) joins |
05:13:54 | | etnguyen03 quits [Remote host closed the connection] |
05:20:43 | | BlueMaxima quits [Read error: Connection reset by peer] |
05:45:49 | | _Dango360 quits [Ping timeout: 255 seconds] |
05:46:33 | | tapos joins |
05:54:26 | | Dango360 (Dango360) joins |
07:05:02 | | Unholy236192 quits [Remote host closed the connection] |
07:06:06 | | Unholy236192 (Unholy2361) joins |
08:42:02 | <h2ibot> | Flashfire42 edited URLTeam/Warrior (+75, /* Warrior projects */): https://wiki.archiveteam.org/?diff=52109&oldid=51651 |
08:42:40 | | parfait_ quits [Ping timeout: 255 seconds] |
09:00:02 | | Bleo1826007 quits [Client Quit] |
09:01:21 | | Bleo1826007 joins |
09:15:50 | | Island quits [Read error: Connection reset by peer] |
09:25:26 | | f_ (funderscore) joins |
09:37:36 | | pedantic-darwin quits [Quit: The Lounge - https://thelounge.chat] |
09:38:59 | | f_ quits [Ping timeout: 250 seconds] |
09:41:22 | | f_ (funderscore) joins |
09:42:53 | | SootBector quits [Ping timeout: 250 seconds] |
09:43:00 | | f_ quits [Remote host closed the connection] |
09:43:33 | | f_ (funderscore) joins |
09:45:02 | | SootBector (SootBector) joins |
10:20:52 | <@arkiver> | pokechu22: thanks for covering womenwhocode.com and .dev, i was about to put it in |
11:21:25 | | f_ quits [Remote host closed the connection] |
11:21:59 | | f_ (funderscore) joins |
11:43:18 | | decky_e joins |
11:45:49 | | decky quits [Ping timeout: 255 seconds] |
11:58:40 | | albertlarsan68 quits [Client Quit] |
12:09:17 | | albertlarsan68 (AlbertLarsan68) joins |
12:17:57 | | etnguyen03 (etnguyen03) joins |
12:18:13 | | @Sanqui_ quits [Ping timeout: 255 seconds] |
12:28:13 | <michaelblob_> | why is pulling from atdr.meo.ws so slow? always takes more than three minutes to pull a 100MB chunk |
12:37:05 | | etnguyen03 quits [Client Quit] |
12:37:46 | | etnguyen03 (etnguyen03) joins |
12:38:25 | <wickerz> | Could someone AB/crawl https://samvirke.dk/ (non-urgent currently)? Their owners was recently sold and the new owner will try to fix their financial situation. Might be good proatively to archive these articles etc |
12:47:33 | | etnguyen03 quits [Client Quit] |
13:01:52 | | BearFortress_ quits [Ping timeout: 255 seconds] |
13:11:03 | | Sanqui joins |
13:13:36 | | Sanqui is now authenticated as Sanqui |
13:13:36 | | Sanqui quits [Changing host] |
13:13:36 | | Sanqui (Sanqui) joins |
13:13:36 | | @ChanServ sets mode: +o Sanqui |
13:14:54 | | etnguyen03 (etnguyen03) joins |
13:26:48 | | BearFortress joins |
13:41:51 | | pixel leaves |
13:41:51 | | pixel (pixel) joins |
13:54:35 | | Arcorann_ quits [Ping timeout: 265 seconds] |
14:13:52 | | tapos quits [Client Quit] |
14:38:22 | <c3manu> | wickerz: sure, i'm gonna run it in #archivebot |
14:40:23 | <katia> | could https://wiki.znc.in/ get AB'd? low coverage, TLS cert expired on ipv6 |
14:47:11 | <wickerz> | Ty c3manu |
14:57:37 | | Guest quits [Client Quit] |
14:58:02 | | AlsoHP_Archivist quits [Client Quit] |
14:58:23 | | HP_Archivist (HP_Archivist) joins |
15:11:31 | | Larsenv quits [Excess Flood] |
15:11:52 | | Larsenv (Larsenv) joins |
15:31:15 | | etnguyen03 quits [Client Quit] |
15:31:56 | | etnguyen03 (etnguyen03) joins |
16:31:11 | | etnguyen03 quits [Client Quit] |
16:31:52 | | etnguyen03 (etnguyen03) joins |
16:52:17 | <pokechu22> | done via #wikibot: https://archive.org/details/wiki-wiki.znc.in-20240428 |
16:54:47 | | Guest joins |
16:54:49 | <katia> | pokechu22, thanks |
17:46:35 | | thalia joins |
17:47:36 | | thalia is now authenticated as thalia |
18:07:56 | | etnguyen03 quits [Client Quit] |
18:11:55 | | leo60228- quits [Ping timeout: 255 seconds] |
18:12:12 | | leo60228 (leo60228) joins |
18:42:12 | | wyatt8740 quits [Remote host closed the connection] |
18:45:16 | | wyatt8740 joins |
19:04:58 | <h2ibot> | Wickedplayer494 edited Bazaar.tf (+2853, Bazaar is unfortunately dead for real this time): https://wiki.archiveteam.org/?diff=52110&oldid=31538 |
19:15:43 | | pedantic-darwin joins |
19:17:03 | | f_ quits [Ping timeout: 250 seconds] |
19:28:43 | | tapos joins |
19:39:41 | | f_ (funderscore) joins |
19:44:36 | | f_ quits [Remote host closed the connection] |
19:53:25 | | etnguyen03 (etnguyen03) joins |
19:57:12 | | kiwiirc joins |
19:57:27 | <kiwiirc> | Probably worth revisiting this: https://wiki.archiveteam.org/index.php/Tapatalk |
19:58:20 | <thuban> | kiwiirc: any particular news? |
19:59:53 | | kiwiirc quits [Client Quit] |
20:02:11 | | kiwiirc joins |
20:02:53 | <kiwiirc> | thuban the news is already in the article. It's from late last year. Tapatalk hosts many old forums from 15+ years that will be lost otherwise |
20:02:57 | | kiwiirc quits [Client Quit] |
20:14:36 | | etnguyen03 quits [Client Quit] |
20:27:32 | <that_lurker> | Could someone throw https://met.refeds.org/ to AB. Could use a complete grab. Mainly for the offsite links that it contains |
20:30:26 | <that_lurker> | Though with a little work adding all the exports to #// could also be a good idea maybe |
20:39:13 | | etnguyen03 (etnguyen03) joins |
20:43:34 | | Island joins |
21:00:02 | <thuban> | that_lurker: running |
21:00:11 | <that_lurker> | thanks <3 |
21:23:59 | <@JAA> | michaelblob_: Sounds like you might have bad routing to Hetzner (Germany, FSN1)? I haven't seen anything anywhere near that slow, even when pulling from the other end of the world (literally, NZ). |
21:41:06 | | ArcticCircleSys joins |
21:44:41 | <nulldata> | arkiver - just a reminder about Post.News. You asked for channel suggestions but I don't think a decision was made. Will there be a project? The shutdown post said within the next few weeks and this coming week would be the second week. The same post does say for users to export their data before May 31st so maybe there's still time? |
21:51:38 | | wyatt8740 quits [Ping timeout: 265 seconds] |
21:52:00 | | wyatt8740 joins |
21:55:59 | | ArcticCircleSys quits [Ping timeout: 265 seconds] |
21:57:29 | | BlueMaxima joins |
22:03:40 | <michaelblob_> | JAA: hm strange, i'm on the US east coast so i was expecting better speeds |
22:15:46 | <that_lurker> | Could someone also throw https://people.nwtime.org/ to AB. Commoncrawl got a lot of the links, but the files are not grabbed. |
22:25:37 | | etnguyen03 quits [Client Quit] |
22:29:36 | | etnguyen03 (etnguyen03) joins |
22:46:25 | | systwi quits [Ping timeout: 255 seconds] |
23:03:45 | | systwi (systwi) joins |
23:34:57 | | Wohlstand (Wohlstand) joins |