00:01:13Arcorann_ joins
00:05:36adamus1red quits [Client Quit]
00:08:12adamus1red (adamus1red) joins
00:25:12Guest joins
00:28:12imer quits [Client Quit]
00:29:16Notrealname1234 (Notrealname1234) joins
00:30:08<Notrealname1234>JAA: you active right now? I want to ask a question.
00:32:36imer (imer) joins
00:33:38<thuban>Notrealname1234: https://dontasktoask.com/
00:34:32<Notrealname1234>thuban: only knew about https://nohello.net/
00:34:50<thuban>same principle
00:35:10<Notrealname1234>JAA: literally, is ISIS websites allowed to be scraped by ArchiveBot?
00:38:36<@JAA>Notrealname1234: I replied to this earlier in #archivebot, and I don't want to repeat it in a publicly logged channel.
00:38:56<Notrealname1234>Dang, i disconnected too early
00:39:45<Notrealname1234>"Pokechu22" already quoted it to me
00:40:09Wohlstand quits [Client Quit]
00:44:30<@JAA>Xe: Your HLS playlists are perfectly fine. The 'only' problem here is that ArchiveBot (which is really the primary tool we use for small to medium sites) currently doesn't even attempt to support HLS in any way. So there's not much you could do, unless you had a single video file on the server side and used Range requests in the playlist and also referenced the video file directly elsewhere, e.g. in
00:44:36<@JAA>the <video> tag. But yeah, this is purely a lack of support on our side.
00:44:52<@JAA>My tooling simply collects the segment URLs, and then I queue those directly for archival.
00:45:13etnguyen03 quits [Client Quit]
00:48:42Notrealname1234 quits [Client Quit]
01:12:32nulldata quits [Client Quit]
01:13:00nulldata (nulldata) joins
01:20:24nulldata quits [Client Quit]
01:20:51nulldata (nulldata) joins
01:32:14<Xe>JAA: would it help if i somehow mechanically recreated the origin video as part of my upload process?
01:32:21<Xe>like muxing it from HLS to a mkv
01:32:32<Xe>i'm more than happy to change how I do uploads
01:35:05<pokechu22>I don't think it's worth worrying about changing how your site is structured for archivebot if archivebot's only going to run into it a few times a year at most
01:36:19<@JAA>Agreed, though a simple link to download the whole video as a single file would likely also be useful to some people.
01:38:02<@JAA>Looks like we hit the site at least once every other month or so since 2022.
01:38:47<pokechu22>True, but probably we wouldn't be downloading videos in those cases, right?
01:39:00<@JAA>Yeah, probably not.
01:39:11<@JAA>Unless they were directly in a <video> src.
01:40:05<@JAA>I suppose WebM would be preferable for that.
01:40:30<Xe>pokechu22: people have asked for it before
01:40:42<Xe>isn't webm a mkv file with extra spice?
01:41:10<@JAA>Yeah, they're closely related. Regular mkv files don't work across browsers though, I believe.
01:41:45<Xe>would mp4 video and aac audio in a webm container be overly fucked from a compat standpoint?
01:42:37<@JAA>I don't believe WebM supports either of those codecs.
01:42:48<@JAA>VP8, VP9, Vorbis, etc.
01:43:13<@JAA>Ah, AV1 and Opus are the remaining ones.
01:43:38<@JAA>Royalty-free codecs and all that.
01:44:09<Xe>ah, i chose mp4 and aac for my video stuff because it's cross platform universal
01:44:36<@JAA>Right
01:44:58<Xe>(and iOS likes it)
01:45:09<Xe>most of my mobile viewers are iOS
01:45:11<@JAA>Even iOS can play WebM these days, I think.
01:45:42<@JAA>Yeah, added in iOS 15 in 2021.
01:46:05<Xe>i'll throw a script together that scans my S3 bucket for index.m3u8 files, fabricates the right URL, muxes it into an MKV container, and then uploads that next to the target as `foldername.mkv`
01:48:54<@JAA>:-)
01:49:36etnguyen03 (etnguyen03) joins
02:00:55DogsRNice quits [Read error: Connection reset by peer]
02:19:41xamgu joins
02:29:11adamus1red_ (adamus1red) joins
02:30:31adamus1red quits [Ping timeout: 255 seconds]
02:30:31adamus1red_ is now known as adamus1red
02:44:26TheTechRobo quits [Remote host closed the connection]
02:44:26ScenarioPlanet quits [Remote host closed the connection]
02:44:26Pedrosso quits [Remote host closed the connection]
02:44:52Pedrosso joins
02:44:56ScenarioPlanet (ScenarioPlanet) joins
02:45:11TheTechRobo (TheTechRobo) joins
03:26:52xamgu quits [Client Quit]
04:16:34BlueMaxima joins
04:34:14ArcticCircleSys joins
04:46:00ArcticCircleSys quits [Ping timeout: 265 seconds]
05:06:46etnguyen03 quits [Client Quit]
05:11:15etnguyen03 (etnguyen03) joins
05:13:54etnguyen03 quits [Remote host closed the connection]
05:20:43BlueMaxima quits [Read error: Connection reset by peer]
05:45:49_Dango360 quits [Ping timeout: 255 seconds]
05:46:33tapos joins
05:54:26Dango360 (Dango360) joins
07:05:02Unholy236192 quits [Remote host closed the connection]
07:06:06Unholy236192 (Unholy2361) joins
08:42:02<h2ibot>Flashfire42 edited URLTeam/Warrior (+75, /* Warrior projects */): https://wiki.archiveteam.org/?diff=52109&oldid=51651
08:42:40parfait_ quits [Ping timeout: 255 seconds]
09:00:02Bleo1826007 quits [Client Quit]
09:01:21Bleo1826007 joins
09:15:50Island quits [Read error: Connection reset by peer]
09:25:26f_ (funderscore) joins
09:37:36pedantic-darwin quits [Quit: The Lounge - https://thelounge.chat]
09:38:59f_ quits [Ping timeout: 250 seconds]
09:41:22f_ (funderscore) joins
09:42:53SootBector quits [Ping timeout: 250 seconds]
09:43:00f_ quits [Remote host closed the connection]
09:43:33f_ (funderscore) joins
09:45:02SootBector (SootBector) joins
10:20:52<@arkiver>pokechu22: thanks for covering womenwhocode.com and .dev, i was about to put it in
11:21:25f_ quits [Remote host closed the connection]
11:21:59f_ (funderscore) joins
11:43:18decky_e joins
11:45:49decky quits [Ping timeout: 255 seconds]
11:58:40albertlarsan68 quits [Client Quit]
12:09:17albertlarsan68 (AlbertLarsan68) joins
12:17:57etnguyen03 (etnguyen03) joins
12:18:13@Sanqui_ quits [Ping timeout: 255 seconds]
12:28:13<michaelblob_>why is pulling from atdr.meo.ws so slow? always takes more than three minutes to pull a 100MB chunk
12:37:05etnguyen03 quits [Client Quit]
12:37:46etnguyen03 (etnguyen03) joins
12:38:25<wickerz>Could someone AB/crawl https://samvirke.dk/ (non-urgent currently)? Their owners was recently sold and the new owner will try to fix their financial situation. Might be good proatively to archive these articles etc
12:47:33etnguyen03 quits [Client Quit]
13:01:52BearFortress_ quits [Ping timeout: 255 seconds]
13:11:03Sanqui joins
13:13:36Sanqui quits [Changing host]
13:13:36Sanqui (Sanqui) joins
13:13:36@ChanServ sets mode: +o Sanqui
13:14:54etnguyen03 (etnguyen03) joins
13:26:48BearFortress joins
13:41:51pixel leaves
13:41:51pixel (pixel) joins
13:54:35Arcorann_ quits [Ping timeout: 265 seconds]
14:13:52tapos quits [Client Quit]
14:38:22<c3manu>wickerz: sure, i'm gonna run it in #archivebot
14:40:23<katia>could https://wiki.znc.in/ get AB'd? low coverage, TLS cert expired on ipv6
14:47:11<wickerz>Ty c3manu
14:57:37Guest quits [Client Quit]
14:58:02AlsoHP_Archivist quits [Client Quit]
14:58:23HP_Archivist (HP_Archivist) joins
15:11:31Larsenv quits [Excess Flood]
15:11:52Larsenv (Larsenv) joins
15:31:15etnguyen03 quits [Client Quit]
15:31:56etnguyen03 (etnguyen03) joins
16:31:11etnguyen03 quits [Client Quit]
16:31:52etnguyen03 (etnguyen03) joins
16:52:17<pokechu22>done via #wikibot: https://archive.org/details/wiki-wiki.znc.in-20240428
16:54:47Guest joins
16:54:49<katia>pokechu22, thanks
17:46:35thalia joins
18:07:56etnguyen03 quits [Client Quit]
18:11:55leo60228- quits [Ping timeout: 255 seconds]
18:12:12leo60228 (leo60228) joins
18:42:12wyatt8740 quits [Remote host closed the connection]
18:45:16wyatt8740 joins
19:04:58<h2ibot>Wickedplayer494 edited Bazaar.tf (+2853, Bazaar is unfortunately dead for real this time): https://wiki.archiveteam.org/?diff=52110&oldid=31538
19:15:43pedantic-darwin joins
19:17:03f_ quits [Ping timeout: 250 seconds]
19:28:43tapos joins
19:39:41f_ (funderscore) joins
19:44:36f_ quits [Remote host closed the connection]
19:53:25etnguyen03 (etnguyen03) joins
19:57:12kiwiirc joins
19:57:27<kiwiirc>Probably worth revisiting this: https://wiki.archiveteam.org/index.php/Tapatalk
19:58:20<thuban>kiwiirc: any particular news?
19:59:53kiwiirc quits [Client Quit]
20:02:11kiwiirc joins
20:02:53<kiwiirc>thuban the news is already in the article. It's from late last year. Tapatalk hosts many old forums from 15+ years that will be lost otherwise
20:02:57kiwiirc quits [Client Quit]
20:14:36etnguyen03 quits [Client Quit]
20:27:32<that_lurker>Could someone throw https://met.refeds.org/ to AB. Could use a complete grab. Mainly for the offsite links that it contains
20:30:26<that_lurker>Though with a little work adding all the exports to #// could also be a good idea maybe
20:39:13etnguyen03 (etnguyen03) joins
20:43:34Island joins
21:00:02<thuban>that_lurker: running
21:00:11<that_lurker>thanks <3
21:23:59<@JAA>michaelblob_: Sounds like you might have bad routing to Hetzner (Germany, FSN1)? I haven't seen anything anywhere near that slow, even when pulling from the other end of the world (literally, NZ).
21:41:06ArcticCircleSys joins
21:44:41<nulldata>arkiver - just a reminder about Post.News. You asked for channel suggestions but I don't think a decision was made. Will there be a project? The shutdown post said within the next few weeks and this coming week would be the second week. The same post does say for users to export their data before May 31st so maybe there's still time?
21:51:38wyatt8740 quits [Ping timeout: 265 seconds]
21:52:00wyatt8740 joins
21:55:59ArcticCircleSys quits [Ping timeout: 265 seconds]
21:57:29BlueMaxima joins
22:03:40<michaelblob_>JAA: hm strange, i'm on the US east coast so i was expecting better speeds
22:15:46<that_lurker>Could someone also throw https://people.nwtime.org/ to AB. Commoncrawl got a lot of the links, but the files are not grabbed.
22:25:37etnguyen03 quits [Client Quit]
22:29:36etnguyen03 (etnguyen03) joins
22:46:25systwi quits [Ping timeout: 255 seconds]
23:03:45systwi (systwi) joins
23:34:57Wohlstand (Wohlstand) joins