00:00:48dm4v quits [Read error: Connection reset by peer]
00:01:15dm4v joins
00:01:17dm4v quits [Changing host]
00:01:17dm4v (dm4v) joins
00:01:25<abccc>Ahh okay thanks.
00:01:45Arcorann__ joins
00:07:54nuroten joins
00:11:51<nuroten>AK: fwiw the ones that were mentioned in the past few days including orly's list (as far as I'm aware) are hkcews, stand news, inmedia, factwire, hongkongfp, d100
00:12:08<nuroten>the others should probably be queued just in case
00:14:00<nuroten>for d100, the main content from their public radio channel is on youtube (https://www.youtube.com/user/D100HK), I don't know if someone already started on it
00:15:52<@JAA>17k videos in #youtubearchive, FWIW.
00:22:28<nuroten>nice!
00:24:45<nuroten>abccc: there is a playlist for some tvmost videos, only a few hundred, are these worth saving? https://www.youtube.com/channel/UCiJnCs2K5gP-DXnMxlstC9A
00:24:59<abccc>Yes those are one of the more important ones
00:26:12<abccc>nuroten the ones that I put in my list, that aren't in the list you just mentioned (hkcnews, stand news, etc) are also at high risk of deletion so I think we should work on them too if possible
00:27:36<nuroten>abccc: agree, it's what I was suggesting to AK to help them cross-check between the list orly sent and ones we might not have covered yet
00:27:53<abccc>great
00:36:52BlueMaxima joins
00:37:23<@JAA>The AB jobs for Virtual Teen (one for everything, one for just the low-bandwidth version of the forums) both finished, so I'll skip an archival with qwarc. The main job retrieved 154k threads and 221k thread pages according to some simple grepping on the log file, and the low-bandwidth one got 146k unique thread IDs. The homepage lists 219k threads, but a significant part is in restricted forums, so
00:37:29<@JAA>that seems alright.
00:42:57<Jake>wow that went really fast!
00:51:30c00k13 quits [Ping timeout: 250 seconds]
00:52:17c00k13 joins
00:58:10Ryz quits [Read error: Connection reset by peer]
00:58:51Ryz (Ryz) joins
01:01:15dm4v quits [Read error: Connection reset by peer]
01:02:53dm4v joins
01:02:55dm4v quits [Changing host]
01:02:55dm4v (dm4v) joins
01:07:51<nuroten>@JAA: any possibility we could archive the yt videos from the respective media sites based off abccc's list as well please? :)
01:08:02<nuroten>https://ttm.sh/FaN.txt
01:09:09<nuroten>brb in a while, I'll look up the links for the "others (still high priority)" list later
01:10:31<nuroten>abccc: do you know if Now TV has a separate playlist for news videos? I only found the financial news and their regular channel which has different things mixed in
01:12:16<@JAA>nuroten: Looks like a good chunk of it is already covered, but far from complete.
01:30:30leo60228 quits [Ping timeout: 250 seconds]
01:31:54leo60228 (leo60228) joins
01:34:37benjins joins
01:54:31<abccc>nuroten no I don't think so, all their sections (local, entertainment, international, etc) are all mixed together. If you manage to get a dump of news.now.com .m3u8 / .ts links, let me know
02:14:22Iki1 joins
02:18:04Iki quits [Ping timeout: 258 seconds]
02:35:02<nuroten>JAA: thanks :)
02:41:30AntiLiberal quits [Client Quit]
02:41:42AntiLiberal joins
02:51:06ThreeHM quits [Ping timeout: 250 seconds]
02:53:17ThreeHM (ThreeHeadedMonkey) joins
03:11:33AntiLiberal hi
03:15:26DogsRNice quits [Read error: Connection reset by peer]
03:17:59<AntiLiberal>hello
03:44:21qw3rty__ joins
03:48:09qw3rty_ quits [Ping timeout: 258 seconds]
04:11:25BlueMaxima quits [Read error: Connection reset by peer]
04:15:31AntiLiberal quits [Client Quit]
04:17:03AntiLiberal joins
05:32:03Justin[home] is now known as DopefishJustin
06:11:41nertzy__ joins
06:12:36nertzy_ quits [Ping timeout: 250 seconds]
06:27:55shoghicp (shoghicp) joins
07:35:48sonick quits [Quit: Connection closed for inactivity]
07:54:27sonick (sonick) joins
09:01:37Ryz quits [Remote host closed the connection]
09:02:33Ryz (Ryz) joins
09:25:06<AK>Hi AntiLiberal
09:26:08<AK>Wasn't there an archiveteam host site a bit like google docs at one point? Is that still around? (I want to use it while I look at some of the HK media sites and catalogue what's already done)
09:29:19HP_Archivist quits [Ping timeout: 258 seconds]
09:55:48sonick quits [Client Quit]
10:27:44HackMii quits [Remote host closed the connection]
10:29:39HackMii (hacktheplanet) joins
10:54:03<Jake>AK: Yes, I believe kiska ran a etherpad instance for us: https://pad.notkiska.pw/
10:57:09<AK>That's the one, thanks!
11:26:57nuroten quits [Remote host closed the connection]
11:53:15<nyany>Don't know if anyone heard, but from that guy that was here yesterday asking after Near/byuu
11:53:20<nyany>They're gone :(
12:01:10achivarin quits [Remote host closed the connection]
12:01:11<Jake>:'(
12:04:18Specular joins
12:11:31<Specular>Would it be possible for someone with op permissions to archive the following forum? Has had many accounts deleted over the past year and would be nice to crawl it before more occur. http://www.thebore.com/forum/
12:11:40<Specular>*via archivebot
12:12:58<AK>If it's not been run by anyone else I'll do it this afternoon when i finish work Specular (Want to make sure someone can keep an eye on it while it runs)
12:13:55<Specular>AK, appreciated. The forum has three main sub-forums and the pagination goes back all the way to the beginning for each.
12:24:57achivarin (achivarin) joins
12:28:13KRG joins
12:28:13KRG quits [Changing host]
12:28:13KRG (KRG) joins
12:30:38KRG` quits [Ping timeout: 258 seconds]
12:38:06LeGoupil joins
12:43:02benjinsmith joins
12:45:58benjins quits [Ping timeout: 258 seconds]
12:46:27<@OrIdow6>AK: It's usually desirable, and more common, to put a list of links on a single site into AB rather than #//
12:46:54<@OrIdow6>Since that groups them together, allows for better monitoring, etc., at the cost of being slower
12:48:19<AK>Makes sense
12:58:45<AK>So OrIdow6, reckon it's worth throwing http://deepdream.psychic-vr-lab.com/ into AB then? I can't see a source on it shutting down, but it has stopped accepting uploads
13:13:43benjinsmith is now known as benjins
13:22:26<@OrIdow6>AK: I think so, unless it's in the TBs or something
13:26:01achivarin quits [Remote host closed the connection]
13:45:16yano quits [Quit: WeeChat, the better IRC client, https://weechat.org/]
13:45:24yanome quits [Quit: The Lounge - https://thelounge.chat]
13:46:28yanome (yano) joins
13:47:22yano (yano) joins
14:03:36<billy549>"WARNING: The requested image's platform (linux/amd64) does not match the detected host platform (linux/arm/v7) and no specific platform was requested
14:03:36<billy549>" is there a reason ARM32 builds of Warrior arent being made? my Pi 4 isnt running 64-bit raspbian and i'd take a bet that most people's arent ;p
14:05:01<AK>Not all components have been tested on ARM, and until everything is tested and confirmed to be working without any issues we don't want to publish builds for it
14:05:07<billy549>ah oki, no worries
14:08:24lorwp quits [Ping timeout: 250 seconds]
14:27:40achivarin (achivarin) joins
14:32:40Arcorann__ quits [Ping timeout: 250 seconds]
15:24:55britmob joins
15:26:58britm0b quits [Ping timeout: 258 seconds]
15:28:30AntiLiberal quits [Ping timeout: 258 seconds]
15:44:44llacb47 joins
15:50:07Specular quits [Client Quit]
16:04:37llacb47 quits [Remote host closed the connection]
16:12:10<kiska8>Jake AK Its still running :D
16:12:15kiska8 is now known as kiska
16:12:38<Jake>:)
16:14:01<kiska>It might be a bit slow given the thing is only 1vCPU + 1G of memory
16:44:07lennier2 joins
16:45:42achivarin quits [Remote host closed the connection]
16:46:08lennier1 quits [Ping timeout: 250 seconds]
16:46:09lennier2 is now known as lennier1
16:53:10achivarin (achivarin) joins
17:04:07britm0b joins
17:05:35<@JAA>Does anyone want to attempt to grab Stand News's Facebook videos? https://pastebin.com/9hMpxr2W (from /u/ChicagoDataHoarder in https://old.reddit.com/r/Archiveteam/comments/o90rgr/another_prodemocracy_newspaper_in_hong_kong/ )
17:05:49<@JAA>Facebook's an arse, and AB doesn't grab the actual video.
17:06:15britmob quits [Ping timeout: 258 seconds]
17:07:26AntiLiberal joins
17:08:00Daloader joins
17:08:55LeGoupil quits [Client Quit]
17:09:16LeGoupil joins
17:10:36<@EggplantN>OrIdow6 any projects from you coming up? :)
17:10:49<@EggplantN>I've got a night spare today, so just doing some cleaning up/planning
17:10:51<@OrIdow6>EggplantN: Um, I think I need to look into that French thing
17:11:15<@OrIdow6>Which is small enough for AB, but unfortunately it uses subdomains such as to be incompatible
17:11:37<@EggplantN>Are you doing #enjinxed or is that arkive r
17:12:48<@OrIdow6>I might do it in the next few days if he has nothing
17:12:51<@OrIdow6>But nothing absolute
17:13:10<@EggplantN>Ah okie, i didnt know who's that was. So nothing from you other than maybe framasoft
17:13:52<@OrIdow6>Looking at Deathwatch, I think it's just that, Brilliant.org if we want to do a WBM pass as someone suggested, and Sony stores
17:14:34<@EggplantN>Aight nice, if you are wanting to do any shout up, i'm cleaning targets etc today
17:16:02AntiLiberal quits [Ping timeout: 250 seconds]
17:16:13<@OrIdow6>Ok
17:36:24Daloader quits [Ping timeout: 250 seconds]
17:39:17lorwp (lorwp) joins
17:45:30lorwp quits [Ping timeout: 250 seconds]
17:45:39abccc quits [Remote host closed the connection]
17:51:58<Jake>I'd be happy to try and grab Stand News's videos when I get home in a little bit.
18:13:45LeGoupil quits [Client Quit]
18:17:41nuroten joins
18:30:07DogsRNice (Webuser299) joins
18:37:36ddd joins
18:42:16<nuroten>JAA: sorry for the delay, the rest of the "others (still high priority)" yt video channels based on abccc's list of websites https://ttm.sh/FOp.txt
18:43:49<nuroten>socrec.org (the website) was on orly's list, but we didn't add the yt channels before I think. they seem to be split up by individuals, the first 2 socrec channels have the most videos
18:46:40<nuroten>AK: is there a pad started for the HK media/websites yet? I can help organise / sort things, cross-check lists, etc. if that might be helpful
18:51:07<AK>I started making one, then struggled to make a table so I went back to onenote locally until I came up with something better
18:51:16<@HCross>good choice
18:52:45<nuroten>another thought, would it be worth archiving websites of political parties? a few of them have recently been disbanded, might be bad news for the sites lifespan
18:53:38<AK>That sounds like something worth archiving to me
18:54:07<nuroten>table ... ethercalc? haha
18:54:37<nuroten>okay, thanks, I'll collect up some links for those
18:54:48<AK>https://pad.notkiska.pw/p/ATArchiveHongKongMedia
18:55:21<AK>There's the list of "High but not highest priority" media sites I need to throw into AB next, so feel free to add a tonne of stuff onto there and I'll just keep getting them through AB
18:55:41<nuroten>awesome, will do, thanks
18:56:40<@JAA>Please set your nickname in the top right corner before making edits so we know who changed what. :-)
18:57:43<nuroten>okay :)
19:06:02x9fff00 quits [Quit: leaving]
19:07:04x9fff00 (x9fff00) joins
19:53:11<Jake>(I've started on the list from reddit)
19:56:16Daloader joins
20:02:42ddd quits [Remote host closed the connection]
20:09:52HP_Archivist (HP_Archivist) joins
20:26:11jacobk joins
20:33:00<nuroten>is hkchronicles.com already (occasionally) archived? not sure about AT's policy on sites with doxxing overtones (name and phone numbers, though that isn't the primary feature of the site)
20:36:31<nuroten>the listing of pro-democracy businesses and shops might be useful
21:20:02<thuban>i'm free this afternoon. would it be helpful if i were to make a general hong kong media wiki page and copy current state to a table there?
21:20:49<AK>Yes!
21:20:53<AK>https://pad.notkiska.pw/p/ATArchiveHongKongMedia
21:21:42<thuban>sounds good, will get started in about an hour :)
21:22:12<AK>Awesome, ping me if you need anything from me :)
21:49:15lorwp (lorwp) joins
22:16:02<Jake>Wow. Facebook ratelimiting is quite annoying.
22:24:17abccc joins
23:01:26<thuban>AK: is the AB job for cablenews.i-cable.com done? i don't see it on the dash but it's not in the viewer (yet)
23:01:38<thuban>& if so could you paste the job id?
23:05:03<AK>e3uw2mq9ewhp3jnsajodw5oi
23:05:53<thuban>thanks! it is done, then?
23:14:37<AK>Yep it's done, sorry
23:29:47ddd joins
23:34:47ddd quits [Remote host closed the connection]
23:37:22Daloader quits [Ping timeout: 250 seconds]
23:41:55<thuban>how's this, everybody? https://wiki.archiveteam.org/index.php/Hong_Kong_media
23:42:59<thuban>(i would appreciate corrections, if anyone has them, on the en and/or zh title of each site)
23:44:58<thuban>so far it's just the highest-priority news sites; i will start adding other news, political parties, & orgs in a bit
23:47:25lorwp quits [Client Quit]
23:47:56nuroten quits [Remote host closed the connection]
23:48:30Arcorann__ joins
23:48:37lorwp (lorwp) joins
23:48:46nuroten joins
23:49:01<nuroten>thuban: looking really good! thanks for collecting them up :) name for tvmost.com.hk - TVMost (100毛)
23:49:26<nuroten>d100.net is just D100
23:49:52<thuban>thanks!
23:50:34<thuban>hm... i think i'll swap the job ids and status indicators in the archivebot column so it actually sorts usefully
23:57:24BlueMaxima joins
23:58:43HP_Archivist quits [Ping timeout: 258 seconds]
23:59:47<nuroten>actually, this may be more accurate - TVMost (毛記電視) / 100Most (100毛)