00:08:23lennier1 quits [Client Quit]
00:08:48lennier1 (lennier1) joins
01:52:47<@JAA>Looks like Apple Daily is back on the menu, everyone. Specifically, the Taiwanese version ceased publication on 31 Aug: https://www.taipeitimes.com/News/taiwan/archives/2022/08/31/2003784483
01:53:31<@JAA>That's https://tw.appledaily.com/ and https://www.appledaily.com.tw/
01:56:13MoeLarryShemp (MoeLarryShemp) joins
01:58:28Arcorann (Arcorann) joins
02:02:04<thuban>hmm, no offline date given
02:10:05pabs quits [Quit: Don't rest until all the world is paved in moss and greenery.]
02:14:05pabs (pabs) joins
02:41:57xkey quits [Quit: xkey]
02:55:33MoeLarryShemp quits [Remote host closed the connection]
03:10:36<h2ibot>Switchnode edited Apple Daily (+1429, add taiwan): https://wiki.archiveteam.org/?diff=48907&oldid=48849
03:12:36<h2ibot>Switchnode edited Apple Daily (+97, add appledaily.com.tw job): https://wiki.archiveteam.org/?diff=48908&oldid=48907
03:13:56<thuban>main site and major social media saved or in progress. two things:
03:15:40<thuban>- not sure what's up with the 'readline timed out' error on the tw.appledaily.com job. the site looks almost identical to www.appledaily.com.tw, but ab reports finding at least a gig less; is this legit?
03:18:24<thuban>- can/should we apply all the techniques we used on apple daily hk (subdomain scans, video extraction, original article image extraction) on the tw sites? i don't know to what extent these apply (other than subdomains for the tw top-level)
03:29:15<@JAA>I highly doubt that AB is able to get everything on these sites.
03:29:43<@JAA>I think the TW sites were very different from the HK ones, but I'm not sure about that.
03:30:26<thuban>^ probably, yeah. are you suggesting a seesaw project?
03:30:44<@JAA>I'm suggesting it needs to be looked at closely to decide what we can do. :-)
03:31:23<thuban>ugh, you and your reasonable nuanced responses. :)
03:34:43<thuban>we can definitely start looking for subdomains, though, yeah? what's out usual methodology there--just cdx?
03:34:49<thuban>*our
04:00:00treora quits [Quit: blub blub.]
04:01:12treora joins
04:37:42<Jake>https://twitter.com/textfiles/status/1565921616378445824
04:39:09<Jake>(Looks like we already got the Twitter and site)
05:18:26sec^nd quits [Remote host closed the connection]
05:18:52sec^nd (second) joins
05:29:25tbc1887 (tbc1887) joins
05:42:11SketchCow joins
06:04:10tbc1887 quits [Read error: Connection reset by peer]
07:07:26nostrum-tango quits [Remote host closed the connection]
07:08:30nostrum-tango joins
07:45:52yawkat quits [Ping timeout: 265 seconds]
07:54:46sec^nd quits [Ping timeout: 240 seconds]
07:56:22sec^nd (second) joins
08:04:54yawkat (yawkat) joins
08:15:22xkey (xkey) joins
08:20:32Shjosan quits [Quit: Am sleepy (-, – )…zzzZZZ]
08:21:18Shjosan (Shjosan) joins
08:31:48mutantmnky quits [Remote host closed the connection]
08:32:44mutantmnky (mutantmonkey) joins
09:00:01Shjosan quits [Client Quit]
09:20:45<h2ibot>Usernam edited List of websites excluded from the Wayback Machine (+33): https://wiki.archiveteam.org/?diff=48909&oldid=48904
10:27:23T31M quits [Quit: ZNC - https://znc.in]
10:27:49T31M joins
10:35:46march_happy quits [Ping timeout: 240 seconds]
10:36:20march_happy (march_happy) joins
11:19:34T31M quits [Client Quit]
11:19:53T31M joins
11:48:47benjinsmith joins
11:50:46benjins quits [Ping timeout: 240 seconds]
11:58:44benjinsmith is now known as benjins
12:13:21sec^nd quits [Remote host closed the connection]
12:13:52sec^nd (second) joins
12:17:13sec^nd quits [Remote host closed the connection]
12:17:46sec^nd (second) joins
12:57:46march_happy quits [Ping timeout: 240 seconds]
12:58:23march_happy (march_happy) joins
13:05:57BlueMaxima quits [Read error: Connection reset by peer]
13:42:29Mateon2 joins
13:43:03omglolbah quits [Quit: ZNC - https://znc.in]
13:43:03HP_Archivist quits [Remote host closed the connection]
13:43:03Mateon1 quits [Remote host closed the connection]
13:43:03drexler quits [Remote host closed the connection]
13:43:03Mateon2 is now known as Mateon1
13:43:10HP_Archivist (HP_Archivist) joins
13:43:14drexler joins
13:43:52omglolbah joins
14:02:23march_happy quits [Ping timeout: 265 seconds]
14:02:35march_happy (march_happy) joins
14:08:04Arcorann quits [Ping timeout: 240 seconds]
14:36:30sec^nd quits [Remote host closed the connection]
14:37:06sec^nd (second) joins
14:37:41sec^nd quits [Remote host closed the connection]
14:38:17sec^nd (second) joins
14:42:50Minkafighter quits [Quit: The Lounge - https://thelounge.chat]
14:43:57Minkafighter joins
15:06:20vitzli (vitzli) joins
15:08:36vitzli quits [Client Quit]
15:49:49dm4v quits [Client Quit]
15:50:26dm4v joins
16:34:27fishingforpie joins
16:34:58<fishingforpie>Is anyone familiar with now defunct site named mf247.jp?
16:35:25<fishingforpie>It was a Japanese music download site in the mid-2000s
16:36:35<fishingforpie>I am going to assume the answer is no because no English mention of it exists on the web.
16:36:51<fishingforpie>There was a song released on it and I am trying to find the song
16:37:26<fishingforpie>https://www.reddit.com/r/Lostwave/comments/wj0jc9/a_lost_japanese_idol_song_power_age_my_dear_friend/ I compiled all info of it into this Reddit post.
16:39:47sec^nd quits [Remote host closed the connection]
16:40:12sec^nd (second) joins
16:46:01fishingforpie quits [Remote host closed the connection]
16:48:16Minkafighter quits [Client Quit]
16:53:54Minkafighter joins
16:54:48drexler_ joins
16:54:58CraftByte quits [Quit: Ping timeout (120 seconds)]
16:54:58Matthww1 quits [Client Quit]
16:54:58AK quits [Quit: Ping timeout (120 seconds)]
16:54:58eroc1990 quits [Quit: Ping timeout (120 seconds)]
16:54:58drexler quits [Remote host closed the connection]
16:54:59coderobe quits [Quit: Ping timeout (120 seconds)]
16:55:00Matthww16 joins
16:55:03CraftByte9 (DragonSec|CraftByte) joins
16:55:07eroc1990 (eroc1990) joins
16:55:16coderobe9 (coderobe) joins
16:55:22AK8 (AK) joins
17:41:16march_happy quits [Ping timeout: 240 seconds]
17:43:39coderobe9 is now known as coderobe
18:12:03tech_exorcist (tech_exorcist) joins
18:15:02tech_exorcist quits [Client Quit]
18:20:26tech_exorcist (tech_exorcist) joins
18:56:03sec^nd quits [Remote host closed the connection]
19:01:42sec^nd (second) joins
20:23:46sec^nd quits [Ping timeout: 240 seconds]
20:24:28mutantmnky quits [Remote host closed the connection]
20:24:29tech_exorcist quits [Write error: Broken pipe]
20:24:54mutantmnky (mutantmonkey) joins
20:25:31tech_exorcist (tech_exorcist) joins
20:28:34sec^nd (second) joins
21:00:46sec^nd quits [Ping timeout: 240 seconds]
21:05:34sec^nd (second) joins
21:10:58AK8 is now known as AK
21:29:16mutantmnky quits [Ping timeout: 240 seconds]
21:36:38mutantmnky (mutantmonkey) joins
22:02:02march_happy (march_happy) joins
22:08:54tech_exorcist quits [Client Quit]
22:09:16march_happy quits [Read error: Connection reset by peer]
22:09:27march_happy (march_happy) joins
22:45:13<systwi_>fishingforpie: https://archive.fart.website/archivebot/viewer/?q=http%3A%2F%2Fmf247.jp%2F - "No search results."
22:45:54<@JAA>ArchiveBot (and in fact ArchiveTeam) did not exist in the mid-2000s, so uh yeah.
22:46:21<systwi_>Didn't know the site died back then.
22:48:42<@JAA>Yeah, they shut down in 2008 it seems.
22:56:03<@JAA>Maybe a CDX listing would unearth something.
22:56:34fishingforpie joins
23:00:54<systwi_>fishingforpie: From earlier:
23:00:58<systwi_>< systwi_> fishingforpie: https://archive.fart.website/archivebot/viewer/?q=http%3A%2F%2Fmf247.jp%2F - "No search results."
23:01:03<systwi_><@JAA> ArchiveBot (and in fact ArchiveTeam) did not exist in the mid-2000s, so uh yeah.
23:01:06<systwi_>< systwi_> Didn't know the site died back then.
23:01:11<systwi_><@JAA> Yeah, they shut down in 2008 it seems.
23:01:17<systwi_><@JAA> Maybe a CDX listing would unearth something.
23:01:21<fishingforpie>CDX?
23:01:40<fishingforpie>My client has been disconnected for the past 6 hours, my bad.
23:01:50<systwi_>"Content inDeX" I believe; an index for WARC files if I'm not wrong.
23:01:53<@JAA>Or, to avoid spamming the channel: https://hackint.logs.kiska.pw/archiveteam-bs/20220903
23:02:03<systwi_>That too, sorry <_>;
23:02:43<fishingforpie>Is there a link to CDX?
23:03:42<@JAA>CDX is a file format for WARC indices, yeah, but I was referring to the Wayback Machine's CDX API, which provides an index of all URLs in the WBM.
23:03:53<@JAA>It's not pleasant to use.
23:04:06<fishingforpie>I checked mf247.jp on wayback machine and got some info
23:04:43<fishingforpie>Specifically the song in question's page wasn't even archived, but the artist's was and the song has at least 2,021 downloads
23:05:35<fishingforpie>There's also a YouTube link that did contain the song at one point
23:05:42<fishingforpie>however the channel got terminate
23:05:58<fishingforpie>http://t.co/uIaUHbMwjg
23:06:06<fishingforpie>https://www.youtube.com/watch?v=x_sAUj2vkxQ
23:06:29<@JAA>Those links would be useful.
23:06:42<@JAA>The song and artist page, I mean.
23:06:59<fishingforpie>The artist page I could probably find
23:07:14<fishingforpie>May take a few days
23:07:28<fishingforpie>Or a few hours
23:08:53<fishingforpie>https://discord.com/channels/941154988300828744/941154988300828749/962821011177357402
23:09:01<fishingforpie>https://web.archive.org/web/20080805100900/http://www.mf247.jp/jp/artist.php?aid=58085
23:10:07<fishingforpie>Or a few minutes, that works too
23:10:20<@JAA>Ok yeah, only 17k unique URLs from mf247.jp in the WBM...
23:10:42<fishingforpie>mf247 did make CDs for indies at one point
23:11:11<fishingforpie>A CD for the song exists if you read my reddit post.
23:11:22<@JAA>https://web.archive.org/web/20080531154623/http://download.mf247.jp:80/dl.php?m=0058085001 Welp...
23:12:19<fishingforpie>That blog post you sent was made by someone who very likely has several types of media including the song
23:12:30<@JAA>Artist page was also captured in English, but that's about it: https://web.archive.org/web/20080506100122/http://www.mf247.jp/en/artist.php?aid=58085
23:12:41<fishingforpie>A DVD, the CD, even the original download possibly
23:13:16<@JAA>Or at least I can't find anything else in the CDX data using the artist and song identifiers, for whatever that's worth.
23:13:20<fishingforpie>I don't think you sent it if I recall correctly though
23:13:23<fishingforpie>Oh.
23:14:26<fishingforpie>It was also available on an ancient smartphone app called Mobage.
23:15:27<fishingforpie>Which doesn't exist anymore.
23:15:31<@JAA>Maybe try Megalodon, but I think it was still in its early days at the time, so probably unlikely that there's anything there.
23:15:38<fishingforpie>Megalodon?
23:15:39<@JAA>https://megalodon.jp/
23:15:43<jamesp>just heard about kiwifarms blocked
23:16:03<fishingforpie>What is that?
23:16:11<fishingforpie>megalodon, not kiwifarms
23:16:17<fishingforpie>Kiwifarms killed Near.
23:16:22<@JAA>Can you search the web?
23:16:24<jamesp>-> #archiveteam-ot for kiwifarms
23:16:26<fishingforpie>Yeah.
23:16:45<fishingforpie>Translate isn't working well
23:16:59<@JAA>There's also a web archival thingy from the National Diet Library in Japan, but that's even less likely to have anything I guess.
23:18:54<fishingforpie>megalodon has nothing
23:19:51<fishingforpie>I checked all of the YouTube archives and no uploads of the song are archived.
23:20:17<fishingforpie>It's sad how many dead ends there are.
23:21:15<fishingforpie>https://www.discogs.com/release/23640539-P-APower-Age-%E7%B4%84%E6%9D%9F-My-Dear-Friend
23:22:08<@JAA>systwi_: I have no idea what CDX stands for exactly. Might be what you say or something involving 'compression' perhaps. You'd have to ask whoever invented it at IA about 20 years ago. Even the earliest evidence of it just calls it 'CDX': https://web.archive.org/web/20031226073353/http://www.archive.org/web/researcher/cdx_file_format.php
23:23:04<@JAA>Although it wasn't only used for compressed contents, so yeah, 'content' is more likely.
23:24:40<systwi_>Thank you for the info. :-)
23:24:54<@JAA>Or maybe someone at the IIPC knows.
23:25:00<systwi_>Maybe it's one of those "it means C-index, C means nothing. Like GNU
23:25:04<systwi_>Oops...
23:25:08<systwi_>Maybe it's one of those "it means C-index, C means nothing. Like GNU's Not Unix."
23:26:33<@JAA>CDX index :-)
23:26:51<@OrIdow6>I thought I saw a video from IA where it was said to stand for "Content Index", I think, though what I remember was that no one knew what the letters stood for
23:28:01<@OrIdow6>No
23:31:23BlueMaxima joins
23:31:50<fishingforpie>helloooo
23:32:23<@OrIdow6>"known in the [unclear] literature as 'Capture Index' or "Crawl Index'", PM me if you want the source for that
23:32:57<@JAA>Ah yes, that would make a lot of sense as well.
23:33:13<@JAA>fishingforpie: The label still exists. Have you considered trying to contact them? https://stardustrecords.jp/sdr
23:33:35<fishingforpie>Is that a joke?
23:33:44<@JAA>... No?
23:33:51<fishingforpie>They leave a automated message saying the group has disbanded.
23:34:03<fishingforpie>I contacted the producer for Promise (B-Side on the CD)
23:34:15<fishingforpie>Never got back
23:34:34<fishingforpie>JASRAC only has Promise registered so I cannot request them for the song or any info
23:35:03<fishingforpie>JASRAC is a database of all songs registered by RIAJ, RIAA's stricter cousin.
23:35:59<fishingforpie>I contacted some of the members too on Instagram. They liked my comment but never responded.
23:38:29<@JAA>I can't know what you tried if all I get is your Reddit post which doesn't mention any of that.
23:38:48<fishingforpie>Oh yeah, my bad.
23:39:00<fishingforpie>I really should edit it...
23:39:27Zerote joins
23:39:28<systwi_>Thanks for solving that mystery, OrIdow6. :-)
23:39:30<@JAA>Have you tried talking to the Discogs entry contributors?
23:39:37<fishingforpie>I am one of them
23:39:44Zerote quits [Remote host closed the connection]
23:39:55<@JAA>You know you're not supposed to contribute without having a copy of the release, right?
23:39:56<fishingforpie>A friend in the lostwave Discord servers, Evan, helped me make the page
23:40:09<fishingforpie>He said it was okay...
23:40:16<@JAA>He was wrong then.
23:40:39<fishingforpie>Can you show me the rules?
23:40:40<@JAA>Unless their policies changed since a few years ago when I was active there.
23:40:49<fishingforpie>There's several pages where that's not the case
23:41:08<@JAA>Nope, rule one is still 'get a copy of the release in front of you': https://support.discogs.com/hc/en-us/articles/360004051893
23:41:36<fishingforpie>That's just a guide...
23:41:38<@JAA>Also https://support.discogs.com/hc/en-us/articles/360004016474-Overview-of-Submission-Guidelines-for-Releases
23:41:56<@JAA>Which is linked as the 'Database Guidelines' in the footer.
23:42:16<@JAA>I remember them making it quite clear in the interface as well at the time, but they redesigned the site since, so I don't know what it looks like now.
23:42:39<fishingforpie>https://www.discogs.com/release/23458226-The-Gallery-Little-Islands it's not the case here
23:43:05pcr quits [Ping timeout: 276 seconds]
23:43:28<fishingforpie>I was told to make the DIscogs page to help look for it
23:43:29<@JAA>I'm not going to argue with you about the Discogs rules here. Ask in their forums if you have any doubt.
23:43:39<fishingforpie>and I am not trying to.
23:43:50<fishingforpie>I just simply didn't know. I will go to their forums in a bit
23:43:56<@JAA>Submissions of fake or inexistent releases were a major problem at the time I was active there.
23:44:04<@JAA>Which is why that rule is a thing.
23:44:07<fishingforpie>Oh.
23:44:33<fishingforpie>I did have a sold listing and photos from a ganbaremomoka ameblo post to work with
23:45:17<fishingforpie>Do you have any ideas on how to find the song?
23:46:01<fishingforpie>ganbaremomoka is an ameblo blogger who has shown several Power Age items and has mentioned the song before and has the CD
23:46:27<@JAA>Any way to contact them?
23:46:46<fishingforpie>he has a twitter but I am too worried he won't be exactly nice to me due to our previous messages..
23:47:12<systwi_>fishingforpie: If you're on EFnet, there's a music channel I frequent, #cassette , which some of the others there might be able to help.
23:47:33<systwi_>In regards to finding unknown music.
23:47:40<systwi_>Lost music, etc.
23:48:03<fishingforpie>https://twitter.com/hfkaren2
23:48:20<fishingforpie>I am not on EFNet.
23:48:25<fishingforpie>How do I get on there?
23:49:06<systwi_>Add irc.efnet.net:9999 to your IRC client.
23:49:21<systwi_>Then /join #cassette
23:49:40<fishingforpie>How do I do that? I am new to IRC
23:50:18<systwi_>Minor warning, it's your typical IRC channel. Expect long delays between answers, if people happen to see it.
23:50:25<fishingforpie>Okay.
23:50:34<systwi_>Continuing on #archiveteam-ot
23:50:45<@JAA>Yeah, we're well beyond web stuff now.
23:51:02<fishingforpie>Yeah