| 00:08:23 | | lennier1 quits [Client Quit] |
| 00:08:48 | | lennier1 (lennier1) joins |
| 01:52:47 | <@JAA> | Looks like Apple Daily is back on the menu, everyone. Specifically, the Taiwanese version ceased publication on 31 Aug: https://www.taipeitimes.com/News/taiwan/archives/2022/08/31/2003784483 |
| 01:53:31 | <@JAA> | That's https://tw.appledaily.com/ and https://www.appledaily.com.tw/ |
| 01:56:13 | | MoeLarryShemp (MoeLarryShemp) joins |
| 01:58:28 | | Arcorann (Arcorann) joins |
| 02:02:04 | <thuban> | hmm, no offline date given |
| 02:10:05 | | pabs quits [Quit: Don't rest until all the world is paved in moss and greenery.] |
| 02:14:05 | | pabs (pabs) joins |
| 02:41:57 | | xkey quits [Quit: xkey] |
| 02:55:33 | | MoeLarryShemp quits [Remote host closed the connection] |
| 03:10:36 | <h2ibot> | Switchnode edited Apple Daily (+1429, add taiwan): https://wiki.archiveteam.org/?diff=48907&oldid=48849 |
| 03:12:36 | <h2ibot> | Switchnode edited Apple Daily (+97, add appledaily.com.tw job): https://wiki.archiveteam.org/?diff=48908&oldid=48907 |
| 03:13:56 | <thuban> | main site and major social media saved or in progress. two things: |
| 03:15:40 | <thuban> | - not sure what's up with the 'readline timed out' error on the tw.appledaily.com job. the site looks almost identical to www.appledaily.com.tw, but ab reports finding at least a gig less; is this legit? |
| 03:18:24 | <thuban> | - can/should we apply all the techniques we used on apple daily hk (subdomain scans, video extraction, original article image extraction) on the tw sites? i don't know to what extent these apply (other than subdomains for the tw top-level) |
| 03:29:15 | <@JAA> | I highly doubt that AB is able to get everything on these sites. |
| 03:29:43 | <@JAA> | I think the TW sites were very different from the HK ones, but I'm not sure about that. |
| 03:30:26 | <thuban> | ^ probably, yeah. are you suggesting a seesaw project? |
| 03:30:44 | <@JAA> | I'm suggesting it needs to be looked at closely to decide what we can do. :-) |
| 03:31:23 | <thuban> | ugh, you and your reasonable nuanced responses. :) |
| 03:34:43 | <thuban> | we can definitely start looking for subdomains, though, yeah? what's out usual methodology there--just cdx? |
| 03:34:49 | <thuban> | *our |
| 04:00:00 | | treora quits [Quit: blub blub.] |
| 04:01:12 | | treora joins |
| 04:37:42 | <Jake> | https://twitter.com/textfiles/status/1565921616378445824 |
| 04:39:09 | <Jake> | (Looks like we already got the Twitter and site) |
| 05:18:26 | | sec^nd quits [Remote host closed the connection] |
| 05:18:52 | | sec^nd (second) joins |
| 05:29:25 | | tbc1887 (tbc1887) joins |
| 05:42:11 | | SketchCow joins |
| 06:04:10 | | tbc1887 quits [Read error: Connection reset by peer] |
| 07:07:26 | | nostrum-tango quits [Remote host closed the connection] |
| 07:08:30 | | nostrum-tango joins |
| 07:45:52 | | yawkat quits [Ping timeout: 265 seconds] |
| 07:54:46 | | sec^nd quits [Ping timeout: 240 seconds] |
| 07:56:22 | | sec^nd (second) joins |
| 08:04:54 | | yawkat (yawkat) joins |
| 08:15:22 | | xkey (xkey) joins |
| 08:20:32 | | Shjosan quits [Quit: Am sleepy (-, – )…zzzZZZ] |
| 08:21:18 | | Shjosan (Shjosan) joins |
| 08:31:48 | | mutantmnky quits [Remote host closed the connection] |
| 08:32:44 | | mutantmnky (mutantmonkey) joins |
| 09:00:01 | | Shjosan quits [Client Quit] |
| 09:20:45 | <h2ibot> | Usernam edited List of websites excluded from the Wayback Machine (+33): https://wiki.archiveteam.org/?diff=48909&oldid=48904 |
| 10:27:23 | | T31M quits [Quit: ZNC - https://znc.in] |
| 10:27:49 | | T31M joins |
| 10:35:46 | | march_happy quits [Ping timeout: 240 seconds] |
| 10:36:20 | | march_happy (march_happy) joins |
| 11:19:34 | | T31M quits [Client Quit] |
| 11:19:53 | | T31M joins |
| 11:48:47 | | benjinsmith joins |
| 11:50:46 | | benjins quits [Ping timeout: 240 seconds] |
| 11:58:44 | | benjinsmith is now known as benjins |
| 11:58:45 | | benjins is now authenticated as benjins |
| 12:13:21 | | sec^nd quits [Remote host closed the connection] |
| 12:13:52 | | sec^nd (second) joins |
| 12:17:13 | | sec^nd quits [Remote host closed the connection] |
| 12:17:46 | | sec^nd (second) joins |
| 12:57:46 | | march_happy quits [Ping timeout: 240 seconds] |
| 12:58:23 | | march_happy (march_happy) joins |
| 13:05:57 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 13:42:29 | | Mateon2 joins |
| 13:43:03 | | omglolbah quits [Quit: ZNC - https://znc.in] |
| 13:43:03 | | HP_Archivist quits [Remote host closed the connection] |
| 13:43:03 | | Mateon1 quits [Remote host closed the connection] |
| 13:43:03 | | drexler quits [Remote host closed the connection] |
| 13:43:03 | | Mateon2 is now known as Mateon1 |
| 13:43:10 | | HP_Archivist (HP_Archivist) joins |
| 13:43:14 | | drexler joins |
| 13:43:52 | | omglolbah joins |
| 14:02:23 | | march_happy quits [Ping timeout: 265 seconds] |
| 14:02:35 | | march_happy (march_happy) joins |
| 14:08:04 | | Arcorann quits [Ping timeout: 240 seconds] |
| 14:36:30 | | sec^nd quits [Remote host closed the connection] |
| 14:37:06 | | sec^nd (second) joins |
| 14:37:41 | | sec^nd quits [Remote host closed the connection] |
| 14:38:17 | | sec^nd (second) joins |
| 14:42:50 | | Minkafighter quits [Quit: The Lounge - https://thelounge.chat] |
| 14:43:57 | | Minkafighter joins |
| 15:06:20 | | vitzli (vitzli) joins |
| 15:08:36 | | vitzli quits [Client Quit] |
| 15:37:27 | | T31M is now authenticated as T31M |
| 15:49:49 | | dm4v quits [Client Quit] |
| 15:50:26 | | dm4v joins |
| 16:34:27 | | fishingforpie joins |
| 16:34:58 | <fishingforpie> | Is anyone familiar with now defunct site named mf247.jp? |
| 16:35:25 | <fishingforpie> | It was a Japanese music download site in the mid-2000s |
| 16:36:35 | <fishingforpie> | I am going to assume the answer is no because no English mention of it exists on the web. |
| 16:36:51 | <fishingforpie> | There was a song released on it and I am trying to find the song |
| 16:37:26 | <fishingforpie> | https://www.reddit.com/r/Lostwave/comments/wj0jc9/a_lost_japanese_idol_song_power_age_my_dear_friend/ I compiled all info of it into this Reddit post. |
| 16:39:47 | | sec^nd quits [Remote host closed the connection] |
| 16:40:12 | | sec^nd (second) joins |
| 16:46:01 | | fishingforpie quits [Remote host closed the connection] |
| 16:48:16 | | Minkafighter quits [Client Quit] |
| 16:53:54 | | Minkafighter joins |
| 16:54:48 | | drexler_ joins |
| 16:54:58 | | CraftByte quits [Quit: Ping timeout (120 seconds)] |
| 16:54:58 | | Matthww1 quits [Client Quit] |
| 16:54:58 | | AK quits [Quit: Ping timeout (120 seconds)] |
| 16:54:58 | | eroc1990 quits [Quit: Ping timeout (120 seconds)] |
| 16:54:58 | | drexler quits [Remote host closed the connection] |
| 16:54:59 | | coderobe quits [Quit: Ping timeout (120 seconds)] |
| 16:55:00 | | Matthww16 joins |
| 16:55:03 | | CraftByte9 (DragonSec|CraftByte) joins |
| 16:55:07 | | eroc1990 (eroc1990) joins |
| 16:55:16 | | coderobe9 (coderobe) joins |
| 16:55:22 | | AK8 (AK) joins |
| 17:41:16 | | march_happy quits [Ping timeout: 240 seconds] |
| 17:43:39 | | coderobe9 is now known as coderobe |
| 18:12:03 | | tech_exorcist (tech_exorcist) joins |
| 18:15:02 | | tech_exorcist quits [Client Quit] |
| 18:20:26 | | tech_exorcist (tech_exorcist) joins |
| 18:56:03 | | sec^nd quits [Remote host closed the connection] |
| 19:01:42 | | sec^nd (second) joins |
| 20:23:46 | | sec^nd quits [Ping timeout: 240 seconds] |
| 20:24:28 | | mutantmnky quits [Remote host closed the connection] |
| 20:24:29 | | tech_exorcist quits [Write error: Broken pipe] |
| 20:24:54 | | mutantmnky (mutantmonkey) joins |
| 20:25:31 | | tech_exorcist (tech_exorcist) joins |
| 20:28:34 | | sec^nd (second) joins |
| 21:00:46 | | sec^nd quits [Ping timeout: 240 seconds] |
| 21:05:34 | | sec^nd (second) joins |
| 21:10:58 | | AK8 is now known as AK |
| 21:29:16 | | mutantmnky quits [Ping timeout: 240 seconds] |
| 21:36:38 | | mutantmnky (mutantmonkey) joins |
| 22:02:02 | | march_happy (march_happy) joins |
| 22:08:54 | | tech_exorcist quits [Client Quit] |
| 22:09:16 | | march_happy quits [Read error: Connection reset by peer] |
| 22:09:27 | | march_happy (march_happy) joins |
| 22:45:13 | <systwi_> | fishingforpie: https://archive.fart.website/archivebot/viewer/?q=http%3A%2F%2Fmf247.jp%2F - "No search results." |
| 22:45:54 | <@JAA> | ArchiveBot (and in fact ArchiveTeam) did not exist in the mid-2000s, so uh yeah. |
| 22:46:21 | <systwi_> | Didn't know the site died back then. |
| 22:48:42 | <@JAA> | Yeah, they shut down in 2008 it seems. |
| 22:56:03 | <@JAA> | Maybe a CDX listing would unearth something. |
| 22:56:34 | | fishingforpie joins |
| 23:00:54 | <systwi_> | fishingforpie: From earlier: |
| 23:00:58 | <systwi_> | < systwi_> fishingforpie: https://archive.fart.website/archivebot/viewer/?q=http%3A%2F%2Fmf247.jp%2F - "No search results." |
| 23:01:03 | <systwi_> | <@JAA> ArchiveBot (and in fact ArchiveTeam) did not exist in the mid-2000s, so uh yeah. |
| 23:01:06 | <systwi_> | < systwi_> Didn't know the site died back then. |
| 23:01:11 | <systwi_> | <@JAA> Yeah, they shut down in 2008 it seems. |
| 23:01:17 | <systwi_> | <@JAA> Maybe a CDX listing would unearth something. |
| 23:01:21 | <fishingforpie> | CDX? |
| 23:01:40 | <fishingforpie> | My client has been disconnected for the past 6 hours, my bad. |
| 23:01:50 | <systwi_> | "Content inDeX" I believe; an index for WARC files if I'm not wrong. |
| 23:01:53 | <@JAA> | Or, to avoid spamming the channel: https://hackint.logs.kiska.pw/archiveteam-bs/20220903 |
| 23:02:03 | <systwi_> | That too, sorry <_>; |
| 23:02:43 | <fishingforpie> | Is there a link to CDX? |
| 23:03:42 | <@JAA> | CDX is a file format for WARC indices, yeah, but I was referring to the Wayback Machine's CDX API, which provides an index of all URLs in the WBM. |
| 23:03:53 | <@JAA> | It's not pleasant to use. |
| 23:04:06 | <fishingforpie> | I checked mf247.jp on wayback machine and got some info |
| 23:04:43 | <fishingforpie> | Specifically the song in question's page wasn't even archived, but the artist's was and the song has at least 2,021 downloads |
| 23:05:35 | <fishingforpie> | There's also a YouTube link that did contain the song at one point |
| 23:05:42 | <fishingforpie> | however the channel got terminate |
| 23:05:58 | <fishingforpie> | http://t.co/uIaUHbMwjg |
| 23:06:06 | <fishingforpie> | https://www.youtube.com/watch?v=x_sAUj2vkxQ |
| 23:06:29 | <@JAA> | Those links would be useful. |
| 23:06:42 | <@JAA> | The song and artist page, I mean. |
| 23:06:59 | <fishingforpie> | The artist page I could probably find |
| 23:07:14 | <fishingforpie> | May take a few days |
| 23:07:28 | <fishingforpie> | Or a few hours |
| 23:08:53 | <fishingforpie> | https://discord.com/channels/941154988300828744/941154988300828749/962821011177357402 |
| 23:09:01 | <fishingforpie> | https://web.archive.org/web/20080805100900/http://www.mf247.jp/jp/artist.php?aid=58085 |
| 23:10:07 | <fishingforpie> | Or a few minutes, that works too |
| 23:10:20 | <@JAA> | Ok yeah, only 17k unique URLs from mf247.jp in the WBM... |
| 23:10:42 | <fishingforpie> | mf247 did make CDs for indies at one point |
| 23:11:11 | <fishingforpie> | A CD for the song exists if you read my reddit post. |
| 23:11:22 | <@JAA> | https://web.archive.org/web/20080531154623/http://download.mf247.jp:80/dl.php?m=0058085001 Welp... |
| 23:12:19 | <fishingforpie> | That blog post you sent was made by someone who very likely has several types of media including the song |
| 23:12:30 | <@JAA> | Artist page was also captured in English, but that's about it: https://web.archive.org/web/20080506100122/http://www.mf247.jp/en/artist.php?aid=58085 |
| 23:12:41 | <fishingforpie> | A DVD, the CD, even the original download possibly |
| 23:13:16 | <@JAA> | Or at least I can't find anything else in the CDX data using the artist and song identifiers, for whatever that's worth. |
| 23:13:20 | <fishingforpie> | I don't think you sent it if I recall correctly though |
| 23:13:23 | <fishingforpie> | Oh. |
| 23:14:26 | <fishingforpie> | It was also available on an ancient smartphone app called Mobage. |
| 23:15:27 | <fishingforpie> | Which doesn't exist anymore. |
| 23:15:31 | <@JAA> | Maybe try Megalodon, but I think it was still in its early days at the time, so probably unlikely that there's anything there. |
| 23:15:38 | <fishingforpie> | Megalodon? |
| 23:15:39 | <@JAA> | https://megalodon.jp/ |
| 23:15:43 | <jamesp> | just heard about kiwifarms blocked |
| 23:16:03 | <fishingforpie> | What is that? |
| 23:16:11 | <fishingforpie> | megalodon, not kiwifarms |
| 23:16:17 | <fishingforpie> | Kiwifarms killed Near. |
| 23:16:22 | <@JAA> | Can you search the web? |
| 23:16:24 | <jamesp> | -> #archiveteam-ot for kiwifarms |
| 23:16:26 | <fishingforpie> | Yeah. |
| 23:16:45 | <fishingforpie> | Translate isn't working well |
| 23:16:59 | <@JAA> | There's also a web archival thingy from the National Diet Library in Japan, but that's even less likely to have anything I guess. |
| 23:18:54 | <fishingforpie> | megalodon has nothing |
| 23:19:51 | <fishingforpie> | I checked all of the YouTube archives and no uploads of the song are archived. |
| 23:20:17 | <fishingforpie> | It's sad how many dead ends there are. |
| 23:21:15 | <fishingforpie> | https://www.discogs.com/release/23640539-P-APower-Age-%E7%B4%84%E6%9D%9F-My-Dear-Friend |
| 23:22:08 | <@JAA> | systwi_: I have no idea what CDX stands for exactly. Might be what you say or something involving 'compression' perhaps. You'd have to ask whoever invented it at IA about 20 years ago. Even the earliest evidence of it just calls it 'CDX': https://web.archive.org/web/20031226073353/http://www.archive.org/web/researcher/cdx_file_format.php |
| 23:23:04 | <@JAA> | Although it wasn't only used for compressed contents, so yeah, 'content' is more likely. |
| 23:24:40 | <systwi_> | Thank you for the info. :-) |
| 23:24:54 | <@JAA> | Or maybe someone at the IIPC knows. |
| 23:25:00 | <systwi_> | Maybe it's one of those "it means C-index, C means nothing. Like GNU |
| 23:25:04 | <systwi_> | Oops... |
| 23:25:08 | <systwi_> | Maybe it's one of those "it means C-index, C means nothing. Like GNU's Not Unix." |
| 23:26:33 | <@JAA> | CDX index :-) |
| 23:26:51 | <@OrIdow6> | I thought I saw a video from IA where it was said to stand for "Content Index", I think, though what I remember was that no one knew what the letters stood for |
| 23:28:01 | <@OrIdow6> | No |
| 23:31:23 | | BlueMaxima joins |
| 23:31:50 | <fishingforpie> | helloooo |
| 23:32:23 | <@OrIdow6> | "known in the [unclear] literature as 'Capture Index' or "Crawl Index'", PM me if you want the source for that |
| 23:32:57 | <@JAA> | Ah yes, that would make a lot of sense as well. |
| 23:33:13 | <@JAA> | fishingforpie: The label still exists. Have you considered trying to contact them? https://stardustrecords.jp/sdr |
| 23:33:35 | <fishingforpie> | Is that a joke? |
| 23:33:44 | <@JAA> | ... No? |
| 23:33:51 | <fishingforpie> | They leave a automated message saying the group has disbanded. |
| 23:34:03 | <fishingforpie> | I contacted the producer for Promise (B-Side on the CD) |
| 23:34:15 | <fishingforpie> | Never got back |
| 23:34:34 | <fishingforpie> | JASRAC only has Promise registered so I cannot request them for the song or any info |
| 23:35:03 | <fishingforpie> | JASRAC is a database of all songs registered by RIAJ, RIAA's stricter cousin. |
| 23:35:59 | <fishingforpie> | I contacted some of the members too on Instagram. They liked my comment but never responded. |
| 23:38:29 | <@JAA> | I can't know what you tried if all I get is your Reddit post which doesn't mention any of that. |
| 23:38:48 | <fishingforpie> | Oh yeah, my bad. |
| 23:39:00 | <fishingforpie> | I really should edit it... |
| 23:39:27 | | Zerote joins |
| 23:39:28 | <systwi_> | Thanks for solving that mystery, OrIdow6. :-) |
| 23:39:30 | <@JAA> | Have you tried talking to the Discogs entry contributors? |
| 23:39:37 | <fishingforpie> | I am one of them |
| 23:39:44 | | Zerote quits [Remote host closed the connection] |
| 23:39:55 | <@JAA> | You know you're not supposed to contribute without having a copy of the release, right? |
| 23:39:56 | <fishingforpie> | A friend in the lostwave Discord servers, Evan, helped me make the page |
| 23:40:09 | <fishingforpie> | He said it was okay... |
| 23:40:16 | <@JAA> | He was wrong then. |
| 23:40:39 | <fishingforpie> | Can you show me the rules? |
| 23:40:40 | <@JAA> | Unless their policies changed since a few years ago when I was active there. |
| 23:40:49 | <fishingforpie> | There's several pages where that's not the case |
| 23:41:08 | <@JAA> | Nope, rule one is still 'get a copy of the release in front of you': https://support.discogs.com/hc/en-us/articles/360004051893 |
| 23:41:36 | <fishingforpie> | That's just a guide... |
| 23:41:38 | <@JAA> | Also https://support.discogs.com/hc/en-us/articles/360004016474-Overview-of-Submission-Guidelines-for-Releases |
| 23:41:56 | <@JAA> | Which is linked as the 'Database Guidelines' in the footer. |
| 23:42:16 | <@JAA> | I remember them making it quite clear in the interface as well at the time, but they redesigned the site since, so I don't know what it looks like now. |
| 23:42:39 | <fishingforpie> | https://www.discogs.com/release/23458226-The-Gallery-Little-Islands it's not the case here |
| 23:43:05 | | pcr quits [Ping timeout: 276 seconds] |
| 23:43:28 | <fishingforpie> | I was told to make the DIscogs page to help look for it |
| 23:43:29 | <@JAA> | I'm not going to argue with you about the Discogs rules here. Ask in their forums if you have any doubt. |
| 23:43:39 | <fishingforpie> | and I am not trying to. |
| 23:43:50 | <fishingforpie> | I just simply didn't know. I will go to their forums in a bit |
| 23:43:56 | <@JAA> | Submissions of fake or inexistent releases were a major problem at the time I was active there. |
| 23:44:04 | <@JAA> | Which is why that rule is a thing. |
| 23:44:07 | <fishingforpie> | Oh. |
| 23:44:33 | <fishingforpie> | I did have a sold listing and photos from a ganbaremomoka ameblo post to work with |
| 23:45:17 | <fishingforpie> | Do you have any ideas on how to find the song? |
| 23:46:01 | <fishingforpie> | ganbaremomoka is an ameblo blogger who has shown several Power Age items and has mentioned the song before and has the CD |
| 23:46:27 | <@JAA> | Any way to contact them? |
| 23:46:46 | <fishingforpie> | he has a twitter but I am too worried he won't be exactly nice to me due to our previous messages.. |
| 23:47:12 | <systwi_> | fishingforpie: If you're on EFnet, there's a music channel I frequent, #cassette , which some of the others there might be able to help. |
| 23:47:33 | <systwi_> | In regards to finding unknown music. |
| 23:47:40 | <systwi_> | Lost music, etc. |
| 23:48:03 | <fishingforpie> | https://twitter.com/hfkaren2 |
| 23:48:20 | <fishingforpie> | I am not on EFNet. |
| 23:48:25 | <fishingforpie> | How do I get on there? |
| 23:49:06 | <systwi_> | Add irc.efnet.net:9999 to your IRC client. |
| 23:49:21 | <systwi_> | Then /join #cassette |
| 23:49:40 | <fishingforpie> | How do I do that? I am new to IRC |
| 23:50:18 | <systwi_> | Minor warning, it's your typical IRC channel. Expect long delays between answers, if people happen to see it. |
| 23:50:25 | <fishingforpie> | Okay. |
| 23:50:34 | <systwi_> | Continuing on #archiveteam-ot |
| 23:50:45 | <@JAA> | Yeah, we're well beyond web stuff now. |
| 23:51:02 | <fishingforpie> | Yeah |