00:01:18 | | qwertyasdfuiopghjkl quits [Client Quit] |
00:02:26 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
00:15:54 | | etnguyen03 (etnguyen03) joins |
00:27:31 | | BigBrain quits [Ping timeout: 245 seconds] |
00:29:46 | | BigBrain (bigbrain) joins |
00:43:55 | | Mateon2 joins |
00:45:35 | | Mateon1 quits [Ping timeout: 252 seconds] |
00:45:35 | | Mateon2 is now known as Mateon1 |
00:46:08 | | katocala quits [Ping timeout: 252 seconds] |
00:46:52 | | katocala joins |
00:50:03 | | BlueMaxima joins |
00:57:43 | | etnguyen03 quits [Ping timeout: 265 seconds] |
01:13:15 | | etnguyen03 (etnguyen03) joins |
01:13:31 | | qwertyasdfuiopghjkl quits [Client Quit] |
01:19:41 | | katocala quits [Ping timeout: 252 seconds] |
01:19:46 | | katocala joins |
01:20:53 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
01:47:59 | | etnguyen03 quits [Ping timeout: 265 seconds] |
01:50:53 | | katocala quits [Ping timeout: 265 seconds] |
01:51:13 | | AlsoHP_Archivist quits [Client Quit] |
01:51:33 | | HP_Archivist (HP_Archivist) joins |
01:51:35 | | katocala joins |
01:58:43 | | etnguyen03 (etnguyen03) joins |
02:33:40 | | katocala is now authenticated as katocala |
02:36:37 | <pabs> | https://www.wired.com/story/epic-games-sale-bandcamp-music-platform-limbo/ |
02:36:50 | <pabs> | "Bandcamp workers say they are unable to do their jobs after being locked out of critical systems. They’re also expecting layoffs." |
02:37:04 | <pabs> | /cc arkiver JAA :) |
02:38:53 | | katocala quits [Ping timeout: 252 seconds] |
02:39:25 | | katocala joins |
02:43:50 | | katocala quits [Ping timeout: 252 seconds] |
02:44:03 | | katocala joins |
02:44:29 | | katocala is now authenticated as katocala |
02:46:02 | | dumbgoy__ joins |
02:49:53 | | dumbgoy_ quits [Ping timeout: 252 seconds] |
02:51:32 | | dumbgoy__ quits [Ping timeout: 252 seconds] |
03:02:37 | <audrooku|m> | So when are we grabbing bandcamp boys |
03:02:47 | <audrooku|m> | don't let em off easy like soundcloud |
03:03:02 | <fireonlive> | didn't like soundcloud threaten archiveteam |
03:03:18 | <audrooku|m> | IA, so basically yes |
03:03:29 | <audrooku|m> | a 128k mp3 BC grab would certainly be less than 1PB |
03:24:04 | <audrooku|m> | nevermind probably more like 1.5 |
03:24:39 | | katocala quits [Ping timeout: 265 seconds] |
03:26:28 | | katocala joins |
03:35:17 | | katocala quits [Ping timeout: 265 seconds] |
03:36:21 | | katocala joins |
03:48:41 | <mgrandi> | re Xentax: is there a way to get the files if the main page is not working? that XML dump that i request that they thankfully posted doesn't have the files |
03:53:22 | | lukash9 quits [Quit: The Lounge - https://thelounge.chat] |
03:57:23 | | Megame quits [Client Quit] |
04:01:06 | | lukash9 joins |
04:04:41 | | wyatt8740 quits [Ping timeout: 252 seconds] |
04:08:05 | | DogsRNice quits [Read error: Connection reset by peer] |
04:10:56 | | wyatt8740 joins |
04:20:37 | <pokechu22> | mgrandi: https://archive.org/details/wiki-wikixentaxcom_202305 and https://archive.org/details/wiki-wikixentaxcom-20230811 both contain files. Looks like https://wiki.xentax.com/images/8/83/File_stripper_01.png etc still works (from |
04:20:40 | <pokechu22> | https://ia802609.us.archive.org/view_archive.php?archive=/16/items/wiki-wikixentaxcom_202305/wikixentaxcom-20230513-wikidump.7z&file=wikixentaxcom-20230513-images.txt) but there isn't an easy way to list files with the index not working (and the api not supporting json)... but it looks like it supports JSON now so hmm |
04:28:07 | | kiryu_ joins |
04:31:38 | | kiryu quits [Ping timeout: 252 seconds] |
04:33:17 | | katocala quits [Ping timeout: 265 seconds] |
04:34:19 | | katocala joins |
04:41:46 | <pokechu22> | mgrandi: ok, I got an image only dump: https://archive.org/details/wiki-wiki.xentax.com-20231008 (it doesn't seem like wikibot wants to dump the non-image content now, so that's fun) |
04:45:31 | | kiryu_ quits [Client Quit] |
04:46:12 | <thuban> | bandcamp band and item ids appear to be nonsequential 10-digit numbers, but there's a "full artist index": https://bandcamp.com/artist_index |
04:48:21 | | kiryu joins |
04:48:21 | | kiryu is now authenticated as kiryu |
04:48:21 | | kiryu quits [Changing host] |
04:48:21 | | kiryu (kiryu) joins |
04:49:12 | <thuban> | (other potential discovery sources include the "discover" endpoint at https://bandcamp.com/api/discover/3/get_web, although each query is limited to ~4.3k results, and the in-html recommendations on each item page--but i doubt either would include anything somehow absent from the index) |
04:58:02 | | katocala quits [Ping timeout: 252 seconds] |
04:58:11 | | katocala joins |
05:02:46 | | katocala quits [Ping timeout: 265 seconds] |
05:03:46 | | katocala joins |
05:14:41 | | etnguyen03 quits [Client Quit] |
05:18:10 | | Overlordz joins |
05:21:08 | | Island quits [Read error: Connection reset by peer] |
05:21:12 | | BlueMaxima quits [Read error: Connection reset by peer] |
05:28:52 | | katocala quits [Ping timeout: 265 seconds] |
05:29:03 | | katocala joins |
05:41:49 | | katocala is now authenticated as katocala |
05:46:44 | <mgrandi> | @pokechu22 thats good that at least we got something from this year, but i was meaning that the main page of the wiki apparently fails to render and i think they siad that the PHP version is out of date or something so i'm not sure we can get anything from the latest version |
05:47:08 | <pokechu22> | mgrandi: https://archive.org/details/wiki-wiki.xentax.com-20231008 is from today |
05:47:25 | <pokechu22> | it was done using https://wiki.xentax.com/api.php |
05:47:38 | <pokechu22> | err, http://wiki.xentax.com/api.php |
05:47:44 | <mgrandi> | huh, i guess if that api.php page works then i guess the wikibot tools still work, neat! |
05:48:06 | <pokechu22> | Well, kinda - I couldn't get it to export page history, only images, but we already got a separate page history dump so good enough |
05:48:09 | <mgrandi> | 14mb seems low, maybe most of the files are on the forum? |
05:49:06 | <pokechu22> | That sounds possible at least |
05:49:44 | | katocala quits [Ping timeout: 252 seconds] |
06:01:47 | | Arcorann (Arcorann) joins |
07:00:12 | | nfriedly quits [Remote host closed the connection] |
07:53:54 | | pabs quits [Client Quit] |
07:57:06 | | Wohlstand (Wohlstand) joins |
08:03:15 | | nic9 (nic) joins |
08:04:30 | | nic quits [Ping timeout: 265 seconds] |
08:04:30 | | nic9 is now known as nic |
08:16:48 | | pabs (pabs) joins |
09:00:08 | | railen63 joins |
09:25:23 | | icedice (icedice) joins |
09:39:02 | | railen64 joins |
09:41:10 | | railen63 quits [Ping timeout: 265 seconds] |
09:42:17 | | zu8899999 joins |
09:44:01 | | zu8899999 quits [Remote host closed the connection] |
10:00:01 | | railen64 quits [Remote host closed the connection] |
10:00:19 | | railen64 joins |
10:02:00 | | igloo22225 quits [Client Quit] |
10:02:26 | | igloo22225 (igloo22225) joins |
10:17:28 | | nfriedly joins |
10:54:20 | | BigBrain quits [Remote host closed the connection] |
10:55:11 | | BigBrain (bigbrain) joins |
12:07:01 | <audrooku|m> | thuban: re: bandcamp: band, album, and track ids are random 32 bit uints, if you want to get a list of tracks to grab I'd definitely suggest crawling the artists listed in the index |
12:14:36 | | BigBrain quits [Ping timeout: 245 seconds] |
12:15:09 | | BigBrain (bigbrain) joins |
12:30:19 | <@JAA> | Eh, what's 4.3 billion requests between friends? :-) |
12:41:11 | <kiryu> | Not sure where to ask this but do I try to archive a Cloudflared site with Selenium and Playwright? |
12:42:37 | <kiryu> | Or is that a very *tough* process? |
12:46:04 | <kiryu> | I found the origin IP but they seems to block every way of archivng (accessing it returns 302 to the cloudflared main domain) |
12:48:30 | <kiryu> | CDN links seems to be loaded only one time then it gets 403'd |
12:52:05 | <@JAA> | You could try something browser-based with warcprox, yeah. With the origin IP, perhaps you could also send the relevant headers so the origin thinks the request comes from Buttflare. But if it's implemented by a half-competent sysadmin, that shouldn't work. https://developers.cloudflare.com/fundamentals/reference/http-request-headers/ |
13:01:45 | | Peroniko quits [Ping timeout: 265 seconds] |
13:02:13 | | Peroniko (Peroniko) joins |
13:06:23 | | geezabiscuit leaves [The Lounge - https://thelounge.chat] |
13:14:57 | <audrooku|m> | JAA: I agree that 4.3BN isn't that bad, I've done nearly double that with soundcloud.. I just think crawling the artist index and WARCing all the pages would be useful for discovering the content |
13:15:43 | <@JAA> | audrooku|m: No disagreement there. At least it'd be a good first pass. |
13:22:32 | | Arcorann quits [Ping timeout: 265 seconds] |
13:34:42 | | eroc19903 is now known as eroc1990 |
13:39:29 | | etnguyen03 (etnguyen03) joins |
14:19:04 | <@arkiver> | pabs: ouch, thanks :/ |
14:19:21 | | katocala joins |
14:19:40 | | katocala is now authenticated as katocala |
14:21:59 | | AmAnd0A quits [Ping timeout: 265 seconds] |
14:22:11 | | AmAnd0A joins |
14:31:10 | | katocala quits [Ping timeout: 265 seconds] |
14:31:29 | | katocala joins |
14:31:45 | | katocala is now authenticated as katocala |
14:40:25 | | katocala quits [Read error: Connection reset by peer] |
14:42:29 | | katocala joins |
14:48:51 | | dumbgoy__ joins |
15:01:11 | | Island joins |
15:09:12 | | Overlordz quits [Client Quit] |
15:10:45 | | katocala quits [Read error: Connection reset by peer] |
15:11:38 | | katocala joins |
15:11:58 | | wrnines joins |
15:13:00 | <wrnines> | Hey, so I've never participated in Archive Team and have more just been admiring it from afar for a long while, but I figured I should pop into the IRC because that's what the FAQ says to do to let the team know about sites that are dying |
15:13:51 | <joepie91|m> | 👋 |
15:13:56 | <wrnines> | It just got announced today that the online writing/literature magazine/writing workshop site LitReactor is shutting its doors, and after December 31 2023 the site is going to be gone |
15:14:08 | <wrnines> | https://litreactor.com/news/litreactor-the-end-of-an-era |
15:18:51 | <wrnines> | I'm not sure if the site is small enough for the ArchiveBot since the site has been running since 2011, but I thought it was probably worth informing archive team about. I guess from here I should go to the archivebot IRC channel to let the folks there know about running it for LitReactor?? |
15:23:08 | | wrnines quits [Remote host closed the connection] |
15:30:22 | | katocala is now authenticated as katocala |
15:48:26 | | VerifiedJ quits [Remote host closed the connection] |
15:51:20 | | VerifiedJ (VerifiedJ) joins |
15:55:24 | | VerifiedJ quits [Client Quit] |
15:56:14 | | HP_Archivist quits [Ping timeout: 265 seconds] |
15:56:17 | | VerifiedJ (VerifiedJ) joins |
16:01:20 | | katocala quits [Remote host closed the connection] |
16:01:38 | | katocala joins |
16:06:17 | | katocala quits [Ping timeout: 252 seconds] |
16:06:20 | | Partition_of_bengal78 joins |
16:07:10 | | katocala joins |
16:07:11 | | VerifiedJ quits [Client Quit] |
16:07:34 | | Partition_of_bengal78 quits [Remote host closed the connection] |
16:07:37 | | VerifiedJ (VerifiedJ) joins |
16:11:35 | | petrichor (petrichor) joins |
16:12:01 | | dumbgoy__ quits [Remote host closed the connection] |
16:14:10 | | dumbgoy joins |
16:45:47 | | HP_Archivist (HP_Archivist) joins |
17:25:44 | | katocala is now authenticated as katocala |
17:46:23 | | etnguyen03 quits [Ping timeout: 252 seconds] |
17:46:55 | | Peroniko quits [Ping timeout: 265 seconds] |
17:47:19 | | Peroniko (Peroniko) joins |
18:00:41 | | wickedplayer494 quits [Ping timeout: 252 seconds] |
18:05:36 | | wickedplayer494 joins |
18:05:51 | | wickedplayer494 is now authenticated as wickedplayer494 |
18:25:35 | | HP_Archivist quits [Ping timeout: 265 seconds] |
18:50:42 | | Wohlstand quits [Client Quit] |
18:55:06 | | parfait (kdqep) joins |
19:17:22 | | HP_Archivist (HP_Archivist) joins |
19:19:43 | | wickedplayer494 quits [Ping timeout: 265 seconds] |
19:20:45 | | wickedplayer494 joins |
19:20:54 | | wickedplayer494 is now authenticated as wickedplayer494 |
19:22:13 | | razul quits [Remote host closed the connection] |
19:24:21 | | Megame (Megame) joins |
19:34:11 | | katocala quits [Ping timeout: 252 seconds] |
19:35:02 | | katocala joins |
19:45:15 | | razul joins |
19:55:34 | | razul quits [Remote host closed the connection] |
20:13:14 | | petrichor quits [Ping timeout: 252 seconds] |
20:15:59 | | razul joins |
20:21:46 | | Dalek quits [Quit: Dalek] |
20:23:41 | | Dalek (Dalek) joins |
20:25:32 | | Dalek quits [Client Quit] |
20:27:11 | | Dalek (Dalek) joins |
20:36:54 | | etnguyen03 (etnguyen03) joins |
20:41:47 | | Dalek quits [Client Quit] |
20:44:23 | | Dalek (Dalek) joins |
20:50:23 | | Ryz263 quits [Quit: Ping timeout (120 seconds)] |
20:50:36 | | Ryz263 (Ryz) joins |
21:00:26 | | ThetaDev quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
21:00:33 | | ThetaDev joins |
21:08:58 | <@arkiver> | while we track the situation, let's make a bandcamp channel |
21:09:03 | <@arkiver> | any ideas for a channel name? |
21:10:45 | <kpcyrd> | #tapecamp |
21:13:09 | | AntiLiberal joins |
21:13:44 | | wickedplayer494 quits [Ping timeout: 252 seconds] |
21:14:57 | <that_lurker> | #concen.... nevermind |
21:15:16 | <@JAA> | lol, my brain just took the same turn. :-) |
21:15:30 | <project10> | #bandaid |
21:16:08 | <flashfire42> | #flute |
21:16:10 | <@JAA> | #bandgulag |
21:16:24 | | BlueMaxima joins |
21:16:25 | <flashfire42> | cause you know this one time. at bandcamp |
21:16:57 | <project10> | #bandcramp |
21:17:13 | <@JAA> | Hah, nice one. |
21:17:16 | <@arkiver> | bandcramp is a nice one |
21:17:17 | <@arkiver> | yeah |
21:17:23 | <@arkiver> | #bandcramp i guess :P |
21:17:51 | <that_lurker> | sounds good |
21:18:17 | <project10> | never been at the ground floor for a channel christening :P |
21:26:07 | | wickedplayer494 joins |
21:26:19 | | wickedplayer494 is now authenticated as wickedplayer494 |
21:37:23 | <FireFly> | "so that's how it's done huh" |
21:49:04 | | katocala quits [Ping timeout: 265 seconds] |
21:50:06 | | AmAnd0A quits [Read error: Connection reset by peer] |
21:50:19 | | AmAnd0A joins |
21:57:35 | <@HCross> | I believe that was a witnessing of democracy |
22:21:56 | <magmaus3> | yeah |
22:21:58 | <magmaus3> | :3 |
22:29:16 | | nic quits [Client Quit] |
22:29:36 | | nic (nic) joins |
22:36:33 | <@arkiver> | the Telegram project has been restarted in #telegrab |
22:37:03 | <audrooku|m> | good stuff :*) |
22:40:18 | | wickedplayer494 quits [Ping timeout: 265 seconds] |
22:53:12 | | systwi__ (systwi) joins |
22:53:21 | | AntiLiberal2 joins |
22:53:24 | | dumbgoy_ joins |
22:53:25 | | AlsoHP_Archivist joins |
22:53:46 | | Carnildo quits [Remote host closed the connection] |
22:53:46 | | AntiLiberal quits [Read error: Connection reset by peer] |
22:53:54 | | Carnildo joins |
22:54:12 | | Naruyoko joins |
22:55:29 | | systwi quits [Ping timeout: 252 seconds] |
22:56:02 | | HP_Archivist quits [Ping timeout: 252 seconds] |
22:56:02 | | dumbgoy quits [Ping timeout: 252 seconds] |
22:56:35 | | Naruyoko5 quits [Ping timeout: 252 seconds] |
23:00:14 | <h2ibot> | JAABot edited CurrentWarriorProject (-1): https://wiki.archiveteam.org/?diff=50958&oldid=50938 |
23:03:58 | | wickedplayer494 joins |
23:03:58 | | wickedplayer494 quits [Excess Flood] |
23:04:24 | | wickedplayer494 joins |
23:04:33 | | wickedplayer494 is now authenticated as wickedplayer494 |
23:10:04 | | AlsoHP_Archivist quits [Client Quit] |
23:10:24 | | HP_Archivist (HP_Archivist) joins |
23:30:21 | <h2ibot> | JustAnotherArchivist edited Bandcamp (-11, Add IRC channel): https://wiki.archiveteam.org/?diff=50959&oldid=50294 |
23:35:05 | | Matthww11 quits [Ping timeout: 252 seconds] |
23:36:00 | | Matthww11 joins |
23:47:41 | | ymgve joins |
23:48:17 | | ymgve_ quits [Ping timeout: 252 seconds] |
23:48:25 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |