00:01:25umgr036 joins
00:09:25<tomodachi94>I'm not really sure how to approach archiving this.
00:13:35<fishingforsoup>A bunch of the links are cited here.
00:13:36<fishingforsoup>https://justdance.fandom.com/wiki/Just_Dance_Now/Beta_Elements
00:19:09<@OrIdow6>Anyone else getting nothing but "Access denied" there?
00:19:52<tomodachi94>OrIdow6: Check the References section https://justdance.fandom.com/wiki/Just_Dance_Now/Beta_Elements#References
00:20:19<@OrIdow6>Oh
00:20:22wickedplayer494 quits [Ping timeout: 252 seconds]
00:20:37<@OrIdow6>A few questions then
00:20:47<@OrIdow6>How do you know it's going to shut down soon? How soon?
00:21:59<@OrIdow6>What exactly is the relationship of this site to the game? Is this a "resource" server? And importantly, how would we be able to discover all the URLs on it?
00:24:54<tomodachi94>> Is this a "resource" server?
00:24:54<tomodachi94>I think it's some sort of CDN. Grabbing an archive might be a good idea in case it ever dies.
00:25:00<tomodachi94> * > Is this a "resource" server?
00:25:00<tomodachi94>I think it's some sort of CDN. Grabbing an archive might be a good idea in case it ever dies.
00:25:11pabs quits [Client Quit]
00:26:46fishingforsoup_ joins
00:27:17pabs (pabs) joins
00:27:48wickedplayer494 joins
00:28:24<fishingforsoup_>1. I don't know when, it just seems like they are starting to forget about it.
00:28:48<fishingforsoup_>2. Yes, it's a resource server, and I don't know. There are people with the buckets but they hoard it.
00:28:54<fishingforsoup_>Well yeah, a CDN.
00:29:32fishingforsoup quits [Ping timeout: 252 seconds]
00:30:14<fishingforsoup_>The CDN has a bunch of content they never ended up using.
00:35:43<tomodachi94>Question: How do I get access to ArchiveBot's !a?
00:56:18wickedplayer494 quits [Ping timeout: 252 seconds]
00:56:47wickedplayer494 joins
01:06:30wickedplayer494 quits [Ping timeout: 265 seconds]
01:07:54wickedplayer494 joins
01:47:37<TheTechRobo>tomodachi94: ask and wait :-)
01:47:58<TheTechRobo>ideally tell people what you're planning on archiving too
02:13:03<@JAA>Well, the first step would be to join the #archivebot channel. But you won't get access to the bot immediately. Make suggestions for things to be archived there (or here if it gets lost in the bot noise), pay attention to the jobs (errors, rate limits, URLs to be ignored, etc.), and once it's clear that you know how it all works, you can get access.
02:37:43BlueMaxima quits [Read error: Connection reset by peer]
02:50:45<@JAA>I'm at 3.76M Issuu users, queue is shrinking rapidly. They claim on https://issuu.com/about that there are 1M users with content on the platform; that number seems to have been added to the page in mid-2022. I don't have this number from my crawl handy, but it can be extracted I think.
03:08:08<@JAA>I'm averaging over 250 req/s now with 60% of a i3-2130 core in use. Haha qwarc goes brrrrrrr...
03:17:33<TheTechRobo>I wonder how much slower it would get if you just parsed the HTML. Not do anything with it, just build a DOM.
03:17:53<TheTechRobo>Obviously a lot slower, but I'd be curious to see just how slow modern HTML parsers are.
03:37:32<@JAA>I'm retrieving and parsing JSON, FWIW, and there's still room for improvement there since I'm pretty sure it just uses Python's stdlib json module, which is implemented purely in Python.
03:37:54<@JAA>But I'm sure HTML would be slower, especially if you build an element tree, yeah.
03:41:41<@JAA>It finished at 3.80M users.
03:46:15tbc1887 (tbc1887) joins
03:54:17tbc1887 quits [Client Quit]
03:59:21Ketchup901 quits [Remote host closed the connection]
03:59:47Ketchup901 (Ketchup901) joins
04:43:31<@OrIdow6>fishingforsoup_: What do you mean, "There are people with the buckets but they hoard it"?
04:44:20<h2ibot>Tomodachi94 edited Formats (-36, /* External links */ Remove…): https://wiki.archiveteam.org/?diff=49459&oldid=48800
04:59:11<fishingforsoup_>People have access to all links, I don't know how, and they don't tell me.
04:59:44tbc1887 (tbc1887) joins
05:37:51tbc1887 quits [Read error: Connection reset by peer]
07:17:03tbc1887 (tbc1887) joins
07:39:05hitgrr8 joins
10:05:01umgr036 quits [Remote host closed the connection]
10:05:15umgr036 joins
10:16:13spirit quits [Client Quit]
10:21:13Minkafighter722 quits [Quit: The Lounge - https://thelounge.chat]
10:21:52Minkafighter722 joins
10:24:09Minkafighter722 quits [Client Quit]
10:26:05Minkafighter722 joins
10:26:53fishingforsoup__ joins
10:30:52fishingforsoup_ quits [Ping timeout: 252 seconds]
10:32:27tbc1887 quits [Read error: Connection reset by peer]
10:39:18jtagcat quits [Quit: Bye!]
10:42:30spirit joins
10:43:22jtagcat (jtagcat) joins
11:09:07Island quits [Read error: Connection reset by peer]
13:04:31benjins2__ joins
13:04:39jacksonchen666 (jacksonchen666) joins
13:05:42Arcorann quits [Ping timeout: 265 seconds]
13:06:40benjins2_ quits [Ping timeout: 265 seconds]
13:15:20jacksonchen666 quits [Client Quit]
13:58:34qwertyasdfuiopghjkl joins
14:45:16ukhardnhorny joins
14:45:25ukhardnhorny quits [Remote host closed the connection]
14:46:04fuzzy8021 quits [Ping timeout: 252 seconds]
14:52:13jacksonchen666 (jacksonchen666) joins
14:59:05fuzzy8021 (fuzzy8021) joins
15:00:21pie_ quits []
15:00:30pie_ joins
15:01:50lennier1 quits [Ping timeout: 252 seconds]
15:02:07lennier2 joins
15:02:07lennier2 is now known as lennier1
15:15:16jacksonchen666 quits [Client Quit]
15:15:30jacksonchen666 (jacksonchen666) joins
15:20:02jacksonchen666 quits [Client Quit]
15:23:16Ketchup901 quits [Remote host closed the connection]
15:23:44Ketchup901 (Ketchup901) joins
15:31:56pie_ quits [Client Quit]
15:32:27pie_ joins
15:38:46lennier2_ joins
15:40:31lennier1 quits [Ping timeout: 252 seconds]
15:41:19lennier1 joins
15:43:45lennier2_ quits [Ping timeout: 265 seconds]
15:55:02lennier2_ joins
15:57:01lennier1 quits [Ping timeout: 252 seconds]
15:57:07lennier2_ is now known as lennier1
16:04:23pie_ quits [Client Quit]
16:04:47lennier2 joins
16:05:08pie_ joins
16:07:28lennier1 quits [Ping timeout: 252 seconds]
16:08:45lennier2_ joins
16:08:46lennier2_ is now known as lennier1
16:10:02lennier2 quits [Ping timeout: 252 seconds]
16:27:58user_ joins
16:31:36umgr036 quits [Ping timeout: 265 seconds]
16:36:12jacksonchen666 (jacksonchen666) joins
16:43:50jacksonchen666 quits [Client Quit]
17:11:49luna quits [Ping timeout: 252 seconds]
17:11:55luckcolors quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]
17:12:15luckcolors (luckcolors) joins
18:33:42wyatt8750 joins
18:35:20wyatt8740 quits [Ping timeout: 265 seconds]
18:51:39LeGoupil joins
19:53:50Ketchup901 quits [Ping timeout: 276 seconds]
19:54:43Ketchup901 (Ketchup901) joins
20:51:06LeGoupil quits [Client Quit]
20:57:38fishingforsoup__ quits [Client Quit]
20:57:53fishingforsoup joins
21:15:26BlueMaxima joins
21:28:47benjinsm joins
21:29:24benjins quits [Ping timeout: 252 seconds]
21:29:45benjinsm is now known as benjins
22:02:41tzt quits [Ping timeout: 265 seconds]
22:06:25Island joins
22:42:19fishingforsoup quits [Ping timeout: 265 seconds]
22:50:29balrog quits [Quit: Bye]
22:51:17Ketchup901 quits [Ping timeout: 276 seconds]
22:56:41balrog (balrog) joins
23:02:22Ketchup901 (Ketchup901) joins
23:18:03hitgrr8 quits [Client Quit]
23:22:08Ketchup901 quits [Remote host closed the connection]
23:22:31superkuh quits [Ping timeout: 252 seconds]
23:22:39Ketchup901 (Ketchup901) joins
23:22:48superkuh joins
23:47:04<tomodachi94>Does anyone know how to tag multilingual works in the Internet Archive?
23:49:16<pokechu22>I'm not aware of a way of doing it other than selecting "multiple languages" (which is one of the options in the language selector)
23:50:06<@JAA>The 'language' metadata field is repeatable, but maybe the web interface doesn't allow that?
23:50:12<@JAA>In any case, #internetarchive for questions about IA.