04:27:16 | | Bedivere quits [Ping timeout: 240 seconds] |
05:41:54 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
05:42:23 | | qwertyasdfuiopghjkl joins |
06:23:43 | | michaelblob (michaelblob) joins |
06:27:18 | | michaelblob_ quits [Ping timeout: 255 seconds] |
08:14:17 | | michaelblob_ (michaelblob) joins |
08:18:04 | | michaelblob quits [Ping timeout: 240 seconds] |
09:22:52 | | tech_exorcist (tech_exorcist) joins |
11:25:08 | | tech_exorcist_ (tech_exorcist) joins |
11:27:40 | | tech_exorcist quits [Remote host closed the connection] |
11:34:22 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
11:46:06 | | qwertyasdfuiopghjkl joins |
11:53:14 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
11:54:00 | | qwertyasdfuiopghjkl93 joins |
11:54:25 | | qwertyasdfuiopghjkl93 is now known as qwertyasdfuiopghjkl |
12:03:49 | | qwertyasdfuiopghjkl quits [Client Quit] |
12:04:36 | | qwertyasdfuiopghjkl joins |
14:31:30 | | Iki quits [Ping timeout: 255 seconds] |
14:33:21 | | Nemo_bis (Nemo_bis) joins |
15:32:40 | | Bedivere joins |
15:44:24 | | HackMii quits [Ping timeout: 255 seconds] |
15:46:37 | | HackMii (hacktheplanet) joins |
15:56:34 | | tech_exorcist_ quits [Client Quit] |
15:59:44 | | qwertyasdfuiopghjkl quits [Ping timeout: 265 seconds] |
16:04:26 | | tech_exorcist (tech_exorcist) joins |
19:29:18 | <pokechu22> | Not sure if it's possible to fix these, but I messed up the identifier for https://archive.org/details/familysearchorg_de_wiki-20221021-wikidump.7z (I wanted wiki-familysearch.org_de_wiki) and the collection for https://archive.org/details/wiki-familysearch.org_it_wiki (I wanted community texts, but it's going to be moved to wikicollections eventually so that probably doesn't |
19:29:20 | <pokechu22> | matter) |
19:32:37 | <Nemo_bis> | pokechu22: yes it will probably get moved |
19:32:51 | <@JAA> | Changing the identifier isn't possible, but you could move the file to a new item with the correct identifier. |
19:33:41 | <pokechu22> | Might as well since it's only about 35MB so deleting+recreating shouldn't be a problem (unlike https://archive.org/details/wiki-familysearch.org_en_wiki which is a 22GB monster that fortunately I got everything right on) |
19:34:06 | <Nemo_bis> | neither is in our standard format though, so it doesn't really matter |
19:34:14 | <Nemo_bis> | I've been asked about wikitravel.org dumps |
19:35:31 | <pokechu22> | Didn't that mostly get merged into wikivoyage? I know there was something about that in the past (but probably like a decade ago now?) |
19:36:14 | <pokechu22> | And, eh, I'll just leave the bad identifier alone - it includes the pertinent information more or less |
20:32:42 | | michaelblob (michaelblob) joins |
20:36:04 | | michaelblob_ quits [Ping timeout: 240 seconds] |
20:43:14 | <pokechu22> | If you *do* want to scrape wikitravel.org, https://wikitravel.org/en/Wikitravel:Terms_of_use#Spiders says that you need to do some other things or you'll get blocked |
20:45:12 | <pokechu22> | (though the talk page says those are less enforced) |
20:50:43 | | tech_exorcist quits [Client Quit] |
21:32:08 | <@JAA> | Someone edited the wiki page on Miraheze claiming that 'Miraheze backups all their public wikis every few months and publishes an XML dump of all of them on archive.org' but helpfully forgot to include a link/identifier. I assume they're referring to https://archive.org/details/@reception123 but that seems incomplete (several items are 'all but top 20 wikis' etc.) and quite irregular. Does anyone know more? |
21:34:44 | <@JAA> | Looks like the items from April 2021 might cover all wikis, but yeah, 'everything gets dumped every few months' seems too bold a claim to me. |
23:24:13 | | michaelblob_ (michaelblob) joins |
23:28:21 | | michaelblob quits [Ping timeout: 255 seconds] |