00:07:23 | | threedeeitguy quits [Ping timeout: 252 seconds] |
00:46:04 | | threedeeitguy (threedeeitguy) joins |
00:46:57 | | Exorcism quits [Client Quit] |
00:47:06 | | Exorcism (exorcism) joins |
00:53:42 | | qwertyasdfuiopghjkl quits [Client Quit] |
02:13:09 | | Sir_Bedivere joins |
02:13:20 | | Bedivere quits [Ping timeout: 252 seconds] |
04:56:30 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
06:16:45 | | rktk quits [Ping timeout: 258 seconds] |
06:27:52 | | rktk (rktk) joins |
07:21:26 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
07:21:37 | | Exorcism1 (exorcism) joins |
07:23:54 | | Exorcism1 quits [Client Quit] |
07:24:14 | | Exorcism|phone quits [Ping timeout: 244 seconds] |
07:28:37 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
07:31:24 | | Exorcism|phone (exorcism) joins |
07:49:04 | | Exorcism|phone quits [Client Quit] |
07:49:24 | | Exorcism|phone (exorcism) joins |
08:36:00 | | albertlarsan68 (AlbertLarsan68) joins |
08:42:03 | | Exorcism|phone quits [Remote host closed the connection] |
09:31:31 | | qwertyasdfuiopghjkl quits [Client Quit] |
10:22:24 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
11:46:46 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
12:53:22 | | eroc1990 quits [Quit: The Lounge - https://thelounge.chat] |
12:54:16 | | eroc1990 (eroc1990) joins |
13:06:22 | | eroc1990 quits [Remote host closed the connection] |
15:33:53 | <rktk> | I know there was some kind of link to find archive downloads for Wiki pages, mediawiki and etc. what's the special link? |
15:33:57 | <rktk> | it was literally a Special: link |
15:35:45 | <rktk> | https://www.avid.wiki/ here's a wiki that may be of interest to archive, distribution and film/movie images and logos |
15:35:52 | <rktk> | and company history often not found on Wikipedia or elsewhere |
15:36:24 | <rktk> | can't find the similar Special or Meta link for Miraheze |
15:38:02 | <rktk> | Special:DataDump , ah but limited to Admins |
15:38:31 | <rktk> | https://wiki.multimedia.cx another one of interest to dump |
16:01:41 | <Exorcism> | rktk you can use a mediawiki bot to dump : #wikibot |
16:03:02 | <rktk> | ah I need + |
16:06:19 | | pabs quits [Ping timeout: 258 seconds] |
16:07:01 | <Exorcism> | Yeah, you can still request a dump and we'll do it |
16:17:25 | | pabs (pabs) joins |
16:22:20 | <rktk> | Exorcism, sorry not to muck up bot channel so much, is this the correct repo? https://github.com/yzqzss/wikiteam3 |
16:22:32 | | pabs quits [Ping timeout: 252 seconds] |
16:22:35 | <rktk> | This branch is 1 commit ahead, 63 commits behind mediawiki-client-tools:python3. :thinking: |
16:24:50 | <Exorcism> | watch in #wikibot, i send you the url |
16:24:59 | <rktk> | Thank you! |
16:25:00 | <rktk> | :) |
16:25:03 | <Exorcism> | yw! |
16:29:15 | <rktk> | For avid.wiki, miraheze, "In attempt 1, XML for "Main_Page" is wrong. Waiting 20 seconds and reloading..." |
16:29:19 | <rktk> | odd... |
16:30:45 | <pokechu22> | I think I did avid at one point, but I don't remember if I did anything special |
16:30:59 | <pokechu22> | oh wait |
16:31:04 | <rktk> | --xml --xmlapiexport --curonly --images |
16:31:08 | <pokechu22> | right, that's the one with way too many images |
16:31:08 | <rktk> | seems to be fine. im OK with current pages |
16:31:13 | <rktk> | yeah about 50000 |
16:31:16 | <rktk> | https://www.avid.wiki/Special:MediaStatistics |
16:31:25 | <rktk> | given it's a site for media company logos, not surprised :) |
16:31:35 | <pokechu22> | https://archive.org/download/wiki-avidmirahezeorg_w |
16:31:41 | <pokechu22> | 40GB in images |
16:31:47 | <pokechu22> | probably more now |
16:32:20 | <pokechu22> | I was able to do things just fine with Special:Export back then (--xml --images on python2 tools) |
16:33:14 | <rktk> | pokechu22, yeah about 50GB ish total but thank you I grabbed that :) |
16:33:58 | <rktk> | I'll leave it curonly at the moment just to dump the images |
16:37:22 | | Iki1 quits [Ping timeout: 258 seconds] |
16:40:22 | | pabs (pabs) joins |
16:41:14 | <rktk> | just wondering, are wikiteam or wikiteam3 tools tied to any sort of api or remote connection by archiveteam members? Or could it be run entirely on it's own? |
16:41:24 | <rktk> | as in, connecting to get names or something? |
16:42:00 | <Exorcism> | it could be run entirely on it's own :) |
16:46:03 | <rktk> | ok :) |
16:46:38 | <rktk> | pokechu22, "110961 page titles loaded" avid.wiki 2023-08-01 ... 0_0 |
16:46:44 | <rktk> | good lord |
16:47:28 | <pokechu22> | Yeah, you just plug in a wiki's main page (or --index <index.php> --api <api.php>, in some cases skipping one of those) and it chugs away and hopefully doesn't explode |
16:47:48 | <pokechu22> | There is a list of wikis system but I don't know what exactly that does |
16:48:22 | <pokechu22> | and I did https://archive.org/details/wiki-wikimultimediacx_202301 too in the past |
16:50:39 | <pokechu22> | I also did https://archive.org/details/wiki-rationalwikiorg_w - and that ended up with 90GB of XML (and dying due to no disk space and then having to modify the tools to properly resume because the wiki was weird). Hopefully most of the ones you try are not jank like that :) |
17:04:55 | <rktk> | oh good pokechu22 :) i guess not just me that uses multimedia cx often |
17:23:59 | <rktk> | pokechu22, have you by chance done a backup of https://prepaid-data-sim-card.fandom.com/ recently? |
17:24:19 | <rktk> | https://archive.org/details/wiki-prepaid_data_sim_cardfandomcom this one is from 2020 |
17:24:54 | <pokechu22> | I don't think I have |
17:25:45 | <pokechu22> | I guess that's one that probably could be done via #wikibot? |
17:25:56 | <rktk> | Yup |
21:52:56 | | BigBrain quits [Ping timeout: 245 seconds] |
21:59:45 | | BigBrain (bigbrain) joins |
22:04:02 | | imer quits [Quit: Oh no] |
22:06:33 | | imer (imer) joins |