01:52:46 | <CoreParadox> | yzqzss|m you rule! |
02:13:14 | | Megame quits [Ping timeout: 258 seconds] |
02:52:49 | | CoreParadox quits [Remote host closed the connection] |
03:05:29 | | pabs quits [Ping timeout: 252 seconds] |
03:12:23 | | pabs (pabs) joins |
03:30:30 | | Megame (Megame) joins |
04:06:27 | | TastyWiener95 quits [Client Quit] |
04:21:29 | | TastyWiener95 (TastyWiener95) joins |
04:26:42 | <pokechu22> | pabs: seems like all of the ccc wikis are static/archived already (including https://events.ccc.de/camp/2007/wiki which you didn't mention) |
04:27:58 | <pabs> | ah ok |
04:29:01 | <pabs> | conference websites are another thing that sometimes tend to go bust at some point, probably we need a concerted effort to save them regularly |
04:29:07 | <pabs> | so many conferences though |
04:34:08 | <fireonlive> | (fun fact: we're on a irc network run by ccc.de :) |
05:12:08 | | Sir_Bedivere quits [Read error: Connection reset by peer] |
05:14:03 | | pabs quits [Ping timeout: 265 seconds] |
05:15:48 | | pabs (pabs) joins |
06:24:21 | | hitgrr8 joins |
07:42:52 | | Megame quits [Client Quit] |
07:44:41 | <yzqzss|m> | Can someone provide me some IA items of mediawiki wikidump with millions or even tens of millions of media files? I would like to test if *.us.archive.org/view_archive.php?archive={wikidump.7z} can list all the files. |
07:45:40 | <yzqzss|m> | If it does, maybe we could use it to do incremental image archive.🫠|
08:32:13 | | pabs quits [Ping timeout: 265 seconds] |
08:35:13 | | pabs (pabs) joins |
09:48:41 | | Craigle quits [Quit: The Lounge - https://thelounge.chat] |
09:58:46 | | Craigle (Craigle) joins |
11:53:53 | | Megame (Megame) joins |
14:29:54 | | andrew quits [Client Quit] |
14:30:56 | | andrew (andrew) joins |
15:34:42 | | Megame quits [Client Quit] |
16:20:12 | | Craigle quits [Remote host closed the connection] |
16:23:12 | | Craigle (Craigle) joins |
16:33:31 | <pokechu22> | If your goal is to extract a list of images, you only need to extract the images.txt file |
16:34:56 | <pokechu22> | Here's one with ~400k media files (so ~800k directory entries): https://ia801607.us.archive.org/view_archive.php?archive=/17/items/wiki-tcrfnet-20230322/tcrfnet-20230322-wikidump.7z |
16:35:25 | <pokechu22> | note that last I checked view_archive.php interacts poorly with solid archives though (although extracting individual items from them is fine). This one's not solid but some others are. |
16:35:51 | <pokechu22> | (I haven't actually waited for it to load completely there since it's taking a while, but it does seem to be trying to list everything) |
17:54:22 | | Bedivere joins |
20:47:01 | | Craigle quits [Read error: Connection reset by peer] |
20:58:23 | | Craigle (Craigle) joins |
21:08:03 | | hitgrr8 quits [Client Quit] |
21:28:47 | | pabs quits [Ping timeout: 252 seconds] |
21:29:23 | | pabs (pabs) joins |
21:30:25 | | nulldata quits [Client Quit] |
21:31:10 | | nulldata joins |
21:38:11 | | nulldata is now authenticated as nulldata |
22:01:14 | | pabs quits [Ping timeout: 252 seconds] |
22:04:04 | | pabs (pabs) joins |
22:10:46 | | pabs quits [Ping timeout: 258 seconds] |
22:11:37 | | pabs (pabs) joins |
22:32:14 | | pabs quits [Ping timeout: 258 seconds] |
22:32:51 | | pabs (pabs) joins |