| 00:00:35 | | Megame quits [Client Quit] |
| 00:12:41 | <SketchTheCow> | FOS is down to 74% |
| 00:12:53 | <SketchTheCow> | Which means either we have a handle on it, or you're all disappointing failure. |
| 00:18:19 | <@JAA> | Disappointing failure it is. We paused uploads from some of the pipelines which are also stopping, both due to an unrelated issue. Don't worry, we'll start hammering FOS again before long. |
| 00:25:10 | | HP_Archivist (HP_Archivist) joins |
| 00:43:28 | <SketchTheCow> | I think only half of FOS's 13tb is free. |
| 00:43:37 | <SketchTheCow> | In theory, teamarchive1 is available for pipelines as well. |
| 00:54:16 | | AlsoHP_Archivist joins |
| 00:57:02 | | TheTechRobo joins |
| 00:57:35 | | HP_Archivist quits [Ping timeout: 244 seconds] |
| 01:00:15 | | AlsoHP_Archivist quits [Client Quit] |
| 01:10:05 | | bonga quits [Ping timeout: 265 seconds] |
| 01:10:29 | | bonga joins |
| 01:17:22 | | bonga quits [Read error: Connection reset by peer] |
| 01:18:09 | | bonga joins |
| 01:25:04 | | bonga quits [Ping timeout: 265 seconds] |
| 01:25:30 | | bonga joins |
| 01:35:50 | | bonga quits [Read error: Connection reset by peer] |
| 01:37:10 | | bonga joins |
| 02:02:46 | | dm4v quits [Ping timeout: 240 seconds] |
| 02:02:52 | | dm4v_ joins |
| 02:03:17 | | dm4v_ is now known as dm4v |
| 02:03:18 | | dm4v is now authenticated as dm4v |
| 02:03:18 | | dm4v quits [Changing host] |
| 02:03:18 | | dm4v (dm4v) joins |
| 02:03:37 | | wickedplayer494 quits [Ping timeout: 252 seconds] |
| 02:04:42 | | wickedplayer494 joins |
| 02:07:44 | | wickedplayer494 is now authenticated as wickedplayer494 |
| 02:18:38 | | tbc1887 (tbc1887) joins |
| 02:24:11 | | nicolas17 quits [Client Quit] |
| 02:24:28 | | nicolas17 joins |
| 02:31:22 | | tbc1887 quits [Read error: Connection reset by peer] |
| 03:04:10 | | dvd (dvd) joins |
| 03:09:52 | | march_happy quits [Remote host closed the connection] |
| 03:17:51 | | march_happy (march_happy) joins |
| 03:27:28 | | HP_Archivist (HP_Archivist) joins |
| 03:30:10 | | AlsoHP_Archivist joins |
| 03:33:37 | | HP_Archivist quits [Ping timeout: 244 seconds] |
| 03:33:51 | | tzt quits [Remote host closed the connection] |
| 03:34:14 | | tzt (tzt) joins |
| 03:42:52 | | jacobk joins |
| 03:51:54 | | AlsoHP_Archivist quits [Client Quit] |
| 03:52:17 | <AnotherIki> | Looks like https://nadiaeghbal.com is a wayback-excluded site not mentioned in https://wiki.archiveteam.org/index.php/List_of_websites_excluded_from_the_Wayback_Machine |
| 03:52:36 | <AnotherIki> | Oops, probably better for ot |
| 03:53:40 | <@JAA> | Sure is. That list is never going to be complete obviously, but please do add any you can find! :-) |
| 04:23:45 | | confusedMoose joins |
| 04:23:55 | <confusedMoose> | is this active? |
| 04:24:03 | <confusedMoose> | i have never used irc before |
| 04:24:21 | <Jake> | hi! |
| 04:24:24 | <confusedMoose> | hello! |
| 04:25:04 | <confusedMoose> | i have come to ask about an xml dump i got from the archive team wiki, i don't know how to make it a readable wiki. |
| 04:25:30 | <confusedMoose> | i tried to get an irc client working on linux, but i switched over onto windows because i could not get it to work |
| 04:27:01 | <confusedMoose> | i also came to better understand irc |
| 04:32:46 | <Jake> | Hmm, XML dump? Are you talking about one of our Wikiteam dumps? |
| 04:33:13 | | lennier1 quits [Ping timeout: 252 seconds] |
| 04:33:18 | <confusedMoose> | if i have an archived wiki, how do i access it? it's laid out like this: |
| 04:33:53 | <confusedMoose> | article lists, images, template-xml, wikix-src (all folders) |
| 04:33:59 | | lennier1 (lennier1) joins |
| 04:35:10 | <confusedMoose> | and then a 171 mb xml file |
| 04:35:36 | <Jake> | I believe the XML files can be reimported into any MediaWiki. https://www.mediawiki.org/wiki/Manual:Importing_XML_dumps and that's the main way those are intended to be consumed. |
| 04:36:15 | <confusedMoose> | I see. |
| 04:36:41 | <confusedMoose> | So I should just be able to download MediaWiki and then open the XML file? |
| 04:37:05 | <confusedMoose> | (sorry if this is really obvious, i still don't know alot about archiving) |
| 04:39:36 | <Jake> | You'd have to import it into their software, I believe the link I put has the correct instructions. The XML file should be viewable right now, it's just huge and not in a great format to view. |
| 04:40:47 | | nicolas17 quits [Ping timeout: 244 seconds] |
| 04:41:12 | <confusedMoose> | okay, i'm trying to install mediawiki right now |
| 04:41:42 | <confusedMoose> | it seems it wants me to set up alot of extra software |
| 04:42:14 | <confusedMoose> | i just want to browse the wiki offline |
| 04:42:43 | <@JAA> | There isn't a good way to do that, I think. These dumps are intended as backups to restore a fully functional wiki, basically. |
| 04:42:54 | <confusedMoose> | ohhhhh |
| 04:43:12 | <confusedMoose> | how do i browse an archived wiki offline then? |
| 04:56:14 | <confusedMoose> | i can open the xml file in a browser, but it's just very hard to read and navigate |
| 04:56:18 | <confusedMoose> | impossible* |
| 04:57:52 | | confusedMoose quits [Remote host closed the connection] |
| 06:18:56 | | dvd quits [Ping timeout: 265 seconds] |
| 06:59:12 | | pebbles quits [Remote host closed the connection] |
| 07:41:19 | | qwertyasdfuiopghjkl joins |
| 08:23:28 | | march_happy quits [Ping timeout: 244 seconds] |
| 08:23:43 | | march_happy (march_happy) joins |
| 09:29:10 | | LetMeByte joins |
| 09:29:36 | <LetMeByte> | someone said paywall recently 12ft.io |
| 09:29:42 | | LetMeByte quits [Client Quit] |
| 09:39:44 | <nirv> | I hate to bring it up again, but I just redownloaded the "Uppercase" Geocities Patched Archive torrent entirely in Linux (Deluge) this time. All the files are there but I'm getting errors already extracting. |
| 09:39:46 | <nirv> | https://cdn.discordapp.com/attachments/197420215363960832/948514640495792128/unknown.png |
| 09:40:02 | <nirv> | I think the geocities collection needs a re-do |
| 09:52:57 | | mutantmonkey quits [Ping timeout: 252 seconds] |
| 09:55:07 | | mutantmonkey (mutantmonkey) joins |
| 10:29:15 | | sec^nd quits [Ping timeout: 252 seconds] |
| 10:42:44 | <nirv> | extraction failed after 71GB out of the 350GB archive in windows with 7zip. trying with ubuntu: 7z -y x "*.001" -o/media/sf_geocities/uppercase > /media/sf_geocities/uppercase/uppercaselinux.log |
| 10:46:36 | <nirv> | I don't know what's up with these files but they have one job: extract. and they can't even do it. I'm about out of patience. I've realized linux is the way to go on these damn archives but I will lose all faith in windows if this actually extracts |
| 10:49:07 | <nirv> | https://cdn.discordapp.com/attachments/872443255281844236/948532042298179584/unknown.png |
| 10:51:39 | | Roman joins |
| 10:58:57 | <Roman> | Hello. I want to help by running a Warrior docker image, but I am confused by this line in FAQ: "No censorship. If you believe your country implements censorship, do not run a warrior." I am from Ukraine and, apparently, a considerable number of Russian online resources are blocked here. Does it preclude me from helping? I should be better suited to crawl local websites because I am located here. I do not find it reasonable to buy |
| 10:58:57 | <Roman> | a server outside Ukraine for this. |
| 11:00:21 | <nirv> | it seems a lot of countries censor the internet in some way. I can't say for sure, but I have australian friends that can't even view some sites I send them |
| 11:00:46 | <nirv> | I'm surprised ukraine even has electricity at this point |
| 11:28:21 | <nirv> | okay. here come the errors. nope. the geocities archive doesn't work. https://cdn.discordapp.com/attachments/197420215363960832/948542190081175582/unknown.png |
| 11:31:15 | | Roman_ joins |
| 11:34:04 | | Roman quits [Ping timeout: 265 seconds] |
| 11:34:04 | | Roman_ is now known as Roman |
| 11:34:13 | <nirv> | Roman_, btw isn't it a bad time to help archive when you have bombs dropping all around? |
| 11:41:33 | <Roman> | I am a lawyer. I know how important web archives are. I used them in my work. I am also a former long-term BOINC and YaCy contributor. I just hope to load a dedicated machine for this while I can and forget about it. |
| 11:51:52 | <@rewby> | I'm not an expert on the subject of what connections should or shouldn't be used. But I think it somewhat depends on what kind of censorship you think is happening and how sites are blocked. |
| 11:53:04 | <@rewby> | For example, we want to avoid situations where you have one of those corporate webfilter things that'll serve you a 200 or something with a fake page that just says "site was blocked by policy blah" |
| 12:09:06 | <Roman> | My former ISP used to put such explanation page. My current ISP just drops the connection. |
| 12:10:23 | <Roman> | The browser just states that the website could not be found. |
| 12:11:03 | <Roman> | Though, it apparently exists via Tor. |
| 12:32:24 | | LeGoupil joins |
| 12:44:00 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 12:54:43 | | march_happy quits [Ping timeout: 244 seconds] |
| 12:55:22 | | march_happy (march_happy) joins |
| 13:14:49 | <@jrwr> | We hit hacker news again https://news.ycombinator.com/item?id=30524842 |
| 13:30:21 | | wessel15129 joins |
| 13:30:22 | | wessel1512 quits [Read error: Connection reset by peer] |
| 13:30:22 | | wessel15129 is now known as wessel1512 |
| 13:37:36 | | Roman quits [Ping timeout: 244 seconds] |
| 13:49:29 | | march_happy quits [Ping timeout: 244 seconds] |
| 13:49:46 | | march_happy (march_happy) joins |
| 13:52:15 | <nirv> | ok I'm just going to ask flat out. 7zip obviously isn't working in ubuntu to extract the Geocities Patched archive. has anyone extracted that archive and if so, what program did you use? winrar? I'm pretty frustrated with this. I've been extracting files for at least 21 years. Never had an issue with an archive like this |
| 13:52:18 | | Arcorann quits [Ping timeout: 265 seconds] |
| 14:07:26 | | Roman joins |
| 14:07:46 | | jacobk quits [Ping timeout: 265 seconds] |
| 14:08:08 | | wessel1512 is now authenticated as wessel1512 |
| 14:10:44 | | onetruth quits [Read error: Connection reset by peer] |
| 14:12:56 | | kiska (kiska) joins |
| 14:21:35 | <nirv> | here's my windows 7zip log trying to extract geocities UPPERCASE .001 files: https://pastebin.com/13pp71bJ |
| 14:21:44 | <nirv> | same attempt for linux: https://pastebin.com/NW3G6MZg |
| 14:22:07 | <nirv> | troubleshooted everything I can think of. nothing works |
| 14:26:10 | | lennier1 quits [Ping timeout: 244 seconds] |
| 14:27:00 | | lennier1 (lennier1) joins |
| 14:57:29 | | sec^nd (second) joins |
| 15:00:27 | <h2ibot> | JAABot edited CurrentWarriorProject (-2): https://wiki.archiveteam.org/?diff=48320&oldid=48319 |
| 15:20:24 | | Arcorann (Arcorann) joins |
| 15:26:17 | | sec^nd quits [Remote host closed the connection] |
| 15:27:02 | | Arcorann quits [Ping timeout: 265 seconds] |
| 15:27:24 | | sec^nd (second) joins |
| 15:39:04 | | Roman leaves |
| 15:43:31 | | Arcorann (Arcorann) joins |
| 15:57:01 | | bonga quits [Remote host closed the connection] |
| 15:58:28 | | bonga joins |
| 16:04:44 | | bonga quits [Ping timeout: 265 seconds] |
| 16:04:59 | | bonga joins |
| 16:11:42 | | pabs quits [Read error: Connection reset by peer] |
| 16:15:44 | | michaelblob quits [Read error: Connection reset by peer] |
| 16:18:20 | | sixregrets joins |
| 16:20:26 | | sixregrets quits [Remote host closed the connection] |
| 16:26:29 | | Arcorann quits [Ping timeout: 265 seconds] |
| 16:31:20 | | bonga quits [Read error: Connection reset by peer] |
| 16:32:47 | | Arcorann (Arcorann) joins |
| 16:34:16 | | jacobk joins |
| 16:35:48 | | HP_Archivist (HP_Archivist) joins |
| 16:37:23 | | HP_Archivist quits [Client Quit] |
| 16:39:28 | | michaelblob (michaelblob) joins |
| 16:41:27 | | Megame (Megame) joins |
| 16:46:30 | | Hackerpcs quits [Quit: Hackerpcs] |
| 16:51:50 | | Hackerpcs (Hackerpcs) joins |
| 17:00:14 | | bonga joins |
| 17:05:47 | <nirv> | so the solution to my geocities archive seems to be this: https://blog.geocities.institute/archives/3209 |
| 17:06:54 | <nirv> | quite a number of files in the torrent are corrupt but they don't say that in any torrent client I've tried. so I need to use this bash script to CRC check and then auto replace with files from archive.org. great |
| 17:11:28 | | painbow joins |
| 17:13:19 | | nirv quits [Ping timeout: 252 seconds] |
| 17:17:14 | | sepro joins |
| 17:21:57 | <@OrIdow6> | So thinking about doing something on Duolingo |
| 17:22:01 | <@OrIdow6> | Forums |
| 17:25:11 | | painbow is now known as nirv |
| 17:35:59 | | user_ (gazorpazorp) joins |
| 17:36:52 | | gazorpazorp quits [Read error: Connection reset by peer] |
| 17:38:30 | | sepro quits [Ping timeout: 265 seconds] |
| 17:39:04 | | sepro joins |
| 17:45:05 | | sepro quits [Ping timeout: 244 seconds] |
| 17:49:08 | | Arcorann quits [Ping timeout: 265 seconds] |
| 17:59:45 | | Megame quits [Client Quit] |
| 18:44:28 | | sepro joins |
| 18:47:53 | <VerifiedJ> | https://twitter.com/Bandcamp/status/1499068917947510788 Bandcamp is joining Epic Games |
| 18:50:56 | | LeGoupil quits [Client Quit] |
| 18:54:52 | | lennier1 quits [Ping timeout: 265 seconds] |
| 18:56:10 | | lennier2 joins |
| 18:59:31 | | lennier2 is now known as lennier1 |
| 19:36:46 | <@JAA> | SketchTheCow: FYI, there is more data coming FOS's way again now. |
| 19:41:11 | | @rewby may be an insane person who knows how to automate infrastructure a bit too well |
| 20:12:20 | | protondonor quits [Ping timeout: 244 seconds] |
| 20:17:25 | | geezabiscuit joins |
| 20:20:35 | | geezabiscuit quits [Remote host closed the connection] |
| 20:30:46 | | mutantmonkey quits [Remote host closed the connection] |
| 20:31:54 | | mutantmonkey (mutantmonkey) joins |
| 20:36:06 | | jacobk quits [Ping timeout: 244 seconds] |
| 20:42:28 | | jacobk joins |
| 20:44:12 | | geezabiscuit joins |
| 20:48:39 | | geezabiscuit is now authenticated as geezabiscuit |
| 20:49:49 | | geezabiscuit quits [Changing host] |
| 20:49:49 | | geezabiscuit (geezabiscuit) joins |
| 20:53:55 | <masterX244> | is there a way to tell archive.org to not derive from some files? Got a bunch of firmware.imgs that i pulled off a server and it tries to derive them as ISO files even though they are a custom format of the vendor |
| 20:55:08 | <@OrIdow6> | I think the --no-derive option on the ia tool worked as of last time I used it |
| 21:08:38 | <masterX244> | damnit... wish i knew that on the initial upload. First "derive" took more than 24 hours since it wasted its time on 5,8k files of a few MB each. (pulled all older versions of the server, too which are not visible unless you deeplink em, the updater of the manufacturer only downloads the one that it got in its index file. Some versions there are newer than the latest in the index, too (upload without setting it as "latest). found those by |
| 21:08:38 | <masterX244> | guessing around a bit |
| 21:08:43 | <masterX244> | ) |
| 21:50:19 | | jacobk quits [Ping timeout: 265 seconds] |
| 22:08:23 | <systwi> | Do we have any ongoing archival projects covering Bandcamp? |
| 22:09:49 | <@OrIdow6> | No |
| 22:11:15 | <@OrIdow6> | I don't think there have been any content removals scheduled yet? |
| 22:12:55 | <@arkiver> | we need to look into what will change exactly |
| 22:13:52 | <@OrIdow6> | The blog post linked in the Twitter post claims "Bandcamp will keep operating as a standalone marketplace... The products and services... aren’t going anywhere" |
| 22:14:21 | <systwi> | Hmmm, I hope they keep their word. |
| 22:14:30 | <@OrIdow6> | In the long term, I wouldn't count on it |
| 22:14:51 | | BlueMaxima joins |
| 22:16:39 | <@OrIdow6> | But in the short term it doesn't sound too bad to me |
| 22:17:20 | <systwi> | I agree. |
| 22:20:55 | | jacobk joins |
| 22:21:41 | <Jake> | They also bought ArtStation which I believe has been running fine? |
| 22:24:34 | | fe joins |
| 22:35:45 | | Arcorann (Arcorann) joins |
| 22:37:13 | | fe quits [Remote host closed the connection] |
| 23:02:21 | | bonga quits [Read error: Connection reset by peer] |
| 23:06:02 | | bonga joins |
| 23:07:09 | <@arkiver> | Jake: when was that bought? |
| 23:07:11 | | bonga quits [Read error: Connection reset by peer] |
| 23:07:23 | | bonga joins |
| 23:07:42 | <Jake> | Almost a year now. https://www.epicgames.com/site/en-US/news/artstation-is-now-part-of-epic-games |
| 23:19:22 | | qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds] |
| 23:31:15 | | march_happy quits [Ping timeout: 244 seconds] |
| 23:32:54 | | march_happy (march_happy) joins |
| 23:53:53 | | qwertyasdfuiopghjkl joins |
| 23:57:03 | | pabs (pabs) joins |