00:00:35Megame quits [Client Quit]
00:12:41<SketchTheCow>FOS is down to 74%
00:12:53<SketchTheCow>Which means either we have a handle on it, or you're all disappointing failure.
00:18:19<@JAA>Disappointing failure it is. We paused uploads from some of the pipelines which are also stopping, both due to an unrelated issue. Don't worry, we'll start hammering FOS again before long.
00:25:10HP_Archivist (HP_Archivist) joins
00:43:28<SketchTheCow>I think only half of FOS's 13tb is free.
00:43:37<SketchTheCow>In theory, teamarchive1 is available for pipelines as well.
00:54:16AlsoHP_Archivist joins
00:57:02TheTechRobo joins
00:57:35HP_Archivist quits [Ping timeout: 244 seconds]
01:00:15AlsoHP_Archivist quits [Client Quit]
01:10:05bonga quits [Ping timeout: 265 seconds]
01:10:29bonga joins
01:17:22bonga quits [Read error: Connection reset by peer]
01:18:09bonga joins
01:25:04bonga quits [Ping timeout: 265 seconds]
01:25:30bonga joins
01:35:50bonga quits [Read error: Connection reset by peer]
01:37:10bonga joins
02:02:46dm4v quits [Ping timeout: 240 seconds]
02:02:52dm4v_ joins
02:03:17dm4v_ is now known as dm4v
02:03:18dm4v quits [Changing host]
02:03:18dm4v (dm4v) joins
02:03:37wickedplayer494 quits [Ping timeout: 252 seconds]
02:04:42wickedplayer494 joins
02:18:38tbc1887 (tbc1887) joins
02:24:11nicolas17 quits [Client Quit]
02:24:28nicolas17 joins
02:31:22tbc1887 quits [Read error: Connection reset by peer]
03:04:10dvd (dvd) joins
03:09:52march_happy quits [Remote host closed the connection]
03:17:51march_happy (march_happy) joins
03:27:28HP_Archivist (HP_Archivist) joins
03:30:10AlsoHP_Archivist joins
03:33:37HP_Archivist quits [Ping timeout: 244 seconds]
03:33:51tzt quits [Remote host closed the connection]
03:34:14tzt (tzt) joins
03:42:52jacobk joins
03:51:54AlsoHP_Archivist quits [Client Quit]
03:52:17<AnotherIki>Looks like https://nadiaeghbal.com is a wayback-excluded site not mentioned in https://wiki.archiveteam.org/index.php/List_of_websites_excluded_from_the_Wayback_Machine
03:52:36<AnotherIki>Oops, probably better for ot
03:53:40<@JAA>Sure is. That list is never going to be complete obviously, but please do add any you can find! :-)
04:23:45confusedMoose joins
04:23:55<confusedMoose>is this active?
04:24:03<confusedMoose>i have never used irc before
04:24:21<Jake>hi!
04:24:24<confusedMoose>hello!
04:25:04<confusedMoose>i have come to ask about an xml dump i got from the archive team wiki, i don't know how to make it a readable wiki.
04:25:30<confusedMoose>i tried to get an irc client working on linux, but i switched over onto windows because i could not get it to work
04:27:01<confusedMoose>i also came to better understand irc
04:32:46<Jake>Hmm, XML dump? Are you talking about one of our Wikiteam dumps?
04:33:13lennier1 quits [Ping timeout: 252 seconds]
04:33:18<confusedMoose>if i have an archived wiki, how do i access it? it's laid out like this:
04:33:53<confusedMoose>article lists, images, template-xml, wikix-src (all folders)
04:33:59lennier1 (lennier1) joins
04:35:10<confusedMoose>and then a 171 mb xml file
04:35:36<Jake>I believe the XML files can be reimported into any MediaWiki. https://www.mediawiki.org/wiki/Manual:Importing_XML_dumps and that's the main way those are intended to be consumed.
04:36:15<confusedMoose>I see.
04:36:41<confusedMoose>So I should just be able to download MediaWiki and then open the XML file?
04:37:05<confusedMoose>(sorry if this is really obvious, i still don't know alot about archiving)
04:39:36<Jake>You'd have to import it into their software, I believe the link I put has the correct instructions. The XML file should be viewable right now, it's just huge and not in a great format to view.
04:40:47nicolas17 quits [Ping timeout: 244 seconds]
04:41:12<confusedMoose>okay, i'm trying to install mediawiki right now
04:41:42<confusedMoose>it seems it wants me to set up alot of extra software
04:42:14<confusedMoose>i just want to browse the wiki offline
04:42:43<@JAA>There isn't a good way to do that, I think. These dumps are intended as backups to restore a fully functional wiki, basically.
04:42:54<confusedMoose>ohhhhh
04:43:12<confusedMoose>how do i browse an archived wiki offline then?
04:56:14<confusedMoose>i can open the xml file in a browser, but it's just very hard to read and navigate
04:56:18<confusedMoose>impossible*
04:57:52confusedMoose quits [Remote host closed the connection]
06:18:56dvd quits [Ping timeout: 265 seconds]
06:59:12pebbles quits [Remote host closed the connection]
07:41:19qwertyasdfuiopghjkl joins
08:23:28march_happy quits [Ping timeout: 244 seconds]
08:23:43march_happy (march_happy) joins
09:29:10LetMeByte joins
09:29:36<LetMeByte>someone said paywall recently 12ft.io
09:29:42LetMeByte quits [Client Quit]
09:39:44<nirv>I hate to bring it up again, but I just redownloaded the "Uppercase" Geocities Patched Archive torrent entirely in Linux (Deluge) this time. All the files are there but I'm getting errors already extracting.
09:39:46<nirv>https://cdn.discordapp.com/attachments/197420215363960832/948514640495792128/unknown.png
09:40:02<nirv>I think the geocities collection needs a re-do
09:52:57mutantmonkey quits [Ping timeout: 252 seconds]
09:55:07mutantmonkey (mutantmonkey) joins
10:29:15sec^nd quits [Ping timeout: 252 seconds]
10:42:44<nirv>extraction failed after 71GB out of the 350GB archive in windows with 7zip. trying with ubuntu: 7z -y x "*.001" -o/media/sf_geocities/uppercase > /media/sf_geocities/uppercase/uppercaselinux.log
10:46:36<nirv>I don't know what's up with these files but they have one job: extract. and they can't even do it. I'm about out of patience. I've realized linux is the way to go on these damn archives but I will lose all faith in windows if this actually extracts
10:49:07<nirv>https://cdn.discordapp.com/attachments/872443255281844236/948532042298179584/unknown.png
10:51:39Roman joins
10:58:57<Roman>Hello. I want to help by running a Warrior docker image, but I am confused by this line in FAQ: "No censorship. If you believe your country implements censorship, do not run a warrior." I am from Ukraine and, apparently, a considerable number of Russian online resources are blocked here. Does it preclude me from helping? I should be better suited to crawl local websites because I am located here. I do not find it reasonable to buy
10:58:57<Roman> a server outside Ukraine for this.
11:00:21<nirv>it seems a lot of countries censor the internet in some way. I can't say for sure, but I have australian friends that can't even view some sites I send them
11:00:46<nirv>I'm surprised ukraine even has electricity at this point
11:28:21<nirv>okay. here come the errors. nope. the geocities archive doesn't work. https://cdn.discordapp.com/attachments/197420215363960832/948542190081175582/unknown.png
11:31:15Roman_ joins
11:34:04Roman quits [Ping timeout: 265 seconds]
11:34:04Roman_ is now known as Roman
11:34:13<nirv>Roman_, btw isn't it a bad time to help archive when you have bombs dropping all around?
11:41:33<Roman>I am a lawyer. I know how important web archives are. I used them in my work. I am also a former long-term BOINC and YaCy contributor. I just hope to load a dedicated machine for this while I can and forget about it.
11:51:52<@rewby>I'm not an expert on the subject of what connections should or shouldn't be used. But I think it somewhat depends on what kind of censorship you think is happening and how sites are blocked.
11:53:04<@rewby>For example, we want to avoid situations where you have one of those corporate webfilter things that'll serve you a 200 or something with a fake page that just says "site was blocked by policy blah"
12:09:06<Roman>My former ISP used to put such explanation page. My current ISP just drops the connection.
12:10:23<Roman>The browser just states that the website could not be found.
12:11:03<Roman>Though, it apparently exists via Tor.
12:32:24LeGoupil joins
12:44:00BlueMaxima quits [Read error: Connection reset by peer]
12:54:43march_happy quits [Ping timeout: 244 seconds]
12:55:22march_happy (march_happy) joins
13:14:49<@jrwr>We hit hacker news again https://news.ycombinator.com/item?id=30524842
13:30:21wessel15129 joins
13:30:22wessel1512 quits [Read error: Connection reset by peer]
13:30:22wessel15129 is now known as wessel1512
13:37:36Roman quits [Ping timeout: 244 seconds]
13:49:29march_happy quits [Ping timeout: 244 seconds]
13:49:46march_happy (march_happy) joins
13:52:15<nirv>ok I'm just going to ask flat out. 7zip obviously isn't working in ubuntu to extract the Geocities Patched archive. has anyone extracted that archive and if so, what program did you use? winrar? I'm pretty frustrated with this. I've been extracting files for at least 21 years. Never had an issue with an archive like this
13:52:18Arcorann quits [Ping timeout: 265 seconds]
14:07:26Roman joins
14:07:46jacobk quits [Ping timeout: 265 seconds]
14:10:44onetruth quits [Read error: Connection reset by peer]
14:12:56kiska (kiska) joins
14:21:35<nirv>here's my windows 7zip log trying to extract geocities UPPERCASE .001 files: https://pastebin.com/13pp71bJ
14:21:44<nirv>same attempt for linux: https://pastebin.com/NW3G6MZg
14:22:07<nirv>troubleshooted everything I can think of. nothing works
14:26:10lennier1 quits [Ping timeout: 244 seconds]
14:27:00lennier1 (lennier1) joins
14:57:29sec^nd (second) joins
15:00:27<h2ibot>JAABot edited CurrentWarriorProject (-2): https://wiki.archiveteam.org/?diff=48320&oldid=48319
15:20:24Arcorann (Arcorann) joins
15:26:17sec^nd quits [Remote host closed the connection]
15:27:02Arcorann quits [Ping timeout: 265 seconds]
15:27:24sec^nd (second) joins
15:39:04Roman leaves
15:43:31Arcorann (Arcorann) joins
15:57:01bonga quits [Remote host closed the connection]
15:58:28bonga joins
16:04:44bonga quits [Ping timeout: 265 seconds]
16:04:59bonga joins
16:11:42pabs quits [Read error: Connection reset by peer]
16:15:44michaelblob quits [Read error: Connection reset by peer]
16:18:20sixregrets joins
16:20:26sixregrets quits [Remote host closed the connection]
16:26:29Arcorann quits [Ping timeout: 265 seconds]
16:31:20bonga quits [Read error: Connection reset by peer]
16:32:47Arcorann (Arcorann) joins
16:34:16jacobk joins
16:35:48HP_Archivist (HP_Archivist) joins
16:37:23HP_Archivist quits [Client Quit]
16:39:28michaelblob (michaelblob) joins
16:41:27Megame (Megame) joins
16:46:30Hackerpcs quits [Quit: Hackerpcs]
16:51:50Hackerpcs (Hackerpcs) joins
17:00:14bonga joins
17:05:47<nirv>so the solution to my geocities archive seems to be this: https://blog.geocities.institute/archives/3209
17:06:54<nirv>quite a number of files in the torrent are corrupt but they don't say that in any torrent client I've tried. so I need to use this bash script to CRC check and then auto replace with files from archive.org. great
17:11:28painbow joins
17:13:19nirv quits [Ping timeout: 252 seconds]
17:17:14sepro joins
17:21:57<@OrIdow6>So thinking about doing something on Duolingo
17:22:01<@OrIdow6>Forums
17:25:11painbow is now known as nirv
17:35:59user_ (gazorpazorp) joins
17:36:52gazorpazorp quits [Read error: Connection reset by peer]
17:38:30sepro quits [Ping timeout: 265 seconds]
17:39:04sepro joins
17:45:05sepro quits [Ping timeout: 244 seconds]
17:49:08Arcorann quits [Ping timeout: 265 seconds]
17:59:45Megame quits [Client Quit]
18:44:28sepro joins
18:47:53<VerifiedJ>https://twitter.com/Bandcamp/status/1499068917947510788 Bandcamp is joining Epic Games
18:50:56LeGoupil quits [Client Quit]
18:54:52lennier1 quits [Ping timeout: 265 seconds]
18:56:10lennier2 joins
18:59:31lennier2 is now known as lennier1
19:36:46<@JAA>SketchTheCow: FYI, there is more data coming FOS's way again now.
19:41:11@rewby may be an insane person who knows how to automate infrastructure a bit too well
20:12:20protondonor quits [Ping timeout: 244 seconds]
20:17:25geezabiscuit joins
20:20:35geezabiscuit quits [Remote host closed the connection]
20:30:46mutantmonkey quits [Remote host closed the connection]
20:31:54mutantmonkey (mutantmonkey) joins
20:36:06jacobk quits [Ping timeout: 244 seconds]
20:42:28jacobk joins
20:44:12geezabiscuit joins
20:49:49geezabiscuit quits [Changing host]
20:49:49geezabiscuit (geezabiscuit) joins
20:53:55<masterX244>is there a way to tell archive.org to not derive from some files? Got a bunch of firmware.imgs that i pulled off a server and it tries to derive them as ISO files even though they are a custom format of the vendor
20:55:08<@OrIdow6>I think the --no-derive option on the ia tool worked as of last time I used it
21:08:38<masterX244>damnit... wish i knew that on the initial upload. First "derive" took more than 24 hours since it wasted its time on 5,8k files of a few MB each. (pulled all older versions of the server, too which are not visible unless you deeplink em, the updater of the manufacturer only downloads the one that it got in its index file. Some versions there are newer than the latest in the index, too (upload without setting it as "latest). found those by
21:08:38<masterX244> guessing around a bit
21:08:43<masterX244>)
21:50:19jacobk quits [Ping timeout: 265 seconds]
22:08:23<systwi>Do we have any ongoing archival projects covering Bandcamp?
22:09:49<@OrIdow6>No
22:11:15<@OrIdow6>I don't think there have been any content removals scheduled yet?
22:12:55<@arkiver>we need to look into what will change exactly
22:13:52<@OrIdow6>The blog post linked in the Twitter post claims "Bandcamp will keep operating as a standalone marketplace... The products and services... aren’t going anywhere"
22:14:21<systwi>Hmmm, I hope they keep their word.
22:14:30<@OrIdow6>In the long term, I wouldn't count on it
22:14:51BlueMaxima joins
22:16:39<@OrIdow6>But in the short term it doesn't sound too bad to me
22:17:20<systwi>I agree.
22:20:55jacobk joins
22:21:41<Jake>They also bought ArtStation which I believe has been running fine?
22:24:34fe joins
22:35:45Arcorann (Arcorann) joins
22:37:13fe quits [Remote host closed the connection]
23:02:21bonga quits [Read error: Connection reset by peer]
23:06:02bonga joins
23:07:09<@arkiver>Jake: when was that bought?
23:07:11bonga quits [Read error: Connection reset by peer]
23:07:23bonga joins
23:07:42<Jake>Almost a year now. https://www.epicgames.com/site/en-US/news/artstation-is-now-part-of-epic-games
23:19:22qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds]
23:31:15march_happy quits [Ping timeout: 244 seconds]
23:32:54march_happy (march_happy) joins
23:53:53qwertyasdfuiopghjkl joins
23:57:03pabs (pabs) joins