| 00:38:20 | | immibis quits [Read error: Connection reset by peer] |
| 00:38:30 | | immibis joins |
| 00:42:31 | | jacobk joins |
| 00:49:56 | | lennier1 quits [Client Quit] |
| 00:51:48 | | qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds] |
| 00:54:11 | | lennier1 (lennier1) joins |
| 00:58:53 | | mutantmonkey quits [Ping timeout: 252 seconds] |
| 01:40:26 | | audy quits [Ping timeout: 240 seconds] |
| 01:40:34 | | audy joins |
| 01:49:06 | | immibis_ joins |
| 01:51:50 | | immibis quits [Ping timeout: 265 seconds] |
| 02:03:06 | | dm4v quits [Ping timeout: 244 seconds] |
| 02:05:00 | | dm4v joins |
| 02:05:03 | | dm4v is now authenticated as dm4v |
| 02:05:03 | | dm4v quits [Changing host] |
| 02:05:03 | | dm4v (dm4v) joins |
| 02:24:17 | | tbc1887 (tbc1887) joins |
| 02:48:15 | | tbc1887 quits [Read error: Connection reset by peer] |
| 02:57:37 | | benjins quits [Read error: Connection reset by peer] |
| 02:58:43 | | benjins joins |
| 03:02:01 | | AnotherIki quits [Read error: Connection reset by peer] |
| 03:02:02 | | mrfooooo quits [Quit: Ping timeout (120 seconds)] |
| 03:02:17 | | mrfooooo joins |
| 03:02:32 | | s-crypt6 (s-crypt) joins |
| 03:02:49 | | sec^nd quits [Ping timeout: 252 seconds] |
| 03:03:38 | | T31M_ joins |
| 03:05:03 | | T31M quits [Quit: ZNC - https://znc.in] |
| 03:05:03 | | T31M_ is now known as T31M |
| 03:05:11 | | VerifiedJ9 (VerifiedJ) joins |
| 03:05:38 | | katocala quits [Read error: Connection reset by peer] |
| 03:05:38 | | VerifiedJ quits [Quit: Ping timeout (120 seconds)] |
| 03:05:38 | | s-crypt quits [Ping timeout: 252 seconds] |
| 03:05:38 | | VerifiedJ9 is now known as VerifiedJ |
| 03:05:38 | | s-crypt6 is now known as s-crypt |
| 03:05:55 | | Iki joins |
| 03:05:57 | | katocala joins |
| 03:06:20 | | endrift|ZNC joins |
| 03:06:39 | | endrift quits [Read error: Connection reset by peer] |
| 03:06:40 | | katocala is now authenticated as katocala |
| 03:07:26 | | sec^nd (second) joins |
| 03:23:49 | | lennier2 joins |
| 03:24:09 | | lennier1 quits [Ping timeout: 265 seconds] |
| 03:24:19 | | lennier2 is now known as lennier1 |
| 03:29:06 | | benjins is now authenticated as benjins |
| 03:30:21 | | march_happy quits [Remote host closed the connection] |
| 03:34:41 | | march_happy (march_happy) joins |
| 03:38:42 | | jacobk_ joins |
| 03:38:44 | | jacobk_ quits [Client Quit] |
| 03:59:26 | | Eighty quits [Ping timeout: 265 seconds] |
| 04:05:08 | | Eighty (Eighty) joins |
| 04:07:04 | | dvd (dvd) joins |
| 04:25:53 | | dvd quits [Read error: Connection reset by peer] |
| 04:40:10 | | hackbug quits [Remote host closed the connection] |
| 04:41:50 | | hackbug (hackbug) joins |
| 05:02:18 | | sonick quits [Client Quit] |
| 05:58:49 | | Eighty quits [Ping timeout: 265 seconds] |
| 06:05:47 | <nirv> | I've spent weeks on this archive. could someone just give me the answer as to how to extract the 7z geocities files? I already extracted the .001 files. it's not letting me extract these. https://cdn.discordapp.com/attachments/872443255281844236/949185672613330984/unknown.png |
| 06:06:02 | <nirv> | I'm using: 7zz -mmt=18 -y x "*.7z" -o/media/nirv/linux2tb > /media/nirv/linux2tb/uppercase02.log |
| 06:06:17 | <nirv> | Extracting archive: geocities-A-a.7z WARNING: geocities-A-a.7z Cannot open the file as [7z] archive The file is open as [tar] archive |
| 06:08:37 | <nirv> | this is the log showing all of the 7z files extracted fine from the .001 archive. https://pastebin.com/8cX5gZHP |
| 06:09:12 | <nirv> | I'm at a loss of what the hell to do to extract these 459GB set of 7z files now. |
| 06:17:11 | | katocala quits [Ping timeout: 265 seconds] |
| 06:20:12 | | katocala joins |
| 06:24:32 | | katocala quits [Ping timeout: 244 seconds] |
| 06:31:35 | <nirv> | I'm thinking it's a problem with the file names? I see the .7z extension on some of the files, but none of the other files have any extension on them. |
| 06:31:38 | <nirv> | I wonder if there is a way to show expected file names of an archive, so I can rename them to that. |
| 06:31:42 | <nirv> | but it begs the question: why in the hell were they named wrong to begin with? (if this is the case) |
| 06:34:34 | | Eighty (Eighty) joins |
| 06:37:27 | | nicolas17 quits [Ping timeout: 244 seconds] |
| 06:39:03 | | sonick (sonick) joins |
| 06:51:01 | | lukash7 quits [Ping timeout: 265 seconds] |
| 06:58:16 | | Eighty quits [Ping timeout: 265 seconds] |
| 07:13:57 | | Eighty (Eighty) joins |
| 07:33:03 | | nico_32 quits [Ping timeout: 252 seconds] |
| 07:55:17 | | nico_32 (nico) joins |
| 07:57:51 | | qwertyasdfuiopghjkl joins |
| 07:58:12 | | Eighty quits [Ping timeout: 265 seconds] |
| 08:10:18 | <nirv> | if I can find the solution to this shit I'm making a video on youtube. nobody should have to suffer like I have to sift through the geocities archive |
| 08:10:49 | <nirv> | tards are people too. and we should be able to extract the geocities archive and sift through the material |
| 08:27:29 | <appledash> | I recall when I downloaded it like 5 years ago I had absolutely no issues |
| 08:27:46 | <nirv> | do you recall how you extracted these files? https://cdn.discordapp.com/attachments/872443255281844236/949185672613330984/unknown.png |
| 08:28:08 | <nirv> | because either I'm a tard, or the files are screwed up. most likely I'm of the tard |
| 08:28:59 | <appledash> | I'm just gonna download the torrent again and see what I can make of it |
| 08:29:11 | <appledash> | You are using https://thepiratebay.org/description.php?id=6353395 correct? |
| 08:29:14 | <appledash> | That is the one I recall using |
| 08:29:26 | <nirv> | the files from the torrent are .001 which extracted fine for me. 100%. but the 7z contained inside refuse to |
| 08:29:33 | <nirv> | yes, that's the one |
| 08:30:14 | <appledash> | I think something is screwed up with your download. |
| 08:30:36 | <nirv> | I did a CRC check even |
| 08:31:10 | <appledash> | All the files I am seeing in the torrent are .7z.xxx where xxx is 3 numbers starting at 001 and ending at however many parts each file has. |
| 08:31:40 | <appledash> | So if you have foo.7z.001 foo.7z.002 foo.7z.003 that's a 3-part 7z archive and doing 7z x foo.7z.001 will extract all files contained across those 3 parts |
| 08:32:02 | <appledash> | I'm not seeing any files that are just .7z and none with no extensions |
| 08:32:40 | <nirv> | I didn't trust the torrent, so what I did was ran a script that CRC checked the files and would do this if a file failed CRC: https://cdn.discordapp.com/attachments/872443255281844236/949222669734248468/unknown.png |
| 08:33:02 | <nirv> | so it created a file you could run to grab the corrupt/missing files from archive.org as you can see |
| 08:33:12 | <nirv> | so that's what I did. now all of the torrent files pass CRC. |
| 08:33:21 | <nirv> | Great. I extracted those no problem. |
| 08:33:31 | <nirv> | The files extracted though are more .7z files and THOSE won't extract |
| 08:33:35 | <appledash> | Ahhh! |
| 08:33:46 | <appledash> | Alright, I will have to wait for the download to complete |
| 08:33:54 | <appledash> | ETA for me is unfortunately 20 hours so I may be able to help you tomorrow |
| 08:34:06 | <appledash> | OK, it just dropped to 11 hours :p |
| 08:34:09 | <nirv> | if you downloaded the torrent and it passed CRC out of the gate, you'd still have the same files I ended up with. okay |
| 08:34:54 | <nirv> | i appreciate that man. I'm loading up deluge to help seed |
| 08:35:40 | <appledash> | Thanks! |
| 08:36:35 | <nirv> | I came across this weird extract script but I have no idea how to analyize it. I don't know enough. https://gist.github.com/gjtorikian/29fbc764d6221363bf4aadd4cf59c22e |
| 08:37:53 | <appledash> | Hmm! |
| 08:37:56 | <appledash> | I think I understand |
| 08:38:17 | <appledash> | To me it implies that 7z has mis-named the second-level of files; I think you should try to extract one of the ".7z" files with tar -xzvf <file> |
| 08:38:22 | <appledash> | And see how that works out |
| 08:38:25 | <nirv> | that's what I'm thinking!! |
| 08:38:43 | <nirv> | okay! |
| 08:38:46 | | froschgrosch_ quits [Ping timeout: 240 seconds] |
| 08:38:50 | <nirv> | the last thing i tried before I got pissed was |
| 08:38:51 | <nirv> | tar -xvf geocities-A-a.7z |
| 08:39:03 | <nirv> | but I'll try yours |
| 08:40:01 | | froschgrosch_ joins |
| 08:40:18 | <nirv> | irv@nirv-VirtualBox:/media/nirv/linuxssd/extracted$ tar -xzvf geocities-A-a.7z gzip: stdin: not in gzip format tar: Child returned status 1 tar: Error is not recoverable: exiting now |
| 08:40:28 | <appledash> | Wtf |
| 08:40:38 | <appledash> | What does `file geocities-A-a.7z` say? |
| 08:40:51 | <nirv> | so just file geocities-A-a.7z from terminal? |
| 08:41:00 | <nirv> | geocities-A-a.7z: POSIX tar archive (GNU) |
| 08:41:17 | <appledash> | Yeah |
| 08:41:30 | <appledash> | Hm, try without the z in the tar command? |
| 08:41:32 | <appledash> | That SHOULD work |
| 08:41:39 | <nirv> | the readme.txt says it's a GNU tar archive so that's why I thought tar command would work |
| 08:41:48 | <nirv> | I did try without the z before but let me try again |
| 08:42:37 | <nirv> | https://cdn.discordapp.com/attachments/872443255281844236/949225311143362650/unknown.png |
| 08:42:45 | <nirv> | it only extracts 17K worth of files |
| 08:42:57 | <appledash> | What the heck! |
| 08:43:00 | <nirv> | which is less than the .7z alone |
| 08:43:04 | <nirv> | I KNOW |
| 08:43:09 | <appledash> | Alright, I'll probably need to have a look at the files when I finish downloading them |
| 08:43:12 | <nirv> | all right man |
| 08:43:23 | <nirv> | good to know I exhausted all I can though :P |
| 08:43:26 | <appledash> | :p |
| 08:43:55 | <nirv> | so going to make a youtube video so nobody suffers like me |
| 08:45:05 | <appledash> | Sharing knowledge is excellent but I often end up finding that I'm just so pleased I finally solved the problem that I want to get on with what I was doing instead of taking a break to do something like that |
| 08:45:07 | <appledash> | But I hope you do |
| 09:15:57 | | Ruthalas quits [Read error: Connection reset by peer] |
| 09:16:27 | | Ruthalas (Ruthalas) joins |
| 09:31:54 | | driib quits [Client Quit] |
| 09:32:07 | | driib (driib) joins |
| 10:14:27 | | march_happy quits [Ping timeout: 244 seconds] |
| 10:14:35 | | march_happy (march_happy) joins |
| 10:31:30 | | march_happy quits [Ping timeout: 244 seconds] |
| 10:31:51 | | march_happy (march_happy) joins |
| 10:38:14 | | march_happy quits [Read error: Connection reset by peer] |
| 10:39:00 | | march_happy (march_happy) joins |
| 10:43:44 | | immibis joins |
| 10:45:55 | | immibis_ quits [Ping timeout: 265 seconds] |
| 11:03:59 | | Eighty (Eighty) joins |
| 11:22:51 | | BlueMaxima quits [Client Quit] |
| 11:57:16 | | march_happy quits [Ping timeout: 244 seconds] |
| 11:57:36 | | march_happy (march_happy) joins |
| 11:57:47 | | Eighty quits [Ping timeout: 244 seconds] |
| 12:16:23 | <nirv> | appledash, could you let me know if the torrent rars are even good? I think they're corrupt out of the gate |
| 12:21:33 | | march_happy quits [Ping timeout: 244 seconds] |
| 12:22:23 | | march_happy (march_happy) joins |
| 12:32:51 | <appledash> | I shall! |
| 12:33:01 | <appledash> | Seem to be at 32% right now |
| 12:37:17 | <nirv> | this is the script I used to create .sh files of files that were missing or failed crc from the torrent. https://blog.geocities.institute/archives/3209 |
| 12:59:01 | | katocala joins |
| 12:59:47 | | katocala is now authenticated as katocala |
| 13:12:30 | | Megame (Megame) joins |
| 13:44:45 | | Arcorann quits [Ping timeout: 265 seconds] |
| 13:58:35 | | onetruth joins |
| 14:38:07 | | Megame quits [Client Quit] |
| 14:55:34 | | Niklink joins |
| 15:02:33 | <Niklink> | how would one go about running a big batch of URLs into the wayback machine? say something around 4-5k |
| 15:14:20 | <Iki> | #archivebot can handle that, I think. You'll need to upload your list to some site |
| 15:14:34 | <Iki> | There is also: https://archive.org/services/wayback-gsheets/ |
| 15:14:37 | <Iki> | Once IA is back online |
| 15:14:59 | <Iki> | But I think it is not preferable to run thousands of URLs through that, even though you can-- archivebot is better for that purpose |
| 15:15:00 | <Iki> | Niklink |
| 15:15:54 | <Iki> | FYI a lot of archiving tool here are paused because IA is temporarily down |
| 15:16:21 | <Niklink> | yeah that's why I came here to ask |
| 15:17:16 | <Niklink> | #archivebot says urgent jobs only though, mine aren't urgent really |
| 15:24:36 | | vukky (Vukky) joins |
| 15:40:04 | | vukky quits [Client Quit] |
| 15:44:36 | | march_happy quits [Ping timeout: 244 seconds] |
| 15:45:30 | | march_happy (march_happy) joins |
| 15:50:52 | | dvd (dvd) joins |
| 15:50:53 | | dvd_ (dvd) joins |
| 15:54:05 | | dvd quits [Client Quit] |
| 15:54:12 | | dvd_ quits [Client Quit] |
| 15:54:24 | | dvd (dvd) joins |
| 16:02:38 | <nirv> | this has been taking hours to do. I'm creating a 5TB VDI file just for linux so I can do all I need to do to fully extract this crap. c:\Program Files\Oracle\VirtualBox>VBoxManage createmedium disk --filename "H:\linux5tb.vdi" --size 5145729 --format VDI --variant Fixed |
| 16:02:38 | <nirv> | 0%...10%...20%...30%...40%...50%...60%...70%... |
| 16:03:24 | <nirv> | there's probably a better way to do this but I'm pretty dumb with linux and virtualbox |
| 16:07:49 | | bonga quits [Ping timeout: 265 seconds] |
| 16:12:12 | | bonga joins |
| 16:13:52 | | Eighty (Eighty) joins |
| 16:14:12 | <@JAA> | Niklink: What kind of content? |
| 16:20:48 | | immibis quits [Remote host closed the connection] |
| 16:20:54 | | immibis joins |
| 16:30:20 | | LeGoupil joins |
| 16:42:17 | | nicolas17 joins |
| 16:52:51 | | daxxy_ (daxxy) joins |
| 16:55:52 | | Megame (Megame) joins |
| 16:56:26 | | daxxy quits [Ping timeout: 240 seconds] |
| 16:58:29 | | Eighty quits [Ping timeout: 244 seconds] |
| 16:59:34 | <Niklink> | JAA: bandcamp album and track pages |
| 17:02:22 | <Niklink> | many of these pages have per-track art and attribution data that's not included in the album download proper, and I'm afraid some of these artists might get it in their head to take their music down in protest of the epic acquisition |
| 17:04:41 | | march_happy quits [Ping timeout: 244 seconds] |
| 17:04:58 | | march_happy (march_happy) joins |
| 17:11:24 | | LeGoupil quits [Ping timeout: 244 seconds] |
| 17:13:04 | | LeGoupil joins |
| 17:13:28 | | march_happy quits [Ping timeout: 244 seconds] |
| 17:22:31 | <@JAA> | Niklink: I see. I wouldn't mind running that through AB, but not right now as we're trying to grab as much of Ukraine as possible. There should be room on the weekend though. If you give me a list, I can throw it in then. Note that it won't grab the audio. |
| 17:32:12 | <Niklink> | JAA: I'm aware, dw & thanks. should have the list ready by the end of today but it's not high prio |
| 17:39:11 | | Niklink71 joins |
| 17:39:49 | | Niklink quits [Ping timeout: 244 seconds] |
| 17:43:09 | | Niklink71 is now known as Niklink |
| 17:48:36 | | Niklink quits [Ping timeout: 244 seconds] |
| 17:49:27 | | lennier1 quits [Client Quit] |
| 17:51:16 | | lennier1 (lennier1) joins |
| 17:58:56 | | bonga quits [Ping timeout: 244 seconds] |
| 18:02:38 | | Niklink joins |
| 18:04:35 | | Eighty (Eighty) joins |
| 18:05:34 | | T31M is now authenticated as T31M |
| 18:11:38 | | bonga joins |
| 18:25:00 | | insane_alien_ quits [Remote host closed the connection] |
| 18:33:01 | | insane_alien (insane_alien) joins |
| 18:50:42 | | insane_alien_ (insane_alien) joins |
| 18:53:36 | | insane_alien quits [Ping timeout: 265 seconds] |
| 19:10:11 | | vukky (Vukky) joins |
| 19:27:03 | <SketchTheCow> | Is there anything that needs my attention? |
| 19:40:11 | <tech234a> | FYI Reddit is blocking .ru link submissions https://old.reddit.com/r/ModSupport/comments/t66l5f/reddit_blocked_all_domains_under_russian_cctld_ru/ |
| 19:49:14 | | immibis_ joins |
| 19:52:05 | | immibis quits [Ping timeout: 265 seconds] |
| 20:05:21 | <wessel1512> | this is bad |
| 20:08:52 | <wessel1512> | this is just limiting the freedom of speech |
| 20:10:04 | <@JAA> | lol no, but also this isn't the place for that discussion. |
| 20:10:43 | <wessel1512> | if someone wanted to share misinformation they could just use another tld for that |
| 20:11:17 | <anarcat> | holy crap |
| 20:11:17 | <anarcat> | https://www.washingtonpost.com/technology/2022/03/04/russia-ukraine-internet-cogent-cutoff/ |
| 20:11:27 | <anarcat> | sorry, wrong channel |
| 20:11:51 | <@JAA> | Nah, that's fine, this is relevant to archival. :-) |
| 20:12:04 | <@JAA> | Although I propose we keep everything Ukraine/Russia-related in #ucryne. |
| 20:12:05 | <anarcat> | heh |
| 20:12:07 | <anarcat> | yeah |
| 20:14:06 | | Stiletto quits [Ping timeout: 240 seconds] |
| 20:14:58 | | Stiletto joins |
| 20:34:41 | | ThreeHM quits [Quit: WeeChat 3.3] |
| 20:48:14 | | ThreeHM (ThreeHeadedMonkey) joins |
| 20:51:32 | | qwertyasdfuiopghjkl quits [Client Quit] |
| 20:58:00 | | qwertyasdfuiopghjkl joins |
| 21:13:07 | | ThreeHM_ (ThreeHeadedMonkey) joins |
| 21:15:42 | | ThreeHM quits [Ping timeout: 265 seconds] |
| 21:16:27 | <@JAA> | SketchTheCow: Are you sure the FOS AB uploads are working? Don't see anything uploaded to IA since the power outage and free disk space is shrinking a bit more rapidly than I'd like. |
| 21:32:18 | | sonick quits [Client Quit] |
| 21:38:55 | | LeGoupil quits [Client Quit] |
| 21:58:41 | | sonick (sonick) joins |
| 22:05:02 | | BlueMaxima joins |
| 22:14:31 | | march_happy (march_happy) joins |
| 22:36:10 | <@OrIdow6> | GOing to try doing some estimates for radikal.ru |
| 22:40:00 | | bonga quits [Ping timeout: 244 seconds] |
| 22:42:28 | | bonga joins |
| 22:48:22 | <@arkiver> | OrIdow6: are you getting 301s to itself as well? |
| 22:48:27 | <@arkiver> | for radikal.ru |
| 22:49:04 | <@OrIdow6> | arkiver: I can access the site fine |
| 22:49:26 | <@arkiver> | hmm |
| 22:49:49 | <@OrIdow6> | Clean Firefox profile on Debian unstable, US IP address |
| 22:50:25 | <@arkiver> | those aid's seem sequential somewhat |
| 22:50:43 | <@arkiver> | that's over 6 billion |
| 22:51:40 | <@OrIdow6> | The site claims 800m IIRC and that's what the IdLong field of the API data seems to show |
| 22:51:51 | <@OrIdow6> | DOing the redirect thing for me now |
| 22:52:12 | <@arkiver> | yeah site seems unstable (those redirects) |
| 23:03:57 | <SketchTheCow> | Tried again |
| 23:07:15 | | Cherri-ArcticCircleSystem joins |
| 23:07:38 | <Cherri-ArcticCircleSystem> | C: Oh hey, is there an easy way to transfer things from OneDrive to archive.org? |
| 23:08:51 | <Cherri-ArcticCircleSystem> | C: Because I was watching VOMS stuff and found out that one of the former members got fired and her videos were privated. So I tried looking up information about it and apparently someone on 4chan made a 400+ GB archive of all of her videos. |
| 23:08:53 | <Cherri-ArcticCircleSystem> | https://onedrive.live.com/?authkey=%21ADpyEVkDJt3bXqI&id=F42A4587F0EA46EC%21963&cid=F42A4587F0EA46EC |
| 23:08:58 | <Cherri-ArcticCircleSystem> | C: All 1080p. |
| 23:11:51 | <Cherri-ArcticCircleSystem> | C: Okay it seems the announcement didn't say she was outright fired, it's possible that after the discussion mentioned in the announcement she stepped down voluntarily, but that's beside the point. |
| 23:14:41 | | dm4v quits [Client Quit] |
| 23:18:15 | | dm4v joins |
| 23:18:17 | | dm4v is now authenticated as dm4v |
| 23:18:17 | | dm4v quits [Changing host] |
| 23:18:17 | | dm4v (dm4v) joins |
| 23:21:26 | <@JAA> | SketchTheCow: Huh, looks like something was stuck, now the tasks and files are appearing. Thanks. :-) |
| 23:34:47 | | bonga quits [Read error: Connection reset by peer] |
| 23:35:07 | | bonga joins |
| 23:37:27 | | Arcorann (Arcorann) joins |
| 23:47:52 | | Megame quits [Client Quit] |
| 23:53:35 | | Cherri-ArcticCircleSystem quits [Remote host closed the connection] |
| 23:58:48 | | insane_alien_ quits [Client Quit] |