00:38:20immibis quits [Read error: Connection reset by peer]
00:38:30immibis joins
00:42:31jacobk joins
00:49:56lennier1 quits [Client Quit]
00:51:48qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds]
00:54:11lennier1 (lennier1) joins
00:58:53mutantmonkey quits [Ping timeout: 252 seconds]
01:40:26audy quits [Ping timeout: 240 seconds]
01:40:34audy joins
01:49:06immibis_ joins
01:51:50immibis quits [Ping timeout: 265 seconds]
02:03:06dm4v quits [Ping timeout: 244 seconds]
02:05:00dm4v joins
02:05:03dm4v quits [Changing host]
02:05:03dm4v (dm4v) joins
02:24:17tbc1887 (tbc1887) joins
02:48:15tbc1887 quits [Read error: Connection reset by peer]
02:57:37benjins quits [Read error: Connection reset by peer]
02:58:43benjins joins
03:02:01AnotherIki quits [Read error: Connection reset by peer]
03:02:02mrfooooo quits [Quit: Ping timeout (120 seconds)]
03:02:17mrfooooo joins
03:02:32s-crypt6 (s-crypt) joins
03:02:49sec^nd quits [Ping timeout: 252 seconds]
03:03:38T31M_ joins
03:05:03T31M quits [Quit: ZNC - https://znc.in]
03:05:03T31M_ is now known as T31M
03:05:11VerifiedJ9 (VerifiedJ) joins
03:05:38katocala quits [Read error: Connection reset by peer]
03:05:38VerifiedJ quits [Quit: Ping timeout (120 seconds)]
03:05:38s-crypt quits [Ping timeout: 252 seconds]
03:05:38VerifiedJ9 is now known as VerifiedJ
03:05:38s-crypt6 is now known as s-crypt
03:05:55Iki joins
03:05:57katocala joins
03:06:20endrift|ZNC joins
03:06:39endrift quits [Read error: Connection reset by peer]
03:07:26sec^nd (second) joins
03:23:49lennier2 joins
03:24:09lennier1 quits [Ping timeout: 265 seconds]
03:24:19lennier2 is now known as lennier1
03:30:21march_happy quits [Remote host closed the connection]
03:34:41march_happy (march_happy) joins
03:38:42jacobk_ joins
03:38:44jacobk_ quits [Client Quit]
03:59:26Eighty quits [Ping timeout: 265 seconds]
04:05:08Eighty (Eighty) joins
04:07:04dvd (dvd) joins
04:25:53dvd quits [Read error: Connection reset by peer]
04:40:10hackbug quits [Remote host closed the connection]
04:41:50hackbug (hackbug) joins
05:02:18sonick quits [Client Quit]
05:58:49Eighty quits [Ping timeout: 265 seconds]
06:05:47<nirv>I've spent weeks on this archive. could someone just give me the answer as to how to extract the 7z geocities files? I already extracted the .001 files. it's not letting me extract these. https://cdn.discordapp.com/attachments/872443255281844236/949185672613330984/unknown.png
06:06:02<nirv>I'm using: 7zz -mmt=18 -y x "*.7z" -o/media/nirv/linux2tb > /media/nirv/linux2tb/uppercase02.log
06:06:17<nirv>Extracting archive: geocities-A-a.7z WARNING: geocities-A-a.7z Cannot open the file as [7z] archive The file is open as [tar] archive
06:08:37<nirv>this is the log showing all of the 7z files extracted fine from the .001 archive. https://pastebin.com/8cX5gZHP
06:09:12<nirv>I'm at a loss of what the hell to do to extract these 459GB set of 7z files now.
06:17:11katocala quits [Ping timeout: 265 seconds]
06:20:12katocala joins
06:24:32katocala quits [Ping timeout: 244 seconds]
06:31:35<nirv>I'm thinking it's a problem with the file names? I see the .7z extension on some of the files, but none of the other files have any extension on them.
06:31:38<nirv>I wonder if there is a way to show expected file names of an archive, so I can rename them to that.
06:31:42<nirv>but it begs the question: why in the hell were they named wrong to begin with? (if this is the case)
06:34:34Eighty (Eighty) joins
06:37:27nicolas17 quits [Ping timeout: 244 seconds]
06:39:03sonick (sonick) joins
06:51:01lukash7 quits [Ping timeout: 265 seconds]
06:58:16Eighty quits [Ping timeout: 265 seconds]
07:13:57Eighty (Eighty) joins
07:33:03nico_32 quits [Ping timeout: 252 seconds]
07:55:17nico_32 (nico) joins
07:57:51qwertyasdfuiopghjkl joins
07:58:12Eighty quits [Ping timeout: 265 seconds]
08:10:18<nirv>if I can find the solution to this shit I'm making a video on youtube. nobody should have to suffer like I have to sift through the geocities archive
08:10:49<nirv>tards are people too. and we should be able to extract the geocities archive and sift through the material
08:27:29<appledash>I recall when I downloaded it like 5 years ago I had absolutely no issues
08:27:46<nirv>do you recall how you extracted these files? https://cdn.discordapp.com/attachments/872443255281844236/949185672613330984/unknown.png
08:28:08<nirv>because either I'm a tard, or the files are screwed up. most likely I'm of the tard
08:28:59<appledash>I'm just gonna download the torrent again and see what I can make of it
08:29:11<appledash>You are using https://thepiratebay.org/description.php?id=6353395 correct?
08:29:14<appledash>That is the one I recall using
08:29:26<nirv>the files from the torrent are .001 which extracted fine for me. 100%. but the 7z contained inside refuse to
08:29:33<nirv>yes, that's the one
08:30:14<appledash>I think something is screwed up with your download.
08:30:36<nirv>I did a CRC check even
08:31:10<appledash>All the files I am seeing in the torrent are .7z.xxx where xxx is 3 numbers starting at 001 and ending at however many parts each file has.
08:31:40<appledash>So if you have foo.7z.001 foo.7z.002 foo.7z.003 that's a 3-part 7z archive and doing 7z x foo.7z.001 will extract all files contained across those 3 parts
08:32:02<appledash>I'm not seeing any files that are just .7z and none with no extensions
08:32:40<nirv>I didn't trust the torrent, so what I did was ran a script that CRC checked the files and would do this if a file failed CRC: https://cdn.discordapp.com/attachments/872443255281844236/949222669734248468/unknown.png
08:33:02<nirv>so it created a file you could run to grab the corrupt/missing files from archive.org as you can see
08:33:12<nirv>so that's what I did. now all of the torrent files pass CRC.
08:33:21<nirv>Great. I extracted those no problem.
08:33:31<nirv>The files extracted though are more .7z files and THOSE won't extract
08:33:35<appledash>Ahhh!
08:33:46<appledash>Alright, I will have to wait for the download to complete
08:33:54<appledash>ETA for me is unfortunately 20 hours so I may be able to help you tomorrow
08:34:06<appledash>OK, it just dropped to 11 hours :p
08:34:09<nirv>if you downloaded the torrent and it passed CRC out of the gate, you'd still have the same files I ended up with. okay
08:34:54<nirv>i appreciate that man. I'm loading up deluge to help seed
08:35:40<appledash>Thanks!
08:36:35<nirv>I came across this weird extract script but I have no idea how to analyize it. I don't know enough. https://gist.github.com/gjtorikian/29fbc764d6221363bf4aadd4cf59c22e
08:37:53<appledash>Hmm!
08:37:56<appledash>I think I understand
08:38:17<appledash>To me it implies that 7z has mis-named the second-level of files; I think you should try to extract one of the ".7z" files with tar -xzvf <file>
08:38:22<appledash>And see how that works out
08:38:25<nirv>that's what I'm thinking!!
08:38:43<nirv>okay!
08:38:46froschgrosch_ quits [Ping timeout: 240 seconds]
08:38:50<nirv>the last thing i tried before I got pissed was
08:38:51<nirv>tar -xvf geocities-A-a.7z
08:39:03<nirv>but I'll try yours
08:40:01froschgrosch_ joins
08:40:18<nirv>irv@nirv-VirtualBox:/media/nirv/linuxssd/extracted$ tar -xzvf geocities-A-a.7z gzip: stdin: not in gzip format tar: Child returned status 1 tar: Error is not recoverable: exiting now
08:40:28<appledash>Wtf
08:40:38<appledash>What does `file geocities-A-a.7z` say?
08:40:51<nirv>so just file geocities-A-a.7z from terminal?
08:41:00<nirv>geocities-A-a.7z: POSIX tar archive (GNU)
08:41:17<appledash>Yeah
08:41:30<appledash>Hm, try without the z in the tar command?
08:41:32<appledash>That SHOULD work
08:41:39<nirv>the readme.txt says it's a GNU tar archive so that's why I thought tar command would work
08:41:48<nirv>I did try without the z before but let me try again
08:42:37<nirv>https://cdn.discordapp.com/attachments/872443255281844236/949225311143362650/unknown.png
08:42:45<nirv>it only extracts 17K worth of files
08:42:57<appledash>What the heck!
08:43:00<nirv>which is less than the .7z alone
08:43:04<nirv>I KNOW
08:43:09<appledash>Alright, I'll probably need to have a look at the files when I finish downloading them
08:43:12<nirv>all right man
08:43:23<nirv>good to know I exhausted all I can though :P
08:43:26<appledash>:p
08:43:55<nirv>so going to make a youtube video so nobody suffers like me
08:45:05<appledash>Sharing knowledge is excellent but I often end up finding that I'm just so pleased I finally solved the problem that I want to get on with what I was doing instead of taking a break to do something like that
08:45:07<appledash>But I hope you do
09:15:57Ruthalas quits [Read error: Connection reset by peer]
09:16:27Ruthalas (Ruthalas) joins
09:31:54driib quits [Client Quit]
09:32:07driib (driib) joins
10:14:27march_happy quits [Ping timeout: 244 seconds]
10:14:35march_happy (march_happy) joins
10:31:30march_happy quits [Ping timeout: 244 seconds]
10:31:51march_happy (march_happy) joins
10:38:14march_happy quits [Read error: Connection reset by peer]
10:39:00march_happy (march_happy) joins
10:43:44immibis joins
10:45:55immibis_ quits [Ping timeout: 265 seconds]
11:03:59Eighty (Eighty) joins
11:22:51BlueMaxima quits [Client Quit]
11:57:16march_happy quits [Ping timeout: 244 seconds]
11:57:36march_happy (march_happy) joins
11:57:47Eighty quits [Ping timeout: 244 seconds]
12:16:23<nirv>appledash, could you let me know if the torrent rars are even good? I think they're corrupt out of the gate
12:21:33march_happy quits [Ping timeout: 244 seconds]
12:22:23march_happy (march_happy) joins
12:32:51<appledash>I shall!
12:33:01<appledash>Seem to be at 32% right now
12:37:17<nirv>this is the script I used to create .sh files of files that were missing or failed crc from the torrent. https://blog.geocities.institute/archives/3209
12:59:01katocala joins
13:12:30Megame (Megame) joins
13:44:45Arcorann quits [Ping timeout: 265 seconds]
13:58:35onetruth joins
14:38:07Megame quits [Client Quit]
14:55:34Niklink joins
15:02:33<Niklink>how would one go about running a big batch of URLs into the wayback machine? say something around 4-5k
15:14:20<Iki>#archivebot can handle that, I think. You'll need to upload your list to some site
15:14:34<Iki>There is also: https://archive.org/services/wayback-gsheets/
15:14:37<Iki>Once IA is back online
15:14:59<Iki>But I think it is not preferable to run thousands of URLs through that, even though you can-- archivebot is better for that purpose
15:15:00<Iki>Niklink
15:15:54<Iki>FYI a lot of archiving tool here are paused because IA is temporarily down
15:16:21<Niklink>yeah that's why I came here to ask
15:17:16<Niklink>#archivebot says urgent jobs only though, mine aren't urgent really
15:24:36vukky (Vukky) joins
15:40:04vukky quits [Client Quit]
15:44:36march_happy quits [Ping timeout: 244 seconds]
15:45:30march_happy (march_happy) joins
15:50:52dvd (dvd) joins
15:50:53dvd_ (dvd) joins
15:54:05dvd quits [Client Quit]
15:54:12dvd_ quits [Client Quit]
15:54:24dvd (dvd) joins
16:02:38<nirv>this has been taking hours to do. I'm creating a 5TB VDI file just for linux so I can do all I need to do to fully extract this crap. c:\Program Files\Oracle\VirtualBox>VBoxManage createmedium disk --filename "H:\linux5tb.vdi" --size 5145729 --format VDI --variant Fixed
16:02:38<nirv>0%...10%...20%...30%...40%...50%...60%...70%...
16:03:24<nirv>there's probably a better way to do this but I'm pretty dumb with linux and virtualbox
16:07:49bonga quits [Ping timeout: 265 seconds]
16:12:12bonga joins
16:13:52Eighty (Eighty) joins
16:14:12<@JAA>Niklink: What kind of content?
16:20:48immibis quits [Remote host closed the connection]
16:20:54immibis joins
16:30:20LeGoupil joins
16:42:17nicolas17 joins
16:52:51daxxy_ (daxxy) joins
16:55:52Megame (Megame) joins
16:56:26daxxy quits [Ping timeout: 240 seconds]
16:58:29Eighty quits [Ping timeout: 244 seconds]
16:59:34<Niklink>JAA: bandcamp album and track pages
17:02:22<Niklink>many of these pages have per-track art and attribution data that's not included in the album download proper, and I'm afraid some of these artists might get it in their head to take their music down in protest of the epic acquisition
17:04:41march_happy quits [Ping timeout: 244 seconds]
17:04:58march_happy (march_happy) joins
17:11:24LeGoupil quits [Ping timeout: 244 seconds]
17:13:04LeGoupil joins
17:13:28march_happy quits [Ping timeout: 244 seconds]
17:22:31<@JAA>Niklink: I see. I wouldn't mind running that through AB, but not right now as we're trying to grab as much of Ukraine as possible. There should be room on the weekend though. If you give me a list, I can throw it in then. Note that it won't grab the audio.
17:32:12<Niklink>JAA: I'm aware, dw & thanks. should have the list ready by the end of today but it's not high prio
17:39:11Niklink71 joins
17:39:49Niklink quits [Ping timeout: 244 seconds]
17:43:09Niklink71 is now known as Niklink
17:48:36Niklink quits [Ping timeout: 244 seconds]
17:49:27lennier1 quits [Client Quit]
17:51:16lennier1 (lennier1) joins
17:58:56bonga quits [Ping timeout: 244 seconds]
18:02:38Niklink joins
18:04:35Eighty (Eighty) joins
18:11:38bonga joins
18:25:00insane_alien_ quits [Remote host closed the connection]
18:33:01insane_alien (insane_alien) joins
18:50:42insane_alien_ (insane_alien) joins
18:53:36insane_alien quits [Ping timeout: 265 seconds]
19:10:11vukky (Vukky) joins
19:27:03<SketchTheCow>Is there anything that needs my attention?
19:40:11<tech234a>FYI Reddit is blocking .ru link submissions https://old.reddit.com/r/ModSupport/comments/t66l5f/reddit_blocked_all_domains_under_russian_cctld_ru/
19:49:14immibis_ joins
19:52:05immibis quits [Ping timeout: 265 seconds]
20:05:21<wessel1512>this is bad
20:08:52<wessel1512>this is just limiting the freedom of speech
20:10:04<@JAA>lol no, but also this isn't the place for that discussion.
20:10:43<wessel1512>if someone wanted to share misinformation they could just use another tld for that
20:11:17<anarcat>holy crap
20:11:17<anarcat>https://www.washingtonpost.com/technology/2022/03/04/russia-ukraine-internet-cogent-cutoff/
20:11:27<anarcat>sorry, wrong channel
20:11:51<@JAA>Nah, that's fine, this is relevant to archival. :-)
20:12:04<@JAA>Although I propose we keep everything Ukraine/Russia-related in #ucryne.
20:12:05<anarcat>heh
20:12:07<anarcat>yeah
20:14:06Stiletto quits [Ping timeout: 240 seconds]
20:14:58Stiletto joins
20:34:41ThreeHM quits [Quit: WeeChat 3.3]
20:48:14ThreeHM (ThreeHeadedMonkey) joins
20:51:32qwertyasdfuiopghjkl quits [Client Quit]
20:58:00qwertyasdfuiopghjkl joins
21:13:07ThreeHM_ (ThreeHeadedMonkey) joins
21:15:42ThreeHM quits [Ping timeout: 265 seconds]
21:16:27<@JAA>SketchTheCow: Are you sure the FOS AB uploads are working? Don't see anything uploaded to IA since the power outage and free disk space is shrinking a bit more rapidly than I'd like.
21:32:18sonick quits [Client Quit]
21:38:55LeGoupil quits [Client Quit]
21:58:41sonick (sonick) joins
22:05:02BlueMaxima joins
22:14:31march_happy (march_happy) joins
22:36:10<@OrIdow6>GOing to try doing some estimates for radikal.ru
22:40:00bonga quits [Ping timeout: 244 seconds]
22:42:28bonga joins
22:48:22<@arkiver>OrIdow6: are you getting 301s to itself as well?
22:48:27<@arkiver>for radikal.ru
22:49:04<@OrIdow6>arkiver: I can access the site fine
22:49:26<@arkiver>hmm
22:49:49<@OrIdow6>Clean Firefox profile on Debian unstable, US IP address
22:50:25<@arkiver>those aid's seem sequential somewhat
22:50:43<@arkiver>that's over 6 billion
22:51:40<@OrIdow6>The site claims 800m IIRC and that's what the IdLong field of the API data seems to show
22:51:51<@OrIdow6>DOing the redirect thing for me now
22:52:12<@arkiver>yeah site seems unstable (those redirects)
23:03:57<SketchTheCow>Tried again
23:07:15Cherri-ArcticCircleSystem joins
23:07:38<Cherri-ArcticCircleSystem>C: Oh hey, is there an easy way to transfer things from OneDrive to archive.org?
23:08:51<Cherri-ArcticCircleSystem>C: Because I was watching VOMS stuff and found out that one of the former members got fired and her videos were privated. So I tried looking up information about it and apparently someone on 4chan made a 400+ GB archive of all of her videos.
23:08:53<Cherri-ArcticCircleSystem>https://onedrive.live.com/?authkey=%21ADpyEVkDJt3bXqI&id=F42A4587F0EA46EC%21963&cid=F42A4587F0EA46EC
23:08:58<Cherri-ArcticCircleSystem>C: All 1080p.
23:11:51<Cherri-ArcticCircleSystem>C: Okay it seems the announcement didn't say she was outright fired, it's possible that after the discussion mentioned in the announcement she stepped down voluntarily, but that's beside the point.
23:14:41dm4v quits [Client Quit]
23:18:15dm4v joins
23:18:17dm4v quits [Changing host]
23:18:17dm4v (dm4v) joins
23:21:26<@JAA>SketchTheCow: Huh, looks like something was stuck, now the tasks and files are appearing. Thanks. :-)
23:34:47bonga quits [Read error: Connection reset by peer]
23:35:07bonga joins
23:37:27Arcorann (Arcorann) joins
23:47:52Megame quits [Client Quit]
23:53:35Cherri-ArcticCircleSystem quits [Remote host closed the connection]
23:58:48insane_alien_ quits [Client Quit]