00:00:13<h2ibot>JAABot edited List of websites excluded from the Wayback Machine (+0): https://wiki.archiveteam.org/?diff=52020&oldid=52018
00:04:28etnguyen03 quits [Client Quit]
00:05:09etnguyen03 (etnguyen03) joins
00:09:47lennier2_ joins
00:12:22lennier2 quits [Ping timeout: 255 seconds]
00:14:56etnguyen03 quits [Client Quit]
00:15:37etnguyen03 (etnguyen03) joins
00:16:52tzt quits [Ping timeout: 255 seconds]
00:23:58tzt (tzt) joins
00:55:09etnguyen03 quits [Client Quit]
00:55:51etnguyen03 (etnguyen03) joins
01:03:19Hackerpcs quits [Client Quit]
01:05:03Hackerpcs (Hackerpcs) joins
01:05:37etnguyen03 quits [Client Quit]
01:06:20etnguyen03 (etnguyen03) joins
01:16:06etnguyen03 quits [Client Quit]
01:16:49etnguyen03 (etnguyen03) joins
01:26:35etnguyen03 quits [Client Quit]
01:27:18etnguyen03 (etnguyen03) joins
01:36:58wickedplayer494 quits [Ping timeout: 255 seconds]
01:37:04etnguyen03 quits [Client Quit]
01:53:52etnguyen03 (etnguyen03) joins
02:02:49datechnoman quits [Quit: The Lounge - https://thelounge.chat]
02:03:24datechnoman (datechnoman) joins
02:11:37Doranwen (Doranwen) joins
02:13:59tzt quits [Remote host closed the connection]
02:14:22tzt (tzt) joins
02:17:59eightthree quits [Ping timeout: 272 seconds]
02:21:15eightthree joins
02:30:01eightthree quits [Ping timeout: 272 seconds]
02:32:25eightthree joins
02:34:15jacksonchen666 quits [Remote host closed the connection]
02:34:43jacksonchen666 (jacksonchen666) joins
02:35:22Wohlstand (Wohlstand) joins
03:02:19DogsRNice joins
03:25:54wickedplayer494 joins
03:26:48<h2ibot>Xz edited List of websites excluded from the Wayback Machine/Partial exclusions (+33, UdoNet.com/circumcision): https://wiki.archiveteam.org/?diff=52021&oldid=52015
03:35:38superkuh joins
03:38:57BlueMaxima quits [Read error: Connection reset by peer]
04:19:20JaffaCakes118 quits [Remote host closed the connection]
04:19:43JaffaCakes118 (JaffaCakes118) joins
04:39:19<@OrIdow6>Guest: Is this a private WARC? If not can you send us a link or (perhaps the first few MB of) the file itself?
04:41:52Craigle quits [Quit: The Lounge - https://thelounge.chat]
04:42:23Craigle (Craigle) joins
04:50:37archivist99 quits [Ping timeout: 272 seconds]
04:55:52midou quits [Ping timeout: 255 seconds]
04:56:18etnguyen03 quits [Client Quit]
04:57:02etnguyen03 (etnguyen03) joins
05:06:49etnguyen03 quits [Client Quit]
05:07:31etnguyen03 (etnguyen03) joins
05:17:19etnguyen03 quits [Client Quit]
05:18:02etnguyen03 (etnguyen03) joins
05:21:38<Guest>OrIdow6 i have the warc file from archiveteam's roblox forums collection on https://archive.org/details/archiveteam_roblox . heres a mediafire link to the first 50mb of the warc file: https://www.mediafire.com/file/jem2wrwpbfrp49w/12_roblox_20171212222627_first_50mb.megawarc.warc.gz/file .
05:26:38etnguyen03 quits [Remote host closed the connection]
05:32:20DogsRNice quits [Read error: Connection reset by peer]
05:32:59BearFortress joins
05:33:09<@OrIdow6>Guest: If I
05:33:24<@OrIdow6>'m understanding this right it's been double-gzipped
05:34:20<@OrIdow6>IDK how proficient you are with the UNIX command line but if you are, cat [file] | zcat | zcat | less gives me the raw text
05:36:30<@OrIdow6>For those following along in IRC: AFAICT the item on the IA is only gzipped once, this was applied later
05:52:34Guest|Mobile joins
05:53:54Guest|Mobile quits [Client Quit]
06:11:05nicolas17 quits [Remote host closed the connection]
06:11:38midou joins
06:13:12<fireonlive>-+rss- Best Buy Geek Squad employees report mass layoffs: https://www.theverge.com/2024/4/5/24122542/best-buy-geek-squad-layoffs-ai-restructuring https://news.ycombinator.com/item?id=39958321
06:30:11eroc1990 quits [Client Quit]
06:30:34eroc1990 (eroc1990) joins
06:49:03michaelblob (michaelblob) joins
06:53:58zhongfu quits [Client Quit]
06:55:34Ryz20 (Ryz) joins
06:55:34Ryz2 quits [Read error: Connection reset by peer]
06:55:34Ryz20 is now known as Ryz2
06:55:35s-crypt3 (s-crypt) joins
06:55:44kiska7 (kiska) joins
06:55:48flashfire429 joins
06:56:13s-crypt quits [Read error: Connection reset by peer]
06:56:13flashfire42 quits [Read error: Connection reset by peer]
06:56:13kiska quits [Read error: Connection reset by peer]
06:56:13s-crypt3 is now known as s-crypt
06:56:13flashfire429 is now known as flashfire42
06:56:14kiska7 is now known as kiska
06:58:13zhongfu (zhongfu) joins
07:04:51nicolas17 joins
07:05:02Unholy2361 quits [Remote host closed the connection]
07:06:25Unholy2361 (Unholy2361) joins
07:17:06Wohlstand quits [Client Quit]
07:21:24Wohlstand (Wohlstand) joins
07:26:25nicolas17 quits [Ping timeout: 272 seconds]
07:33:56Wohlstand quits [Client Quit]
07:36:58midou quits [Ping timeout: 255 seconds]
07:43:46zhongfu quits [Client Quit]
07:46:21zhongfu (zhongfu) joins
07:51:26Island quits [Read error: Connection reset by peer]
08:00:04qwertyasdfuiopghjkl quits [Ping timeout: 265 seconds]
08:03:27qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
08:21:51midou joins
09:00:01Bleo182600 quits [Client Quit]
09:01:15Bleo182600 joins
09:06:31decky quits [Ping timeout: 255 seconds]
09:19:27pabs quits [Remote host closed the connection]
09:30:56@kaz quits [Quit: Killed]
09:31:15kaz (Kaz) joins
09:31:15@ChanServ sets mode: +o kaz
09:34:20<@kaz>ping
09:46:04pabs (pabs) joins
09:46:45pabs quits [Remote host closed the connection]
09:49:41pabs (pabs) joins
09:49:59<katia>pong
09:51:36pabs quits [Remote host closed the connection]
09:55:56pabs (pabs) joins
09:56:23Dango360_ joins
09:59:41Dango360 quits [Ping timeout: 272 seconds]
10:21:13benjins3 quits [Ping timeout: 255 seconds]
10:21:45cas joins
10:22:45<cas>Hello, I'd like to report about a site called Japanator. Its main site appears to have been dead, and its twitter is no longer updating posts https://twitter.com/Japanator
10:23:30<cas>This is its main site fyi http://www.japanator.com/ As you can see it's no longer accessible, so I hope to archive what remains of it while it's still there
10:26:10blue_0000ff quits [Read error: Connection reset by peer]
10:26:47blue_0000ff joins
10:49:55decky_e joins
10:55:19cas quits [Client Quit]
12:00:01Chris5010 quits [Ping timeout: 272 seconds]
12:39:10cas joins
12:39:10cas quits [Client Quit]
13:07:32jacksonchen666 quits [Client Quit]
13:30:35Arcorann quits [Ping timeout: 272 seconds]
13:47:30pixel leaves [Disconnected: Replaced by new connection]
13:47:31pixel (pixel) joins
14:10:25SootBect1 (SootBector) joins
14:11:48SootBector quits [Ping timeout: 255 seconds]
14:15:23JaffaCakes118 quits [Remote host closed the connection]
14:18:28icedice (icedice) joins
14:26:14<icedice>thuban: Hi, how is it going with the scanlation sites?
14:28:18itachi1706 quits [Client Quit]
14:28:46itachi1706 (itachi1706) joins
14:48:57<that_lurker>-+rss- ElephantSQL Is Shutting Down: https://www.elephantsql.com/blog/end-of-life-announcement.html https://news.ycombinator.com/item?id=39958701
14:52:21Dango360_ quits [Client Quit]
14:52:41Dango360 (Dango360) joins
15:10:33etnguyen03 (etnguyen03) joins
15:16:37nicolas17 joins
15:25:52ymgve quits [Ping timeout: 255 seconds]
15:38:10ymgve joins
15:58:07JaffaCakes118 (JaffaCakes118) joins
16:02:55andrew quits [Quit: ]
16:03:18andrew (andrew) joins
16:13:55etnguyen03 quits [Client Quit]
16:14:37etnguyen03 (etnguyen03) joins
16:29:35<thuban>icedice: mangaupdates scrape is done, i'll upload results in a bit
16:30:03<thuban>have finished with the work stuff, so i should get vatoto done pretty soon too
16:31:13<icedice>Nice!
17:03:08Chris5010 (Chris5010) joins
17:04:47Guest quits [Ping timeout: 265 seconds]
17:18:40<qwertyasdfuiopghjkl>wattpad.com is apparently doing some sort of major content removal without prior notice: https://old.reddit.com/r/FanFiction/comments/1bvmchq/is_wattpad_going_nuclear_on_fanfics/
17:21:25Barto (Barto) joins
17:23:54Guest joins
17:24:11etnguyen03 quits [Client Quit]
17:24:53etnguyen03 (etnguyen03) joins
17:26:17nic8 quits [Read error: Connection reset by peer]
17:30:16<Guest>OrIdow6 sorry i gzipped it twice. do you know how i can get the contents though? i was pulling my hairs out yesterday since it was completely fine with replayweb.page . the full file is at https://archive.org/download/archiveteam_roblox_20171212222627/roblox_20171212222627.megawarc.warc.gz (39gb). i can find another warc if youd like, it looks like
17:30:16<Guest>i forgot to decompress them in that one.
17:31:22<Guest>im also on windows, so the only time i ever really "use linux" is with an rpi or ubuntu on WSL
17:36:35nic8 (nic) joins
17:56:03etnguyen03 quits [Client Quit]
17:56:44etnguyen03 (etnguyen03) joins
18:06:31etnguyen03 quits [Client Quit]
18:07:13etnguyen03 (etnguyen03) joins
18:16:58etnguyen03 quits [Client Quit]
18:17:39etnguyen03 (etnguyen03) joins
18:27:26etnguyen03 quits [Client Quit]
18:28:07etnguyen03 (etnguyen03) joins
18:36:28Guest quits [Client Quit]
18:37:54etnguyen03 quits [Client Quit]
18:38:36etnguyen03 (etnguyen03) joins
18:56:47benjins3 joins
18:57:56Guest joins
19:01:36icedice quits [Client Quit]
19:03:26icedice (icedice) joins
19:06:09<@OrIdow6>Guest: What specific issue are you having? For instance https://transfer.archivete.am/pxsLb/warcio.py works fine for me
19:06:10<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/pxsLb/warcio.py
19:19:44Island joins
19:25:41etnguyen03 quits [Client Quit]
19:26:23etnguyen03 (etnguyen03) joins
19:36:55<mikolaj|m>I found some ancient IRC logs from c. 1996 here https://trog.qgl.org/browse.php/docs/irclogs, they weren't in the Wayback Machine for some reason, threw them in via savepagenow
19:43:16benjins3 quits [Ping timeout: 255 seconds]
19:46:30<Guest>OrIdow6 tested that on the file i linked above (full version), and that only works for records with the "warcinfo" type. heres the data from one of those records: https://pastebin.com/1zXTH4N1 . it still doesnt give any http content though like seen in replayweb.page .
19:47:35<h2ibot>That lurker edited IRC/Logs (+45, https://trog.qgl.org/browse.php/docs/irclogs…): https://wiki.archiveteam.org/?diff=52022&oldid=50957
19:47:50<@OrIdow6>Guest: `if record.rec_type == 'response'` gets you warcinfos?
19:48:05<Guest>no i changed it to the warcinfo type
19:48:38<Guest>i can only see the html content when i read the bytes from the warc file
19:57:11<@OrIdow6>Guest: Is it still true that "record.content_stream().read() does not return anything"? What are you getting, and what do you want to get?
20:04:17<Guest>yes. when the record type is "response" it does not return anything and when the record type is "warcinfo" it returns something similar to what was in the pastebin.
20:05:09<Guest>i need the html contents and not http data which is what the warcinfo record seemed to be giving me
20:06:42hogchips quits [Quit: My znc bouncer found a childhood friend and left me all alone, how will I survive now? Again?!]
20:07:10<kiska>Try record.raw_stream.read()
20:10:41<wickerz>Anyone using docker desktop (windows) for running containers? The last few days I can spin up some containers for different projects and have them running smoothly for maybe 1-3 hours, and then it suddenly crashes. I get things like "Server returned 0 (CONERROR). Sleeping.", "10671=0" and "Cannot assign requested addressnil". Those are from 3
20:10:41<wickerz>different projects, which I guess is why the errors are different
20:11:06<wickerz>Seems like it could be a connection issue (due to the CONERROR) (?) but I'm just curious if anyone know why this happens. Never had that happen previously. Internet work fine in browser and otherwise
20:11:58<Guest>printing that results in the same output, nothing.
20:15:41<@OrIdow6>Guest: Is the issue seeing the response records, or just getting their bodies?
20:19:34<Guest>just getting their bodies
20:22:48<nstrom|m>I get that on docker desktop for windows too, after a while. I think its networking stack is a bit janky
20:23:19<nstrom|m>Win 11 seems a little better than win 10 for it. It's probably something with WSL
20:25:00<wickerz>nstrom|m at least good to hear it's not necessarily an issue on my end (on Win 11 though). Any tips on how to avoid having to babysit my PC to check for when it has happened?
20:25:29<wickerz>Usually it's not enough to simply restart the containers
20:26:11<nstrom|m>No idea, sorry. I ended up getting a standalone Linux box for home to run 24/7 and just chip in on windows occasionally
20:26:44<wickerz>Yeah, that's the solution I was considering as well..
20:27:11<wickerz>Thanks though!
20:27:49<nstrom|m>2-3 hours seems short for me, mine is usually fine for around 8 and then starts to get flaky
20:28:17<nulldata>Are you using WSL1 or 2, and are you running the latest WSL runtime?
20:29:05<kiska>I seem to get the output just fine from your file: https://paste.kiska.pw/EmplaceKilowatts
20:31:08<nulldata>wickerz - run wsl -l -v and make sure it's using WSL2. And try wsl --update to make sure the runtime is up to date
20:31:59<wickerz>nulldata WSL2
20:32:10<wickerz>Based on output from that command
20:32:38<wickerz>Got the newest one it seems
20:37:38<Guest>kiska oddly enough, it seems to be working now. thanks for the help. ill give updates if i have any issues.
20:40:23<Guest>i believe the issue was happening because i was accessing other properties of the warc record before accessing the content
20:41:30<Guest>i did a test on it right now and i cant read the content after ive already read other properties of the warc, but i can read the content before i read the other properties of the warc
20:48:11<nulldata>Have you considered foregoing Docker Desktop and creating a Debian WSL2 instance and setting up Docker manually? I don't run AT projects on WSL but I do have a Plex Docker running on WSL for GPU passthrough and it's pretty stable. Another thing to consider might be network bridging instead of the NAT stuff WSL does OOB - don't know if you can do
20:48:11<nulldata>that with Docker Desktop but can be done when configuring manually.
20:50:18BlueMaxima joins
20:52:04<wickerz>nulldata no, but that might be something to look into! Thanks. I'll look at that tomorrow after work :)
21:00:32hogchips (shoghicp) joins
21:39:53<h2ibot>Inti83 edited Argentina (+46, /* Public Media and Communication */): https://wiki.archiveteam.org/?diff=52023&oldid=51824
22:28:55icedice quits [Ping timeout: 272 seconds]
22:34:40Guest quits [Client Quit]
23:18:24Guest joins
23:53:55etnguyen03 quits [Client Quit]
23:53:56benjins3 joins
23:54:35etnguyen03 (etnguyen03) joins