00:00:13 | <h2ibot> | JAABot edited List of websites excluded from the Wayback Machine (+0): https://wiki.archiveteam.org/?diff=52020&oldid=52018 |
00:04:28 | | etnguyen03 quits [Client Quit] |
00:05:09 | | etnguyen03 (etnguyen03) joins |
00:09:47 | | lennier2_ joins |
00:12:22 | | lennier2 quits [Ping timeout: 255 seconds] |
00:14:56 | | etnguyen03 quits [Client Quit] |
00:15:37 | | etnguyen03 (etnguyen03) joins |
00:16:52 | | tzt quits [Ping timeout: 255 seconds] |
00:23:58 | | tzt (tzt) joins |
00:55:09 | | etnguyen03 quits [Client Quit] |
00:55:51 | | etnguyen03 (etnguyen03) joins |
01:03:19 | | Hackerpcs quits [Client Quit] |
01:05:03 | | Hackerpcs (Hackerpcs) joins |
01:05:37 | | etnguyen03 quits [Client Quit] |
01:06:20 | | etnguyen03 (etnguyen03) joins |
01:16:06 | | etnguyen03 quits [Client Quit] |
01:16:49 | | etnguyen03 (etnguyen03) joins |
01:26:35 | | etnguyen03 quits [Client Quit] |
01:27:18 | | etnguyen03 (etnguyen03) joins |
01:36:58 | | wickedplayer494 quits [Ping timeout: 255 seconds] |
01:37:04 | | etnguyen03 quits [Client Quit] |
01:53:52 | | etnguyen03 (etnguyen03) joins |
02:02:49 | | datechnoman quits [Quit: The Lounge - https://thelounge.chat] |
02:03:24 | | datechnoman (datechnoman) joins |
02:11:37 | | Doranwen (Doranwen) joins |
02:13:59 | | tzt quits [Remote host closed the connection] |
02:14:22 | | tzt (tzt) joins |
02:17:59 | | eightthree quits [Ping timeout: 272 seconds] |
02:21:15 | | eightthree joins |
02:30:01 | | eightthree quits [Ping timeout: 272 seconds] |
02:32:25 | | eightthree joins |
02:34:15 | | jacksonchen666 quits [Remote host closed the connection] |
02:34:43 | | jacksonchen666 (jacksonchen666) joins |
02:35:22 | | Wohlstand (Wohlstand) joins |
03:02:19 | | DogsRNice joins |
03:25:54 | | wickedplayer494 joins |
03:26:04 | | wickedplayer494 is now authenticated as wickedplayer494 |
03:26:48 | <h2ibot> | Xz edited List of websites excluded from the Wayback Machine/Partial exclusions (+33, UdoNet.com/circumcision): https://wiki.archiveteam.org/?diff=52021&oldid=52015 |
03:35:38 | | superkuh joins |
03:38:57 | | BlueMaxima quits [Read error: Connection reset by peer] |
04:19:20 | | JaffaCakes118 quits [Remote host closed the connection] |
04:19:43 | | JaffaCakes118 (JaffaCakes118) joins |
04:39:19 | <@OrIdow6> | Guest: Is this a private WARC? If not can you send us a link or (perhaps the first few MB of) the file itself? |
04:41:52 | | Craigle quits [Quit: The Lounge - https://thelounge.chat] |
04:42:23 | | Craigle (Craigle) joins |
04:50:37 | | archivist99 quits [Ping timeout: 272 seconds] |
04:55:52 | | midou quits [Ping timeout: 255 seconds] |
04:56:18 | | etnguyen03 quits [Client Quit] |
04:57:02 | | etnguyen03 (etnguyen03) joins |
05:06:49 | | etnguyen03 quits [Client Quit] |
05:07:31 | | etnguyen03 (etnguyen03) joins |
05:17:19 | | etnguyen03 quits [Client Quit] |
05:18:02 | | etnguyen03 (etnguyen03) joins |
05:21:38 | <Guest> | OrIdow6 i have the warc file from archiveteam's roblox forums collection on https://archive.org/details/archiveteam_roblox . heres a mediafire link to the first 50mb of the warc file: https://www.mediafire.com/file/jem2wrwpbfrp49w/12_roblox_20171212222627_first_50mb.megawarc.warc.gz/file . |
05:26:38 | | etnguyen03 quits [Remote host closed the connection] |
05:32:20 | | DogsRNice quits [Read error: Connection reset by peer] |
05:32:59 | | BearFortress joins |
05:33:09 | <@OrIdow6> | Guest: If I |
05:33:24 | <@OrIdow6> | 'm understanding this right it's been double-gzipped |
05:34:20 | <@OrIdow6> | IDK how proficient you are with the UNIX command line but if you are, cat [file] | zcat | zcat | less gives me the raw text |
05:36:30 | <@OrIdow6> | For those following along in IRC: AFAICT the item on the IA is only gzipped once, this was applied later |
05:52:34 | | Guest|Mobile joins |
05:53:54 | | Guest|Mobile quits [Client Quit] |
06:11:05 | | nicolas17 quits [Remote host closed the connection] |
06:11:38 | | midou joins |
06:13:12 | <fireonlive> | -+rss- Best Buy Geek Squad employees report mass layoffs: https://www.theverge.com/2024/4/5/24122542/best-buy-geek-squad-layoffs-ai-restructuring https://news.ycombinator.com/item?id=39958321 |
06:30:11 | | eroc1990 quits [Client Quit] |
06:30:34 | | eroc1990 (eroc1990) joins |
06:49:03 | | michaelblob (michaelblob) joins |
06:53:58 | | zhongfu quits [Client Quit] |
06:55:34 | | Ryz20 (Ryz) joins |
06:55:34 | | Ryz2 quits [Read error: Connection reset by peer] |
06:55:34 | | Ryz20 is now known as Ryz2 |
06:55:35 | | s-crypt3 (s-crypt) joins |
06:55:44 | | kiska7 (kiska) joins |
06:55:48 | | flashfire429 joins |
06:56:13 | | s-crypt quits [Read error: Connection reset by peer] |
06:56:13 | | flashfire42 quits [Read error: Connection reset by peer] |
06:56:13 | | kiska quits [Read error: Connection reset by peer] |
06:56:13 | | s-crypt3 is now known as s-crypt |
06:56:13 | | flashfire429 is now known as flashfire42 |
06:56:14 | | kiska7 is now known as kiska |
06:58:13 | | zhongfu (zhongfu) joins |
07:04:51 | | nicolas17 joins |
07:05:02 | | Unholy2361 quits [Remote host closed the connection] |
07:06:25 | | Unholy2361 (Unholy2361) joins |
07:17:06 | | Wohlstand quits [Client Quit] |
07:21:24 | | Wohlstand (Wohlstand) joins |
07:26:25 | | nicolas17 quits [Ping timeout: 272 seconds] |
07:33:56 | | Wohlstand quits [Client Quit] |
07:36:58 | | midou quits [Ping timeout: 255 seconds] |
07:43:46 | | zhongfu quits [Client Quit] |
07:46:21 | | zhongfu (zhongfu) joins |
07:51:26 | | Island quits [Read error: Connection reset by peer] |
08:00:04 | | qwertyasdfuiopghjkl quits [Ping timeout: 265 seconds] |
08:03:27 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
08:21:51 | | midou joins |
09:00:01 | | Bleo182600 quits [Client Quit] |
09:01:15 | | Bleo182600 joins |
09:06:31 | | decky quits [Ping timeout: 255 seconds] |
09:19:27 | | pabs quits [Remote host closed the connection] |
09:30:56 | | @kaz quits [Quit: Killed] |
09:31:15 | | kaz (Kaz) joins |
09:31:15 | | @ChanServ sets mode: +o kaz |
09:34:20 | <@kaz> | ping |
09:46:04 | | pabs (pabs) joins |
09:46:45 | | pabs quits [Remote host closed the connection] |
09:49:41 | | pabs (pabs) joins |
09:49:59 | <katia> | pong |
09:51:36 | | pabs quits [Remote host closed the connection] |
09:55:56 | | pabs (pabs) joins |
09:56:23 | | Dango360_ joins |
09:59:41 | | Dango360 quits [Ping timeout: 272 seconds] |
10:21:13 | | benjins3 quits [Ping timeout: 255 seconds] |
10:21:45 | | cas joins |
10:22:45 | <cas> | Hello, I'd like to report about a site called Japanator. Its main site appears to have been dead, and its twitter is no longer updating posts https://twitter.com/Japanator |
10:23:30 | <cas> | This is its main site fyi http://www.japanator.com/ As you can see it's no longer accessible, so I hope to archive what remains of it while it's still there |
10:26:10 | | blue_0000ff quits [Read error: Connection reset by peer] |
10:26:47 | | blue_0000ff joins |
10:49:55 | | decky_e joins |
10:55:19 | | cas quits [Client Quit] |
12:00:01 | | Chris5010 quits [Ping timeout: 272 seconds] |
12:39:10 | | cas joins |
12:39:10 | | cas quits [Client Quit] |
13:07:32 | | jacksonchen666 quits [Client Quit] |
13:30:35 | | Arcorann quits [Ping timeout: 272 seconds] |
13:47:30 | | pixel leaves [Disconnected: Replaced by new connection] |
13:47:31 | | pixel (pixel) joins |
13:49:34 | | blue_0000ff is now authenticated as blue_0000ff |
14:10:25 | | SootBect1 (SootBector) joins |
14:11:48 | | SootBector quits [Ping timeout: 255 seconds] |
14:15:23 | | JaffaCakes118 quits [Remote host closed the connection] |
14:18:28 | | icedice (icedice) joins |
14:26:14 | <icedice> | thuban: Hi, how is it going with the scanlation sites? |
14:28:18 | | itachi1706 quits [Client Quit] |
14:28:46 | | itachi1706 (itachi1706) joins |
14:48:57 | <that_lurker> | -+rss- ElephantSQL Is Shutting Down: https://www.elephantsql.com/blog/end-of-life-announcement.html https://news.ycombinator.com/item?id=39958701 |
14:52:21 | | Dango360_ quits [Client Quit] |
14:52:41 | | Dango360 (Dango360) joins |
15:10:33 | | etnguyen03 (etnguyen03) joins |
15:16:37 | | nicolas17 joins |
15:25:52 | | ymgve quits [Ping timeout: 255 seconds] |
15:38:10 | | ymgve joins |
15:58:07 | | JaffaCakes118 (JaffaCakes118) joins |
16:02:55 | | andrew quits [Quit: ] |
16:03:18 | | andrew (andrew) joins |
16:13:55 | | etnguyen03 quits [Client Quit] |
16:14:37 | | etnguyen03 (etnguyen03) joins |
16:29:35 | <thuban> | icedice: mangaupdates scrape is done, i'll upload results in a bit |
16:30:03 | <thuban> | have finished with the work stuff, so i should get vatoto done pretty soon too |
16:31:13 | <icedice> | Nice! |
17:03:08 | | Chris5010 (Chris5010) joins |
17:04:47 | | Guest quits [Ping timeout: 265 seconds] |
17:18:40 | <qwertyasdfuiopghjkl> | wattpad.com is apparently doing some sort of major content removal without prior notice: https://old.reddit.com/r/FanFiction/comments/1bvmchq/is_wattpad_going_nuclear_on_fanfics/ |
17:21:25 | | Barto (Barto) joins |
17:23:54 | | Guest joins |
17:24:11 | | etnguyen03 quits [Client Quit] |
17:24:53 | | etnguyen03 (etnguyen03) joins |
17:26:17 | | nic8 quits [Read error: Connection reset by peer] |
17:30:16 | <Guest> | OrIdow6 sorry i gzipped it twice. do you know how i can get the contents though? i was pulling my hairs out yesterday since it was completely fine with replayweb.page . the full file is at https://archive.org/download/archiveteam_roblox_20171212222627/roblox_20171212222627.megawarc.warc.gz (39gb). i can find another warc if youd like, it looks like |
17:30:16 | <Guest> | i forgot to decompress them in that one. |
17:31:22 | <Guest> | im also on windows, so the only time i ever really "use linux" is with an rpi or ubuntu on WSL |
17:36:35 | | nic8 (nic) joins |
17:56:03 | | etnguyen03 quits [Client Quit] |
17:56:44 | | etnguyen03 (etnguyen03) joins |
18:06:31 | | etnguyen03 quits [Client Quit] |
18:07:13 | | etnguyen03 (etnguyen03) joins |
18:16:58 | | etnguyen03 quits [Client Quit] |
18:17:39 | | etnguyen03 (etnguyen03) joins |
18:27:26 | | etnguyen03 quits [Client Quit] |
18:28:07 | | etnguyen03 (etnguyen03) joins |
18:36:28 | | Guest quits [Client Quit] |
18:37:54 | | etnguyen03 quits [Client Quit] |
18:38:36 | | etnguyen03 (etnguyen03) joins |
18:56:47 | | benjins3 joins |
18:57:56 | | Guest joins |
19:01:36 | | icedice quits [Client Quit] |
19:03:26 | | icedice (icedice) joins |
19:06:09 | <@OrIdow6> | Guest: What specific issue are you having? For instance https://transfer.archivete.am/pxsLb/warcio.py works fine for me |
19:06:10 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/pxsLb/warcio.py |
19:19:44 | | Island joins |
19:25:41 | | etnguyen03 quits [Client Quit] |
19:26:23 | | etnguyen03 (etnguyen03) joins |
19:36:55 | <mikolaj|m> | I found some ancient IRC logs from c. 1996 here https://trog.qgl.org/browse.php/docs/irclogs, they weren't in the Wayback Machine for some reason, threw them in via savepagenow |
19:43:16 | | benjins3 quits [Ping timeout: 255 seconds] |
19:46:30 | <Guest> | OrIdow6 tested that on the file i linked above (full version), and that only works for records with the "warcinfo" type. heres the data from one of those records: https://pastebin.com/1zXTH4N1 . it still doesnt give any http content though like seen in replayweb.page . |
19:47:35 | <h2ibot> | That lurker edited IRC/Logs (+45, https://trog.qgl.org/browse.php/docs/irclogs…): https://wiki.archiveteam.org/?diff=52022&oldid=50957 |
19:47:50 | <@OrIdow6> | Guest: `if record.rec_type == 'response'` gets you warcinfos? |
19:48:05 | <Guest> | no i changed it to the warcinfo type |
19:48:38 | <Guest> | i can only see the html content when i read the bytes from the warc file |
19:57:11 | <@OrIdow6> | Guest: Is it still true that "record.content_stream().read() does not return anything"? What are you getting, and what do you want to get? |
20:04:17 | <Guest> | yes. when the record type is "response" it does not return anything and when the record type is "warcinfo" it returns something similar to what was in the pastebin. |
20:05:09 | <Guest> | i need the html contents and not http data which is what the warcinfo record seemed to be giving me |
20:06:42 | | hogchips quits [Quit: My znc bouncer found a childhood friend and left me all alone, how will I survive now? Again?!] |
20:07:10 | <kiska> | Try record.raw_stream.read() |
20:10:41 | <wickerz> | Anyone using docker desktop (windows) for running containers? The last few days I can spin up some containers for different projects and have them running smoothly for maybe 1-3 hours, and then it suddenly crashes. I get things like "Server returned 0 (CONERROR). Sleeping.", "10671=0" and "Cannot assign requested addressnil". Those are from 3 |
20:10:41 | <wickerz> | different projects, which I guess is why the errors are different |
20:11:06 | <wickerz> | Seems like it could be a connection issue (due to the CONERROR) (?) but I'm just curious if anyone know why this happens. Never had that happen previously. Internet work fine in browser and otherwise |
20:11:58 | <Guest> | printing that results in the same output, nothing. |
20:15:41 | <@OrIdow6> | Guest: Is the issue seeing the response records, or just getting their bodies? |
20:19:34 | <Guest> | just getting their bodies |
20:22:48 | <nstrom|m> | I get that on docker desktop for windows too, after a while. I think its networking stack is a bit janky |
20:23:19 | <nstrom|m> | Win 11 seems a little better than win 10 for it. It's probably something with WSL |
20:25:00 | <wickerz> | nstrom|m at least good to hear it's not necessarily an issue on my end (on Win 11 though). Any tips on how to avoid having to babysit my PC to check for when it has happened? |
20:25:29 | <wickerz> | Usually it's not enough to simply restart the containers |
20:26:11 | <nstrom|m> | No idea, sorry. I ended up getting a standalone Linux box for home to run 24/7 and just chip in on windows occasionally |
20:26:44 | <wickerz> | Yeah, that's the solution I was considering as well.. |
20:27:11 | <wickerz> | Thanks though! |
20:27:49 | <nstrom|m> | 2-3 hours seems short for me, mine is usually fine for around 8 and then starts to get flaky |
20:28:17 | <nulldata> | Are you using WSL1 or 2, and are you running the latest WSL runtime? |
20:29:05 | <kiska> | I seem to get the output just fine from your file: https://paste.kiska.pw/EmplaceKilowatts |
20:31:08 | <nulldata> | wickerz - run wsl -l -v and make sure it's using WSL2. And try wsl --update to make sure the runtime is up to date |
20:31:59 | <wickerz> | nulldata WSL2 |
20:32:10 | <wickerz> | Based on output from that command |
20:32:38 | <wickerz> | Got the newest one it seems |
20:37:38 | <Guest> | kiska oddly enough, it seems to be working now. thanks for the help. ill give updates if i have any issues. |
20:40:23 | <Guest> | i believe the issue was happening because i was accessing other properties of the warc record before accessing the content |
20:41:30 | <Guest> | i did a test on it right now and i cant read the content after ive already read other properties of the warc, but i can read the content before i read the other properties of the warc |
20:48:11 | <nulldata> | Have you considered foregoing Docker Desktop and creating a Debian WSL2 instance and setting up Docker manually? I don't run AT projects on WSL but I do have a Plex Docker running on WSL for GPU passthrough and it's pretty stable. Another thing to consider might be network bridging instead of the NAT stuff WSL does OOB - don't know if you can do |
20:48:11 | <nulldata> | that with Docker Desktop but can be done when configuring manually. |
20:50:18 | | BlueMaxima joins |
20:52:04 | <wickerz> | nulldata no, but that might be something to look into! Thanks. I'll look at that tomorrow after work :) |
21:00:32 | | hogchips (shoghicp) joins |
21:39:53 | <h2ibot> | Inti83 edited Argentina (+46, /* Public Media and Communication */): https://wiki.archiveteam.org/?diff=52023&oldid=51824 |
22:28:55 | | icedice quits [Ping timeout: 272 seconds] |
22:34:40 | | Guest quits [Client Quit] |
23:18:24 | | Guest joins |
23:53:55 | | etnguyen03 quits [Client Quit] |
23:53:56 | | benjins3 joins |
23:54:35 | | etnguyen03 (etnguyen03) joins |