00:01:50etnguyen03 quits [Client Quit]
00:10:08retrograde quits [Remote host closed the connection]
00:10:35retrograde (retrograde) joins
00:15:40retrograde quits [Remote host closed the connection]
00:16:03retrograde (retrograde) joins
00:27:50<nicolas17>https://www.youtube.com/watch?v=0dwagg5wYY4 about Mr Beast
00:42:59etnguyen03 (etnguyen03) joins
01:07:02Exorcism quits [Quit: Ping timeout (120 seconds)]
01:07:05DigitalDragons quits [Quit: Ping timeout (120 seconds)]
01:07:15DigitalDragons (DigitalDragons) joins
01:07:16Exorcism (exorcism) joins
01:15:27Nekroschizofrenetyk joins
01:23:31Nekroschizofrenetyk quits [Client Quit]
02:37:27<Doranwen>Is there a quick way to modify this code to match not only the " but a ' instead of a " here (in both places)? `src="\K[^"?]+` I've got a case where it does img src='PICLINK.EXT' and I want it to grab that pic link there just as it would grab a standard img src="PICLINK.EXT"
02:38:05<@JAA>Only (reasonably) by duplicating it: src="\K[^"?]+|src='\K[^'?]+
02:38:40<@JAA>If you need to put that in a shell string, it gets a bit ugly. :-)
02:39:08<Doranwen>Lol, that's fine, it's going into a script so I don't have to manually fuss with it.
02:39:46<Doranwen>I'm also going to have to add on this to match as well: `{background: url(PICLINKHERE)` and get the PICLINKHERE out of it.
02:39:56<@JAA>How I would write it: grep -Po 'src="\K[^"?]+|src='\''\K[^'\''?]+'
02:40:24<Doranwen>Yeah, the grep -Po is already there, I just didn't copy the full command here.
02:40:54<@JAA>It's a concatenation of this: 'src="\K[^"?]+|src=' \' '\K[^' \' '?]+'
02:41:13<@JAA>So the actual argument passed to grep becomes: src="\K[^"?]+|src='\K[^'?]+
02:41:23<Doranwen>Ahh.
02:41:40<@JAA>There are other versions of writing it, but this is the least ugly variant in my opinion.
02:42:32<@JAA>And: background: url\(\K[^)]*
02:43:46<Doranwen>So altogether it would be: src="\K[^"?]+|src='\K[^'?]+|background: url\(\K[^)]* ? (I probably have it very wrong as I barely understand regex and this is beyond my basic skills there)
02:44:20<Doranwen>It has to do the search all at once and dump all the results into one place, so I'm trying to get them all together in one command. Ugly or not, lol.
02:44:25<steering>hmm
02:44:25<@JAA>Sure, and then putting it as a shell command argument like above.
02:44:52<Doranwen>Right, the original line was: grep -oP 'src="\K[^"?]+' "$f" > ImageLinks/"$post".txt
02:44:59<Doranwen>As part of a for loop and all that.
02:45:00Webuser291943 quits [Quit: Ooops, wrong browser tab.]
02:45:24<Doranwen>I actually stitched them together correctly?
02:45:40Doranwen was sure she missed *something*, lol.
02:46:04<steering>something like this should work in grep -P i think btw: src=(["'])\K.+?(?=\1)
02:47:21<@JAA>Yeah, you can use backrefs and a lookahead with PCRE.
02:47:50<@JAA>Teeeechnically not a regex anymore, but... :-P
02:49:14<steering>if you could use capturing groups instead of needing the match to only include what you're interested in `src=(["'])(.+?)\1` would be better but alas :P
02:52:09<Doranwen>Weird, it's working great on the pics with the {background: url( bit - but it's not finding this one, for instance: <img src='https://l-userpic.livejournal.com/122062562/45330521' width='100' height='100' alt='' border='0'/>
02:54:22<Doranwen>Possibly has something to do with the whole thing being enclosed in ' ? The code for the src with the single quotes has a few characters that aren't the correct color, in my editor.
02:54:32<Doranwen>Which usually means something isn't being quoted correctly.
02:54:45<Doranwen>The line currently is: `grep -oP 'src="\K[^"?]+|src='\K[^'?]+|background: url\(\K[^)]*' "$f" > ImageLinks/"$post".txt`
02:55:56etnguyen03 quits [Remote host closed the connection]
02:57:55<@JAA>You didn't handle the quoting as I described above.
02:58:00<@JAA>'\''
03:01:11<steering>your grep sees the bit in the middle like this: `src=\K[^?]+`
03:01:23<steering>(or something)
03:02:13<Doranwen>I thought I copied from your message but maybe I didn't.
03:02:16<@JAA>src=K[^?]+ actually, I think.
03:02:19<steering>yeah
03:02:30<Doranwen>Oh, I copied from the wrong message.
03:02:41<Doranwen>That's why.
03:02:45SootBector quits [Remote host closed the connection]
03:03:55SootBector (SootBector) joins
03:04:45<steering>also if you don't have any in your dataset great but you may want to add uhh... `|src=\K[^'" ][^ ]*` or so
03:04:54<steering>to get unquoted ones
03:05:43<steering>(*technically* [^ ] needs a lot more characters in it but i don't know exactly what characters will terminate an unquoted html attribute and it probably varies anyway :D)
03:06:11<steering>oh
03:06:18<@JAA>At least >
03:06:21<steering>`|src=\K[^'" >][^ >]*`
03:06:22<steering>yeah
03:06:29<Doranwen>Well, I'm trying to get all the embedded pics, and it turns out these are two other ways they appear besides the standard.
03:06:50<steering>yeah, there's a few more ways.
03:07:00<steering>you can have <img src=/foo/bar.png>
03:07:00<@JAA>The lengths one goes to to avoid having to parse HTML. :-)
03:07:11<steering>you can have background-image:url(...)
03:07:24<steering>(i think its -image, idr, background-something)
03:08:52<Doranwen>Hmm, I thought I copied from the right message this time but this is still not working: `grep -oP 'src="\K[^"?]+|src='\''\K[^'\''?]+|background: url\(\K[^)]*' "$f" > ImageLinks/"$post".txt`
03:09:02<Doranwen>Now a couple of the \ in the middle are the wrong color.
03:09:35<@JAA>> grep -oP 'src="\K[^"?]+|src='\''\K[^'\''?]+|background: url\(\K[^)]*' <<<"<img src='https://l-userpic.livejournal.com/122062562/45330521' width='100' height='100' alt='' border='0'/>"
03:09:39<@JAA>https://l-userpic.livejournal.com/122062562/45330521
03:09:40<@JAA>Works for me.
03:10:40<Doranwen>I'm going off this page: https://www.livejournal.com/allpics.bml?user=ahs_exchange
03:10:43<Doranwen>Saved as an html.
03:11:10<Doranwen>Can't get the userpic links - which I took that one from - to get pulled.
03:12:05<Doranwen>The script creates a folder for every page that it finds embedded pics for, and dumps those pics into it. That part it handles fine. But it's not creating a folder for this page whatsoever. It simply isn't finding the links. :/
03:14:41Doranwen has the whole script up on GitHub but doubts the rest of it is relevant to this bit as it's working flawlessly for everything except discovering the urls in the first place.
03:18:35<Doranwen>It's finding ordinary embedded links fine, as well as the banners that were embedded with `{background: url()` - so there's something about how these userpics are linked that's the challenge.
03:23:59<@JAA>> curl -s https://www.livejournal.com/allpics.bml?user=ahs_exchange | grep -oP 'src="\K[^"?]+|src='\''\K[^'\''?]+|background: url\(\K[^)]*' | grep -F https://l-userpic.livejournal.com/122062562/45330521
03:24:03<@JAA>https://l-userpic.livejournal.com/122062562/45330521
03:24:04<@JAA>Works for me.
03:25:36<Doranwen>Ohhhhhhh, I know exactly why it's not working. At least, I think I do.
03:25:52<Doranwen>Has to do with the order and way I have the script handling all the image downloads.
03:26:08<Doranwen>Anyway, thank you VERY much! I will work on the script some more.
03:26:23<Doranwen>It was not designed to download userpics per post, and that's where the issue lies, I strongly suspect.
03:29:20<Doranwen>They were being downloaded into a separate folder - which Maggie is fine with - so we'll leave it at that. They're saved, at least.
03:29:21<Doranwen>Whew!
04:39:47fluke quits [Ping timeout: 268 seconds]
05:16:57atphoenix__ (atphoenix) joins
05:19:20atphoenix_ quits [Ping timeout: 268 seconds]
05:28:20HackMii quits [Remote host closed the connection]
05:28:46HackMii (hacktheplanet) joins
05:33:15<klea>If you're evil you culd write the full unquoted version into a file, and use grep -oPf that_file :P
05:37:14Starchives_ (Starchives) joins
05:38:47systwi_ quits [Quit: systwi_]
05:40:50Starchives__ quits [Ping timeout: 268 seconds]
05:43:18nothere quits [Ping timeout: 268 seconds]
05:43:18ivan quits [Ping timeout: 268 seconds]
06:02:29ivan joins
06:25:07SootBector quits [Remote host closed the connection]
06:26:16SootBector (SootBector) joins
06:28:20retrograde quits [Remote host closed the connection]
06:28:44retrograde (retrograde) joins
06:30:36nothere joins
06:31:59SootBector quits [Ping timeout: 260 seconds]
06:31:59HackMii quits [Ping timeout: 260 seconds]
06:32:23HackMii (hacktheplanet) joins
06:50:20SootBector (SootBector) joins
07:05:44lemuria quits [Read error: Connection reset by peer]
07:17:54pabs quits [Read error: Connection reset by peer]
07:18:28pabs (pabs) joins
07:24:00nine quits [Quit: See ya!]
07:24:13nine joins
07:59:11ducky_ (ducky) joins
08:00:54ducky quits [Ping timeout: 268 seconds]
08:02:43<steering>you know what i should do? i should make a mirc bot. in honor of fireonlive
08:02:46<steering>fireonlive++
08:02:46<eggdrop>[karma] 'fireonlive' now has 1192 karma!
08:03:00<steering>i'm gonna get it to 1337 dangit
08:03:54ducky_ quits [Ping timeout: 268 seconds]
08:16:02ducky (ducky) joins
08:35:26Dango360 quits [Ping timeout: 268 seconds]
08:36:31Dango360 (Dango360) joins
10:00:12<@JAA>Spot the difference: https://web.archive.org/web/diff/20180326073014/20260220155052/https://www.ipcs.org/archive.php
11:00:02Bleo1826007227196234552220110 quits [Quit: The Lounge - https://thelounge.chat]
11:02:50Bleo1826007227196234552220110 joins
11:15:47<ivan>TIL there's a diff feature
11:18:26steering wonders what changed about the fb and twitter icons
11:18:45<steering>oh
11:19:23<steering>they suffixed the class with a 4 i.e. ico-facebook4
11:23:44datechnoman quits [Quit: Ping timeout (120 seconds)]
11:24:06datechnoman (datechnoman) joins
12:23:25SootBector quits [Remote host closed the connection]
12:24:38SootBector (SootBector) joins
12:47:04SootBector quits [Ping timeout: 260 seconds]
12:48:45SootBector (SootBector) joins
13:00:12ducky_ (ducky) joins
13:01:08ducky quits [Ping timeout: 268 seconds]
13:01:08ducky_ is now known as ducky
13:07:37pabs quits [Remote host closed the connection]
13:08:04pabs (pabs) joins
13:16:57<klea>fireonlive++
13:16:57<eggdrop>[karma] 'fireonlive' now has 1193 karma!
13:17:50<klea>JAA: Shortly is more than 8 years?
13:17:55<klea>Also.
13:18:46<klea>Copyright © <?php echo date("Y"); ?>
14:04:38<klea>https://zacharykai.net/bookmarklets#word-counter seems useful.
14:07:03Nekroschizofrenetyk joins
14:19:27Nekroschizofrenetyk quits [Client Quit]
14:19:31Nekroschizofrenetyk joins
14:27:08<klea>https://www.youtube.com/watch?v=lvEO4IaEJlw This seems fun to do, I suppose they had a wonderful time reverse engineering the way the program backed up data.
14:45:21Dango360 quits [Ping timeout: 268 seconds]
14:45:31Dango3602 (Dango360) joins
14:53:03nine quits [Quit: See ya!]
14:53:16nine joins
14:55:02Nekroschizofrenetyk quits [Client Quit]
16:13:20Webuser572193 joins
16:45:35atphoenix_ (atphoenix) joins
16:48:46atphoenix__ quits [Ping timeout: 268 seconds]
17:06:45traxys quits [Quit: The Lounge - https://thelounge.chat]
18:18:24<@JAA>klea: Yep
19:13:16<@JAA>88x31 galore: https://files.leggi.es/8831/
21:23:46<Dango3602>JAA: only complaint is that they're not clickable (and thus idk where to find some of the things referenced). but it's pretty cool regardless
21:24:29<@JAA>Dango3602: Yeah, would be nice to know what exactly they referenced.
21:25:23<Dango3602>ugh my name is currently not the right one
21:26:24Dango3602 quits [Quit: The Lounge - https://thelounge.chat]
21:26:42Dango3602 (Dango360) joins
21:26:58Dango3602 is now known as Dango360
21:30:23<klea>btw, JAA did you upload the login.corp.google qwarc WARC?
22:09:21multisn8 quits [Ping timeout: 268 seconds]
22:28:39etnguyen03 (etnguyen03) joins
22:48:36Hackerpcs quits [Remote host closed the connection]
22:48:38<nulldata>https://techcrunch.com/2026/04/20/deezer-says-44-of-songs-uploaded-to-its-platform-daily-are-ai-generated/
22:49:28Hackerpcs (Hackerpcs) joins
22:53:15ph0rcys quits [*.net *.split]
22:53:15x9fff00 quits [*.net *.split]
22:53:15DigitalDragon quits [*.net *.split]
22:53:15Hans5958 quits [*.net *.split]
22:53:15yzqzss quits [*.net *.split]
22:53:15Vokun quits [*.net *.split]
22:53:15Exorcism|m quits [*.net *.split]
22:53:16M--mlv|m quits [*.net *.split]
22:53:17osiride|m quits [*.net *.split]
22:53:17ax|m quits [*.net *.split]
22:53:17spearcat|m quits [*.net *.split]
22:53:17supermariofan67|m quits [*.net *.split]
22:53:17NickS|m quits [*.net *.split]
22:53:17djasldjasldjalsdj|m quits [*.net *.split]
22:53:17EvanBoehs|m quits [*.net *.split]
22:53:17Fijxu|m quits [*.net *.split]
22:53:17its_notjack quits [*.net *.split]
22:53:17starg2|m quits [*.net *.split]
22:53:17that_lurker|m quits [*.net *.split]
22:53:17l0rd_enki|m quits [*.net *.split]
22:53:17mikolaj|m quits [*.net *.split]
22:53:17vics quits [*.net *.split]
22:53:17Adamvoltagex|m quits [*.net *.split]
22:53:17v1cs quits [*.net *.split]
22:53:17bogsen quits [*.net *.split]
22:53:17trumad|m quits [*.net *.split]
22:53:17akaibu|m quits [*.net *.split]
22:53:17haha-whered-it-go|m quits [*.net *.split]
22:53:17joepie91|m quits [*.net *.split]
22:53:17nosamu|m quits [*.net *.split]
22:53:17GRBaset quits [*.net *.split]
22:53:17s-crypt|m|m quits [*.net *.split]
22:53:17superusercode quits [*.net *.split]
22:53:17wrangle|m quits [*.net *.split]
22:53:17Cydog|m quits [*.net *.split]
22:53:17jwoglom|m quits [*.net *.split]
22:53:17CrispyAlice2 quits [*.net *.split]
22:53:17Explo quits [*.net *.split]
22:53:17thermospheric quits [*.net *.split]
22:53:17Video quits [*.net *.split]
22:53:17Roki_100|m quits [*.net *.split]
22:53:17hexagonwin|m quits [*.net *.split]
22:53:17iCesenberk|m quits [*.net *.split]
22:53:17phaeton quits [*.net *.split]
22:53:17Thibaultmol quits [*.net *.split]
22:53:17tech234a quits [*.net *.split]
22:53:17moe-a-m|m quits [*.net *.split]
22:53:17tech234a|m-backup quits [*.net *.split]
22:53:17schwarzkatz|m quits [*.net *.split]
22:53:17Ember|m quits [*.net *.split]
22:53:17th3z0l4|m quits [*.net *.split]
22:53:17aaq|m quits [*.net *.split]
22:53:17nstrom|m quits [*.net *.split]
22:53:17xxia|m quits [*.net *.split]
22:53:17Fletcher quits [*.net *.split]
22:53:17masterx244|m quits [*.net *.split]
22:53:17cruller quits [*.net *.split]
22:53:17mpeter|m quits [*.net *.split]
22:53:17Minkafighter|m quits [*.net *.split]
22:53:17tomodachi94 quits [*.net *.split]
22:53:17@rewby|m quits [*.net *.split]
22:53:17flashfire42|m quits [*.net *.split]
22:53:17theblazehen|m quits [*.net *.split]
22:53:17gamer191-1|m quits [*.net *.split]
22:53:17justauser|m quits [*.net *.split]
22:53:17igneousx quits [*.net *.split]
22:53:17audrooku|m quits [*.net *.split]
22:53:17britmob|m quits [*.net *.split]
22:53:17mind_combatant quits [*.net *.split]
22:53:17@Sanqui|m quits [*.net *.split]
22:53:17anon00001|m quits [*.net *.split]
22:53:17Ajay quits [*.net *.split]
22:55:45flashfire42|m (flashfire42) joins
23:00:24Sanqui|m (Sanqui) joins
23:00:24EvanBoehs|m joins
23:00:24theblazehen|m joins
23:00:24nstrom|m joins
23:00:24@ChanServ sets mode: +o Sanqui|m
23:00:24joepie91|m joins
23:00:24yzqzss (yzqzss) joins
23:00:24DigitalDragon joins
23:00:24britmob|m joins
23:00:24tech234a|m-backup (tech234a) joins
23:00:24audrooku|m joins
23:00:24igneousx (igneousx) joins
23:00:24mpeter|m joins
23:00:24Thibaultmol joins
23:00:24masterx244|m (masterx244|m) joins
23:00:24akaibu|m joins
23:00:24Fletcher (Fletcher) joins
23:00:24wrangle|m joins
23:00:24superusercode joins
23:00:24jwoglom|m joins
23:00:24Cydog|m joins
23:00:24Explo joins
23:00:24Roki_100|m joins
23:00:24phaeton (phaeton) joins
23:00:24nosamu|m joins
23:00:24aaq|m joins
23:00:24tech234a (tech234a) joins
23:00:24th3z0l4|m joins
23:00:24that_lurker|m joins
23:00:24hexagonwin|m joins
23:00:25justauser|m (justauser|m) joins
23:00:25ph0rcys (ph0rcys) joins
23:00:25rewby|m (rewby) joins
23:00:25xxia|m joins
23:00:25@ChanServ sets mode: +o rewby|m
23:00:25schwarzkatz|m joins
23:00:25Ajay joins
23:00:25GRBaset (GRBaset) joins
23:00:25Minkafighter|m joins
23:00:25CrispyAlice2 joins
23:00:25x9fff00 (x9fff00) joins
23:00:25moe-a-m|m joins
23:00:25thermospheric joins
23:00:25Vokun (Vokun) joins
23:00:25iCesenberk|m joins
23:00:25s-crypt|m|m joins
23:00:25mikolaj|m joins
23:00:25cruller joins
23:00:25l0rd_enki|m joins
23:00:25tomodachi94 (tomodachi94) joins
23:00:25anon00001|m joins
23:00:25Hans5958 joins
23:00:25mind_combatant (mind_combatant) joins
23:00:25Video joins
23:00:25gamer191-1|m joins
23:00:25Ember|m joins
23:00:25Exorcism|m (exorcism) joins
23:00:26haha-whered-it-go|m joins
23:00:26supermariofan67|m joins
23:00:26spearcat|m joins
23:00:26ax|m joins
23:00:26starg2|m joins
23:00:26djasldjasldjalsdj|m joins
23:00:26NickS|m joins
23:00:27trumad|m joins
23:00:27v1cs joins
23:00:27Adamvoltagex|m joins
23:00:27its_notjack (its_notjack) joins
23:00:27osiride|m joins
23:00:27Fijxu|m joins
23:00:29bogsen (bogsen) joins
23:00:29vics joins
23:03:37useretail__ joins
23:03:42wickedplayer494 quits [Ping timeout: 268 seconds]
23:04:06wickedplayer494 (wickedplayer494) joins
23:06:42useretail_ quits [Ping timeout: 268 seconds]
23:21:34etnguyen03 quits [Client Quit]
23:30:45BitByBit4 quits [Ping timeout: 268 seconds]