00:11:24dvd_ joins
00:14:37dvd__ quits [Ping timeout: 252 seconds]
01:13:39lunik173 quits [Quit: Ping timeout (120 seconds)]
01:15:44lunik173 joins
01:16:39Sanqui_ joins
01:16:42Sanqui_ quits [Changing host]
01:16:42Sanqui_ (Sanqui) joins
01:16:42@ChanServ sets mode: +o Sanqui_
01:17:52@Sanqui quits [Ping timeout: 252 seconds]
01:27:04Arcorann (Arcorann) joins
01:44:58BlueMaxima joins
01:46:13hitgrr8 quits [Client Quit]
02:25:19qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
02:45:50Terbium quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]
02:46:17Terbium joins
03:59:28<pabs>https://www.abc.net.au/news/2023-04-16/melbourne-box-hill-history-photos-archive/102147004
04:37:38BlueMaxima quits [Read error: Connection reset by peer]
05:49:28nicolas17 quits [Client Quit]
07:01:43qwertyasdfuiopghjkl quits [Remote host closed the connection]
07:04:30hitgrr8 joins
07:04:42qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
07:30:41@Sanqui_ is now known as @Sanqui
09:25:33tbc1887_ quits [Read error: Connection reset by peer]
11:51:00DiscantX quits [Ping timeout: 265 seconds]
11:52:07sec^nd quits [Ping timeout: 245 seconds]
11:52:32HackMii_ quits [Ping timeout: 245 seconds]
11:54:33HackMii_ (hacktheplanet) joins
11:54:41DiscantX joins
11:54:46sec^nd (second) joins
12:50:56sec^nd quits [Remote host closed the connection]
12:51:23sec^nd (second) joins
14:03:50Arcorann quits [Ping timeout: 252 seconds]
14:04:20qwertyasdfuiopghjkl quits [Remote host closed the connection]
14:06:33qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
14:35:55pie_ quits [Ping timeout: 252 seconds]
14:40:41pie_ joins
15:12:58pie_ quits [Client Quit]
15:13:12pie_ joins
15:50:01sec^nd quits [Ping timeout: 245 seconds]
16:10:34<Doranwen>how does one generate all possible URLs? I'm trying to find the filenames for some pdfs which I know must exist on a server, but I don't know the exact names, and manually typing every possibility to check is definitely out of the question
16:11:15Doranwen figures this is semi-on-topic but she's doing this for a private project, not public archival, so...
16:11:40<Doranwen>The only things I can find are this ancient Windows software: http://www.spadixbd.com/freetools/urlgen.htm
16:11:47<Doranwen>or this which only does sequential numbers: http://the-mostly.ru/online_url_generator_with_sequential_numbers.html
16:12:17<Doranwen>or this which says it'll do letters as well as numbers but clicking on Generate just clears the field and gives no results ever: https://streamtest.in/tools/url-generator
16:13:55<Doranwen>(I'm a bit amused that the ancient Windows software's download link is dead - but WBM has the file, lol)
16:20:00<Doranwen>And unfortunately the ancient Windows software (while it does run on Wine) only does numeric sequences, not alphanumeric.
16:22:50<Doranwen>The one other thing I turn up is a payware newer Windows program: https://www.exisoftware.com/url_generator/
16:26:38sec^nd (second) joins
16:28:35Doranwen is quite sure this can be done with a simple script but doesn't know how to write one to do it
16:52:19tzt quits [Ping timeout: 252 seconds]
17:13:11pie_ quits [Client Quit]
17:13:18qwertyasdfuiopghjkl quits [Client Quit]
17:13:47pie_ joins
17:19:45qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
17:38:43<spirit>what does *all possible urls* mean?
17:40:51<@JAA>Doranwen: First step is figuring out the patterns. When there's an easy way to e.g. enumerate all content with an ID, then you just generate the URLs using the range of possible IDs. I usually do that with seq plus either sed or awk, but there are many ways.
17:41:18<@JAA>s/an ID/a numeric, sequential ID/
17:42:35<Doranwen>I did manage to guess at the ones I was trying to find and found them, but someday I need to learn how to generate them, lol.
17:43:20<@JAA>I have no idea what tools are available to sensibly do this on Windows. It's definitely one of the tasks where the typical Unixoid system tools shine.
17:43:28<Doranwen>It was sections of an ebook that were available as pdfs but there was no overall page with them linked (presumably they'd used some html5-based player to load each section), so I had to work off various searches and whatnot.
17:44:17<Doranwen>Yeah, I knew it'd be way easier on Linux than Windows, it just requires knowledge I don't have... yet. I'll get there eventually, lol. Too many other tasks pulling at my attention to devote to mastering everything at once. But I did work a bit at learning regex the other day so that was something.
17:47:51qwertyasdfuiopghjkl quits [Remote host closed the connection]
17:48:01qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
18:19:27BearFortress quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
19:06:45<masterX244>got some examples of the URLs?
19:14:42BearFortress joins
19:33:01Craigle quits [Quit: The Lounge - https://thelounge.chat]
19:33:36Craigle (Craigle) joins
20:39:56spirit quits [Quit: Leaving]
20:58:58hitgrr8 quits [Client Quit]
21:06:02onetruth joins
21:24:44tzt (tzt) joins
22:10:10<thuban>Doranwen: i use https://github.com/OJ/gobuster, but like almost all tools of this type it requires a wordlist.
22:13:41<thuban>googling around will find you wordlists commonly used for directory guessing, but you'll probably want to use something custom; you may find tools like seq, exrex (https://github.com/asciimoo/exrex), or combinator (https://hashcat.net/wiki/doku.php?id=hashcat_utils) useful in generating it.
22:19:44<thuban>(you _can_ do true alphanumeric brute-forcing this way, but i don't recommend it for long strings or if you have any idea whatever of the pattern you're looking for)
22:20:28<Doranwen>thuban: yeah, I figured out the ones I was missing - or at least most of them - by guessing at the initial letters for the section (it was a math textbook again, lol)
22:20:42<Doranwen>And then I did find links to the chapters as a whole so got more of the book that way.
22:21:27<thuban>nice
22:22:08<Doranwen>I found in one spot they have the entire books available to browse online but I can't see any pdfs exposed - just get pngs of each page and I have to click to get each of those to load, so that's too much work. We'll see - they may be on IA already, or I may run across them somewhere else. I'm trying things here and there to see how much I can find.
22:22:50<thuban>pngs of each page? that sounds very scriptable. holler if you need me
22:25:34<Doranwen>thuban: will do, I hate to ask right away so I'll see if I can find anything elsewhere first, but otherwise... that site is *full* of stuff XD
22:42:41Iki1 joins
22:45:58AnotherIki quits [Ping timeout: 252 seconds]
23:08:37Craigle8 (Craigle) joins
23:09:43<Doranwen>Ha, I was trying to figure out the first part of the book - because I'd found the chapters, and stumbled across the end stuff - and then I noticed the end stuff was "em" - guessed that was "end material", so I tried "fm" for "front material" and that was it. Feeling rather satisfied right now. XD
23:10:58pie_[bnc] joins
23:10:59BearFortress_ joins
23:12:04tzt_ (tzt) joins
23:12:07jess_ (jess) joins
23:12:12BearFortress quits [Client Quit]
23:12:13Craigle quits [Client Quit]
23:12:13pie_ quits [Client Quit]
23:12:13qwertyasdfuiopghjkl quits [Client Quit]
23:12:13Hackerpcs quits [Remote host closed the connection]
23:12:13tzt quits [Remote host closed the connection]
23:12:13jess quits [Remote host closed the connection]
23:12:13SketchCow quits [Remote host closed the connection]
23:12:13Craigle8 is now known as Craigle
23:12:26SketchCow joins
23:12:28Hackerpcs (Hackerpcs) joins
23:12:37qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
23:23:36BlueMaxima joins
23:48:40<pabs>https://paulstamatiou.com/digital-clutter/ https://news.ycombinator.com/item?id=35584866
23:58:36nicolas17 joins