| 00:11:24 | | dvd_ joins |
| 00:14:37 | | dvd__ quits [Ping timeout: 252 seconds] |
| 01:13:39 | | lunik173 quits [Quit: Ping timeout (120 seconds)] |
| 01:15:44 | | lunik173 joins |
| 01:16:39 | | Sanqui_ joins |
| 01:16:41 | | Sanqui_ is now authenticated as Sanqui |
| 01:16:42 | | Sanqui_ quits [Changing host] |
| 01:16:42 | | Sanqui_ (Sanqui) joins |
| 01:16:42 | | @ChanServ sets mode: +o Sanqui_ |
| 01:17:52 | | @Sanqui quits [Ping timeout: 252 seconds] |
| 01:27:04 | | Arcorann (Arcorann) joins |
| 01:44:58 | | BlueMaxima joins |
| 01:46:13 | | hitgrr8 quits [Client Quit] |
| 02:25:19 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
| 02:45:50 | | Terbium quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.] |
| 02:46:17 | | Terbium joins |
| 03:59:28 | <pabs> | https://www.abc.net.au/news/2023-04-16/melbourne-box-hill-history-photos-archive/102147004 |
| 04:37:38 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 05:49:28 | | nicolas17 quits [Client Quit] |
| 07:01:43 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
| 07:04:30 | | hitgrr8 joins |
| 07:04:42 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
| 07:30:41 | | @Sanqui_ is now known as @Sanqui |
| 09:25:33 | | tbc1887_ quits [Read error: Connection reset by peer] |
| 11:51:00 | | DiscantX quits [Ping timeout: 265 seconds] |
| 11:52:07 | | sec^nd quits [Ping timeout: 245 seconds] |
| 11:52:32 | | HackMii_ quits [Ping timeout: 245 seconds] |
| 11:54:33 | | HackMii_ (hacktheplanet) joins |
| 11:54:41 | | DiscantX joins |
| 11:54:46 | | sec^nd (second) joins |
| 12:50:56 | | sec^nd quits [Remote host closed the connection] |
| 12:51:23 | | sec^nd (second) joins |
| 14:03:50 | | Arcorann quits [Ping timeout: 252 seconds] |
| 14:04:20 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
| 14:06:33 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
| 14:35:55 | | pie_ quits [Ping timeout: 252 seconds] |
| 14:40:41 | | pie_ joins |
| 15:12:58 | | pie_ quits [Client Quit] |
| 15:13:12 | | pie_ joins |
| 15:50:01 | | sec^nd quits [Ping timeout: 245 seconds] |
| 16:10:34 | <Doranwen> | how does one generate all possible URLs? I'm trying to find the filenames for some pdfs which I know must exist on a server, but I don't know the exact names, and manually typing every possibility to check is definitely out of the question |
| 16:11:15 | | Doranwen figures this is semi-on-topic but she's doing this for a private project, not public archival, so... |
| 16:11:40 | <Doranwen> | The only things I can find are this ancient Windows software: http://www.spadixbd.com/freetools/urlgen.htm |
| 16:11:47 | <Doranwen> | or this which only does sequential numbers: http://the-mostly.ru/online_url_generator_with_sequential_numbers.html |
| 16:12:17 | <Doranwen> | or this which says it'll do letters as well as numbers but clicking on Generate just clears the field and gives no results ever: https://streamtest.in/tools/url-generator |
| 16:13:55 | <Doranwen> | (I'm a bit amused that the ancient Windows software's download link is dead - but WBM has the file, lol) |
| 16:20:00 | <Doranwen> | And unfortunately the ancient Windows software (while it does run on Wine) only does numeric sequences, not alphanumeric. |
| 16:22:50 | <Doranwen> | The one other thing I turn up is a payware newer Windows program: https://www.exisoftware.com/url_generator/ |
| 16:26:38 | | sec^nd (second) joins |
| 16:28:35 | | Doranwen is quite sure this can be done with a simple script but doesn't know how to write one to do it |
| 16:52:19 | | tzt quits [Ping timeout: 252 seconds] |
| 17:13:11 | | pie_ quits [Client Quit] |
| 17:13:18 | | qwertyasdfuiopghjkl quits [Client Quit] |
| 17:13:47 | | pie_ joins |
| 17:19:45 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
| 17:38:43 | <spirit> | what does *all possible urls* mean? |
| 17:40:51 | <@JAA> | Doranwen: First step is figuring out the patterns. When there's an easy way to e.g. enumerate all content with an ID, then you just generate the URLs using the range of possible IDs. I usually do that with seq plus either sed or awk, but there are many ways. |
| 17:41:18 | <@JAA> | s/an ID/a numeric, sequential ID/ |
| 17:42:35 | <Doranwen> | I did manage to guess at the ones I was trying to find and found them, but someday I need to learn how to generate them, lol. |
| 17:43:20 | <@JAA> | I have no idea what tools are available to sensibly do this on Windows. It's definitely one of the tasks where the typical Unixoid system tools shine. |
| 17:43:28 | <Doranwen> | It was sections of an ebook that were available as pdfs but there was no overall page with them linked (presumably they'd used some html5-based player to load each section), so I had to work off various searches and whatnot. |
| 17:44:17 | <Doranwen> | Yeah, I knew it'd be way easier on Linux than Windows, it just requires knowledge I don't have... yet. I'll get there eventually, lol. Too many other tasks pulling at my attention to devote to mastering everything at once. But I did work a bit at learning regex the other day so that was something. |
| 17:47:51 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
| 17:48:01 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
| 18:19:27 | | BearFortress quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
| 19:06:45 | <masterX244> | got some examples of the URLs? |
| 19:14:42 | | BearFortress joins |
| 19:33:01 | | Craigle quits [Quit: The Lounge - https://thelounge.chat] |
| 19:33:36 | | Craigle (Craigle) joins |
| 20:39:56 | | spirit quits [Quit: Leaving] |
| 20:58:58 | | hitgrr8 quits [Client Quit] |
| 21:06:02 | | onetruth joins |
| 21:24:44 | | tzt (tzt) joins |
| 22:10:10 | <thuban> | Doranwen: i use https://github.com/OJ/gobuster, but like almost all tools of this type it requires a wordlist. |
| 22:13:41 | <thuban> | googling around will find you wordlists commonly used for directory guessing, but you'll probably want to use something custom; you may find tools like seq, exrex (https://github.com/asciimoo/exrex), or combinator (https://hashcat.net/wiki/doku.php?id=hashcat_utils) useful in generating it. |
| 22:19:44 | <thuban> | (you _can_ do true alphanumeric brute-forcing this way, but i don't recommend it for long strings or if you have any idea whatever of the pattern you're looking for) |
| 22:20:28 | <Doranwen> | thuban: yeah, I figured out the ones I was missing - or at least most of them - by guessing at the initial letters for the section (it was a math textbook again, lol) |
| 22:20:42 | <Doranwen> | And then I did find links to the chapters as a whole so got more of the book that way. |
| 22:21:27 | <thuban> | nice |
| 22:22:08 | <Doranwen> | I found in one spot they have the entire books available to browse online but I can't see any pdfs exposed - just get pngs of each page and I have to click to get each of those to load, so that's too much work. We'll see - they may be on IA already, or I may run across them somewhere else. I'm trying things here and there to see how much I can find. |
| 22:22:50 | <thuban> | pngs of each page? that sounds very scriptable. holler if you need me |
| 22:25:34 | <Doranwen> | thuban: will do, I hate to ask right away so I'll see if I can find anything elsewhere first, but otherwise... that site is *full* of stuff XD |
| 22:42:41 | | Iki1 joins |
| 22:45:58 | | AnotherIki quits [Ping timeout: 252 seconds] |
| 23:08:37 | | Craigle8 (Craigle) joins |
| 23:09:43 | <Doranwen> | Ha, I was trying to figure out the first part of the book - because I'd found the chapters, and stumbled across the end stuff - and then I noticed the end stuff was "em" - guessed that was "end material", so I tried "fm" for "front material" and that was it. Feeling rather satisfied right now. XD |
| 23:10:58 | | pie_[bnc] joins |
| 23:10:59 | | BearFortress_ joins |
| 23:12:04 | | tzt_ (tzt) joins |
| 23:12:07 | | jess_ (jess) joins |
| 23:12:12 | | BearFortress quits [Client Quit] |
| 23:12:13 | | Craigle quits [Client Quit] |
| 23:12:13 | | pie_ quits [Client Quit] |
| 23:12:13 | | qwertyasdfuiopghjkl quits [Client Quit] |
| 23:12:13 | | Hackerpcs quits [Remote host closed the connection] |
| 23:12:13 | | tzt quits [Remote host closed the connection] |
| 23:12:13 | | jess quits [Remote host closed the connection] |
| 23:12:13 | | SketchCow quits [Remote host closed the connection] |
| 23:12:13 | | Craigle8 is now known as Craigle |
| 23:12:26 | | SketchCow joins |
| 23:12:28 | | Hackerpcs (Hackerpcs) joins |
| 23:12:37 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
| 23:23:36 | | BlueMaxima joins |
| 23:48:40 | <pabs> | https://paulstamatiou.com/digital-clutter/ https://news.ycombinator.com/item?id=35584866 |
| 23:58:36 | | nicolas17 joins |