03:51:43 | | DogsRNice joins |
03:56:35 | | SootBector quits [Remote host closed the connection] |
03:57:45 | | SootBector (SootBector) joins |
04:07:17 | | DogsRNice quits [Read error: Connection reset by peer] |
05:10:29 | | Webuser910297 joins |
05:37:05 | <Webuser910297> | JAA then how I can archive such sites with broken HTTPS? |
05:37:33 | <@JAA> | (Context from #archivebot: 'do you also know how to force SPN to use HTTP? I found some weird site on which HTTPS connections always gives you 403') |
05:37:55 | <@JAA> | Webuser910297: I don't know of a way to do it with SPN. |
08:04:18 | | SootBector quits [Remote host closed the connection] |
08:05:26 | | SootBector (SootBector) joins |
08:19:29 | | nicolas17 quits [Ping timeout: 260 seconds] |
08:25:58 | | nicolas17 joins |
08:33:29 | | nicolas17 quits [Ping timeout: 260 seconds] |
08:35:40 | | nicolas17 joins |
09:21:36 | <pabs> | Webuser910297: maybe try the force_get option from https://docs.google.com/document/d/1Nsv52MvSjbLb2PCpHlat0gkzw0EvtSgpKHu4mk0MnrA/edit?tab=t.0#heading=h.uu61fictja6r |
10:00:12 | <Webuser910297> | It works! Thank you pabs. http://web.archive.org/web/20250921095635/http://185.236.24.241 |
10:38:48 | <Webuser910297> | pabs++ |
10:38:48 | <eggdrop> | [karma] 'pabs' now has 129 karma! |
13:17:45 | <pabs> | JAA: ^ |
13:39:44 | | BearFortress quits [Ping timeout: 260 seconds] |
13:40:16 | <Webuser910297> | Goodbye. |
13:40:24 | | Webuser910297 quits [Quit: Ooops, wrong browser tab.] |
13:43:10 | | BearFortress joins |
15:48:37 | | ThreeHM (ThreeHeadedMonkey) joins |
16:57:34 | <@JAA> | It'd be nice if this wasn't login-walled. |
16:58:25 | <@JAA> | But interesting; that would imply that it's the HEAD thing that 'upgrades' to HTTPS. |
17:02:18 | | DogsRNice joins |
17:48:47 | <TheTechRobo> | Might also be the browser doing it, I guess. |
17:50:59 | <@JAA> | The HEAD is done outside of the browser IIRC? So if force_get really just goes directly to the GET in browser, I don't think so. |
17:54:34 | | X-Scale quits [Ping timeout: 258 seconds] |
18:13:08 | <TheTechRobo> | Documentation suggests that force_get=1 skips the HEAD and does the simple GET request rather than the browser. |
18:13:18 | <TheTechRobo> | > Force the use of a simple HTTP GET request to capture the target URL. By default SPN2 does a HTTP HEAD on the target URL to decide whether to use a headless browser or a simple HTTP GET request. force_get overrides this behavior. |
20:42:56 | <@JAA> | Exactly |
20:43:17 | <@JAA> | Which suggests to me that the HTTPS rewriting must happen in the HEAD stage. |