00:13:40thuban joins
00:24:59Church (Church) joins
00:25:02lennier1 (lennier1) joins
01:44:40immibis quits [Ping timeout: 252 seconds]
01:45:35immibis (immibis) joins
01:46:19myself quits [Ping timeout: 252 seconds]
01:47:47myself joins
02:14:31Ketchup901 quits [Remote host closed the connection]
02:14:59Ketchup901 (Ketchup901) joins
02:17:44zander quits [Remote host closed the connection]
02:21:01tzt quits [Ping timeout: 265 seconds]
02:42:55tzt (tzt) joins
03:03:30sonick quits [Client Quit]
03:39:29<@JAA>I'm grabbing the Turner Classic Movies forums with qwarc since a few hours. As usual, just topic pages, nothing else. The site is geolocked to North America it seems, and it has pretty strict rate limits in its CloudFront configuration. I ran it through AB in December, but it looks like that probably missed some content due to the rate limit; I haven't analysed the impact in more detail. My qwarc run
03:39:35<@JAA>is easily on track to finishing in time before the shutdown on the 15th.
03:51:10katocala quits [Ping timeout: 252 seconds]
05:01:18nepeat quits [Quit: ZNC - https://znc.in]
05:01:24lun4 quits [Quit: Ping timeout (120 seconds)]
05:01:24ave quits [Quit: Ping timeout (120 seconds)]
05:01:43nepeat (nepeat) joins
05:01:44lun4 (lun4) joins
05:01:44ave (ave) joins
05:21:32qwertyasdfuiopghjkl joins
05:53:44Ketchup901 quits [Remote host closed the connection]
05:54:46Ketchup901 (Ketchup901) joins
06:04:16hackbug quits [Ping timeout: 252 seconds]
06:05:25<mgrandi>TCM has (had?) forums? Wild
06:30:37katocala joins
06:44:17umgr036 quits [Remote host closed the connection]
06:46:04immibis quits [Ping timeout: 252 seconds]
06:46:24jacksonchen667 (jacksonchen666) joins
06:47:10jacksonchen666 quits [Ping timeout: 252 seconds]
06:47:31umgr036 joins
06:52:27immibis (immibis) joins
06:57:12<@JAA>So what do we want to do for Issuu? I'm not sure there's a way to identify the content that will go down. There doesn't seem to be a way to enumerate users or documents. Best we could do is start with a list of users and then fan out by collecting followers. Unfortunately, the following list isn't public it seems, so the value of that may be limited.
07:02:16<@JAA>'Stacks' (collections of documents uploaded possibly by other users) can help I guess.
07:10:03<@JAA>Following list may actually be possible, and that should work reasonably well except for very obscure content, I suppose.
07:22:43<@JAA>Oh, they actually have useful sitemaps. Not sitemap.xml, but the ones listed on robots.txt.
07:24:39<@OrIdow6>Oh
07:26:31<@OrIdow6>https://e.issuu.com/config/2111115.json seems to enumerate *something*, the way the dates are nonincreasing with the IDs and that it's on a domain that seems to do embeds makes me wonder if it's not comprehensive
07:44:43sonick (sonick) joins
07:46:10Arachnophine quits [Quit: Arachnophine]
07:47:08Arachnophine (Arachnophine) joins
08:10:22<pabs>https://openports.se/ shut down
08:35:02sec^nd quits [Ping timeout: 276 seconds]
08:41:31sec^nd (second) joins
08:49:49immibis quits [Ping timeout: 252 seconds]
09:04:07@dxrt quits [Ping timeout: 252 seconds]
09:04:56sec^nd quits [Ping timeout: 276 seconds]
09:07:30Ryz quits [Ping timeout: 265 seconds]
09:10:21sec^nd (second) joins
09:11:41Ryz (Ryz) joins
09:13:14Megame (Megame) joins
09:16:21dxrt joins
09:16:23dxrt quits [Changing host]
09:16:23dxrt (dxrt) joins
09:16:23@ChanServ sets mode: +o dxrt
09:21:57hitgrr8 joins
09:23:22Ryz quits [Ping timeout: 252 seconds]
09:23:55@dxrt quits [Ping timeout: 252 seconds]
09:34:00dxrt joins
09:34:02dxrt quits [Changing host]
09:34:02dxrt (dxrt) joins
09:34:02@ChanServ sets mode: +o dxrt
09:34:35Ryz (Ryz) joins
09:39:24@dxrt quits [Ping timeout: 265 seconds]
09:40:58Ryz quits [Ping timeout: 252 seconds]
09:45:57dxrt joins
09:45:58dxrt quits [Changing host]
09:45:58dxrt (dxrt) joins
09:45:58@ChanServ sets mode: +o dxrt
09:46:05Ryz (Ryz) joins
09:54:19T31M quits [Quit: ZNC - https://znc.in]
09:54:36T31M joins
09:57:25@dxrt quits [Client Quit]
09:58:22dxrt joins
09:58:24dxrt quits [Changing host]
09:58:24dxrt (dxrt) joins
09:58:24@ChanServ sets mode: +o dxrt
10:05:01umgr036 quits [Remote host closed the connection]
10:05:14umgr036 joins
10:18:22Jon quits [Read error: Connection reset by peer]
10:31:45jmtd joins
10:37:37Megame quits [Client Quit]
10:43:20Island quits [Read error: Connection reset by peer]
12:42:56umgr036 quits [Remote host closed the connection]
12:43:09umgr036 joins
12:43:51umgr036 quits [Remote host closed the connection]
12:44:04umgr036 joins
12:54:34Arcorann_ quits [Ping timeout: 252 seconds]
12:57:31hackbug (hackbug) joins
14:09:02jacksonchen667 quits [Client Quit]
14:09:22jacksonchen666 (jacksonchen666) joins
14:25:49jacksonchen666 quits [Client Quit]
14:26:10jacksonchen666 (jacksonchen666) joins
14:43:02HP_Archivist (HP_Archivist) joins
15:04:09@dxrt quits [Client Quit]
15:06:09dxrt joins
15:06:11dxrt quits [Changing host]
15:06:11dxrt (dxrt) joins
15:06:11@ChanServ sets mode: +o dxrt
15:08:00HP_Archivist quits [Client Quit]
15:12:23@dxrt quits [Client Quit]
15:13:49dxrt joins
15:14:01dxrt quits [Changing host]
15:14:01dxrt (dxrt) joins
15:14:01@ChanServ sets mode: +o dxrt
15:20:32@dxrt quits [Client Quit]
15:21:02dxrt joins
15:21:02dxrt quits [Changing host]
15:21:02dxrt (dxrt) joins
15:21:02@ChanServ sets mode: +o dxrt
15:29:38@dxrt quits [Client Quit]
15:31:00dxrt joins
15:31:02dxrt quits [Changing host]
15:31:02dxrt (dxrt) joins
15:31:02@ChanServ sets mode: +o dxrt
16:39:07treora quits [Remote host closed the connection]
16:39:09treora joins
16:40:16treora quits [Remote host closed the connection]
16:40:19treora joins
16:48:07spirit quits [Read error: Connection reset by peer]
16:48:27spirit joins
17:23:09<spirit>has anyone made a 4players.de file archive?
17:23:14<spirit>files/downloads
17:23:56<spirit>https://archive.org/search?query=4players only shows a few warcs
17:34:06<spirit>yikes, http://counterstrike.4pforen.4players.de/index.php is getting spammed to death :/
19:03:39benjins2__ quits [Read error: Connection reset by peer]
19:14:12<@JAA>OrIdow6: Interesting find. The ID does not appear on https://issuu.com/carrefourromania/docs/national-16 FWIW.
19:14:35<@OrIdow6>JAA: Yeah
19:18:07<@JAA>spirit: IIRC, we ran a bunch of that through ArchiveBot. Don't remember how complete that was though.
19:35:07umgr036 quits [Remote host closed the connection]
19:38:38umgr036 joins
20:04:16qwertyasdfuiopghjkl quits [Remote host closed the connection]
20:10:34michaelblob_ quits [Read error: Connection reset by peer]
20:33:21Island joins
21:20:41BlueMaxima joins
21:28:28immibis (immibis) joins
21:32:08LeGoupil joins
21:42:46benjins2 joins
21:45:34TheTechRobo quits [Remote host closed the connection]
22:05:06hitgrr8 quits [Client Quit]
22:10:58TheTechRobo (TheTechRobo) joins
22:39:42LeGoupil quits [Client Quit]