00:00:47dm4v quits [Read error: Connection reset by peer]
00:01:40dm4v joins
00:01:43dm4v quits [Changing host]
00:01:43dm4v (dm4v) joins
00:06:23nertzy_ quits [Client Quit]
00:58:15britm0b quits [Ping timeout: 244 seconds]
01:03:49dm4v_ joins
01:04:19dm4v quits [Ping timeout: 252 seconds]
01:04:19dm4v_ is now known as dm4v
01:04:19dm4v quits [Changing host]
01:04:19dm4v (dm4v) joins
01:08:12tommyshinebox46 joins
01:10:39tommyshinebox quits [Ping timeout: 244 seconds]
01:11:24tommyshinebox joins
01:11:55tommyshinebox46 quits [Remote host closed the connection]
01:46:08<@JAA>I'm grabbing Gamasutra with qwarc now. Specifically, I'm grabbing the pagination of https://www.gamasutra.com/updates/ https://www.gamasutra.com/blogs/expert/ https://www.gamasutra.com/blogs/member/ , the articles linked on those, and the comments (which are the actual target of all this).
01:48:01<Ryz>On the user comments, does it cover the comments pagination too? oo;
01:48:13<@JAA>I haven't seen any pagination.
01:48:45<@JAA>Do you have an example?
01:51:26<Ryz>Mmpf, I had an example on hand at the time on watching the job... <#>;
01:51:37<Ryz>Hold on, it may take some digging...
01:51:49<@JAA>The AB job won't discover all news articles, by the way. The /updates/ pagination links break at page 800 (of ~3700).
01:52:15<Ryz>Ugh...
01:52:45<@JAA>For comments, I couldn't even find a good example with many comments. Biggest I found was only 50-ish.
01:53:15<Ryz>Well, there's this: https://gamasutra.com/blogs/RaminShokrizade/20171204/310918/The_Future_of_F2P_The_Force_Wars.php - 56 comments
01:53:41<Ryz>And there used to be pagination on the comments that's now replaced with '[View Older Comments]' link or button
01:54:16<@JAA>Ah, interesting.
01:54:46<Ryz>Here's one at 93 comments: https://gamasutra.com/blogs/RaminShokrizade/20131016/202489/Mastering_F2P_The_Titanic_Effect.php
01:55:28<Ryz>Look it this first version of this archived link, the comments content used to be and not dynamically loaded ;-; - https://web.archive.org/web/20131017235413/https://gamasutra.com/blogs/RaminShokrizade/20131016/202489/Mastering_F2P_The_Titanic_Effect.php
01:56:33<Ryz>I'm a bit more upset this was changed...
01:56:46<Ryz>Uuuuuuuuugh...
01:57:05<@JAA>If you have one with over 100, that'd be nice as another test case.
01:57:07<Ryz>Here's one at 123 comments: https://gamasutra.com/blogs/RaminShokrizade/20130626/194933/The_Top_F2P_Monetization_Tricks.php
01:57:11wizards joins
01:57:12<@JAA>Heh, perfect.
01:57:52<Ryz>Oh, it turns out there wasn't comments pagination at all back then; that makes me more upset: https://web.archive.org/web/20131215175048/https://gamasutra.com/blogs/RaminShokrizade/20130626/194933/The_Top_F2P_Monetization_Tricks.php
01:59:36<Ryz>Is pagination on blogs broken too, JAA?
02:00:25wizards_ quits [Ping timeout: 252 seconds]
02:01:40<@JAA>Ryz: No, I think that one's fine. Didn't check every page though.
02:03:47<@JAA>No idea whether comments will replay in the WBM, by the way. There's a timestamp in the URL generated by the site's JS, so unless the WBM ignores that, I suspect it might not work.
02:04:38<Ryz>I used to archive these Gamasutra pages via WBM via SPN until the stinking change to how comments are rendered have been changed; this is why I'm upset by this more personally
02:09:49<Ryz>Also because of personally witnessing a blog post disappearing or being deleted within 3 months of it's lifespan D:
02:10:51<@JAA>Fun fact: the comments API endpoint returns the original content for banned users. 194933 has an example of that on the pagination.
02:11:07<Ryz>O#o;;;
02:11:09<@JAA>It's not displayed on the page though.
02:11:12<Ryz>Unhidden content?
02:11:44<@JAA>Anyway, comments pagination support added, restarting shortly.
02:15:05<Ryz>Loot, looooooooot
02:16:56<@JAA>I also just discovered that you can enumerate all comments via the API. Hmmmmm.... :-)
02:17:31<@JAA>I'll do that if there's still time. Don't want to overload their shoebox.
02:17:35<Ryz>Loot, loot, loot, loooooooooot! <#>;
02:18:11<Ryz>!ignore 2bt25n99mynh9vedkv6v58xya ^https?://www\.gamasutra\.com/blogs/.*/orzzzz\.com/
02:18:39<@JAA>Might just run that through AB actually. It's only about 6k URLs.
02:24:39<@JAA>In other news, I've emailed E2BN about the Myths and Legends site. Let's see if they can fix it.
02:45:16<@JAA>Updates and blogs pagination is done, about 89k articles discovered.
02:46:37<Ryz>Hmm, do you cover press releases? Like https://www.gamasutra.com/pressreleases_index.php?page=117 ?
02:46:50<@JAA>No, because as far as I could see, there are no comments on them.
02:46:55<Ryz>May be worth of valuable because GamePress content is only for logged in users~
02:47:00<Ryz>I meant in terms of pagination~
02:47:16<@JAA>I'm doing this for the comments mostly, since those aren't grabbed by AB.
02:48:40<Ryz>Hmm, now I'm really curious if there was such a thing as a comment in a press release...I don't think so, but then again, I was reading more of the original content and less of the press releases (though they're useful for finding older content that's not covered anymore)
02:50:56<@JAA>Looks like this should take only about 4 hours now.
03:13:14qwertyasdfuiopghjkl joins
03:20:50Larsenv quits [Quit: ZNC 1.8.2+deb1+focal2 - https://znc.in]
03:29:22Larsenv (Larsenv) joins
03:41:04Viniter69 quits [Ping timeout: 252 seconds]
03:52:06qw3rty_ joins
03:54:26qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds]
03:55:55qw3rty__ quits [Ping timeout: 252 seconds]
03:57:57qwertyasdfuiopghjkl joins
04:09:02tech234a quits [Client Quit]
04:18:12nicolas17 quits [Client Quit]
04:41:34Earendil (Cobalt17) joins
04:48:15Earendil quits [Client Quit]
05:19:41tommyshinebox quits [Ping timeout: 244 seconds]
05:24:17qwertyasdfuiopghjkl91 joins
05:24:20qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds]
05:24:27qwertyasdfuiopghjkl91 is now known as qwertyasdfuiopghjkl
05:42:22BlueMaxima quits [Client Quit]
06:12:19tommyshinebox joins
06:27:03somerando3 quits [Remote host closed the connection]
07:16:19HackMii quits [Remote host closed the connection]
07:16:41HackMii (hacktheplanet) joins
07:34:57tzt joins
08:00:09britmob256 quits [Quit: britmob256]
09:14:50Ruthalas quits [Ping timeout: 250 seconds]
09:40:41Ruthalas (Ruthalas) joins
09:49:23Ruthalas quits [Ping timeout: 244 seconds]
09:53:05Ruthalas (Ruthalas) joins
09:56:06tommyshinebox quits [Ping timeout: 244 seconds]
10:30:47Megame quits [Client Quit]
11:45:04heart_ quits []
11:45:13heart_ joins
11:56:06sec^nd quits [Remote host closed the connection]
11:56:06mutantmonkey quits [Remote host closed the connection]
11:56:07HackMii quits [Write error: Broken pipe]
11:56:23mutantmonkey (mutantmonkey) joins
11:56:30sec^nd (second) joins
11:56:37HackMii (hacktheplanet) joins
11:59:40britmob256 joins
12:29:12C4K3 quits [Quit: leaving]
12:56:25nertzy_ joins
12:57:28march_happy joins
12:58:03<march_happy>Ah geez I used the wrong channel
12:59:29<march_happy>Long story short, UWP Autodesk Sketchbook is removed from MS Store, I repacked a Appx but needs code signing so that others could click and install
13:03:41<march_happy>Does anyone here has code signing certificate? The appx bundle is available on https://www.mediafire.com/file/n7jugwqjk0gp7v8/sketchbook.appx/file
13:04:22Hanan joins
13:04:46<march_happy>Also I wish to know if https://blogs.autodesk.com/sketchbookpro/ is fully covered by the URL project
13:08:20starship_86018 (starship_8601) joins
13:09:06Aoede_ (Aoede) joins
13:10:00simon8162 (simon816) joins
13:10:52atomicthumbs_ joins
13:10:58summerisle_ (summerisle) joins
13:12:03bleb joins
13:12:15archzz_ joins
13:12:35kiskaWee1 (kiska) joins
13:12:41avoozl1 joins
13:13:03Pingerfowder quits [Remote host closed the connection]
13:13:42Pingerfowder (Pingerfowder) joins
13:15:44HotSwap` joins
13:16:23HackMii quits [*.net *.split]
13:16:23sec^nd quits [*.net *.split]
13:16:23mutantmonkey quits [*.net *.split]
13:16:23programmerq quits [*.net *.split]
13:16:23simon816 quits [*.net *.split]
13:16:23kpcyrd quits [*.net *.split]
13:16:23HotSwap quits [*.net *.split]
13:16:23avoozl quits [*.net *.split]
13:16:23Aoede quits [*.net *.split]
13:16:23atomicthumbs quits [*.net *.split]
13:16:23summerisle quits [*.net *.split]
13:16:23starship_8601 quits [*.net *.split]
13:16:23cm quits [*.net *.split]
13:16:23kiskaWeebChat quits [*.net *.split]
13:16:23archzz quits [*.net *.split]
13:16:23starship_86018 is now known as starship_8601
13:18:47nertzy_ quits [Client Quit]
13:23:46kpcyrd (kpcyrd) joins
13:23:59programmerq (programmerq) joins
13:33:16Iki joins
13:38:55aleph quits [Ping timeout: 252 seconds]
13:39:25aleph joins
13:47:03ThreeHM quits [Ping timeout: 244 seconds]
13:48:55ThreeHM (ThreeHeadedMonkey) joins
13:53:46march_happy quits [Ping timeout: 244 seconds]
14:00:42Hanan quits [Client Quit]
14:10:10sec^nd (second) joins
14:18:32HackMii (hacktheplanet) joins
14:33:55jacobk quits [Ping timeout: 252 seconds]
14:36:46mutantmonkey (mutantmonkey) joins
14:41:06jtagcat quits [Quit: Bye!]
14:44:00jtagcat (jtagcat) joins
14:55:26Arcorann quits [Ping timeout: 250 seconds]
15:10:44tommyshinebox joins
15:14:51<Ryz>JAA, regarding https://www.gamasutra.com/ - are you also gonna grab https://www.gamasutra.com/news paginatons? This is just the same https://www.gamasutra.com/updates but not linked anymore when checking WBM
16:11:12tommyshinebox quits [Ping timeout: 244 seconds]
16:22:49nicolas17 joins
16:38:35qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds]
16:57:19<@JAA>Ryz: If it's the same content, I won't waste time on it, no. But feel free to throw it into an !ao < job I guess.
17:01:30<@JAA>OnlyFans made a 540° and has suspended the plans to ban porn. lol
17:03:40<Ryz>...Time to keep going the looting?
17:08:19tzt quits [Changing host]
17:08:19tzt (tzt) joins
17:14:42<nicolas17>there's a conspiracy theory that a bank or payment processor that told them they had to ban porn, wants to buy them, and with this move they lowered their market value :P
17:55:04h3ndr1k quits [Quit: ]
17:55:27h3ndr1k (h3ndr1k) joins
18:13:57Megame (Megame) joins
18:14:23<@JAA>I got a reply from E2BN; they can't fix the technical issues with the Myths and Legends site, unfortunately.
18:16:27<@JAA>I also saw that several other sites of theirs are down: cookit.e2bn.org, historysheroes.e2bn.org, vcp.e2bn.org, abolition.e2bn.org. No clue when these disappeared though.
18:29:23h3ndr1k quits [Client Quit]
18:45:18<@JAA>http://gallery.nen.gov.uk/ (aka gallery.e2bn.org) appears to be owned/run by E2BN and may also be worth archiving sooner rather than later.
18:49:57nertzy_ joins
18:54:36Megame quits [Read error: Connection reset by peer]
18:55:00Megame (Megame) joins
19:04:41DogsRNice (Webuser299) joins
19:05:26nertzy_ quits [Client Quit]
19:06:35<@HCross>Kaz: probably has fond memories of EB2N
19:06:47<@Kaz>hello
19:06:53<@Kaz>I know the guys there - do we need a contact?
19:07:31<@Kaz>JAA: who'd you speak to?
19:33:11lennier2 joins
19:35:48lennier1 quits [Ping timeout: 244 seconds]
19:35:53lennier2 is now known as lennier1
19:44:05h3ndr1k (h3ndr1k) joins
19:54:52lennier1 quits [Ping timeout: 250 seconds]
19:57:46lennier1 (lennier1) joins
20:00:04lennier2 joins
20:02:09lennier1 quits [Ping timeout: 244 seconds]
20:02:16lennier2 is now known as lennier1
20:10:56sec^nd quits [Remote host closed the connection]
20:11:12sec^nd (second) joins
20:18:12mutantmonkey quits [Ping timeout: 258 seconds]
20:19:15mutantmonkey (mutantmonkey) joins
20:29:25mutantmonkey quits [Remote host closed the connection]
20:29:41mutantmonkey (mutantmonkey) joins
21:43:59<@JAA>rewby: Example for extracting ignored off-site links from a wpull DB: sqlite3 www.gamasutra.com-inf-20210823-172103-2bt25-wpull.db 'SELECT url FROM queued_urls JOIN url_strings ON url_string_id = url_strings.id WHERE status = "skipped" AND url NOT LIKE "%//www.gamasutra.com/%" AND inline_level is NULL AND url LIKE "http%"'
22:19:49<h2ibot>JustAnotherArchivist edited Deathwatch (+137, /* 2021 */ Add DemoDrop): https://wiki.archiveteam.org/?diff=47068&oldid=47066
22:23:26<@JAA>DemoDrop seems to have quite strict rate limits. The AB job got blocked almost immediately.
22:26:38BlueMaxima joins
22:55:36melodykramer joins
23:00:12brsq (brsq) joins
23:04:43brsq quits [Remote host closed the connection]
23:16:56Iki quits [Ping timeout: 244 seconds]
23:34:31Iki joins
23:53:06Arcorann (Arcorann) joins
23:57:30lukash7 quits [Quit: The Lounge - https://thelounge.chat]