00:05:35Ivan226 joins
00:10:48andrew quits [Read error: Connection reset by peer]
00:10:56andrew (andrew) joins
00:11:26icedice quits [Client Quit]
00:13:23icedice (icedice) joins
00:20:57icedice quits [Client Quit]
00:42:58DopefishJustin quits [Ping timeout: 252 seconds]
00:44:41<Jake>I _believe_ they feed a version that doesn't require JS with some user-agents?
00:53:00umgr036 quits [Remote host closed the connection]
00:53:14umgr036 joins
00:55:49<pabs>JAA, qwertyasdfuiopghjkl: btw, the opensource.com download URLs are not gated behind a cookie. so a local script could do the forms, then all the URLs could be passed to AB !ao <
00:56:10<pabs>rewby, rewby|backup ^
00:56:42<@JAA>I suspected that might be the case but didn't have time to look at it yet. Nice!
00:57:24<@JAA>Although we'd want to somehow preserve the mapping between the page and the download URL, too.
00:58:42<pabs>there is no such mapping it seems, but I'll check what the form does
00:58:52<qwertyasdfuiopghjkl>pabs: I think you could just manually do one form and use the cookie for all the pages since it didn't look like it depended on the specific page (haven't checked though)
01:01:03Ophanim (Ophanim) joins
01:01:33nostalgebraist quits [Client Quit]
01:04:38AK quits [Remote host closed the connection]
01:05:25AK (AK) joins
01:06:27<pabs>indeed, same cookie for each page
01:07:06<pabs>just the value you gave works for me too :)
01:07:06AK quits [Remote host closed the connection]
01:13:47DopefishJustin joins
01:18:54<pabs>qwertyasdfuiopghjkl: any thoughts on how to preserve the mapping between the page and the download URL? I couldn't figure this out
01:20:43<@JAA>That sounds promising. I guess we could just WARC the download section with cookies. Will need to take a closer look to determine whether that can go into the WBM or not.
01:23:26<pabs>if we can get AB to set the cookie, then it would work fully. not sure if the WBM allows such manipulated content though?
01:24:24<@JAA>AB doesn't take cookies.
01:24:39<@JAA>For the WBM, there's a good amount of grey area.
01:27:41<pabs>the other thing is the pages with downloads are under /downloads but there are some under https://opensource.com/open-organization/resources/book-series with URLs under /open-organisation/resources/
01:27:58<pabs>also the /downloads URL is paginated
01:29:27<pabs>I'll get the AB job kicked off later today so we have the rest of the site
01:57:34vokunal|m joins
02:01:04HP_Archivist quits [Ping timeout: 252 seconds]
02:28:54Guest50 quits [Client Quit]
02:47:11BlueMaxima joins
03:14:50tbc1887 (tbc1887) joins
03:30:48dumbgoy quits [Ping timeout: 265 seconds]
03:36:42tbc1887 quits [Read error: Connection reset by peer]
04:06:03dumbgoy joins
04:15:13Campbell joins
04:37:11Island quits [Read error: Connection reset by peer]
04:40:28BlueMaxima quits [Read error: Connection reset by peer]
05:20:30Guest50 joins
05:41:25nicolas17 quits [Client Quit]
05:43:39retromouse (retromouse) joins
05:47:47retromouse quits [Client Quit]
05:52:27Campbell quits [Client Quit]
05:58:08Icyelut quits [Client Quit]
06:10:31sonick quits [Client Quit]
06:20:00Icyelut (Icyelut) joins
06:36:06Guest50 quits [Client Quit]
06:50:25Icyelut quits [Client Quit]
06:54:07BigBoris joins
07:01:47lexikiq quits [Client Quit]
07:14:06Icyelut (Icyelut) joins
07:26:02spirit joins
07:30:08Guest50 joins
07:58:09AK (AK) joins
08:04:48TastyWiener95 quits [Ping timeout: 252 seconds]
08:05:24<AK>Lounge crashed so I've lost anything between 1AM UTC and now. If anyone needed me they'll have to try again :)
08:16:02user_ joins
08:18:22umgr036 quits [Ping timeout: 252 seconds]
08:47:42DopefishJustin quits [Ping timeout: 252 seconds]
09:33:18spirit quits [Ping timeout: 265 seconds]
09:37:12spirit joins
09:45:23icedice (icedice) joins
09:46:38<icedice>Hi Sanqui / Sanqui|m
09:47:18<icedice>I reached out to a PokéCommunity admin about gettig an ArchiveBot IP whitelisted and they're asking me if I can confirm the IP
09:47:33<icedice>Which IP would we be archiving it with?
10:04:31Ruthalas5 quits [Ping timeout: 252 seconds]
10:05:10Ruthalas5 (Ruthalas) joins
10:05:41Ivan226 quits [Ping timeout: 265 seconds]
10:07:12<threedeeitguy>!a https://transfer.archivete.am/zS98L/link6.txt
10:07:37<threedeeitguy>oops, ignore that!
10:12:17moe-a-m|m joins
10:35:22Jonboy3452 joins
10:38:48Jonboy3451 quits [Ping timeout: 252 seconds]
10:40:26Hae joins
11:03:00user_ quits [Remote host closed the connection]
11:03:35user_ joins
11:07:27CaldeiraG (CaldeiraG) joins
11:26:56Jonboy3452 quits [Read error: Connection reset by peer]
11:36:44Icyelut quits [Ping timeout: 252 seconds]
11:40:40MRX joins
11:40:48MRX quits [Remote host closed the connection]
11:42:58user_ quits [Ping timeout: 252 seconds]
11:42:58icedice quits [Ping timeout: 252 seconds]
11:45:03Icyelut (Icyelut) joins
12:22:14icedice (icedice) joins
12:23:28Icyelut quits [Client Quit]
12:54:35eroc19901 is now known as eroc1990
13:30:38Icyelut (Icyelut) joins
13:36:20HP_Archivist (HP_Archivist) joins
13:37:11AlsoHP_Archivist joins
13:41:13HP_Archivist quits [Ping timeout: 252 seconds]
13:53:03Wingy1 (Wingy) joins
13:54:58Wingy quits [Ping timeout: 252 seconds]
13:54:58Wingy1 is now known as Wingy
13:57:41<Hans5958|m>On the tracker web page, what's the difference between "claims" and "todo"?
14:02:39AlsoHP_Archivist quits [Client Quit]
14:03:01HP_Archivist (HP_Archivist) joins
14:08:55Icyelut|2 (Icyelut) joins
14:12:34Icyelut quits [Ping timeout: 252 seconds]
14:16:03sonick (sonick) joins
14:20:33HP_Archivist quits [Client Quit]
14:20:44DopefishJustin (DopefishJustin) joins
14:22:38HP_Archivist (HP_Archivist) joins
14:24:56<@kaz>Hans5958|m: todo is things than need doing
14:25:05<@kaz>claims ar e jobs that have been picked up by a worker but not yet marked as completed]
14:25:09<joepie91|m>Hans5958: claims are those tasks which have already been picked up by someone's warrior, and which will eventually be completed (but may not be if the user disappears), and todo are the tasks which have *not* yet been picked up
14:25:14<joepie91|m>claims are reserved, essentially
14:25:33<joepie91|m>usually towards the end of the project, the claims all get cleared out to give other warriors a chance to pick up the ones that were abandoned
14:31:10Chris5010 quits [Quit: ]
14:37:56HP_Archivist quits [Client Quit]
14:38:01umgr036 joins
14:38:51umgr036 quits [Remote host closed the connection]
14:39:05umgr036 joins
15:03:54Arcorann quits [Ping timeout: 252 seconds]
15:26:04<icedice>Does anyone know when Sanqui usually gets online?
15:27:01<icedice>I have an admin asking which IP address to whitelist in CloudFlare so that we can archive The PokéCommunity site and I don't want to keep them waiting forever
15:27:08Guest50 quits [Client Quit]
15:27:28<icedice>I should probably have asked him that yesterday if I had had any foresight
15:27:29nicolas17 joins
15:30:00Guest50 joins
15:31:54Guest50 quits [Client Quit]
15:32:29Guest50 joins
15:34:10Chris5010 (Chris5010) joins
15:48:23<@Sanqui>JAA: can you please determine a pipeline fit for this purpose and give icedice the IP address?
15:48:32Ivan226 joins
15:48:45<@Sanqui>icedice: if you need me please PM, I can easily miss messages in chats otherwise
15:49:21jtagcat quits [Quit: Bye!]
15:52:47Island joins
15:57:16jtagcat (jtagcat) joins
15:57:39<Hans5958|m>icedice, thank you for the explanation.
16:02:37<icedice>Sanqui: Ah right, I'll keep that in mind in the future
16:03:05<icedice>Hans5958|m: I think you got the wrong guy :)
16:03:25<Hans5958|m>Oh damn
16:03:44<Hans5958|m>11pm didn't help much
16:03:59<Hans5958|m>kaz, joepie91 🏳️‍🌈, thank you for the explanation!
16:04:21<icedice>I know how it is
16:05:14<icedice>I misread a website name and caused some confusion when asking about errors in the wrong archivation job here yesterday
16:05:25<icedice>My brain was pretty fried on three hours of sleep then
16:13:19<icedice>We could probably try reaching out to the admins of various forums that we don't have time to crawl and ask them for lists of Imgur links
16:18:36Emitewiki joins
16:19:49<Emitewiki>I'm sure this has already been discussed here, but are there any ongoing archival efforts surrounding the imminent wipe of data on Imgur?
16:19:56<icedice>#imgone
16:23:33<icedice>Looks like the bill that will kill YouTube is back from the dead yet again: https://www.congress.gov/bill/118th-congress/house-bill/2801
16:24:35<icedice>I heard that they're going to try and get it showed into some budget bill so that it will get rammed through
16:33:00Chris5010 quits [Ping timeout: 252 seconds]
16:40:41Guest50 quits [Client Quit]
16:44:28threedeeitguy quits [Client Quit]
16:45:01threedeeitguy joins
16:57:44HiccupJul (HiccupJul) joins
16:57:54<HiccupJul>Is there a good tool for exporting a whole github repo, including github specific data? (not just doing `git clone --mirror`)
17:04:27Guest50 joins
17:15:36<icedice>I would imagine there'd have to be something by now after all the trouble youtube-dl had with lost issue tracker threads when they got C&D'd
17:16:44<@JAA>Last time I looked into it, there was nothing that covers *everything*. But see the wiki page on GitHub for some tools.
17:17:04<@JAA>This is part of why I started writing codearchiver. The GitHub module isn't ready yet though.
17:17:22<@JAA>(It will use the GraphQL API to retrieve everything that's accessible to a normal user.)
17:18:08<icedice>I'm guessing this is related to Lockpick_RCM getting DMCA'd by Nintendo before the end of today
17:18:39<icedice>Or they already got DMCA'd, but the removal happens a day later from that
17:19:01<@JAA>Would be a shame if someone had already archived it... :-)
17:19:11<icedice>Yeah, I saw
17:19:25<icedice>Well done
17:20:31nostalgebraist joins
17:23:21<icedice>Too bad git.rip and offshoregit are no longer a thing
17:23:43<icedice>Well, I guess git.rip had it's issue with leaked stuff
17:24:13<icedice>Still though, DMCA ignored Git hosting seems to be something that will be needed in the future
17:24:47<icedice>Though, I guess GitHub's $1 million legal defense fund should be able to get Lockpick_RCM back
17:24:58<icedice>Assuming GitHub is willing to stand up for them
17:25:25<icedice>* issues
17:27:27BigBoris quits [Ping timeout: 265 seconds]
17:37:05Guest50 quits [Client Quit]
17:37:12that_lurker quits [Quit: ZNC 1.8.2+deb1+focal2 - https://znc.in]
17:37:54that_lurker (that_lurker) joins
17:40:59HiccupJul quits [Ping timeout: 265 seconds]
17:42:34Guest50 joins
17:42:41that_lurker quits [Client Quit]
17:43:00that_lurker (that_lurker) joins
17:43:29that_lurker quits [Client Quit]
17:43:48that_lurker (that_lurker) joins
17:45:01that_lurker quits [Client Quit]
17:45:45that_lurker (that_lurker) joins
17:49:19wrangle|m joins
18:44:10therubberduckie joins
18:48:22za4k joins
18:50:52za3k quits [Ping timeout: 252 seconds]
19:04:07Ivan226 quits [Ping timeout: 265 seconds]
19:09:01za4k quits [Ping timeout: 252 seconds]
19:27:06retromouse (retromouse) joins
19:27:24retromouse quits [Client Quit]
19:33:14Guest50 quits [Client Quit]
19:35:22Guest50 joins
19:56:06jacksonchen666 (jacksonchen666) joins
20:00:35<h2ibot>JAABot edited CurrentWarriorProject (+6): https://wiki.archiveteam.org/?diff=49738&oldid=49728
20:03:54jacksonchen666 quits [Remote host closed the connection]
20:04:15jacksonchen666 (jacksonchen666) joins
20:08:14pabs quits [Ping timeout: 252 seconds]
20:14:35pabs (pabs) joins
20:32:31Guest50 quits [Client Quit]
20:32:55Guest50 joins
20:38:29Guest50 quits [Client Quit]
20:38:46hitgrr8 joins
20:39:41lexikiq joins
20:40:00Guest50 joins
20:44:43Guest50 quits [Ping timeout: 252 seconds]
20:48:01andrew quits [Ping timeout: 252 seconds]
20:48:53andrew (andrew) joins
20:56:44Guest50 joins
21:12:23andrew3 (andrew) joins
21:12:46andrew quits [Ping timeout: 252 seconds]
21:12:46andrew3 is now known as andrew
21:31:59jacksonchen666 quits [Client Quit]
21:35:16andrew4 (andrew) joins
21:36:22andrew quits [Ping timeout: 265 seconds]
21:36:22andrew4 is now known as andrew
21:39:13Guest50 quits [Client Quit]
21:40:14Guest50 joins
21:48:02BigBoris joins
21:54:43hitgrr8 quits [Client Quit]
22:26:47Guest50 quits [Client Quit]
22:31:10Guest50 joins
22:45:10hackbug quits [Ping timeout: 252 seconds]
22:45:26hackbug (hackbug) joins
22:50:18hackbug quits [Ping timeout: 252 seconds]
23:11:23Jake quits [Client Quit]
23:11:37Jake (Jake) joins
23:15:53icedice quits [Client Quit]
23:17:09hackbug (hackbug) joins
23:30:49pabs quits [Ping timeout: 252 seconds]
23:58:27pabs (pabs) joins