| 00:05:35 | | Ivan226 joins |
| 00:10:48 | | andrew quits [Read error: Connection reset by peer] |
| 00:10:56 | | andrew (andrew) joins |
| 00:11:26 | | icedice quits [Client Quit] |
| 00:13:23 | | icedice (icedice) joins |
| 00:20:57 | | icedice quits [Client Quit] |
| 00:42:58 | | DopefishJustin quits [Ping timeout: 252 seconds] |
| 00:44:41 | <Jake> | I _believe_ they feed a version that doesn't require JS with some user-agents? |
| 00:53:00 | | umgr036 quits [Remote host closed the connection] |
| 00:53:14 | | umgr036 joins |
| 00:55:49 | <pabs> | JAA, qwertyasdfuiopghjkl: btw, the opensource.com download URLs are not gated behind a cookie. so a local script could do the forms, then all the URLs could be passed to AB !ao < |
| 00:56:10 | <pabs> | rewby, rewby|backup ^ |
| 00:56:42 | <@JAA> | I suspected that might be the case but didn't have time to look at it yet. Nice! |
| 00:57:24 | <@JAA> | Although we'd want to somehow preserve the mapping between the page and the download URL, too. |
| 00:58:42 | <pabs> | there is no such mapping it seems, but I'll check what the form does |
| 00:58:52 | <qwertyasdfuiopghjkl> | pabs: I think you could just manually do one form and use the cookie for all the pages since it didn't look like it depended on the specific page (haven't checked though) |
| 01:01:03 | | Ophanim (Ophanim) joins |
| 01:01:33 | | nostalgebraist quits [Client Quit] |
| 01:04:38 | | AK quits [Remote host closed the connection] |
| 01:05:25 | | AK (AK) joins |
| 01:06:27 | <pabs> | indeed, same cookie for each page |
| 01:07:06 | <pabs> | just the value you gave works for me too :) |
| 01:07:06 | | AK quits [Remote host closed the connection] |
| 01:13:47 | | DopefishJustin joins |
| 01:13:47 | | DopefishJustin is now authenticated as DopefishJustin |
| 01:18:54 | <pabs> | qwertyasdfuiopghjkl: any thoughts on how to preserve the mapping between the page and the download URL? I couldn't figure this out |
| 01:20:43 | <@JAA> | That sounds promising. I guess we could just WARC the download section with cookies. Will need to take a closer look to determine whether that can go into the WBM or not. |
| 01:23:26 | <pabs> | if we can get AB to set the cookie, then it would work fully. not sure if the WBM allows such manipulated content though? |
| 01:24:24 | <@JAA> | AB doesn't take cookies. |
| 01:24:39 | <@JAA> | For the WBM, there's a good amount of grey area. |
| 01:27:41 | <pabs> | the other thing is the pages with downloads are under /downloads but there are some under https://opensource.com/open-organization/resources/book-series with URLs under /open-organisation/resources/ |
| 01:27:58 | <pabs> | also the /downloads URL is paginated |
| 01:29:27 | <pabs> | I'll get the AB job kicked off later today so we have the rest of the site |
| 01:57:34 | | vokunal|m joins |
| 02:01:04 | | HP_Archivist quits [Ping timeout: 252 seconds] |
| 02:28:54 | | Guest50 quits [Client Quit] |
| 02:47:11 | | BlueMaxima joins |
| 03:14:50 | | tbc1887 (tbc1887) joins |
| 03:30:48 | | dumbgoy quits [Ping timeout: 265 seconds] |
| 03:36:42 | | tbc1887 quits [Read error: Connection reset by peer] |
| 04:06:03 | | dumbgoy joins |
| 04:15:13 | | Campbell joins |
| 04:37:11 | | Island quits [Read error: Connection reset by peer] |
| 04:40:28 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 05:20:30 | | Guest50 joins |
| 05:41:25 | | nicolas17 quits [Client Quit] |
| 05:43:39 | | retromouse (retromouse) joins |
| 05:47:47 | | retromouse quits [Client Quit] |
| 05:52:27 | | Campbell quits [Client Quit] |
| 05:58:08 | | Icyelut quits [Client Quit] |
| 06:10:31 | | sonick quits [Client Quit] |
| 06:20:00 | | Icyelut (Icyelut) joins |
| 06:36:06 | | Guest50 quits [Client Quit] |
| 06:50:25 | | Icyelut quits [Client Quit] |
| 06:54:07 | | BigBoris joins |
| 07:01:47 | | lexikiq quits [Client Quit] |
| 07:14:06 | | Icyelut (Icyelut) joins |
| 07:26:02 | | spirit joins |
| 07:30:08 | | Guest50 joins |
| 07:58:09 | | AK (AK) joins |
| 08:04:48 | | TastyWiener95 quits [Ping timeout: 252 seconds] |
| 08:05:24 | <AK> | Lounge crashed so I've lost anything between 1AM UTC and now. If anyone needed me they'll have to try again :) |
| 08:16:02 | | user_ joins |
| 08:18:22 | | umgr036 quits [Ping timeout: 252 seconds] |
| 08:47:42 | | DopefishJustin quits [Ping timeout: 252 seconds] |
| 09:33:18 | | spirit quits [Ping timeout: 265 seconds] |
| 09:37:12 | | spirit joins |
| 09:45:23 | | icedice (icedice) joins |
| 09:46:38 | <icedice> | Hi Sanqui / Sanqui|m |
| 09:47:18 | <icedice> | I reached out to a PokéCommunity admin about gettig an ArchiveBot IP whitelisted and they're asking me if I can confirm the IP |
| 09:47:33 | <icedice> | Which IP would we be archiving it with? |
| 10:04:31 | | Ruthalas5 quits [Ping timeout: 252 seconds] |
| 10:05:10 | | Ruthalas5 (Ruthalas) joins |
| 10:05:41 | | Ivan226 quits [Ping timeout: 265 seconds] |
| 10:07:12 | <threedeeitguy> | !a https://transfer.archivete.am/zS98L/link6.txt |
| 10:07:37 | <threedeeitguy> | oops, ignore that! |
| 10:12:17 | | moe-a-m|m joins |
| 10:35:22 | | Jonboy3452 joins |
| 10:38:48 | | Jonboy3451 quits [Ping timeout: 252 seconds] |
| 10:40:26 | | Hae joins |
| 10:56:44 | | Hae is now authenticated as Hae |
| 11:03:00 | | user_ quits [Remote host closed the connection] |
| 11:03:35 | | user_ joins |
| 11:07:27 | | CaldeiraG (CaldeiraG) joins |
| 11:26:56 | | Jonboy3452 quits [Read error: Connection reset by peer] |
| 11:36:44 | | Icyelut quits [Ping timeout: 252 seconds] |
| 11:40:40 | | MRX joins |
| 11:40:48 | | MRX quits [Remote host closed the connection] |
| 11:42:58 | | user_ quits [Ping timeout: 252 seconds] |
| 11:42:58 | | icedice quits [Ping timeout: 252 seconds] |
| 11:45:03 | | Icyelut (Icyelut) joins |
| 12:22:14 | | icedice (icedice) joins |
| 12:23:28 | | Icyelut quits [Client Quit] |
| 12:54:35 | | eroc19901 is now known as eroc1990 |
| 13:30:38 | | Icyelut (Icyelut) joins |
| 13:36:20 | | HP_Archivist (HP_Archivist) joins |
| 13:37:11 | | AlsoHP_Archivist joins |
| 13:41:13 | | HP_Archivist quits [Ping timeout: 252 seconds] |
| 13:53:03 | | Wingy1 (Wingy) joins |
| 13:54:58 | | Wingy quits [Ping timeout: 252 seconds] |
| 13:54:58 | | Wingy1 is now known as Wingy |
| 13:57:41 | <Hans5958|m> | On the tracker web page, what's the difference between "claims" and "todo"? |
| 14:02:39 | | AlsoHP_Archivist quits [Client Quit] |
| 14:03:01 | | HP_Archivist (HP_Archivist) joins |
| 14:08:55 | | Icyelut|2 (Icyelut) joins |
| 14:12:34 | | Icyelut quits [Ping timeout: 252 seconds] |
| 14:16:03 | | sonick (sonick) joins |
| 14:20:33 | | HP_Archivist quits [Client Quit] |
| 14:20:44 | | DopefishJustin (DopefishJustin) joins |
| 14:22:38 | | HP_Archivist (HP_Archivist) joins |
| 14:24:56 | <@kaz> | Hans5958|m: todo is things than need doing |
| 14:25:05 | <@kaz> | claims ar e jobs that have been picked up by a worker but not yet marked as completed] |
| 14:25:09 | <joepie91|m> | Hans5958: claims are those tasks which have already been picked up by someone's warrior, and which will eventually be completed (but may not be if the user disappears), and todo are the tasks which have *not* yet been picked up |
| 14:25:14 | <joepie91|m> | claims are reserved, essentially |
| 14:25:33 | <joepie91|m> | usually towards the end of the project, the claims all get cleared out to give other warriors a chance to pick up the ones that were abandoned |
| 14:31:10 | | Chris5010 quits [Quit: ] |
| 14:37:56 | | HP_Archivist quits [Client Quit] |
| 14:38:01 | | umgr036 joins |
| 14:38:51 | | umgr036 quits [Remote host closed the connection] |
| 14:39:05 | | umgr036 joins |
| 15:03:54 | | Arcorann quits [Ping timeout: 252 seconds] |
| 15:26:04 | <icedice> | Does anyone know when Sanqui usually gets online? |
| 15:27:01 | <icedice> | I have an admin asking which IP address to whitelist in CloudFlare so that we can archive The PokéCommunity site and I don't want to keep them waiting forever |
| 15:27:08 | | Guest50 quits [Client Quit] |
| 15:27:28 | <icedice> | I should probably have asked him that yesterday if I had had any foresight |
| 15:27:29 | | nicolas17 joins |
| 15:30:00 | | Guest50 joins |
| 15:31:54 | | Guest50 quits [Client Quit] |
| 15:32:29 | | Guest50 joins |
| 15:34:10 | | Chris5010 (Chris5010) joins |
| 15:48:23 | <@Sanqui> | JAA: can you please determine a pipeline fit for this purpose and give icedice the IP address? |
| 15:48:32 | | Ivan226 joins |
| 15:48:45 | <@Sanqui> | icedice: if you need me please PM, I can easily miss messages in chats otherwise |
| 15:49:21 | | jtagcat quits [Quit: Bye!] |
| 15:52:47 | | Island joins |
| 15:57:16 | | jtagcat (jtagcat) joins |
| 15:57:39 | <Hans5958|m> | icedice, thank you for the explanation. |
| 16:02:37 | <icedice> | Sanqui: Ah right, I'll keep that in mind in the future |
| 16:03:05 | <icedice> | Hans5958|m: I think you got the wrong guy :) |
| 16:03:25 | <Hans5958|m> | Oh damn |
| 16:03:44 | <Hans5958|m> | 11pm didn't help much |
| 16:03:59 | <Hans5958|m> | kaz, joepie91 🏳️🌈, thank you for the explanation! |
| 16:04:21 | <icedice> | I know how it is |
| 16:05:14 | <icedice> | I misread a website name and caused some confusion when asking about errors in the wrong archivation job here yesterday |
| 16:05:25 | <icedice> | My brain was pretty fried on three hours of sleep then |
| 16:13:19 | <icedice> | We could probably try reaching out to the admins of various forums that we don't have time to crawl and ask them for lists of Imgur links |
| 16:18:36 | | Emitewiki joins |
| 16:19:49 | <Emitewiki> | I'm sure this has already been discussed here, but are there any ongoing archival efforts surrounding the imminent wipe of data on Imgur? |
| 16:19:56 | <icedice> | #imgone |
| 16:23:33 | <icedice> | Looks like the bill that will kill YouTube is back from the dead yet again: https://www.congress.gov/bill/118th-congress/house-bill/2801 |
| 16:24:35 | <icedice> | I heard that they're going to try and get it showed into some budget bill so that it will get rammed through |
| 16:33:00 | | Chris5010 quits [Ping timeout: 252 seconds] |
| 16:40:41 | | Guest50 quits [Client Quit] |
| 16:44:28 | | threedeeitguy quits [Client Quit] |
| 16:45:01 | | threedeeitguy joins |
| 16:57:44 | | HiccupJul (HiccupJul) joins |
| 16:57:54 | <HiccupJul> | Is there a good tool for exporting a whole github repo, including github specific data? (not just doing `git clone --mirror`) |
| 17:04:27 | | Guest50 joins |
| 17:15:36 | <icedice> | I would imagine there'd have to be something by now after all the trouble youtube-dl had with lost issue tracker threads when they got C&D'd |
| 17:16:44 | <@JAA> | Last time I looked into it, there was nothing that covers *everything*. But see the wiki page on GitHub for some tools. |
| 17:17:04 | <@JAA> | This is part of why I started writing codearchiver. The GitHub module isn't ready yet though. |
| 17:17:22 | <@JAA> | (It will use the GraphQL API to retrieve everything that's accessible to a normal user.) |
| 17:18:08 | <icedice> | I'm guessing this is related to Lockpick_RCM getting DMCA'd by Nintendo before the end of today |
| 17:18:39 | <icedice> | Or they already got DMCA'd, but the removal happens a day later from that |
| 17:19:01 | <@JAA> | Would be a shame if someone had already archived it... :-) |
| 17:19:11 | <icedice> | Yeah, I saw |
| 17:19:25 | <icedice> | Well done |
| 17:20:31 | | nostalgebraist joins |
| 17:23:21 | <icedice> | Too bad git.rip and offshoregit are no longer a thing |
| 17:23:43 | <icedice> | Well, I guess git.rip had it's issue with leaked stuff |
| 17:24:13 | <icedice> | Still though, DMCA ignored Git hosting seems to be something that will be needed in the future |
| 17:24:47 | <icedice> | Though, I guess GitHub's $1 million legal defense fund should be able to get Lockpick_RCM back |
| 17:24:58 | <icedice> | Assuming GitHub is willing to stand up for them |
| 17:25:25 | <icedice> | * issues |
| 17:27:27 | | BigBoris quits [Ping timeout: 265 seconds] |
| 17:37:05 | | Guest50 quits [Client Quit] |
| 17:37:12 | | that_lurker quits [Quit: ZNC 1.8.2+deb1+focal2 - https://znc.in] |
| 17:37:54 | | that_lurker (that_lurker) joins |
| 17:40:59 | | HiccupJul quits [Ping timeout: 265 seconds] |
| 17:42:34 | | Guest50 joins |
| 17:42:41 | | that_lurker quits [Client Quit] |
| 17:43:00 | | that_lurker (that_lurker) joins |
| 17:43:29 | | that_lurker quits [Client Quit] |
| 17:43:48 | | that_lurker (that_lurker) joins |
| 17:45:01 | | that_lurker quits [Client Quit] |
| 17:45:45 | | that_lurker (that_lurker) joins |
| 17:49:19 | | wrangle|m joins |
| 18:44:10 | | therubberduckie joins |
| 18:48:22 | | za4k joins |
| 18:50:52 | | za3k quits [Ping timeout: 252 seconds] |
| 19:04:07 | | Ivan226 quits [Ping timeout: 265 seconds] |
| 19:09:01 | | za4k quits [Ping timeout: 252 seconds] |
| 19:27:06 | | retromouse (retromouse) joins |
| 19:27:24 | | retromouse quits [Client Quit] |
| 19:33:14 | | Guest50 quits [Client Quit] |
| 19:35:22 | | Guest50 joins |
| 19:56:06 | | jacksonchen666 (jacksonchen666) joins |
| 20:00:35 | <h2ibot> | JAABot edited CurrentWarriorProject (+6): https://wiki.archiveteam.org/?diff=49738&oldid=49728 |
| 20:03:54 | | jacksonchen666 quits [Remote host closed the connection] |
| 20:04:15 | | jacksonchen666 (jacksonchen666) joins |
| 20:08:14 | | pabs quits [Ping timeout: 252 seconds] |
| 20:14:35 | | pabs (pabs) joins |
| 20:32:31 | | Guest50 quits [Client Quit] |
| 20:32:55 | | Guest50 joins |
| 20:38:29 | | Guest50 quits [Client Quit] |
| 20:38:46 | | hitgrr8 joins |
| 20:39:41 | | lexikiq joins |
| 20:40:00 | | Guest50 joins |
| 20:44:43 | | Guest50 quits [Ping timeout: 252 seconds] |
| 20:48:01 | | andrew quits [Ping timeout: 252 seconds] |
| 20:48:53 | | andrew (andrew) joins |
| 20:56:44 | | Guest50 joins |
| 21:12:23 | | andrew3 (andrew) joins |
| 21:12:46 | | andrew quits [Ping timeout: 252 seconds] |
| 21:12:46 | | andrew3 is now known as andrew |
| 21:31:59 | | jacksonchen666 quits [Client Quit] |
| 21:35:16 | | andrew4 (andrew) joins |
| 21:36:22 | | andrew quits [Ping timeout: 265 seconds] |
| 21:36:22 | | andrew4 is now known as andrew |
| 21:39:13 | | Guest50 quits [Client Quit] |
| 21:40:14 | | Guest50 joins |
| 21:48:02 | | BigBoris joins |
| 21:54:43 | | hitgrr8 quits [Client Quit] |
| 22:26:47 | | Guest50 quits [Client Quit] |
| 22:31:10 | | Guest50 joins |
| 22:45:10 | | hackbug quits [Ping timeout: 252 seconds] |
| 22:45:26 | | hackbug (hackbug) joins |
| 22:50:18 | | hackbug quits [Ping timeout: 252 seconds] |
| 23:11:23 | | Jake quits [Client Quit] |
| 23:11:37 | | Jake (Jake) joins |
| 23:15:53 | | icedice quits [Client Quit] |
| 23:17:09 | | hackbug (hackbug) joins |
| 23:30:49 | | pabs quits [Ping timeout: 252 seconds] |
| 23:58:27 | | pabs (pabs) joins |