00:06:53lunik1 quits [Quit: Ping timeout (120 seconds)]
00:06:59lunik1 joins
00:40:01Arcorann quits [Ping timeout: 265 seconds]
00:47:16bonga quits [Ping timeout: 265 seconds]
00:48:15bonga joins
00:59:22bonga quits [Read error: Connection reset by peer]
01:00:15bonga joins
01:08:31GarveyPatrickD joins
01:14:19JackThompson joins
01:29:55GarveyPatrickD quits [Remote host closed the connection]
02:02:44GarveyPatrickD joins
02:03:13igloo222253 joins
02:03:38nepeat quits [Ping timeout: 265 seconds]
02:04:15dm4v quits [Ping timeout: 265 seconds]
02:04:36igloo22225 quits [Ping timeout: 265 seconds]
02:04:36igloo222253 is now known as igloo22225
02:04:53nepeat (nepeat) joins
02:22:23dm4v joins
02:22:25dm4v quits [Changing host]
02:22:25dm4v (dm4v) joins
02:28:46aamber joins
02:31:33<aamber>is anyone doing anything with the Duolingo forums? it's going down tomorrow. I'll try to grab what I can but I've never done this before
02:36:12<@OrIdow6>I believe that JAA has finished making their copy aamber
02:36:39<@OrIdow6>Or at any rate a "minimal" copy
02:40:03<@JAA>Yeah, I have 'something'. Just API data for the threads that I could find. Their servers are awful, so it's definitely incomplete, and I have no way really to estimate how incomplete it is.
02:40:38<@JAA>The images and avatars are running through ArchiveBot. Outlinks went through #// earlier.
02:40:50<aamber>how much? for which forums?
02:42:05<@JAA>11.9 GiB as gzipped WARCs
02:43:11<@JAA>I didn't list forums, just bruteforced comment IDs (via a 'secret' API endpoint to discover which comment IDs were discussion starters).
02:44:36<aamber>in ascending order?
02:44:48<@JAA>Random
02:46:30<@JAA>Looks like I got HTTP 200 responses for approximately 3.1 million threads. I have no idea what proportion of all threads that represents. Couldn't find global stats anywhere.
02:49:18<@JAA>I could also get a comments count, but that's pretty useless without something to compare it to.
02:50:55<aamber>Well, thank you for your efforts in any case.
02:51:01Arcorann (Arcorann) joins
02:57:40<aamber>Think I'll try for the top n in every forum.
02:58:31<@JAA>Trying to fix the 1% of IDs that weren't attempted at all before because of that filtering endpoint failing.
03:00:29<@JAA>Approximately 64k IDs that are supposed to exist according to that filtering failed in some way, by the way. Impossible to tell how many of those actually exist since the API is so broken. I might retry them though, maybe I'll get a couple dozen more threads from that.
03:09:40BlueMaxima quits [Read error: Connection reset by peer]
03:14:49Stiletto quits [Ping timeout: 265 seconds]
03:33:43sonick quits [Client Quit]
03:46:01GarveyPatrickD quits [Remote host closed the connection]
04:41:12<@OrIdow6>arkiver: Anything on that art site?
04:41:13<@OrIdow6>Buzzl
04:41:15<@OrIdow6>Buzzly
04:47:42Jake1 (Jake) joins
04:49:00<michaelblob>GarveyPatrickD: i'm grabbing rosetta wiki currently btw
04:49:54Jake quits [Ping timeout: 265 seconds]
04:49:54Jake1 is now known as Jake
05:06:06BlueMaxima joins
05:07:46Stiletto joins
05:46:57bonga quits [Read error: Connection reset by peer]
05:47:43bonga joins
06:48:47BlueMaxima quits [Read error: Connection reset by peer]
06:57:47march_happy quits [Remote host closed the connection]
07:06:27march_happy (march_happy) joins
07:20:05binzyboi quits [Remote host closed the connection]
07:23:11binzyboi joins
08:49:36qwertyasdfuiopghjkl joins
08:57:25sec^nd quits [Ping timeout: 252 seconds]
09:06:10sec^nd (second) joins
10:01:51Hackerpcs quits [Quit: Hackerpcs]
10:04:49Hackerpcs (Hackerpcs) joins
10:15:19march_happy quits [Ping timeout: 265 seconds]
10:16:06march_happy (march_happy) joins
10:20:38march_happy quits [Ping timeout: 265 seconds]
10:21:41march_happy (march_happy) joins
10:26:26avoozl quits [Ping timeout: 265 seconds]
10:26:55march_happy quits [Ping timeout: 265 seconds]
10:27:38march_happy (march_happy) joins
10:34:40march_happy quits [Read error: Connection reset by peer]
10:35:24march_happy (march_happy) joins
10:37:34march_happy quits [Read error: Connection reset by peer]
10:37:49march_happy (march_happy) joins
10:51:03aamber leaves
11:07:08user_ (gazorpazorp) joins
11:07:54gazorpazorp quits [Read error: Connection reset by peer]
11:08:31sonick (sonick) joins
11:20:44<sonick>On March 31, Japanese web hosting service 2Style.net will shut down. Announcement page: http://2style.net/support-end.html
11:21:38<@OrIdow6>Wow, there's a lot going down on the 31st
11:22:55<sonick>This hosting service runs on the below domains, and directories are assigned to users.
11:23:29<sonick>https://www.irccloud.com/pastebin/f5lMsYQy/
11:23:51<@OrIdow6>Looks like AB might be able to find everything?
11:24:08<@OrIdow6>Hm
11:24:39<sonick>No. Because they have no sitemap.
11:25:12<@OrIdow6>Well, they maybe kind of do for some hosts, but due to domain funkiness that wouldn't work
11:27:46<sonick>So I think the first thing we can do is collect the directories from CDX in WBM and throw them to AB.
11:29:16march_happy quits [Ping timeout: 265 seconds]
11:29:30march_happy (march_happy) joins
11:29:36lennier1 quits [Client Quit]
11:30:00lennier1 (lennier1) joins
11:33:07lunik1 quits [Read error: Connection reset by peer]
11:33:14lunik1 joins
11:40:11<h2ibot>OrIdow6 edited Deathwatch (+216, /* 2022 */ Add 2style): https://wiki.archiveteam.org/?diff=48404&oldid=48395
11:40:52<@OrIdow6>If you have any more details sonick (for instance the difference between plans β and γ, and α) please say them
11:40:55<@OrIdow6>I am going to sleep
11:45:34<sonick>The α plan is just the free plan with the ads eliminated, while the β and γ plans seem to be more enhanced plans with unlimited disk space, etc. The α and Free plans are the ones that will be closing this time.
11:53:26march_happy quits [Ping timeout: 265 seconds]
11:54:14march_happy (march_happy) joins
13:05:27Megame (Megame) joins
13:08:42Arcorann quits [Ping timeout: 265 seconds]
14:16:22JackThompson2 joins
14:18:55JackThompson quits [Ping timeout: 265 seconds]
14:18:55JackThompson2 is now known as JackThompson
14:20:24hackbug (hackbug) joins
14:43:34lunik1 quits [Client Quit]
14:43:43lunik1 joins
14:59:44GarveyPatrickD joins
15:29:06<michaelblob>GarveyPatrickD: https://archive.org/details/wiki-rosettacodeorg_mw
15:46:24lunik1 quits [Client Quit]
15:46:32lunik1 joins
16:14:35LeGoupil joins
16:26:34HackMii quits [Remote host closed the connection]
16:27:52HackMii (hacktheplanet) joins
16:36:58lunik1 quits [Client Quit]
16:37:03lunik1 joins
17:02:51LeGoupil1 joins
17:03:12anarchat quits [Read error: Connection reset by peer]
17:03:45sonick quits [Excess Flood]
17:03:52sonick (sonick) joins
17:04:34LeGoupil quits [Ping timeout: 265 seconds]
17:04:34LeGoupil1 is now known as LeGoupil
17:07:26Sluggs quits [Ping timeout: 240 seconds]
17:08:10anarcat (anarcat) joins
17:08:26Sluggs joins
17:16:19dvd (dvd) joins
17:17:01<GarveyPatrickD>michaelblob: Can you help me understand what that page is saying. There is a intermittently functioning mediawiki wiki at http://rosettacode.org that displays editable pages.
17:31:00<GarveyPatrickD>michaelblob: Is https://archive.org/details/wiki-rosettacodeorg_mw the means to clone or archive the contents of https://rosettacode.org/ ?
17:42:18<michaelblob>GarveyPatrickD: the archive.org page is a dump of the rosettacode.org mediawiki, simply a download of the pages and images from the wiki
17:43:33<michaelblob>i believe you can uncompress the archive and import it into mediawiki to create a clone
17:47:43march_happy quits [Ping timeout: 265 seconds]
17:57:34<GarveyPatrickD>michaelblob: I'm new here.
17:59:00<GarveyPatrickD>michaelblob: So, https://archive.org/details/wiki-rosettacodeorg_mw is a current archive copy of https://rosettacode.org
18:01:36<GarveyPatrickD>michaelblob: What do I need to do to pass that to https://archive.org/ so they can display the contents of https://archive.org/details/wiki-rosettacodeorg_mw as a set of pages?
18:11:51<michaelblob>GarveyPatrickD: let's move this to #wikiteam
18:12:52<GarveyPatrickD>michaelblob: OK
18:21:28LeGoupil quits [Client Quit]
18:21:47LeGoupil joins
18:26:01Megame quits [Client Quit]
19:08:26lunik1 quits [Ping timeout: 265 seconds]
19:09:04lunik1 joins
19:10:43dvd quits [Ping timeout: 265 seconds]
19:20:03<@JAA>GarveyPatrickD: Rosetta Code is running through ArchiveBot now.
19:23:53meowe quits [Quit: meow]
19:26:08meowe (meowe) joins
19:29:43anarcat quits [Remote host closed the connection]
19:31:16<@JAA>The Duolingo forums disappeared sometime in the past hour.
19:35:15<@JAA>It was throwing 500s about an hour ago. My last successful retrieval of a non-sentence discussion thread was at 17:53.
19:41:34lunik1 quits [Client Quit]
19:41:40lunik1 joins
19:42:45bonga quits [Ping timeout: 265 seconds]
19:42:59bonga joins
19:53:30<cptcobalt>is there an irc room for the current default Grab project?
19:54:51<@JAA>#Y
20:00:05lunik1 quits [Client Quit]
20:00:11lunik1 joins
20:07:34<h2ibot>JustAnotherArchivist created Distributed recursive crawls (+528, Created page with "{{Infobox project |…): https://wiki.archiveteam.org/?title=Distributed%20recursive%20crawls
20:08:34<h2ibot>JustAnotherArchivist created Grab (+42, Redirected page to [[Distributed recursive…): https://wiki.archiveteam.org/?title=Grab
20:09:25MrRadar quits [Quit: Rebooting]
20:12:20MrRadar (MrRadar) joins
20:30:10lunik1 quits [Client Quit]
20:30:17lunik1 joins
20:36:09lunik1 quits [Client Quit]
20:36:17lunik1 joins
20:39:56lunik1 quits [Client Quit]
20:40:02lunik1 joins
20:58:53SM joins
21:00:29dm4v quits [Client Quit]
21:02:53dm4v joins
21:02:56dm4v quits [Changing host]
21:02:56dm4v (dm4v) joins
21:04:55lunik1 quits [Ping timeout: 265 seconds]
21:09:18lunik1 joins
21:12:11jtagcat6 quits [Quit: Bye!]
21:13:56jtagcat6 (jtagcat) joins
21:20:01lennier1 quits [Client Quit]
21:21:42lunik1 quits [Ping timeout: 265 seconds]
21:21:47lennier1 (lennier1) joins
21:35:19lunik1 joins
21:38:37LeGoupil quits [Ping timeout: 265 seconds]
21:40:02Arcorann (Arcorann) joins
21:42:26lunik1 quits [Client Quit]
21:42:34lunik1 joins
21:46:21lunik1 quits [Client Quit]
21:46:27lunik1 joins
21:49:12lunik1 quits [Client Quit]
21:49:21lunik1 joins
21:50:18march_happy (march_happy) joins
21:52:47march_happy quits [Read error: Connection reset by peer]
21:53:06march_happy (march_happy) joins
21:54:34lunik1 quits [Ping timeout: 265 seconds]
22:00:54<h2ibot>GarveyPatrickD edited A Million Ways to Die on the Web (-1, /* Censorship */ the violating ->…): https://wiki.archiveteam.org/?diff=48408&oldid=46185
22:01:54<h2ibot>Nemo bis edited TechNet (+167, /* Archival */ torrent from…): https://wiki.archiveteam.org/?diff=48409&oldid=46440
22:01:55<h2ibot>Nemo bis edited Microsoft (+550, code; size): https://wiki.archiveteam.org/?diff=48410&oldid=28659
22:02:26user_ quits [Remote host closed the connection]
22:02:37user_ (gazorpazorp) joins
22:05:55<h2ibot>GarveyPatrickD edited ArchiveBot (+86, Added IRC and WARC expansions): https://wiki.archiveteam.org/?diff=48411&oldid=48018
22:05:56<h2ibot>GarveyPatrickD created Template:Alphabet index (+583, Initial input): https://wiki.archiveteam.org/?title=Template%3AAlphabet%20index
22:07:23<@JAA>What do we need that template for?
22:08:22LeGoupil joins
22:22:55lunik1 joins
22:25:24LeGoupil quits [Client Quit]
22:25:25bonga quits [Read error: Connection reset by peer]
22:25:45bonga joins
22:27:29GarveyPatrickD quits [Remote host closed the connection]
22:28:11lunik1 quits [Client Quit]
22:30:12lunik1 joins
23:04:19march_happy quits [Read error: Connection reset by peer]
23:05:03march_happy (march_happy) joins
23:19:20BlueMaxima joins
23:29:36march_happy quits [Read error: Connection reset by peer]
23:30:40march_happy (march_happy) joins
23:31:49meowe quits [Client Quit]
23:32:38meowe (meowe) joins