| 00:53:44 | | Arcorann (Arcorann) joins |
| 01:44:23 | | themadpro (themadpro) joins |
| 02:07:09 | | Wingy (Wingy) joins |
| 02:09:42 | | Terbium quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.] |
| 02:10:10 | | Terbium joins |
| 02:48:30 | <h2ibot> | TheTechRobo edited V Live (+61): https://wiki.archiveteam.org/?diff=49191&oldid=49187 |
| 02:49:19 | <@arkiver> | TheTechRobo: if you are asking if the code is opensource - then yes https://github.com/internetarchive/heritrix3 |
| 02:49:49 | <TheTechRobo> | arkiver: https://github.com/internetarchive/heritrix3/discussions/541 |
| 02:49:59 | <TheTechRobo> | I mean using the WARC-writing API in other programs |
| 02:50:07 | <TheTechRobo> | would be easiest if I could just use it as a library :-) |
| 02:50:31 | <@arkiver> | err right |
| 02:50:45 | <@arkiver> | why not use one of the existing solutions? |
| 02:51:02 | <TheTechRobo> | there are ones for java? |
| 02:51:10 | <@arkiver> | are you writing in java now? |
| 02:51:12 | <@arkiver> | but why |
| 02:51:20 | <TheTechRobo> | to learn :-) |
| 02:51:44 | <@arkiver> | not sure if java is the most useful language at the moment to learn but sure |
| 02:51:51 | <@arkiver> | well yeah you could look into heritrix |
| 02:52:02 | <TheTechRobo> | I don't actually hate Java |
| 02:52:02 | <@arkiver> | i don't have much experience with it or its code though |
| 02:52:10 | <@arkiver> | so not much help from me I'm afraid |
| 02:52:23 | <TheTechRobo> | Maven sucks, and the RAM usage is awful, but it's not as bad as some people make it out to be (imo) |
| 02:52:34 | | @arkiver also doesn't hate java |
| 02:56:10 | <TheTechRobo> | arkiver: out of curiosity, why don't you think it's useful? (and what languages do you think are more useful to learn?) |
| 02:59:20 | <@arkiver> | i dont see a ton of new software being written in it |
| 02:59:32 | <@arkiver> | it usually seems older software that uses that or php that needs to be maintained |
| 02:59:50 | <TheTechRobo> | that's why you need to use it ig :-) |
| 02:59:51 | <@arkiver> | ( JAA may also have opinion on java and 'usefulness' of languages) |
| 02:59:58 | <@arkiver> | opinions* |
| 03:07:33 | <@JAA> | In my opinion, Java as a language isn't terrible, but it's tainted by the history with Sun and Oracle, and it's become a meme due to hilariously overengineered 'enterprise-grade' code. It's been a good few years since I last touched it, but the ecosystem was a mess at the time, and I'm not sure it's improved significantly since. |
| 03:49:37 | | michaelblob (michaelblob) joins |
| 04:00:47 | | mut4ntm0nkey quits [Ping timeout: 248 seconds] |
| 04:23:35 | | themadpro quits [Client Quit] |
| 04:32:17 | | mut4ntm0nkey (mutantmonkey) joins |
| 05:01:12 | | nepeat_ is now known as nepeat |
| 05:02:53 | <h2ibot> | JustAnotherArchivist edited V Live (+66, Add source): https://wiki.archiveteam.org/?diff=49192&oldid=49191 |
| 05:14:50 | <fishingforsoup> | In case anyone here was curious about that lost song of mine, it's partially found! https://www.youtube.com/watch?v=U-BqT6TQR7s |
| 05:43:33 | | Arcorann quits [Ping timeout: 265 seconds] |
| 05:45:22 | | farhan joins |
| 05:45:38 | | farhan quits [Remote host closed the connection] |
| 05:48:35 | | hackbug quits [Remote host closed the connection] |
| 06:05:21 | | hackbug (hackbug) joins |
| 06:10:13 | | hackbug quits [Client Quit] |
| 06:10:24 | | hackbug (hackbug) joins |
| 06:19:36 | | katocala quits [Ping timeout: 276 seconds] |
| 06:44:57 | | hackbug quits [Ping timeout: 276 seconds] |
| 06:45:30 | | hackbug (hackbug) joins |
| 07:01:43 | | jacobk quits [Ping timeout: 268 seconds] |
| 07:09:01 | | jacobk joins |
| 07:10:04 | | hackbug quits [Ping timeout: 265 seconds] |
| 07:44:53 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 07:46:20 | | Arcorann (Arcorann) joins |
| 07:56:55 | | sonick (sonick) joins |
| 08:37:21 | | hitgrr8 joins |
| 09:09:59 | | Island quits [Read error: Connection reset by peer] |
| 10:35:59 | | HackMii quits [Ping timeout: 248 seconds] |
| 10:38:36 | | HackMii (hacktheplanet) joins |
| 10:40:00 | | sepro quits [Quit: Ping timeout (120 seconds)] |
| 11:49:35 | | AK (AK) joins |
| 12:03:22 | | TheTechRobo quits [Remote host closed the connection] |
| 12:04:02 | | TheTechRobo (TheTechRobo) joins |
| 12:32:39 | | march_happy quits [Remote host closed the connection] |
| 12:39:13 | | march_happy (march_happy) joins |
| 12:56:39 | | march_happy quits [Remote host closed the connection] |
| 13:00:29 | | hackbug (hackbug) joins |
| 13:00:38 | | march_happy (march_happy) joins |
| 13:12:18 | | Iki joins |
| 13:29:46 | | themadpro (themadpro) joins |
| 13:46:31 | <audrooku|m> | Oh wow |
| 13:52:11 | <TheTechRobo> | Ooh https://github.com/iipc/jwarc exists |
| 13:59:12 | | Iki quits [Ping timeout: 268 seconds] |
| 14:25:06 | | Arcorann quits [Ping timeout: 268 seconds] |
| 14:59:59 | | HackMii quits [Ping timeout: 248 seconds] |
| 15:02:38 | | HackMii (hacktheplanet) joins |
| 15:03:29 | | katocala joins |
| 15:03:56 | | katocala is now authenticated as katocala |
| 15:46:08 | <Ryz> | Hmm, I'm trying to figure out if there's something that needs to be addressed scraping wise before the end of November; someone was saying something that's closing down and I'm not sure if it has been addressed |
| 17:00:58 | <@arkiver> | looks like we have all the blogs shutting down on November 30th covered |
| 17:30:21 | | katocala quits [Remote host closed the connection] |
| 17:44:42 | | wyatt8750 joins |
| 17:47:06 | | wyatt8740 quits [Ping timeout: 265 seconds] |
| 17:52:54 | | katocala joins |
| 17:53:24 | | katocala is now authenticated as katocala |
| 18:29:25 | | themadpro quits [Client Quit] |
| 18:45:42 | | Ketchup901 quits [Remote host closed the connection] |
| 18:46:38 | | sepro (sepro) joins |
| 18:49:16 | | Ketchup901 (Ketchup901) joins |
| 18:55:10 | <@Sanqui> | probably last set of sweb.cz domains (derived from outlinks from the previous sweb.cz archivebot run warcs) put in AB |
| 18:57:19 | | mut4ntm0nkey quits [Ping timeout: 248 seconds] |
| 18:59:11 | | HackMii quits [Remote host closed the connection] |
| 18:59:47 | | HackMii (hacktheplanet) joins |
| 19:04:48 | | mut4ntm0nkey (mutantmonkey) joins |
| 19:18:06 | | sepro7 (sepro) joins |
| 19:20:23 | | sepro quits [Ping timeout: 265 seconds] |
| 19:20:46 | | HackMii quits [Remote host closed the connection] |
| 19:21:50 | | HackMii (hacktheplanet) joins |
| 19:24:09 | | sepro7 quits [Ping timeout: 276 seconds] |
| 19:28:39 | <@OrIdow6> | Congratulations, I'll admit I was skeptical it would work in time |
| 19:51:40 | | sepro (sepro) joins |
| 19:59:11 | <@arkiver> | Sanqui: absolutely awesome |
| 19:59:26 | <@Sanqui> | I don't know how much I'm missing |
| 20:00:21 | <@Sanqui> | the total is 155k domains with some extra unreachable-from-/ urls |
| 20:02:04 | | sonick quits [Client Quit] |
| 20:02:22 | <h2ibot> | Sanqui edited Sweb.cz (+197, set 3): https://wiki.archiveteam.org/?diff=49193&oldid=49176 |
| 20:02:23 | <@arkiver> | sounds pretty good |
| 20:02:36 | <@Sanqui> | I will also note that maybe half or more of the domains are already dead |
| 20:05:12 | <@Sanqui> | likely a lot more new domains could be gathered by scraping seznam.cz search |
| 20:12:01 | <@Sanqui> | you know what, i'm looking into that... |
| 21:08:06 | | TheTechRobo quits [Remote host closed the connection] |
| 21:09:36 | | TheTechRobo (TheTechRobo) joins |
| 21:09:36 | | TheTechRobo quits [Remote host closed the connection] |
| 21:09:59 | | TheTechRobo (TheTechRobo) joins |
| 21:12:24 | | sonick (sonick) joins |
| 21:16:34 | | Island joins |
| 21:18:40 | | TheTechRobo quits [Read error: Connection reset by peer] |
| 21:19:10 | | TheTechRobo (TheTechRobo) joins |
| 21:50:15 | | Iki joins |
| 22:15:37 | | Iki quits [Ping timeout: 268 seconds] |
| 22:20:39 | | BlueMaxima joins |
| 23:00:22 | | sepro7 (sepro) joins |
| 23:01:15 | | sepro quits [Ping timeout: 276 seconds] |
| 23:01:15 | | sepro7 is now known as sepro |
| 23:06:57 | | sepro1 (sepro) joins |
| 23:09:03 | | sepro quits [Ping timeout: 276 seconds] |
| 23:09:03 | | sepro1 is now known as sepro |
| 23:14:48 | | march_happy quits [Ping timeout: 265 seconds] |
| 23:15:23 | | march_happy (march_happy) joins |
| 23:18:37 | | katocala quits [Remote host closed the connection] |
| 23:39:36 | | hitgrr8 quits [Client Quit] |
| 23:44:29 | | Arcorann (Arcorann) joins |
| 23:50:20 | | sonick quits [Client Quit] |