00:53:44Arcorann (Arcorann) joins
01:44:23themadpro (themadpro) joins
02:07:09Wingy (Wingy) joins
02:09:42Terbium quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]
02:10:10Terbium joins
02:48:30<h2ibot>TheTechRobo edited V Live (+61): https://wiki.archiveteam.org/?diff=49191&oldid=49187
02:49:19<@arkiver>TheTechRobo: if you are asking if the code is opensource - then yes https://github.com/internetarchive/heritrix3
02:49:49<TheTechRobo>arkiver: https://github.com/internetarchive/heritrix3/discussions/541
02:49:59<TheTechRobo>I mean using the WARC-writing API in other programs
02:50:07<TheTechRobo>would be easiest if I could just use it as a library :-)
02:50:31<@arkiver>err right
02:50:45<@arkiver>why not use one of the existing solutions?
02:51:02<TheTechRobo>there are ones for java?
02:51:10<@arkiver>are you writing in java now?
02:51:12<@arkiver>but why
02:51:20<TheTechRobo>to learn :-)
02:51:44<@arkiver>not sure if java is the most useful language at the moment to learn but sure
02:51:51<@arkiver>well yeah you could look into heritrix
02:52:02<TheTechRobo>I don't actually hate Java
02:52:02<@arkiver>i don't have much experience with it or its code though
02:52:10<@arkiver>so not much help from me I'm afraid
02:52:23<TheTechRobo>Maven sucks, and the RAM usage is awful, but it's not as bad as some people make it out to be (imo)
02:52:34@arkiver also doesn't hate java
02:56:10<TheTechRobo>arkiver: out of curiosity, why don't you think it's useful? (and what languages do you think are more useful to learn?)
02:59:20<@arkiver>i dont see a ton of new software being written in it
02:59:32<@arkiver>it usually seems older software that uses that or php that needs to be maintained
02:59:50<TheTechRobo>that's why you need to use it ig :-)
02:59:51<@arkiver>( JAA may also have opinion on java and 'usefulness' of languages)
02:59:58<@arkiver>opinions*
03:07:33<@JAA>In my opinion, Java as a language isn't terrible, but it's tainted by the history with Sun and Oracle, and it's become a meme due to hilariously overengineered 'enterprise-grade' code. It's been a good few years since I last touched it, but the ecosystem was a mess at the time, and I'm not sure it's improved significantly since.
03:49:37michaelblob (michaelblob) joins
04:00:47mut4ntm0nkey quits [Ping timeout: 248 seconds]
04:23:35themadpro quits [Client Quit]
04:32:17mut4ntm0nkey (mutantmonkey) joins
05:01:12nepeat_ is now known as nepeat
05:02:53<h2ibot>JustAnotherArchivist edited V Live (+66, Add source): https://wiki.archiveteam.org/?diff=49192&oldid=49191
05:14:50<fishingforsoup>In case anyone here was curious about that lost song of mine, it's partially found! https://www.youtube.com/watch?v=U-BqT6TQR7s
05:43:33Arcorann quits [Ping timeout: 265 seconds]
05:45:22farhan joins
05:45:38farhan quits [Remote host closed the connection]
05:48:35hackbug quits [Remote host closed the connection]
06:05:21hackbug (hackbug) joins
06:10:13hackbug quits [Client Quit]
06:10:24hackbug (hackbug) joins
06:19:36katocala quits [Ping timeout: 276 seconds]
06:44:57hackbug quits [Ping timeout: 276 seconds]
06:45:30hackbug (hackbug) joins
07:01:43jacobk quits [Ping timeout: 268 seconds]
07:09:01jacobk joins
07:10:04hackbug quits [Ping timeout: 265 seconds]
07:44:53BlueMaxima quits [Read error: Connection reset by peer]
07:46:20Arcorann (Arcorann) joins
07:56:55sonick (sonick) joins
08:37:21hitgrr8 joins
09:09:59Island quits [Read error: Connection reset by peer]
10:35:59HackMii quits [Ping timeout: 248 seconds]
10:38:36HackMii (hacktheplanet) joins
10:40:00sepro quits [Quit: Ping timeout (120 seconds)]
11:49:35AK (AK) joins
12:03:22TheTechRobo quits [Remote host closed the connection]
12:04:02TheTechRobo (TheTechRobo) joins
12:32:39march_happy quits [Remote host closed the connection]
12:39:13march_happy (march_happy) joins
12:56:39march_happy quits [Remote host closed the connection]
13:00:29hackbug (hackbug) joins
13:00:38march_happy (march_happy) joins
13:12:18Iki joins
13:29:46themadpro (themadpro) joins
13:46:31<audrooku|m>Oh wow
13:52:11<TheTechRobo>Ooh https://github.com/iipc/jwarc exists
13:59:12Iki quits [Ping timeout: 268 seconds]
14:25:06Arcorann quits [Ping timeout: 268 seconds]
14:59:59HackMii quits [Ping timeout: 248 seconds]
15:02:38HackMii (hacktheplanet) joins
15:03:29katocala joins
15:46:08<Ryz>Hmm, I'm trying to figure out if there's something that needs to be addressed scraping wise before the end of November; someone was saying something that's closing down and I'm not sure if it has been addressed
17:00:58<@arkiver>looks like we have all the blogs shutting down on November 30th covered
17:30:21katocala quits [Remote host closed the connection]
17:44:42wyatt8750 joins
17:47:06wyatt8740 quits [Ping timeout: 265 seconds]
17:52:54katocala joins
18:29:25themadpro quits [Client Quit]
18:45:42Ketchup901 quits [Remote host closed the connection]
18:46:38sepro (sepro) joins
18:49:16Ketchup901 (Ketchup901) joins
18:55:10<@Sanqui>probably last set of sweb.cz domains (derived from outlinks from the previous sweb.cz archivebot run warcs) put in AB
18:57:19mut4ntm0nkey quits [Ping timeout: 248 seconds]
18:59:11HackMii quits [Remote host closed the connection]
18:59:47HackMii (hacktheplanet) joins
19:04:48mut4ntm0nkey (mutantmonkey) joins
19:18:06sepro7 (sepro) joins
19:20:23sepro quits [Ping timeout: 265 seconds]
19:20:46HackMii quits [Remote host closed the connection]
19:21:50HackMii (hacktheplanet) joins
19:24:09sepro7 quits [Ping timeout: 276 seconds]
19:28:39<@OrIdow6>Congratulations, I'll admit I was skeptical it would work in time
19:51:40sepro (sepro) joins
19:59:11<@arkiver>Sanqui: absolutely awesome
19:59:26<@Sanqui>I don't know how much I'm missing
20:00:21<@Sanqui>the total is 155k domains with some extra unreachable-from-/ urls
20:02:04sonick quits [Client Quit]
20:02:22<h2ibot>Sanqui edited Sweb.cz (+197, set 3): https://wiki.archiveteam.org/?diff=49193&oldid=49176
20:02:23<@arkiver>sounds pretty good
20:02:36<@Sanqui>I will also note that maybe half or more of the domains are already dead
20:05:12<@Sanqui>likely a lot more new domains could be gathered by scraping seznam.cz search
20:12:01<@Sanqui>you know what, i'm looking into that...
21:08:06TheTechRobo quits [Remote host closed the connection]
21:09:36TheTechRobo (TheTechRobo) joins
21:09:36TheTechRobo quits [Remote host closed the connection]
21:09:59TheTechRobo (TheTechRobo) joins
21:12:24sonick (sonick) joins
21:16:34Island joins
21:18:40TheTechRobo quits [Read error: Connection reset by peer]
21:19:10TheTechRobo (TheTechRobo) joins
21:50:15Iki joins
22:15:37Iki quits [Ping timeout: 268 seconds]
22:20:39BlueMaxima joins
23:00:22sepro7 (sepro) joins
23:01:15sepro quits [Ping timeout: 276 seconds]
23:01:15sepro7 is now known as sepro
23:06:57sepro1 (sepro) joins
23:09:03sepro quits [Ping timeout: 276 seconds]
23:09:03sepro1 is now known as sepro
23:14:48march_happy quits [Ping timeout: 265 seconds]
23:15:23march_happy (march_happy) joins
23:18:37katocala quits [Remote host closed the connection]
23:39:36hitgrr8 quits [Client Quit]
23:44:29Arcorann (Arcorann) joins
23:50:20sonick quits [Client Quit]