00:00:38chrismeller (chrismeller) joins
00:01:01chrismeller quits [Remote host closed the connection]
00:01:24chrismeller (chrismeller) joins
00:49:23<h2ibot>Cyrilio edited Recommended Reading (+206, /* Online Archives of Interest */ added Dutch…): https://wiki.archiveteam.org/?diff=48636&oldid=41342
00:51:56<cyrilio>good edit/addition?
00:52:13<cyrilio>found some amazing pics in there from before prohibition
00:53:43bonga joins
01:02:35dm4v_ joins
01:03:05dm4v quits [Ping timeout: 265 seconds]
01:03:05dm4v_ is now known as dm4v
01:03:06dm4v quits [Changing host]
01:03:06dm4v (dm4v) joins
01:11:47<cyrilio>perhaps completely off topic, but doe anyone know good NLP programs/methods?
01:23:05<TheTechRobo>cyrilio: NLP...?
01:23:45<thuban>'natural language processing', i assume
01:23:47<thuban>cyrilio: #archiveteam-ot
01:24:11<thuban>(i can recommend some textbooks, but it sounds like maybe you're trying to do something specific?)
01:41:54<TheTechRobo>At my current concurrency for Strawpoll, it'll "only" take me 3 years to finish!
01:42:57<cyrilio>exactly thuban
01:43:46<TheTechRobo>cyrilio: we should probably move to #archiveteam-ot
01:44:05<cyrilio>I'm actually colaborating with two profs and two PhD students on how to make a better bot/automod. I won't be doing any programming. I'm more the reddit and content expert
02:20:30march_happy (march_happy) joins
02:31:44BlueMaxima quits [Read error: Connection reset by peer]
02:31:58BlueMaxima joins
02:42:58cyrilio quits [Remote host closed the connection]
02:49:46march_happy quits [Ping timeout: 265 seconds]
02:50:43march_happy (march_happy) joins
02:55:08michaelblob (michaelblob) joins
04:20:17dm4v quits [Ping timeout: 265 seconds]
04:31:13dm4v joins
04:31:16dm4v quits [Changing host]
04:31:16dm4v (dm4v) joins
04:32:27Arcorann (Arcorann) joins
05:05:45lennier1 quits [Client Quit]
05:07:02G4te_Keep3r quits [Ping timeout: 265 seconds]
05:07:30lennier1 (lennier1) joins
05:56:00DiscantX joins
06:02:24wyatt8740 quits [Client Quit]
06:03:05wyatt8740 joins
06:33:30kn100 quits [Client Quit]
06:34:40kn100 joins
06:53:01chrismeller quits [Ping timeout: 265 seconds]
07:37:04HackMii quits [Remote host closed the connection]
07:39:37HackMii (hacktheplanet) joins
07:43:37HackMii quits [Remote host closed the connection]
07:44:55HackMii (hacktheplanet) joins
07:47:39HackMii quits [Remote host closed the connection]
07:48:52HackMii (hacktheplanet) joins
08:35:58march_happy quits [Ping timeout: 265 seconds]
08:36:33march_happy (march_happy) joins
08:46:08march_happy quits [Read error: Connection reset by peer]
08:46:35march_happy (march_happy) joins
08:58:41march_happy quits [Read error: Connection reset by peer]
08:59:03march_happy (march_happy) joins
09:01:28march_happy quits [Read error: Connection reset by peer]
09:02:04march_happy (march_happy) joins
09:09:12march_happy quits [Read error: Connection reset by peer]
09:10:10march_happy (march_happy) joins
09:14:26<AK>https://freenode.net/view/Network_Info "It was reddit last time let's do media wiki this time yeah suire let's keep the Excitement coming‌" lol
09:27:19BlueMaxima quits [Read error: Connection reset by peer]
09:31:33march_happy quits [Ping timeout: 265 seconds]
09:31:41march_happy (march_happy) joins
09:42:37march_happy quits [Remote host closed the connection]
09:48:28march_happy (march_happy) joins
09:53:10march_happy quits [Ping timeout: 265 seconds]
09:59:48march_happy (march_happy) joins
10:43:19qwertyasdfuiopghjkl is now known as qwertyasdfuiopghjkl_
10:43:29qwertyasdfuiopghjkl joins
10:44:04qwertyasdfuiopghjkl_ quits [Client Quit]
11:38:54marto_8 joins
11:39:07Stilett0 joins
11:39:11mgrytbak3 joins
11:39:12CraftByte1 (DragonSec|CraftByte) joins
11:39:15Justin[home] joins
11:39:26monika3 (boom) joins
11:39:27datechnoman quits [Client Quit]
11:39:27@Kaz quits [Client Quit]
11:39:28coderobe quits [Client Quit]
11:39:28qwertyasdfuiopghjkl quits [Remote host closed the connection]
11:39:28CraftByte quits [Client Quit]
11:39:28marto_ quits [Client Quit]
11:39:28seednode494 quits [Client Quit]
11:39:28mgrytbak quits [Client Quit]
11:39:28NIC007a83_ quits [Remote host closed the connection]
11:39:28VerifiedJ9 quits [Client Quit]
11:39:28jtagcat6 quits [Client Quit]
11:39:28notbasetwo quits [Quit: o/]
11:39:28Shjosan_ quits [Client Quit]
11:39:28monika quits [Client Quit]
11:39:28superkuh quits [Remote host closed the connection]
11:39:28Mateon1 quits [Remote host closed the connection]
11:39:28DopefishJustin quits [Remote host closed the connection]
11:39:28Stiletto quits [Remote host closed the connection]
11:39:28marto_8 is now known as marto_
11:39:28mgrytbak3 is now known as mgrytbak
11:39:28CraftByte1 is now known as CraftByte
11:39:28monika3 is now known as monika
11:39:30IDK_ quits [Client Quit]
11:39:30superkuh joins
11:39:30Ryz quits [Client Quit]
11:39:31Mateon1 joins
11:39:33seednode494 (seednode) joins
11:39:37IDK_ joins
11:39:41Ryz (Ryz) joins
11:39:51Kaz8 (Kaz) joins
11:39:51@ChanServ sets mode: +o Kaz8
11:39:52datechnoman1 (datechnoman) joins
11:40:03NIC007a83 joins
11:40:04coderobe4 (coderobe) joins
11:40:06Shjosan (Shjosan) joins
11:40:16jtagcat6 (jtagcat) joins
11:41:27notbasetwo joins
11:46:20@ChanServ sets mode: +o Sanqui
11:47:58coderobe4 is now known as coderobe
12:06:16HP_Archivist (HP_Archivist) joins
12:06:44qwertyasdfuiopghjkl joins
12:19:50VerifiedJ9 (VerifiedJ) joins
12:22:19eroc1990 quits [Client Quit]
12:22:50eroc1990 (eroc1990) joins
12:34:07DiscantX quits [Ping timeout: 265 seconds]
12:54:25Arcorann quits [Ping timeout: 265 seconds]
13:29:06qwertyasdfuiopghjkl quits [Remote host closed the connection]
13:45:10jacobk_ quits [Ping timeout: 265 seconds]
14:00:18bonga quits [Remote host closed the connection]
14:03:03bonga joins
14:10:39jtagcat6 quits [Client Quit]
14:14:57jtagcat6 (jtagcat) joins
14:34:39<@arkiver>Ryz: on http://mansionofe.comicgenesis.com/ and http://mansionofe.comicgen.com/ - so one is 2 GB, total would only be 2 GB of duplicated content?
14:34:42<@arkiver>i'd say get them both
14:35:32<@arkiver>lennier1: if you get those screenshot URLs as well, let me know!
14:44:15jacobk joins
14:52:00bonga quits [Ping timeout: 265 seconds]
14:52:21bonga joins
14:56:06<@arkiver>Ryz: JAA: I remember there were some HTML tags (and/or attributes) from which no URLs were extracted
14:56:37<@arkiver>I believe this was for swf, but there may have been other examples as well. what were these?
14:58:07<@arkiver>i'm expanding what Wget-AT extracts from HTML
15:03:27jacobk quits [Ping timeout: 245 seconds]
15:07:57<@Sanqui>this may sounds stupid, but are you extracting links in plain text, especially those without a protocol?
15:15:18qwertyasdfuiopghjkl joins
15:45:42HP_Archivist quits [Client Quit]
16:10:47michaelblob quits [Read error: Connection reset by peer]
16:12:58michaelblob (michaelblob) joins
16:14:09michaelblob quits [Read error: Connection reset by peer]
16:18:31michaelblob (michaelblob) joins
16:21:54dm4v quits [Ping timeout: 265 seconds]
16:24:32dm4v joins
16:24:34dm4v quits [Changing host]
16:24:34dm4v (dm4v) joins
16:33:18jacobk joins
16:38:02jacobk quits [Ping timeout: 245 seconds]
16:50:54march_happy quits [Ping timeout: 265 seconds]
17:09:51Matthww quits [Remote host closed the connection]
17:19:15<h2ibot>Qwerty0 edited Last.fm (+173, /* Listening History */ Update with some…): https://wiki.archiveteam.org/?diff=48637&oldid=47950
17:19:16<h2ibot>Hasional edited ArchiveBot/National Archives/list (+66): https://wiki.archiveteam.org/?diff=48638&oldid=37062
17:20:42jacobk joins
17:22:17driib (driib) joins
17:25:34jacobk quits [Ping timeout: 265 seconds]
17:26:17Matthww joins
17:52:59<@JAA>arkiver: I don't remember. :-/
17:57:40<@arkiver>JAA: already found it, it was the param tag
17:58:07<@arkiver>just added support for it, will probably push out an update to Wget-AT today or tomorrow
17:58:10<@JAA>Yeah, I assumed that was what you meant by 'for swf'.
17:58:22<@JAA>Can we fix the accesses to private IP addresses at the same time?
17:58:28<@JAA>It came up again somewhere the other day.
17:58:31<@arkiver>that will be a different update
17:58:46<Jake>(came up in #youtubearchive )
18:00:48Mateon1 quits [Remote host closed the connection]
18:01:45Mateon1 joins
18:18:07jacobk joins
18:56:26Mateon1 quits [Remote host closed the connection]
18:57:23Mateon1 joins
19:20:07jacobk quits [Ping timeout: 245 seconds]
19:20:46Craigle quits [Quit: The Lounge - https://thelounge.chat]
19:21:16Craigle (Craigle) joins
19:32:58Mateon1 quits [Remote host closed the connection]
19:33:16Mateon1 joins
19:34:57driib5 (driib) joins
19:38:08driib quits [Ping timeout: 265 seconds]
19:38:09driib (driib) joins
19:41:47driib5 quits [Ping timeout: 245 seconds]
19:46:38<h2ibot>Systwi edited Template:Wikis (+19, Added MoinMoin wiki software (https://moinmo.in/).): https://wiki.archiveteam.org/?diff=48639&oldid=48522
19:53:08jacobk joins
20:00:07jacobk quits [Ping timeout: 245 seconds]
20:15:50bonga quits [Ping timeout: 265 seconds]
20:21:09bonga joins
20:21:52<lennier1>arkiver: I need to finish scraping the metadata to get all the screenshot urls, but will do!
20:25:59bonga quits [Ping timeout: 265 seconds]
20:27:47bonga joins
20:29:31lennier1 quits [Client Quit]
20:31:02lennier1 (lennier1) joins
20:40:50bonga quits [Ping timeout: 265 seconds]
20:47:47@Sanqui quits [Changing host]
20:47:47Sanqui (Sanqui) joins
20:47:47ing.hackint.org sets mode: +o Sanqui
20:47:47Sanqui|m quits [Changing host]
20:47:47Sanqui|m (Sanqui) joins
20:47:47@ChanServ sets mode: +o Sanqui|m
20:50:30godane quits [Ping timeout: 265 seconds]
21:05:07wessel1512 quits [Read error: Connection reset by peer]
21:05:29wessel1512 joins
21:33:25<@arkiver>lennier1: sounds good. i'm assing support for extracting URLs from the srcset attribute in the source HTML tag
21:33:35<@arkiver>that one is used for storing some image on those app web pages
21:46:13march_happy (march_happy) joins
23:05:37BlueMaxima joins
23:27:14march_happy quits [Ping timeout: 265 seconds]
23:40:55<pabs>has the centos.org domain been archived, including https://forums.centos.org/ ?
23:42:06<pabs>(the main CentOS is defunct, RedHat shut it down in favour of CentOS Stream)
23:46:00<TheTechRobo>don't think so: https://archive.fart.website/archivebot/viewer/?q=centos
23:46:10<TheTechRobo>only vault. and lists. in archivebot
23:49:40jacobk joins
23:53:42<TheTechRobo>pabs: ^
23:54:12march_happy (march_happy) joins