| 00:00:38 | | chrismeller (chrismeller) joins |
| 00:01:01 | | chrismeller quits [Remote host closed the connection] |
| 00:01:24 | | chrismeller (chrismeller) joins |
| 00:49:23 | <h2ibot> | Cyrilio edited Recommended Reading (+206, /* Online Archives of Interest */ added Dutch…): https://wiki.archiveteam.org/?diff=48636&oldid=41342 |
| 00:51:56 | <cyrilio> | good edit/addition? |
| 00:52:13 | <cyrilio> | found some amazing pics in there from before prohibition |
| 00:53:43 | | bonga joins |
| 01:02:35 | | dm4v_ joins |
| 01:03:05 | | dm4v quits [Ping timeout: 265 seconds] |
| 01:03:05 | | dm4v_ is now known as dm4v |
| 01:03:06 | | dm4v is now authenticated as dm4v |
| 01:03:06 | | dm4v quits [Changing host] |
| 01:03:06 | | dm4v (dm4v) joins |
| 01:11:47 | <cyrilio> | perhaps completely off topic, but doe anyone know good NLP programs/methods? |
| 01:23:05 | <TheTechRobo> | cyrilio: NLP...? |
| 01:23:45 | <thuban> | 'natural language processing', i assume |
| 01:23:47 | <thuban> | cyrilio: #archiveteam-ot |
| 01:24:11 | <thuban> | (i can recommend some textbooks, but it sounds like maybe you're trying to do something specific?) |
| 01:41:54 | <TheTechRobo> | At my current concurrency for Strawpoll, it'll "only" take me 3 years to finish! |
| 01:42:57 | <cyrilio> | exactly thuban |
| 01:43:46 | <TheTechRobo> | cyrilio: we should probably move to #archiveteam-ot |
| 01:44:05 | <cyrilio> | I'm actually colaborating with two profs and two PhD students on how to make a better bot/automod. I won't be doing any programming. I'm more the reddit and content expert |
| 02:20:30 | | march_happy (march_happy) joins |
| 02:31:44 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 02:31:58 | | BlueMaxima joins |
| 02:42:58 | | cyrilio quits [Remote host closed the connection] |
| 02:49:46 | | march_happy quits [Ping timeout: 265 seconds] |
| 02:50:43 | | march_happy (march_happy) joins |
| 02:55:08 | | michaelblob (michaelblob) joins |
| 04:20:17 | | dm4v quits [Ping timeout: 265 seconds] |
| 04:31:13 | | dm4v joins |
| 04:31:16 | | dm4v is now authenticated as dm4v |
| 04:31:16 | | dm4v quits [Changing host] |
| 04:31:16 | | dm4v (dm4v) joins |
| 04:32:27 | | Arcorann (Arcorann) joins |
| 05:05:45 | | lennier1 quits [Client Quit] |
| 05:07:02 | | G4te_Keep3r quits [Ping timeout: 265 seconds] |
| 05:07:30 | | lennier1 (lennier1) joins |
| 05:56:00 | | DiscantX joins |
| 06:02:24 | | wyatt8740 quits [Client Quit] |
| 06:03:05 | | wyatt8740 joins |
| 06:33:30 | | kn100 quits [Client Quit] |
| 06:34:40 | | kn100 joins |
| 06:53:01 | | chrismeller quits [Ping timeout: 265 seconds] |
| 07:37:04 | | HackMii quits [Remote host closed the connection] |
| 07:39:37 | | HackMii (hacktheplanet) joins |
| 07:43:37 | | HackMii quits [Remote host closed the connection] |
| 07:44:55 | | HackMii (hacktheplanet) joins |
| 07:47:39 | | HackMii quits [Remote host closed the connection] |
| 07:48:52 | | HackMii (hacktheplanet) joins |
| 08:35:58 | | march_happy quits [Ping timeout: 265 seconds] |
| 08:36:33 | | march_happy (march_happy) joins |
| 08:46:08 | | march_happy quits [Read error: Connection reset by peer] |
| 08:46:35 | | march_happy (march_happy) joins |
| 08:58:41 | | march_happy quits [Read error: Connection reset by peer] |
| 08:59:03 | | march_happy (march_happy) joins |
| 09:01:28 | | march_happy quits [Read error: Connection reset by peer] |
| 09:02:04 | | march_happy (march_happy) joins |
| 09:09:12 | | march_happy quits [Read error: Connection reset by peer] |
| 09:10:10 | | march_happy (march_happy) joins |
| 09:14:26 | <AK> | https://freenode.net/view/Network_Info "It was reddit last time let's do media wiki this time yeah suire let's keep the Excitement coming" lol |
| 09:27:19 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 09:31:33 | | march_happy quits [Ping timeout: 265 seconds] |
| 09:31:41 | | march_happy (march_happy) joins |
| 09:42:37 | | march_happy quits [Remote host closed the connection] |
| 09:48:28 | | march_happy (march_happy) joins |
| 09:53:10 | | march_happy quits [Ping timeout: 265 seconds] |
| 09:59:48 | | march_happy (march_happy) joins |
| 10:43:19 | | qwertyasdfuiopghjkl is now known as qwertyasdfuiopghjkl_ |
| 10:43:29 | | qwertyasdfuiopghjkl joins |
| 10:44:04 | | qwertyasdfuiopghjkl_ quits [Client Quit] |
| 11:38:54 | | marto_8 joins |
| 11:39:07 | | Stilett0 joins |
| 11:39:11 | | mgrytbak3 joins |
| 11:39:12 | | CraftByte1 (DragonSec|CraftByte) joins |
| 11:39:15 | | Justin[home] joins |
| 11:39:15 | | Justin[home] is now authenticated as DopefishJustin |
| 11:39:26 | | monika3 (boom) joins |
| 11:39:27 | | datechnoman quits [Client Quit] |
| 11:39:27 | | @Kaz quits [Client Quit] |
| 11:39:28 | | coderobe quits [Client Quit] |
| 11:39:28 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
| 11:39:28 | | CraftByte quits [Client Quit] |
| 11:39:28 | | marto_ quits [Client Quit] |
| 11:39:28 | | seednode494 quits [Client Quit] |
| 11:39:28 | | mgrytbak quits [Client Quit] |
| 11:39:28 | | NIC007a83_ quits [Remote host closed the connection] |
| 11:39:28 | | VerifiedJ9 quits [Client Quit] |
| 11:39:28 | | jtagcat6 quits [Client Quit] |
| 11:39:28 | | notbasetwo quits [Quit: o/] |
| 11:39:28 | | Shjosan_ quits [Client Quit] |
| 11:39:28 | | monika quits [Client Quit] |
| 11:39:28 | | superkuh quits [Remote host closed the connection] |
| 11:39:28 | | Mateon1 quits [Remote host closed the connection] |
| 11:39:28 | | DopefishJustin quits [Remote host closed the connection] |
| 11:39:28 | | Stiletto quits [Remote host closed the connection] |
| 11:39:28 | | marto_8 is now known as marto_ |
| 11:39:28 | | mgrytbak3 is now known as mgrytbak |
| 11:39:28 | | CraftByte1 is now known as CraftByte |
| 11:39:28 | | monika3 is now known as monika |
| 11:39:30 | | IDK_ quits [Client Quit] |
| 11:39:30 | | superkuh joins |
| 11:39:30 | | Ryz quits [Client Quit] |
| 11:39:31 | | Mateon1 joins |
| 11:39:33 | | seednode494 (seednode) joins |
| 11:39:37 | | IDK_ joins |
| 11:39:41 | | Ryz (Ryz) joins |
| 11:39:51 | | Kaz8 (Kaz) joins |
| 11:39:51 | | @ChanServ sets mode: +o Kaz8 |
| 11:39:52 | | datechnoman1 (datechnoman) joins |
| 11:40:03 | | NIC007a83 joins |
| 11:40:04 | | coderobe4 (coderobe) joins |
| 11:40:06 | | Shjosan (Shjosan) joins |
| 11:40:16 | | jtagcat6 (jtagcat) joins |
| 11:41:27 | | notbasetwo joins |
| 11:46:20 | | @ChanServ sets mode: +o Sanqui |
| 11:47:58 | | coderobe4 is now known as coderobe |
| 12:06:16 | | HP_Archivist (HP_Archivist) joins |
| 12:06:44 | | qwertyasdfuiopghjkl joins |
| 12:19:50 | | VerifiedJ9 (VerifiedJ) joins |
| 12:22:19 | | eroc1990 quits [Client Quit] |
| 12:22:50 | | eroc1990 (eroc1990) joins |
| 12:34:07 | | DiscantX quits [Ping timeout: 265 seconds] |
| 12:54:25 | | Arcorann quits [Ping timeout: 265 seconds] |
| 13:29:06 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
| 13:45:10 | | jacobk_ quits [Ping timeout: 265 seconds] |
| 14:00:18 | | bonga quits [Remote host closed the connection] |
| 14:03:03 | | bonga joins |
| 14:10:39 | | jtagcat6 quits [Client Quit] |
| 14:14:57 | | jtagcat6 (jtagcat) joins |
| 14:34:39 | <@arkiver> | Ryz: on http://mansionofe.comicgenesis.com/ and http://mansionofe.comicgen.com/ - so one is 2 GB, total would only be 2 GB of duplicated content? |
| 14:34:42 | <@arkiver> | i'd say get them both |
| 14:35:32 | <@arkiver> | lennier1: if you get those screenshot URLs as well, let me know! |
| 14:44:15 | | jacobk joins |
| 14:52:00 | | bonga quits [Ping timeout: 265 seconds] |
| 14:52:21 | | bonga joins |
| 14:56:06 | <@arkiver> | Ryz: JAA: I remember there were some HTML tags (and/or attributes) from which no URLs were extracted |
| 14:56:37 | <@arkiver> | I believe this was for swf, but there may have been other examples as well. what were these? |
| 14:58:07 | <@arkiver> | i'm expanding what Wget-AT extracts from HTML |
| 15:03:27 | | jacobk quits [Ping timeout: 245 seconds] |
| 15:07:57 | <@Sanqui> | this may sounds stupid, but are you extracting links in plain text, especially those without a protocol? |
| 15:15:18 | | qwertyasdfuiopghjkl joins |
| 15:45:42 | | HP_Archivist quits [Client Quit] |
| 16:10:47 | | michaelblob quits [Read error: Connection reset by peer] |
| 16:12:58 | | michaelblob (michaelblob) joins |
| 16:14:09 | | michaelblob quits [Read error: Connection reset by peer] |
| 16:18:31 | | michaelblob (michaelblob) joins |
| 16:21:54 | | dm4v quits [Ping timeout: 265 seconds] |
| 16:24:32 | | dm4v joins |
| 16:24:34 | | dm4v is now authenticated as dm4v |
| 16:24:34 | | dm4v quits [Changing host] |
| 16:24:34 | | dm4v (dm4v) joins |
| 16:33:18 | | jacobk joins |
| 16:38:02 | | jacobk quits [Ping timeout: 245 seconds] |
| 16:50:54 | | march_happy quits [Ping timeout: 265 seconds] |
| 17:09:51 | | Matthww quits [Remote host closed the connection] |
| 17:19:15 | <h2ibot> | Qwerty0 edited Last.fm (+173, /* Listening History */ Update with some…): https://wiki.archiveteam.org/?diff=48637&oldid=47950 |
| 17:19:16 | <h2ibot> | Hasional edited ArchiveBot/National Archives/list (+66): https://wiki.archiveteam.org/?diff=48638&oldid=37062 |
| 17:20:42 | | jacobk joins |
| 17:22:17 | | driib (driib) joins |
| 17:25:34 | | jacobk quits [Ping timeout: 265 seconds] |
| 17:26:17 | | Matthww joins |
| 17:52:59 | <@JAA> | arkiver: I don't remember. :-/ |
| 17:57:40 | <@arkiver> | JAA: already found it, it was the param tag |
| 17:58:07 | <@arkiver> | just added support for it, will probably push out an update to Wget-AT today or tomorrow |
| 17:58:10 | <@JAA> | Yeah, I assumed that was what you meant by 'for swf'. |
| 17:58:22 | <@JAA> | Can we fix the accesses to private IP addresses at the same time? |
| 17:58:28 | <@JAA> | It came up again somewhere the other day. |
| 17:58:31 | <@arkiver> | that will be a different update |
| 17:58:46 | <Jake> | (came up in #youtubearchive ) |
| 18:00:48 | | Mateon1 quits [Remote host closed the connection] |
| 18:01:45 | | Mateon1 joins |
| 18:18:07 | | jacobk joins |
| 18:56:26 | | Mateon1 quits [Remote host closed the connection] |
| 18:57:23 | | Mateon1 joins |
| 19:20:07 | | jacobk quits [Ping timeout: 245 seconds] |
| 19:20:46 | | Craigle quits [Quit: The Lounge - https://thelounge.chat] |
| 19:21:16 | | Craigle (Craigle) joins |
| 19:32:58 | | Mateon1 quits [Remote host closed the connection] |
| 19:33:16 | | Mateon1 joins |
| 19:34:57 | | driib5 (driib) joins |
| 19:38:08 | | driib quits [Ping timeout: 265 seconds] |
| 19:38:09 | | driib (driib) joins |
| 19:41:47 | | driib5 quits [Ping timeout: 245 seconds] |
| 19:46:38 | <h2ibot> | Systwi edited Template:Wikis (+19, Added MoinMoin wiki software (https://moinmo.in/).): https://wiki.archiveteam.org/?diff=48639&oldid=48522 |
| 19:53:08 | | jacobk joins |
| 20:00:07 | | jacobk quits [Ping timeout: 245 seconds] |
| 20:15:50 | | bonga quits [Ping timeout: 265 seconds] |
| 20:21:09 | | bonga joins |
| 20:21:52 | <lennier1> | arkiver: I need to finish scraping the metadata to get all the screenshot urls, but will do! |
| 20:25:59 | | bonga quits [Ping timeout: 265 seconds] |
| 20:27:47 | | bonga joins |
| 20:29:31 | | lennier1 quits [Client Quit] |
| 20:31:02 | | lennier1 (lennier1) joins |
| 20:40:50 | | bonga quits [Ping timeout: 265 seconds] |
| 20:47:47 | | @Sanqui quits [Changing host] |
| 20:47:47 | | Sanqui (Sanqui) joins |
| 20:47:47 | | ing.hackint.org sets mode: +o Sanqui |
| 20:47:47 | | Sanqui|m quits [Changing host] |
| 20:47:47 | | Sanqui|m (Sanqui) joins |
| 20:47:47 | | @ChanServ sets mode: +o Sanqui|m |
| 20:50:30 | | godane quits [Ping timeout: 265 seconds] |
| 21:05:07 | | wessel1512 quits [Read error: Connection reset by peer] |
| 21:05:29 | | wessel1512 joins |
| 21:33:25 | <@arkiver> | lennier1: sounds good. i'm assing support for extracting URLs from the srcset attribute in the source HTML tag |
| 21:33:35 | <@arkiver> | that one is used for storing some image on those app web pages |
| 21:46:13 | | march_happy (march_happy) joins |
| 23:05:37 | | BlueMaxima joins |
| 23:27:14 | | march_happy quits [Ping timeout: 265 seconds] |
| 23:40:55 | <pabs> | has the centos.org domain been archived, including https://forums.centos.org/ ? |
| 23:42:06 | <pabs> | (the main CentOS is defunct, RedHat shut it down in favour of CentOS Stream) |
| 23:46:00 | <TheTechRobo> | don't think so: https://archive.fart.website/archivebot/viewer/?q=centos |
| 23:46:10 | <TheTechRobo> | only vault. and lists. in archivebot |
| 23:49:40 | | jacobk joins |
| 23:53:42 | <TheTechRobo> | pabs: ^ |
| 23:54:12 | | march_happy (march_happy) joins |