| 00:37:30 | | onetruth quits [Remote host closed the connection] |
| 00:37:42 | | onetruth joins |
| 00:41:20 | | Chris50105 (Chris5010) joins |
| 00:41:21 | | HP_Archivist quits [Remote host closed the connection] |
| 00:41:21 | | onetruth quits [Remote host closed the connection] |
| 00:41:21 | | Chris5010 quits [Client Quit] |
| 00:41:22 | | chrismeller quits [Remote host closed the connection] |
| 00:41:22 | | driib quits [Client Quit] |
| 00:41:22 | | Chris50105 is now known as Chris5010 |
| 00:41:31 | | onetruth joins |
| 00:41:36 | | HP_Archivist (HP_Archivist) joins |
| 00:41:38 | | rellem (chrismeller) joins |
| 00:42:01 | | rellem quits [Remote host closed the connection] |
| 00:42:09 | | driib (driib) joins |
| 00:43:08 | | rellem (chrismeller) joins |
| 00:43:31 | | rellem quits [Remote host closed the connection] |
| 00:44:38 | | rellem (chrismeller) joins |
| 00:45:01 | | rellem quits [Remote host closed the connection] |
| 00:45:58 | | march_happy quits [Ping timeout: 265 seconds] |
| 00:46:08 | | rellem (chrismeller) joins |
| 00:46:30 | | march_happy (march_happy) joins |
| 00:46:31 | | rellem quits [Remote host closed the connection] |
| 00:47:38 | | rellem (chrismeller) joins |
| 00:48:01 | | rellem quits [Remote host closed the connection] |
| 00:49:08 | | rellem (chrismeller) joins |
| 00:49:31 | | rellem quits [Remote host closed the connection] |
| 00:50:38 | | rellem (chrismeller) joins |
| 00:51:01 | | rellem quits [Remote host closed the connection] |
| 00:52:08 | | rellem (chrismeller) joins |
| 00:52:31 | | rellem quits [Remote host closed the connection] |
| 00:53:38 | | rellem (chrismeller) joins |
| 00:54:01 | | rellem quits [Remote host closed the connection] |
| 00:55:08 | | rellem (chrismeller) joins |
| 00:55:31 | | rellem quits [Remote host closed the connection] |
| 00:56:38 | | rellem (chrismeller) joins |
| 00:57:01 | | rellem quits [Remote host closed the connection] |
| 00:57:12 | | march_happy quits [Ping timeout: 245 seconds] |
| 00:57:20 | | march_happy (march_happy) joins |
| 00:58:08 | | rellem (chrismeller) joins |
| 00:58:31 | | rellem quits [Remote host closed the connection] |
| 00:58:54 | | rellem (chrismeller) joins |
| 01:01:22 | | bonga quits [Ping timeout: 245 seconds] |
| 01:02:46 | | dm4v_ joins |
| 01:02:48 | | bonga joins |
| 01:02:52 | | HP_Archivist quits [Client Quit] |
| 01:02:53 | | dm4v quits [Ping timeout: 265 seconds] |
| 01:02:58 | | dm4v_ is now known as dm4v |
| 01:03:01 | | dm4v is now authenticated as dm4v |
| 01:03:01 | | dm4v quits [Changing host] |
| 01:03:01 | | dm4v (dm4v) joins |
| 01:03:02 | | eroc1990 quits [Ping timeout: 245 seconds] |
| 01:03:28 | | eroc1990 (eroc1990) joins |
| 01:07:04 | | eroc1990 quits [Remote host closed the connection] |
| 01:36:20 | | Arcorann (Arcorann) joins |
| 01:45:50 | | Megame (Megame) joins |
| 02:01:54 | | Discant joins |
| 02:22:02 | | yay joins |
| 02:22:02 | | yay is now authenticated as yay |
| 02:22:47 | | DogsRNice (Webuser299) joins |
| 02:23:48 | <h2ibot> | Arkiver uploaded File:Scratch Logo.png: https://wiki.archiveteam.org/?title=File%3AScratch%20Logo.png |
| 02:59:53 | <h2ibot> | Wickedplayer494 uploaded File:Scratch1.4.png: https://wiki.archiveteam.org/?title=File%3AScratch1.4.png |
| 03:02:54 | <h2ibot> | Wickedplayer494 edited Scratch (+120, Imagery and navbox): https://wiki.archiveteam.org/?diff=48671&oldid=48651 |
| 03:15:14 | <pabs> | might be worth archiving this webOS archive? https://www.webosarchive.com/ https://news.ycombinator.com/item?id=31607318 |
| 03:15:52 | <pabs> | its spread over several domains and there are GitHub repos |
| 04:38:09 | | DogsRNice quits [Read error: Connection reset by peer] |
| 04:43:52 | | march_happy quits [Ping timeout: 245 seconds] |
| 04:44:32 | | march_happy (march_happy) joins |
| 05:12:00 | | Jake quits [Client Quit] |
| 05:36:01 | | drexler joins |
| 05:36:25 | <drexler> | So, is it possible to get a database index of OpenVerse? |
| 05:36:47 | <drexler> | https://rom1504.github.io/clip-retrieval/ |
| 05:36:57 | <drexler> | I want to put together a public domain version of LAION 400m |
| 05:37:02 | <drexler> | And that would save a lot of time |
| 06:44:45 | | march_happy quits [Remote host closed the connection] |
| 06:53:40 | | Jake (Jake) joins |
| 06:53:41 | | Jake quits [Client Quit] |
| 06:53:55 | | Jake (Jake) joins |
| 06:54:17 | | Jake quits [Client Quit] |
| 06:54:46 | | Jake (Jake) joins |
| 06:55:56 | | Megame quits [Client Quit] |
| 07:16:47 | | eroc1990 (eroc1990) joins |
| 07:41:07 | | eroc1990 quits [Client Quit] |
| 07:46:56 | | eroc1990 (eroc1990) joins |
| 07:54:14 | | jtagcat6 quits [Quit: Bye!] |
| 07:54:30 | | jtagcat6 (jtagcat) joins |
| 08:05:57 | | kiska quits [Quit: Ping timeout (120 seconds)] |
| 08:05:57 | | s-crypt quits [Quit: Ping timeout (120 seconds)] |
| 08:05:57 | | Ryz2 quits [Quit: Ping timeout (120 seconds)] |
| 08:06:58 | | kiska (kiska) joins |
| 08:07:07 | | s-crypt (s-crypt) joins |
| 08:07:08 | | Ryz2 (Ryz) joins |
| 08:08:42 | | ArchivalEfforts_ quits [Ping timeout: 265 seconds] |
| 08:09:13 | | ArchivalEfforts joins |
| 08:31:54 | | NotEggplant quits [Read error: Connection reset by peer] |
| 08:32:14 | | NotEggplant joins |
| 09:38:27 | | march_happy (march_happy) joins |
| 10:42:04 | | spirit quits [Quit: Leaving] |
| 11:22:23 | | march_happy quits [Remote host closed the connection] |
| 11:23:33 | | march_happy (march_happy) joins |
| 11:33:19 | | qwertyasdfuiopghjkl joins |
| 11:33:52 | | Discant quits [Ping timeout: 245 seconds] |
| 11:39:20 | | Discant joins |
| 12:17:12 | | Discant quits [Ping timeout: 245 seconds] |
| 12:53:53 | | Larsenv quits [Quit: ZNC 1.8.2+deb2build5 - https://znc.in] |
| 12:56:18 | | rellem quits [Remote host closed the connection] |
| 12:56:18 | | drexler quits [Remote host closed the connection] |
| 12:56:48 | | drexler joins |
| 12:57:08 | | rellem (chrismeller) joins |
| 12:57:31 | | rellem quits [Remote host closed the connection] |
| 12:58:38 | | rellem (chrismeller) joins |
| 12:59:01 | | rellem quits [Remote host closed the connection] |
| 13:00:08 | | rellem (chrismeller) joins |
| 13:00:12 | | BlueMaxima quits [Client Quit] |
| 13:00:31 | | rellem quits [Remote host closed the connection] |
| 13:01:38 | | rellem (chrismeller) joins |
| 13:02:01 | | rellem quits [Remote host closed the connection] |
| 13:02:23 | | rellem (chrismeller) joins |
| 13:03:54 | | HP_Archivist (HP_Archivist) joins |
| 13:42:18 | | tea joins |
| 13:46:04 | | rellem quits [Ping timeout: 265 seconds] |
| 13:57:37 | | Arcorann quits [Ping timeout: 245 seconds] |
| 15:15:00 | | march_happy quits [Ping timeout: 265 seconds] |
| 15:16:19 | | march_happy (march_happy) joins |
| 15:53:32 | | Larsenv (Larsenv) joins |
| 15:59:57 | | march_happy quits [Ping timeout: 265 seconds] |
| 16:10:32 | | Sluggs quits [Ping timeout: 245 seconds] |
| 16:11:02 | | Sluggs joins |
| 16:13:03 | | march_happy (march_happy) joins |
| 17:00:59 | <drexler> | SketchTheCow, speaking of AI art, you around? |
| 17:13:25 | | march_happy quits [Ping timeout: 265 seconds] |
| 17:13:47 | | march_happy (march_happy) joins |
| 17:24:59 | | mgwatts (mgwatts) joins |
| 17:40:00 | | march_happy quits [Ping timeout: 265 seconds] |
| 18:16:36 | | HP_Archivist quits [Ping timeout: 265 seconds] |
| 18:23:49 | <thuban> | i waited too long to archive something i knew might be time-sensitive. now it's gone and i have only myself to blame ._. |
| 18:23:59 | | tea quits [Ping timeout: 265 seconds] |
| 18:24:13 | <@JAA> | [x] I'm in this picture, and I don't like it. |
| 18:27:40 | <Ryz> | More archiving looty <#>; |
| 18:33:31 | | yay quits [Ping timeout: 265 seconds] |
| 19:29:24 | | yay joins |
| 19:29:24 | | yay is now authenticated as yay |
| 19:49:56 | | HP_Archivist (HP_Archivist) joins |
| 19:50:13 | | HP_Archivist quits [Remote host closed the connection] |
| 19:50:40 | | HP_Archivist (HP_Archivist) joins |
| 19:52:38 | <drexler> | thuban, many such cases lol |
| 19:54:02 | <thuban> | sad! |
| 20:00:45 | | Nulo quits [Remote host closed the connection] |
| 21:07:50 | <@arkiver> | thuban: don't beat yourself up too much about it! |
| 21:08:26 | <@arkiver> | it happens unfortunately (I missed blogos.com), let's move on, plenty of other archival jobs to run |
| 21:19:21 | | lennier1 quits [Quit: Going offline, see ya! (www.adiirc.com)] |
| 21:20:47 | | lennier1 (lennier1) joins |
| 21:24:07 | | mgwatts quits [Remote host closed the connection] |
| 21:53:41 | <Ryz> | More loot to find and archiveeeeeeee |
| 22:11:31 | <drexler> | I'm currently annoyed because WordPress doesn't publish the OpenVerse database index |
| 22:11:49 | <drexler> | Which, if I had and scraped it, would basically be my public domain LAION 400m right there |
| 22:12:14 | <drexler> | And I find this annoying because it's like...it's a Creative Commons backed project, WikiMedia publishes one, why doesn't OpenVerse lol |
| 22:12:27 | | sepro quits [Client Quit] |
| 22:13:50 | <drexler> | It's especially annoying because the biggest publisher/host of public domain imagery right now is...Flickr |
| 22:13:59 | <drexler> | And we all know Flickr isn't in great financial shape |
| 22:14:41 | <drexler> | If they go without making arrangements for stewardship of the data, we could easily lose one of the most valuable media collections in existence. |
| 22:17:04 | <drexler> | It's triply annoying because search methods were recently invented that are easy to implement and let us much more powerfully search these kinds of collections than we could before using deep learning, so it's not even like the old "eh, who would ever look through it?" argument holds water anymore. |
| 22:22:14 | <drexler> | Like, you can search by image and text using open source models to find things that aren't in a caption for the image at all. |
| 22:22:36 | <drexler> | And with some preprocessing it's lightweight enough to run on CPU |
| 22:23:50 | <drexler> | https://rom1504.github.io/clip-retrieval/ implements this |
| 22:28:55 | | sepro (sepro) joins |
| 22:35:29 | <drexler> | Then if you look at the OpenVerse API https://api.openverse.engineering/v1/#operation/image_search it says in the docs that they are trying to structurally prevent you from downloading the whole database |
| 22:36:06 | <@arkiver> | also ".engineering" |
| 22:36:35 | <drexler> | What? Why? *This is public domain data*, why is WordPress being allowed to build a moat around this, why is Creative Commons allowing it? It's not like Creative Commons needs WordPress to provide search from a technical perspective, you can see the link I just gave you gives you an index over 5 *billion* images. |
| 22:37:06 | <drexler> | In short, this is chickenshit, and I'm pissed. |
| 22:40:09 | <drexler> | arkiver, I didn't even notice that lol |
| 22:43:55 | <drexler> | By contrast, turning this stuff into an ML dataset is probably one of the easier ways to make sure it stays available to the public. Since ML guys will train on the whole 400 million images, raise holy hell if you try to silently wall off access to the full thing, stick the entire dataset up on academictorrents, etc |
| 22:45:17 | <drexler> | It's honestly just a better strategy than giving the whole thing to WordPress and having them pinky promise they won't follow their structural incentives to build a moat. |
| 22:48:37 | | march_happy (march_happy) joins |
| 22:58:27 | | sepro quits [Ping timeout: 245 seconds] |
| 23:01:27 | | Nulo joins |
| 23:30:51 | | BlueMaxima joins |
| 23:33:02 | | wickedplayer494 quits [Ping timeout: 245 seconds] |
| 23:33:24 | | wickedplayer494 joins |
| 23:58:07 | | sepro (sepro) joins |