00:01:29Eighty (Eighty) joins
00:08:36Eighty quits [Ping timeout: 250 seconds]
00:10:35Eighty (Eighty) joins
00:17:42mary quits [Ping timeout: 250 seconds]
00:48:33mary joins
01:03:53HP_Archivist quits [Ping timeout: 258 seconds]
01:16:24Jon quits [Quit: ZNC - http://znc.in]
01:44:27Hyenadae quits [Ping timeout: 244 seconds]
02:01:05<Ryz>Heya folks, I'm wondering if a potential archive project of Sourceforge would be doable, based on my chattering about it in #archivebot (basically jobs under https://sourceforge.net/projects/ and not subdomains that are basically project userpages are meaningless archives because JS);
02:01:13<Ryz>Reasons include past controversies mentioned in https://en.wikipedia.org/wiki/SourceForge#Controversies such as 'DevShare adware' and 'Project hijackings and bundled malware', both of these have been resolved since 2016; betting this drove a bunch of people and their projects elsewhere like GitHub and other websites
02:02:09<@JAA>++
02:04:24Mineroboter_ joins
02:05:02<OrIdow6>I don't see why not
02:05:13Mineroboter quits [Ping timeout: 258 seconds]
02:05:20<OrIdow6>I think they have some complicated system of download mirrors, could complicate reply there
02:05:57<OrIdow6>"I don't see..." technically, obviously I don't know what IA would accept etc
02:05:58<@JAA>Yeah, their entire interface is awful.
02:06:18<@JAA>But it's been dying for years, so it should definitely be archived.
02:10:47<@JAA>It's a pain though. There are files, issues, mailing lists, discussion forums, wikis, news, source code repos with every SCM imaginable (at *least* SVN, Git, Mercurial, CVS, and Bazaar), rsync servers, and more.
02:15:13<@JAA>The site also doesn't always make that obvious. For example, https://sourceforge.net/projects/allura/ doesn't say anything about a Git repo and https://sourceforge.net/p/allura/code/ is a 404, but it exists and can be cloned directly or via rsync: `git clone https://git.code.sf.net/p/allura/git` and `rsync -av git.code.sf.net::p/allura/git.git .`
02:15:22<Ryz>Additional signs of dying, there hasn't been a new post in https://sourceforge.net/blog/ since 2020 December; there's supposed to be recurring posts of 'Projects of the Week' along with "Project of the Month', which has also stopped since 2020 December: https://sourceforge.net/blog/category/potm/
02:15:55<Ryz>There was supposed to be 2021 January post as per https://sourceforge.net/blog/community-choice-project-month-vote-january-2021/ but it hasn't happened at all
02:16:21<Ryz>The last Twitter post on https://twitter.com/sourceforge is 2020 December 27
02:16:50<@JAA>That's just a couple months.
02:17:04<@JAA>Not particularly concerning.
02:17:38<@JAA>But the platform has been on the decline for the better part of a decade.
02:19:54<Ryz>The Twitter account, maybe not considered concerning, seeing that it has made made a post almost every day, sometimes more than 1; but the stopping of 'Projects of the Week' and 'Project of the Month' is further hints of their decline
02:23:26<OrIdow6>Yeah, that is a bad sign
02:23:56<Ryz>!ignore 94szmo521qbojl79lkzhqz6hq ^https?://app\.hi-george\.com/.*\.js$
02:23:59<Ryz>Oops
02:25:47Hyenadae joins
02:27:28<OrIdow6>Wonder how rsync and similar would be stored; I don't think there's a warc standard for that
02:28:22<@JAA>Probably tars? Yeah, doesn't really fit into WARCs.
02:34:19<@hook54321>https://wiki.archiveteam.org/index.php/SourceForge#Archiving
02:36:00<@JAA>'We attempted to contact them but got no reply.' Yeah, great.
02:39:32<@hook54321>not sure if that person even works there anymore https://sourceforge.net/u/burley/profile/
02:40:06<@hook54321>same with the other person https://sourceforge.net/u/rgaloppini/profile/
02:44:11<OrIdow6>"David Burley" is either working 11 jobs at once or a common name
02:44:29<OrIdow6>Here's the second one https://robertogaloppini.net/ - "After serving as Senior Director of Business Development at SourceForge for over 4 years, in 2016 he started a new company..."
02:46:33<OrIdow6>Last activity of both was in 4 days of each other, a few months before it was sold
02:49:07<atphoenix>if SourceForge dies then it becomes SourceForget
02:49:49<Ryz>I smell an IRC channel name :p
02:51:52<atphoenix>:D and the end of that could be pronounced as 'forget' or as 'forjhay' or even as 'For Get'
02:52:36<OrIdow6>Apparently the channel was called #coldstorage in 2015 - https://archive.fart.website/bin/irclogger_log/archiveteam-bs?date=2015-06-04,Thu&sel=47#l43
02:53:10<OrIdow6>Oh, that's on th ewiki anyway
02:53:34<@JAA>Where's the pun in that?
02:54:35<tech234a>Forges are hot?
02:54:41<OrIdow6>They were a lot more primitive back in 2015
03:09:25tzt quits [Quit: WeeChat 2.3]
03:11:15<@JAA>#sourceforget exists now.
03:11:17tzt joins
03:16:26<purplebot>SourceForge edited by Hook54321 (+25, updated irc channel) just now -- https://www.archiveteam.org/?diff=46443&oldid=28773
03:17:53HP_Archivist (HP_Archivist) joins
03:44:26<purplebot>SourceForge edited by Hook54321 (-30) 23 minutes ago -- https://www.archiveteam.org/?diff=46444&oldid=46443
03:47:01qw3rty_ joins
03:50:38qw3rty__ quits [Ping timeout: 258 seconds]
04:38:06Sylirana (Sylirana) joins
05:26:45vela quits [Quit: Ping timeout (120 seconds)]
05:26:48linuxgemini quits [Quit: Ping timeout (120 seconds)]
05:26:48lun4 quits [Quit: Ping timeout (120 seconds)]
05:27:05lun4 (lun4) joins
05:27:05linuxgemini (linuxgemini) joins
05:27:10igloo22225 quits [Quit: Ping timeout (120 seconds)]
05:28:43vela (vela) joins
05:44:24<OrIdow6>Ryz: Did you see whether tokyo.whatsin.jp can be archived? Shuts down end of month
05:51:15<Ryz>For those not in #archivebot - I have ran a job for this above link ^
06:11:25<purplebot>Deathwatch edited by JustAnotherArchivist (+60, /* 2021 */ Add ref for aimix-BBS) just now -- https://www.archiveteam.org/?diff=46446&oldid=46441
06:16:27Arcorann (Arcorann) joins
06:22:34godane quits [Ping timeout: 250 seconds]
06:30:29Arcorann quits [Ping timeout: 258 seconds]
07:22:08hooway joins
07:39:58Zopolis4 (Zopolis4) joins
07:44:31Arcorann (Arcorann) joins
07:55:14Jens quits [Killed (NickServ (GHOST command used by jens_!~jens@hackint/user/JENS))]
07:55:31JensRex (JensRex) joins
08:09:00HackMii quits [Ping timeout: 258 seconds]
08:09:21HackMii (hacktheplanet) joins
08:13:39Arcorann_ joins
08:16:06BlueMaxima quits [Read error: Connection reset by peer]
08:16:40Arcorann quits [Ping timeout: 258 seconds]
08:25:38Arcorann (Arcorann) joins
08:28:10Arcorann_ quits [Ping timeout: 258 seconds]
10:13:12FriarGiuseppe quits [Ping timeout: 258 seconds]
10:18:27Arcorann_ joins
10:21:15FriarGiuseppe joins
10:22:12Arcorann quits [Ping timeout: 250 seconds]
10:23:36Giuseppe joins
10:25:40FriarGiuseppe quits [Ping timeout: 250 seconds]
10:29:39Giuseppe quits [Remote host closed the connection]
10:29:51Giuseppe joins
10:32:09Giuseppe quits [Remote host closed the connection]
10:32:24Giuseppe joins
10:39:58aphitex2 quits [Ping timeout: 250 seconds]
10:54:04ragu joins
11:07:50igloo22225 (igloo22225) joins
11:10:09ragu quits [Client Quit]
11:38:01masterX244 (masterX244) joins
11:40:26<masterX244>noticed on the archivebot job of the defiance forum that it captures every single-post link on the threads, too. (didnt knew of the archivebot job when i saw the reddit post on r/archiveteam) had a grab-site crawl running in parallel which captured the threads without those single-post links which already finished
12:21:18yano1 quits [Quit: WeeChat, The Better IRC Client, https://weechat.org/]
12:21:27yanome quits [Quit: The Lounge - https://thelounge.chat]
12:23:13yanome (yano) joins
12:23:37yano (yano) joins
12:23:38yano quits [Client Quit]
12:24:10yano (yano) joins
12:24:50Arcorann_ quits [Ping timeout: 250 seconds]
12:26:43Arcorann_ joins
12:27:11masterX244 quits [Ping timeout: 244 seconds]
12:32:46masterX244 joins
12:58:36fuzzy8021 quits [Read error: Connection reset by peer]
13:00:51fuzzy8021 joins
13:00:52fuzzy8021 quits [Changing host]
13:00:52fuzzy8021 (fuzzy8021) joins
13:04:23Sylirana quits [Ping timeout: 244 seconds]
13:06:03Sylirana (Sylirana) joins
13:08:23HackMii quits [Ping timeout: 258 seconds]
13:20:13fuzzy802 joins
13:20:14fuzzy8021 quits [Killed (NickServ (GHOST command used by fuzzy802!~fuzzy8021@173-224-26-244.ptcnet.net))]
13:20:14fuzzy802 is now known as fuzzy8021
13:20:15fuzzy8021 quits [Changing host]
13:20:15fuzzy8021 (fuzzy8021) joins
13:20:23fuzzy8021 quits [Excess Flood]
13:20:47fuzzy8021 joins
13:20:47fuzzy8021 quits [Changing host]
13:20:47fuzzy8021 (fuzzy8021) joins
13:36:39HackMii (hacktheplanet) joins
13:52:26masterX24460 joins
13:52:26masterX244 quits [Ping timeout: 244 seconds]
13:53:11masterX24460 is now known as masterX244
13:58:45masterX244 quits [Remote host closed the connection]
14:01:40nerdguy1138 quits [Ping timeout: 258 seconds]
14:14:38ragu__ joins
14:17:14nerdguy1138 (nerdguy1138) joins
14:18:09ragu_ quits [Ping timeout: 258 seconds]
15:13:13godane joins
15:19:52Arcorann_ quits [Ping timeout: 258 seconds]
15:30:27mutantmonkey quits [Remote host closed the connection]
15:30:45mutantmonkey (mutantmonkey) joins
15:40:16godane quits [Ping timeout: 250 seconds]
15:53:21Wingy7 (Wingy) joins
15:54:22Wingy quits [Ping timeout: 258 seconds]
15:54:22Wingy7 is now known as Wingy
17:35:42LeGoupil joins
18:02:46DogsRNice (Webuser299) joins
18:08:19Daloader joins
18:17:52<tzt>http://www.opengrey.eu/ is shutting down "before the summer"
18:19:52<tzt>the sites sql is available
18:23:06HP_Archivist quits [Ping timeout: 258 seconds]
18:33:30HP_Archivist (HP_Archivist) joins
19:06:57LeGoupil quits [Client Quit]
19:34:45FriarGiuseppe joins
19:35:10Giuseppe quits [Ping timeout: 258 seconds]
20:01:01Atom joins
20:01:37Atom-- quits [Ping timeout: 258 seconds]
20:04:09spirit quits [Client Quit]
20:16:44HP_Archivist quits [Ping timeout: 250 seconds]
20:18:37HP_Archivist (HP_Archivist) joins
20:50:32Daloader quits [Ping timeout: 250 seconds]
21:04:50nertzy__ joins
21:07:56nertzy_ quits [Ping timeout: 258 seconds]
21:39:49nertzy_ joins
21:43:12nertzy__ quits [Ping timeout: 258 seconds]
22:14:26<purplebot>Deathwatch edited by JustAnotherArchivist (+502, /* 2021 */ Add PS3/PSP/PS Vita …) just now -- https://www.archiveteam.org/?diff=46447&oldid=46446
22:20:26useretail_ joins
22:20:44<useretail_>how much space would one need to mirror psn-store?
22:21:01Arcorann_ joins
22:23:18<@EggplantN>not sure at this time, but the way our system works we have plenty of storage
22:23:37<@EggplantN>We have over 100TB available, and thats purely for staging before sending to the IA
22:24:29<useretail_>is there any way to retrieve the data after it's being sent to archive.org?
22:24:59<useretail_>i'm asking because of copyrights n stuff
22:24:59<@JAA>Yeah, all our archives are publicly accessible. https://archive.org/details/archiveteam
22:26:02<useretail_>haven't you received any copyright claims?
22:26:44Sylirana quits [Read error: Connection reset by peer]
22:27:51Sylirana (Sylirana) joins
22:29:22hooway quits [Client Quit]
22:30:38<useretail_>i just don't to waste anyone's time here. these files are not going to last long. and i have to be sure that the data will be safe. because to be honest, i'm not sure that IA will allow open access to those files after a few copyright claims and if the situation does really look like this, i'll have to to grab them myself
22:31:44<useretail_>i'm thinking about 40-60TB
22:32:42<@JAA>If you want to be sure to always have access to everything, you'll need to keep a copy yourself, yes. That applies to everything, not just IA. Nobody knows what would happen with this dataset.
22:58:39FriarGiuseppe quits [Remote host closed the connection]
22:59:29FriarGiuseppe joins
23:23:30Arcorann_ quits [Ping timeout: 250 seconds]
23:34:35onetruth joins
23:41:45lennier1 quits [Quit: Going offline, see ya! (www.adiirc.com)]
23:43:32lennier1 (lennier1) joins
23:51:09BlueMaxima joins