00:01:09Arcorann (Arcorann) joins
00:06:18Ruthalas quits [Read error: Connection reset by peer]
00:06:20Ruthalas (Ruthalas) joins
00:43:00Mateon1 quits [Remote host closed the connection]
00:43:04Mateon1 joins
00:49:13TheTechRobo quits [Remote host closed the connection]
00:50:20TheTechRobo (TheTechRobo) joins
01:01:55march_happy quits [Remote host closed the connection]
01:03:22dm4v_ joins
01:05:39dm4v quits [Ping timeout: 265 seconds]
01:05:39dm4v_ is now known as dm4v
01:05:40dm4v quits [Changing host]
01:05:40dm4v (dm4v) joins
01:09:01march_happy (march_happy) joins
01:46:27Jake2 (Jake) joins
01:46:33Jake quits [Client Quit]
01:46:33coderobe quits [Quit: Ping timeout (120 seconds)]
01:46:33wyatt8740 quits [Client Quit]
01:46:33Ruthalas quits [Client Quit]
01:46:33igloo22225 quits [Client Quit]
01:46:33ave quits [Client Quit]
01:46:33JackThompson quits [Quit: Ping timeout (120 seconds)]
01:46:33Iki quits [Remote host closed the connection]
01:46:33swebb quits [Quit: ZNC 1.7.5+deb4 - https://znc.in]
01:46:33syntaxx quits [Client Quit]
01:46:33Jake2 is now known as Jake
01:46:43Iki joins
01:46:50JackThompson joins
01:47:15swebb joins
01:47:22igloo22225 (igloo22225) joins
01:47:22Ruthalas (Ruthalas) joins
01:47:25ave (ave) joins
01:47:27coderobe (coderobe) joins
01:47:38syntaxx (syntaxx) joins
01:48:54Iki quits [Remote host closed the connection]
01:49:04Iki joins
01:53:48wyatt8740 joins
01:56:54michaelblob_ quits [Read error: Connection reset by peer]
03:01:42bonga joins
03:01:45appstore joins
03:02:10<appstore>Remember the app store archival I talked about?
03:02:18<appstore>https://www.theverge.com/2022/4/23/23038870/apple-app-store-widely-remove-outdated-apps-developers
03:02:32<appstore>We need to archive these. Fast
03:02:54<appstore>They will be locked to people's purchases list
03:03:49<appstore>We need to create an IPA downloader and also download metadata like icon, reviews, description, and developer info
03:04:11<appstore>If we don't thousands of iOS apps will be nearly lost
03:04:18<appstore>And that includes games
03:04:33<appstore>And games exclusive to iOS that kids grew up on
03:04:59<nyany>it was discussed earlier and could probably use some additional commentary
03:05:13<appstore>They'll be crying in the future because only the console kids get to play their old games
03:05:51<appstore>iTunes old versions can download these but it's old
03:06:01<appstore>We need to code an archival tool
03:06:31<appstore>I remember it was me who talked about it
03:06:45<appstore>But now there's a cleanup of apps
03:06:49<appstore>https://www.theverge.com/2022/4/23/23038870/apple-app-store-widely-remove-outdated-apps-developers
03:07:18appstore quits [Remote host closed the connection]
03:16:04paul2520 quits [Remote host closed the connection]
03:24:28bonga quits [Remote host closed the connection]
03:27:09bonga joins
03:31:29<atphoenix>"They'll be crying in the future because only the console kids get to play their old games" not all consoles. Mostly just the old consoles that didn't depend on downloaded content or activation.
03:49:37march_happy quits [Remote host closed the connection]
03:55:13march_happy (march_happy) joins
03:57:36march_happy quits [Read error: Connection reset by peer]
03:57:45march_happy (march_happy) joins
04:00:01treora quits [Quit: blub blub.]
04:02:12treora joins
04:16:56march_happy quits [Read error: Connection reset by peer]
04:20:05march_happy (march_happy) joins
04:24:18pabs quits [Ping timeout: 265 seconds]
04:31:59bonga quits [Read error: Connection reset by peer]
04:32:02march_happy quits [Ping timeout: 265 seconds]
04:32:08march_happy (march_happy) joins
04:33:37bonga joins
04:37:52pabs (pabs) joins
04:39:03sec^nd quits [Remote host closed the connection]
04:40:08sec^nd (second) joins
05:12:26Arcorann quits [Ping timeout: 240 seconds]
05:25:23<lennier1>Hmmm, it does sound like a lot of apps may be removed in the near future. I wonder if there's a way to automate whatever the old version of iTunes does to download them. Not sure if there's any way around the need for an account, if that matters.
05:29:53UstreamHate joins
05:31:01<UstreamHate>Hey, I was wondering if anyone had any advice on archiving Ustream streams or recordings
05:39:04UstreamHate quits [Remote host closed the connection]
05:47:03pabs quits [Client Quit]
06:01:40michaelblob (michaelblob) joins
06:07:35<atphoenix>they left, but Open Broadcaster Software https://obsproject.com/ works well as a generic screenrecorder. Screenrecording isn't ideal but may be more accessible than some other methods that try to get the original datastream (especially if there aren't specialized tools to handle stream source). It is definitely better than falling back to wired analog output/analog recording. It is much much better than the actual last resort of
06:07:35<atphoenix>pointing a camera at a screen.
06:12:34sepro quits [Ping timeout: 265 seconds]
06:14:53JackThompson quits [Client Quit]
06:15:34sepro (sepro) joins
06:19:33pabs (pabs) joins
06:28:36AK quits [Remote host closed the connection]
06:35:08JackThompson joins
06:40:08march_happy quits [Read error: Connection reset by peer]
06:40:13march_happy (march_happy) joins
06:42:33march_happy quits [Read error: Connection reset by peer]
06:43:58march_happy (march_happy) joins
06:55:28BlueMaxima quits [Read error: Connection reset by peer]
07:03:09sepro quits [Read error: Connection reset by peer]
07:04:22sepro (sepro) joins
07:08:05<@JAA>yt-dlp would always be my first attempt, and it does appear to have an extractor for Ustream (aka IBM Watson Media).
07:09:57sepro3 (sepro) joins
07:10:05sepro quits [Ping timeout: 265 seconds]
07:10:05sepro3 is now known as sepro
07:13:12AK (AK) joins
07:13:14AK quits [Remote host closed the connection]
07:13:58test joins
07:14:11test quits [Remote host closed the connection]
07:16:22sepro quits [Ping timeout: 265 seconds]
07:22:30AK (AK) joins
07:22:33AK quits [Remote host closed the connection]
07:37:55AK (AK) joins
07:37:57AK quits [Remote host closed the connection]
08:11:26bonga quits [Remote host closed the connection]
08:11:39bonga joins
08:24:18Arcorann (Arcorann) joins
10:06:54NinCollin quits [Ping timeout: 252 seconds]
10:14:44Terbium quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]
10:15:55Terbium joins
10:21:24march_happy quits [Remote host closed the connection]
10:26:15march_happy (march_happy) joins
10:40:03AK (AK) joins
11:09:35dm4v quits [Client Quit]
11:11:12dm4v joins
11:11:14dm4v quits [Changing host]
11:11:14dm4v (dm4v) joins
12:10:44Hackerpcs quits [Quit: Hackerpcs]
12:11:07Hackerpcs (Hackerpcs) joins
12:17:46Sluggs quits [Ping timeout: 240 seconds]
12:24:47Sluggs joins
12:30:46Sluggs quits [Ping timeout: 240 seconds]
12:31:10Sluggs joins
12:53:12<h2ibot>TheTechRobo edited Mobile Phone Applications (-70, Properly link to other wiki articles): https://wiki.archiveteam.org/?diff=48489&oldid=48486
12:53:13<h2ibot>TheTechRobo edited Telegram (+78, link to tracker; elaborate on snscrape bit): https://wiki.archiveteam.org/?diff=48490&oldid=48461
12:53:14<h2ibot>TheTechRobo edited Academic Earth (+270, 2022): https://wiki.archiveteam.org/?diff=48491&oldid=28783
12:55:17qwertyasdfuiopghjkl joins
13:02:13<h2ibot>Kore edited WebCite (+360, endangered – "DB Connection failed". Also, the…): https://wiki.archiveteam.org/?diff=48492&oldid=40992
13:03:13<h2ibot>Spaffel edited Discord (+171, Added information about Search-Cord): https://wiki.archiveteam.org/?diff=48493&oldid=47895
13:04:03bruce joins
13:09:14<h2ibot>JustAnotherArchivist edited WebCite (+229, Link to collection, note about changes in S3…): https://wiki.archiveteam.org/?diff=48494&oldid=48492
13:10:29<@JAA>Apparently WebCite's S3 bucket lost a bunch of files in the past 3 years. All of these are gone, for example: https://web.archive.org/web/*/https://s3-us-west-2.amazonaws.com/webcitation/00*
13:23:17<h2ibot>Barto created Elections/2022 May Swiss votes (+1435, Create page - 15-05-2022 swiss votations): https://wiki.archiveteam.org/?title=Elections/2022%20May%20Swiss%20votes
13:23:18<h2ibot>JustAnotherArchivist changed the user rights of User:Barto
13:23:26<Barto>:-)
13:24:54<Barto>not sure about some terms between initiative commitee and referendum commitee, i'm confident you'll sort it out :-)
13:26:56HP_Archivist (HP_Archivist) joins
13:31:18<h2ibot>JustAnotherArchivist edited Elections (+98, Add 2022-05-15 Swiss votes): https://wiki.archiveteam.org/?diff=48496&oldid=48438
13:41:44sepro (sepro) joins
13:57:21march_happy quits [Ping timeout: 252 seconds]
13:57:41march_happy (march_happy) joins
14:15:25march_happy quits [Ping timeout: 265 seconds]
14:16:06march_happy (march_happy) joins
14:25:57march_happy quits [Ping timeout: 252 seconds]
14:26:35march_happy (march_happy) joins
14:31:51HP_Archivist quits [Ping timeout: 265 seconds]
14:38:03Arcorann quits [Ping timeout: 252 seconds]
16:10:13JackThompson7 joins
16:11:33JackThompson quits [Ping timeout: 252 seconds]
16:11:34JackThompson7 is now known as JackThompson
16:32:24systwi quits [Read error: Connection reset by peer]
16:32:36fuzzy8021 quits [Read error: Connection reset by peer]
16:33:29eroc1990 quits [Client Quit]
16:33:40systwi (systwi) joins
16:33:50lun4 quits [Client Quit]
16:33:51eroc1990 (eroc1990) joins
16:34:11fuzzy8021 (fuzzy8021) joins
16:34:17lun4 (lun4) joins
16:35:51Dalek quits [Client Quit]
16:36:04Dalek (Dalek) joins
16:43:19nepeat quits [Ping timeout: 265 seconds]
16:45:26nepeat (nepeat) joins
17:06:24fuzzy8021 quits [Read error: Connection reset by peer]
17:07:23eroc1990 quits [Client Quit]
17:07:35lun4 quits [Client Quit]
17:08:19nepeat quits [Client Quit]
17:08:37lun4 (lun4) joins
17:21:23nepeat (nepeat) joins
17:21:36Megame (Megame) joins
17:22:30march_happy quits [Ping timeout: 252 seconds]
17:24:00eroc1990 (eroc1990) joins
18:06:19fuzzy8021 (fuzzy8021) joins
18:46:47evan joins
18:49:39<evan>Hello all! New to archiving, I have a website that I am interested to start archiving because It's unmaintained for 7 years and it's the sweetest place on earth. I'm doing research into how this community works and thought I would join the IRC as I keep learning
18:50:10<evan>Not asking for help at this point in time
18:53:35hackbug quits [Remote host closed the connection]
18:53:51<evan>Also the irony that the quote of the moment leads to a 404 page
18:55:56hackbug (hackbug) joins
19:01:15<fuzzy8021>if you list the website someone might be able to run it
19:03:48<evan>I'm very much a do-it-yourselfer, I mostly wanted to officiate myself as a member of this community first :)
19:04:50<fuzzy8021>the thing i see mentioned usually is grab-site but i dont do any of that kind of stuff myself
19:09:51Minkafighter quits [Quit: The Lounge - https://thelounge.chat]
19:10:31Minkafighter joins
19:25:06lennier1 quits [Ping timeout: 265 seconds]
19:27:06lennier1 (lennier1) joins
19:32:48<TheTechRobo>evan: Well, if you ever do need help, we're right here. :-)
19:33:26TheTechRobo quits [Remote host closed the connection]
19:34:32TheTechRobo (TheTechRobo) joins
19:36:23LeGoupil joins
19:46:12<@JAA>evan: grab-site for your personal archive, or ask us for a public archive that will be available in the Wayback Machine forever. :-)
19:49:15<h2ibot>JustAnotherArchivist edited Main Page (+4, Fix TIME article link): https://wiki.archiveteam.org/?diff=48497&oldid=47671
19:53:24<@JAA>content.time.com has sitemaps, but they're weird and somewhat broken. Might be a good idea to archive it.
19:53:42<TheTechRobo>#Y or ArchiveBot do you think?
19:54:18<@JAA>Probably needs something special due to how broken the links are.
19:54:40<@JAA>Or extraction + !ao <.
19:55:21<TheTechRobo>There's a sitemap? /, /sitemap.xml don't work, and /robots.txt doesn't have any
19:55:31<TheTechRobo>oh nvm
19:55:37<TheTechRobo> /html-sitemap/
19:56:42<TheTechRobo>Nevermind, that's time.com.
19:56:51<TheTechRobo>JAA: Where did you get the content.time.com sitemaps?
19:58:05<evan>If there is data that is always formatted in the same way, would it be better to store that data as JSON and then simply template it? Would be better for storage space I think
19:59:01<evan>IE a user page will always have class="description", so why not just extract that to { description: "" }
19:59:25<TheTechRobo>You can, although saving in WARC format is preferred for archival
20:01:51<Sanqui>evan: when archiving, you want to do minimal processing at capture time, and only derive the data into other formats at a later point
20:02:38<TheTechRobo>If you store in WARC or some other raw format, you can always change to another format later. But you can't change from processed to unprocessed, unless the processing is lossless (which in this case it isn't)
20:03:02<Sanqui>transforming the data while downloading it is almost always the wrong move. You can easily confuse matters by using different terminology, omit capturing exotic data types (resulting in silently lost data), and essentially risk losing being able to reconstruct original representation entirely
20:03:51<evan>thanks everyone, and thanks to Sanqui for explaining not only that it is a bad idea but why it is a bad idea
20:06:03<Sanqui>additionally, I'll note that storage space tends to be a smaller problem with proper compression techniques, which work especially well on data that can be "templated" :)
20:06:26<evan>ah right, I forgot compression is magic!
20:19:08<evan>Will wiki edits always need to be reviewed or is there a process like wikipedia's autoconfirm?
20:19:49<@JAA>TheTechRobo: http://content.time.com/time/static/sitemap/1.html but the index is a 404 etc.
20:20:10<@JAA>evan: Wiki edits from non-whitelisted accounts go to a moderation queue, yeah.
20:20:12<TheTechRobo>evan: Some accounts are manually approved to auto-accept.
20:20:45<TheTechRobo>JAA: Don't those links just point to other subdomains...?
20:21:04Megame quits [Client Quit]
20:21:18<@JAA>TheTechRobo: The ones for 2013/2014 do, but older ones don't. As I said, it's a weird sitemap.
20:21:44<@JAA>Well, some of the older ones don't.
20:21:48<TheTechRobo>Oh, I wasn't changing the number.
20:21:54<@JAA>Those sitemaps are how I found that GeoCities article again.
20:22:05<TheTechRobo>It was on the talk page, FWIW.
20:22:16<TheTechRobo>I just didn't know how to edit the "Quote of the moment"
20:23:00<@JAA>The main page is protected, only admins can edit it.
20:23:09<TheTechRobo>ah
20:23:20<h2ibot>Evan edited Reddit (+205): https://wiki.archiveteam.org/?diff=48498&oldid=48472
20:26:38<@JAA>Anyway, yeah, the sitemaps are majorly broken. http://content.time.com/time/static/sitemap/4.html is the index for 2011, but the first page for October 2011 exists under both http://content.time.com/time/static/sitemap/4_10_1.html (linked from there) and http://content.time.com/time/static/sitemap/1_10_1.html with different contents...
20:42:25kayvon2008 joins
20:42:51<kayvon2008>can you accept and edit a bit my revision on https://wiki.archiveteam.org/index.php?title=Current_Projects
20:43:16<kayvon2008>and https://wiki.archiveteam.org/index.php?title=Geekbench_Browser
20:43:28kayvon2008 quits [Remote host closed the connection]
20:53:54<@JAA>Nope, and if you keep disappearing within seconds, that won't change either.
21:06:23<Doranwen>LOL
21:06:51lennier2 joins
21:08:55LeGoupil quits [Remote host closed the connection]
21:09:06lennier1 quits [Ping timeout: 252 seconds]
21:09:12lennier2 is now known as lennier1
21:22:08Larsenv quits [Quit: ZNC 1.8.2+deb2build5 - https://znc.in]
21:24:49Larsenv (Larsenv) joins
21:51:21Megame (Megame) joins
21:51:32Megame quits [Remote host closed the connection]
21:52:38Megame (Megame) joins
21:53:02Megame quits [Remote host closed the connection]
21:57:31Megame (Megame) joins
22:00:27march_happy (march_happy) joins
22:10:42wyatt8740 quits [Ping timeout: 252 seconds]
22:10:53wyatt8740 joins
23:07:30HP_Archivist (HP_Archivist) joins
23:28:42BlueMaxima joins
23:29:11march_happy quits [Ping timeout: 265 seconds]
23:58:11HP_Archivist quits [Ping timeout: 265 seconds]