| 00:12:18 | | wyatt8740 joins |
| 00:12:51 | | wyatt8750 quits [Ping timeout: 265 seconds] |
| 00:14:29 | <lennier1> | With the recent news about Apple removing apps that don't get updated, how difficult would it be to archive https://apps.apple.com ? |
| 00:15:45 | <lennier1> | That would get a lot of metadata about the apps, but not the .ipa files themselves, thought it would also be useful for identifying apps, seeing when they were last updated, whether they're free or paid, etc. |
| 00:16:26 | <TheTechRobo> | lennier1: The individual app pages at least don't seem to need JS |
| 00:17:24 | <TheTechRobo> | However, the review page only shows a few with no option that I can see to load more. |
| 00:17:42 | <TheTechRobo> | Not sure abt ratelimiting |
| 00:24:41 | <lennier1> | Yeah, I guess it just shows 10 reviews. They also have multiple regions. i.e. https://apps.apple.com/us/app/maximus-2/id1554432924 https://apps.apple.com/de/app/maximus-2/id1554432924 |
| 00:30:27 | <lennier1> | You can get to a page with just the app ID. https://apps.apple.com/us/app/id447188370 |
| 00:31:53 | <lennier1> | Is there a good way to discover the app IDs? They go up to at least the low billions, but I believe there are just about a couple of million apps. |
| 00:32:07 | <TheTechRobo> | search engines? |
| 00:32:14 | <TheTechRobo> | there's also probably a discovery page |
| 00:34:30 | <lennier1> | The main page has some links, and then each app has links to similar apps, apps by the same developer, etc. But I don't know if that would get to all the obscure ones. |
| 00:42:14 | | benjins is now authenticated as benjins |
| 00:49:39 | <thuban> | ok, my brute force scan of glencoe.mheducation.com sites is running |
| 00:50:31 | <thuban> | i did it in the maximally lazy way, so it'll take about 30 days, but with shutdown on 30 june that should leave plenty of time for archiving (there appear to be few sites and they appear to be small) |
| 00:52:04 | <thuban> | the sites themselves use javascript, but should be crawlable without it |
| 00:52:58 | | Mateon1 quits [Ping timeout: 265 seconds] |
| 00:53:17 | | Mateon1 joins |
| 00:54:11 | | Megame quits [Client Quit] |
| 00:56:48 | <thuban> | (some teachers' resources are "password-protected"... using client-side javascript. everything seems to be reachable through the sitemaps) |
| 00:59:37 | <thuban> | the shutdown announcement is for "glencoe.mheducation.com and all of its associated sites"; i'm not sure whether this implies there's content other than the sites associated with the domain, but i haven't been able to find any |
| 01:02:38 | | dm4v quits [Client Quit] |
| 01:02:46 | | dm4v joins |
| 01:02:49 | | dm4v is now authenticated as dm4v |
| 01:02:49 | | dm4v quits [Changing host] |
| 01:02:49 | | dm4v (dm4v) joins |
| 01:09:03 | | march_happy quits [Ping timeout: 265 seconds] |
| 01:10:34 | | march_happy (march_happy) joins |
| 01:12:47 | | DiscantX quits [Ping timeout: 265 seconds] |
| 01:16:35 | | HP_Archivist (HP_Archivist) joins |
| 01:20:45 | | nikow1 joins |
| 01:20:54 | | Mateon2 joins |
| 01:21:38 | | BlueMaxima_ joins |
| 01:21:41 | | Jake4 (Jake) joins |
| 01:22:01 | | BlueMaxima_ quits [Remote host closed the connection] |
| 01:22:07 | | IDK_ quits [Client Quit] |
| 01:22:07 | | BlueMaxima quits [Remote host closed the connection] |
| 01:22:07 | | nikow quits [Remote host closed the connection] |
| 01:22:07 | | adamus1red quits [Client Quit] |
| 01:22:07 | | le0n quits [Client Quit] |
| 01:22:07 | | Mateon1 quits [Remote host closed the connection] |
| 01:22:07 | | Jake quits [Client Quit] |
| 01:22:07 | | mikael quits [Client Quit] |
| 01:22:07 | | dm4v quits [Client Quit] |
| 01:22:07 | | Mateon2 is now known as Mateon1 |
| 01:22:08 | | Jake4 is now known as Jake |
| 01:22:10 | | dm4v joins |
| 01:22:11 | | msrn_ joins |
| 01:22:20 | | dm4v is now authenticated as dm4v |
| 01:22:20 | | dm4v quits [Changing host] |
| 01:22:20 | | dm4v (dm4v) joins |
| 01:22:43 | | le0n (le0n) joins |
| 01:22:43 | | adamus1red (adamus1red) joins |
| 01:22:53 | | IDK_ joins |
| 01:23:07 | | BlueMaxima_ joins |
| 01:23:31 | | BlueMaxima_ quits [Remote host closed the connection] |
| 01:24:37 | | BlueMaxima_ joins |
| 01:25:01 | | BlueMaxima_ quits [Remote host closed the connection] |
| 01:26:07 | | BlueMaxima_ joins |
| 01:26:31 | | BlueMaxima_ quits [Remote host closed the connection] |
| 01:26:45 | | BlueMaxima_ joins |
| 01:44:48 | | Arcorann quits [Ping timeout: 252 seconds] |
| 01:55:15 | | HP_Archivist quits [Client Quit] |
| 01:59:31 | <h2ibot> | Hoarderhank edited Alive... OR ARE THEY (+466, Added The Correspondent): https://wiki.archiveteam.org/?diff=48570&oldid=48444 |
| 02:36:48 | | tzt quits [Remote host closed the connection] |
| 02:37:12 | | tzt (tzt) joins |
| 03:04:34 | | jacobk quits [Ping timeout: 265 seconds] |
| 03:14:06 | | DiscantX joins |
| 03:19:55 | | BlueMaxima_ quits [Client Quit] |
| 03:22:13 | | benjinsmith joins |
| 03:24:44 | | benjins quits [Ping timeout: 265 seconds] |
| 03:28:13 | | jacobk joins |
| 03:38:39 | | march_happy quits [Ping timeout: 252 seconds] |
| 03:39:13 | | march_happy (march_happy) joins |
| 03:58:13 | | march_happy quits [Ping timeout: 265 seconds] |
| 03:59:02 | | march_happy (march_happy) joins |
| 04:06:46 | | Arcorann (Arcorann) joins |
| 04:22:33 | <Arcorann> | Question: is there a project related to the Philippines' current situation? |
| 04:28:00 | <Frogging101> | What situation is that? |
| 04:34:03 | | kn100 quits [Quit: https://kn100.me :)] |
| 04:34:26 | | kn100 joins |
| 04:45:07 | <Ryz> | Think it might be election related |
| 05:34:09 | | jacobk quits [Ping timeout: 252 seconds] |
| 05:58:17 | | jacobk joins |
| 06:04:44 | | Atom-- joins |
| 06:06:36 | | Atom quits [Ping timeout: 252 seconds] |
| 06:21:02 | | Atom joins |
| 06:23:18 | | michaelblob quits [Read error: Connection reset by peer] |
| 06:24:12 | | Atom-- quits [Ping timeout: 252 seconds] |
| 06:26:40 | | michaelblob (michaelblob) joins |
| 06:45:27 | | DiscantX quits [Ping timeout: 265 seconds] |
| 07:06:35 | | Shjosan quits [Ping timeout: 265 seconds] |
| 07:30:38 | | systwi__ (systwi) joins |
| 07:31:18 | | systwi quits [Ping timeout: 252 seconds] |
| 07:35:29 | <thuban> | whoops! those 10-digit identifiers are isbn-10s, which means (among other things) that the final 'digit' can be an "x": http://glencoe.mheducation.com/sites/007895312x/ http://glencoe.mheducation.com/sites/007874637x/ http://glencoe.mheducation.com/sites/007873830x/ |
| 07:36:05 | <thuban> | not a problem, just have to make sure i scan those as well |
| 08:30:20 | | benjinsmith quits [Ping timeout: 265 seconds] |
| 09:09:29 | | shoghicp joins |
| 09:09:29 | | shoghicp is now authenticated as shoghicp |
| 09:09:29 | | shoghicp quits [Changing host] |
| 09:09:29 | | shoghicp (shoghicp) joins |
| 09:51:00 | | shoghicp quits [Ping timeout: 252 seconds] |
| 09:52:10 | | shoghicp joins |
| 09:52:11 | | shoghicp is now authenticated as shoghicp |
| 09:52:11 | | shoghicp quits [Changing host] |
| 09:52:11 | | shoghicp (shoghicp) joins |
| 09:55:36 | | qwertyasdfuiopghjkl joins |
| 10:01:08 | | benjins joins |
| 10:01:43 | | benjins is now authenticated as benjins |
| 10:19:48 | | Megame (Megame) joins |
| 11:11:25 | | systwi__ is now known as systwi |
| 11:59:23 | | wyatt8750 joins |
| 11:59:42 | | wyatt8740 quits [Ping timeout: 252 seconds] |
| 12:10:53 | | eroc1990 quits [Client Quit] |
| 12:12:04 | | eroc1990 (eroc1990) joins |
| 12:47:00 | | march_happy quits [Ping timeout: 252 seconds] |
| 12:47:13 | | march_happy (march_happy) joins |
| 13:22:44 | | HP_Archivist (HP_Archivist) joins |
| 13:26:23 | | evan quits [Remote host closed the connection] |
| 13:26:23 | | jamesp quits [Remote host closed the connection] |
| 13:26:24 | | shreyasminocha quits [Remote host closed the connection] |
| 13:27:14 | | jamesp joins |
| 13:27:14 | | evan joins |
| 13:27:14 | | jamesp is now authenticated as jamesp |
| 13:27:14 | | jamesp quits [Changing host] |
| 13:27:14 | | jamesp (jamesp) joins |
| 13:27:39 | | shreyasminocha (shreyasminocha) joins |
| 13:53:54 | <Doranwen> | glad you're saving the Glencoe stuff - I've grabbed *tons* of their downloadable content (workbook pdfs and such) over the years, as some of it is super valuable to teachers, even when using a different company's books to teach from |
| 13:56:31 | <Doranwen> | thuban: ah yes, this is what I mean by the super useful downloadable stuff - I've used these workbooks personally in tutoring students at one point: http://glencoe.mheducation.com/sites/007873830x/student_view0/student_workbooks.html |
| 13:57:05 | <Doranwen> | a few years back I went hunting for every case like that I could find but that was only from browsing through the links they provided, wonder if there are more that weren't linked to there somehow |
| 14:17:14 | | Arcorann quits [Ping timeout: 265 seconds] |
| 14:23:10 | | march_happy quits [Ping timeout: 265 seconds] |
| 14:23:29 | | march_happy (march_happy) joins |
| 14:23:30 | | Megame quits [Client Quit] |
| 15:12:37 | | Shjosan (Shjosan) joins |
| 15:15:00 | | HP_Archivist quits [Client Quit] |
| 15:39:08 | <Ryz> | Weird, https://gamefaqs.gamespot.com/ starting rendering all the URLs as 404s, and then finally it gave 503s to indicate it was down... oo; |
| 15:46:21 | | AramZS joins |
| 15:48:44 | <AramZS> | Hey folks @Chronotope on Twitter here. Wanted to note that I proposed adding The Believer's website to the 'Alive... but are they!' watchlist. They appear to have been purchased for their SEO juice by a sex toy company |
| 15:48:47 | <AramZS> | See: https://twitter.com/ST_Collective_/status/1523756595317927936 |
| 15:49:43 | <AramZS> | So while presumably they are well covered in the archives already, and the current owner is incentivized to keep them up, if there is a process for double checking their pages are indeed covered and archived it might be worthwhile to do. |
| 15:55:22 | <Ryz> | Hello AramZS, is this the one? https://believermag.com/ |
| 15:55:58 | | wyatt8750 quits [Ping timeout: 265 seconds] |
| 15:56:01 | <AramZS> | Yup, that's the site |
| 15:56:14 | | wyatt8740 joins |
| 15:57:49 | <Ryz> | I threw the website into ArchiveBot ArmazZS; just trying to figure out if the website has a Twitter account~ |
| 15:58:17 | <AramZS> | https://twitter.com/believermag |
| 15:58:22 | <AramZS> | Is their Twitter I think |
| 15:58:27 | <Ryz> | Grand o: |
| 15:58:32 | <Ryz> | Running it through ArchiveBot too~ |
| 15:58:55 | <Ryz> | AramZS, thanks for bringing it up to us, means a lot when websites like that could suddenly change how available their content is |
| 15:59:12 | <Ryz> | Please don't hesistate to come back and report websites, or even suggest websites that might be in danger |
| 15:59:35 | <AramZS> | Exactly and awesome. That's a relief. There is a lot of significant authorship who have contributed major essays over the years to that site and while it also has a print version I do believe there significance to the online form and format and likely some online-only content. Will do! |
| 16:01:20 | <Ryz> | Gonna also archive the website from https://twitter.com/ST_Collective_ - being https://sextoycollective.com/ - just incase |
| 16:24:11 | | HP_Archivist (HP_Archivist) joins |
| 16:50:40 | <tech234a> | Might make sense to make sure iPod-related pages on Appleās site are archived, Apple appears to have announced that iPod Touches have been discontinued https://www.apple.com/newsroom/2022/05/the-music-lives-on/ |
| 17:19:12 | | HP_Archivist quits [Client Quit] |
| 17:39:50 | | sec^nd quits [Remote host closed the connection] |
| 17:41:03 | | sec^nd (second) joins |
| 18:28:34 | | Icyelut|3 quits [Ping timeout: 265 seconds] |
| 19:20:43 | | Craigle quits [Quit: The Lounge - https://thelounge.chat] |
| 19:21:09 | | Craigle (Craigle) joins |
| 19:38:17 | | wyatt8750 joins |
| 19:38:18 | | wyatt8740 quits [Ping timeout: 265 seconds] |
| 19:57:59 | | Void0 (Void0) joins |
| 20:03:57 | <Void0> | Hey Ryz |
| 20:04:12 | <Ryz> | Hello Void0, what forum website are you trying to archive? |
| 20:04:18 | <Void0> | blackpearl.biz |
| 20:04:50 | <Void0> | piracy discussion/sharing forum. shutting down at the end of the month. i tried to back up what i could but didn't have much progress |
| 20:06:11 | <Ryz> | Unfortunately I can't toss it into ArchiveBot since the bot can't go through the website on it's own; |
| 20:06:43 | <Ryz> | I recall some people might've been able to archive invite-only forums here, but unsure if they're present right now |
| 20:07:14 | <Void0> | ah okay, thanks and no worries! i'll try and lurk in here to see if anyone responds. |
| 20:07:52 | <Void0> | would it work if i invited you? |
| 20:11:38 | <Ryz> | Invite me? Uhh, I don't really have tools to manually archive websites on my computer; I just usually use ArchiveBot for archiving websites and links in general |
| 20:25:50 | <Void0> | ooh okay! thanks. so archivebox works for sites that are public? |
| 20:53:01 | <programmerq> | Void0: I wouldn't mind tinkering with trying to grab a backup. It wouldn't end up in web.archive.org if I just do the grab myself. |
| 21:01:02 | | Mateon1 quits [Remote host closed the connection] |
| 21:01:16 | | Mateon1 joins |
| 21:32:55 | | LeGoupil joins |
| 21:34:54 | | LeGoupil quits [Client Quit] |
| 21:37:15 | | LeGoupil joins |
| 21:40:43 | | LeGoupil quits [Client Quit] |
| 21:52:53 | | HP_Archivist (HP_Archivist) joins |
| 22:08:32 | <@arkiver> | Void0: feel free to PM me |
| 22:12:29 | | AramZS quits [Ping timeout: 265 seconds] |
| 22:31:26 | | BlueMaxima joins |
| 23:15:39 | | Void0 quits [Client Quit] |
| 23:16:37 | | HP_Archivist quits [Client Quit] |
| 23:19:52 | | HP_Archivist (HP_Archivist) joins |
| 23:29:56 | | phuzion quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
| 23:30:02 | | phuzion (phuzion) joins |
| 23:33:33 | | march_happy quits [Ping timeout: 265 seconds] |
| 23:34:17 | | march_happy (march_happy) joins |
| 23:42:21 | | Arcorann (Arcorann) joins |
| 23:46:28 | | HP_Archivist quits [Client Quit] |
| 23:50:00 | <TheTechRobo> | Void0: If you're reading logs, feel free to PM me with details |
| 23:51:57 | | march_happy quits [Ping timeout: 252 seconds] |
| 23:52:32 | | jacobk quits [Ping timeout: 265 seconds] |
| 23:52:48 | | march_happy (march_happy) joins |
| 23:57:22 | | march_happy quits [Ping timeout: 265 seconds] |
| 23:58:48 | | march_happy (march_happy) joins |