00:04:17toss quits [Client Quit]
00:28:01Emitewiki joins
00:31:13Emitewiki quits [Remote host closed the connection]
01:07:20wickedplayer494 quits [Ping timeout: 240 seconds]
01:13:46sec^nd quits [Remote host closed the connection]
01:14:14sec^nd (second) joins
01:21:51wickedplayer494 joins
01:22:03wickedplayer494 quits [Remote host closed the connection]
01:24:25qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
01:26:59wickedplayer494 joins
01:29:31Wohlstand quits [Ping timeout: 272 seconds]
01:33:37nicolas17 quits [Quit: Konversation terminated!]
01:39:01MetaNova quits [Ping timeout: 272 seconds]
01:40:55riku quits [Ping timeout: 272 seconds]
01:43:41MetaNova (MetaNova) joins
01:45:53nicolas17 joins
02:22:15jacksonchen666 quits [Remote host closed the connection]
02:22:49jacksonchen666 (jacksonchen666) joins
02:43:37tzt quits [Ping timeout: 272 seconds]
02:44:43tzt (tzt) joins
02:59:34parfait (kdqep) joins
04:12:46lizitha joins
04:14:32lizitha quits [Remote host closed the connection]
04:42:20atphoenix quits [Ping timeout: 240 seconds]
04:43:57DogsRNice quits [Read error: Connection reset by peer]
04:44:15atphoenix (atphoenix) joins
04:59:25BlueMaxima quits [Read error: Connection reset by peer]
05:11:23kiryu quits [Quit: kiryu]
05:49:12Wohlstand (Wohlstand) joins
06:09:18<flashfire42>I presume still no ETA on Hel1 fix?
06:55:38parfait quits [Client Quit]
06:57:23<fireonlive>+rss- Posthog is closing their Slack community in favor of forum: https://posthog.com/blog/slack-closure https://news.ycombinator.com/item?id=38987383
06:57:28<fireonlive>finally, in the right direction
07:14:58c3manu quits [Remote host closed the connection]
07:32:48c3manu (c3manu) joins
07:50:11dentropy joins
07:58:01dentropy quits [Remote host closed the connection]
08:01:27aninternettroll_ (aninternettroll) joins
08:01:50aninternettroll quits [Ping timeout: 240 seconds]
08:01:50aninternettroll_ is now known as aninternettroll
08:04:50itachi1706 quits [Ping timeout: 240 seconds]
08:06:01neggles_ (neggles) joins
08:06:50neggles quits [Ping timeout: 240 seconds]
08:06:52neggles_ is now known as neggles
08:07:32itachi1706 (itachi1706) joins
08:11:59fireonlive quits [Client Quit]
08:12:43fireonlive (fireonlive) joins
08:24:05aninternettroll quits [Remote host closed the connection]
08:26:37aninternettroll (aninternettroll) joins
08:37:08riku (riku) joins
08:47:47systwi quits [Ping timeout: 272 seconds]
08:55:16systwi (systwi) joins
09:06:30hitgrr8 joins
09:13:38Wohlstand quits [Client Quit]
10:00:01Bleo18260 quits [Client Quit]
10:01:24Bleo18260 joins
10:21:12Island quits [Read error: Connection reset by peer]
10:21:13qwertyasdfuiopghjkl quits [Remote host closed the connection]
10:49:53qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
11:24:03jacksonchen666 quits [Client Quit]
11:32:20jacksonchen666 (jacksonchen666) joins
11:43:43mls quits [Quit: leaving]
12:45:17Arcorann quits [Ping timeout: 272 seconds]
13:40:44itachi1706 quits [Client Quit]
13:42:59itachi1706 (itachi1706) joins
14:10:48<h2ibot>JacksonChen666 edited Deathwatch (+196, ambrosia.moe): https://wiki.archiveteam.org/?diff=51512&oldid=51510
14:21:12lennier2_ joins
14:23:50lennier2 quits [Ping timeout: 240 seconds]
14:28:33lennier2 joins
14:31:03lennier2_ quits [Ping timeout: 272 seconds]
14:36:26lennier2_ joins
14:41:06<Terbium>Lots of Fediverse instances shutting down these days
14:41:11lennier2 quits [Ping timeout: 272 seconds]
14:42:20lennier2_ quits [Ping timeout: 240 seconds]
14:46:01lennier2_ joins
14:49:28danwellby quits [Quit: Watch out For sysops carrying carpet and quicklime]
15:03:19fangfufu_ quits [Read error: Connection reset by peer]
15:35:14<c3manu>!ig bgnzc3hv0l3ejju7m5khrc3g5 ^https?://www\.lostboysinteractive\.com.*/icon/icon
15:35:18<c3manu>-.-
15:39:56jacksonchen666 is now known as RJHacker86417
15:40:00jacksonchen666 (jacksonchen666) joins
15:43:36RJHacker86417 quits [Ping timeout: 255 seconds]
15:50:33qwertyasdfuiopghjkl quits [Remote host closed the connection]
15:59:08qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
16:02:47ymgve_ joins
16:04:39danwellby joins
16:06:41ymgve quits [Ping timeout: 272 seconds]
16:07:45danwellby quits [Client Quit]
16:19:20lflare quits [Ping timeout: 240 seconds]
16:45:40jacksonchen666 quits [Client Quit]
16:45:40ctag quits [Read error: Connection reset by peer]
16:46:03ctag (ctag) joins
16:59:47fangfufu joins
17:02:12<@JAA>https://dcb18d6mfegct.cloudfront.net/ is fully archived, for the record.
17:04:51<@JAA>Unique size turned out to only be 244 GB, 162 GiB with compression.
17:06:18<@JAA>AWS S3 sometimes does a weird thing where the ETag differs even though the contents are identical. I've seen that before.
17:21:58danwellby joins
17:57:22lunik173 quits [Client Quit]
17:59:27lunik173 joins
17:59:43lflare (lflare) joins
18:00:54DogsRNice joins
18:01:17lunik173 quits [Client Quit]
18:02:59lunik173 joins
18:15:52lunik173 quits [Client Quit]
18:16:18lunik173 joins
18:38:31<nicolas17>JAA: wait what were the ETags? did they have dashes?
18:39:16<@JAA>nicolas17: I didn't pay attention, but probably, yes. They almost always do with AWS S3.
18:39:31<nicolas17>files uploaded in 1 part have the file md5 as ETag
18:40:44<nicolas17>files uploaded in multiple parts have something like the md5 of the concatenation of the md5 of each part, so the ETag depends on how the upload tool decided to split the multipart upload
18:41:37<@JAA>I'm sure I've seen ETags of the form 'md5-nnn' with some decimal number appended.
18:41:46<nicolas17>yes, the decimal number is the number of parts
18:41:53<@JAA>Right
18:42:24<@JAA>Looks like most files have just the hash in this bucket.
18:43:24<nicolas17>so if there's no dash, it's an MD5 of the file, if there is a dash, you can't reliably calculate it yourself (and it could be different on files with the same contents) because you don't know how exactly it was split into parts
18:43:55<@JAA>I see.
18:44:50<@JAA>355721 of 372995 files have just the hash.
18:45:22<nicolas17>recently they added x-amz-meta-digest-sh1 and x-amz-meta-digest-sha256 headers, but I think that depends on the uploader sending it in the first place
18:45:28<@JAA>Split uploads would be skewed towards larger files, so I guess it's mostly useless.
18:47:18<nicolas17>like if you upload a file with x-amz-meta-digest-sh1 set to the sha1 of the whole file, S3 will validate that it actually matches the content when the upload is done, and will send the header back when you download the file
18:48:00<@JAA>Right, but not in the bucket listing.
18:50:52<nicolas17>looks like the bucket is also open btw https://omniverse-content-production.s3.amazonaws.com/
18:55:57<@JAA>That's the bucket behind that Cloudfront domain.
18:56:16<nicolas17>yep
18:56:44<nicolas17>but sometimes the cloudfront domain shows everything (including bucket listing) while the bucket itself is inaccessible, because the bucket is configured to only accept requests from cloudfront
18:56:49<nicolas17>seems this is not the case, the bucket is just open
18:57:44<@JAA>Yeah, and you can't paginate through Cloudfront.
18:57:50<@JAA>(In this case, anyway.)
18:57:55<nicolas17>oh
18:57:57<nicolas17>how annoying
19:05:08<nicolas17>any idea where I can find what Samsung device models mean? I have a long list of models like SM-F9460 and SM-G988B
19:05:18<nicolas17>searching SM-G988B on wikipedia I got redirected to "Samsung Galaxy S20" and the infobox says "SM-G988x (S20 Ultra LTE/5G) (Last letter varies by carrier and international models)"
19:05:38<nicolas17>while for "SM-F9460" I got nothing; and either way searching wikipedia won't scale for 700+ strings
19:06:21<@JAA>GSMArena is probably what I'd try, but not sure they have a nice list for it.
19:06:31<@JAA>> Galaxy Z Fold5 (SM-F9460)
19:07:03<katia>https://grep.app/search?q=SM-F9460
19:07:28<katia>https://github.com/KHwang9883/MobileModels/blob/master/scripts/models.csv
19:07:52<katia>296 samsungs there
19:09:17<nicolas17>https://transfer.archivete.am/inline/7HAJZ/opensource-samsung-list.ndjson
19:21:29Wohlstand (Wohlstand) joins
19:47:33HP_Archivist quits [Quit: Leaving]
19:55:14katia quits [Remote host closed the connection]
19:56:01katia (katia) joins
19:56:57katia quits [Remote host closed the connection]
19:57:47katia (katia) joins
19:58:32katia quits [Remote host closed the connection]
20:00:10katia (katia) joins
20:47:38<Vokun>Can the runescape forum be put into ab at least to grab what it'll grab? I don't want to annoy, but I also don't want this to be forgotten about
20:55:32<@JAA>Once it's read-only.
21:00:10<@JAA>There'll be two months between that and the shutdown, which should at least grab some meaningful part of the forums.
21:01:36BlueMaxima joins
21:02:05datechnoman quits [Quit: The Lounge - https://thelounge.chat]
21:03:11<aninternettroll>Is it resonable to compile a big list of all URLs from a site that is not gonna shut down any time soon and post it in #//?
21:03:50<fireonlive>a single site? no
21:03:51<@JAA>If you mean collecting outgoing links from a site, sure.
21:03:57<@JAA>If you mean URLs for the site itself, never.
21:04:07<@JAA>#// is not suitable for that kind of thing.
21:04:39<@JAA>Regardless of whether a site is shutting down or not, that is.
21:05:13<nicolas17>sounds like archivebot material
21:05:33<aninternettroll>What's an "outgoing link"?
21:06:29Island joins
21:06:46<@JAA>If https://example.org/ links to https://foobar.invalid/, that is an outgoing link or offsite link.
21:07:51<nicolas17>if you crawl all posts in a forum, the links on forum posts pointing to websites elsewhere are outlinks and maybe could be sent to #//, the URLs of the forum posts themselves are not
21:08:07<@JAA>Which site is this about?
21:08:28<aninternettroll>A goverment website, nothing important. https://lanekassen.no
21:08:45<aninternettroll>It offers an API with all of it's links, which I thought might be relevant
21:09:04datechnoman (datechnoman) joins
21:09:08<@JAA>That sounds like a potential ArchiveBot job, yeah.
21:09:47<aninternettroll>(link to the API https://lanekassen.no/api/mt1502/sitemap/lkno and static assets if relevant https://lanekassen.no/dist/manifest.json)
21:10:28<@JAA>Interesting
21:11:20<@JAA>I'll run a recursive crawl and then a separate job for anything from those lists that wasn't grabbed.
21:11:43<aninternettroll>Worth noting a lot of links are not accessible by normal users (anything with /dinesider/ really) since it's for customers
21:12:23<@JAA>Do you know what 'lkno' means there?
21:12:33<aninternettroll>abbreviation for LaneKassen.NO
21:12:39<@JAA>Ah, right, duh.
21:12:48<aninternettroll>And dine sider translates to "Your pages"
21:13:30<aninternettroll>Also API docs are here https://lanekassen.no/api/mt1502/swagger/v1/swagger.json but don't offer that much guidance into how to use it. Just a standard CMS really
21:13:35<@JAA>I was thinking something with the language, but it covers also English pages. :-)
21:15:40<@JAA>Looks like https://lanekassen.no/api/mt1502/SiteMap/LKNO is the proper capitalisation per the Swagger doc.
21:15:49<@JAA>(But the server doesn't care.)
21:16:26<@JAA>It's been thrown into AB.
21:16:38<aninternettroll>Thanks!
21:17:02<aninternettroll>Some new URLs should be added soon, so I'll come back when that happens
21:18:11<h2ibot>Ufarwisan created Talk:GitHub (+1803, Created page with "= Github mirrors = There are…): https://wiki.archiveteam.org/?title=Talk%3AGitHub
21:18:12<h2ibot>Ufarwisan edited GitHub (+145, /* External links */): https://wiki.archiveteam.org/?diff=51514&oldid=50956
21:18:58<@JAA>Sounds good. I've grabbed the API endpoint, so we can diff it then I guess.
21:19:11<h2ibot>JustAnotherArchivist changed the user rights of User:Ufarwisan
21:27:09hackbug quits [Ping timeout: 272 seconds]
21:28:55magmaus3 quits [Quit: :3]
21:33:40hackbug (hackbug) joins
21:56:38c3manu quits [Remote host closed the connection]
21:57:50xkey quits [Ping timeout: 240 seconds]
21:59:09xkey (xkey) joins
22:05:09xkey quits [Ping timeout: 272 seconds]
22:05:21xkey (xkey) joins
22:06:21<h2ibot>Ufarwisan edited Talk:GitHub (+110): https://wiki.archiveteam.org/?diff=51515&oldid=51513
22:10:50xkey quits [Ping timeout: 240 seconds]
22:11:41xkey (xkey) joins
22:12:22qwertyasdfuiopghjkl quits [Remote host closed the connection]
22:23:03jasons quits [Quit: The Lounge - https://thelounge.chat]
22:35:20xkey quits [Ping timeout: 240 seconds]
22:38:13xkey (xkey) joins
22:38:54jasons (jasons) joins
22:57:41xkey quits [Client Quit]
23:37:20jasons quits [Ping timeout: 240 seconds]
23:48:16Hackerpcs quits [Quit: Hackerpcs]
23:50:58Hackerpcs (Hackerpcs) joins