00:04:17 | | toss quits [Client Quit] |
00:28:01 | | Emitewiki joins |
00:31:13 | | Emitewiki quits [Remote host closed the connection] |
01:07:20 | | wickedplayer494 quits [Ping timeout: 240 seconds] |
01:13:46 | | sec^nd quits [Remote host closed the connection] |
01:14:14 | | sec^nd (second) joins |
01:21:51 | | wickedplayer494 joins |
01:22:03 | | wickedplayer494 quits [Remote host closed the connection] |
01:24:25 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
01:26:59 | | wickedplayer494 joins |
01:27:08 | | wickedplayer494 is now authenticated as wickedplayer494 |
01:29:31 | | Wohlstand quits [Ping timeout: 272 seconds] |
01:33:37 | | nicolas17 quits [Quit: Konversation terminated!] |
01:39:01 | | MetaNova quits [Ping timeout: 272 seconds] |
01:40:55 | | riku quits [Ping timeout: 272 seconds] |
01:43:41 | | MetaNova (MetaNova) joins |
01:45:53 | | nicolas17 joins |
02:22:15 | | jacksonchen666 quits [Remote host closed the connection] |
02:22:49 | | jacksonchen666 (jacksonchen666) joins |
02:43:37 | | tzt quits [Ping timeout: 272 seconds] |
02:44:43 | | tzt (tzt) joins |
02:59:34 | | parfait (kdqep) joins |
04:12:46 | | lizitha joins |
04:14:32 | | lizitha quits [Remote host closed the connection] |
04:42:20 | | atphoenix quits [Ping timeout: 240 seconds] |
04:43:57 | | DogsRNice quits [Read error: Connection reset by peer] |
04:44:15 | | atphoenix (atphoenix) joins |
04:59:25 | | BlueMaxima quits [Read error: Connection reset by peer] |
05:11:23 | | kiryu quits [Quit: kiryu] |
05:49:12 | | Wohlstand (Wohlstand) joins |
06:09:18 | <flashfire42> | I presume still no ETA on Hel1 fix? |
06:55:38 | | parfait quits [Client Quit] |
06:57:23 | <fireonlive> | +rss- Posthog is closing their Slack community in favor of forum: https://posthog.com/blog/slack-closure https://news.ycombinator.com/item?id=38987383 |
06:57:28 | <fireonlive> | finally, in the right direction |
07:14:58 | | c3manu quits [Remote host closed the connection] |
07:32:48 | | c3manu (c3manu) joins |
07:50:11 | | dentropy joins |
07:58:01 | | dentropy quits [Remote host closed the connection] |
08:01:27 | | aninternettroll_ (aninternettroll) joins |
08:01:50 | | aninternettroll quits [Ping timeout: 240 seconds] |
08:01:50 | | aninternettroll_ is now known as aninternettroll |
08:04:50 | | itachi1706 quits [Ping timeout: 240 seconds] |
08:06:01 | | neggles_ (neggles) joins |
08:06:50 | | neggles quits [Ping timeout: 240 seconds] |
08:06:52 | | neggles_ is now known as neggles |
08:07:32 | | itachi1706 (itachi1706) joins |
08:11:59 | | fireonlive quits [Client Quit] |
08:12:43 | | fireonlive (fireonlive) joins |
08:24:05 | | aninternettroll quits [Remote host closed the connection] |
08:26:37 | | aninternettroll (aninternettroll) joins |
08:37:08 | | riku (riku) joins |
08:47:47 | | systwi quits [Ping timeout: 272 seconds] |
08:55:16 | | systwi (systwi) joins |
09:06:30 | | hitgrr8 joins |
09:13:38 | | Wohlstand quits [Client Quit] |
10:00:01 | | Bleo18260 quits [Client Quit] |
10:01:24 | | Bleo18260 joins |
10:21:12 | | Island quits [Read error: Connection reset by peer] |
10:21:13 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
10:49:53 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
11:24:03 | | jacksonchen666 quits [Client Quit] |
11:32:20 | | jacksonchen666 (jacksonchen666) joins |
11:43:43 | | mls quits [Quit: leaving] |
12:45:17 | | Arcorann quits [Ping timeout: 272 seconds] |
13:40:44 | | itachi1706 quits [Client Quit] |
13:42:59 | | itachi1706 (itachi1706) joins |
14:10:48 | <h2ibot> | JacksonChen666 edited Deathwatch (+196, ambrosia.moe): https://wiki.archiveteam.org/?diff=51512&oldid=51510 |
14:21:12 | | lennier2_ joins |
14:23:50 | | lennier2 quits [Ping timeout: 240 seconds] |
14:28:33 | | lennier2 joins |
14:31:03 | | lennier2_ quits [Ping timeout: 272 seconds] |
14:36:26 | | lennier2_ joins |
14:41:06 | <Terbium> | Lots of Fediverse instances shutting down these days |
14:41:11 | | lennier2 quits [Ping timeout: 272 seconds] |
14:42:20 | | lennier2_ quits [Ping timeout: 240 seconds] |
14:46:01 | | lennier2_ joins |
14:49:28 | | danwellby quits [Quit: Watch out For sysops carrying carpet and quicklime] |
15:03:19 | | fangfufu_ quits [Read error: Connection reset by peer] |
15:35:14 | <c3manu> | !ig bgnzc3hv0l3ejju7m5khrc3g5 ^https?://www\.lostboysinteractive\.com.*/icon/icon |
15:35:18 | <c3manu> | -.- |
15:39:56 | | jacksonchen666 is now authenticated as * |
15:39:56 | | jacksonchen666 is now known as RJHacker86417 |
15:40:00 | | jacksonchen666 (jacksonchen666) joins |
15:43:36 | | RJHacker86417 quits [Ping timeout: 255 seconds] |
15:50:33 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
15:59:08 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
16:02:47 | | ymgve_ joins |
16:04:39 | | danwellby joins |
16:06:41 | | ymgve quits [Ping timeout: 272 seconds] |
16:07:45 | | danwellby quits [Client Quit] |
16:19:20 | | lflare quits [Ping timeout: 240 seconds] |
16:45:40 | | jacksonchen666 quits [Client Quit] |
16:45:40 | | ctag quits [Read error: Connection reset by peer] |
16:46:03 | | ctag (ctag) joins |
16:59:47 | | fangfufu joins |
17:00:07 | | fangfufu is now authenticated as fangfufu |
17:02:12 | <@JAA> | https://dcb18d6mfegct.cloudfront.net/ is fully archived, for the record. |
17:04:51 | <@JAA> | Unique size turned out to only be 244 GB, 162 GiB with compression. |
17:06:18 | <@JAA> | AWS S3 sometimes does a weird thing where the ETag differs even though the contents are identical. I've seen that before. |
17:21:58 | | danwellby joins |
17:57:22 | | lunik173 quits [Client Quit] |
17:59:27 | | lunik173 joins |
17:59:43 | | lflare (lflare) joins |
18:00:54 | | DogsRNice joins |
18:01:17 | | lunik173 quits [Client Quit] |
18:02:59 | | lunik173 joins |
18:15:52 | | lunik173 quits [Client Quit] |
18:16:18 | | lunik173 joins |
18:38:31 | <nicolas17> | JAA: wait what were the ETags? did they have dashes? |
18:39:16 | <@JAA> | nicolas17: I didn't pay attention, but probably, yes. They almost always do with AWS S3. |
18:39:31 | <nicolas17> | files uploaded in 1 part have the file md5 as ETag |
18:40:44 | <nicolas17> | files uploaded in multiple parts have something like the md5 of the concatenation of the md5 of each part, so the ETag depends on how the upload tool decided to split the multipart upload |
18:41:37 | <@JAA> | I'm sure I've seen ETags of the form 'md5-nnn' with some decimal number appended. |
18:41:46 | <nicolas17> | yes, the decimal number is the number of parts |
18:41:53 | <@JAA> | Right |
18:42:24 | <@JAA> | Looks like most files have just the hash in this bucket. |
18:43:24 | <nicolas17> | so if there's no dash, it's an MD5 of the file, if there is a dash, you can't reliably calculate it yourself (and it could be different on files with the same contents) because you don't know how exactly it was split into parts |
18:43:55 | <@JAA> | I see. |
18:44:50 | <@JAA> | 355721 of 372995 files have just the hash. |
18:45:22 | <nicolas17> | recently they added x-amz-meta-digest-sh1 and x-amz-meta-digest-sha256 headers, but I think that depends on the uploader sending it in the first place |
18:45:28 | <@JAA> | Split uploads would be skewed towards larger files, so I guess it's mostly useless. |
18:47:18 | <nicolas17> | like if you upload a file with x-amz-meta-digest-sh1 set to the sha1 of the whole file, S3 will validate that it actually matches the content when the upload is done, and will send the header back when you download the file |
18:48:00 | <@JAA> | Right, but not in the bucket listing. |
18:50:52 | <nicolas17> | looks like the bucket is also open btw https://omniverse-content-production.s3.amazonaws.com/ |
18:55:57 | <@JAA> | That's the bucket behind that Cloudfront domain. |
18:56:16 | <nicolas17> | yep |
18:56:44 | <nicolas17> | but sometimes the cloudfront domain shows everything (including bucket listing) while the bucket itself is inaccessible, because the bucket is configured to only accept requests from cloudfront |
18:56:49 | <nicolas17> | seems this is not the case, the bucket is just open |
18:57:44 | <@JAA> | Yeah, and you can't paginate through Cloudfront. |
18:57:50 | <@JAA> | (In this case, anyway.) |
18:57:55 | <nicolas17> | oh |
18:57:57 | <nicolas17> | how annoying |
19:05:08 | <nicolas17> | any idea where I can find what Samsung device models mean? I have a long list of models like SM-F9460 and SM-G988B |
19:05:18 | <nicolas17> | searching SM-G988B on wikipedia I got redirected to "Samsung Galaxy S20" and the infobox says "SM-G988x (S20 Ultra LTE/5G) (Last letter varies by carrier and international models)" |
19:05:38 | <nicolas17> | while for "SM-F9460" I got nothing; and either way searching wikipedia won't scale for 700+ strings |
19:06:21 | <@JAA> | GSMArena is probably what I'd try, but not sure they have a nice list for it. |
19:06:31 | <@JAA> | > Galaxy Z Fold5 (SM-F9460) |
19:07:03 | <katia> | https://grep.app/search?q=SM-F9460 |
19:07:28 | <katia> | https://github.com/KHwang9883/MobileModels/blob/master/scripts/models.csv |
19:07:52 | <katia> | 296 samsungs there |
19:09:17 | <nicolas17> | https://transfer.archivete.am/inline/7HAJZ/opensource-samsung-list.ndjson |
19:21:29 | | Wohlstand (Wohlstand) joins |
19:47:33 | | HP_Archivist quits [Quit: Leaving] |
19:55:14 | | katia quits [Remote host closed the connection] |
19:56:01 | | katia (katia) joins |
19:56:57 | | katia quits [Remote host closed the connection] |
19:57:47 | | katia (katia) joins |
19:58:32 | | katia quits [Remote host closed the connection] |
20:00:10 | | katia (katia) joins |
20:47:38 | <Vokun> | Can the runescape forum be put into ab at least to grab what it'll grab? I don't want to annoy, but I also don't want this to be forgotten about |
20:55:32 | <@JAA> | Once it's read-only. |
21:00:10 | <@JAA> | There'll be two months between that and the shutdown, which should at least grab some meaningful part of the forums. |
21:01:36 | | BlueMaxima joins |
21:02:05 | | datechnoman quits [Quit: The Lounge - https://thelounge.chat] |
21:03:11 | <aninternettroll> | Is it resonable to compile a big list of all URLs from a site that is not gonna shut down any time soon and post it in #//? |
21:03:50 | <fireonlive> | a single site? no |
21:03:51 | <@JAA> | If you mean collecting outgoing links from a site, sure. |
21:03:57 | <@JAA> | If you mean URLs for the site itself, never. |
21:04:07 | <@JAA> | #// is not suitable for that kind of thing. |
21:04:39 | <@JAA> | Regardless of whether a site is shutting down or not, that is. |
21:05:13 | <nicolas17> | sounds like archivebot material |
21:05:33 | <aninternettroll> | What's an "outgoing link"? |
21:06:29 | | Island joins |
21:06:46 | <@JAA> | If https://example.org/ links to https://foobar.invalid/, that is an outgoing link or offsite link. |
21:07:51 | <nicolas17> | if you crawl all posts in a forum, the links on forum posts pointing to websites elsewhere are outlinks and maybe could be sent to #//, the URLs of the forum posts themselves are not |
21:08:07 | <@JAA> | Which site is this about? |
21:08:28 | <aninternettroll> | A goverment website, nothing important. https://lanekassen.no |
21:08:45 | <aninternettroll> | It offers an API with all of it's links, which I thought might be relevant |
21:09:04 | | datechnoman (datechnoman) joins |
21:09:08 | <@JAA> | That sounds like a potential ArchiveBot job, yeah. |
21:09:47 | <aninternettroll> | (link to the API https://lanekassen.no/api/mt1502/sitemap/lkno and static assets if relevant https://lanekassen.no/dist/manifest.json) |
21:10:28 | <@JAA> | Interesting |
21:11:20 | <@JAA> | I'll run a recursive crawl and then a separate job for anything from those lists that wasn't grabbed. |
21:11:43 | <aninternettroll> | Worth noting a lot of links are not accessible by normal users (anything with /dinesider/ really) since it's for customers |
21:12:23 | <@JAA> | Do you know what 'lkno' means there? |
21:12:33 | <aninternettroll> | abbreviation for LaneKassen.NO |
21:12:39 | <@JAA> | Ah, right, duh. |
21:12:48 | <aninternettroll> | And dine sider translates to "Your pages" |
21:13:30 | <aninternettroll> | Also API docs are here https://lanekassen.no/api/mt1502/swagger/v1/swagger.json but don't offer that much guidance into how to use it. Just a standard CMS really |
21:13:35 | <@JAA> | I was thinking something with the language, but it covers also English pages. :-) |
21:15:40 | <@JAA> | Looks like https://lanekassen.no/api/mt1502/SiteMap/LKNO is the proper capitalisation per the Swagger doc. |
21:15:49 | <@JAA> | (But the server doesn't care.) |
21:16:26 | <@JAA> | It's been thrown into AB. |
21:16:38 | <aninternettroll> | Thanks! |
21:17:02 | <aninternettroll> | Some new URLs should be added soon, so I'll come back when that happens |
21:18:11 | <h2ibot> | Ufarwisan created Talk:GitHub (+1803, Created page with "= Github mirrors = There areā¦): https://wiki.archiveteam.org/?title=Talk%3AGitHub |
21:18:12 | <h2ibot> | Ufarwisan edited GitHub (+145, /* External links */): https://wiki.archiveteam.org/?diff=51514&oldid=50956 |
21:18:58 | <@JAA> | Sounds good. I've grabbed the API endpoint, so we can diff it then I guess. |
21:19:11 | <h2ibot> | JustAnotherArchivist changed the user rights of User:Ufarwisan |
21:27:09 | | hackbug quits [Ping timeout: 272 seconds] |
21:28:55 | | magmaus3 quits [Quit: :3] |
21:33:40 | | hackbug (hackbug) joins |
21:56:38 | | c3manu quits [Remote host closed the connection] |
21:57:50 | | xkey quits [Ping timeout: 240 seconds] |
21:59:09 | | xkey (xkey) joins |
22:05:09 | | xkey quits [Ping timeout: 272 seconds] |
22:05:21 | | xkey (xkey) joins |
22:06:21 | <h2ibot> | Ufarwisan edited Talk:GitHub (+110): https://wiki.archiveteam.org/?diff=51515&oldid=51513 |
22:10:50 | | xkey quits [Ping timeout: 240 seconds] |
22:11:41 | | xkey (xkey) joins |
22:12:22 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
22:23:03 | | jasons quits [Quit: The Lounge - https://thelounge.chat] |
22:35:20 | | xkey quits [Ping timeout: 240 seconds] |
22:38:13 | | xkey (xkey) joins |
22:38:54 | | jasons (jasons) joins |
22:57:41 | | xkey quits [Client Quit] |
23:37:20 | | jasons quits [Ping timeout: 240 seconds] |
23:48:16 | | Hackerpcs quits [Quit: Hackerpcs] |
23:50:58 | | Hackerpcs (Hackerpcs) joins |