00:02:12 | | cascode quits [Read error: Connection reset by peer] |
00:02:26 | | cascode joins |
00:06:25 | | Wohlstand quits [Quit: Wohlstand] |
00:19:49 | | etnguyen03 (etnguyen03) joins |
00:28:11 | | Webuser369573 joins |
00:28:20 | | Webuser369573 quits [Client Quit] |
00:37:39 | | dabs quits [Quit: Leaving] |
00:40:40 | | nine quits [Client Quit] |
00:40:52 | | nine joins |
00:40:52 | | nine is now authenticated as nine |
00:40:52 | | nine quits [Changing host] |
00:40:52 | | nine (nine) joins |
00:46:55 | | xkey quits [Quit: WeeChat 4.6.3] |
00:47:07 | | xkey (xkey) joins |
00:57:59 | | hexagonwin joins |
00:59:39 | | hexagonwin_ quits [Ping timeout: 260 seconds] |
01:00:31 | <xarph> | Can an s3 > archivebot boffin check that assets.shopcats.app s3 bucket now has file listings on? |
01:01:12 | <xarph> | The site operator threw the switch for us but didn't have time to check and I'm fighting the costco eastern front |
01:01:30 | <pabs> | loading that page in a browser gives a file listing yeah |
01:02:32 | <pabs> | hmm, little-things s3-bucket-list crashes with: Marker loop (empty marker in response despite providing one) |
01:04:30 | <pokechu22> | Often when it's on a subdomain, ?marker= won't work on the subdomain but will work on the original bucket at https://shopcats-prod-bucket.s3.amazonaws.com/ |
01:05:00 | <pokechu22> | in that case, I generate a list using the s3.amazonaws.com URL but then modify it to be on their subdomain (other than the marker URLs) |
01:05:52 | <pokechu22> | compare https://assets.shopcats.app/?marker=protected/us-east-1:017b3273-680c-4e68-82ab-d40508ca7b49/8abac6b2-741e-4993-b9f8-1cb835250b63ms.jpg and https://shopcats-prod-bucket.s3.amazonaws.com/?marker=protected/us-east-1:017b3273-680c-4e68-82ab-d40508ca7b49/8abac6b2-741e-4993-b9f8-1cb835250b63ms.jpg |
01:07:02 | <pabs> | https://shopcats-prod-bucket.s3.amazonaws.com/ indeed doesn't crash |
01:07:16 | <pokechu22> | I'm currently listing it and will run it |
01:12:43 | <pokechu22> | https://transfer.archivete.am/BG5eS/assets.shopcats.app_shopcats-prod-bucket.s3.amazonaws.com_urls.txt.zst |
01:14:28 | | nicolas17 clicks a random URL and goes "awwww" already |
01:19:16 | | Guest58 joins |
01:27:23 | <xarph> | wahoo 17,000 bodega cat images |
01:33:45 | <nicolas17> | xarph: why do I see 31462? |
01:35:54 | <xarph> | because I misread a field |
01:36:24 | <nicolas17> | hm was the website shut down already? |
01:36:27 | <xarph> | yes |
01:36:44 | <xarph> | the frontend went down this morning |
01:36:51 | <xarph> | the owner is leaving the s3 bucket up for us to mirror |
01:37:04 | <xarph> | still negotiating about the postgres database |
01:40:56 | <nicolas17> | hm what's this? user profile pictures? https://assets.shopcats.app/protected/us-east-1%3Aabb8265f-b3ec-4429-846d-1b31d07e73fe/3f678254-3980-4d6c-8453-c0df50692a22l.jpg https://assets.shopcats.app/protected/us-east-1%3A2af9aa03-8217-4541-b8e3-eb8b6f452cd0/517d7cc8-4615-4e9a-b055-cd70abc3a772l.jpg |
01:41:45 | <xarph> | yeah it did have profile pics |
01:42:01 | <xarph> | s3cmd sync shows 160648 items |
01:42:40 | <nicolas17> | yeah every image has several downscaled versions |
01:42:47 | <xarph> | I was about to write that |
01:43:01 | <xarph> | idgaf I'll go to the best buy down the street and buy a giant hard drive if I need it |
01:43:23 | <xarph> | not one cat image shall die on our watch |
01:44:27 | <nicolas17> | xarph: http://archivebot.com/?initialFilter=shopcats |
01:44:55 | | nicolas17 is now authenticated as nicolas17 |
01:45:42 | | malteeez joins |
01:49:01 | | GradientCat quits [Client Quit] |
02:13:31 | | malteeez quits [Client Quit] |
02:22:18 | | BornOn420 quits [Remote host closed the connection] |
02:25:55 | | nicolas17 quits [Quit: Konversation terminated!] |
02:26:22 | | nicolas17 joins |
02:26:45 | | nicolas17 is now authenticated as nicolas17 |
02:29:34 | <anonymoususer852> | There's a typo on the Current_Projects page for the AT wiki. I've made applied them, but am awaiting moderation. If anyone has access to moderation, please also undo the Talk page that I previously contributed to main page, as I have found the correct place. |
02:44:19 | | cuphead2527480 quits [Quit: Connection closed for inactivity] |
02:55:34 | | cascode quits [Ping timeout: 240 seconds] |
02:56:24 | | cascode joins |
03:03:01 | | etnguyen03 quits [Client Quit] |
03:06:06 | | cascode quits [Read error: Connection reset by peer] |
03:07:07 | | cascode joins |
03:09:21 | | etnguyen03 (etnguyen03) joins |
03:14:27 | | Webuser335071 joins |
03:15:04 | | Webuser335071 quits [Client Quit] |
03:40:33 | | etnguyen03 quits [Remote host closed the connection] |
03:44:44 | | @imer quits [Ping timeout: 260 seconds] |
03:57:44 | | imer (imer) joins |
03:57:44 | | @ChanServ sets mode: +o imer |
05:02:32 | | egallager quits [Quit: This computer has gone to sleep] |
05:23:22 | | egallager joins |
05:47:54 | | awauwa (awauwa) joins |
06:54:33 | | Island quits [Read error: Connection reset by peer] |
07:35:09 | | nothere quits [Ping timeout: 260 seconds] |
07:43:34 | | Radzig quits [Ping timeout: 240 seconds] |
08:15:39 | | Radzig joins |
08:16:29 | | nothere_ joins |
08:25:13 | | Webuser105032 joins |
08:25:17 | | Webuser105032 quits [Client Quit] |
08:37:01 | | archiveDrill quits [Quit: The Lounge - https://thelounge.chat] |
08:56:51 | | Webuser581666 joins |
08:57:00 | | Webuser581666 quits [Client Quit] |
09:23:50 | | Dada joins |
09:24:38 | <@OrIdow6> | On Itch.io embeds, I'm thinking of either |
09:24:49 | <@OrIdow6> | - THrowing it all into SPN (probably would encounter IA difficulties) |
09:25:17 | <@OrIdow6> | - Throwing it into JSEater and pushing to have those accepted to the WBM |
09:25:53 | <@OrIdow6> | - Setting up some little setup with a headless browser, maybe just running on my own/a few volunteers' machines, which will record the URLs of the resources, then we'll throw those into AB or somewhere |
09:26:02 | <@OrIdow6> | - Considering using API keys |
09:26:12 | <@OrIdow6> | Third is what I'm leaning towards |
09:28:54 | | archiveDrill joins |
10:02:54 | | Radzig quits [Ping timeout: 240 seconds] |
10:10:38 | | Radzig joins |
10:15:10 | | T31M quits [Quit: ZNC - https://znc.in] |
10:15:32 | | T31M joins |
10:16:23 | | T31M is now authenticated as T31M |
10:22:34 | | chrismrtn quits [Ping timeout: 260 seconds] |
10:23:09 | | jspiros quits [Ping timeout: 260 seconds] |
10:43:22 | | Wohlstand (Wohlstand) joins |
11:00:04 | | Bleo182600722719623455222 quits [Quit: The Lounge - https://thelounge.chat] |
11:02:47 | | Bleo182600722719623455222 joins |
11:29:08 | <BlankEclair> | i _think_ wikiforge is now dead? see https://wm-bot.wmcloud.org/browser/index.php?start=07%2F27%2F2025&end=07%2F27%2F2025&display=%23miraheze-offtopic (convo starts at 2025-07-27 02:42:48) |
12:55:23 | | etnguyen03 (etnguyen03) joins |
13:25:47 | | GradientCat (GradientCat) joins |
13:35:06 | | Wohlstand quits [Remote host closed the connection] |
13:40:28 | | Wohlstand (Wohlstand) joins |
13:43:17 | | etnguyen03 quits [Client Quit] |
13:46:26 | | etnguyen03 (etnguyen03) joins |
14:02:32 | | jspiros (jspiros) joins |
14:04:27 | | chrismrtn (chrismrtn) joins |
15:07:01 | | etnguyen03 quits [Client Quit] |
15:45:08 | <h2ibot> | Dragon789 created Talk:Banned from youtube (+519, Youtube Project "Your're likely banned"): https://wiki.archiveteam.org/?title=Talk%3ABanned%20from%20youtube |
15:45:09 | <h2ibot> | JustAGrook edited Alive... OR ARE THEY (+210, /* Endangered */): https://wiki.archiveteam.org/?diff=56599&oldid=55903 |
15:45:10 | <h2ibot> | KamafaDelgato edited Itch.io (+4): https://wiki.archiveteam.org/?diff=56600&oldid=56593 |
15:45:11 | <h2ibot> | R74n edited Deathwatch (+153, /* 2025 */ Note about Firebase Dynamic Links…): https://wiki.archiveteam.org/?diff=56602&oldid=56585 |
15:45:12 | <h2ibot> | Legowerewolf created Talk:Goo.gl (+214, /* Memory leak? */ new section): https://wiki.archiveteam.org/?title=Talk%3AGoo.gl |
15:45:13 | <h2ibot> | Anonymoususer852 edited Talk:Main Page (+228, /* Typo under the section Meta */ new section): https://wiki.archiveteam.org/?diff=56604&oldid=51271 |
15:45:14 | <h2ibot> | Anonymoususer852 edited Main Page/Current Projects (+0, /* Medium-term projects */ Fixed typo with Meta…): https://wiki.archiveteam.org/?diff=56605&oldid=56493 |
15:46:09 | <h2ibot> | Arkiver changed the user rights of User:Anonymoususer852 |
15:46:10 | <h2ibot> | Arkiver changed the user rights of User:R74n |
16:03:49 | <@arkiver> | OrIdow6: on the three options. 1. i would not trust SPN too much, especially when using some form of automatic submission. 2. JSEater would work i guess. 3. this would work as well, unless certain headers are needed, information that is not preserved by listing URLs alone. 4. what are these keys for, are they a type of login? |
16:09:37 | | Guest58 quits [Quit: My Mac has gone to sleep. ZZZzzz…] |
16:10:01 | | Guest58 joins |
16:10:18 | | Guest58 quits [Client Quit] |
16:10:49 | <@OrIdow6> | arkiver: Yes, keys derived from an account. But technically the only way to be 100% sure we have everything, as the embeds in question are just little static sites hosted by itch.io, and technically the JS embedded in them can load resources dynamically (though it *looks* like most load all the resources up front) |
16:13:23 | <@OrIdow6> | Anyway like I say I am going with 3 |
16:13:32 | <@OrIdow6> | Probably |
16:15:35 | | grill (grill) joins |
16:16:39 | | jspiros quits [Client Quit] |
16:17:05 | | linuxgemini quits [Quit: Ping timeout (120 seconds)] |
16:17:55 | | jspiros (jspiros) joins |
16:21:34 | | grill quits [Ping timeout: 240 seconds] |
16:23:17 | | dabs joins |
16:27:13 | | linuxgemini (linuxgemini) joins |
16:27:33 | | grill (grill) joins |
16:43:15 | | linuxgemini quits [Client Quit] |
16:46:18 | <h2ibot> | Anonymoususer852 edited Talk:Main Page (-228, Undo revision 56604 by…): https://wiki.archiveteam.org/?diff=56606&oldid=56604 |
17:03:20 | <h2ibot> | OrIdow6 edited Shutdown rumors, hoaxes, and scares (+507, Amino): https://wiki.archiveteam.org/?diff=56607&oldid=52757 |
17:14:22 | <h2ibot> | Anonymoususer852 edited Talk:Banned from youtube (+986, /* Re: Banned from youtube */ new section): https://wiki.archiveteam.org/?diff=56608&oldid=56598 |
17:37:56 | <egallager> | Tom Lehrer died: https://tomlehrersongs.com/ |
17:38:46 | <pokechu22> | We've saved it several times in the past (https://archive.fart.website/archivebot/viewer/domain/tomlehrersongs.com) so it might be best to wait for a notice on the site |
17:39:01 | <pokechu22> | though it's only 5gb so probably not too big of an issue |
17:40:20 | <@OrIdow6> | RIP |
17:40:49 | <@OrIdow6> | That thing was on Deathwatch for what, 5 years? |
17:41:15 | | etnguyen03 (etnguyen03) joins |
17:43:26 | <h2ibot> | Cooljeanius edited Shutdown rumors, hoaxes, and scares (+32, use URL template): https://wiki.archiveteam.org/?diff=56609&oldid=56607 |
17:44:50 | | feed quits [Quit: Limnoria 2024.12.20] |
17:45:30 | | feed (feed) joins |
17:48:14 | | grill quits [Ping timeout: 260 seconds] |
17:50:02 | | hexagonwin_ joins |
17:50:22 | | DartRetaliator_ joins |
17:51:44 | | hexagonwin quits [Ping timeout: 260 seconds] |
17:53:29 | | DartRetaliator__ quits [Ping timeout: 260 seconds] |
17:53:39 | | etnguyen03 quits [Client Quit] |
17:55:07 | | etnguyen03 (etnguyen03) joins |
18:03:18 | | grill (grill) joins |
18:12:03 | | dabs quits [Client Quit] |
18:12:13 | | dabs joins |
18:12:30 | <h2ibot> | Anonymoususer852 edited Data compression algorithms and tools (+2427, /* TAR/SHAR */ New section - created primarily…): https://wiki.archiveteam.org/?diff=56610&oldid=56537 |
18:14:31 | <h2ibot> | Anonymoususer852 edited Formats (+85, /* Compression */ Add missing links, especially…): https://wiki.archiveteam.org/?diff=56611&oldid=49459 |
18:16:54 | | atphoenix_ quits [Read error: Connection reset by peer] |
18:17:29 | | lennier2_ quits [Read error: Connection reset by peer] |
18:17:51 | | atphoenix_ (atphoenix) joins |
18:18:09 | | lennier2_ joins |
18:24:32 | <h2ibot> | Anonymoususer852 edited Data compression algorithms and tools (+108, /* TAR/SHAR */ Add reference to Tape Archive…): https://wiki.archiveteam.org/?diff=56612&oldid=56610 |
18:40:09 | | grill quits [Ping timeout: 260 seconds] |
18:45:44 | | grill (grill) joins |
18:49:13 | <@OrIdow6> | pokechu22: On itch,io, how are you doing on identification? IDK what their rates look like but a fallback approach if we can't reliably identify which ones are NSFW might be to cross reference with search results |
18:49:30 | <@OrIdow6> | Famous last words but I think I'm nearly done with the script |
18:49:56 | <pokechu22> | OrIdow6: the data.json files, which I downloaded with archivebot at con=3, d=100 (I know that at con=6, d=100 you get 429s, so there is a limit) |
18:50:40 | <pokechu22> | https://transfer.archivete.am/R5zMT/itch.io_find_nsfw_games.py is what I used |
18:50:41 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/R5zMT/itch.io_find_nsfw_games.py |
18:51:35 | <pokechu22> | data.json should also be usable to identify games tagged html5 (which might not exactly match games that have html5 embeds), and free/name your price games (that should be exact since data.json has price information) |
18:52:54 | | midou quits [Ping timeout: 240 seconds] |
19:01:57 | | midou joins |
19:04:32 | | awauwa quits [Quit: awauwa] |
19:20:33 | | szczot3k quits [Remote host closed the connection] |
19:20:42 | | szczot3k (szczot3k) joins |
19:27:08 | | lemuria quits [Read error: Connection reset by peer] |
19:27:54 | | lemuria (lemuria) joins |
19:31:48 | | etnguyen03 quits [Client Quit] |
19:32:09 | | etnguyen03 (etnguyen03) joins |
19:36:34 | | grill quits [Ping timeout: 240 seconds] |
19:41:59 | | etnguyen03 quits [Client Quit] |
19:51:58 | | dabs quits [Client Quit] |
20:19:19 | | dabs joins |
20:42:36 | | Island joins |
20:46:23 | | mls (mls) joins |
21:57:27 | | Webuser090327 quits [Quit: Ooops, wrong browser tab.] |
22:00:38 | | etnguyen03 (etnguyen03) joins |
22:09:51 | | pokechu22 quits [Quit: Network maintenance] |
22:14:49 | | cascode quits [Ping timeout: 260 seconds] |
22:15:01 | | cascode joins |
22:35:49 | | cascode quits [Ping timeout: 260 seconds] |
22:35:55 | | simon816 quits [Remote host closed the connection] |
22:36:01 | | cascode joins |
22:40:14 | | cascode quits [Ping timeout: 240 seconds] |
22:40:26 | | cascode joins |
22:40:51 | | simon816 (simon816) joins |
22:41:49 | | pokechu22 (pokechu22) joins |
22:44:01 | | Guest58 joins |
22:47:19 | | Webuser172778 joins |
22:48:03 | | Webuser172778 quits [Client Quit] |
22:49:14 | | cascode quits [Ping timeout: 260 seconds] |
22:53:54 | | pokechu22 quits [Ping timeout: 240 seconds] |
22:56:16 | | pokechu22 (pokechu22) joins |
22:59:02 | | Dada quits [Remote host closed the connection] |
23:12:31 | | hyenatown joins |
23:23:13 | | cascode joins |
23:29:56 | | Hackerpcs quits [Quit: Hackerpcs] |
23:35:27 | | cascode quits [Read error: Connection reset by peer] |
23:35:39 | | cascode joins |
23:36:39 | | Hackerpcs (Hackerpcs) joins |
23:42:11 | | etnguyen03 quits [Client Quit] |