00:08:41dumbgoy__ joins
00:22:00BlueMaxima joins
00:23:07BlueMaxima quits [Read error: Connection reset by peer]
00:23:17BlueMaxima joins
00:26:38icedice quits [Client Quit]
00:27:18beario quits [Ping timeout: 258 seconds]
00:37:19sarge (sarge) joins
00:40:43mr_sarge quits [Ping timeout: 258 seconds]
00:41:49sonick (sonick) joins
00:45:55Mateon2 joins
00:47:35Mateon1 quits [Ping timeout: 252 seconds]
00:47:35Mateon2 is now known as Mateon1
00:55:54AmAnd0A quits [Read error: Connection reset by peer]
00:56:11AmAnd0A joins
01:07:54benjins quits [Remote host closed the connection]
01:08:05benjins joins
01:08:44fireonlive quits [Quit: Connection gently closed by peer]
01:09:16Naruyoko quits [Read error: Connection reset by peer]
01:09:49fireonlive (fireonlive) joins
01:22:51Naruyoko joins
01:36:06benjins2_ joins
01:38:13benjins2 quits [Ping timeout: 258 seconds]
01:51:15benjins quits [Remote host closed the connection]
01:51:30benjins joins
02:03:02justmolamola joins
02:05:49fishingforsoup quits [Ping timeout: 258 seconds]
02:08:04railen69 quits [Remote host closed the connection]
02:11:05railen63 joins
02:28:03dumbgoy__ quits [Ping timeout: 258 seconds]
03:18:03qwertyasdfuiopghjkl quits [Remote host closed the connection]
03:31:23qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
03:40:58gfhh1 quits [Remote host closed the connection]
03:41:18gfhh1 joins
03:46:54Jake quits [Quit: Ping timeout (120 seconds)]
03:47:08Jake (Jake) joins
03:49:01Shjosan quits [Quit: Am sleepy (-, – )…zzzZZZ]
03:49:38Shjosan (Shjosan) joins
04:02:14shreyasminocha quits [Remote host closed the connection]
04:02:14thehedgeh0g quits [Remote host closed the connection]
04:02:14evan quits [Remote host closed the connection]
04:02:18evan joins
04:02:21shreyasminocha (shreyasminocha) joins
04:02:21thehedgeh0g (mrHedgehog0) joins
04:04:48Island quits [Read error: Connection reset by peer]
04:19:21BearFortress quits [Read error: Connection reset by peer]
04:19:33BearFortress joins
04:22:06jacksonchen666 quits [Ping timeout: 245 seconds]
04:22:50jacksonchen666 (jacksonchen666) joins
04:24:32HP_Archivist quits [Read error: Connection reset by peer]
04:24:59HP_Archivist (HP_Archivist) joins
04:37:02PredatorIWD joins
04:48:59dumbgoy__ joins
04:50:34Hans5958 quits [Quit: Reconnecting]
04:50:43Hans5958 (Hans5958) joins
04:55:02HP_Archivist quits [Read error: Connection reset by peer]
04:55:29HP_Archivist (HP_Archivist) joins
05:29:19sonick quits [Client Quit]
05:30:32HP_Archivist quits [Read error: Connection reset by peer]
05:30:59HP_Archivist (HP_Archivist) joins
05:55:40justmolamola quits [Remote host closed the connection]
06:05:26BigBrain quits [Ping timeout: 245 seconds]
06:22:50BigBrain (bigbrain) joins
06:44:46<AK>https://twitter.com/AdamRackis/status/1671923984835788803?t=7YyAlkFaB0iS7N05CQrNCQ&s=19
06:45:19<AK>Not sure if anyone's seen this, think there's a case to be made that css-tricks is at risk
06:54:24BlueMaxima quits [Client Quit]
07:02:24Earendil7 quits [Quit: Leaving]
07:27:44Minkafighter quits [Quit: The Lounge - https://thelounge.chat]
07:28:51Minkafighter joins
07:50:51<razul>Sure looks like it.
07:53:17Earendil7 (Earendil7) joins
08:11:20null joins
08:12:17rktk quits [Ping timeout: 258 seconds]
08:12:47beario joins
08:27:28appledash quits [Ping timeout: 265 seconds]
08:27:53appledash joins
08:30:36<@OrIdow6>Anyone already looked at Wysp (August 1)? Thinking of taking something far away, so I stop with these last-minute projects
08:54:32yasomi quits [Ping timeout: 265 seconds]
08:55:06yasomi (yasomi) joins
09:00:12yasomi quits [Ping timeout: 258 seconds]
09:04:03yasomi (yasomi) joins
09:23:32Chris5010 quits [Ping timeout: 265 seconds]
09:28:48Minkafighter quits [Client Quit]
09:29:05Minkafighter joins
09:29:20Church quits [Ping timeout: 265 seconds]
09:29:50Church (Church) joins
09:48:49Craigle quits [Quit: The Lounge - https://thelounge.chat]
09:49:18Craigle (Craigle) joins
09:55:09birdjj quits [Client Quit]
09:55:33birdjj joins
09:55:46AmAnd0A quits [Remote host closed the connection]
09:55:59AmAnd0A joins
09:58:52birdjj quits [Client Quit]
09:59:15birdjj joins
10:00:01railen63 quits [Remote host closed the connection]
10:00:19railen63 joins
10:02:18dumbgoy__ quits [Ping timeout: 258 seconds]
10:10:37driib quits [Quit: The Lounge - https://thelounge.chat]
10:11:00driib (driib) joins
10:16:30birdjj quits [Client Quit]
10:30:30pie_ quits []
10:30:38pie_ joins
11:20:59jacksonc1 (jacksonchen666) joins
11:21:16jacksonchen666 quits [Ping timeout: 245 seconds]
11:23:07jacksonc1 quits [Client Quit]
12:06:37Minkafighter quits [Client Quit]
12:08:16Minkafighter joins
12:13:55Iki quits [Read error: Connection reset by peer]
12:40:01that_lurker quits [Quit: Clowning around is not the same as fooling around...I am a clown, not a fool]
12:40:09that_lurker (that_lurker) joins
12:46:29AmAnd0A quits [Read error: Connection reset by peer]
12:47:07AmAnd0A joins
12:51:44AmAnd0A quits [Ping timeout: 258 seconds]
12:52:24AmAnd0A joins
13:10:56emberquill quits [Quit: The Lounge - https://thelounge.chat]
13:11:15emberquill (emberquill) joins
13:37:20<@arkiver>OrIdow6: no yet here! go ahead :)
13:37:25<@arkiver>more than a month left!
14:06:02Island joins
14:12:47Arcorann quits [Ping timeout: 252 seconds]
14:30:14<yts98>Has anyone archived Game Atsumaru? If not, I suggest archiving with ArchiveBot first, and I'll inspect how to find game assets in a few days.
14:39:57null quits [Client Quit]
14:41:05rktk (rktk) joins
14:44:40thenes (thenes) joins
15:25:50<@arkiver>yts98: it's actually not that difficult. the actual 'game' seems to be some HTML with references to the assets
15:27:23<@arkiver>i see some games though that heavily use js to generate asset URLs
15:30:52thenes quits [Remote host closed the connection]
15:31:49<yts98>there's in-game API https://atsumaru.github.io/api-references/ related to comments and score boards,
15:32:44thenes (thenes) joins
15:33:18<yts98>and the version number 1147 of https://resource.game.nicovideo.jp/games/gm9482/1147/index.html should be parsed from #stateSerialized of https://game.nicovideo.jp/atsumaru/games/gm9482
15:35:42<yts98>parsing 29754 games and search for the string "RPGAtsumaru" would be easy without the warrior
15:40:28<joepie91|m>AK: fwiw, anything DigitalOcean touches content-wise is always at risk. they have a long history of shoddy content marketing practices, pretty much from the first moment they started doing their 'guides' thing
15:41:08<joepie91|m>they talk a big game, but it's been obvious from the poor quality of their content (and the refusal to issue corrections) that they don't actually care about it in any way other than to draw in more customers by looking hip and helpful
15:44:25rktk quits [Client Quit]
15:54:29rktk (rktk) joins
15:54:56<fireonlive>i think it’s one of those like seo boosting things, get 30 articles out about roughly the same topic changing things slightly so they appear more in search results and are more in people’s minds / call to action to register for digitalocean. iirc they allow anyone to write for them and pay out a few bucks per article
15:56:30<fireonlive>oh lordy imagine a site stuffed with chatgpt instructionals
16:08:26yts98 leaves
16:08:29yts98 joins
16:24:59SF quits [Remote host closed the connection]
16:26:47SF joins
16:42:01LeGoupil joins
17:03:50yasomi quits [Ping timeout: 252 seconds]
17:03:57<@JAA>I believe we already archived css-tricks.com and some other things a few months ago when DO was doing layoffs.
17:08:59thenes quits [Client Quit]
17:15:03thenes (thenes) joins
17:35:07<masterX244>re the announcement in archiveteam: clearly too large for AB
17:36:36<nicolas17>...is this frequency of shutdowns normal?
17:37:42yasomi (yasomi) joins
17:38:57<@JAA>They have a URL shortener at sk.mu, but the codes are far too long to be bruteforcable. E.g. http://sk.mu/a90soh3wEbti for one of the most recent posts.
17:39:52<fireonlive>here's deathwatch, some years look busier than others: https://wiki.archiveteam.org/index.php/Deathwatch
17:40:01<fireonlive>but i imagine there's stuff that didn't make it cc nicolas17
17:40:53<@JAA>Yeah, stuff lands on Deathwatch if someone adds it, and I think some of the smaller things used to not be added as often.
17:45:54<@JAA>Ok, blogs can be enumerated via e.g. https://www.skyrock.com/common/r/skynautes/card/75598933
17:46:29<@JAA>Blog IDs go to around 124 million, so that's fun enough.
17:47:57<albertlarsan68>Skyblog (https://www.skyrock.com/blog/) announced that they will shut down the 21st August.
17:47:57<albertlarsan68>It was a pre-facebook social media, especially popular in France.
17:47:57<albertlarsan68>Has it been archived? Would it be archivable?
17:47:57<albertlarsan68>Can it be added to the Deathwatch page please?
17:48:12<@JAA>See above, we're already discussing it.
17:48:22<@JAA>And please do add it to Deathwatch. It's a wiki. :-)
17:48:59dumbgoy__ joins
17:49:57<@JAA>Server seems very stable, I'm already getting timeouts after just clicking around a bit.
17:51:14<albertlarsan68>It has been said that anonymized data will be saved to the INA and BNF (French authorities that archive mainly the TV and radio broadcast and books respectively)
17:54:23<albertlarsan68>They offer ways to save blogs, but using third-party tools (e.g. Cyotek WebCopy, A1 Website Download, HTTrack are methods on the official page)
17:57:03<pokechu22>Exorcism|m: re https://www.wysp.ws/, it seems like archivebot won't work well with it due to javascript. The "new" tab on the front page uses https://www.wysp.ws/timeline/load/?tlid=wysp-main&start=-1&rg=32&nb_col=3&order=antichronological&term_string=newest (and that progresses onwards). I also tried
17:57:05<pokechu22>https://www.wysp.ws/timeline/load/?tlid=wysp-main&start=-1&rg=32&nb_col=3&order=chronological&term_string=oldest and that seems like the oldest post it gives is https://www.wysp.ws/post/866261001/ which isn't the oldest (the "hall of fame" tab gives https://www.wysp.ws/post/8492023/ from 2013, while that post is 2017). IDs don't seem to be incremental so I'm not sure how to
17:57:07<pokechu22>go about saving everything.
18:00:54<thuban>do we have a source on the august 21 date for skyrock? i don't see it in the linked announcement
18:02:28<thuban>it's given in the news article, but no mention of where they got that information
18:04:42<thuban>LeGoupil, albertlarsan68: ^ any info?
18:06:26<LeGoupil>thuban: on https://www.skyrock.com/blog/ it's in small on the banner next to ICI T LIBRE
18:07:14<thuban>LeGoupil: so it is, thank you
18:09:01<h2ibot>Switchnode edited Deathwatch (+235, /* 2023 */ add skyrock): https://wiki.archiveteam.org/?diff=49997&oldid=49993
18:11:01<h2ibot>Switchnode edited Deathwatch (+8, /* 2023 */): https://wiki.archiveteam.org/?diff=49998&oldid=49997
18:17:41andrew quits [Client Quit]
18:48:20LeGoupil quits [Ping timeout: 252 seconds]
18:50:35andrew (andrew) joins
18:51:36<albertlarsan68>Official english statement: https://the-skyrock-team.skyrock.com/
18:52:29<albertlarsan68>Shared link: http://sk.mu/a3XJM6hYvHNo
18:53:05<albertlarsan68>Unshortened link: https://the-skyrock-team.skyrock.com/3356796874-posted-on-2023-06-22.html
18:54:09<h2ibot>Switchnode edited Deathwatch (+92, /* 2023 */ add english-language skyrock…): https://wiki.archiveteam.org/?diff=49999&oldid=49998
19:03:26<albertlarsan68>What should the IRC stream be named?
19:04:08<albertlarsan68>I can propose #downblog, #thunderblog
19:04:25andrew quits [Client Quit]
19:08:40<albertlarsan68>I quite like the second one, being a reference to French history (kinda): it is known (in France) that the Gaulois (IDK the English for that) were supposedly afraid of the sky falling on their head, this being the thunder.
19:08:56<fireonlive>with the new emoji policy arkiver +1'd i recommend #⛈️📝
19:08:57<fireonlive>:P
19:10:55<fireonlive>s/the new/my proposed/
19:13:55appledash quits [Ping timeout: 258 seconds]
19:17:23andrew (andrew) joins
19:21:30sarayalth quits [Client Quit]
19:24:17<albertlarsan68>It seems like each blog (subdomain in "<blog-id>.skyrock.com") has a sitemap.xml, so maybe would be a warrior project?
19:24:32<fireonlive>ooh that's useful
19:25:30<masterX244>depending on average size of a blog it might blow up on VM warriors
19:25:52appledash joins
19:27:44<fireonlive>hmmm. sitemap links could be reported back to the trackers I guess? or pre-scraped?
19:28:39<albertlarsan68>It seems like the "canonical" URL is composed of an ID example URL: https://lequipe-skyrock.skyrock.com/3356709252-Comment-sauvegarder-ton-blog.html, that seems like it is sequential and unique across the network. However, I'm not sure of its usefulness.
19:30:59andrew quits [Client Quit]
19:31:47<nstrom|m>They have an api https://www.skyrock.com/developer/documentation/api/
19:31:59LeGoupil joins
19:32:02<albertlarsan68>The ID is the only part that identifies a post within a blog, and is the only needed part to find the post. If the name is not correct of absent, it will redirect to the correct url.
19:34:04<masterX244>gah, we still need to know which blog it is on, can't buzz out the blog where the article lives on via a redirect
19:34:27<@JAA>#bowlofpetunias
19:34:33<albertlarsan68>Seems like it
19:34:47<albertlarsan68>JAA Why?
19:35:08<@JAA>Just my channel name suggestion.
19:36:59<masterX244>since you didnt write "proposed" it looked like a channel announcement
19:38:27<albertlarsan68>And what would be the pun, as required by the rules?
19:39:17LeGoupil quits [Client Quit]
19:40:47<fireonlive>#25732e07-b4e7-42c0-ad8a-a9bad8716b9b
19:40:48<fireonlive>:3
19:43:15<@JAA>albertlarsan68: Looks like someone needs to listen to/read/watch The Hitchhiker's Guide To The Galaxy.
19:43:24<@JAA>:-)
19:45:00<albertlarsan68>Something that is not contained in my French culture...
19:45:02<fireonlive>or; if you prefer; UUIDv7: #06495f63-2f1c-74f3-8000-0efab13ab36e
19:45:03<fireonlive>:D
19:45:16<fireonlive>but nah lmao
19:45:27<fireonlive>imagine trying to manage a channel list full of those
19:45:37andrew (andrew) joins
19:46:40<albertlarsan68>It seems like the API can't help too much doing basic post retrieving/discovering, the sitemap.xml file contains what we need.
19:46:57AmAnd0A quits [Read error: Connection reset by peer]
19:47:18AmAnd0A joins
19:48:33<masterX244>probably a 2-stage project needed, buzzing out the posts/articles via sitemap hunting and then the core retrieval
19:50:50<albertlarsan68>Maybe get a first pool, then grow it via backfeed?
19:51:04andrew quits [Client Quit]
19:52:02<albertlarsan68>There is also an atom feed: example https://lequipe-skyrock.skyrock.com/atom.xml
19:54:29<albertlarsan68>or having someone create an account and maybe subscribe to everyone discovered, then maybe (not verified) there can be a feed/api callback (webhook type) to stay up to date on new posts?
19:59:12andrew (andrew) joins
19:59:53<albertlarsan68>Also, I have tried to create a wiki page for Skyblog (my user account is sensibly the same as on IRC).
20:00:53yts98 leaves
20:00:56yts98 joins
20:03:31<@arkiver>let's see
20:03:35<@arkiver>what is this
20:04:23<masterX244>french blogging site, really old. see on #archivteteam
20:04:36<@arkiver>yep
20:04:43<@arkiver>we have a channel! #bowlofpetunias thanks JAA
20:09:01<albertlarsan68>What would be the strategy?
20:56:31lennier1 quits [Quit: Going offline, see ya! (www.adiirc.com)]
21:01:12Unholy23613 quits [Client Quit]
21:01:33Unholy23613 (Unholy2361) joins
21:07:05lennier1 (lennier1) joins
21:09:21Matthww1 quits [Quit: The Lounge - https://thelounge.chat]
21:21:10Matthww1 joins
21:22:02PredatorIWD quits [Client Quit]
21:22:17PredatorIWD joins
21:42:32savethestuffyo quits [Quit: WeeChat 3.8]
21:45:11icedice (icedice) joins
22:00:04Hajdar quits [Remote host closed the connection]
22:00:19Hajdar (Hajdar) joins
22:07:25Dallas (Dallas) joins
22:09:20<imer>JAA: is the ia repo not receptive to fixes then? seems like there's a lot of issues with it
22:09:28<imer>or just too much work fixing it?
22:09:53<imer>seeing as you implemented an entirely new uploader
22:10:21<@JAA>imer: Jake is receptive to fixes, and I've fixed a bunch of things myself.
22:10:43<@JAA>I implemented ia-upload-stream separately because adding multi-part uploads and parallelism to ia would've been a pain.
22:10:51<Jake>(just to be extra clear, different Jake :)
22:11:09<@JAA>Oh right :-)
22:11:31<fireonlive>aw darn was just about to PM you with my list of wants :3
22:12:31<@JAA>After I implemented multi-part uploads, I realised they were ... not exactly great on IA's side of things. There's a bunch of copying parts around, which adds significant overhead.
22:12:42<@JAA>So I ended up adding the single-part uploading as well.
22:13:40<imer>alright, makes sense. might look at the ia stuff then, if its easy enough to sort out I might
22:15:19<fireonlive>imer: source seems to be here: https://github.com/jjjake/internetarchive
22:15:27<imer>yup yup
22:15:36<fireonlive>:)
22:15:58<fireonlive>oh there's a whole how to contribute page!
22:16:02<imer>just asking since the stuff I ran into seemed to have been known for a while, never know if a project is on life support or something
22:16:09<fireonlive>i didn't have to like s/download/details/ and poke around lol
22:16:16<fireonlive>🤦
22:17:17<fireonlive>i do like that official releases are on the archive itself =]
22:25:13<@arkiver>a lot is happening in Russia, and most of it is happening through Telegram. we archive Telegram. if you haven't done so, please join and run the telegram project #telegrab
22:30:09<@OrIdow6>pokechu22: See above, I'll probably write a warrior project or similar for wysp.ys
22:35:22wyatt8750 quits [Remote host closed the connection]
22:43:12<myself>Am running a telegrab container, but the channel was low signal-to-noise. Perhaps it's time to rejoin..
22:45:59<nicolas17>arkiver: do telegram and imgur use the same targets? I could slow down the bruteforce queueing if we need target capacity
22:58:02Hackerpcs quits [Quit: Hackerpcs]
23:00:04Hackerpcs (Hackerpcs) joins
23:00:30<@arkiver>nicolas17: if needed i'll slow down the imgur project
23:00:40<nicolas17>okay
23:01:01<nicolas17>I'm now enqueueing bruteforce lists when todo reaches a threshold, rather than every N minutes
23:01:23<nicolas17>so if you reduce the rate limit, it will adapt to that
23:02:45<fireonlive>i love the jank → fancier route things always take over time
23:02:53<fireonlive>even if there is still jank
23:03:25<fireonlive>is that what having children is like?
23:03:27<fireonlive>lol
23:12:28<@arkiver>any youtube videos or channels related to what is happening in Russia now can be archived at #down-the-tube
23:43:58BlueMaxima joins