00:00:36Wohlstand quits [Client Quit]
00:01:12gosc joins
00:04:01Shard0 (Shard) joins
00:04:48Shard quits [Ping timeout: 268 seconds]
00:04:48Shard0 is now known as Shard
00:07:58itachi1706 quits [Ping timeout: 268 seconds]
00:15:09<klea>nicolas17: I've noticed that https://opensource.samsung.com/uploadSearch?searchValue=M146BXXU8CXK6 has more than one item, I downloaded both for good measure (the big one, the one you want is still downloading), what should I do with the other zip?
00:15:43<nicolas17>klea: yes, usually in those cases I already downloaded the smaller file
00:15:51<klea>Ah.
00:15:56<klea>so rm it?
00:16:12<klea>9864f97e375e578b2654e6019ba29c16 SM-M146B_14_Opensource_M146BXXU8CYA1_M146BXXS8CYB2.zip
00:16:17<klea>(md5)
00:16:36<nicolas17>yep I have it
00:16:37<nicolas17>9864f97e375e578b2654e6019ba29c16 13768/SM-M146B_14_Opensource_M146BXXU8CYA1_M146BXXS8CYB2.zip
00:17:02<klea>Ack, removed locally.
00:17:48itachi1706 (itachi1706) joins
00:33:35etnguyen03 (etnguyen03) joins
00:43:30<klea>Also, https://opensource.samsung.com/uploadSearch?searchValue=S731BXXU3AYJ9 seems to have more than one result, or at least more than one item you might care about.
00:44:29<klea>oh nvm, almost every other file except a few are in kilobyte ranges.
01:03:01Arcorann_ (Arcorann) joins
01:04:29SootBector quits [Ping timeout: 260 seconds]
01:07:14SootBector (SootBector) joins
01:21:33benjins3_ quits [Remote host closed the connection]
01:21:50benjins3_ joins
01:26:51dabs joins
01:27:47dabs quits [Remote host closed the connection]
01:28:00dabs joins
01:28:56nukke (nukke) joins
01:37:29polypeptide (polypeptide) joins
01:41:49polypept1 quits [Ping timeout: 260 seconds]
01:56:34SootBector quits [Remote host closed the connection]
01:57:42SootBector (SootBector) joins
01:58:58pabs is now known as RJHacker92344
01:59:16pabs (pabs) joins
01:59:41<klea>kline: If you want, you can help manually download Samsung files, via https://data.nicolas17.xyz/samsung-grab/ (IIRC you were looking for ways to contribute)
01:59:51RJHacker92344 quits [Read error: Connection reset by peer]
02:04:12pabs quits [Excess Flood]
02:04:58pabs (pabs) joins
02:07:07pabs quits [Read error: Connection reset by peer]
02:11:31dabs quits [Read error: Connection reset by peer]
02:27:52TheEnbyperor_ quits [Ping timeout: 268 seconds]
02:29:11TheEnbyperor quits [Ping timeout: 268 seconds]
02:31:58TheEnbyperor joins
02:41:02pabs (pabs) joins
02:42:19TheEnbyperor_ (TheEnbyperor) joins
03:16:06polypeptide quits [Remote host closed the connection]
03:16:21polypeptide (polypeptide) joins
03:30:26polypeptide quits [Remote host closed the connection]
03:30:39polypeptide (polypeptide) joins
03:31:08etnguyen03 quits [Client Quit]
03:33:15etnguyen03 (etnguyen03) joins
03:36:28DogsRNice quits [Read error: Connection reset by peer]
03:38:55TheEnbyperor quits [Read error: Connection reset by peer]
03:39:01TheEnbyperor_ quits [Read error: Connection reset by peer]
03:45:35<h2ibot>TripleCamera edited Template:Wikis (+21, +WikiKeeper): https://wiki.archiveteam.org/?diff=61007&oldid=59984
03:46:24etnguyen03 quits [Remote host closed the connection]
03:49:36TheEnbyperor (TheEnbyperor) joins
03:50:08TheEnbyperor_ joins
04:00:27n9nes quits [Ping timeout: 268 seconds]
04:11:43n9nes joins
04:15:47pabs quits [Ping timeout: 268 seconds]
04:15:54polypeptide quits [Remote host closed the connection]
04:16:14polypeptide (polypeptide) joins
04:30:09pabs (pabs) joins
05:24:44nexussfan quits [Quit: Konversation terminated!]
05:53:16chipmunk joins
05:53:27<chipmunk>hi
05:53:30<chipmunk>so
05:54:39<chipmunk>as you may know a while ago scratch.mit.edu stopped allowing users to get unshared projects via api
05:55:00<chipmunk>(should be documented on the wiki)
05:55:31<chipmunk>however somewhat recently some new endpoints for getting projects appeared
05:56:19<chipmunk>i believe, https://scratch-projects.scratch.org/123, https://scratch-projects-v2.scratch.org/123, https://scratch-projects-v3.scratch.org/1
05:57:08<chipmunk>i mentioned this in #fleshwound and github issues a while ago but i suppose those are not checked
05:58:09<chipmunk>but recently the first and last ones seemed to go down and i realized i should have probably mentioned this in a channel where it's more likely to be seen
05:58:15<chipmunk>sorry i neglected to do so sooner
06:00:14<chipmunk>but the v2 one is still up in case it is of intrest
06:01:00<chipmunk>i'll sleep now but will check the logs later if i don't forget to
06:01:03chipmunk quits [Client Quit]
06:13:58Nekroschizofrenetyk joins
06:29:21Island quits [Read error: Connection reset by peer]
06:33:35SootBector quits [Remote host closed the connection]
06:34:49SootBector (SootBector) joins
07:11:01TastyWiener95 quits [Quit: So long, farewell, auf wiedersehen, good night]
07:17:20TastyWiener95 (TastyWiener95) joins
08:08:18<Nekroschizofrenetyk>Hmmm, interesting. https://my.mail.ru/mail/ - Moy Mir, it's a social network belonging to VK (along with VKontakte and Odnoklassniki). From what I've seemed to have experienced, VKontakte itself uses some rather aggressive bot-protection mechanism. Is it true for MM, though? I SPNd one profile and it seemed to archive well. No job to any extent
08:08:18<Nekroschizofrenetyk>done in AB, I see. Obviously, large. In July 2019 there were 5m users active in a month. 57.6m people had mail.ru email addresses back in February 2021, though definitely only a fraction would be active on MM.
08:21:49michaelblob76 quits [Quit: yoop]
08:22:30michaelblob764 joins
08:25:08michaelblob764 quits [Client Quit]
08:28:01michaelblob764 joins
08:57:52Webuser830606 joins
08:57:59Webuser830606 quits [Client Quit]
09:03:46TheEnbyperor quits [Ping timeout: 268 seconds]
09:03:51TheEnbyperor_ quits [Ping timeout: 268 seconds]
09:08:02TheEnbyperor joins
09:17:57sg72 quits [Ping timeout: 268 seconds]
09:18:29TheEnbyperor_ (TheEnbyperor) joins
09:19:06sg72 joins
10:02:44Nekroschizofrenetyk quits [Quit: Ooops, wrong browser tab.]
10:02:56equinoxe joins
10:03:22Nekroschizofrenetyk joins
10:22:27FiTheArchiver joins
10:23:27FiTheArchiver quits [Read error: Connection reset by peer]
10:23:59SootBector quits [Remote host closed the connection]
10:24:24FiTheArchiver joins
10:25:08SootBector (SootBector) joins
10:26:23FiTheArchiver quits [Client Quit]
10:34:29croissant_ joins
10:37:30croissant quits [Ping timeout: 268 seconds]
10:37:56lun4 quits [Quit: Ping timeout (120 seconds)]
10:38:12lun4 joins
10:46:34<h2ibot>Hans5958 edited Main Page/Current Projects (+127, Create wrapper to prepare of list display for…): https://wiki.archiveteam.org/?diff=61008&oldid=60884
10:46:35<h2ibot>Hans5958 edited Template:CurrentWarrior (-191, Make as list): https://wiki.archiveteam.org/?diff=61009&oldid=60709
10:46:40myself quits [Read error: Connection reset by peer]
10:46:50myself joins
10:48:35<h2ibot>Hans5958 edited Main Page/Current Projects (+11, Adjust margin a bit): https://wiki.archiveteam.org/?diff=61010&oldid=61008
10:51:39SootBector quits [Remote host closed the connection]
10:52:47SootBector (SootBector) joins
10:53:35<h2ibot>Hans5958 edited MoinMoin (+11, Add {{wikis}}): https://wiki.archiveteam.org/?diff=61011&oldid=58717
11:00:01Bleo1826007227196234552220110 quits [Quit: The Lounge - https://thelounge.chat]
11:00:37polypeptide quits [Remote host closed the connection]
11:01:21polypeptide (polypeptide) joins
11:02:45Bleo1826007227196234552220110 joins
11:06:37<h2ibot>Hans5958 edited Running Archive Team Projects with Docker (-5): https://wiki.archiveteam.org/?diff=61012&oldid=60849
11:07:37<h2ibot>Hans5958 edited Running Archive Team Projects with Docker (+6): https://wiki.archiveteam.org/?diff=61013&oldid=61012
11:17:30<equinoxe>Hi All! In page https://wiki.archiveteam.org/index.php/Deaths_in_2025 is not record about Vasili Golovachyov daed in 2025.09.07 https://en.wikipedia.org/wiki/Vasili_Golovachyov His official site http://www.golovachev.ru/ Add this please.
11:17:56<equinoxe>*died
11:19:22<myself>The top of the page explains where the data comes from.
11:21:35<equinoxe>Yes, i know. But today is 2026 and record still not there.
11:22:14<klea>The bot account HadeanEon is not always active.
11:23:23<equinoxe>Ok, thank you for information.
11:34:19equinoxe quits [Client Quit]
11:38:01Wohlstand (Wohlstand) joins
11:41:46etnguyen03 (etnguyen03) joins
11:46:06qmastery joins
11:47:28<qmastery>Good day, dear friends!
11:48:10croissant joins
11:49:11<qmastery>klea : thank you so much for your kind guidance and kind help, I successfully captured a test website using "grab-site" and then "opened it" for local browsing with webrecorder/pywb
11:49:29<Yakov>klea: Pretty sure VoynichCr maintains the bot, they're not really active though
11:49:45<klea>Yeah, having the source code be available would make it possible for others to run it.
11:51:35croissant_ quits [Ping timeout: 268 seconds]
11:53:46<qmastery>Now I am capturing of huge website (~ 50GBs) with "--warc-max-size=4294967296" (4 GB) for it to be writable to a set of DVDs. But one thing bothers me: I got like 100 ERRORs because of internet connection problems. I know that URLs will be tried at the end. But it means that i.e. a certain page is inside the first WARC archive, and the material
11:53:46<qmastery>embedded into this webpage that it wanted to download and failed - inside the last WARC archive
11:54:43<qmastery>Is there any way to resolve this fragmentation? I.e. by somehow "uniting" the WARCs after the whole capture ends / reindexing / splitting again ?
11:55:10Nekroschizofrenetyk quits [Client Quit]
11:56:54Wohlstand quits [Remote host closed the connection]
11:57:14Nekroschizofrenetyk joins
11:59:09<qmastery>So far I found https://github.com/maturban/WARCMerge but it seems to be really old
12:01:51Shard quits [Quit: Im doing something rq. Il brb]
12:03:52etnguyen03 quits [Client Quit]
12:04:06Shard (Shard) joins
12:04:23<qmastery>many people merge their WARCs just by " cat *.warc.gz > ./combined/all.warc.gz ", however it does not resolve this fragmentation issue (some page in 1st warc but its needed file is in last warc because of capture-time ERROR and delayed retry). Please tell, do you have any ideas how this could be fixed?
12:07:25Nekroschizofrenetyk quits [Excess Flood]
12:07:35Shard quits [Client Quit]
12:07:46Nekroschizofrenetyk joins
12:08:35Shard (Shard) joins
12:15:38pabs quits [Ping timeout: 268 seconds]
12:17:07pabs (pabs) joins
12:36:44<klea>https://waf.moe/fediverse/post/9b8172f4-cebb-49e2-ac92-5ea5e9b427f7
12:36:58<klea>> Hey wafflers! Me and the rest of the team has decided that due to the stress, internal issues, and other issues, the WAF.moe project is officially shutting down on June 2, 2026 at 12:00 AM (GMT+8).
13:00:08qmastery quits [Client Quit]
13:08:09qmastery9 joins
13:08:32qmastery1 joins
13:08:38Webuser557222 joins
13:09:03Webuser557222 quits [Client Quit]
13:11:08scotrod2 quits [Quit: The Lounge - https://thelounge.chat]
13:13:37klea quits [Remote host closed the connection]
13:13:37alexlehm quits [Remote host closed the connection]
13:17:50TheEnbyperor_ quits [Ping timeout: 268 seconds]
13:17:55TheEnbyperor quits [Ping timeout: 268 seconds]
13:19:14alexlehm (alexlehm) joins
13:19:17klea (jmjl) joins
13:20:18alexlehm quits [Remote host closed the connection]
13:21:20klea quits [Remote host closed the connection]
13:21:23alexlehm (alexlehm) joins
13:21:26klea (jmjl) joins
13:27:59myself quits [Read error: Connection reset by peer]
13:28:11myself joins
13:31:00TheEnbyperor joins
13:31:21TheEnbyperor_ (TheEnbyperor) joins
13:53:24nexussfan (nexussfan) joins
13:59:58qmastery9 quits [Client Quit]
13:59:58qmastery1 quits [Client Quit]
14:07:22myself quits [Read error: Connection reset by peer]
14:07:34myself joins
14:24:56scotrod2 joins
14:31:50Arcorann_ quits [Ping timeout: 268 seconds]
14:35:45<cruller>Why is #fedisperse invite-only? For historical reasons?
14:38:21<justauser>Perhaps something to do with the berries.space problems?
14:39:34<justauser>Looks like it was like that since forever: https://irclogs.archivete.am/efnet_archiveteam-ot/2019-06-24#l4e4f1ae1
14:39:48multisn8 quits [Quit: WeeChat 4.8.1]
14:39:54multisn8 (multisn8) joins
14:42:01<cruller>I suspect that too.
14:50:12<h2ibot>Klea edited List of websites excluded from the Wayback Machine (+292, Add berries.space): https://wiki.archiveteam.org/?diff=61014&oldid=60933
14:51:12<h2ibot>KleaBot edited List of websites excluded from the Wayback Machine (+0, Reordered websites): https://wiki.archiveteam.org/?diff=61015&oldid=61014
14:54:36ATinySpaceMarine quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
14:55:46etnguyen03 (etnguyen03) joins
14:57:24ATinySpaceMarine joins
15:04:10ATinySpaceMarine quits [Client Quit]
15:04:39ATinySpaceMarine joins
15:36:40TheEnbyperor quits [Ping timeout: 268 seconds]
15:37:12TheEnbyperor_ quits [Ping timeout: 268 seconds]
15:37:49TheEnbyperor joins
15:39:29TheEnbyperor_ (TheEnbyperor) joins
15:49:55DogsRNice joins
15:59:20<h2ibot>Calmevening edited List of websites excluded from the Wayback Machine (+32, add ads.wonder-wonder.com): https://wiki.archiveteam.org/?diff=61016&oldid=61015
16:03:53myself quits [Read error: Connection reset by peer]
16:04:06myself joins
16:22:23<h2ibot>KleaBot edited List of websites excluded from the Wayback Machine (+0, Reordered websites and/or updated count.): https://wiki.archiveteam.org/?diff=61017&oldid=61016
16:37:25<h2ibot>Nekroschizofrenetyk edited URLTeam (+254, /* Alive */ shorturl.fm): https://wiki.archiveteam.org/?diff=61018&oldid=60983
16:44:49SootBector quits [Ping timeout: 260 seconds]
16:45:58qmastery2 joins
16:47:00Webuser641496 joins
16:47:42SootBector (SootBector) joins
16:47:44Webuser641496 quits [Client Quit]
16:53:37that_lurker quits [Remote host closed the connection]
16:53:41that_lurker (that_lurker) joins
16:56:50nukke quits [Ping timeout: 268 seconds]
17:05:52retrograde quits [Remote host closed the connection]
17:06:15retrograde (retrograde) joins
17:11:28<h2ibot>Brad edited Deathwatch (+509, Added Outlook Lite): https://wiki.archiveteam.org/?diff=61019&oldid=60990
17:30:30nukke (nukke) joins
17:31:53<gosc>forgot to ask about this (or maybe I did already?) this page needs a japanese IP to work, how do you even go about saving it? https://edith.co.jp/lp/nagoya.kotsu-madoka/oshitabi/
17:33:34etnguyen03 quits [Client Quit]
17:39:41Webuser051960 quits [Quit: Ooops, wrong browser tab.]
17:42:08cyanbox_ joins
17:42:09cyanbox quits [Read error: Connection reset by peer]
17:54:43rohvani quits [Ping timeout: 268 seconds]
17:57:09etnguyen03 (etnguyen03) joins
18:00:51Island joins
18:03:58ducky quits [Ping timeout: 268 seconds]
18:04:35SootBector quits [Remote host closed the connection]
18:05:42SootBector (SootBector) joins
18:08:35<h2ibot>Nekroschizofrenetyk uploaded File:Moy mir.png (My World@Mail.Ru logo): https://wiki.archiveteam.org/?title=File%3AMoy%20mir.png
18:09:35<h2ibot>Nekroschizofrenetyk uploaded File:Moymir 12 04 2026.png (My World@Mail.Ru screenshot 12.04.2026): https://wiki.archiveteam.org/?title=File%3AMoymir%2012%2004%202026.png
18:10:13nukke quits [Ping timeout: 268 seconds]
18:10:42etnguyen03 quits [Client Quit]
18:18:58ducky (ducky) joins
18:36:18nukke (nukke) joins
18:39:13gosc quits [Quit: Leaving]
18:47:39<h2ibot>Nekroschizofrenetyk created My World@Mail.Ru (+1285, Created page with "{{Infobox project | logo =…): https://wiki.archiveteam.org/?oldid=61022
19:05:01nukke quits [Ping timeout: 268 seconds]
19:10:49nukke (nukke) joins
19:11:32Nekroschizofrenetyk quits [Quit: Ooops, wrong browser tab.]
19:18:39rohvani joins
19:19:43Nekroschizofrenetyk joins
19:20:31nukke quits [Ping timeout: 268 seconds]
19:21:55Nekroschizofrenetyk quits [Client Quit]
19:22:36DogsRNice_ joins
19:27:13DogsRNice quits [Ping timeout: 268 seconds]
19:27:46qmastery2 quits [Client Quit]
19:38:19nexussfan quits [Ping timeout: 268 seconds]
19:42:32dabs joins
19:43:28dabs quits [Remote host closed the connection]
19:43:41dabs joins
19:48:50Webuser406909 joins
19:50:02Webuser406909 quits [Client Quit]
19:50:44TheEnbyperor quits [Ping timeout: 268 seconds]
19:51:16TheEnbyperor_ quits [Ping timeout: 268 seconds]
19:53:10TheEnbyperor joins
19:54:50nukke (nukke) joins
19:57:53TheEnbyperor_ (TheEnbyperor) joins
20:00:31nukke quits [Ping timeout: 268 seconds]
20:01:37nexussfan (nexussfan) joins
20:08:00bigfren joins
20:08:05<bigfren>hi there ^_^
20:10:15<bigfren>i was running grab-site and had a power outage. Luckily, all the files are there (ls -lh wpull.db wpull.log *.warc.gz) and the wpull.db database is fine (verified by sqlite3 wpull.db "PRAGMA integrity_check;" ). Things seem good
20:10:26dabs quits [Read error: Connection reset by peer]
20:11:04<bigfren>Please tell, how do I resume the grab-site download ? when I re-run the grab-site command, it tries to create a new dir and start from scratch
20:12:07<bigfren>and if I try adding "grab-site --id "ID"" (where ID is a trailing ID of my existing directory), it simply tells "this dir already exists" :P
20:12:51TheEnbyperor_ quits [Ping timeout: 268 seconds]
20:12:56TheEnbyperor quits [Ping timeout: 268 seconds]
20:13:13<klea>https://github.com/ArchiveTeam/grab-site/issues/58
20:14:43<bigfren>klea: you are genius, thank you so much ^_^
20:16:12TheEnbyperor joins
20:16:58<bigfren>this pull request seems small enough, wonder why its not merged yet https://github.com/ArchiveTeam/grab-site/pull/247/
20:20:57TheEnbyperor quits [Ping timeout: 268 seconds]
20:21:56<klea>(a) It begins with "WIP:" and is marked as draft, (b) The pull request's initial message says "Needs more testing and I think it may have issues with sites that require cookies", (c) I believe the test harness wasn't fully finished and no system to run CI on changes.
20:24:12bigfren quits [Client Quit]
20:24:47TheEnbyperor joins
20:25:53nukke (nukke) joins
20:26:32TheEnbyperor_ (TheEnbyperor) joins
20:28:15<pokechu22>Right, wpull currently stores cookies in memory, so those will be lost. I know someone was working on storing cookies in the DB instead (more for performance reasons, but it would help with resuming), but I don't knwo what the status is
20:33:10<klea>https://odysee.com/copilot-is-for-entertainment-purposes:1c7b0382db599a22fb16d50f92f1cbbe3e4dc370 claims Anthropic's TOS is different in some countries, (US, Australia one, UK/EU another)
20:33:28<klea>https://www.anthropic.com/legal/consumer-terms
20:35:28<klea>https://www.difchecker.com/BtqVrR9p/
20:46:46TheEnbyperor_ quits [Ping timeout: 268 seconds]
20:46:51TheEnbyperor quits [Ping timeout: 268 seconds]
20:48:44kansei quits [Quit: ZNC 1.10.1 - https://znc.in]
20:49:57TheEnbyperor joins
20:50:02TheEnbyperor_ (TheEnbyperor) joins
20:52:30etnguyen03 (etnguyen03) joins
20:55:14andrewnyr quits [Quit: The Lounge - https://thelounge.chat]
20:56:29bigfren joins
20:57:29<bigfren>Thank you. I rebased this "resume" patch series on top of master and it builds OK, will backup my dump and test really soon
20:57:54andrewnyr joins
20:58:17<bigfren>the author if this patch seems to have a lot of Github activity, I hope this patch is solid and he just didn't have time to get it merged
20:58:51<bigfren>I haven't used any custom cookies during my dump so won't be affected by their loss
20:59:45klea points out she shared a few possible reasons for it to not have been merged.
21:00:05Shard quits [Quit: Im doing something rq. Il brb]
21:02:21Shard (Shard) joins
21:08:24n9nes quits [Remote host closed the connection]
21:09:52n9nes joins
21:32:04retrograde quits [Remote host closed the connection]
21:32:45retrograde (retrograde) joins
21:50:15hamouda joins
22:10:37<bigfren>klea: thank you for your precautions. I have looked through it and also ran VS qwen and deepseek ai's. So far the current problems of "resume" patch (aside of the mentioned "cookies" problem: 1) it does not remove the files at ./site-directory/temp/ before starting ; 2) it does not honor the custom --warc-file names ; 3) in a multi-parted situation
22:10:37<bigfren>like --warc-max-size=4294967296 ( 4 GB , I split it this way in order for my WARC parts to be writable to DVDs) - instead of appending to the last WARC (1.5GB in my case), it created a new part, which is inconvenient . 4) not sure what happens if the user changes the other-than "--resume" / "--dir" command line parameters (i.e. what if we
22:10:37<bigfren>erroneously supply a different URL ? ) 5) no cleanup of stop file after graceful stop
22:12:14<bigfren>However, based on what I see by wpull.db contents (old "todo" vs new "todo", etc) , it seems to successfully resume my download, which is a real savior in my situation (like 9 GB was dumped already and I did not want to start from scratch)
22:15:36<bigfren>Also luckily I don't see any duplicates at wpull.db --> done table , so far... (verified by commands like "gs-dump-urls ./path_to/wpull.db done > ~/done-new.txt", "sort ./done-new.txt > ./done-new-srt.txt", "uniq -d")
22:19:13<bigfren>If there is anything else you would like me to check, I am all in attention ;-)
22:21:24michaelblob764 quits [Quit: yoop]
22:24:23michaelblob764 joins
22:32:10Webuser510872 joins
22:32:44Webuser510872 quits [Client Quit]