| 00:00:36 | | Wohlstand quits [Client Quit] |
| 00:01:12 | | gosc joins |
| 00:04:01 | | Shard0 (Shard) joins |
| 00:04:48 | | Shard quits [Ping timeout: 268 seconds] |
| 00:04:48 | | Shard0 is now known as Shard |
| 00:07:58 | | itachi1706 quits [Ping timeout: 268 seconds] |
| 00:15:09 | <klea> | nicolas17: I've noticed that https://opensource.samsung.com/uploadSearch?searchValue=M146BXXU8CXK6 has more than one item, I downloaded both for good measure (the big one, the one you want is still downloading), what should I do with the other zip? |
| 00:15:43 | <nicolas17> | klea: yes, usually in those cases I already downloaded the smaller file |
| 00:15:51 | <klea> | Ah. |
| 00:15:56 | <klea> | so rm it? |
| 00:16:12 | <klea> | 9864f97e375e578b2654e6019ba29c16 SM-M146B_14_Opensource_M146BXXU8CYA1_M146BXXS8CYB2.zip |
| 00:16:17 | <klea> | (md5) |
| 00:16:36 | <nicolas17> | yep I have it |
| 00:16:37 | <nicolas17> | 9864f97e375e578b2654e6019ba29c16 13768/SM-M146B_14_Opensource_M146BXXU8CYA1_M146BXXS8CYB2.zip |
| 00:17:02 | <klea> | Ack, removed locally. |
| 00:17:48 | | itachi1706 (itachi1706) joins |
| 00:33:35 | | etnguyen03 (etnguyen03) joins |
| 00:43:30 | <klea> | Also, https://opensource.samsung.com/uploadSearch?searchValue=S731BXXU3AYJ9 seems to have more than one result, or at least more than one item you might care about. |
| 00:44:29 | <klea> | oh nvm, almost every other file except a few are in kilobyte ranges. |
| 01:03:01 | | Arcorann_ (Arcorann) joins |
| 01:04:29 | | SootBector quits [Ping timeout: 260 seconds] |
| 01:07:14 | | SootBector (SootBector) joins |
| 01:21:33 | | benjins3_ quits [Remote host closed the connection] |
| 01:21:50 | | benjins3_ joins |
| 01:26:51 | | dabs joins |
| 01:27:47 | | dabs quits [Remote host closed the connection] |
| 01:28:00 | | dabs joins |
| 01:28:56 | | nukke (nukke) joins |
| 01:37:29 | | polypeptide (polypeptide) joins |
| 01:41:49 | | polypept1 quits [Ping timeout: 260 seconds] |
| 01:56:34 | | SootBector quits [Remote host closed the connection] |
| 01:57:42 | | SootBector (SootBector) joins |
| 01:58:58 | | pabs is now authenticated as * |
| 01:58:58 | | pabs is now known as RJHacker92344 |
| 01:59:16 | | pabs (pabs) joins |
| 01:59:41 | <klea> | kline: If you want, you can help manually download Samsung files, via https://data.nicolas17.xyz/samsung-grab/ (IIRC you were looking for ways to contribute) |
| 01:59:51 | | RJHacker92344 quits [Read error: Connection reset by peer] |
| 02:04:12 | | pabs quits [Excess Flood] |
| 02:04:58 | | pabs (pabs) joins |
| 02:07:07 | | pabs quits [Read error: Connection reset by peer] |
| 02:11:31 | | dabs quits [Read error: Connection reset by peer] |
| 02:27:52 | | TheEnbyperor_ quits [Ping timeout: 268 seconds] |
| 02:29:11 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 02:31:58 | | TheEnbyperor joins |
| 02:41:02 | | pabs (pabs) joins |
| 02:42:19 | | TheEnbyperor_ (TheEnbyperor) joins |
| 03:16:06 | | polypeptide quits [Remote host closed the connection] |
| 03:16:21 | | polypeptide (polypeptide) joins |
| 03:30:26 | | polypeptide quits [Remote host closed the connection] |
| 03:30:39 | | polypeptide (polypeptide) joins |
| 03:31:08 | | etnguyen03 quits [Client Quit] |
| 03:33:15 | | etnguyen03 (etnguyen03) joins |
| 03:36:28 | | DogsRNice quits [Read error: Connection reset by peer] |
| 03:38:55 | | TheEnbyperor quits [Read error: Connection reset by peer] |
| 03:39:01 | | TheEnbyperor_ quits [Read error: Connection reset by peer] |
| 03:45:35 | <h2ibot> | TripleCamera edited Template:Wikis (+21, +WikiKeeper): https://wiki.archiveteam.org/?diff=61007&oldid=59984 |
| 03:46:24 | | etnguyen03 quits [Remote host closed the connection] |
| 03:49:36 | | TheEnbyperor (TheEnbyperor) joins |
| 03:50:08 | | TheEnbyperor_ joins |
| 04:00:27 | | n9nes quits [Ping timeout: 268 seconds] |
| 04:11:43 | | n9nes joins |
| 04:15:47 | | pabs quits [Ping timeout: 268 seconds] |
| 04:15:54 | | polypeptide quits [Remote host closed the connection] |
| 04:16:14 | | polypeptide (polypeptide) joins |
| 04:30:09 | | pabs (pabs) joins |
| 05:24:44 | | nexussfan quits [Quit: Konversation terminated!] |
| 05:53:16 | | chipmunk joins |
| 05:53:27 | <chipmunk> | hi |
| 05:53:30 | <chipmunk> | so |
| 05:54:39 | <chipmunk> | as you may know a while ago scratch.mit.edu stopped allowing users to get unshared projects via api |
| 05:55:00 | <chipmunk> | (should be documented on the wiki) |
| 05:55:31 | <chipmunk> | however somewhat recently some new endpoints for getting projects appeared |
| 05:56:19 | <chipmunk> | i believe, https://scratch-projects.scratch.org/123, https://scratch-projects-v2.scratch.org/123, https://scratch-projects-v3.scratch.org/1 |
| 05:57:08 | <chipmunk> | i mentioned this in #fleshwound and github issues a while ago but i suppose those are not checked |
| 05:58:09 | <chipmunk> | but recently the first and last ones seemed to go down and i realized i should have probably mentioned this in a channel where it's more likely to be seen |
| 05:58:15 | <chipmunk> | sorry i neglected to do so sooner |
| 06:00:14 | <chipmunk> | but the v2 one is still up in case it is of intrest |
| 06:01:00 | <chipmunk> | i'll sleep now but will check the logs later if i don't forget to |
| 06:01:03 | | chipmunk quits [Client Quit] |
| 06:13:58 | | Nekroschizofrenetyk joins |
| 06:29:21 | | Island quits [Read error: Connection reset by peer] |
| 06:33:35 | | SootBector quits [Remote host closed the connection] |
| 06:34:49 | | SootBector (SootBector) joins |
| 07:11:01 | | TastyWiener95 quits [Quit: So long, farewell, auf wiedersehen, good night] |
| 07:17:20 | | TastyWiener95 (TastyWiener95) joins |
| 08:08:18 | <Nekroschizofrenetyk> | Hmmm, interesting. https://my.mail.ru/mail/ - Moy Mir, it's a social network belonging to VK (along with VKontakte and Odnoklassniki). From what I've seemed to have experienced, VKontakte itself uses some rather aggressive bot-protection mechanism. Is it true for MM, though? I SPNd one profile and it seemed to archive well. No job to any extent |
| 08:08:18 | <Nekroschizofrenetyk> | done in AB, I see. Obviously, large. In July 2019 there were 5m users active in a month. 57.6m people had mail.ru email addresses back in February 2021, though definitely only a fraction would be active on MM. |
| 08:21:49 | | michaelblob76 quits [Quit: yoop] |
| 08:22:30 | | michaelblob764 joins |
| 08:25:08 | | michaelblob764 quits [Client Quit] |
| 08:28:01 | | michaelblob764 joins |
| 08:57:52 | | Webuser830606 joins |
| 08:57:59 | | Webuser830606 quits [Client Quit] |
| 09:03:46 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 09:03:51 | | TheEnbyperor_ quits [Ping timeout: 268 seconds] |
| 09:08:02 | | TheEnbyperor joins |
| 09:17:57 | | sg72 quits [Ping timeout: 268 seconds] |
| 09:18:29 | | TheEnbyperor_ (TheEnbyperor) joins |
| 09:19:06 | | sg72 joins |
| 10:02:44 | | Nekroschizofrenetyk quits [Quit: Ooops, wrong browser tab.] |
| 10:02:56 | | equinoxe joins |
| 10:03:22 | | Nekroschizofrenetyk joins |
| 10:22:27 | | FiTheArchiver joins |
| 10:23:27 | | FiTheArchiver quits [Read error: Connection reset by peer] |
| 10:23:59 | | SootBector quits [Remote host closed the connection] |
| 10:24:24 | | FiTheArchiver joins |
| 10:25:08 | | SootBector (SootBector) joins |
| 10:26:23 | | FiTheArchiver quits [Client Quit] |
| 10:34:29 | | croissant_ joins |
| 10:37:30 | | croissant quits [Ping timeout: 268 seconds] |
| 10:37:56 | | lun4 quits [Quit: Ping timeout (120 seconds)] |
| 10:38:12 | | lun4 joins |
| 10:46:34 | <h2ibot> | Hans5958 edited Main Page/Current Projects (+127, Create wrapper to prepare of list display for…): https://wiki.archiveteam.org/?diff=61008&oldid=60884 |
| 10:46:35 | <h2ibot> | Hans5958 edited Template:CurrentWarrior (-191, Make as list): https://wiki.archiveteam.org/?diff=61009&oldid=60709 |
| 10:46:40 | | myself quits [Read error: Connection reset by peer] |
| 10:46:50 | | myself joins |
| 10:48:35 | <h2ibot> | Hans5958 edited Main Page/Current Projects (+11, Adjust margin a bit): https://wiki.archiveteam.org/?diff=61010&oldid=61008 |
| 10:51:39 | | SootBector quits [Remote host closed the connection] |
| 10:52:47 | | SootBector (SootBector) joins |
| 10:53:35 | <h2ibot> | Hans5958 edited MoinMoin (+11, Add {{wikis}}): https://wiki.archiveteam.org/?diff=61011&oldid=58717 |
| 11:00:01 | | Bleo1826007227196234552220110 quits [Quit: The Lounge - https://thelounge.chat] |
| 11:00:37 | | polypeptide quits [Remote host closed the connection] |
| 11:01:21 | | polypeptide (polypeptide) joins |
| 11:02:45 | | Bleo1826007227196234552220110 joins |
| 11:06:37 | <h2ibot> | Hans5958 edited Running Archive Team Projects with Docker (-5): https://wiki.archiveteam.org/?diff=61012&oldid=60849 |
| 11:07:37 | <h2ibot> | Hans5958 edited Running Archive Team Projects with Docker (+6): https://wiki.archiveteam.org/?diff=61013&oldid=61012 |
| 11:17:30 | <equinoxe> | Hi All! In page https://wiki.archiveteam.org/index.php/Deaths_in_2025 is not record about Vasili Golovachyov daed in 2025.09.07 https://en.wikipedia.org/wiki/Vasili_Golovachyov His official site http://www.golovachev.ru/ Add this please. |
| 11:17:56 | <equinoxe> | *died |
| 11:19:22 | <myself> | The top of the page explains where the data comes from. |
| 11:21:35 | <equinoxe> | Yes, i know. But today is 2026 and record still not there. |
| 11:22:14 | <klea> | The bot account HadeanEon is not always active. |
| 11:23:23 | <equinoxe> | Ok, thank you for information. |
| 11:34:19 | | equinoxe quits [Client Quit] |
| 11:38:01 | | Wohlstand (Wohlstand) joins |
| 11:41:46 | | etnguyen03 (etnguyen03) joins |
| 11:46:06 | | qmastery joins |
| 11:47:28 | <qmastery> | Good day, dear friends! |
| 11:48:10 | | croissant joins |
| 11:49:11 | <qmastery> | klea : thank you so much for your kind guidance and kind help, I successfully captured a test website using "grab-site" and then "opened it" for local browsing with webrecorder/pywb |
| 11:49:29 | <Yakov> | klea: Pretty sure VoynichCr maintains the bot, they're not really active though |
| 11:49:45 | <klea> | Yeah, having the source code be available would make it possible for others to run it. |
| 11:51:35 | | croissant_ quits [Ping timeout: 268 seconds] |
| 11:53:46 | <qmastery> | Now I am capturing of huge website (~ 50GBs) with "--warc-max-size=4294967296" (4 GB) for it to be writable to a set of DVDs. But one thing bothers me: I got like 100 ERRORs because of internet connection problems. I know that URLs will be tried at the end. But it means that i.e. a certain page is inside the first WARC archive, and the material |
| 11:53:46 | <qmastery> | embedded into this webpage that it wanted to download and failed - inside the last WARC archive |
| 11:54:43 | <qmastery> | Is there any way to resolve this fragmentation? I.e. by somehow "uniting" the WARCs after the whole capture ends / reindexing / splitting again ? |
| 11:55:10 | | Nekroschizofrenetyk quits [Client Quit] |
| 11:56:54 | | Wohlstand quits [Remote host closed the connection] |
| 11:57:14 | | Nekroschizofrenetyk joins |
| 11:59:09 | <qmastery> | So far I found https://github.com/maturban/WARCMerge but it seems to be really old |
| 12:01:51 | | Shard quits [Quit: Im doing something rq. Il brb] |
| 12:03:52 | | etnguyen03 quits [Client Quit] |
| 12:04:06 | | Shard (Shard) joins |
| 12:04:23 | <qmastery> | many people merge their WARCs just by " cat *.warc.gz > ./combined/all.warc.gz ", however it does not resolve this fragmentation issue (some page in 1st warc but its needed file is in last warc because of capture-time ERROR and delayed retry). Please tell, do you have any ideas how this could be fixed? |
| 12:07:25 | | Nekroschizofrenetyk quits [Excess Flood] |
| 12:07:35 | | Shard quits [Client Quit] |
| 12:07:46 | | Nekroschizofrenetyk joins |
| 12:08:35 | | Shard (Shard) joins |
| 12:15:38 | | pabs quits [Ping timeout: 268 seconds] |
| 12:17:07 | | pabs (pabs) joins |
| 12:36:44 | <klea> | https://waf.moe/fediverse/post/9b8172f4-cebb-49e2-ac92-5ea5e9b427f7 |
| 12:36:58 | <klea> | > Hey wafflers! Me and the rest of the team has decided that due to the stress, internal issues, and other issues, the WAF.moe project is officially shutting down on June 2, 2026 at 12:00 AM (GMT+8). |
| 13:00:08 | | qmastery quits [Client Quit] |
| 13:08:09 | | qmastery9 joins |
| 13:08:32 | | qmastery1 joins |
| 13:08:38 | | Webuser557222 joins |
| 13:09:03 | | Webuser557222 quits [Client Quit] |
| 13:11:08 | | scotrod2 quits [Quit: The Lounge - https://thelounge.chat] |
| 13:13:37 | | klea quits [Remote host closed the connection] |
| 13:13:37 | | alexlehm quits [Remote host closed the connection] |
| 13:17:50 | | TheEnbyperor_ quits [Ping timeout: 268 seconds] |
| 13:17:55 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 13:19:14 | | alexlehm (alexlehm) joins |
| 13:19:17 | | klea (jmjl) joins |
| 13:20:18 | | alexlehm quits [Remote host closed the connection] |
| 13:21:20 | | klea quits [Remote host closed the connection] |
| 13:21:23 | | alexlehm (alexlehm) joins |
| 13:21:26 | | klea (jmjl) joins |
| 13:27:59 | | myself quits [Read error: Connection reset by peer] |
| 13:28:11 | | myself joins |
| 13:31:00 | | TheEnbyperor joins |
| 13:31:21 | | TheEnbyperor_ (TheEnbyperor) joins |
| 13:53:24 | | nexussfan (nexussfan) joins |
| 13:59:58 | | qmastery9 quits [Client Quit] |
| 13:59:58 | | qmastery1 quits [Client Quit] |
| 14:07:22 | | myself quits [Read error: Connection reset by peer] |
| 14:07:34 | | myself joins |
| 14:24:56 | | scotrod2 joins |
| 14:31:50 | | Arcorann_ quits [Ping timeout: 268 seconds] |
| 14:35:45 | <cruller> | Why is #fedisperse invite-only? For historical reasons? |
| 14:38:21 | <justauser> | Perhaps something to do with the berries.space problems? |
| 14:39:34 | <justauser> | Looks like it was like that since forever: https://irclogs.archivete.am/efnet_archiveteam-ot/2019-06-24#l4e4f1ae1 |
| 14:39:48 | | multisn8 quits [Quit: WeeChat 4.8.1] |
| 14:39:54 | | multisn8 (multisn8) joins |
| 14:42:01 | <cruller> | I suspect that too. |
| 14:50:12 | <h2ibot> | Klea edited List of websites excluded from the Wayback Machine (+292, Add berries.space): https://wiki.archiveteam.org/?diff=61014&oldid=60933 |
| 14:51:12 | <h2ibot> | KleaBot edited List of websites excluded from the Wayback Machine (+0, Reordered websites): https://wiki.archiveteam.org/?diff=61015&oldid=61014 |
| 14:54:36 | | ATinySpaceMarine quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
| 14:55:46 | | etnguyen03 (etnguyen03) joins |
| 14:57:24 | | ATinySpaceMarine joins |
| 15:04:10 | | ATinySpaceMarine quits [Client Quit] |
| 15:04:39 | | ATinySpaceMarine joins |
| 15:36:40 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 15:37:12 | | TheEnbyperor_ quits [Ping timeout: 268 seconds] |
| 15:37:49 | | TheEnbyperor joins |
| 15:39:29 | | TheEnbyperor_ (TheEnbyperor) joins |
| 15:49:55 | | DogsRNice joins |
| 15:59:20 | <h2ibot> | Calmevening edited List of websites excluded from the Wayback Machine (+32, add ads.wonder-wonder.com): https://wiki.archiveteam.org/?diff=61016&oldid=61015 |
| 16:03:53 | | myself quits [Read error: Connection reset by peer] |
| 16:04:06 | | myself joins |
| 16:22:23 | <h2ibot> | KleaBot edited List of websites excluded from the Wayback Machine (+0, Reordered websites and/or updated count.): https://wiki.archiveteam.org/?diff=61017&oldid=61016 |
| 16:37:25 | <h2ibot> | Nekroschizofrenetyk edited URLTeam (+254, /* Alive */ shorturl.fm): https://wiki.archiveteam.org/?diff=61018&oldid=60983 |
| 16:44:49 | | SootBector quits [Ping timeout: 260 seconds] |
| 16:45:58 | | qmastery2 joins |
| 16:47:00 | | Webuser641496 joins |
| 16:47:42 | | SootBector (SootBector) joins |
| 16:47:44 | | Webuser641496 quits [Client Quit] |
| 16:53:37 | | that_lurker quits [Remote host closed the connection] |
| 16:53:41 | | that_lurker (that_lurker) joins |
| 16:56:50 | | nukke quits [Ping timeout: 268 seconds] |
| 17:05:52 | | retrograde quits [Remote host closed the connection] |
| 17:06:15 | | retrograde (retrograde) joins |
| 17:11:28 | <h2ibot> | Brad edited Deathwatch (+509, Added Outlook Lite): https://wiki.archiveteam.org/?diff=61019&oldid=60990 |
| 17:30:30 | | nukke (nukke) joins |
| 17:31:53 | <gosc> | forgot to ask about this (or maybe I did already?) this page needs a japanese IP to work, how do you even go about saving it? https://edith.co.jp/lp/nagoya.kotsu-madoka/oshitabi/ |
| 17:33:34 | | etnguyen03 quits [Client Quit] |
| 17:39:41 | | Webuser051960 quits [Quit: Ooops, wrong browser tab.] |
| 17:42:08 | | cyanbox_ joins |
| 17:42:09 | | cyanbox quits [Read error: Connection reset by peer] |
| 17:54:43 | | rohvani quits [Ping timeout: 268 seconds] |
| 17:57:09 | | etnguyen03 (etnguyen03) joins |
| 18:00:51 | | Island joins |
| 18:03:58 | | ducky quits [Ping timeout: 268 seconds] |
| 18:04:35 | | SootBector quits [Remote host closed the connection] |
| 18:05:42 | | SootBector (SootBector) joins |
| 18:08:35 | <h2ibot> | Nekroschizofrenetyk uploaded File:Moy mir.png (My World@Mail.Ru logo): https://wiki.archiveteam.org/?title=File%3AMoy%20mir.png |
| 18:09:35 | <h2ibot> | Nekroschizofrenetyk uploaded File:Moymir 12 04 2026.png (My World@Mail.Ru screenshot 12.04.2026): https://wiki.archiveteam.org/?title=File%3AMoymir%2012%2004%202026.png |
| 18:10:13 | | nukke quits [Ping timeout: 268 seconds] |
| 18:10:42 | | etnguyen03 quits [Client Quit] |
| 18:18:58 | | ducky (ducky) joins |
| 18:36:18 | | nukke (nukke) joins |
| 18:39:13 | | gosc quits [Quit: Leaving] |
| 18:47:39 | <h2ibot> | Nekroschizofrenetyk created My World@Mail.Ru (+1285, Created page with "{{Infobox project | logo =…): https://wiki.archiveteam.org/?oldid=61022 |
| 19:05:01 | | nukke quits [Ping timeout: 268 seconds] |
| 19:10:49 | | nukke (nukke) joins |
| 19:11:32 | | Nekroschizofrenetyk quits [Quit: Ooops, wrong browser tab.] |
| 19:18:39 | | rohvani joins |
| 19:19:43 | | Nekroschizofrenetyk joins |
| 19:20:31 | | nukke quits [Ping timeout: 268 seconds] |
| 19:21:55 | | Nekroschizofrenetyk quits [Client Quit] |
| 19:22:36 | | DogsRNice_ joins |
| 19:27:13 | | DogsRNice quits [Ping timeout: 268 seconds] |
| 19:27:46 | | qmastery2 quits [Client Quit] |
| 19:38:19 | | nexussfan quits [Ping timeout: 268 seconds] |
| 19:42:32 | | dabs joins |
| 19:43:28 | | dabs quits [Remote host closed the connection] |
| 19:43:41 | | dabs joins |
| 19:48:50 | | Webuser406909 joins |
| 19:50:02 | | Webuser406909 quits [Client Quit] |
| 19:50:44 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 19:51:16 | | TheEnbyperor_ quits [Ping timeout: 268 seconds] |
| 19:53:10 | | TheEnbyperor joins |
| 19:54:50 | | nukke (nukke) joins |
| 19:57:53 | | TheEnbyperor_ (TheEnbyperor) joins |
| 20:00:31 | | nukke quits [Ping timeout: 268 seconds] |
| 20:01:37 | | nexussfan (nexussfan) joins |
| 20:08:00 | | bigfren joins |
| 20:08:05 | <bigfren> | hi there ^_^ |
| 20:10:15 | <bigfren> | i was running grab-site and had a power outage. Luckily, all the files are there (ls -lh wpull.db wpull.log *.warc.gz) and the wpull.db database is fine (verified by sqlite3 wpull.db "PRAGMA integrity_check;" ). Things seem good |
| 20:10:26 | | dabs quits [Read error: Connection reset by peer] |
| 20:11:04 | <bigfren> | Please tell, how do I resume the grab-site download ? when I re-run the grab-site command, it tries to create a new dir and start from scratch |
| 20:12:07 | <bigfren> | and if I try adding "grab-site --id "ID"" (where ID is a trailing ID of my existing directory), it simply tells "this dir already exists" :P |
| 20:12:51 | | TheEnbyperor_ quits [Ping timeout: 268 seconds] |
| 20:12:56 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 20:13:13 | <klea> | https://github.com/ArchiveTeam/grab-site/issues/58 |
| 20:14:43 | <bigfren> | klea: you are genius, thank you so much ^_^ |
| 20:16:12 | | TheEnbyperor joins |
| 20:16:58 | <bigfren> | this pull request seems small enough, wonder why its not merged yet https://github.com/ArchiveTeam/grab-site/pull/247/ |
| 20:20:57 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 20:21:56 | <klea> | (a) It begins with "WIP:" and is marked as draft, (b) The pull request's initial message says "Needs more testing and I think it may have issues with sites that require cookies", (c) I believe the test harness wasn't fully finished and no system to run CI on changes. |
| 20:24:12 | | bigfren quits [Client Quit] |
| 20:24:47 | | TheEnbyperor joins |
| 20:25:53 | | nukke (nukke) joins |
| 20:26:32 | | TheEnbyperor_ (TheEnbyperor) joins |
| 20:28:15 | <pokechu22> | Right, wpull currently stores cookies in memory, so those will be lost. I know someone was working on storing cookies in the DB instead (more for performance reasons, but it would help with resuming), but I don't knwo what the status is |
| 20:33:10 | <klea> | https://odysee.com/copilot-is-for-entertainment-purposes:1c7b0382db599a22fb16d50f92f1cbbe3e4dc370 claims Anthropic's TOS is different in some countries, (US, Australia one, UK/EU another) |
| 20:33:28 | <klea> | https://www.anthropic.com/legal/consumer-terms |
| 20:35:28 | <klea> | https://www.difchecker.com/BtqVrR9p/ |
| 20:46:46 | | TheEnbyperor_ quits [Ping timeout: 268 seconds] |
| 20:46:51 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 20:48:44 | | kansei quits [Quit: ZNC 1.10.1 - https://znc.in] |
| 20:49:57 | | TheEnbyperor joins |
| 20:50:02 | | TheEnbyperor_ (TheEnbyperor) joins |
| 20:52:30 | | etnguyen03 (etnguyen03) joins |
| 20:55:14 | | andrewnyr quits [Quit: The Lounge - https://thelounge.chat] |
| 20:56:29 | | bigfren joins |
| 20:57:29 | <bigfren> | Thank you. I rebased this "resume" patch series on top of master and it builds OK, will backup my dump and test really soon |
| 20:57:54 | | andrewnyr joins |
| 20:58:17 | <bigfren> | the author if this patch seems to have a lot of Github activity, I hope this patch is solid and he just didn't have time to get it merged |
| 20:58:51 | <bigfren> | I haven't used any custom cookies during my dump so won't be affected by their loss |
| 20:59:45 | | klea points out she shared a few possible reasons for it to not have been merged. |
| 21:00:05 | | Shard quits [Quit: Im doing something rq. Il brb] |
| 21:02:21 | | Shard (Shard) joins |
| 21:08:24 | | n9nes quits [Remote host closed the connection] |
| 21:09:52 | | n9nes joins |
| 21:32:04 | | retrograde quits [Remote host closed the connection] |
| 21:32:45 | | retrograde (retrograde) joins |
| 21:50:15 | | hamouda joins |
| 22:10:37 | <bigfren> | klea: thank you for your precautions. I have looked through it and also ran VS qwen and deepseek ai's. So far the current problems of "resume" patch (aside of the mentioned "cookies" problem: 1) it does not remove the files at ./site-directory/temp/ before starting ; 2) it does not honor the custom --warc-file names ; 3) in a multi-parted situation |
| 22:10:37 | <bigfren> | like --warc-max-size=4294967296 ( 4 GB , I split it this way in order for my WARC parts to be writable to DVDs) - instead of appending to the last WARC (1.5GB in my case), it created a new part, which is inconvenient . 4) not sure what happens if the user changes the other-than "--resume" / "--dir" command line parameters (i.e. what if we |
| 22:10:37 | <bigfren> | erroneously supply a different URL ? ) 5) no cleanup of stop file after graceful stop |
| 22:12:14 | <bigfren> | However, based on what I see by wpull.db contents (old "todo" vs new "todo", etc) , it seems to successfully resume my download, which is a real savior in my situation (like 9 GB was dumped already and I did not want to start from scratch) |
| 22:15:36 | <bigfren> | Also luckily I don't see any duplicates at wpull.db --> done table , so far... (verified by commands like "gs-dump-urls ./path_to/wpull.db done > ~/done-new.txt", "sort ./done-new.txt > ./done-new-srt.txt", "uniq -d") |
| 22:19:13 | <bigfren> | If there is anything else you would like me to check, I am all in attention ;-) |
| 22:21:24 | | michaelblob764 quits [Quit: yoop] |
| 22:24:23 | | michaelblob764 joins |
| 22:32:10 | | Webuser510872 joins |
| 22:32:44 | | Webuser510872 quits [Client Quit] |