| 00:34:34 | | pabs quits [Ping timeout: 268 seconds] |
| 00:37:52 | | pabs (pabs) joins |
| 00:40:20 | <h2ibot> | PaulWise edited Archive.today (+578, update searches section): https://wiki.archiveteam.org/?diff=60991&oldid=60981 |
| 00:41:20 | <h2ibot> | PaulWise edited Archive.today (+1, typo): https://wiki.archiveteam.org/?diff=60992&oldid=60991 |
| 00:50:22 | | etnguyen03 (etnguyen03) joins |
| 01:02:23 | | etnguyen03 quits [Client Quit] |
| 01:37:22 | | polypept1 (polypeptide) joins |
| 01:41:34 | | polypeptide quits [Ping timeout: 260 seconds] |
| 01:45:41 | | etnguyen03 (etnguyen03) joins |
| 01:59:58 | | ericgallager quits [Remote host closed the connection] |
| 02:27:31 | | ericgallager joins |
| 02:47:04 | | iseaup quits [Ping timeout: 268 seconds] |
| 03:26:51 | | etnguyen03 quits [Remote host closed the connection] |
| 03:32:37 | | iseaup (iseaup) joins |
| 04:04:46 | | n9nes quits [Ping timeout: 268 seconds] |
| 04:06:12 | | n9nes joins |
| 04:20:22 | | DogsRNice quits [Read error: Connection reset by peer] |
| 04:26:29 | | Nekroschizofrenetyk joins |
| 04:32:31 | | Webuser982607 joins |
| 04:32:37 | | Webuser982607 quits [Client Quit] |
| 04:36:22 | | Island quits [Read error: Connection reset by peer] |
| 04:56:25 | | nexussfan quits [Quit: Konversation terminated!] |
| 05:17:35 | | Nekroschizofrenetyk quits [Client Quit] |
| 05:39:01 | | sg72 joins |
| 05:40:26 | | sg-72 quits [Ping timeout: 268 seconds] |
| 05:50:12 | | Nekroschizofrenetyk joins |
| 05:52:35 | | Webuser987124 joins |
| 05:59:33 | | iseaup quits [Ping timeout: 268 seconds] |
| 06:15:00 | | iseaup (iseaup) joins |
| 06:25:43 | | TastyWiener950 (TastyWiener95) joins |
| 06:26:08 | | TastyWiener95 quits [Read error: Connection reset by peer] |
| 06:26:09 | | TastyWiener950 is now known as TastyWiener95 |
| 06:26:56 | | TastyWiener959 (TastyWiener95) joins |
| 06:31:00 | | TastyWiener95 quits [Ping timeout: 268 seconds] |
| 06:31:01 | | TastyWiener959 is now known as TastyWiener95 |
| 06:32:41 | | TastyWiener954 (TastyWiener95) joins |
| 06:33:16 | | TastyWiener95 quits [Read error: Connection reset by peer] |
| 06:33:53 | | TastyWiener95 (TastyWiener95) joins |
| 06:36:34 | | TastyWiener954 quits [Read error: Connection reset by peer] |
| 06:37:12 | | TastyWiener955 (TastyWiener95) joins |
| 06:37:40 | | TastyWiener95 quits [Read error: Connection reset by peer] |
| 06:38:24 | | TastyWiener95 (TastyWiener95) joins |
| 06:42:06 | | TastyWiener955 quits [Ping timeout: 268 seconds] |
| 06:45:10 | | TastyWiener959 (TastyWiener95) joins |
| 06:47:17 | | TastyWiener95 quits [Read error: Connection reset by peer] |
| 06:47:57 | | TastyWiener95 (TastyWiener95) joins |
| 06:51:53 | | TastyWiener959 quits [Ping timeout: 268 seconds] |
| 06:52:29 | | TastyWiener95 quits [Read error: Connection reset by peer] |
| 06:52:45 | | TastyWiener95 (TastyWiener95) joins |
| 07:22:30 | <h2ibot> | PaulWise edited Obstacles (+30, Sucuri): https://wiki.archiveteam.org/?diff=60993&oldid=60978 |
| 07:25:46 | <pabs> | TIL archive.today uses TLS fingerprinting, Firefox copy as curl here for https://archive.today/www.nytimes.com only results in a conn hang |
| 07:33:34 | | sepro2 (sepro) joins |
| 07:36:17 | | sepro quits [Ping timeout: 268 seconds] |
| 07:36:17 | | sepro2 is now known as sepro |
| 07:45:37 | | Arcorann_ quits [Ping timeout: 268 seconds] |
| 07:50:23 | | iseaup quits [Client Quit] |
| 07:52:55 | | APOLLO03 joins |
| 07:55:29 | | APOLLO03a quits [Ping timeout: 268 seconds] |
| 08:00:09 | | Arcorann_ (Arcorann) joins |
| 08:10:41 | | iseaup (iseaup) joins |
| 08:12:36 | <h2ibot> | PaulWise edited Archive.today (+119, add screenshot of archive.st capture of the…): https://wiki.archiveteam.org/?diff=60994&oldid=60992 |
| 08:17:49 | <pabs> | TIL ArchiveBot can capture archive.st screenshots :) |
| 08:20:11 | | Nekroschizofrenetyk quits [Client Quit] |
| 08:28:31 | | Nekroschizofrenetyk joins |
| 08:32:26 | <pabs> | might be time to brute-force archive.st short IDs to find all their long URLs and screenshots. actual captures are broken though |
| 08:51:31 | | Dango360 quits [Ping timeout: 268 seconds] |
| 09:00:27 | | dendory quits [Quit: The Lounge - https://thelounge.chat] |
| 09:01:02 | | dendory (dendory) joins |
| 09:03:51 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 09:03:56 | | TheEnbyperor_ quits [Ping timeout: 268 seconds] |
| 09:04:03 | | TheEnbyperor (TheEnbyperor) joins |
| 09:06:03 | | TheEnbyperor_ joins |
| 09:20:30 | | APOLLO03 quits [Ping timeout: 268 seconds] |
| 09:28:09 | | APOLLO03 joins |
| 09:30:22 | | TunaLobster44 quits [Ping timeout: 268 seconds] |
| 09:45:47 | <h2ibot> | Manu edited Mailman/2 (+4, Queued lists.si6networks.com): https://wiki.archiveteam.org/?diff=60995&oldid=60968 |
| 09:48:09 | | michaelblob764 joins |
| 09:48:15 | | michaelblob76 quits [Ping timeout: 268 seconds] |
| 09:48:15 | | @hook54321 quits [Ping timeout: 633 seconds] |
| 09:48:15 | | michaelblob764 is now known as michaelblob76 |
| 09:50:02 | | fuzzy80211 quits [Killed (NickServ (GHOST command used by fuzzy8021!~fuzzy8021@173-224-25-67.ptcnet.net))] |
| 09:50:09 | | fuzzy80211 joins |
| 09:50:39 | | hook54321 (hook54321) joins |
| 09:50:39 | | @ChanServ sets mode: +o hook54321 |
| 09:56:23 | <Nekroschizofrenetyk> | I want to quickly get all urls from a page. What would be the best way to go about it? curl and grep it somehow into a txt file? |
| 10:03:03 | <pabs> | chrefsu () { curl -s "$@" | pup 'a attr{href}' | sort -u } and pup = https://github.com/ericchiang/pup |
| 10:05:14 | <Nekroschizofrenetyk> | oh yeah, it has a windows version |
| 10:06:17 | <pabs> | ah. the function is Linux shell, but probably you can do part of it on Windows at least |
| 10:06:48 | <pabs> | I think there are other tools to query HTML but pup is what I use |
| 10:07:32 | | nathang21843 joins |
| 10:07:47 | <Nekroschizofrenetyk> | I'm trying to figure it out, the executable gives me a blank black window, non-reactive to typing |
| 10:07:59 | | nathang2184 quits [Ping timeout: 268 seconds] |
| 10:07:59 | | nathang21843 is now known as nathang2184 |
| 10:08:52 | <pabs> | its command-line not GUI |
| 10:10:32 | | BearFortress_ quits [] |
| 10:11:18 | <Nekroschizofrenetyk> | yeah.... can't fire it though, even after having added to PATH, neither git bash, nor pwsh |
| 10:11:32 | <Nekroschizofrenetyk> | guess I'd need to start migrating to Linux |
| 10:11:36 | <Nekroschizofrenetyk> | (*restart) |
| 10:12:38 | <pabs> | hmm |
| 10:12:44 | <pabs> | WSL maybe? |
| 10:13:27 | <ericgallager> | Cygwin? |
| 10:14:42 | <ericgallager> | MinGW? |
| 10:14:43 | <Nekroschizofrenetyk> | Oh, I have WSL (to run warrior projects in docker) and I have even installed Cygwin today (to install Weechat, so that I can finally register my IRC name, though I failed at that Weechat thing) |
| 10:14:52 | <Nekroschizofrenetyk> | if I install it via Cygwin, I would be able to use it? |
| 10:15:13 | <Nekroschizofrenetyk> | let me see... |
| 10:17:46 | | klea wonders if JAA could add chrefsu into little-things, and maybe make a little pup version with just bash, but supposes that'd take lots of time. |
| 10:21:25 | <Nekroschizofrenetyk> | nevermind, I just added the directory path to somewhere else, not directly to the PATH dir :D |
| 10:21:35 | <Nekroschizofrenetyk> | fixed that and now it seems to work |
| 10:25:52 | <h2ibot> | Manu edited Mailman/2 (-28, Queued lists.suse.com): https://wiki.archiveteam.org/?diff=60996&oldid=60995 |
| 10:31:59 | | BearFortress joins |
| 10:34:58 | | Nekroschizofrenetyk quits [Client Quit] |
| 10:40:17 | | bilboed084 joins |
| 10:42:36 | | bilboed08 quits [Ping timeout: 268 seconds] |
| 10:42:37 | | bilboed084 is now known as bilboed08 |
| 10:42:54 | <h2ibot> | Manu edited Mailman/2 (-35, Queued mailman.powerdns.com): https://wiki.archiveteam.org/?diff=60997&oldid=60996 |
| 10:43:27 | | Nekroschizofrenetyk joins |
| 11:00:03 | | Bleo1826007227196234552220110 quits [Quit: The Lounge - https://thelounge.chat] |
| 11:02:48 | | Bleo1826007227196234552220110 joins |
| 11:05:36 | | Webuser138753 joins |
| 11:05:58 | | Webuser138753 quits [Client Quit] |
| 11:20:07 | <Nekroschizofrenetyk> | pup is working, great ! |
| 11:21:28 | <@arkiver> | do we know if any videos on amara.org are public? they show me a login screen |
| 11:21:42 | | Webuser420057 joins |
| 11:22:29 | <@arkiver> | c3manu: are we doing well on historicplaces.ca ? i see you looked into it before on archivebot |
| 11:23:15 | <klea> | https://irclogs.archivete.am/archiveteam-bs/2026-04-05#l6814c45e |
| 11:23:51 | | Webuser420057 quits [Client Quit] |
| 11:28:10 | <c3manu> | arkiver: i ran the job and it looked fine to me. but i haven't explicitly checked all the places or whether there are more than it found |
| 11:29:49 | <@arkiver> | c3manu: alright then i will not look too closely into it |
| 11:46:03 | <@arkiver> | looking into dlive, xeenon, trovo |
| 11:47:41 | <Nekroschizofrenetyk> | can AB-job sites be added to archiveteam wiki? |
| 11:47:45 | <Nekroschizofrenetyk> | like kresy24.pl |
| 11:48:10 | <Nekroschizofrenetyk> | (I mean - can I add) |
| 11:49:44 | <@arkiver> | Nekroschizofrenetyk: sure, if you want to write an article about it |
| 11:50:04 | <pabs> | parsing HTML with bash seems like a bad idea :) |
| 11:50:09 | <Nekroschizofrenetyk> | thanks! |
| 11:51:04 | <klea> | `grep -E 'href="[^"]*"'` maybe? |
| 11:51:48 | <Nekroschizofrenetyk> | do you have any tools which make editing wiki easier? |
| 11:52:14 | <Nekroschizofrenetyk> | or is it just what you have to practice to use comfortably? |
| 11:52:49 | <klea> | https://www.mediawiki.org/ has lots of pages for different things you can do with MediaWiki. |
| 11:54:40 | <klea> | https://megalodon.jp/ - Japanese Web Archive. |
| 11:58:24 | <Nekroschizofrenetyk> | klea Hmmm... What I mean exactly, is, when you edit/create a page on Wiki (AT Wiki specifically), it's quite difficult to read. With programming-language tools, the syntax is highlighted, with different colours etc, which makes it much easier to follow |
| 11:58:51 | <klea> | You can click preview to get whatever you typed in rendered. |
| 11:59:06 | <Nekroschizofrenetyk> | (I guess, I should somehow turn off grammar-correction red-wavy line underlining) |
| 11:59:08 | <Nekroschizofrenetyk> | yeah |
| 11:59:08 | <klea> | On newer MW installs, there's also a Visual Editor. |
| 11:59:35 | <Nekroschizofrenetyk> | yup, I've seen this one on Wikipedia |
| 12:02:04 | | pabs quits [Ping timeout: 268 seconds] |
| 12:03:23 | <Nekroschizofrenetyk> | well, no pain, no gain ;) |
| 12:03:41 | | Nekroschizofrenetyk quits [Client Quit] |
| 12:06:48 | | Nekroschizofrenetyk joins |
| 12:07:44 | | fuzzy80211 quits [Remote host closed the connection] |
| 12:08:05 | | fuzzy80211 joins |
| 12:08:39 | | pabs (pabs) joins |
| 12:24:07 | <h2ibot> | Nekroschizofrenetyk uploaded File:Logo kresy24 2020 k2.webp (Kresy24.pl logo): https://wiki.archiveteam.org/?title=File%3ALogo%20kresy24%202020%20k2.webp |
| 12:24:08 | <h2ibot> | Nekroschizofrenetyk uploaded File:Kresy24pl 11 04 2026.png (Kresy24.pl main page screenshot 11.04.2026): https://wiki.archiveteam.org/?title=File%3AKresy24pl%2011%2004%202026.png |
| 12:24:16 | <cruller> | If my understanding is correct, Arquivo.pt's ArchivePageNow captures content through the user's own browsing. |
| 12:24:23 | <cruller> | This is somewhat questionable behavior for a public archiving service, but it can be useful for complex pages. |
| 12:26:08 | <h2ibot> | Nekroschizofrenetyk created Kresy24.pl (+1397, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?oldid=61000 |
| 12:27:02 | <Nekroschizofrenetyk> | Published. Hope, it's okay |
| 12:31:40 | | etnguyen03 (etnguyen03) joins |
| 12:37:12 | <cruller> | Apart from that, ArchivePageNow likely inherits the issues with pywb's warc generation. |
| 12:47:16 | | DigitalDragons quits [Quit: Ping timeout (120 seconds)] |
| 12:47:29 | | DigitalDragons (DigitalDragons) joins |
| 12:47:48 | <Nekroschizofrenetyk> | Max.ru - the new, Kremlin-approved communication app. Has anybody taken a look at it? I'm wondering, if that would be difficult/much different from #telegrab to run. There was a job in #archivebot, don't know how succesful, though |
| 12:48:19 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 12:48:24 | | TheEnbyperor_ quits [Ping timeout: 268 seconds] |
| 12:48:29 | <Nekroschizofrenetyk> | needs some qr scanning to view in a browser |
| 12:48:41 | <Nekroschizofrenetyk> | so, possibly not doable |
| 12:50:34 | | icedice (icedice) joins |
| 12:50:36 | | icedice quits [Remote host closed the connection] |
| 12:55:33 | | TheEnbyperor joins |
| 12:57:01 | | TheEnbyperor_ (TheEnbyperor) joins |
| 12:57:59 | <cruller> | https://max.ru/prav_prazdnik It's terrible that clicking "Открыть в браузере" only displays a QR code :( |
| 13:03:44 | <Nekroschizofrenetyk> | yeah... |
| 13:04:34 | | etnguyen03 quits [Client Quit] |
| 13:06:09 | <Nekroschizofrenetyk> | search engine links redirect to mainpage https://max.ru/?ysclid=mdpppvy6vo262175045 |
| 13:06:40 | <Nekroschizofrenetyk> | maaaybe it could work with a ru pipeline? but doubt it |
| 13:06:56 | | fuzzy80211 quits [Remote host closed the connection] |
| 13:13:04 | | iseaup quits [Ping timeout: 268 seconds] |
| 13:14:09 | | Dango360 (Dango360) joins |
| 13:14:11 | | fuzzy80211 joins |
| 13:14:12 | | fuzzy80211 quits [Remote host closed the connection] |
| 13:15:55 | | fuzzy80211 joins |
| 13:17:42 | | Wohlstand (Wohlstand) joins |
| 13:23:24 | | fuzzy80211 quits [Remote host closed the connection] |
| 13:40:10 | | Dango360 quits [Client Quit] |
| 13:41:54 | | Nekroschizofrenetyk quits [Client Quit] |
| 13:48:35 | | Nekroschizofrenetyk joins |
| 13:48:51 | | Dango360 (Dango360) joins |
| 13:53:59 | | Nekroschizofrenetyk quits [Client Quit] |
| 13:54:10 | | fuzzy80211 (fuzzy80211) joins |
| 13:55:28 | | Nekroschizofrenetyk joins |
| 14:09:24 | | polypept1 quits [Ping timeout: 260 seconds] |
| 14:09:50 | | polypeptide (polypeptide) joins |
| 14:11:18 | | Arcorann_ quits [Remote host closed the connection] |
| 14:11:54 | | Arcorann_ (Arcorann) joins |
| 14:15:27 | | simon816 quits [Remote host closed the connection] |
| 14:20:42 | | simon816 (simon816) joins |
| 14:24:03 | | nexussfan (nexussfan) joins |
| 14:44:57 | | polypeptide quits [Remote host closed the connection] |
| 14:44:57 | | Arcorann_ quits [Ping timeout: 268 seconds] |
| 14:45:12 | | polypeptide (polypeptide) joins |
| 15:02:20 | <c3manu> | anyone doing anything about https://dlive.tv/ ? |
| 15:02:21 | <c3manu> | https://community.dlive.tv/important-announcement-dlive-platform-closure/ |
| 15:02:45 | <c3manu> | it's got a lot of livestreams (and possibly vods?) and i've no idea what's the best way to deal with that |
| 15:09:10 | | polypept1 (polypeptide) joins |
| 15:09:28 | | polypeptide quits [Remote host closed the connection] |
| 15:19:34 | | etnguyen03 (etnguyen03) joins |
| 15:32:04 | <Cupping1285> | c3manu, did a quick search and downloading vods looks quite easy. https://dlive.tv/p/dlive-dfmqedypii+aAbWQBtDR turns into https://playback.prd.dlivecdn.com/live/dlive-dfmqedypii/1775500109/src/playback.m3u8, I just don't know how they are converting the aAbWQBtDR into the number 1775500109. After that it looks quite easy to download vods, just |
| 15:32:04 | <Cupping1285> | split by the + and convert the string number into the actual number. |
| 15:37:26 | <c3manu> | that wasn't much of a technical question. it was more about what can reasonably be saved, what's valuable, how much space does take, does it require a dpos project etc |
| 15:38:19 | <c3manu> | that's also my way of saying i'm not gonna do the work if anyone wants that video saved. curerntly running jobs for some subdomains (with blog posts, help, etc.), but only ran an !ao on the main site |
| 15:42:31 | | etnguyen03 quits [Client Quit] |
| 15:44:43 | <h2ibot> | Manu edited Discourse/active (+47, Add www.elektronauts.com): https://wiki.archiveteam.org/?diff=61001&oldid=60965 |
| 15:51:15 | | etnguyen03 (etnguyen03) joins |
| 15:59:29 | | fmeppo quits [Ping timeout: 268 seconds] |
| 16:01:45 | | fmeppo (fmeppo) joins |
| 16:01:51 | | Nekroschizofrenetyk quits [Client Quit] |
| 16:03:44 | | Nekroschizofrenetyk joins |
| 16:08:40 | | Cuphead2527480 (Cuphead2527480) joins |
| 16:15:29 | | ericgallager quits [Quit: This computer has gone to sleep] |
| 16:22:03 | | ericgallager joins |
| 16:30:38 | <justauser> | arkiver: amara.org videos were previously public, but they are dead now. |
| 16:34:43 | | emphie quits [Ping timeout: 268 seconds] |
| 16:43:16 | | TheEnbyperor_ quits [Ping timeout: 268 seconds] |
| 16:43:21 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 16:46:49 | | Island joins |
| 16:48:57 | | TheEnbyperor joins |
| 16:52:54 | | TheEnbyperor_ (TheEnbyperor) joins |
| 16:54:08 | | ericgallager quits [Client Quit] |
| 16:57:01 | | ericgallager joins |
| 16:57:23 | | ericgallager quits [Client Quit] |
| 17:06:23 | | retrograde quits [Remote host closed the connection] |
| 17:06:48 | | retrograde (retrograde) joins |
| 17:17:23 | <cruller> | RE: dlive, my preferred approach is to rank all items and save as many as possible from the top down. However, to do this, you need to retrieve the metadata for all items in advance. |
| 17:18:00 | <cruller> | Generally speaking, a more depth-first and sequential approach is likely to be preferred. |
| 17:20:39 | <justauser> | Cupping1285: 1775500109 is almost certainly a timestamp, either of a video or of URL generation. |
| 17:22:29 | | etnguyen03 quits [Client Quit] |
| 17:49:52 | | etnguyen03 (etnguyen03) joins |
| 18:05:55 | | etnguyen03 quits [Client Quit] |
| 18:09:36 | | TheEnbyperor_ quits [Ping timeout: 268 seconds] |
| 18:10:18 | | TheEnbyperor quits [Ping timeout: 268 seconds] |
| 18:18:25 | | Cuphead2527480 quits [Client Quit] |
| 18:20:54 | | TheEnbyperor joins |
| 18:21:55 | | etnguyen03 (etnguyen03) joins |
| 18:25:27 | | ericgallager joins |
| 18:29:10 | | AK (AK) joins |
| 18:31:39 | | TheEnbyperor_ (TheEnbyperor) joins |
| 19:12:34 | | etnguyen03 quits [Client Quit] |
| 19:14:16 | | Webuser737349 joins |
| 19:14:32 | | Webuser737349 quits [Client Quit] |
| 19:39:38 | | Stargazers quits [Remote host closed the connection] |
| 19:42:08 | | mikael quits [Quit: ZNC - http://znc.in] |
| 19:57:27 | | etnguyen03 (etnguyen03) joins |
| 20:01:45 | | DogsRNice joins |
| 20:05:50 | | SootBector quits [Remote host closed the connection] |
| 20:06:58 | | SootBector (SootBector) joins |
| 20:08:33 | | mikael joins |
| 20:11:23 | | Nekroschizofrenetyk quits [Quit: Ooops, wrong browser tab.] |
| 20:12:04 | | Nekroschizofrenetyk joins |