00:34:34pabs quits [Ping timeout: 268 seconds]
00:37:52pabs (pabs) joins
00:40:20<h2ibot>PaulWise edited Archive.today (+578, update searches section): https://wiki.archiveteam.org/?diff=60991&oldid=60981
00:41:20<h2ibot>PaulWise edited Archive.today (+1, typo): https://wiki.archiveteam.org/?diff=60992&oldid=60991
00:50:22etnguyen03 (etnguyen03) joins
01:02:23etnguyen03 quits [Client Quit]
01:37:22polypept1 (polypeptide) joins
01:41:34polypeptide quits [Ping timeout: 260 seconds]
01:45:41etnguyen03 (etnguyen03) joins
01:59:58ericgallager quits [Remote host closed the connection]
02:27:31ericgallager joins
02:47:04iseaup quits [Ping timeout: 268 seconds]
03:26:51etnguyen03 quits [Remote host closed the connection]
03:32:37iseaup (iseaup) joins
04:04:46n9nes quits [Ping timeout: 268 seconds]
04:06:12n9nes joins
04:20:22DogsRNice quits [Read error: Connection reset by peer]
04:26:29Nekroschizofrenetyk joins
04:32:31Webuser982607 joins
04:32:37Webuser982607 quits [Client Quit]
04:36:22Island quits [Read error: Connection reset by peer]
04:56:25nexussfan quits [Quit: Konversation terminated!]
05:17:35Nekroschizofrenetyk quits [Client Quit]
05:39:01sg72 joins
05:40:26sg-72 quits [Ping timeout: 268 seconds]
05:50:12Nekroschizofrenetyk joins
05:52:35Webuser987124 joins
05:59:33iseaup quits [Ping timeout: 268 seconds]
06:15:00iseaup (iseaup) joins
06:25:43TastyWiener950 (TastyWiener95) joins
06:26:08TastyWiener95 quits [Read error: Connection reset by peer]
06:26:09TastyWiener950 is now known as TastyWiener95
06:26:56TastyWiener959 (TastyWiener95) joins
06:31:00TastyWiener95 quits [Ping timeout: 268 seconds]
06:31:01TastyWiener959 is now known as TastyWiener95
06:32:41TastyWiener954 (TastyWiener95) joins
06:33:16TastyWiener95 quits [Read error: Connection reset by peer]
06:33:53TastyWiener95 (TastyWiener95) joins
06:36:34TastyWiener954 quits [Read error: Connection reset by peer]
06:37:12TastyWiener955 (TastyWiener95) joins
06:37:40TastyWiener95 quits [Read error: Connection reset by peer]
06:38:24TastyWiener95 (TastyWiener95) joins
06:42:06TastyWiener955 quits [Ping timeout: 268 seconds]
06:45:10TastyWiener959 (TastyWiener95) joins
06:47:17TastyWiener95 quits [Read error: Connection reset by peer]
06:47:57TastyWiener95 (TastyWiener95) joins
06:51:53TastyWiener959 quits [Ping timeout: 268 seconds]
06:52:29TastyWiener95 quits [Read error: Connection reset by peer]
06:52:45TastyWiener95 (TastyWiener95) joins
07:22:30<h2ibot>PaulWise edited Obstacles (+30, Sucuri): https://wiki.archiveteam.org/?diff=60993&oldid=60978
07:25:46<pabs>TIL archive.today uses TLS fingerprinting, Firefox copy as curl here for https://archive.today/www.nytimes.com only results in a conn hang
07:33:34sepro2 (sepro) joins
07:36:17sepro quits [Ping timeout: 268 seconds]
07:36:17sepro2 is now known as sepro
07:45:37Arcorann_ quits [Ping timeout: 268 seconds]
07:50:23iseaup quits [Client Quit]
07:52:55APOLLO03 joins
07:55:29APOLLO03a quits [Ping timeout: 268 seconds]
08:00:09Arcorann_ (Arcorann) joins
08:10:41iseaup (iseaup) joins
08:12:36<h2ibot>PaulWise edited Archive.today (+119, add screenshot of archive.st capture of the…): https://wiki.archiveteam.org/?diff=60994&oldid=60992
08:17:49<pabs>TIL ArchiveBot can capture archive.st screenshots :)
08:20:11Nekroschizofrenetyk quits [Client Quit]
08:28:31Nekroschizofrenetyk joins
08:32:26<pabs>might be time to brute-force archive.st short IDs to find all their long URLs and screenshots. actual captures are broken though
08:51:31Dango360 quits [Ping timeout: 268 seconds]
09:00:27dendory quits [Quit: The Lounge - https://thelounge.chat]
09:01:02dendory (dendory) joins
09:03:51TheEnbyperor quits [Ping timeout: 268 seconds]
09:03:56TheEnbyperor_ quits [Ping timeout: 268 seconds]
09:04:03TheEnbyperor (TheEnbyperor) joins
09:06:03TheEnbyperor_ joins
09:20:30APOLLO03 quits [Ping timeout: 268 seconds]
09:28:09APOLLO03 joins
09:30:22TunaLobster44 quits [Ping timeout: 268 seconds]
09:45:47<h2ibot>Manu edited Mailman/2 (+4, Queued lists.si6networks.com): https://wiki.archiveteam.org/?diff=60995&oldid=60968
09:48:09michaelblob764 joins
09:48:15michaelblob76 quits [Ping timeout: 268 seconds]
09:48:15@hook54321 quits [Ping timeout: 633 seconds]
09:48:15michaelblob764 is now known as michaelblob76
09:50:02fuzzy80211 quits [Killed (NickServ (GHOST command used by fuzzy8021!~fuzzy8021@173-224-25-67.ptcnet.net))]
09:50:09fuzzy80211 joins
09:50:39hook54321 (hook54321) joins
09:50:39@ChanServ sets mode: +o hook54321
09:56:23<Nekroschizofrenetyk>I want to quickly get all urls from a page. What would be the best way to go about it? curl and grep it somehow into a txt file?
10:03:03<pabs>chrefsu () { curl -s "$@" | pup 'a attr{href}' | sort -u } and pup = https://github.com/ericchiang/pup
10:05:14<Nekroschizofrenetyk>oh yeah, it has a windows version
10:06:17<pabs>ah. the function is Linux shell, but probably you can do part of it on Windows at least
10:06:48<pabs>I think there are other tools to query HTML but pup is what I use
10:07:32nathang21843 joins
10:07:47<Nekroschizofrenetyk>I'm trying to figure it out, the executable gives me a blank black window, non-reactive to typing
10:07:59nathang2184 quits [Ping timeout: 268 seconds]
10:07:59nathang21843 is now known as nathang2184
10:08:52<pabs>its command-line not GUI
10:10:32BearFortress_ quits []
10:11:18<Nekroschizofrenetyk>yeah.... can't fire it though, even after having added to PATH, neither git bash, nor pwsh
10:11:32<Nekroschizofrenetyk>guess I'd need to start migrating to Linux
10:11:36<Nekroschizofrenetyk>(*restart)
10:12:38<pabs>hmm
10:12:44<pabs>WSL maybe?
10:13:27<ericgallager>Cygwin?
10:14:42<ericgallager>MinGW?
10:14:43<Nekroschizofrenetyk>Oh, I have WSL (to run warrior projects in docker) and I have even installed Cygwin today (to install Weechat, so that I can finally register my IRC name, though I failed at that Weechat thing)
10:14:52<Nekroschizofrenetyk>if I install it via Cygwin, I would be able to use it?
10:15:13<Nekroschizofrenetyk>let me see...
10:17:46klea wonders if JAA could add chrefsu into little-things, and maybe make a little pup version with just bash, but supposes that'd take lots of time.
10:21:25<Nekroschizofrenetyk>nevermind, I just added the directory path to somewhere else, not directly to the PATH dir :D
10:21:35<Nekroschizofrenetyk>fixed that and now it seems to work
10:25:52<h2ibot>Manu edited Mailman/2 (-28, Queued lists.suse.com): https://wiki.archiveteam.org/?diff=60996&oldid=60995
10:31:59BearFortress joins
10:34:58Nekroschizofrenetyk quits [Client Quit]
10:40:17bilboed084 joins
10:42:36bilboed08 quits [Ping timeout: 268 seconds]
10:42:37bilboed084 is now known as bilboed08
10:42:54<h2ibot>Manu edited Mailman/2 (-35, Queued mailman.powerdns.com): https://wiki.archiveteam.org/?diff=60997&oldid=60996
10:43:27Nekroschizofrenetyk joins
11:00:03Bleo1826007227196234552220110 quits [Quit: The Lounge - https://thelounge.chat]
11:02:48Bleo1826007227196234552220110 joins
11:05:36Webuser138753 joins
11:05:58Webuser138753 quits [Client Quit]
11:20:07<Nekroschizofrenetyk>pup is working, great !
11:21:28<@arkiver>do we know if any videos on amara.org are public? they show me a login screen
11:21:42Webuser420057 joins
11:22:29<@arkiver>c3manu: are we doing well on historicplaces.ca ? i see you looked into it before on archivebot
11:23:15<klea>https://irclogs.archivete.am/archiveteam-bs/2026-04-05#l6814c45e
11:23:51Webuser420057 quits [Client Quit]
11:28:10<c3manu>arkiver: i ran the job and it looked fine to me. but i haven't explicitly checked all the places or whether there are more than it found
11:29:49<@arkiver>c3manu: alright then i will not look too closely into it
11:46:03<@arkiver>looking into dlive, xeenon, trovo
11:47:41<Nekroschizofrenetyk>can AB-job sites be added to archiveteam wiki?
11:47:45<Nekroschizofrenetyk>like kresy24.pl
11:48:10<Nekroschizofrenetyk>(I mean - can I add)
11:49:44<@arkiver>Nekroschizofrenetyk: sure, if you want to write an article about it
11:50:04<pabs>parsing HTML with bash seems like a bad idea :)
11:50:09<Nekroschizofrenetyk>thanks!
11:51:04<klea>`grep -E 'href="[^"]*"'` maybe?
11:51:48<Nekroschizofrenetyk>do you have any tools which make editing wiki easier?
11:52:14<Nekroschizofrenetyk>or is it just what you have to practice to use comfortably?
11:52:49<klea>https://www.mediawiki.org/ has lots of pages for different things you can do with MediaWiki.
11:54:40<klea>https://megalodon.jp/ - Japanese Web Archive.
11:58:24<Nekroschizofrenetyk>klea Hmmm... What I mean exactly, is, when you edit/create a page on Wiki (AT Wiki specifically), it's quite difficult to read. With programming-language tools, the syntax is highlighted, with different colours etc, which makes it much easier to follow
11:58:51<klea>You can click preview to get whatever you typed in rendered.
11:59:06<Nekroschizofrenetyk>(I guess, I should somehow turn off grammar-correction red-wavy line underlining)
11:59:08<Nekroschizofrenetyk>yeah
11:59:08<klea>On newer MW installs, there's also a Visual Editor.
11:59:35<Nekroschizofrenetyk>yup, I've seen this one on Wikipedia
12:02:04pabs quits [Ping timeout: 268 seconds]
12:03:23<Nekroschizofrenetyk>well, no pain, no gain ;)
12:03:41Nekroschizofrenetyk quits [Client Quit]
12:06:48Nekroschizofrenetyk joins
12:07:44fuzzy80211 quits [Remote host closed the connection]
12:08:05fuzzy80211 joins
12:08:39pabs (pabs) joins
12:24:07<h2ibot>Nekroschizofrenetyk uploaded File:Logo kresy24 2020 k2.webp (Kresy24.pl logo): https://wiki.archiveteam.org/?title=File%3ALogo%20kresy24%202020%20k2.webp
12:24:08<h2ibot>Nekroschizofrenetyk uploaded File:Kresy24pl 11 04 2026.png (Kresy24.pl main page screenshot 11.04.2026): https://wiki.archiveteam.org/?title=File%3AKresy24pl%2011%2004%202026.png
12:24:16<cruller>If my understanding is correct, Arquivo.pt's ArchivePageNow captures content through the user's own browsing.
12:24:23<cruller>This is somewhat questionable behavior for a public archiving service, but it can be useful for complex pages.
12:26:08<h2ibot>Nekroschizofrenetyk created Kresy24.pl (+1397, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?oldid=61000
12:27:02<Nekroschizofrenetyk>Published. Hope, it's okay
12:31:40etnguyen03 (etnguyen03) joins
12:37:12<cruller>Apart from that, ArchivePageNow likely inherits the issues with pywb's warc generation.
12:47:16DigitalDragons quits [Quit: Ping timeout (120 seconds)]
12:47:29DigitalDragons (DigitalDragons) joins
12:47:48<Nekroschizofrenetyk>Max.ru - the new, Kremlin-approved communication app. Has anybody taken a look at it? I'm wondering, if that would be difficult/much different from #telegrab to run. There was a job in #archivebot, don't know how succesful, though
12:48:19TheEnbyperor quits [Ping timeout: 268 seconds]
12:48:24TheEnbyperor_ quits [Ping timeout: 268 seconds]
12:48:29<Nekroschizofrenetyk>needs some qr scanning to view in a browser
12:48:41<Nekroschizofrenetyk>so, possibly not doable
12:50:34icedice (icedice) joins
12:50:36icedice quits [Remote host closed the connection]
12:55:33TheEnbyperor joins
12:57:01TheEnbyperor_ (TheEnbyperor) joins
12:57:59<cruller>https://max.ru/prav_prazdnik It's terrible that clicking "Открыть в браузере" only displays a QR code :(
13:03:44<Nekroschizofrenetyk>yeah...
13:04:34etnguyen03 quits [Client Quit]
13:06:09<Nekroschizofrenetyk>search engine links redirect to mainpage https://max.ru/?ysclid=mdpppvy6vo262175045
13:06:40<Nekroschizofrenetyk>maaaybe it could work with a ru pipeline? but doubt it
13:06:56fuzzy80211 quits [Remote host closed the connection]
13:13:04iseaup quits [Ping timeout: 268 seconds]
13:14:09Dango360 (Dango360) joins
13:14:11fuzzy80211 joins
13:14:12fuzzy80211 quits [Remote host closed the connection]
13:15:55fuzzy80211 joins
13:17:42Wohlstand (Wohlstand) joins
13:23:24fuzzy80211 quits [Remote host closed the connection]
13:40:10Dango360 quits [Client Quit]
13:41:54Nekroschizofrenetyk quits [Client Quit]
13:48:35Nekroschizofrenetyk joins
13:48:51Dango360 (Dango360) joins
13:53:59Nekroschizofrenetyk quits [Client Quit]
13:54:10fuzzy80211 (fuzzy80211) joins
13:55:28Nekroschizofrenetyk joins
14:09:24polypept1 quits [Ping timeout: 260 seconds]
14:09:50polypeptide (polypeptide) joins
14:11:18Arcorann_ quits [Remote host closed the connection]
14:11:54Arcorann_ (Arcorann) joins
14:15:27simon816 quits [Remote host closed the connection]
14:20:42simon816 (simon816) joins
14:24:03nexussfan (nexussfan) joins
14:44:57polypeptide quits [Remote host closed the connection]
14:44:57Arcorann_ quits [Ping timeout: 268 seconds]
14:45:12polypeptide (polypeptide) joins
15:02:20<c3manu>anyone doing anything about https://dlive.tv/ ?
15:02:21<c3manu>https://community.dlive.tv/important-announcement-dlive-platform-closure/
15:02:45<c3manu>it's got a lot of livestreams (and possibly vods?) and i've no idea what's the best way to deal with that
15:09:10polypept1 (polypeptide) joins
15:09:28polypeptide quits [Remote host closed the connection]
15:19:34etnguyen03 (etnguyen03) joins
15:32:04<Cupping1285>c3manu, did a quick search and downloading vods looks quite easy. https://dlive.tv/p/dlive-dfmqedypii+aAbWQBtDR turns into https://playback.prd.dlivecdn.com/live/dlive-dfmqedypii/1775500109/src/playback.m3u8, I just don't know how they are converting the aAbWQBtDR into the number 1775500109. After that it looks quite easy to download vods, just
15:32:04<Cupping1285>split by the + and convert the string number into the actual number.
15:37:26<c3manu>that wasn't much of a technical question. it was more about what can reasonably be saved, what's valuable, how much space does take, does it require a dpos project etc
15:38:19<c3manu>that's also my way of saying i'm not gonna do the work if anyone wants that video saved. curerntly running jobs for some subdomains (with blog posts, help, etc.), but only ran an !ao on the main site
15:42:31etnguyen03 quits [Client Quit]
15:44:43<h2ibot>Manu edited Discourse/active (+47, Add www.elektronauts.com): https://wiki.archiveteam.org/?diff=61001&oldid=60965
15:51:15etnguyen03 (etnguyen03) joins
15:59:29fmeppo quits [Ping timeout: 268 seconds]
16:01:45fmeppo (fmeppo) joins
16:01:51Nekroschizofrenetyk quits [Client Quit]
16:03:44Nekroschizofrenetyk joins
16:08:40Cuphead2527480 (Cuphead2527480) joins
16:15:29ericgallager quits [Quit: This computer has gone to sleep]
16:22:03ericgallager joins
16:30:38<justauser>arkiver: amara.org videos were previously public, but they are dead now.
16:34:43emphie quits [Ping timeout: 268 seconds]
16:43:16TheEnbyperor_ quits [Ping timeout: 268 seconds]
16:43:21TheEnbyperor quits [Ping timeout: 268 seconds]
16:46:49Island joins
16:48:57TheEnbyperor joins
16:52:54TheEnbyperor_ (TheEnbyperor) joins
16:54:08ericgallager quits [Client Quit]
16:57:01ericgallager joins
16:57:23ericgallager quits [Client Quit]
17:06:23retrograde quits [Remote host closed the connection]
17:06:48retrograde (retrograde) joins
17:17:23<cruller>RE: dlive, my preferred approach is to rank all items and save as many as possible from the top down. However, to do this, you need to retrieve the metadata for all items in advance.
17:18:00<cruller>Generally speaking, a more depth-first and sequential approach is likely to be preferred.
17:20:39<justauser>Cupping1285: 1775500109 is almost certainly a timestamp, either of a video or of URL generation.
17:22:29etnguyen03 quits [Client Quit]
17:49:52etnguyen03 (etnguyen03) joins
18:05:55etnguyen03 quits [Client Quit]
18:09:36TheEnbyperor_ quits [Ping timeout: 268 seconds]
18:10:18TheEnbyperor quits [Ping timeout: 268 seconds]
18:18:25Cuphead2527480 quits [Client Quit]
18:20:54TheEnbyperor joins
18:21:55etnguyen03 (etnguyen03) joins
18:25:27ericgallager joins
18:29:10AK (AK) joins
18:31:39TheEnbyperor_ (TheEnbyperor) joins
19:12:34etnguyen03 quits [Client Quit]
19:14:16Webuser737349 joins
19:14:32Webuser737349 quits [Client Quit]
19:39:38Stargazers quits [Remote host closed the connection]
19:42:08mikael quits [Quit: ZNC - http://znc.in]
19:57:27etnguyen03 (etnguyen03) joins
20:01:45DogsRNice joins
20:05:50SootBector quits [Remote host closed the connection]
20:06:58SootBector (SootBector) joins
20:08:33mikael joins
20:11:23Nekroschizofrenetyk quits [Quit: Ooops, wrong browser tab.]
20:12:04Nekroschizofrenetyk joins