00:06:47<alexlehm>nicolas17: yes, but js is simple to get values (i would say)
00:10:22APOLLO03 joins
00:11:47nyakase quits [Remote host closed the connection]
00:14:27nyakase (nyakase) joins
00:15:31nulldata (nulldata) joins
00:18:16FiTheArchiver quits [Read error: Connection reset by peer]
00:22:09APOLLO03 quits [Client Quit]
00:24:59nulldata2 (nulldata) joins
00:27:18nulldata quits [Ping timeout: 260 seconds]
00:29:19nulldata2 is now known as nulldata
00:32:13sec^nd quits [Remote host closed the connection]
00:32:20APOLLO03 joins
00:32:39sec^nd (second) joins
00:34:52BennyOtt quits [Ping timeout: 250 seconds]
00:35:22BennyOtt (BennyOtt) joins
00:38:53<pabs>c3manu: the monitoring wiki page has things https://wiki.archiveteam.org/index.php/ArchiveBot/Monitoring
00:39:27notarobot16 joins
00:40:30notarobot1 quits [Ping timeout: 250 seconds]
00:40:30notarobot16 is now known as notarobot1
00:40:34<pabs>c3manu: here is my bash+jq thing https://paste.debian.net/hidden/b1629cf3/
00:40:50<pabs>run it like this: JOB=82onm6c77s0l8af301qmzti42 archivebot-monitor | sort -u
00:41:51<pabs>katia/JAA have their own one-liners for this too
00:43:01<@JAA>Yeah, I think I wrote one in -dev when ab2f was new.
00:45:13<pabs>figuring out when the response code changed is a bit more complex, you'll need to write some jq to select the right stuff, and then print
00:45:31<pabs>might be easier to parse the file in Python and do it that way
00:47:39<pabs>an example jq statement I use for the monitoring stuff: (.url) and (.url | test(env.ignore; "x") | not) and (.url | test(env.match; "x"))
00:47:52<pabs>and then you pass regexes to that via environment variables
00:50:17<@JAA>I'd probably use jq + AWK.
00:50:54<@JAA>And pre-filter with grep so jq doesn't need to parse useless things.
00:52:19<nicolas17>what projects are running atm?
00:52:34<pabs>nicolas17: ab2f saves the AB websocket to per-job files https://ab2f.archivingyoursh.it/ (run by katia) (files are deleted after 30days inactivity)
00:53:24BlueMaxima quits [Read error: Connection reset by peer]
00:53:33<@JAA>Oh yeah, pre-filtering doesn't work for onsite URLs because the initial URL appears in the job info.
00:54:33<pabs>you could grep for "url":... ?
00:55:19<pabs>ah that won't work because of "job_data":{"url":
00:56:07<pabs>,"url":... would though
00:56:34<pabs>or {"ts":\d+\.\d+,"url":
00:56:53<@JAA>Without pre-filtering: `curl -s https://ab2f.archivingyoursh.it/5nw1doj250k8ux5q49tesggln.jsonl | jq -r 'select(.url) | "\(.ts | strftime("%Y-%m-%dT%H:%M:%S")) \(.response_code) \(.url)"' | grep -F //mathb.in/ | awk '$2 != l { print; } { l = $2; }'`
01:04:28<steering>awk?!
01:05:41<steering>I had no idea it even had user defined variables like that
01:07:53<@JAA>AWK is neat for things like this.
01:08:02<nicolas17>I should learn awk
01:08:08<steering>yeah, I should too
01:08:18<steering>instead I just turn to Perl whenever I need to do something like that
01:08:19<nicolas17>btw how overloaded is IA? am I going to make things worse by uploading ~40GB? :P
01:08:25<steering>(... which means awk would be easy to learn but oh well :P)
01:08:26<anarcat>steering: that's fine too
01:09:06<@JAA>Just don't go too close to the cliff: https://github.com/crossbowerbt/awk-webserver https://github.com/TheMozg/awk-raycaster
01:09:26<steering>mostly I should turn to awk more for stuff like "print lines where <some field> matches <some value>"
01:09:28<nicolas17>JAA: https://github.com/64/cmake-raytracer
01:09:46<@JAA>nicolas17: Yes, but we were talking about AWK. :-)
01:10:05anarcat watches archiveteam rewrite the world in bash
01:10:08<steering>(bit shorter in awk for the common case of space separated fields I think)
01:10:10<nicolas17>I love programs written in "inappropriate" languages
01:10:17<nicolas17>anarcat: https://github.com/p8952/bocker
01:10:26<anarcat>my eyes!
01:11:21<steering>nicolas17: seems very likely to be superior to docker
01:11:22<steering>xP
01:11:24<@JAA>I've heard that katia uses this: https://github.com/learnk8s/xlskubectl
01:11:52<@imer>nicolas17: (re: how overloaded is IA) you should probably just upload it - looking fine at the moment with AT things slowed down
01:12:53<anarcat>JAA: jfc
01:16:59<Vokun>What percent of IA's problems are we?
01:31:29devkev quits [Quit: The Lounge - https://thelounge.chat]
01:32:14devkev (devkev) joins
02:14:09BennyOtt_ joins
02:15:17etnguyen03 quits [Client Quit]
02:16:23BennyOtt quits [Ping timeout: 260 seconds]
02:16:24BennyOtt_ is now known as BennyOtt
02:19:15etnguyen03 (etnguyen03) joins
02:20:42tmg1|michelson joins
02:32:41lennier2_ joins
02:35:38lennier2 quits [Ping timeout: 260 seconds]
02:48:12etnguyen03 quits [Remote host closed the connection]
03:01:18riteo quits [Ping timeout: 260 seconds]
03:11:33riteo (riteo) joins
03:21:08dontwashyourhands (dontwashyourhands) joins
03:21:16dontwashyourhands quits [Client Quit]
03:51:08Bleo18260072271962345 quits [Quit: Ping timeout (120 seconds)]
03:51:38Bleo18260072271962345 joins
04:05:04sparky14921 (sparky1492) joins
04:08:30sparky1492 quits [Ping timeout: 250 seconds]
04:08:31sparky14921 is now known as sparky1492
04:15:58datechnoman quits [Ping timeout: 260 seconds]
04:17:11datechnoman (datechnoman) joins
04:34:16Island quits [Read error: Connection reset by peer]
04:54:07<h2ibot>Vitzli uploaded File:Libgen.is-homepage-20250313.png (Libgen.is homepage screenshot as of 2025-03-13): https://wiki.archiveteam.org/?title=File%3ALibgen.is-homepage-20250313.png
04:54:08<h2ibot>Vitzli uploaded File:Libgen.li-homepage-20250313.png (Libgen.li homepage screenshot as of 2025-03-13): https://wiki.archiveteam.org/?title=File%3ALibgen.li-homepage-20250313.png
04:57:07<h2ibot>Vitzli edited Library Genesis (+16, /* February-March 2025 hosting outage */…): https://wiki.archiveteam.org/?diff=54867&oldid=54834
05:00:08<h2ibot>Vitzli edited Library Genesis (+42, Add a homepage screenshot for libgen.rs (classic)): https://wiki.archiveteam.org/?diff=54868&oldid=54867
05:04:08<h2ibot>Vitzli edited Library Genesis (+26, Set language category - 'Multilingual'): https://wiki.archiveteam.org/?diff=54869&oldid=54868
05:23:12<h2ibot>Luc7 edited Retrospring (+204, update to reflect new banner with new shutdown…): https://wiki.archiveteam.org/?diff=54870&oldid=54502
05:23:13<h2ibot>Mvolz edited US Government (+59, /* Other content that may be at risk based on…): https://wiki.archiveteam.org/?diff=54871&oldid=54491
05:23:14<h2ibot>Tapedrive edited List of websites excluded from the Wayback Machine/Partial exclusions (+127, addalexstjohn.com to list of excluded websites): https://wiki.archiveteam.org/?diff=54872&oldid=54731
05:23:15<h2ibot>Vitzli moved File:Libgen.li-homepage-20250313.png to File:Libgen.li-homepage.png (Remove timestamp): https://wiki.archiveteam.org/?title=File%3ALibgen.li-homepage.png
05:23:16<h2ibot>Vitzli moved File:Libgen.is-homepage-20250313.png to File:Libgen.is-homepage.png (Remove timestamp): https://wiki.archiveteam.org/?title=File%3ALibgen.is-homepage.png
05:25:12<h2ibot>Vitzli edited Library Genesis (-9, Adjust a homepage image location): https://wiki.archiveteam.org/?diff=54877&oldid=54869
05:26:12<h2ibot>JustAnotherArchivist edited List of websites excluded from the Wayback Machine/Partial exclusions (+9, Fix missing path): https://wiki.archiveteam.org/?diff=54878&oldid=54872
05:43:28nepeat quits [Ping timeout: 260 seconds]
06:38:23Megame quits [Quit: Leaving]
06:46:39gripebomb9 quits [Quit: The Lounge - https://thelounge.chat]
07:06:14<steering>huh, i'm surprised pornhub (and phcdn) isn't excluded from WBM. the first couple segments of the random video I picked off the front page load, even!
07:06:27<steering>bunch of other porn sites on the list
07:32:02Wohlstand (Wohlstand) joins
07:41:21notSokar joins
07:41:21Sokar quits [Read error: Connection reset by peer]
07:47:01<that_lurker>steering: https://media.tenor.com/W-42HlChzwAAAAAM/rainn-wilson.gif
07:47:34ducky quits [Ping timeout: 260 seconds]
08:05:05ducky (ducky) joins
08:28:02<datechnoman>Archiving important history
08:30:03LddPotato quits [Remote host closed the connection]
08:31:52LddPotato (LddPotato) joins
09:32:20Hackerpcs quits [Quit: Hackerpcs]
09:42:50Hackerpcs (Hackerpcs) joins
09:47:00<@arkiver>does anyone know if we have "2025-03-31: NTT Plala will remove public access to home pages." covered with AB?
09:48:05<@arkiver>looking into arzon.jp (NSFW)
09:48:17<@arkiver>and the three Asian ones
09:48:35<@arkiver>i love how we seem to have lots of insight into which sites are shutting down in Japan
10:08:34Ketchup901 quits [Remote host closed the connection]
10:08:47Ketchup901 (Ketchup901) joins
10:08:56SootBector quits [Remote host closed the connection]
10:09:20SootBector (SootBector) joins
10:38:52arch quits [Remote host closed the connection]
10:39:09arch joins
11:00:04Bleo18260072271962345 quits [Quit: The Lounge - https://thelounge.chat]
11:02:51Bleo18260072271962345 joins
11:35:23SkilledAlpaca418962 quits [Quit: SkilledAlpaca418962]
11:35:52SkilledAlpaca418962 joins
11:38:08Wohlstand quits [Ping timeout: 260 seconds]
11:38:10FiTheArchiver joins
11:59:57BornOn420 quits [Remote host closed the connection]
12:00:38BornOn420 (BornOn420) joins
12:26:58loug83181422 joins
13:23:10@Sanqui quits [Ping timeout: 250 seconds]
13:31:09Bjorn1978_2 joins
13:31:59Bjorn1978_2 quits [Client Quit]
13:32:35Bjorn1978_2 joins
13:37:59Sanqui joins
13:39:06<Bjorn1978_2>US Governement backup issue with version:
13:39:06<Bjorn1978_2>I got an error earlier today on my two computers. One running Docker, and one running Oracle VirtualBox.
13:39:06<Bjorn1978_2>Staring at my desktop I tried to restart Docker, but no help at all. I also tried to force an update, wait an hour and everything else I was able to think of.
13:39:06<Bjorn1978_2>I then uninstalled all containers to remove both WatchTower and the ArchiveTeam containers by using the garbage bin/delete function on the front page of docker.
13:39:06<Bjorn1978_2>When trying to install it again, I am stuck at a message on "project code is out of date".
13:39:06Bjorn1978_2 quits [Excess Flood]
13:39:12Bjorn1978_2 joins
13:39:23Sanqui quits [Changing host]
13:39:23Sanqui (Sanqui) joins
13:39:23@ChanServ sets mode: +o Sanqui
13:39:39<Bjorn1978_2>US Governement backup issue with version:
13:39:39<Bjorn1978_2>I got an error earlier today on my two computers. One running Docker, and one running Oracle VirtualBox.
13:39:39<Bjorn1978_2>Staring at my desktop I tried to restart Docker, but no help at all. I also tried to force an update, wait an hour and everything else I was able to think of.
13:39:39<Bjorn1978_2>I then uninstalled all containers to remove both WatchTower and the ArchiveTeam containers by using the garbage bin/delete function on the front page of docker.
13:39:39<Bjorn1978_2>When trying to install it again, I am stuck at a message on "project code is out of date".
13:39:39Bjorn1978_2 quits [Excess Flood]
13:39:47Bjorn1978_2 joins
13:40:41<pabs>Bjorn1978_2: USG project is paused due to archive.org intake issues IIRC
13:43:23<Bjorn1978_2>Ahhhh... Shit...
13:43:23<Bjorn1978_2>Do we have any timeline on fixing this?
13:43:23<Bjorn1978_2>I mailed the national norwegian TV broadcasting company about this project as a response to a news article about unrest within the scientific communities due to data sources beeing removed. And they want to run a piece on this. But if it is paused... That sort of takes the edge of the article :D
13:47:38APOLLO03 quits [Ping timeout: 260 seconds]
13:57:37<pabs>ah, I'm wrong. not paused but slowed down. the channel is #UncleSamsArchive btw
13:58:02<Bjorn1978_2>ok
14:20:48katocala quits [Ping timeout: 250 seconds]
14:35:24Bjorn1978_2 quits [Client Quit]
14:52:19APOLLO03 joins
15:17:28tek_dmn quits [Ping timeout: 260 seconds]
15:17:51Wohlstand (Wohlstand) joins
15:20:10BennyOtt_ joins
15:22:43Wohlstand quits [Ping timeout: 260 seconds]
15:23:38BennyOtt quits [Ping timeout: 250 seconds]
15:23:38BennyOtt_ is now known as BennyOtt
15:32:59Island joins
15:34:12HugsNotDrugs joins
15:36:41Megame (Megame) joins
15:36:57Naruyoko joins
15:40:06Naruyoko5 quits [Ping timeout: 250 seconds]
16:18:00<c3manu>pabs: thanks <3
16:24:48HP_Archivist (HP_Archivist) joins
16:26:24<@arkiver>is someone able to come up with a channel name for ss-blog.jp ?
16:34:14<that_lurker>hmm. that is a hard site to make a channel name for
16:36:04<@arkiver>also the site has some human verification stuff here and there...
16:36:22tek_dmn (tek_dmn) joins
16:36:32<that_lurker>something like #ssscribe might work
16:38:13<@arkiver>maybe yeah
16:38:32<@arkiver>perhaps we have someone here who can read the language and make a funny version or pun on it
16:38:36nine quits [Quit: See ya!]
16:47:21<@arkiver>... and maho.jp will not work well in the Wayback Machine due to graphql POST requests
17:18:42<Vokun>the SS stands for seesaa, which doesn't really mean anything, but #seensaw?
17:18:52<Vokun>#seensaa
17:24:42<that_lurker>yeah. Might not be a good idea to have SS in the name so those work well.
17:25:18<FiTheArchiver>yeah i like the seesaw idea
18:01:41<pokechu22>Yeah, that seems like a better idea to me too
18:02:24sparky14920 (sparky1492) joins
18:06:08sparky1492 quits [Ping timeout: 250 seconds]
18:06:09sparky14920 is now known as sparky1492
18:23:24<h2ibot>VoynichCr created OpenAlex (+288, Created page with "'''OpenAlex''' is a…): https://wiki.archiveteam.org/?title=OpenAlex
18:23:25<h2ibot>VoynichCr edited OpenAlex (+167): https://wiki.archiveteam.org/?diff=54880&oldid=54879
18:24:24<h2ibot>VoynichCr created Bibliographic catalogue (+45, Redirected page to [[List of bibliographic…): https://wiki.archiveteam.org/?title=Bibliographic%20catalogue
18:26:24<h2ibot>VoynichCr edited OpenAlex (+501): https://wiki.archiveteam.org/?diff=54882&oldid=54880
18:26:25<h2ibot>VoynichCr edited OpenAlex (+394): https://wiki.archiveteam.org/?diff=54883&oldid=54882
18:27:24<h2ibot>VoynichCr edited Microsoft Academic (+35, /* External links */): https://wiki.archiveteam.org/?diff=54884&oldid=54862
18:27:25<h2ibot>VoynichCr edited OpenAlex (+25, /* External links */): https://wiki.archiveteam.org/?diff=54885&oldid=54883
18:32:16ell7 quits [Quit: Ping timeout (120 seconds)]
18:32:19ell70 (ell) joins
18:37:26<h2ibot>VoynichCr edited Template:Category archives by year (+4, link to year): https://wiki.archiveteam.org/?diff=54886&oldid=54560
18:38:26<h2ibot>VoynichCr created 2022 (+207, Created page with "{{Years}} == Events == *…): https://wiki.archiveteam.org/?title=2022
18:40:26<h2ibot>VoynichCr uploaded File:OpenAlex.png: https://wiki.archiveteam.org/?title=File%3AOpenAlex.png
18:41:27<h2ibot>VoynichCr edited OpenAlex (+12): https://wiki.archiveteam.org/?diff=54889&oldid=54885
18:41:28<h2ibot>VoynichCr edited OpenAlex (+4): https://wiki.archiveteam.org/?diff=54890&oldid=54889
18:42:27<h2ibot>VoynichCr edited Wikidata (+26, | language = Multilingual): https://wiki.archiveteam.org/?diff=54891&oldid=50334
18:42:28<h2ibot>VoynichCr edited Wikimedia Commons (+26, | language = Multilingual): https://wiki.archiveteam.org/?diff=54892&oldid=53487
18:43:23<c3manu>arkiver, that_lurker #sos-blog ?
18:44:43<FiTheArchiver>wait yeah that ones good
18:46:27<h2ibot>VoynichCr created Talk:Wikimedia Commons (+206, Created page with "I think we could run the…): https://wiki.archiveteam.org/?title=Talk%3AWikimedia%20Commons
18:50:35VoynichCR (VoynichCR) joins
18:51:26Wohlstand (Wohlstand) joins
18:57:58VoynichCR quits [Client Quit]
19:04:57BornOn420 quits [Remote host closed the connection]
19:05:35BornOn420 (BornOn420) joins
19:06:31<h2ibot>VoynichCr edited List of bibliographic databases (+37): https://wiki.archiveteam.org/?diff=54894&oldid=54861
19:08:45Webuser555700 joins
19:09:27Webuser555700 quits [Client Quit]
20:01:22sparky14923 (sparky1492) joins
20:05:03sparky1492 quits [Ping timeout: 260 seconds]
20:05:04sparky14923 is now known as sparky1492
20:06:10pokechu22 quits [Ping timeout: 250 seconds]
20:08:47pokechu22 (pokechu22) joins
20:12:55<chrismrtn>Looks like the blog URLs end with blog.so-net.ne.jp or blog.ss-blog.jp, and I get more CAPTCHAs when using *.blog.so-net.ne.jp. (Either works: *.blog.so-net.ne.jp seems to redirect to *.blog.ss-blog.jp)
20:24:53Wohlstand quits [Client Quit]
20:32:00SootBector quits [Remote host closed the connection]
20:32:23SootBector (SootBector) joins
20:57:36IDK (IDK) joins
21:02:32notarobot11 joins
21:03:58notarobot1 quits [Ping timeout: 260 seconds]
21:03:59notarobot11 is now known as notarobot1
21:09:40etnguyen03 (etnguyen03) joins
21:11:56<h2ibot>Exorcism edited Bugzilla (-31, /* Status */): https://wiki.archiveteam.org/?diff=54895&oldid=54825
22:32:46<nicolas17>how are the targets doing now?
22:32:55<nicolas17>I'm having no problems uploading to youtube anymore
22:56:33Megame quits [Ping timeout: 260 seconds]
22:56:49Megame1 (Megame) joins
23:27:04loug83181422 quits [Quit: The Lounge - https://thelounge.chat]
23:30:10FiTheArchiver quits [Read error: Connection reset by peer]
23:34:51Megame1 quits [Remote host closed the connection]
23:35:11Megame1 (Megame) joins
23:38:34simon816 quits [Quit: ZNC 1.9.1 - https://znc.in]
23:39:12etnguyen03 quits [Client Quit]
23:40:51simon816 (simon816) joins
23:47:06gust joins
23:54:54nine joins
23:54:54nine quits [Changing host]
23:54:54nine (nine) joins