00:06:47 | <alexlehm> | nicolas17: yes, but js is simple to get values (i would say) |
00:10:22 | | APOLLO03 joins |
00:11:47 | | nyakase quits [Remote host closed the connection] |
00:14:27 | | nyakase (nyakase) joins |
00:15:31 | | nulldata (nulldata) joins |
00:18:16 | | FiTheArchiver quits [Read error: Connection reset by peer] |
00:22:09 | | APOLLO03 quits [Client Quit] |
00:24:59 | | nulldata2 (nulldata) joins |
00:27:18 | | nulldata quits [Ping timeout: 260 seconds] |
00:29:19 | | nulldata2 is now known as nulldata |
00:32:13 | | sec^nd quits [Remote host closed the connection] |
00:32:20 | | APOLLO03 joins |
00:32:39 | | sec^nd (second) joins |
00:34:52 | | BennyOtt quits [Ping timeout: 250 seconds] |
00:35:22 | | BennyOtt (BennyOtt) joins |
00:38:53 | <pabs> | c3manu: the monitoring wiki page has things https://wiki.archiveteam.org/index.php/ArchiveBot/Monitoring |
00:39:27 | | notarobot16 joins |
00:40:30 | | notarobot1 quits [Ping timeout: 250 seconds] |
00:40:30 | | notarobot16 is now known as notarobot1 |
00:40:34 | <pabs> | c3manu: here is my bash+jq thing https://paste.debian.net/hidden/b1629cf3/ |
00:40:50 | <pabs> | run it like this: JOB=82onm6c77s0l8af301qmzti42 archivebot-monitor | sort -u |
00:41:51 | <pabs> | katia/JAA have their own one-liners for this too |
00:43:01 | <@JAA> | Yeah, I think I wrote one in -dev when ab2f was new. |
00:45:13 | <pabs> | figuring out when the response code changed is a bit more complex, you'll need to write some jq to select the right stuff, and then print |
00:45:31 | <pabs> | might be easier to parse the file in Python and do it that way |
00:47:39 | <pabs> | an example jq statement I use for the monitoring stuff: (.url) and (.url | test(env.ignore; "x") | not) and (.url | test(env.match; "x")) |
00:47:52 | <pabs> | and then you pass regexes to that via environment variables |
00:50:17 | <@JAA> | I'd probably use jq + AWK. |
00:50:54 | <@JAA> | And pre-filter with grep so jq doesn't need to parse useless things. |
00:52:19 | <nicolas17> | what projects are running atm? |
00:52:34 | <pabs> | nicolas17: ab2f saves the AB websocket to per-job files https://ab2f.archivingyoursh.it/ (run by katia) (files are deleted after 30days inactivity) |
00:53:24 | | BlueMaxima quits [Read error: Connection reset by peer] |
00:53:33 | <@JAA> | Oh yeah, pre-filtering doesn't work for onsite URLs because the initial URL appears in the job info. |
00:54:33 | <pabs> | you could grep for "url":... ? |
00:55:19 | <pabs> | ah that won't work because of "job_data":{"url": |
00:56:07 | <pabs> | ,"url":... would though |
00:56:34 | <pabs> | or {"ts":\d+\.\d+,"url": |
00:56:53 | <@JAA> | Without pre-filtering: `curl -s https://ab2f.archivingyoursh.it/5nw1doj250k8ux5q49tesggln.jsonl | jq -r 'select(.url) | "\(.ts | strftime("%Y-%m-%dT%H:%M:%S")) \(.response_code) \(.url)"' | grep -F //mathb.in/ | awk '$2 != l { print; } { l = $2; }'` |
01:04:28 | <steering> | awk?! |
01:05:41 | <steering> | I had no idea it even had user defined variables like that |
01:07:53 | <@JAA> | AWK is neat for things like this. |
01:08:02 | <nicolas17> | I should learn awk |
01:08:08 | <steering> | yeah, I should too |
01:08:18 | <steering> | instead I just turn to Perl whenever I need to do something like that |
01:08:19 | <nicolas17> | btw how overloaded is IA? am I going to make things worse by uploading ~40GB? :P |
01:08:25 | <steering> | (... which means awk would be easy to learn but oh well :P) |
01:08:26 | <anarcat> | steering: that's fine too |
01:09:06 | <@JAA> | Just don't go too close to the cliff: https://github.com/crossbowerbt/awk-webserver https://github.com/TheMozg/awk-raycaster |
01:09:26 | <steering> | mostly I should turn to awk more for stuff like "print lines where <some field> matches <some value>" |
01:09:28 | <nicolas17> | JAA: https://github.com/64/cmake-raytracer |
01:09:46 | <@JAA> | nicolas17: Yes, but we were talking about AWK. :-) |
01:10:05 | | anarcat watches archiveteam rewrite the world in bash |
01:10:08 | <steering> | (bit shorter in awk for the common case of space separated fields I think) |
01:10:10 | <nicolas17> | I love programs written in "inappropriate" languages |
01:10:17 | <nicolas17> | anarcat: https://github.com/p8952/bocker |
01:10:26 | <anarcat> | my eyes! |
01:11:21 | <steering> | nicolas17: seems very likely to be superior to docker |
01:11:22 | <steering> | xP |
01:11:24 | <@JAA> | I've heard that katia uses this: https://github.com/learnk8s/xlskubectl |
01:11:52 | <@imer> | nicolas17: (re: how overloaded is IA) you should probably just upload it - looking fine at the moment with AT things slowed down |
01:12:53 | <anarcat> | JAA: jfc |
01:16:59 | <Vokun> | What percent of IA's problems are we? |
01:31:29 | | devkev quits [Quit: The Lounge - https://thelounge.chat] |
01:32:14 | | devkev (devkev) joins |
02:14:09 | | BennyOtt_ joins |
02:15:17 | | etnguyen03 quits [Client Quit] |
02:16:23 | | BennyOtt quits [Ping timeout: 260 seconds] |
02:16:24 | | BennyOtt_ is now known as BennyOtt |
02:16:24 | | BennyOtt is now authenticated as BennyOtt |
02:19:15 | | etnguyen03 (etnguyen03) joins |
02:20:42 | | tmg1|michelson joins |
02:32:41 | | lennier2_ joins |
02:35:38 | | lennier2 quits [Ping timeout: 260 seconds] |
02:48:12 | | etnguyen03 quits [Remote host closed the connection] |
03:01:18 | | riteo quits [Ping timeout: 260 seconds] |
03:11:33 | | riteo (riteo) joins |
03:21:08 | | dontwashyourhands (dontwashyourhands) joins |
03:21:16 | | dontwashyourhands quits [Client Quit] |
03:51:08 | | Bleo18260072271962345 quits [Quit: Ping timeout (120 seconds)] |
03:51:38 | | Bleo18260072271962345 joins |
04:05:04 | | sparky14921 (sparky1492) joins |
04:08:30 | | sparky1492 quits [Ping timeout: 250 seconds] |
04:08:31 | | sparky14921 is now known as sparky1492 |
04:15:58 | | datechnoman quits [Ping timeout: 260 seconds] |
04:17:11 | | datechnoman (datechnoman) joins |
04:34:16 | | Island quits [Read error: Connection reset by peer] |
04:54:07 | <h2ibot> | Vitzli uploaded File:Libgen.is-homepage-20250313.png (Libgen.is homepage screenshot as of 2025-03-13): https://wiki.archiveteam.org/?title=File%3ALibgen.is-homepage-20250313.png |
04:54:08 | <h2ibot> | Vitzli uploaded File:Libgen.li-homepage-20250313.png (Libgen.li homepage screenshot as of 2025-03-13): https://wiki.archiveteam.org/?title=File%3ALibgen.li-homepage-20250313.png |
04:57:07 | <h2ibot> | Vitzli edited Library Genesis (+16, /* February-March 2025 hosting outage */…): https://wiki.archiveteam.org/?diff=54867&oldid=54834 |
05:00:08 | <h2ibot> | Vitzli edited Library Genesis (+42, Add a homepage screenshot for libgen.rs (classic)): https://wiki.archiveteam.org/?diff=54868&oldid=54867 |
05:04:08 | <h2ibot> | Vitzli edited Library Genesis (+26, Set language category - 'Multilingual'): https://wiki.archiveteam.org/?diff=54869&oldid=54868 |
05:23:12 | <h2ibot> | Luc7 edited Retrospring (+204, update to reflect new banner with new shutdown…): https://wiki.archiveteam.org/?diff=54870&oldid=54502 |
05:23:13 | <h2ibot> | Mvolz edited US Government (+59, /* Other content that may be at risk based on…): https://wiki.archiveteam.org/?diff=54871&oldid=54491 |
05:23:14 | <h2ibot> | Tapedrive edited List of websites excluded from the Wayback Machine/Partial exclusions (+127, addalexstjohn.com to list of excluded websites): https://wiki.archiveteam.org/?diff=54872&oldid=54731 |
05:23:15 | <h2ibot> | Vitzli moved File:Libgen.li-homepage-20250313.png to File:Libgen.li-homepage.png (Remove timestamp): https://wiki.archiveteam.org/?title=File%3ALibgen.li-homepage.png |
05:23:16 | <h2ibot> | Vitzli moved File:Libgen.is-homepage-20250313.png to File:Libgen.is-homepage.png (Remove timestamp): https://wiki.archiveteam.org/?title=File%3ALibgen.is-homepage.png |
05:25:12 | <h2ibot> | Vitzli edited Library Genesis (-9, Adjust a homepage image location): https://wiki.archiveteam.org/?diff=54877&oldid=54869 |
05:26:12 | <h2ibot> | JustAnotherArchivist edited List of websites excluded from the Wayback Machine/Partial exclusions (+9, Fix missing path): https://wiki.archiveteam.org/?diff=54878&oldid=54872 |
05:43:28 | | nepeat quits [Ping timeout: 260 seconds] |
06:38:23 | | Megame quits [Quit: Leaving] |
06:46:39 | | gripebomb9 quits [Quit: The Lounge - https://thelounge.chat] |
07:06:14 | <steering> | huh, i'm surprised pornhub (and phcdn) isn't excluded from WBM. the first couple segments of the random video I picked off the front page load, even! |
07:06:27 | <steering> | bunch of other porn sites on the list |
07:32:02 | | Wohlstand (Wohlstand) joins |
07:41:21 | | notSokar joins |
07:41:21 | | Sokar quits [Read error: Connection reset by peer] |
07:47:01 | <that_lurker> | steering: https://media.tenor.com/W-42HlChzwAAAAAM/rainn-wilson.gif |
07:47:34 | | ducky quits [Ping timeout: 260 seconds] |
08:05:05 | | ducky (ducky) joins |
08:28:02 | <datechnoman> | Archiving important history |
08:30:03 | | LddPotato quits [Remote host closed the connection] |
08:31:52 | | LddPotato (LddPotato) joins |
09:32:20 | | Hackerpcs quits [Quit: Hackerpcs] |
09:42:50 | | Hackerpcs (Hackerpcs) joins |
09:47:00 | <@arkiver> | does anyone know if we have "2025-03-31: NTT Plala will remove public access to home pages." covered with AB? |
09:48:05 | <@arkiver> | looking into arzon.jp (NSFW) |
09:48:17 | <@arkiver> | and the three Asian ones |
09:48:35 | <@arkiver> | i love how we seem to have lots of insight into which sites are shutting down in Japan |
10:08:34 | | Ketchup901 quits [Remote host closed the connection] |
10:08:47 | | Ketchup901 (Ketchup901) joins |
10:08:56 | | SootBector quits [Remote host closed the connection] |
10:09:20 | | SootBector (SootBector) joins |
10:38:52 | | arch quits [Remote host closed the connection] |
10:39:09 | | arch joins |
11:00:04 | | Bleo18260072271962345 quits [Quit: The Lounge - https://thelounge.chat] |
11:02:51 | | Bleo18260072271962345 joins |
11:35:23 | | SkilledAlpaca418962 quits [Quit: SkilledAlpaca418962] |
11:35:52 | | SkilledAlpaca418962 joins |
11:38:08 | | Wohlstand quits [Ping timeout: 260 seconds] |
11:38:10 | | FiTheArchiver joins |
11:59:57 | | BornOn420 quits [Remote host closed the connection] |
12:00:38 | | BornOn420 (BornOn420) joins |
12:26:58 | | loug83181422 joins |
13:23:10 | | @Sanqui quits [Ping timeout: 250 seconds] |
13:31:09 | | Bjorn1978_2 joins |
13:31:59 | | Bjorn1978_2 quits [Client Quit] |
13:32:35 | | Bjorn1978_2 joins |
13:37:59 | | Sanqui joins |
13:39:06 | <Bjorn1978_2> | US Governement backup issue with version: |
13:39:06 | <Bjorn1978_2> | I got an error earlier today on my two computers. One running Docker, and one running Oracle VirtualBox. |
13:39:06 | <Bjorn1978_2> | Staring at my desktop I tried to restart Docker, but no help at all. I also tried to force an update, wait an hour and everything else I was able to think of. |
13:39:06 | <Bjorn1978_2> | I then uninstalled all containers to remove both WatchTower and the ArchiveTeam containers by using the garbage bin/delete function on the front page of docker. |
13:39:06 | <Bjorn1978_2> | When trying to install it again, I am stuck at a message on "project code is out of date". |
13:39:06 | | Bjorn1978_2 quits [Excess Flood] |
13:39:12 | | Bjorn1978_2 joins |
13:39:23 | | Sanqui is now authenticated as Sanqui |
13:39:23 | | Sanqui quits [Changing host] |
13:39:23 | | Sanqui (Sanqui) joins |
13:39:23 | | @ChanServ sets mode: +o Sanqui |
13:39:39 | <Bjorn1978_2> | US Governement backup issue with version: |
13:39:39 | <Bjorn1978_2> | I got an error earlier today on my two computers. One running Docker, and one running Oracle VirtualBox. |
13:39:39 | <Bjorn1978_2> | Staring at my desktop I tried to restart Docker, but no help at all. I also tried to force an update, wait an hour and everything else I was able to think of. |
13:39:39 | <Bjorn1978_2> | I then uninstalled all containers to remove both WatchTower and the ArchiveTeam containers by using the garbage bin/delete function on the front page of docker. |
13:39:39 | <Bjorn1978_2> | When trying to install it again, I am stuck at a message on "project code is out of date". |
13:39:39 | | Bjorn1978_2 quits [Excess Flood] |
13:39:47 | | Bjorn1978_2 joins |
13:40:41 | <pabs> | Bjorn1978_2: USG project is paused due to archive.org intake issues IIRC |
13:43:23 | <Bjorn1978_2> | Ahhhh... Shit... |
13:43:23 | <Bjorn1978_2> | Do we have any timeline on fixing this? |
13:43:23 | <Bjorn1978_2> | I mailed the national norwegian TV broadcasting company about this project as a response to a news article about unrest within the scientific communities due to data sources beeing removed. And they want to run a piece on this. But if it is paused... That sort of takes the edge of the article :D |
13:47:38 | | APOLLO03 quits [Ping timeout: 260 seconds] |
13:57:37 | <pabs> | ah, I'm wrong. not paused but slowed down. the channel is #UncleSamsArchive btw |
13:58:02 | <Bjorn1978_2> | ok |
14:20:48 | | katocala quits [Ping timeout: 250 seconds] |
14:35:24 | | Bjorn1978_2 quits [Client Quit] |
14:52:19 | | APOLLO03 joins |
15:17:28 | | tek_dmn quits [Ping timeout: 260 seconds] |
15:17:51 | | Wohlstand (Wohlstand) joins |
15:20:10 | | BennyOtt_ joins |
15:22:43 | | Wohlstand quits [Ping timeout: 260 seconds] |
15:23:38 | | BennyOtt quits [Ping timeout: 250 seconds] |
15:23:38 | | BennyOtt_ is now known as BennyOtt |
15:23:39 | | BennyOtt is now authenticated as BennyOtt |
15:32:59 | | Island joins |
15:34:12 | | HugsNotDrugs joins |
15:36:41 | | Megame (Megame) joins |
15:36:57 | | Naruyoko joins |
15:40:06 | | Naruyoko5 quits [Ping timeout: 250 seconds] |
16:18:00 | <c3manu> | pabs: thanks <3 |
16:24:48 | | HP_Archivist (HP_Archivist) joins |
16:26:24 | <@arkiver> | is someone able to come up with a channel name for ss-blog.jp ? |
16:34:14 | <that_lurker> | hmm. that is a hard site to make a channel name for |
16:36:04 | <@arkiver> | also the site has some human verification stuff here and there... |
16:36:22 | | tek_dmn (tek_dmn) joins |
16:36:32 | <that_lurker> | something like #ssscribe might work |
16:38:13 | <@arkiver> | maybe yeah |
16:38:32 | <@arkiver> | perhaps we have someone here who can read the language and make a funny version or pun on it |
16:38:36 | | nine quits [Quit: See ya!] |
16:47:21 | <@arkiver> | ... and maho.jp will not work well in the Wayback Machine due to graphql POST requests |
17:18:42 | <Vokun> | the SS stands for seesaa, which doesn't really mean anything, but #seensaw? |
17:18:52 | <Vokun> | #seensaa |
17:24:42 | <that_lurker> | yeah. Might not be a good idea to have SS in the name so those work well. |
17:25:18 | <FiTheArchiver> | yeah i like the seesaw idea |
18:01:41 | <pokechu22> | Yeah, that seems like a better idea to me too |
18:02:24 | | sparky14920 (sparky1492) joins |
18:06:08 | | sparky1492 quits [Ping timeout: 250 seconds] |
18:06:09 | | sparky14920 is now known as sparky1492 |
18:23:24 | <h2ibot> | VoynichCr created OpenAlex (+288, Created page with "'''OpenAlex''' is a…): https://wiki.archiveteam.org/?title=OpenAlex |
18:23:25 | <h2ibot> | VoynichCr edited OpenAlex (+167): https://wiki.archiveteam.org/?diff=54880&oldid=54879 |
18:24:24 | <h2ibot> | VoynichCr created Bibliographic catalogue (+45, Redirected page to [[List of bibliographic…): https://wiki.archiveteam.org/?title=Bibliographic%20catalogue |
18:26:24 | <h2ibot> | VoynichCr edited OpenAlex (+501): https://wiki.archiveteam.org/?diff=54882&oldid=54880 |
18:26:25 | <h2ibot> | VoynichCr edited OpenAlex (+394): https://wiki.archiveteam.org/?diff=54883&oldid=54882 |
18:27:24 | <h2ibot> | VoynichCr edited Microsoft Academic (+35, /* External links */): https://wiki.archiveteam.org/?diff=54884&oldid=54862 |
18:27:25 | <h2ibot> | VoynichCr edited OpenAlex (+25, /* External links */): https://wiki.archiveteam.org/?diff=54885&oldid=54883 |
18:32:16 | | ell7 quits [Quit: Ping timeout (120 seconds)] |
18:32:19 | | ell70 (ell) joins |
18:37:26 | <h2ibot> | VoynichCr edited Template:Category archives by year (+4, link to year): https://wiki.archiveteam.org/?diff=54886&oldid=54560 |
18:38:26 | <h2ibot> | VoynichCr created 2022 (+207, Created page with "{{Years}} == Events == *…): https://wiki.archiveteam.org/?title=2022 |
18:40:26 | <h2ibot> | VoynichCr uploaded File:OpenAlex.png: https://wiki.archiveteam.org/?title=File%3AOpenAlex.png |
18:41:27 | <h2ibot> | VoynichCr edited OpenAlex (+12): https://wiki.archiveteam.org/?diff=54889&oldid=54885 |
18:41:28 | <h2ibot> | VoynichCr edited OpenAlex (+4): https://wiki.archiveteam.org/?diff=54890&oldid=54889 |
18:42:27 | <h2ibot> | VoynichCr edited Wikidata (+26, | language = Multilingual): https://wiki.archiveteam.org/?diff=54891&oldid=50334 |
18:42:28 | <h2ibot> | VoynichCr edited Wikimedia Commons (+26, | language = Multilingual): https://wiki.archiveteam.org/?diff=54892&oldid=53487 |
18:43:23 | <c3manu> | arkiver, that_lurker #sos-blog ? |
18:44:43 | <FiTheArchiver> | wait yeah that ones good |
18:46:27 | <h2ibot> | VoynichCr created Talk:Wikimedia Commons (+206, Created page with "I think we could run the…): https://wiki.archiveteam.org/?title=Talk%3AWikimedia%20Commons |
18:50:35 | | VoynichCR (VoynichCR) joins |
18:51:26 | | Wohlstand (Wohlstand) joins |
18:57:58 | | VoynichCR quits [Client Quit] |
19:04:57 | | BornOn420 quits [Remote host closed the connection] |
19:05:35 | | BornOn420 (BornOn420) joins |
19:06:31 | <h2ibot> | VoynichCr edited List of bibliographic databases (+37): https://wiki.archiveteam.org/?diff=54894&oldid=54861 |
19:08:45 | | Webuser555700 joins |
19:09:27 | | Webuser555700 quits [Client Quit] |
20:01:22 | | sparky14923 (sparky1492) joins |
20:05:03 | | sparky1492 quits [Ping timeout: 260 seconds] |
20:05:04 | | sparky14923 is now known as sparky1492 |
20:06:10 | | pokechu22 quits [Ping timeout: 250 seconds] |
20:08:47 | | pokechu22 (pokechu22) joins |
20:12:55 | <chrismrtn> | Looks like the blog URLs end with blog.so-net.ne.jp or blog.ss-blog.jp, and I get more CAPTCHAs when using *.blog.so-net.ne.jp. (Either works: *.blog.so-net.ne.jp seems to redirect to *.blog.ss-blog.jp) |
20:24:53 | | Wohlstand quits [Client Quit] |
20:32:00 | | SootBector quits [Remote host closed the connection] |
20:32:23 | | SootBector (SootBector) joins |
20:57:36 | | IDK (IDK) joins |
21:02:32 | | notarobot11 joins |
21:03:58 | | notarobot1 quits [Ping timeout: 260 seconds] |
21:03:59 | | notarobot11 is now known as notarobot1 |
21:09:40 | | etnguyen03 (etnguyen03) joins |
21:11:56 | <h2ibot> | Exorcism edited Bugzilla (-31, /* Status */): https://wiki.archiveteam.org/?diff=54895&oldid=54825 |
22:32:46 | <nicolas17> | how are the targets doing now? |
22:32:55 | <nicolas17> | I'm having no problems uploading to youtube anymore |
22:56:33 | | Megame quits [Ping timeout: 260 seconds] |
22:56:49 | | Megame1 (Megame) joins |
23:27:04 | | loug83181422 quits [Quit: The Lounge - https://thelounge.chat] |
23:30:10 | | FiTheArchiver quits [Read error: Connection reset by peer] |
23:34:51 | | Megame1 quits [Remote host closed the connection] |
23:35:11 | | Megame1 (Megame) joins |
23:38:34 | | simon816 quits [Quit: ZNC 1.9.1 - https://znc.in] |
23:39:12 | | etnguyen03 quits [Client Quit] |
23:40:51 | | simon816 (simon816) joins |
23:47:06 | | gust joins |
23:54:54 | | nine joins |
23:54:54 | | nine is now authenticated as nine |
23:54:54 | | nine quits [Changing host] |
23:54:54 | | nine (nine) joins |