00:02:46Unholy236131 (Unholy2361) joins
00:05:59Unholy23613 quits [Ping timeout: 252 seconds]
00:05:59Unholy236131 is now known as Unholy23613
00:11:49BlueMaxima joins
00:30:10decky_e_ quits [Read error: Connection reset by peer]
00:32:15lk quits [Ping timeout: 265 seconds]
00:34:33lk (lk) joins
00:36:30ats (ats) joins
00:42:17lk quits [Ping timeout: 252 seconds]
00:42:36lk (lk) joins
00:47:29Mateon2 joins
00:48:53Mateon1 quits [Ping timeout: 252 seconds]
00:48:53Mateon2 is now known as Mateon1
00:56:05csm10495 joins
01:07:02Hackerpcs quits [Quit: Hackerpcs]
01:08:59Hackerpcs (Hackerpcs) joins
01:18:13za3k joins
01:32:20lk quits [Ping timeout: 252 seconds]
01:32:34lk (lk) joins
01:37:51decky_e joins
01:38:49tzt quits [Ping timeout: 258 seconds]
01:40:13dumbgoy quits [Read error: Connection reset by peer]
01:43:00tzt (tzt) joins
01:47:44lk quits [Ping timeout: 252 seconds]
01:49:24lk (lk) joins
02:16:52icedice quits [Client Quit]
02:35:33lk quits [Ping timeout: 258 seconds]
02:35:52lk (lk) joins
02:36:26za3k quits [Client Quit]
02:39:42dumbgoy joins
02:48:14dumbgoy quits [Ping timeout: 252 seconds]
02:51:17csm10495 quits [Remote host closed the connection]
02:52:02lk quits [Ping timeout: 258 seconds]
02:53:41lk (lk) joins
03:19:45nulldata quits [Client Quit]
03:20:54nulldata (nulldata) joins
03:28:04lk quits [Ping timeout: 258 seconds]
03:28:23lk (lk) joins
03:50:02benjins quits [Read error: Connection reset by peer]
03:53:48Icyelut (Icyelut) joins
03:56:59lk quits [Ping timeout: 252 seconds]
03:57:44lk (lk) joins
03:57:53Icyelut|2 (Icyelut) joins
03:58:02benjins joins
04:02:01Icyelut quits [Ping timeout: 265 seconds]
04:02:30lk quits [Ping timeout: 265 seconds]
04:02:52lk (lk) joins
04:10:48katocala quits [Remote host closed the connection]
04:11:23lk quits [Ping timeout: 258 seconds]
04:11:46lk (lk) joins
04:14:16nicolas17 quits [Client Quit]
04:15:33nulldata quits [Client Quit]
04:15:57nulldata (nulldata) joins
04:18:21cobertos quits [Remote host closed the connection]
04:19:04cobertos joins
04:33:21cobertos quits [Remote host closed the connection]
04:33:55lk quits [Ping timeout: 265 seconds]
04:33:59cobertos joins
04:34:42lk (lk) joins
04:44:04benjins2 quits [Read error: Connection reset by peer]
04:44:21cobertos quits [Remote host closed the connection]
04:44:39cobertos joins
04:52:49Wohlstand (Wohlstand) joins
04:55:00katocala joins
04:55:39sec^nd quits [Remote host closed the connection]
04:56:12sec^nd (second) joins
05:00:02HP_Archivist quits [Read error: Connection reset by peer]
05:17:53HackMii quits [Remote host closed the connection]
05:18:20HackMii (hacktheplanet) joins
05:33:12BlueMaxima quits [Read error: Connection reset by peer]
05:44:27Wohlstand quits [Client Quit]
05:59:11cobertos_ joins
05:59:57cobertos quits [Ping timeout: 265 seconds]
06:10:17hitgrr8 joins
06:11:30Carnildo_again is now known as Carnildo
06:27:44Justin[home] quits [Remote host closed the connection]
06:47:00DopefishJustin joins
07:31:07Arcorann (Arcorann) joins
07:33:56Wohlstand (Wohlstand) joins
07:43:33BigBrain_ (bigbrain) joins
07:47:06BigBrain quits [Ping timeout: 245 seconds]
08:43:32Naruyoko quits [Ping timeout: 252 seconds]
08:56:24W7RFa6AbNFz_ quits [Read error: Connection reset by peer]
08:56:37W7RFa6AbNFz_ joins
09:03:28mls (mls) joins
09:09:14hitgrr8 quits [Client Quit]
09:46:30agrecascino_ joins
09:46:45agrecascino quits [Client Quit]
10:04:06mls quits [Client Quit]
10:21:21mexat2 joins
10:22:55<mexat2>does archivebot have space for 7891 mini-blog entries? they're hosted on a site that have no/very little activity for the past 3 years and may shutdown anytime
10:25:04<mexat2>I have my own vps that's running AT's Warrior if you want to use Grab (Distributed ArchiveBot) so I can be of extra help to you!
10:25:12mexat2 quits [Remote host closed the connection]
10:25:46<Exorcism|m>-> #archiveteam-bs
10:26:20<Exorcism|m>this channel is for announcements
10:26:32mexat2 joins
10:36:15hitgrr8 joins
10:53:08mexat2 quits [Remote host closed the connection]
11:26:28VickoSaviour joins
11:27:02yano quits [Quit: WeeChat, the better IRC client, https://weechat.org/]
11:27:21yano (yano) joins
11:33:29pabs quits [Ping timeout: 252 seconds]
11:46:40pabs (pabs) joins
12:10:24hexagonwin joins
12:15:31<hexagonwin>Hi all. Firstly I'm not sure if I'm in the right place, if not please tell. There was a Wordpress.com/Blogger-style blogging website named Egloos in South Korea, which terminated service in June 16. Me and a friend of mine made some efforts to backup their entire posts and CDN since a few months ago, luckily the site is pretty old and not many
12:15:32<hexagonwin>anti-crawling measures in place so we did get a lot of posts. They have a nice XML-based API that shows list of posts/each post/comments so we wrote scripts and such to download blogs, and we found names for blog via many methods (Blogs are in address like username.egloos.com). However we just found out that we're missing quite a lot of blogs,
12:15:32<hexagonwin>Google seems to have a very good index of *.egloos.com, can anyone please help fetching each domain name (*.egloos.com) from Google? Google seems to do rate-limiting and stuff, and I'm actually really new to archiving so that would help... Though egloos terminated service their API still works via HTTPS (probably misconfigured router)..
12:32:04<@OrIdow6>Hello hexagonwin, we knew about egloos as well and attempted our own grab project (and we do encourage people to tell us about all sites that shut down); scraping Google is generally (from what I understand) fairly difficult, but we have other methods to list domains (in addition to our own list); so perhaps we at least have some information to share with each other
12:32:27<@OrIdow6>We knew RSS feeds were still up but not the API
12:32:43<@OrIdow6>Anyhow the project-specific channel is #eggos
12:34:13<hexagonwin>Hello OrIdow6, thanks for the reply. There appears to be a router setting (?) trouble in the Egloos side (lol), when you request via HTTPS and try connection multiple times it returns proper output instead of redirection, so we used a bash script to loop until curl's status code is 200
12:34:29Darken (Darken) joins
12:35:05Darken quits [Remote host closed the connection]
12:53:47AmAnd0A quits [Ping timeout: 252 seconds]
12:54:23AmAnd0A joins
12:56:58benjins2 joins
13:00:15AmAnd0A quits [Read error: Connection reset by peer]
13:00:31AmAnd0A joins
13:10:50Unholy23613 quits [Ping timeout: 252 seconds]
13:13:59HackMii quits [Remote host closed the connection]
13:14:32HackMii (hacktheplanet) joins
13:24:06<@OrIdow6>If there is anyone who knows how to SERP scrape Google in 2023 BTW, I'm sure they (and #egloos) would appreciate it
13:24:14<@OrIdow6>*#eggos
13:28:58VickoSaviour leaves
13:53:10driib quits [Quit: The Lounge - https://thelounge.chat]
13:54:29driib (driib) joins
13:59:08katocala quits [Remote host closed the connection]
14:06:56Arcorann quits [Ping timeout: 252 seconds]
14:17:47Wohlstand quits [Ping timeout: 265 seconds]
14:18:22katocala joins
14:51:36ave9 (ave) joins
14:51:47yano1 (yano) joins
14:51:55monkeykong1 joins
14:51:56TastyWiener95 quits [Client Quit]
14:51:56ave quits [Quit: Ping timeout (120 seconds)]
14:51:56yano quits [Remote host closed the connection]
14:51:56chrismeller3 quits [Client Quit]
14:51:56datechnoman quits [Quit: Ping timeout (120 seconds)]
14:51:56monkeykong quits [Client Quit]
14:51:56Mateon1 quits [Remote host closed the connection]
14:51:56IDK_ quits [Quit: Ping timeout (120 seconds)]
14:51:56ave9 is now known as ave
14:51:56monkeykong1 is now known as monkeykong
14:51:57chrismeller35 (chrismeller) joins
14:52:00TastyWiener951 (TastyWiener95) joins
14:52:02Mateon1 joins
14:52:08datechnoman (datechnoman) joins
14:52:08IDK_ joins
15:05:02Wohlstand (Wohlstand) joins
15:06:25Wohlstand1 (Wohlstand) joins
15:09:34Wohlstand quits [Ping timeout: 258 seconds]
15:09:34Wohlstand1 is now known as Wohlstand
15:21:37driib quits [Client Quit]
15:24:45Unholy23613 (Unholy2361) joins
15:36:16nostalgebraist joins
15:37:42driib (driib) joins
16:24:01icedice (icedice) joins
17:08:02nostalgebraist quits [Client Quit]
17:17:54hexagonwin quits [Remote host closed the connection]
17:48:27myself quits [Quit: The Lounge - https://thelounge.chat]
17:48:51nicolas17 joins
17:48:52myself (myself) joins
18:03:30icedice quits [Ping timeout: 265 seconds]
18:22:06sec^nd quits [Ping timeout: 245 seconds]
18:29:14dumbgoy joins
18:36:50Wohlstand quits [Client Quit]
18:51:25spirit joins
18:52:06T31M quits [Remote host closed the connection]
18:52:26T31M joins
18:53:54sec^nd (second) joins
19:58:32HP_Archivist (HP_Archivist) joins
21:00:07lennier2 joins
21:03:00lennier1 quits [Ping timeout: 258 seconds]
21:03:09lennier2 is now known as lennier1
21:03:25hitgrr8 quits [Client Quit]
21:08:57Terbium quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]
21:17:41Terbium joins
21:30:24icedice (icedice) joins
21:34:26nicolas17 quits [Ping timeout: 258 seconds]
21:38:25nicolas17 joins
21:41:28sec^nd quits [Remote host closed the connection]
21:42:07sec^nd (second) joins
23:10:35will1|m is now known as will|m
23:26:02IDK (IDK) joins
23:30:40qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
23:52:57Megaweapon quits [Ping timeout: 265 seconds]
23:52:59Arcorann (Arcorann) joins