00:26:22etnguyen03 quits [Client Quit]
00:28:47etnguyen03 (etnguyen03) joins
00:29:20JRF joins
00:47:39etnguyen03 quits [Client Quit]
01:29:18etnguyen03 (etnguyen03) joins
01:51:40hackbug quits [Remote host closed the connection]
02:12:10etnguyen03 quits [Client Quit]
02:14:00hackbug (hackbug) joins
02:29:18Lunarian1 is now known as LunarianBunny1147
03:40:07etnguyen03 (etnguyen03) joins
04:04:53etnguyen03 quits [Remote host closed the connection]
04:51:47katocala quits [Ping timeout: 276 seconds]
04:51:59katocala joins
04:55:02<nicolas17>https://data.nicolas17.xyz/samsung-grab/ 10 files pending :)
05:12:38zzzpear_ joins
05:14:08<zzzpear_>https://web.archive.org/web/20200419171525/https://www.reddit.com/r/libgen/comments/6f8a57/can_i_download_all_libgen_books/
05:14:24<zzzpear_>https://web.archive.org/web/20200419171525/ftp://ftp.libgen.io/code/
05:14:43<zzzpear_>where is that -------ftp://ftp.libgen.io/code/
05:16:19<pabs>arkiver: no rate limiting that I can see, but some domains timeout, and others redirect to a 404 URL
05:16:43<pabs>arkiver: not all the domains need googlebot, only the diary one IIRC
05:17:59<pabs>Medaka and cruller did the URL lists, which we auto-converted to sitemaps
05:18:48<pabs>Deathwatch has the list of domains possibly others too, I didn't look into it
05:47:03katocala quits [Remote host closed the connection]
05:48:45croissant_ is now known as croissant
06:13:20<cruller>k[1-2]?.fc2.com need googlebot too. (It is properly configured in the active job 7fo1nr7vwcopnz36ujncvhhcw.)
06:20:41hexagonwin (hexagonwin) joins
06:22:46<cruller>I hate to say it, but I noticed that some users of k[1-2]?.fc2.com also have http://kbbs1.fc2.com/cgi-bin/b.cgi/{username}/ and http://kdiary1.fc2.com/cgi-bin/d.cgi/{username}/.
06:26:16<cruller>These need googlebot too.
07:12:15Island quits [Read error: Connection reset by peer]
07:26:23Webuser593237 joins
07:26:35Webuser593237 quits [Client Quit]
07:33:12cruller uploaded an image: (88KiB) < https://matrix.hackint.org/_irc/v1/media/download/ATZJsT9OadZLHw_vOxQ51jnuRXDNQwpnu7sELV2bMVI2KSn6QwwYqfu0eiGHEty4IyegTrAHSaotvcVp0K1LzmpCfgki4KNwAGltYWdpc3BoZS5yZS9ZaUdqeEZIR29yYnhlYm1lZlBPQ1RDRUk >
07:33:34<cruller>To summarize about FC2, it looks roughly like this.
07:35:16<cruller>CSV of the table: https://transfer.archivete.am/VToVj/Summary%20about%20FC2.csv
07:35:16<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/VToVj/Summary%20about%20FC2.csv
08:13:21midou quits [Remote host closed the connection]
08:14:10midou joins
08:56:19<@arkiver>cruller: did you extract users from the icon.fc2.com site?
09:13:30Dada joins
09:13:41<cruller>No, I didn't. The service itself doesn't have any user pages, but external links to users' FC2 blogs.
09:47:23<@arkiver>right and those blogs are staying for now
09:48:03<@arkiver>cruller: do you know a pun on "fc2"? we can make a channel
09:48:50tzt quits [Ping timeout: 276 seconds]
10:00:39tzt (tzt) joins
10:02:05<@arkiver>FC2 project coming up shortly
10:20:42Webuser233103 joins
10:20:50Webuser233103 quits [Client Quit]
10:26:22<@arkiver>cruller: when you say saved, does that mean with ArchiveBot?
10:33:10<cruller>This is not to say that it is always so.
10:35:08<h2ibot>Cvolton edited EStránky.cz (+86, +details on site retention): https://wiki.archiveteam.org/?diff=55834&oldid=30524
10:35:09<h2ibot>Nintendofan885 edited Voice of America (+9, fix collection): https://wiki.archiveteam.org/?diff=55835&oldid=55755
10:35:10<h2ibot>JRF edited Glitch (+1029, Add some more info): https://wiki.archiveteam.org/?diff=55836&oldid=55783
10:35:46<cruller>If you're talking about "So, I first saved http://diary.fc2.com/cgi-sys/ed.cgi/{user_ID}/, checked which months existed from there, and saved only http://diary.fc2.com/cgi-sys/ed.cgi/{user_ID}/?Y={year}&M ={month}&all=1 and its requisites." it's a personal archive (with grab-site on my PC).
10:36:08<h2ibot>Arkiver uploaded File:Fc2-icon.png: https://wiki.archiveteam.org/?title=File%3AFc2-icon.png
10:39:33<cruller><arkiver> "koichi: do you know a pun on "fc..." <- This one is a bit difficult. How about (if you haven't already created a channel) “ForbiddenCravingCenter”?
10:41:37<cruller>FC2 has long been famous for its NSFW content.
10:43:09<h2ibot>Arkiver uploaded File:Apps.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3AApps.fc2.com%20screenshot.png
10:43:10<h2ibot>Arkiver uploaded File:Diary.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3ADiary.fc2.com%20screenshot.png
10:43:11<h2ibot>Arkiver uploaded File:Icon.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3AIcon.fc2.com%20screenshot.png
10:43:12<h2ibot>Arkiver uploaded File:K.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3AK.fc2.com%20screenshot.png
10:43:13<h2ibot>Arkiver uploaded File:Piyo.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3APiyo.fc2.com%20screenshot.png
10:43:14<h2ibot>Arkiver uploaded File:Pr.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3APr.fc2.com%20screenshot.png
10:44:10<h2ibot>Arkiver uploaded File:Vote.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3AVote.fc2.com%20screenshot.png
10:50:00<@arkiver>cruller: do you have examples of the NSFW content? is it all behind a login always?
10:50:22<@arkiver>(we totally archive NSFW stuff, that's not the problem, but i wonder if some of it at least is publicly accessible)
10:50:53<@arkiver>cruller: i'm going to go back in the logs a bit, but can you also remind me of all the URL lists around fc2.com? any URLs related to it
10:55:59<@arkiver>pabs: JAA: how can i view the ignore patterns set for jobs in AB?
11:00:03Bleo182600722719623455 quits [Quit: The Lounge - https://thelounge.chat]
11:01:45arch quits [Remote host closed the connection]
11:01:53arch joins
11:02:03arch quits [Remote host closed the connection]
11:02:11arch joins
11:02:45Bleo182600722719623455 joins
11:04:56<cruller>In this case, the following clearly have a NSFW zone.... (full message at <https://matrix.hackint.org/_irc/v1/media/download/ATUxY1EzxiVYKVxaBa7aH6aNGw_eMAoSJWlp5N4cTdAGwmD_LKU2sdxtOfgQ_Jee8iHQrBajeUC9dUCF0adZVBpCfgku_f_QAGhhY2tpbnQub3JnL0VMaGdwU0x6ZHBHeGlUdlZHTlB5RmZXbQ>)
11:05:05<cruller>oops
11:13:37<cruller>FC2SNS, FC2アプリ, and FC2旧無料ホームページスペース clearly have a NSFW zone. Restrictions are as shown in the previous table.
11:15:49<cruller>For FC2アプリ, answering yes on https://apps.fc2.com/adult_cushion and using cookie is a solution
11:20:48<cruller>FC2投票 has a NSFW zone too, but there is no restrictions. So I think it's already archived by Archivebot.
11:30:15<cruller>FC2ケータイホームページ clearly has an NSFW area (http://k.fc2.com/a/*). They also required the cookie as well as FC2アプリ. I saved them to WM with Save Page Now.
11:33:18<@arkiver>cruller: have you found significant slowdowns or other signs the server cannot handle large numbers of requests?
11:36:17<cruller>No, I haven't.
11:38:39<cruller><cruller> "In this case, the following..." <- FC2プロフ doesn't have NSFW. Sorry for the mistake.
11:41:54<@arkiver>cruller: can you give me an example of NSFW on FC2投票? you can post here or DM me if you're uncomfortable with that
11:44:51<cruller>https://vote1.fc2.com/result/38690823/23/
11:48:44<@arkiver>cruller: on your screenshot of the table i see a reference "kbbs" and "kdiary" in considerations for k.fc2.com, do you ave examples of those two?
11:50:01<@arkiver>actually, i can find them though google
11:51:38<@arkiver>but, do you have an example of a k.fc2.com URL that links to kbbs or kdiary?
11:53:21<cruller>http://kdiary1.fc2.com/cgi-bin/d.cgi/haightashbury/
11:53:21<cruller>http://kbbs1.fc2.com/cgi-bin/b.cgi/haightashbury/
11:53:21<cruller>linked from
11:53:21<cruller>http://k2.fc2.com/cgi-bin/hp.cgi/haightashbury/
11:54:44<@arkiver>perfect
11:54:48<@arkiver>i think we're nearly ready
11:54:57<@arkiver>final tests now
11:55:38<@arkiver>cruller: have you noticed any rate limiting or error status codes before on fc2?
12:04:31<cruller>Redirection to https://error.fc2.com/* is likely a sign of a problem. Except that, I haven't.
12:04:44<@arkiver>alright
12:07:10<@arkiver>we'll do #forbiddencravingcenter for FC2
12:18:33<cruller>I see. Thank you for all your efforts and hard work, ArchiveTeam!
12:18:45etnguyen03 (etnguyen03) joins
12:24:59Webuser752127 joins
12:25:09Webuser752127 quits [Client Quit]
12:27:05<@arkiver>cruller: thanks :)
12:27:16<@arkiver>you could join #forbiddencravingcenter perhaps
12:28:27nepeat quits [Quit: ZNC - https://znc.in]
12:32:40nepeat (nepeat) joins
12:53:57etnguyen03 quits [Client Quit]
12:55:47driib9 quits [Quit: The Lounge - https://thelounge.chat]
12:56:20driib9 (driib) joins
13:01:58<pabs>arkiver: http://archivebot.com/ignores/7qwz68jnobcw90l4utbhabotd?compact=true
13:02:32driib9 quits [Client Quit]
13:02:36Naruyoko joins
13:03:00driib9 (driib) joins
13:03:34<pabs>(the first one is ignoring offsite)
13:03:52<pabs>and the rest are ignoring offsite broken stuff
13:03:53driib9 quits [Client Quit]
13:04:26Naruyoko5 quits [Ping timeout: 260 seconds]
13:04:37<pabs>one thing AB can't ignore is redirects to http://error.fc2.com/web/404.html
13:04:44Naruyoko5 joins
13:04:45<pabs>there are a lot of those
13:05:03driib9 (driib) joins
13:07:44Naruyoko quits [Ping timeout: 276 seconds]
13:21:39driib9 quits [Client Quit]
13:22:46driib9 (driib) joins
13:24:54katocala joins
13:32:47corentin quits [Quit: The Lounge - https://thelounge.chat]
13:47:18fangfufu quits [Quit: ZNC 1.8.2+deb3.1+deb12u1 - https://znc.in]
13:52:09fangfufu joins
14:31:26Kenshin quits [Quit: ZNC - http://znc.in]
14:31:35Kenshin joins
14:35:48grill (grill) joins
14:56:03corentin joins
14:59:25arch quits [Remote host closed the connection]
14:59:36arch joins
15:00:33arch quits [Remote host closed the connection]
15:01:03arch joins
15:19:11corentin quits [Ping timeout: 260 seconds]
15:36:35grill quits [Ping timeout: 276 seconds]
15:37:42grill (grill) joins
15:39:15Sluggs quits [Excess Flood]
15:41:33Sluggs joins
15:45:03arch quits [Remote host closed the connection]
15:45:11arch joins
15:49:15<katia>kiska, kiska52 fyi fc2 project started
15:55:11<katia>also should it be set as default for warrior?
15:58:30ducky quits [Remote host closed the connection]
15:59:00ducky (ducky) joins
16:00:11Sluggs quits [Changing host]
16:00:11Sluggs (Sluggs) joins
16:28:33<nstrom|m>We're already hitting tracker rate limiting so we probably don't need to unless something changes
16:30:01nine quits [Quit: See ya!]
16:30:15nine joins
16:30:15nine quits [Changing host]
16:30:15nine (nine) joins
16:48:14<h2ibot>Hans5958 edited Twitch.tv (+160, Add new section for 2025): https://wiki.archiveteam.org/?diff=55845&oldid=55720
16:48:15<h2ibot>Hans5958 edited Twitch.tv (+7, Add year): https://wiki.archiveteam.org/?diff=55846&oldid=55845
16:49:41<kiska>katia: You should check grafana list before telling me, I had started the wss listener 24h ago
16:51:21magmaus3 quits [Ping timeout: 260 seconds]
16:53:37<Hans5958>Can I put US Government, VOA, RFE/RL, Twitch, and Livestream to "Medium-term projects"? Is Goo.gl completed?
16:54:59<Hans5958>I'm regerring to the Current projects on the Main Page
16:55:02<Hans5958>*referring
16:55:15<h2ibot>Hans5958 edited Current Projects (-111, Pull Retrospring to finished, remove older…): https://wiki.archiveteam.org/?diff=55847&oldid=55754
16:55:32<Hans5958>Now I'm just moving Retrospring to finished
17:05:17<h2ibot>Hans5958 edited Current Projects (+0, Move goo-gl as it has 0 TODO): https://wiki.archiveteam.org/?diff=55848&oldid=55847
17:05:18<Hans5958>I moved Goo.gl to finished also (I saw both Retrospring and Goo.gl had 0 TODO)
17:05:54magmaus3 (magmaus3) joins
17:08:52<Hans5958>Nvm putting goo.gl on medium since there's still some discussion
17:09:17<h2ibot>Hans5958 edited Current Projects (-18, Goo.gl still had some discussion): https://wiki.archiveteam.org/?diff=55849&oldid=55848
17:27:05grill quits [Ping timeout: 276 seconds]
17:28:24grill (grill) joins
17:31:04<katia>kiska, just saw nobody said anything about it to you so wanted to make sure :).
17:32:07<kiska>You should check your logs then:
17:32:07<kiska>[2025-05-30T17:29:41.698Z] <kiska> fc2 wss listening
17:34:04@dxrt quits [Remote host closed the connection]
17:34:10Ryz quits [Read error: Connection reset by peer]
17:34:14grill quits [Ping timeout: 276 seconds]
17:34:30dxrt joins
17:34:32dxrt quits [Changing host]
17:34:32dxrt (dxrt) joins
17:34:32@ChanServ sets mode: +o dxrt
17:35:02Ryz (Ryz) joins
17:35:23grill (grill) joins
17:44:26eggdrop quits [Ping timeout: 260 seconds]
17:58:31Wohlstand (Wohlstand) joins
18:00:54<katia>kiska, ...i'll just leave you to it in the future...
18:32:51hackbug quits [Ping timeout: 260 seconds]
18:35:51jspiros quits []
18:42:03jspiros (jspiros) joins
18:50:09midou quits [Remote host closed the connection]
18:50:17midou joins
19:00:48riteo (riteo) joins
19:16:31BornOn420 quits [Remote host closed the connection]
19:17:04BornOn420 (BornOn420) joins
19:20:19eggdrop (eggdrop) joins
19:25:59JRF quits [Quit: Ooops, wrong browser tab.]
19:29:26grill quits [Ping timeout: 260 seconds]
19:31:14nepeat quits [Ping timeout: 276 seconds]
19:32:25nepeat (nepeat) joins
19:33:11jspiros quits [Ping timeout: 276 seconds]
19:34:17jspiros (jspiros) joins
19:48:06tzt quits [Ping timeout: 260 seconds]
19:50:49Radzig2 joins
19:52:46Radzig quits [Ping timeout: 260 seconds]
19:52:46Radzig2 is now known as Radzig
19:56:58APOLLO03 joins
19:58:36APOLLO_03 quits [Ping timeout: 260 seconds]
20:08:56Naruyoko5 quits [Ping timeout: 276 seconds]
20:09:16Naruyoko joins
20:16:37JRF joins
20:18:12Naruyoko quits [Remote host closed the connection]
20:18:29Naruyoko joins
20:20:20tzt (tzt) joins
20:38:33etnguyen03 (etnguyen03) joins
20:49:53midou quits [Ping timeout: 276 seconds]
20:52:34APOLLO03 quits [Client Quit]
21:01:44chunkynutz68 quits [Read error: Connection reset by peer]
21:01:57chunkynutz6 joins
21:51:32Wohlstand quits [Quit: Wohlstand]
21:54:53etnguyen03 quits [Client Quit]
21:56:21legoktm quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]
21:57:00legoktm joins
22:07:39useretail joins
22:32:41jspiros quits [Client Quit]
22:39:10jspiros (jspiros) joins
23:00:45threedeeitguy6 quits [Quit: The Lounge - https://thelounge.chat]
23:01:22threedeeitguy6 (threedeeitguy) joins
23:10:56jspiros quits [Ping timeout: 276 seconds]
23:12:34DogsRNice joins
23:13:44jspiros (jspiros) joins
23:23:33Dada quits [Remote host closed the connection]
23:37:26tek_dmn quits [Quit: ZNC - https://znc.in]
23:39:47dabs joins
23:41:50tek_dmn (tek_dmn) joins
23:44:45midou joins
23:51:40okay joins
23:58:56nine quits [Ping timeout: 260 seconds]