00:26:22 | | etnguyen03 quits [Client Quit] |
00:28:47 | | etnguyen03 (etnguyen03) joins |
00:29:20 | | JRF joins |
00:47:39 | | etnguyen03 quits [Client Quit] |
01:29:18 | | etnguyen03 (etnguyen03) joins |
01:51:40 | | hackbug quits [Remote host closed the connection] |
02:12:10 | | etnguyen03 quits [Client Quit] |
02:14:00 | | hackbug (hackbug) joins |
02:29:18 | | Lunarian1 is now known as LunarianBunny1147 |
03:40:07 | | etnguyen03 (etnguyen03) joins |
04:04:53 | | etnguyen03 quits [Remote host closed the connection] |
04:51:47 | | katocala quits [Ping timeout: 276 seconds] |
04:51:59 | | katocala joins |
04:55:02 | <nicolas17> | https://data.nicolas17.xyz/samsung-grab/ 10 files pending :) |
05:12:38 | | zzzpear_ joins |
05:14:08 | <zzzpear_> | https://web.archive.org/web/20200419171525/https://www.reddit.com/r/libgen/comments/6f8a57/can_i_download_all_libgen_books/ |
05:14:24 | <zzzpear_> | https://web.archive.org/web/20200419171525/ftp://ftp.libgen.io/code/ |
05:14:43 | <zzzpear_> | where is that -------ftp://ftp.libgen.io/code/ |
05:16:19 | <pabs> | arkiver: no rate limiting that I can see, but some domains timeout, and others redirect to a 404 URL |
05:16:43 | <pabs> | arkiver: not all the domains need googlebot, only the diary one IIRC |
05:17:59 | <pabs> | Medaka and cruller did the URL lists, which we auto-converted to sitemaps |
05:18:48 | <pabs> | Deathwatch has the list of domains possibly others too, I didn't look into it |
05:47:03 | | katocala quits [Remote host closed the connection] |
05:48:45 | | croissant_ is now known as croissant |
06:13:20 | <cruller> | k[1-2]?.fc2.com need googlebot too. (It is properly configured in the active job 7fo1nr7vwcopnz36ujncvhhcw.) |
06:20:41 | | hexagonwin (hexagonwin) joins |
06:22:46 | <cruller> | I hate to say it, but I noticed that some users of k[1-2]?.fc2.com also have http://kbbs1.fc2.com/cgi-bin/b.cgi/{username}/ and http://kdiary1.fc2.com/cgi-bin/d.cgi/{username}/. |
06:26:16 | <cruller> | These need googlebot too. |
07:12:15 | | Island quits [Read error: Connection reset by peer] |
07:26:23 | | Webuser593237 joins |
07:26:35 | | Webuser593237 quits [Client Quit] |
07:33:12 | | cruller uploaded an image: (88KiB) < https://matrix.hackint.org/_irc/v1/media/download/ATZJsT9OadZLHw_vOxQ51jnuRXDNQwpnu7sELV2bMVI2KSn6QwwYqfu0eiGHEty4IyegTrAHSaotvcVp0K1LzmpCfgki4KNwAGltYWdpc3BoZS5yZS9ZaUdqeEZIR29yYnhlYm1lZlBPQ1RDRUk > |
07:33:34 | <cruller> | To summarize about FC2, it looks roughly like this. |
07:35:16 | <cruller> | CSV of the table: https://transfer.archivete.am/VToVj/Summary%20about%20FC2.csv |
07:35:16 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/VToVj/Summary%20about%20FC2.csv |
08:13:21 | | midou quits [Remote host closed the connection] |
08:14:10 | | midou joins |
08:56:19 | <@arkiver> | cruller: did you extract users from the icon.fc2.com site? |
09:13:30 | | Dada joins |
09:13:41 | <cruller> | No, I didn't. The service itself doesn't have any user pages, but external links to users' FC2 blogs. |
09:47:23 | <@arkiver> | right and those blogs are staying for now |
09:48:03 | <@arkiver> | cruller: do you know a pun on "fc2"? we can make a channel |
09:48:50 | | tzt quits [Ping timeout: 276 seconds] |
10:00:39 | | tzt (tzt) joins |
10:02:05 | <@arkiver> | FC2 project coming up shortly |
10:20:42 | | Webuser233103 joins |
10:20:50 | | Webuser233103 quits [Client Quit] |
10:26:22 | <@arkiver> | cruller: when you say saved, does that mean with ArchiveBot? |
10:33:10 | <cruller> | This is not to say that it is always so. |
10:35:08 | <h2ibot> | Cvolton edited EStránky.cz (+86, +details on site retention): https://wiki.archiveteam.org/?diff=55834&oldid=30524 |
10:35:09 | <h2ibot> | Nintendofan885 edited Voice of America (+9, fix collection): https://wiki.archiveteam.org/?diff=55835&oldid=55755 |
10:35:10 | <h2ibot> | JRF edited Glitch (+1029, Add some more info): https://wiki.archiveteam.org/?diff=55836&oldid=55783 |
10:35:46 | <cruller> | If you're talking about "So, I first saved http://diary.fc2.com/cgi-sys/ed.cgi/{user_ID}/, checked which months existed from there, and saved only http://diary.fc2.com/cgi-sys/ed.cgi/{user_ID}/?Y={year}&M ={month}&all=1 and its requisites." it's a personal archive (with grab-site on my PC). |
10:36:08 | <h2ibot> | Arkiver uploaded File:Fc2-icon.png: https://wiki.archiveteam.org/?title=File%3AFc2-icon.png |
10:39:33 | <cruller> | <arkiver> "koichi: do you know a pun on "fc..." <- This one is a bit difficult. How about (if you haven't already created a channel) “ForbiddenCravingCenter”? |
10:41:37 | <cruller> | FC2 has long been famous for its NSFW content. |
10:43:09 | <h2ibot> | Arkiver uploaded File:Apps.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3AApps.fc2.com%20screenshot.png |
10:43:10 | <h2ibot> | Arkiver uploaded File:Diary.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3ADiary.fc2.com%20screenshot.png |
10:43:11 | <h2ibot> | Arkiver uploaded File:Icon.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3AIcon.fc2.com%20screenshot.png |
10:43:12 | <h2ibot> | Arkiver uploaded File:K.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3AK.fc2.com%20screenshot.png |
10:43:13 | <h2ibot> | Arkiver uploaded File:Piyo.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3APiyo.fc2.com%20screenshot.png |
10:43:14 | <h2ibot> | Arkiver uploaded File:Pr.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3APr.fc2.com%20screenshot.png |
10:44:10 | <h2ibot> | Arkiver uploaded File:Vote.fc2.com screenshot.png: https://wiki.archiveteam.org/?title=File%3AVote.fc2.com%20screenshot.png |
10:50:00 | <@arkiver> | cruller: do you have examples of the NSFW content? is it all behind a login always? |
10:50:22 | <@arkiver> | (we totally archive NSFW stuff, that's not the problem, but i wonder if some of it at least is publicly accessible) |
10:50:53 | <@arkiver> | cruller: i'm going to go back in the logs a bit, but can you also remind me of all the URL lists around fc2.com? any URLs related to it |
10:55:59 | <@arkiver> | pabs: JAA: how can i view the ignore patterns set for jobs in AB? |
11:00:03 | | Bleo182600722719623455 quits [Quit: The Lounge - https://thelounge.chat] |
11:01:45 | | arch quits [Remote host closed the connection] |
11:01:53 | | arch joins |
11:02:03 | | arch quits [Remote host closed the connection] |
11:02:11 | | arch joins |
11:02:45 | | Bleo182600722719623455 joins |
11:04:56 | <cruller> | In this case, the following clearly have a NSFW zone.... (full message at <https://matrix.hackint.org/_irc/v1/media/download/ATUxY1EzxiVYKVxaBa7aH6aNGw_eMAoSJWlp5N4cTdAGwmD_LKU2sdxtOfgQ_Jee8iHQrBajeUC9dUCF0adZVBpCfgku_f_QAGhhY2tpbnQub3JnL0VMaGdwU0x6ZHBHeGlUdlZHTlB5RmZXbQ>) |
11:05:05 | <cruller> | oops |
11:13:37 | <cruller> | FC2SNS, FC2アプリ, and FC2旧無料ホームページスペース clearly have a NSFW zone. Restrictions are as shown in the previous table. |
11:15:49 | <cruller> | For FC2アプリ, answering yes on https://apps.fc2.com/adult_cushion and using cookie is a solution |
11:20:48 | <cruller> | FC2投票 has a NSFW zone too, but there is no restrictions. So I think it's already archived by Archivebot. |
11:30:15 | <cruller> | FC2ケータイホームページ clearly has an NSFW area (http://k.fc2.com/a/*). They also required the cookie as well as FC2アプリ. I saved them to WM with Save Page Now. |
11:33:18 | <@arkiver> | cruller: have you found significant slowdowns or other signs the server cannot handle large numbers of requests? |
11:36:17 | <cruller> | No, I haven't. |
11:38:39 | <cruller> | <cruller> "In this case, the following..." <- FC2プロフ doesn't have NSFW. Sorry for the mistake. |
11:41:54 | <@arkiver> | cruller: can you give me an example of NSFW on FC2投票? you can post here or DM me if you're uncomfortable with that |
11:44:51 | <cruller> | https://vote1.fc2.com/result/38690823/23/ |
11:48:44 | <@arkiver> | cruller: on your screenshot of the table i see a reference "kbbs" and "kdiary" in considerations for k.fc2.com, do you ave examples of those two? |
11:50:01 | <@arkiver> | actually, i can find them though google |
11:51:38 | <@arkiver> | but, do you have an example of a k.fc2.com URL that links to kbbs or kdiary? |
11:53:21 | <cruller> | http://kdiary1.fc2.com/cgi-bin/d.cgi/haightashbury/ |
11:53:21 | <cruller> | http://kbbs1.fc2.com/cgi-bin/b.cgi/haightashbury/ |
11:53:21 | <cruller> | linked from |
11:53:21 | <cruller> | http://k2.fc2.com/cgi-bin/hp.cgi/haightashbury/ |
11:54:44 | <@arkiver> | perfect |
11:54:48 | <@arkiver> | i think we're nearly ready |
11:54:57 | <@arkiver> | final tests now |
11:55:38 | <@arkiver> | cruller: have you noticed any rate limiting or error status codes before on fc2? |
12:04:31 | <cruller> | Redirection to https://error.fc2.com/* is likely a sign of a problem. Except that, I haven't. |
12:04:44 | <@arkiver> | alright |
12:07:10 | <@arkiver> | we'll do #forbiddencravingcenter for FC2 |
12:18:33 | <cruller> | I see. Thank you for all your efforts and hard work, ArchiveTeam! |
12:18:45 | | etnguyen03 (etnguyen03) joins |
12:24:59 | | Webuser752127 joins |
12:25:09 | | Webuser752127 quits [Client Quit] |
12:27:05 | <@arkiver> | cruller: thanks :) |
12:27:16 | <@arkiver> | you could join #forbiddencravingcenter perhaps |
12:28:27 | | nepeat quits [Quit: ZNC - https://znc.in] |
12:32:40 | | nepeat (nepeat) joins |
12:53:57 | | etnguyen03 quits [Client Quit] |
12:55:47 | | driib9 quits [Quit: The Lounge - https://thelounge.chat] |
12:56:20 | | driib9 (driib) joins |
13:01:58 | <pabs> | arkiver: http://archivebot.com/ignores/7qwz68jnobcw90l4utbhabotd?compact=true |
13:02:32 | | driib9 quits [Client Quit] |
13:02:36 | | Naruyoko joins |
13:03:00 | | driib9 (driib) joins |
13:03:34 | <pabs> | (the first one is ignoring offsite) |
13:03:52 | <pabs> | and the rest are ignoring offsite broken stuff |
13:03:53 | | driib9 quits [Client Quit] |
13:04:26 | | Naruyoko5 quits [Ping timeout: 260 seconds] |
13:04:37 | <pabs> | one thing AB can't ignore is redirects to http://error.fc2.com/web/404.html |
13:04:44 | | Naruyoko5 joins |
13:04:45 | <pabs> | there are a lot of those |
13:05:03 | | driib9 (driib) joins |
13:07:44 | | Naruyoko quits [Ping timeout: 276 seconds] |
13:21:39 | | driib9 quits [Client Quit] |
13:22:46 | | driib9 (driib) joins |
13:24:54 | | katocala joins |
13:25:19 | | katocala is now authenticated as katocala |
13:32:47 | | corentin quits [Quit: The Lounge - https://thelounge.chat] |
13:47:18 | | fangfufu quits [Quit: ZNC 1.8.2+deb3.1+deb12u1 - https://znc.in] |
13:52:09 | | fangfufu joins |
13:52:25 | | fangfufu is now authenticated as fangfufu |
14:31:26 | | Kenshin quits [Quit: ZNC - http://znc.in] |
14:31:35 | | Kenshin joins |
14:31:36 | | Kenshin is now authenticated as Kenshin |
14:35:48 | | grill (grill) joins |
14:56:03 | | corentin joins |
14:59:25 | | arch quits [Remote host closed the connection] |
14:59:36 | | arch joins |
15:00:33 | | arch quits [Remote host closed the connection] |
15:01:03 | | arch joins |
15:19:11 | | corentin quits [Ping timeout: 260 seconds] |
15:36:35 | | grill quits [Ping timeout: 276 seconds] |
15:37:42 | | grill (grill) joins |
15:39:15 | | Sluggs quits [Excess Flood] |
15:41:33 | | Sluggs joins |
15:45:03 | | arch quits [Remote host closed the connection] |
15:45:11 | | arch joins |
15:49:15 | <katia> | kiska, kiska52 fyi fc2 project started |
15:55:11 | <katia> | also should it be set as default for warrior? |
15:58:30 | | ducky quits [Remote host closed the connection] |
15:59:00 | | ducky (ducky) joins |
16:00:11 | | Sluggs is now authenticated as Sluggs |
16:00:11 | | Sluggs quits [Changing host] |
16:00:11 | | Sluggs (Sluggs) joins |
16:28:33 | <nstrom|m> | We're already hitting tracker rate limiting so we probably don't need to unless something changes |
16:30:01 | | nine quits [Quit: See ya!] |
16:30:15 | | nine joins |
16:30:15 | | nine is now authenticated as nine |
16:30:15 | | nine quits [Changing host] |
16:30:15 | | nine (nine) joins |
16:48:14 | <h2ibot> | Hans5958 edited Twitch.tv (+160, Add new section for 2025): https://wiki.archiveteam.org/?diff=55845&oldid=55720 |
16:48:15 | <h2ibot> | Hans5958 edited Twitch.tv (+7, Add year): https://wiki.archiveteam.org/?diff=55846&oldid=55845 |
16:49:41 | <kiska> | katia: You should check grafana list before telling me, I had started the wss listener 24h ago |
16:51:21 | | magmaus3 quits [Ping timeout: 260 seconds] |
16:53:37 | <Hans5958> | Can I put US Government, VOA, RFE/RL, Twitch, and Livestream to "Medium-term projects"? Is Goo.gl completed? |
16:54:59 | <Hans5958> | I'm regerring to the Current projects on the Main Page |
16:55:02 | <Hans5958> | *referring |
16:55:15 | <h2ibot> | Hans5958 edited Current Projects (-111, Pull Retrospring to finished, remove older…): https://wiki.archiveteam.org/?diff=55847&oldid=55754 |
16:55:32 | <Hans5958> | Now I'm just moving Retrospring to finished |
17:05:17 | <h2ibot> | Hans5958 edited Current Projects (+0, Move goo-gl as it has 0 TODO): https://wiki.archiveteam.org/?diff=55848&oldid=55847 |
17:05:18 | <Hans5958> | I moved Goo.gl to finished also (I saw both Retrospring and Goo.gl had 0 TODO) |
17:05:54 | | magmaus3 (magmaus3) joins |
17:08:52 | <Hans5958> | Nvm putting goo.gl on medium since there's still some discussion |
17:09:17 | <h2ibot> | Hans5958 edited Current Projects (-18, Goo.gl still had some discussion): https://wiki.archiveteam.org/?diff=55849&oldid=55848 |
17:27:05 | | grill quits [Ping timeout: 276 seconds] |
17:28:24 | | grill (grill) joins |
17:31:04 | <katia> | kiska, just saw nobody said anything about it to you so wanted to make sure :). |
17:32:07 | <kiska> | You should check your logs then: |
17:32:07 | <kiska> | [2025-05-30T17:29:41.698Z] <kiska> fc2 wss listening |
17:34:04 | | @dxrt quits [Remote host closed the connection] |
17:34:10 | | Ryz quits [Read error: Connection reset by peer] |
17:34:14 | | grill quits [Ping timeout: 276 seconds] |
17:34:30 | | dxrt joins |
17:34:32 | | dxrt is now authenticated as dxrt |
17:34:32 | | dxrt quits [Changing host] |
17:34:32 | | dxrt (dxrt) joins |
17:34:32 | | @ChanServ sets mode: +o dxrt |
17:35:02 | | Ryz (Ryz) joins |
17:35:23 | | grill (grill) joins |
17:44:26 | | eggdrop quits [Ping timeout: 260 seconds] |
17:58:31 | | Wohlstand (Wohlstand) joins |
18:00:54 | <katia> | kiska, ...i'll just leave you to it in the future... |
18:32:51 | | hackbug quits [Ping timeout: 260 seconds] |
18:35:51 | | jspiros quits [] |
18:42:03 | | jspiros (jspiros) joins |
18:50:09 | | midou quits [Remote host closed the connection] |
18:50:17 | | midou joins |
19:00:48 | | riteo (riteo) joins |
19:16:31 | | BornOn420 quits [Remote host closed the connection] |
19:17:04 | | BornOn420 (BornOn420) joins |
19:20:19 | | eggdrop (eggdrop) joins |
19:25:59 | | JRF quits [Quit: Ooops, wrong browser tab.] |
19:29:26 | | grill quits [Ping timeout: 260 seconds] |
19:31:14 | | nepeat quits [Ping timeout: 276 seconds] |
19:32:25 | | nepeat (nepeat) joins |
19:33:11 | | jspiros quits [Ping timeout: 276 seconds] |
19:34:17 | | jspiros (jspiros) joins |
19:48:06 | | tzt quits [Ping timeout: 260 seconds] |
19:50:49 | | Radzig2 joins |
19:52:46 | | Radzig quits [Ping timeout: 260 seconds] |
19:52:46 | | Radzig2 is now known as Radzig |
19:56:58 | | APOLLO03 joins |
19:58:36 | | APOLLO_03 quits [Ping timeout: 260 seconds] |
20:08:56 | | Naruyoko5 quits [Ping timeout: 276 seconds] |
20:09:16 | | Naruyoko joins |
20:16:37 | | JRF joins |
20:18:12 | | Naruyoko quits [Remote host closed the connection] |
20:18:29 | | Naruyoko joins |
20:20:20 | | tzt (tzt) joins |
20:38:33 | | etnguyen03 (etnguyen03) joins |
20:49:53 | | midou quits [Ping timeout: 276 seconds] |
20:52:34 | | APOLLO03 quits [Client Quit] |
21:01:44 | | chunkynutz68 quits [Read error: Connection reset by peer] |
21:01:57 | | chunkynutz6 joins |
21:51:32 | | Wohlstand quits [Quit: Wohlstand] |
21:54:53 | | etnguyen03 quits [Client Quit] |
21:56:21 | | legoktm quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.] |
21:57:00 | | legoktm joins |
22:07:39 | | useretail joins |
22:32:41 | | jspiros quits [Client Quit] |
22:39:10 | | jspiros (jspiros) joins |
23:00:45 | | threedeeitguy6 quits [Quit: The Lounge - https://thelounge.chat] |
23:01:22 | | threedeeitguy6 (threedeeitguy) joins |
23:10:56 | | jspiros quits [Ping timeout: 276 seconds] |
23:12:34 | | DogsRNice joins |
23:13:44 | | jspiros (jspiros) joins |
23:23:33 | | Dada quits [Remote host closed the connection] |
23:37:26 | | tek_dmn quits [Quit: ZNC - https://znc.in] |
23:39:47 | | dabs joins |
23:41:50 | | tek_dmn (tek_dmn) joins |
23:44:45 | | midou joins |
23:51:40 | | okay joins |
23:58:56 | | nine quits [Ping timeout: 260 seconds] |