00:08:19 | | Wohlstand quits [Quit: Wohlstand] |
00:16:31 | | Wohlstand (Wohlstand) joins |
00:32:52 | | IDK quits [Quit: Connection closed for inactivity] |
00:46:32 | | ATinySpaceMarine joins |
01:31:18 | | DogsRNice joins |
01:40:11 | <pie_> | is there a way to get archive.org save page now to not automatically rewrite a url to https? i dont think its from a redirect its just archive.org doing it itself... |
01:41:19 | <@JAA> | #internetarchive |
01:42:26 | <pie_> | thanks |
01:57:26 | | pabs quits [Read error: Connection reset by peer] |
01:58:07 | | pabs (pabs) joins |
02:03:21 | | Dango360 quits [Ping timeout: 260 seconds] |
02:24:45 | | ahm2587609 joins |
02:26:06 | | ahm258760 quits [Ping timeout: 260 seconds] |
02:26:06 | | ahm2587609 is now known as ahm258760 |
02:26:15 | | chrismeller quits [Quit: chrismeller] |
02:28:20 | | chrismeller (chrismeller) joins |
02:38:35 | | etnguyen03 quits [Quit: Konversation terminated!] |
02:42:40 | | JayEmbee (JayEmbee) joins |
02:56:23 | <h2ibot> | PaulWise edited Blogger (+1254, add JAA tip for ignoring excessive pagination): https://wiki.archiveteam.org/?diff=55798&oldid=55466 |
02:56:25 | <pabs> | JAA++ |
02:56:27 | <eggdrop> | [karma] 'JAA' now has 272 karma! |
03:35:50 | | Naruyoko5 joins |
03:39:36 | | Naruyoko quits [Ping timeout: 260 seconds] |
03:51:59 | | DogsRNice quits [Read error: Connection reset by peer] |
04:09:37 | | Naruyoko5 quits [Read error: Connection reset by peer] |
04:10:17 | | Naruyoko5 joins |
04:12:07 | | Juesto (Juest) joins |
04:14:36 | | Juest quits [Ping timeout: 260 seconds] |
04:18:44 | | Juesto quits [Ping timeout: 276 seconds] |
04:21:28 | | Juest (Juest) joins |
04:33:48 | | Webuser313652 joins |
04:34:26 | | lennier2 quits [Ping timeout: 260 seconds] |
04:35:08 | | lennier2 joins |
04:35:28 | | Webuser313652 quits [Client Quit] |
05:19:53 | | Island quits [Read error: Connection reset by peer] |
05:39:47 | <h2ibot> | Nintendofan885 edited WarriorBot (+69, status (is it technically 'on hiatus' if it was…): https://wiki.archiveteam.org/?diff=55799&oldid=55782 |
05:39:48 | <h2ibot> | Nintendofan885 edited Google+ press release (+27, fixing [[Special:DoubleRedirects|double redirect]]): https://wiki.archiveteam.org/?diff=55800&oldid=35959 |
05:39:49 | <h2ibot> | Nintendofan885 edited ArchiveBot/Ignore/RegexRodeo (-9, fixing [[Special:DoubleRedirects|double redirect]]): https://wiki.archiveteam.org/?diff=55801&oldid=53826 |
05:39:50 | <h2ibot> | Nintendofan885 edited Geocities FAQ (+0, fixing [[Special:DoubleRedirects|double redirect]]): https://wiki.archiveteam.org/?diff=55802&oldid=1309 |
05:39:51 | <h2ibot> | Nintendofan885 edited Vkontakte (-7, fixing [[Special:DoubleRedirects|double redirect]]): https://wiki.archiveteam.org/?diff=55803&oldid=46600 |
05:39:52 | <h2ibot> | Nintendofan885 edited Archiveteam:Press releases (+1, link to subpage): https://wiki.archiveteam.org/?diff=55806&oldid=55786 |
05:40:38 | | Naruyoko joins |
05:40:41 | | flotwig quits [Quit: ZNC - http://znc.in] |
05:40:47 | <h2ibot> | JustAnotherArchivist edited WarriorBot (-3, Hiatus implies it could be resumed later, but…): https://wiki.archiveteam.org/?diff=55807&oldid=55799 |
05:44:26 | | Naruyoko5 quits [Ping timeout: 260 seconds] |
05:47:44 | | flotwig joins |
06:33:56 | | lennier2 quits [Ping timeout: 276 seconds] |
06:34:35 | | lennier2 joins |
06:41:01 | | anarcat quits [Ping timeout: 260 seconds] |
07:07:53 | | anarcat (anarcat) joins |
07:18:23 | | Webuser838968 joins |
07:19:12 | | Webuser838968 quits [Client Quit] |
07:40:36 | | ThreeHM quits [Quit: WeeChat 4.4.3] |
07:40:45 | | ThreeHM (ThreeHeadedMonkey) joins |
07:59:59 | <szczot3k> | Hello - server talk, anyone has an idea where could I get a cheap idrac card for PowerEdge R320? |
08:05:45 | <katia> | szczot3k: #archiveteam-ot |
08:27:07 | <szczot3k> | my bad |
08:34:44 | | Dada joins |
08:55:46 | | tzt quits [Ping timeout: 260 seconds] |
08:57:01 | | tzt (tzt) joins |
10:46:05 | | mgrytbak8 joins |
10:48:21 | | mgrytbak quits [Ping timeout: 260 seconds] |
10:48:21 | | mgrytbak8 is now known as mgrytbak |
10:58:21 | | flotwig quits [Quit: ZNC - http://znc.in] |
11:00:01 | | Bleo182600722719623455 quits [Quit: The Lounge - https://thelounge.chat] |
11:00:03 | | flotwig joins |
11:02:44 | | Bleo182600722719623455 joins |
11:08:46 | | Wohlstand quits [Ping timeout: 260 seconds] |
11:11:41 | | @imer quits [Ping timeout: 260 seconds] |
11:19:38 | | imer (imer) joins |
11:19:38 | | @ChanServ sets mode: +o imer |
11:30:04 | | Wohlstand (Wohlstand) joins |
11:59:35 | | Wohlstand quits [Ping timeout: 276 seconds] |
12:21:55 | | katocala joins |
12:22:10 | | katocala is now authenticated as katocala |
13:11:59 | | Dango360 (Dango360) joins |
13:38:42 | <triplecamera|m> | <triplecamera|m> "Hi. I'm looking for a file which..." <- Good news, [Memento Time Travel](https://timetravel.mementoweb.org/) can do this job (ChatGPT told me this) |
13:44:36 | <triplecamera|m> | This is an interesting project... I have only heard of the Wayback Machine before |
13:45:20 | | IDK (IDK) joins |
14:11:41 | | klaffty quits [Quit: klaffty] |
14:34:41 | | DLoader quits [Ping timeout: 260 seconds] |
15:15:21 | <@arkiver> | pokechu22: have we been able to save fc2? |
15:43:54 | | grill (grill) joins |
15:47:36 | | kansei quits [Read error: Connection reset by peer] |
15:47:53 | | kansei- (kansei) joins |
16:01:06 | | Island joins |
16:09:46 | | janos777 joins |
16:18:47 | <pokechu22> | arkiver: it's running in archivebot, but I don't think it's made as much progress as would be desired |
16:35:29 | | janos778 joins |
16:39:05 | | janos777 quits [Ping timeout: 276 seconds] |
16:40:11 | | janos777 joins |
16:41:17 | | janos777 quits [Read error: Connection reset by peer] |
16:42:02 | | janos777 joins |
16:42:20 | | janos778 quits [Ping timeout: 276 seconds] |
16:45:35 | | BornOn420 quits [Remote host closed the connection] |
16:45:37 | | kuroger quits [Quit: Ping timeout (120 seconds)] |
16:45:54 | | kuroger (kuroger) joins |
16:47:09 | | BornOn420 (BornOn420) joins |
16:50:46 | | janos777 quits [Read error: Connection reset by peer] |
17:07:41 | | grill quits [Ping timeout: 276 seconds] |
17:14:40 | <h2ibot> | HadeanEon edited Deaths in 2025 (+3075, BOT - Updating page: {{saved}} (130),…): https://wiki.archiveteam.org/?diff=55808&oldid=55796 |
17:15:40 | <h2ibot> | HadeanEon edited Deaths in 2025/list (+283, BOT - Updating list): https://wiki.archiveteam.org/?diff=55809&oldid=55797 |
17:17:22 | | ericgallager quits [Read error: Connection reset by peer] |
17:19:32 | | ericgallager joins |
17:22:15 | | DLoader (DLoader) joins |
17:36:42 | | grill (grill) joins |
17:51:20 | | Webuser156103 joins |
17:51:38 | | Webuser156103 quits [Client Quit] |
18:16:21 | | grill quits [Ping timeout: 260 seconds] |
18:19:49 | | ducky quits [Remote host closed the connection] |
18:20:13 | | ducky (ducky) joins |
18:21:48 | | ducky quits [Remote host closed the connection] |
18:27:44 | | fmixolydian joins |
18:30:23 | <fmixolydian> | hello |
18:30:36 | <fmixolydian> | should i continue dumping glitch.com manually (from my computer)? |
18:47:07 | | fmixolydian quits [Client Quit] |
19:14:58 | <h2ibot> | Ufarwisan edited Discord (-60, bookkeeping): https://wiki.archiveteam.org/?diff=55810&oldid=55709 |
19:39:40 | | PredatorIWD25 quits [Read error: Connection reset by peer] |
19:42:31 | | PredatorIWD25 joins |
20:34:10 | | Webuser081823 joins |
20:35:06 | | Webuser081823 quits [Client Quit] |
20:39:51 | | ^ quits [Remote host closed the connection] |
20:40:02 | | ^ (^) joins |
20:49:15 | | adryd0 quits [Quit: The Lounge - https://thelounge.chat] |
20:50:30 | | adryd0 (adryd) joins |
20:55:56 | | cow_2001 quits [Quit: ⛧] |
20:57:21 | | cow_2001 joins |
21:31:41 | | fangfufu quits [Quit: ZNC 1.8.2+deb3.1+deb12u1 - https://znc.in] |
21:36:30 | | fangfufu joins |
21:36:32 | | fangfufu is now authenticated as fangfufu |
22:00:47 | | Dada quits [Remote host closed the connection] |
22:03:17 | <pabs> | arkiver: not sure if the remaining AB job will finish, its rather large. my fault for not looking at speed/size/offsite stuff properly beforehand |
22:03:27 | <pabs> | there is also a grab-site job IIRC |
22:04:16 | <pabs> | JAA: did you see Medaka's https://transfer.archivete.am/inline/plTPG/diary_fc2_urls.txt ? |
22:05:35 | <@JAA> | pabs: No, only k.fc2.com, but that isn't running either because the machine has had technical issues again. |
22:07:02 | <pabs> | ok, diary_fc2_urls.txt was their latest brute-force, IIRC we concluded it had to be grab-site |
22:07:39 | <@JAA> | Ah, they both need Googlebot due to georestrictions, right? |
22:07:46 | <pabs> | (due to the Googlebot UA requirement) |
22:07:51 | <pabs> | ya |
22:08:21 | <@JAA> | I can get the Googlebot UA into AB though. |
22:10:44 | <pabs> | ah, then we can do the sitemap trick and !a |
22:10:49 | <pabs> | er !a < |
22:11:30 | <pabs> | only 20k URLs in the file, much smaller than the big one |
22:11:36 | <@JAA> | Don't even need the sitemap hack for k.fc2.com, I think, though we would need to modify the list. |
22:11:49 | <@JAA> | I suspect the same applies to diary.fc2.com. |
22:12:14 | <pabs> | the domains for the latter are diary{,1,2,3}.fc2.com |
22:12:38 | <pabs> | and all the URLs are multiple levels deep |
22:12:53 | <@JAA> | Yeah, but is there anything outside of /cgi-bin/ed.cgi/ ? |
22:13:28 | <@JAA> | If not, we can just queue a list like http://diary.fc2.com/cgi-sys/ed.cgi/fjwr with !a <. |
22:13:50 | <pabs> | but then you don't have the trailing-slash URLs? |
22:14:11 | <@JAA> | I guess, unless there are links to it (like there are in this case). |
22:14:35 | <@JAA> | Though it seems to serve the same content, so could just do a separate !ao < job for the slashed URLs for completeness's sake. |
22:15:12 | <pabs> | sitemap thing is easy anyway, I have it automated https://wiki.archiveteam.org/index.php/ArchiveBot#Usage_tips |
22:17:40 | <pabs> | just get a URL list file, run urls-to-sitemap, transfer both, include those URLs in an !a < of all the domains |
22:18:52 | <pabs> | hmm, might look at search engines to check the ed.cgi question |
22:21:40 | <pabs> | Google finds only http://diary.fc2.com/login.html http://diary.fc2.com/i/QA.html http://diary2.fc2.com/cgi-sys/ed_user.cgi |
22:22:43 | <@JAA> | DDG finds basically nothing in general (and all ed.cgi). |
22:24:38 | | Webuser348865 joins |
22:30:09 | <pabs> | a few on google/bing, nothing on yandex https://transfer.archivete.am/SM9kV/diary.fc2.com-google-bing-yandex-scrape.txt |
22:30:09 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/SM9kV/diary.fc2.com-google-bing-yandex-scrape.txt |
22:30:40 | <@JAA> | Also all ed.cgi except for /i/QA.html |
22:31:26 | <@JAA> | Does the original list include WBM CDX? |
22:31:44 | <pabs> | I think no, sounded like it was brute-forced |
22:32:00 | <pabs> | <Medaka> The brute-force process for 4-character combinations on diary.fc2.com is finally complete. |
22:32:29 | <pabs> | the ones in my search scrape are longer than that |
22:32:36 | <pabs> | so there are likely lots of users missing |
22:33:47 | | JayEmbee quits [Quit: WeeChat 4.1.1] |
22:35:26 | | JayEmbee (JayEmbee) joins |
22:39:37 | <@JAA> | I'll get the CDX stuff for k.fc2.com. |
22:40:04 | | JayEmbee quits [Client Quit] |
22:43:31 | <pabs> | can you do diary{,1,2,3}.fc2.com too? |
22:43:36 | <@JAA> | Sure |
23:02:56 | <@JAA> | Ok, there are definitely things outside of /cgi-bin/ed.cgi/, e.g. images like http://diary.fc2.com/user/zztop666/img/bg.png |
23:03:34 | <@JAA> | If they're requisites, they'd still get retrieved, but... |
23:22:45 | | JayEmbee (JayEmbee) joins |
23:26:34 | | JayEmbee quits [Client Quit] |
23:29:21 | | JayEmbee (JayEmbee) joins |
23:30:07 | | Wohlstand (Wohlstand) joins |
23:33:41 | | Webuser348865 quits [Client Quit] |
23:40:09 | | Wohlstand quits [Client Quit] |
23:40:24 | | Wohlstand (Wohlstand) joins |