00:08:19Wohlstand quits [Quit: Wohlstand]
00:16:31Wohlstand (Wohlstand) joins
00:32:52IDK quits [Quit: Connection closed for inactivity]
00:46:32ATinySpaceMarine joins
01:31:18DogsRNice joins
01:40:11<pie_>is there a way to get archive.org save page now to not automatically rewrite a url to https? i dont think its from a redirect its just archive.org doing it itself...
01:41:19<@JAA>#internetarchive
01:42:26<pie_>thanks
01:57:26pabs quits [Read error: Connection reset by peer]
01:58:07pabs (pabs) joins
02:03:21Dango360 quits [Ping timeout: 260 seconds]
02:24:45ahm2587609 joins
02:26:06ahm258760 quits [Ping timeout: 260 seconds]
02:26:06ahm2587609 is now known as ahm258760
02:26:15chrismeller quits [Quit: chrismeller]
02:28:20chrismeller (chrismeller) joins
02:38:35etnguyen03 quits [Quit: Konversation terminated!]
02:42:40JayEmbee (JayEmbee) joins
02:56:23<h2ibot>PaulWise edited Blogger (+1254, add JAA tip for ignoring excessive pagination): https://wiki.archiveteam.org/?diff=55798&oldid=55466
02:56:25<pabs>JAA++
02:56:27<eggdrop>[karma] 'JAA' now has 272 karma!
03:35:50Naruyoko5 joins
03:39:36Naruyoko quits [Ping timeout: 260 seconds]
03:51:59DogsRNice quits [Read error: Connection reset by peer]
04:09:37Naruyoko5 quits [Read error: Connection reset by peer]
04:10:17Naruyoko5 joins
04:12:07Juesto (Juest) joins
04:14:36Juest quits [Ping timeout: 260 seconds]
04:18:44Juesto quits [Ping timeout: 276 seconds]
04:21:28Juest (Juest) joins
04:33:48Webuser313652 joins
04:34:26lennier2 quits [Ping timeout: 260 seconds]
04:35:08lennier2 joins
04:35:28Webuser313652 quits [Client Quit]
05:19:53Island quits [Read error: Connection reset by peer]
05:39:47<h2ibot>Nintendofan885 edited WarriorBot (+69, status (is it technically 'on hiatus' if it was…): https://wiki.archiveteam.org/?diff=55799&oldid=55782
05:39:48<h2ibot>Nintendofan885 edited Google+ press release (+27, fixing [[Special:DoubleRedirects|double redirect]]): https://wiki.archiveteam.org/?diff=55800&oldid=35959
05:39:49<h2ibot>Nintendofan885 edited ArchiveBot/Ignore/RegexRodeo (-9, fixing [[Special:DoubleRedirects|double redirect]]): https://wiki.archiveteam.org/?diff=55801&oldid=53826
05:39:50<h2ibot>Nintendofan885 edited Geocities FAQ (+0, fixing [[Special:DoubleRedirects|double redirect]]): https://wiki.archiveteam.org/?diff=55802&oldid=1309
05:39:51<h2ibot>Nintendofan885 edited Vkontakte (-7, fixing [[Special:DoubleRedirects|double redirect]]): https://wiki.archiveteam.org/?diff=55803&oldid=46600
05:39:52<h2ibot>Nintendofan885 edited Archiveteam:Press releases (+1, link to subpage): https://wiki.archiveteam.org/?diff=55806&oldid=55786
05:40:38Naruyoko joins
05:40:41flotwig quits [Quit: ZNC - http://znc.in]
05:40:47<h2ibot>JustAnotherArchivist edited WarriorBot (-3, Hiatus implies it could be resumed later, but…): https://wiki.archiveteam.org/?diff=55807&oldid=55799
05:44:26Naruyoko5 quits [Ping timeout: 260 seconds]
05:47:44flotwig joins
06:33:56lennier2 quits [Ping timeout: 276 seconds]
06:34:35lennier2 joins
06:41:01anarcat quits [Ping timeout: 260 seconds]
07:07:53anarcat (anarcat) joins
07:18:23Webuser838968 joins
07:19:12Webuser838968 quits [Client Quit]
07:40:36ThreeHM quits [Quit: WeeChat 4.4.3]
07:40:45ThreeHM (ThreeHeadedMonkey) joins
07:59:59<szczot3k>Hello - server talk, anyone has an idea where could I get a cheap idrac card for PowerEdge R320?
08:05:45<katia>szczot3k: #archiveteam-ot
08:27:07<szczot3k>my bad
08:34:44Dada joins
08:55:46tzt quits [Ping timeout: 260 seconds]
08:57:01tzt (tzt) joins
10:46:05mgrytbak8 joins
10:48:21mgrytbak quits [Ping timeout: 260 seconds]
10:48:21mgrytbak8 is now known as mgrytbak
10:58:21flotwig quits [Quit: ZNC - http://znc.in]
11:00:01Bleo182600722719623455 quits [Quit: The Lounge - https://thelounge.chat]
11:00:03flotwig joins
11:02:44Bleo182600722719623455 joins
11:08:46Wohlstand quits [Ping timeout: 260 seconds]
11:11:41@imer quits [Ping timeout: 260 seconds]
11:19:38imer (imer) joins
11:19:38@ChanServ sets mode: +o imer
11:30:04Wohlstand (Wohlstand) joins
11:59:35Wohlstand quits [Ping timeout: 276 seconds]
12:21:55katocala joins
13:11:59Dango360 (Dango360) joins
13:38:42<triplecamera|m><triplecamera|m> "Hi. I'm looking for a file which..." <- Good news, [Memento Time Travel](https://timetravel.mementoweb.org/) can do this job (ChatGPT told me this)
13:44:36<triplecamera|m>This is an interesting project... I have only heard of the Wayback Machine before
13:45:20IDK (IDK) joins
14:11:41klaffty quits [Quit: klaffty]
14:34:41DLoader quits [Ping timeout: 260 seconds]
15:15:21<@arkiver>pokechu22: have we been able to save fc2?
15:43:54grill (grill) joins
15:47:36kansei quits [Read error: Connection reset by peer]
15:47:53kansei- (kansei) joins
16:01:06Island joins
16:09:46janos777 joins
16:18:47<pokechu22>arkiver: it's running in archivebot, but I don't think it's made as much progress as would be desired
16:35:29janos778 joins
16:39:05janos777 quits [Ping timeout: 276 seconds]
16:40:11janos777 joins
16:41:17janos777 quits [Read error: Connection reset by peer]
16:42:02janos777 joins
16:42:20janos778 quits [Ping timeout: 276 seconds]
16:45:35BornOn420 quits [Remote host closed the connection]
16:45:37kuroger quits [Quit: Ping timeout (120 seconds)]
16:45:54kuroger (kuroger) joins
16:47:09BornOn420 (BornOn420) joins
16:50:46janos777 quits [Read error: Connection reset by peer]
17:07:41grill quits [Ping timeout: 276 seconds]
17:14:40<h2ibot>HadeanEon edited Deaths in 2025 (+3075, BOT - Updating page: {{saved}} (130),…): https://wiki.archiveteam.org/?diff=55808&oldid=55796
17:15:40<h2ibot>HadeanEon edited Deaths in 2025/list (+283, BOT - Updating list): https://wiki.archiveteam.org/?diff=55809&oldid=55797
17:17:22ericgallager quits [Read error: Connection reset by peer]
17:19:32ericgallager joins
17:22:15DLoader (DLoader) joins
17:36:42grill (grill) joins
17:51:20Webuser156103 joins
17:51:38Webuser156103 quits [Client Quit]
18:16:21grill quits [Ping timeout: 260 seconds]
18:19:49ducky quits [Remote host closed the connection]
18:20:13ducky (ducky) joins
18:21:48ducky quits [Remote host closed the connection]
18:27:44fmixolydian joins
18:30:23<fmixolydian>hello
18:30:36<fmixolydian>should i continue dumping glitch.com manually (from my computer)?
18:47:07fmixolydian quits [Client Quit]
19:14:58<h2ibot>Ufarwisan edited Discord (-60, bookkeeping): https://wiki.archiveteam.org/?diff=55810&oldid=55709
19:39:40PredatorIWD25 quits [Read error: Connection reset by peer]
19:42:31PredatorIWD25 joins
20:34:10Webuser081823 joins
20:35:06Webuser081823 quits [Client Quit]
20:39:51^ quits [Remote host closed the connection]
20:40:02^ (^) joins
20:49:15adryd0 quits [Quit: The Lounge - https://thelounge.chat]
20:50:30adryd0 (adryd) joins
20:55:56cow_2001 quits [Quit: ⛧]
20:57:21cow_2001 joins
21:31:41fangfufu quits [Quit: ZNC 1.8.2+deb3.1+deb12u1 - https://znc.in]
21:36:30fangfufu joins
22:00:47Dada quits [Remote host closed the connection]
22:03:17<pabs>arkiver: not sure if the remaining AB job will finish, its rather large. my fault for not looking at speed/size/offsite stuff properly beforehand
22:03:27<pabs>there is also a grab-site job IIRC
22:04:16<pabs>JAA: did you see Medaka's https://transfer.archivete.am/inline/plTPG/diary_fc2_urls.txt ?
22:05:35<@JAA>pabs: No, only k.fc2.com, but that isn't running either because the machine has had technical issues again.
22:07:02<pabs>ok, diary_fc2_urls.txt was their latest brute-force, IIRC we concluded it had to be grab-site
22:07:39<@JAA>Ah, they both need Googlebot due to georestrictions, right?
22:07:46<pabs>(due to the Googlebot UA requirement)
22:07:51<pabs>ya
22:08:21<@JAA>I can get the Googlebot UA into AB though.
22:10:44<pabs>ah, then we can do the sitemap trick and !a
22:10:49<pabs>er !a <
22:11:30<pabs>only 20k URLs in the file, much smaller than the big one
22:11:36<@JAA>Don't even need the sitemap hack for k.fc2.com, I think, though we would need to modify the list.
22:11:49<@JAA>I suspect the same applies to diary.fc2.com.
22:12:14<pabs>the domains for the latter are diary{,1,2,3}.fc2.com
22:12:38<pabs>and all the URLs are multiple levels deep
22:12:53<@JAA>Yeah, but is there anything outside of /cgi-bin/ed.cgi/ ?
22:13:28<@JAA>If not, we can just queue a list like http://diary.fc2.com/cgi-sys/ed.cgi/fjwr with !a <.
22:13:50<pabs>but then you don't have the trailing-slash URLs?
22:14:11<@JAA>I guess, unless there are links to it (like there are in this case).
22:14:35<@JAA>Though it seems to serve the same content, so could just do a separate !ao < job for the slashed URLs for completeness's sake.
22:15:12<pabs>sitemap thing is easy anyway, I have it automated https://wiki.archiveteam.org/index.php/ArchiveBot#Usage_tips
22:17:40<pabs>just get a URL list file, run urls-to-sitemap, transfer both, include those URLs in an !a < of all the domains
22:18:52<pabs>hmm, might look at search engines to check the ed.cgi question
22:21:40<pabs>Google finds only http://diary.fc2.com/login.html http://diary.fc2.com/i/QA.html http://diary2.fc2.com/cgi-sys/ed_user.cgi
22:22:43<@JAA>DDG finds basically nothing in general (and all ed.cgi).
22:24:38Webuser348865 joins
22:30:09<pabs>a few on google/bing, nothing on yandex https://transfer.archivete.am/SM9kV/diary.fc2.com-google-bing-yandex-scrape.txt
22:30:09<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/SM9kV/diary.fc2.com-google-bing-yandex-scrape.txt
22:30:40<@JAA>Also all ed.cgi except for /i/QA.html
22:31:26<@JAA>Does the original list include WBM CDX?
22:31:44<pabs>I think no, sounded like it was brute-forced
22:32:00<pabs><Medaka> The brute-force process for 4-character combinations on diary.fc2.com is finally complete.
22:32:29<pabs>the ones in my search scrape are longer than that
22:32:36<pabs>so there are likely lots of users missing
22:33:47JayEmbee quits [Quit: WeeChat 4.1.1]
22:35:26JayEmbee (JayEmbee) joins
22:39:37<@JAA>I'll get the CDX stuff for k.fc2.com.
22:40:04JayEmbee quits [Client Quit]
22:43:31<pabs>can you do diary{,1,2,3}.fc2.com too?
22:43:36<@JAA>Sure
23:02:56<@JAA>Ok, there are definitely things outside of /cgi-bin/ed.cgi/, e.g. images like http://diary.fc2.com/user/zztop666/img/bg.png
23:03:34<@JAA>If they're requisites, they'd still get retrieved, but...
23:22:45JayEmbee (JayEmbee) joins
23:26:34JayEmbee quits [Client Quit]
23:29:21JayEmbee (JayEmbee) joins
23:30:07Wohlstand (Wohlstand) joins
23:33:41Webuser348865 quits [Client Quit]
23:40:09Wohlstand quits [Client Quit]
23:40:24Wohlstand (Wohlstand) joins