歡迎您光臨本站 註冊首頁

在RHCS5中搭建RHCS HA 集群CMAN伺服器啟動不了的問題!

←手機掃碼閱讀     火星人 @ 2014-03-04 , reply:0

在RHCS5中搭建RHCS HA 集群CMAN伺服器啟動不了的問題!

各位師兄,我是首次搭建集群,在網上找的資料.搭建完成後,CMAN伺服器啟動不了.請各位師兄幫我看看.非常感謝!
這是我 "cluster.conf"裡面的配置:
<?xml version="1.0" ?>
<cluster alias="sztc_cluster" config_version="19" name="sztc_cluster">
        <fence_daemon post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="dfs1.sztc" nodeid="1" votes="1">
                        <fence>
                                <method name="1">
                                        <device modulename="" name="drac1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="dfs3.sztc" nodeid="2" votes="1">
                        <fence>
                                <method name="1">
                                        <device modulename="" name="drac3"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices>
                <fencedevice agent="fence_drac" ipaddr="192.168.2.194" login="root" name="drac1" passwd="654321"/>
                <fencedevice agent="fence_drac" ipaddr="192.168.2.195" login="root" name="drac3" passwd="654321"/>
        </fencedevices>
        <rm>
                <failoverdomains>
                        <failoverdomain name="nfs_fd" ordered="0" restricted="1">
                                <failoverdomainnode name="dfs1.sztc" priority="1"/>
                                <failoverdomainnode name="dfs3.sztc" priority="1"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <ip address="192.168.2.190" monitor_link="1"/>
                        <fs device="/dev/mapper/project_vg-project_lv" force_fsck="0" force_unmount="1" fsid="62521" fstype="ext3" mountpoint="/storage/sztc" name="project_ext3" options="acl" self_fence="1"/>
                        <fs device="dev/mapper/home_vg-home_lv" force_fsck="0" force_unmount="1" fsid="36334" fstype="ext3" mountpoint="/storage/home" name="home_ext3" options="acl,usrquota,grpquota" self_fence="1"/>
                        <nfsexport name="p_export"/>
                        <nfsexport name="h_export"/>
                        <nfsclient name="p_client" options="rw,sync" path="/storage/sztc" target="*"/>
                        <nfsclient name="h_client" options="rw,sync" path="/storage/home" target="*"/>
                        <nfsclient name="dell3_h_c" options="no_root_squash,rw,sync" path="/storage/home" target="192.168.2.149"/>
                </resources>
                <service domain="nfs_fd" name="nfs" recovery="restart">
                        <ip ref="192.168.2.190"/>
                        <fs ref="project_ext3"/>
                        <nfsexport ref="p_export"/>
                        <fs ref="home_ext3"/>
                        <nfsexport ref="h_export"/>
                        <nfsclient ref="dell3_h_c"/>
                        <nfsclient ref="h_client"/>
                </service>
        </rm>
</cluster>
然後我啟動 用於RHEL5 的2個服務,其中一個 rgmanager 服務能夠啟動
啟動CMAN服務報下面的錯誤....
root@dfs3 ~]# /etc/init.d/cman restart
Stopping cluster:
   Stopping fencing... done
   Stopping cman... done
   Stopping ccsd... done
   Unmounting configfs... done
                                                           [  OK  ]
Starting cluster:
   Loading modules... done
   Mounting configfs... done
   Starting ccsd... done
   Starting cman... failed
cman not started: Can't determine address family of nodename /usr/sbin/cman_tool: aisexec daemon didn't start
                                                           
#
請大家給我點意見....
《解決方案》

回復 1# liujixuan


    把 /var/log/message 貼上來
《解決方案》

回復 2# nagaregawa

這是我截取的后30條/var/log/message  

Feb 24 08:32:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77250 seconds.
Feb 24 08:33:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77280 seconds.
Feb 24 08:33:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77310 seconds.
Feb 24 08:34:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77340 seconds.
Feb 24 08:34:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77370 seconds.
Feb 24 08:35:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77400 seconds.
Feb 24 08:35:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77430 seconds.
Feb 24 08:36:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77460 seconds.
Feb 24 08:36:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77490 seconds.
Feb 24 08:37:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77520 seconds.
Feb 24 08:37:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77550 seconds.
Feb 24 08:38:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77580 seconds.
Feb 24 08:38:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77610 seconds.
Feb 24 08:39:14 dfs3 ccsd: Unable to connect to cluster infrastructure after 77640 seconds.
Feb 24 08:39:44 dfs3 ccsd: Unable to connect to cluster infrastructure after 77670 seconds.
Feb 24 08:40:11 dfs3 ccsd: Stopping ccsd, SIGTERM received.
Feb 24 08:40:12 dfs3 ccsd: Starting ccsd 2.0.84:
Feb 24 08:40:12 dfs3 ccsd:  Built: Apr 15 2008 16:19:15
Feb 24 08:40:12 dfs3 ccsd:  Copyright (C) Red Hat, Inc.  2004  All rights reserved.
Feb 24 08:40:12 dfs3 ccsd: cluster.conf (cluster name = sztc_cluster, version = 17) found.
Feb 24 08:40:15 dfs3 openais: AIS Executive Service RELEASE 'subrev 1358 version 0.80.3'
Feb 24 08:40:15 dfs3 openais: Copyright (C) 2002-2006 MontaVista Software, Inc and contributors.
Feb 24 08:40:15 dfs3 openais: Copyright (C) 2006 Red Hat, Inc.
Feb 24 08:40:15 dfs3 openais: AIS Executive Service: started and ready to provide service.
《解決方案》

回復 2# nagaregawa


    謝謝你的幫忙,我終於找到 為什麼 CMAN 服務啟動不了的原因了..

我把 /etc/hosts 文件裡面的 : 192.168.2.191 dfs3.sztc  dfs3  寫成了

另外一個節點的名字了 : 192.168.2.190     dfs1.sztc dfs1  難怪啟動CMAN服務的時候報這個錯. 現在可以正常啟動了
《解決方案》

錯誤:
root@dfs3 ~]# /etc/init.d/cman restart
Stopping cluster:
   Stopping fencing... done
   Stopping cman... done
   Stopping ccsd... done
   Unmounting configfs... done
                                                           [  OK  ]
Starting cluster:
   Loading modules... done
   Mounting configfs... done
   Starting ccsd... done
   Starting cman... failed
cman not started: Can't determine address family of nodename /usr/sbin/cman_tool: aisexec daemon didn't start
                                                           
#

[火星人 ] 在RHCS5中搭建RHCS HA 集群CMAN伺服器啟動不了的問題!已經有969次圍觀

http://coctec.com/docs/service/show-post-5820.html