CentOS 5 :: 2 Nodes Cluster Hang At Startup?
Feb 17, 2010
I have an annoying problem with a 2 nodes cluster that I don't understand.
One of my server is totally shutdown.
When I start cman after a reboot on the other server, it hangs on "Starting fencing...". The process which is actually hanging is : /sbin/fence_tool -w -t 300 -m 45 join
When I start this program manually, I get this output :
# /sbin/fence_tool -w -t 300 -m 45 join
Waiting for all 2 nodes to be members
Waiting for all 2 nodes to be members
Waiting for all 2 nodes to be members
Waiting for all 2 nodes to be members
(and so on...)
What I don't understand is why the fence_tool program wait for the 2nd node to join the cluster and don't fence the other node.
Here is my cluster.conf :
<?xml version="1.0"?>
<cluster name="ctpsrv-bdd" config_version="7">
<fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="20"/>
<cman expected_votes="1" two_node="1"/>
[Code]....
Would it be a good idea to edit the cman startup script and remove the "-w" option of command fence_tool join, so the command do not hang up ?
View 4 Replies
ADVERTISEMENT
Aug 12, 2011
well sorry I'm new in this but I just made my cluster with about 4 good machines each with 2 or more gbs of memory and dual-core processor but the question is i can install an operating system in nodes as having the frontend or not different if possible, and how i can do it because i don't now how and i don't find material and tutorials that can help me..
View 3 Replies
View Related
Mar 10, 2010
I have two Web server (RHEL 5.3) as cluster. i want to get a certificate for website. do i need to get two certificates for a website?
View 2 Replies
View Related
May 28, 2010
I am trying to setup Rocks Cluster on a Cluster containing 1 Master node and 23 slave nodes connected via a Switch. The master node's eth1 is connected to public IP and eth0 is connected to the slave nodes via the switch.
I was able to install the Rocks cluster on the machine, I configured the IP for master through DHCP. When I run the command 'insert-ethers' on the master node, it doesnot get me any slave nodes.
setting up the Rocks cluster on all the nodes?
View 1 Replies
View Related
Oct 9, 2010
I've just installed Debian AMD64 on my main box (Intel Q6600, Ati HD4770, Asus P5K) using the DVD install. It boots, but hangs when it tries starting the graphical interface. No reports, no commandline, nothing at all - a nice black screen and completely unresponsive system.
Debian x86 works with no problems.
View 4 Replies
View Related
Oct 10, 2010
I've just installed Debian AMD64 on my main box (Intel Q6600, Ati HD4770, Asus P5K) using the DVD install. It boots, but hangs when it tries starting the graphical interface. No reports, no commandline, nothing at all - a nice black screen and completely unresponsive system.
View 2 Replies
View Related
Dec 3, 2010
As I was updating machines today, I was wondering... Is there some way to cache the RPM files locally that yum is grabbing? Like many, I have a number of VMs I run, and it most updates are the same between the nodes. I'm guessing I'd need some sort of http cache? Just looking for pointers, not sure what I'm looking for, however.
View 4 Replies
View Related
May 11, 2011
I just updated to kernel 2.6.35.13-91.fc14.x86_64, and I'm now having a weird issue where my computer will hang during startup and shutdown unless I move the mouse around.
While moving the mouse, it will boot up or shut down normally. Once i stop moving the mouse around it will hang (until I begin to move the mouse again).Once fully booted it acts fine..
View 13 Replies
View Related
Nov 24, 2010
Everything worked fine in Kubuntu 10.04 (64-bit). On the upgrade to 10.10 (64-bit) it takes about a minute until I can move the mouse after startup and the integrated memory card reader doesn't work (but the green LED which used to only come on when there was a card in there is now always on). This has been discussed twice before: here (closed and solved by unplugging cardreader which I do not want to do) and here (unsolved but closed as maverick no longer in beta) dmesg (note the errer -110) follows. Let me know if I can/should include any other logs/info!
Code:
[ 13.575002] EXT4-fs (sda2): re-mounted. Opts: errors=remount-ro
[ 13.790470] EXT4-fs (sda1): mounted filesystem with ordered data mode. Opts: (null)
[ 13.903728] EXT4-fs (sda4): mounted filesystem with ordered data mode. Opts: (null)
[ 16.280020] usb 1-5: device descriptor read/64, error -110
[ 17.375755] sky2 0000:02:00.0: eth0: enabling interface
[ 17.376174] ADDRCONF(NETDEV_UP): eth0: link is not ready
[ 17.377028] type=1400 audit(1290608578.996:5): apparmor="STATUS" operation="profile_load"
[Code]......
View 2 Replies
View Related
Nov 6, 2010
I am trying to build GFS2 cluster with 2 or 3 Fedora 14 nodes, but I've encountered some problems from the start. First luci does not work at all in Fedora 14. There is no luci_admin and even if I manage to start luci service, I get a blank white screen when I try to open it from the browser. I've googled a bit and found that I'd might be able to setup GFS if I manage to build cluster.conf manually and start the cluster suite, but I cannot find documentation on how to create cluster.conf anywhere. If anyone knows how to setup GFS2 without a cluster suite or how to configure cluster.conf.
View 9 Replies
View Related
Mar 15, 2011
I'm having a very strange problem with red hat cluster. After testing it in the lab, I tried to install a new cluster and received the following error: cman not started: Cannot start, cluster name is too long or other CCS error /usr/sbin/cman_tool: aisexec daemon didn't start I've checked the internet for that error, but nothing. I decided to to take the example from cluster.conf man, which looks like that :
[Code]...
And still I get the same error. I can find my servers both from DNS (FQDN and short name) and they also appear in /etc/hosts.
View 3 Replies
View Related
Apr 3, 2009
i'm a familiar with Linux environement ( fedora 10 user ) and i got a project in a training where i have to create a cluster with two nodes where i have to set up a number of VMs that will run applications such as ( Samba, Ldap, Zimbra, ...) but i don't know how to virtualize on top of a cluster ! i would like to know how that can be done, and how is it possible to let the VMs get ressources ( RAM & CPU ) from the two nodes ??
View 1 Replies
View Related
Sep 1, 2011
I am working in a project that needs to set up an Apache Web Cluster. The cluster needs to be High-availability (HA) cluster and Load-balancing cluster. Someone mentioned the use of Red Hat Cluster Suite, but, honestly, I can't figure out how it works, I haven't been able to configure it correctly. The project currently have two nodes, but we need to support at least three nodes in the cluster.
View 5 Replies
View Related
Feb 19, 2010
I recently tried to upgrade from 2.6.23 to 2.6.32 but kept on getting VFS errors. I had overwritten the old 2.6.23 kernel so there was no going back. Luckily,I had the 2.4.31 kernel still in the /boot directory.
Anyway, I finally got some time to figure out the VFS issues and I managed to solve the VFS kernel panics. It was probably a forgotten configuration setting during the make menuconfig. Now it boots up only to hang right after the "Using /etc/random-seed to initialize /dev/urandom".
Looking at the rc.S script, the next section does the dd if=/dev/urandom of=/etc/random-seed count=1 ...
I know for a fact that it isn't that line that hangs the system. I commented out that line and it still hangs.
Strange thing is that I can boot up 2.4.31 with no problem; but with 2.6.x, I get a hang-up. I did a google and came up with something about rc.udev.new, which I don't have.
Can someone point out to me what is executed right after/etc/rc.d/rc.S? Or maybe point out where it might
be hanging? (To be honest, it's either a hang up or a process that's taking an extraordinary long time to
run.)
View 5 Replies
View Related
Mar 15, 2010
What I did not realize was, that DLM uses the external Ethernet Interface even when talking to the local machine/node. So iptables was blocking my DLM daemon. With iptables down or the TCP port for DLM opened, cman starts, mount works.What I have here is a fibrechannel SAN which will be directly attached to several servers in the near future. Thise servers should be enabled access to a single filesystem on the SAN (shared).I heard that the right filesystem choice for this kind of setup would be GFS, because it has a Distributed Lock Manager and one FS journal for each node.
But I am having trouble setting up GFS. I have managed to create a GFS on a small testvolume (local HDD so far), but am unable to mount it. It seems that GFS/DLM needs a lot of cluster services to run, which I do not all understand / know how to correctly setup. Also: Will the lock_dlm stuff need Ethernet communications to handle file locks? And if so, will it fetch the node list from /etc/cluster/cluster.conf to determine who to talk to?
View 1 Replies
View Related
Nov 26, 2010
I created a cluster with two nodes and a machine for managers with luci, if a machine reboot the cluster function by transferring the resource (IP address), if forced to stop the machine (pull the plug) the cluster does not work.
View 1 Replies
View Related
Mar 26, 2009
I have Cent OS 5.1
I also have http://phpsysinfo.sourceforge.net/
I have asterisk running .
Now when I look at System Information ,
I see that "Physical Memory" keep increasing and at one point it reaches 96%. Then my sever get hang and then I have to restart it.
I have 4 GB RAM.
Processors 2
Model Intel(R) Core(TM)2 Duo CPU E7200 @ 2.53GHz
CPU Speed 2.53 GHz
Cache Size 3.00 MB
System Bogomips 10135.71
Kernel Version 2.6.18-53.el5 (SMP)
Distro Name CentOS release 5 (Final)
Load Averages 0.27 0.35 0.31
What could be the reason ?
View 8 Replies
View Related
Feb 19, 2010
I have lack of understanding of CentOS in general. I have looked for a remedy on other forums and google, but haven't been able to find the answer. I have a 3 node cluster that was functioning great until I decided to go offline for awhile. My config is as follows:
node 2: vh1
node 3: vh2
node 4: vh6
All nodes connect to a common shared area on an iscsi device (vguests_root)
Currently vh2 and vh6 connect great, however since putting the machines back online I can no longer connect with vh1. A dmesg command on vh1 reveals the following:
GFS2: fsid=: Trying to join cluster "lock_dlm", "Cluster1:vguest_roots"
GFS2: fsid=Cluster1:vguest_roots.2: Joined cluster. Now mounting FS...
GFS2: fsid=Cluster1:vguest_roots.2: can't mount journal #2
GFS2: fsid=Cluster1:vguest_roots.2: there are only 2 journals (0 - 1) .....
View 1 Replies
View Related
Apr 15, 2009
I'm having some trouble configuring clustering in a 2-node cluster, with no shared FS. Application is video streaming, so outbound traffic only...The cluster is generally ok - if I kill -9 one of the resource-applications, the failover works as expected. But it does not failover when I disconnect the power from the service owning node (simulating a hardware failure). clustat on the remaining node shows that the powered-down node has status "Offline", so it knows the node is not responding, but the remaining node does not become the owner, nor start up the cluster services/resource-applications. eth0 on each node is connected via a crossover cable for heartbeat, etc. Each eth1 connects to a switch.
[root@lmshw01 ~]# cat /etc/cluster/cluster.conf
<?xml version="1.0"?>
<cluster alias="lmshw-clust" config_version="35" name="lmshw-clust">
[code].....
View 3 Replies
View Related
Sep 30, 2010
I have a two node cluster, and a third system which has luci installed.
node1 is nfs0
node2 is nfs1
both nodes have identically the same configuration. They have a fresh installation of Centos 5.5 + yum update. I am unable to join nfs1 to the cluster, as it is giving me the following issue:
Sep 29 23:28:00 nfs0 ccsd[6009]: Starting ccsd 2.0.115:
Sep 29 23:28:00 nfs0 ccsd[6009]: Built: Aug 11 2010 08:25:53
Sep 29 23:28:00 nfs0 ccsd[6009]: Copyright (C) Red Hat, Inc. 2004 All rights reserved.
[code].....
View 1 Replies
View Related
Jul 17, 2011
I have just installed a two server cluster with ricci luci and conga on centos 5.6 32bit , both servers are vmware guests and have a shared storage disk connected to them both
with a GFS2 file system on them + fencing agents configured to work with VMware Vcenter.
(this is supported by vmware and works great on 4 other centos clusters i have been runing for 4 monthes with no CLVMD).
In this setup i used for the first time CLVMD as recommnded by RedHat so i could have the flexablitly of LVM under the GFS2 file system but , i have been getting some Strange problem with it , some times after a developer has done some IO heavy task like unziping a file or a simple TAR the load goes to 10 - 15 and no task can be killed , trying to reboot the server hangs.
After hard shutting the server every thing works ok until the next time some one does the same IO work as before.
View 5 Replies
View Related
Dec 17, 2009
I'm trying to install CentOS 5.4-x86 on a "new" motherboard, it's an ABIT Fatal1ty AN9 32X. I have a AMD 6000+ Dual Core running stable at 3.01 ghz. 4GB of ram(4x 1GB). nVidia GeForce 7600. 2x SATA II 160GB WD RAID 0 and 2x SATA II 1TB Seagate RAID 0. This setup has been running fine as a FTP server for months and has been flawless for games for about 3 years.Right off the bat during initial install it hangs before loading any GUI.Last lines:
NFORCE-MCP55: IDE controller PCI slot 0000:00:0c.0
NFORCE-MCP55: chipset revision 161
NFORCE-MCP55: not 100% native mode: will probe irqs later
[code]....
View 2 Replies
View Related
Sep 3, 2010
configure a separate NIC for the heartbeat. what i need to add to cluster.conf to achieve this?
View 1 Replies
View Related
May 27, 2009
I'm running into what is quite possibly one of the strangest problems I've ever encountered. We recently had a power loss, and some of our vmware instances didn't shut down correctly. Once of those is our Zabbix testing monitor. From that point on, whenever I run the "ll" command, my terminal freezes. This happens over both ssh and the local console. I've forced a disk check via the "shutdown -rF now" command, and it returned no errors. Since ll is simply an alias for ls, I copied ls (and the entire bin directory) from another Centos 5.3 instance, with no change in behavior.
View 4 Replies
View Related
Oct 30, 2009
I have just completed an upgrade from 5.3 to 5.4 (64bit) Now when the system starts it gets to Staring udev and then hangs for about 2mins before printing "Timeout" and then continuing.
At various stages during the boot process it just stops with no warning and takes a while to restart.
My boot time in 5.3 was about 50seconds, now with 5.4 it is about 4mins.
View 2 Replies
View Related
Aug 31, 2010
My issue is: I am "locked" on kernel 2.6.18-164, I just can't get my server up and running with any later release. It simply hangs at "Starting udev".
The hardware is:
MB: Asus M2N32-Deluxe
CPU: AMD Phenom(tm) II X4 920 Processor
Boot FS: Ext2
RootFS: xfs
RAM: 4Gb
I have 2.6.18-194.el5 running on other machines.
PS: I'm about to install/update a fresh 5.5 disc for testing purpose anyway. I'll update here if it works.
View 3 Replies
View Related
Mar 24, 2011
Keyboard Hanging, in CentOS. Some time when I login to Linux or when we come back to use Keyboard after a long time work with mouse, my keyboard will hang for few seconds. I don't know its my USB keyboard problem or another issue. I doubt it started after I install opera browser in CentOS or by VNC setup.
View 3 Replies
View Related
May 5, 2011
I just wanna simulate HPC and and other kinds of clusters in VMware workstation 7.0 , in my HP 520 laptop which is dualcore and 3Gb RAM. So can u please help me out regarding this.. I am interested to work in clusters. I am new to this hpc and other clusters..please can any one gve me document on cluster installation and configuration. I would be grateful to them.. I am using Centos OS.
View 4 Replies
View Related
Apr 21, 2009
Using google with search option: cman not started: Can't find local node name in cluster.conf /usr/sbin/ cman_tool: aisexec daemon didn't start.I found this URL...I have found the config_version in cluster.conf. Unfortunately, as everyone may have noticed, english is not my native tongue so I am having trouble understanding the part "Make sure you bumped the cluster config version number". Can anyone enlightened me on what should I be doing so that I could "bump" the cluster config version?
View 3 Replies
View Related
May 6, 2009
i am using the redhat cluster suite (luci and ricci) on my centos 5.3. i have 2 nodes in a cluster. when i poweroff the first node on wich a vm service is running, the service switchtes to node2. so far, so good :) but when i restart node1 the service is not failback to node1! i have created a failover domain with both nodes and priorized whre node1 has prio1 and node2 has prio2.
View 4 Replies
View Related