Indeni

Check Point Alert of the Week: Cluster member down due to NIC error

It’s so easy sometimes to disconnect the wrong cable.

This is a real life sample alert from indeni.

Description:

The cluster member is in a critical state due to one or more monitored interfaces being down or disconnected from the network.

NICs Down: eth7

Remediation Steps:

For eth7 (Bandwidth: 1000M/full, MAC Address: 00:1C:7F:30:9E:14): While the NIC itself appears to be active and working well, ClusterXL considers a NIC to be down if it is connected to a network where there are no other hosts, including no other cluster members. If the configuration is correct, lack of support for multicast at one of the switches being used could also stop ClusterXL from operating. In order NOT to monitor this interface, add a line containing only “eth7” to the list in the $FWDIR/conf/discntd.if file (create the file if it doesn’t exist) or use the ifdown command to remove the interface entirely from the system.

Read SK30060 for more information.

How does this alert work?

indeni continuously monitors “cphaprob -a if” and compares it to other networking data (such as “ifconfig -a” and “arp -an”) to determine the cause for the NIC being considered down by ClusterXL. Note that indeni does not rely on the log messages for this alert (known as “cluster_info: (ClusterXL) interface is down”).

Interested in learning more? Download our solution for Checkpoint here.

Leave a Reply