Avamar Gen4T: Power Supply Units (PSU) (or other hardware components) are flapping on a Gen4T node

           

   Article Number:     512611                                   Article Version: 5     Article Type:    Break Fix 
   

 


Product:

 

Avamar,Avamar Data Store Gen4T

 

Issue:

 

 

An Avamar node repeatedly reports Power Supply (PSU) related error messages which flap between "External Fault (asserted)" to "External Fault (deasserted)" and "Power Good (deasserted)" to "Enabled (deasserted)".       
       
        The following messages can be seen in the sel log        
       
        ipmitool sel list 
   
        

      0169 09/05/17 05:23:42 INF BMC CMD Status #10 Enabled (deasserted) 6f [01 ff ff]         
          016a 09/05/17 05:32:07 MAJ BMC CMD Status #10 Power Good (deasserted) 6f [00 ff ff]         
          016b 09/05/17 05:32:07 INF BMC CMD Status #10 Enabled (deasserted) 6f [01 ff ff]         
          016c 09/05/17 05:34:05 MAJ BMC CMD Status #10 Power Good (deasserted) 6f [00 ff ff]         
          016d 09/05/17 05:34:05 INF BMC CMD Status #10 Enabled (deasserted) 6f [01 ff ff]         
          016e 09/05/17 08:05:40 MAJ BMC CMD Status #11 Power Good (deasserted) 6f [00 ff ff]         
          016f 09/05/17 08:05:40 INF BMC CMD Status #11 Enabled (deasserted) 6f [01 ff ff]         
          0170 09/05/17 13:03:13 MAJ BMC CMD Status #11 Power Good (deasserted) 6f [00 ff ff]         
          0171 09/05/17 13:03:13 INF BMC CMD Status #11 Enabled (deasserted) 6f [01 ff ff]         
          0172 09/05/17 14:30:34 MAJ BMC CMD Status #10 Power Good (deasserted) 6f [00 ff ff]         
          0173 09/05/17 14:30:34 INF BMC CMD Status #10 Enabled (deasserted) 6f [01 ff ff]
   
   
    The same messages can be seen in /var/log/messages:        
       
        grep -i "BMC CMD Status" /var/log/messages
   
        
      Aug 31 13:47:55 test-gen4t-single01 ipmiutil: igetevent-gen4t: 010e 08/31/17 13:47:52 MAJ BMC CMD Status #11 Power Good (deasserted) 6f [00 ff ff]         
          Aug 31 13:47:55 test-gen4t-single01 ipmiutil: igetevent-gen4t: 010f 08/31/17 13:47:52 INF BMC CMD Status #11 Enabled (deasserted) 6f [01 ff ff]         
          Aug 31 16:29:11 test-gen4t-single01 ipmiutil: igetevent-gen4t: 0110 08/31/17 16:29:07 MAJ BMC CMD Status #10 Power Good (deasserted) 6f [00 ff ff]         
          Aug 31 16:29:11 test-gen4t-single01 ipmiutil: igetevent-gen4t: 0111 08/31/17 16:29:07 INF BMC CMD Status #10 Enabled (deasserted) 6f [01 ff ff]         
          Aug 31 20:47:58 test-gen4t-single01 ipmiutil: igetevent-gen4t: 0112 08/31/17 20:47:54 MAJ BMC CMD Status #11 Power Good (deasserted) 6f [00 ff ff]         
          Aug 31 20:47:58 test-gen4t-single01 ipmiutil: igetevent-gen4t: 0113 08/31/17 20:47:54 INF BMC CMD Status #11 Enabled (deasserted) 6f [01 ff ff]         
          Aug 31 20:48:31 test-gen4t-single01 ipmiutil: igetevent-gen4t: 0114 08/31/17 20:48:26 MAJ BMC CMD Status #10 Power Good (deasserted) 6f [00 ff ff]
     
          
   
      The PSUs on the affected node do not report any errors:         
         
          avsysreport power-supply
     
          
   
      === Power supply redundancy       
        Status           : Ok         
          Redundancy State : Fully Redundant
       
       
        === Power supplies       
        Power Supply ID   : 0       
        Status            : Ok       
        Location          : PSA       
        Operational State : Inserted, Power Good       
        Firmware Revision : 4.27.0.1       
        Serial Number     : FPSAG164101986       
       
        Power Supply ID   : 1       
        Status            : Ok       
        Location          : PSB       
        Operational State : Inserted, Power Good       
        Firmware Revision : 4.27.0.1       
        Serial Number     : FPSAG164102070
   
   
          
                                                             

 

 

Cause:

 

 

The messages indicate instability in the 12V bus internal to the node. They do not indicate a power supply problem. Do not replace a power supply for this issue unless a hard failure is indicated. Avamar Hardware Engineering Team is still investigating this issue.                                                            

 

 

Resolution:

 

 

Please open a Service Request with DELL|EMC Avamar Support to investigate this issue.