Dell EMC VxRack iDRAC logs the following event: MEM0702     Correctable memory error rate exceeded for DIMM (Bank/Slot)

           

   Article Number:     530456                                   Article Version: 4     Article Type:    Break Fix 
   

 


Product:

 

PowerEdge R630 Rack Server,PowerEdge R640 Rack Server,PowerEdge R730 Rack Server,PowerEdge R730xd Rack Server,PowerEdge R740 Rack Server,PowerEdge R740XD Rack Server,VxRack Flex Series,VxRack Node,VxFlex OS,Ready Node Series

 

Issue:

 

 

VxRack iDRAC logs the following event: MEM0702    Correctable memory error rate exceeded for DIMM (Bank/Slot)                                                               

 

 

Cause:

 

 

The memory may not be operational.(See Resolution Scenarios) This is an early indicator of a possible future uncorrectable error.   
   
    Memory errors can show in a number of ways on your system, and might vary depending on the age of your system or (system generation). There might also be slight variations based on your system firmware levels. The error messages can appear in one or more of BIOS message on post, iDRAC logs, OpenManage System Administrator (OMSA) logs, System LCD display or in the Operating system.   
   
    Many of these errors can also be prevented by ensuring your firmware levels are up to date.   
    Note: If the system is new, or have been recently moved, some components, including the memory could have become incorrectly seated due to the vibrations, and all memory modules and other components should be re-seated (taken out, and put back in) before continuing troubleshooting.   
     
                                                           

 

 

Resolution:

 

 

Common Scenarios   
   
    13G Systems BIOS is lower than 2.7.1   
    Clear events and upgrade bios to 2.7.1. Troubleshooting has proven that nodes on BIOS 2.5.5 or lower report false ECC correctable alerts.  Nodes upgraded to BIOS 2.7.1 stopped reporting this error.   
   
    13G Systems BIOS 2.7.1 or higher   
    The memory may not be operational. This is an early indicator of a possible future uncorrectable error.    Perform Dimm swap test: Swap the module with another identical module in the system, Eg. Swap A2 with B2 if A2 is the DIMM reporting the fault. Then see if the error follows the module or not. If the issue persists, perform a VxRack Flex Dell 13G node replacement with Satadom Migration procedure. See Solve procedure for more information.   
   
    14G Systems BIOS versions older than 2.1.8   
    Perform DIMM swap test: Swap the module with another identical module in the system, Eg. Swap A2 with B2 if A2 is the DIMM reporting the fault. Then see if the error follows the module or not. If the issue persists, Contact Support as a memory replacement might be needed