Start a Conversation

Unsolved

Closed

DH

1 Message

295

June 14th, 2023 11:00

Process to replace failed Data Mover Mgt Module and DIMMs from a recently retired array

Quick highlight of events:

 

Had our Standby DM fail (Amber on Management modules so assuming that is the culprit).  Before we could do anything to address that (Out of warranty) had a production Data Mover fail (Log shows S0,R1&R0 CorrErrOvflowCnt over threshold (1) on 3 DIMMS). As you can imagine that was problematic.

 

In any case, we have recently retired another array and have shipped its Data Mover enclosure (With two Data Movers) to our site to use its parts.

I found a process, via Solve, for replacing the DM mgmt module though I assume that is based off of the assumption you are using new module from factory not one from another array.   Do you think this matters?

For resolving the second DM issue (Appears to be failed DIMMS), I could not find a process in Solve.

Any thoughts?

1. Prep system for maintenance as per previous process (Shut down secondary CS, stop NAS services on CS0 etc)

2. Pull both DM power supplies

3. Pull DM out of enclosure,  swap failed DIMMS (Or should I just swap them all out), and put DM back into enclosure

4. Insert Power Supplies

5. Wait for a while for DM boot

6. Complete Restore NAS services and CSA per previous process

7. Perform system health check

 

Also, my understanding is that shutting down NAS services on the CS won't impact connectivity to PROD NAS services running on the other Data Movers; only impacting NAS service management on the CS.  Am I understanding that correctly?

Any feedback would be appreciated folks.

 

 

 

Moderator

 • 

6.9K Posts

June 19th, 2023 07:00

Hello Denny Hopkins,

I would try to swap all DImms first to see if the issue is resolved.  If the issue is not resolved, then I would replace DM.

As long as your NAS services are configured correctly then you should not impact your production system.

No Events found!

Top