Texas A&M Supercomputing Facility Texas A&M University Texas A&M Supercomputing Facility

Hydra is Off-Line for Maintenance (Update #1)

This is a quick update on the status of Hydra maintenance. Please see this previous announcement for details on the nature and objectives of the current maintenance.

On Fri. 02/03/2012, the DDN 9550 disk storage of Hydra was powered down in the process of physically moving the connectivity from dead node f1n1 to a new replacement one. When the DDN9550 was powered back up to resume operation, one of its controllers (Singlet #2) did not power up to a stable state and it became itself inaccessible. Several attempts to wake up this controller were proven fruitless. Finally, DDN the vendor of this disk storage suggested they ship a new controller to replace controller 2.

We are waiting to receive a new controller and replace the one which died. In the meantime the process of trying to replace the failed f1n1 as a storage controller has been suspended as the storage array is unstable and not fully functional.

After we replace the failed controller and ensure that the disk storage is fully operational, we will resume the healing of the GPFS I/O subsystem of Hydra. Please keep monitoring our web site for updates.

Posted on: 6:33 PM, February 7, 2012