Re: [patch] Remove limit on MCA recoveries

From: Russ Anderson <rja_at_sgi.com>
Date: 2005-01-18 08:28:28
Keith Owens wrote:
> Russ Anderson <rja@sgi.com> wrote:
>
> >My only real complaint about the array is that the current size
> >is too small.  The Altix error injection test (which can modify
> >the ECC to create true memory uncorrectables) can recover from 
> >several hundred memory uncorrectable errors, when the array 
> >limitation is removed.  Making the array dynamic (linked list or 
> >something), so the size can be expanded would be a sufficient 
> >solution, too.
> 
> Do not even think about making the array a dynamic size.  MCA events
> can occur even when interrupts are disabled, which means that in an MCA
> you have no idea if any of the kernel structures are in a safe state or
> not.  That in turn means that you cannot extend the array from MCA
> context.  Page flags that indicate bad memory are the best way to go.

Good point Keith.

Thanks,
-- 
Russ Anderson, OS RAS/Partitioning Project Lead  
SGI - Silicon Graphics Inc          rja@sgi.com
-
To unsubscribe from this list: send the line "unsubscribe linux-ia64" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Received on Mon Jan 17 16:29:51 2005

This archive was generated by hypermail 2.1.8 : 2005-08-02 09:20:34 EST