Re: [PATCH 2.6.13 8/7] MCA/INIT: tweaks

From: Keith Owens <kaos_at_sgi.com>
Date: 2005-09-08 17:01:21
On Thu, 08 Sep 2005 16:53:18 +1000, 
Keith Owens <kaos@sgi.com> wrote:
>Some tweaks to the previous MCA/INIT patch sets.
>
>* Remove the requirement that kernel stacks be aligned on KERNEL_STACK_SIZE.
>* Remove the serialization of MCA/INIT handlers returning to SAL.  The
>  problem looked like a race but was really caused by a broken prom
>  doing cacheable accesses to the minstate area.
>* Print the cpu number and monarch status in the INIT handler.
>* Workaround for broken proms that access the minstate area using
>  cacheable addresses.

With the above tweaks, the new MCA/INIT handlers pass all my stress
tests.  On SGI systems I can send INIT tens of times without any
problem, the system dumps the tasks and keeps going.  I was also
running ia64regcheck at the same time, it passed the test, no registers
were corrupted by INIT.

All the problems that have been reported against the new MCA/INIT
handlers have been caused by SAL not conforming to the SAL
specification.  Some versions of SAL only call the monarch cpu and not
the slaves for INIT.  Some versions of SAL call all cpus as monarchs.
Some versions of SAL do not resume correctly after INIT.  Even on these
broken versions of SAL, the new OS handlers give better results that
the existing OS handlers.  On working versions of SAL, INIT is now
fully recoverable.  On working versions of SAL, a recoverable MCA which
needs to send INIT in order to rendezvous can now be successfully
resumed.

This code is definitely ready for inclusion in 2.6.13-rc1.

-
To unsubscribe from this list: send the line "unsubscribe linux-ia64" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Received on Thu Sep 08 17:01:59 2005

This archive was generated by hypermail 2.1.8 : 2005-09-08 17:02:05 EST