[tremor] [PATCH] 12% global performance gain on a StrongARM

Nicolas Pitre nico at cam.org
Thu Sep 19 11:18:24 PDT 2002



On Thu, 19 Sep 2002, Chris Gilbert wrote:

> > In some cases, the compiler would have inverted the two assignments.
> > However *x really needs to be written to memory first since the value of
> > y1  is still being processed in the pipeline by the smlal instruction.
> > The  memory barrier ensure that ordering.
> 
> Wouldn't it be better to push the result storing into the asm, to be
> certain that the ordering is always enforced correctly, and the correct
> number of cycles is waited?  Although I'm surprised that the processor
> doesn't actually do a stall to wait for the result.

It does, but the idea is to fill the pipeline with useful work instead of 
just stalling and wasting cycles.

<p>Nicolas

--- >8 ----
List archives:  http://www.xiph.org/archives/
Ogg project homepage: http://www.xiph.org/ogg/
To unsubscribe from this list, send a message to 'tremor-request at xiph.org'
containing only the word 'unsubscribe' in the body.  No subject is needed.
Unsubscribe messages sent to the list will be ignored/filtered.



More information about the Tremor mailing list