[tremor] [PATCH] 12% global performance gain on a StrongARM
Nicolas Pitre
nico at cam.org
Thu Sep 19 11:18:24 PDT 2002
On Thu, 19 Sep 2002, Chris Gilbert wrote:
> > In some cases, the compiler would have inverted the two assignments.
> > However *x really needs to be written to memory first since the value of
> > y1 is still being processed in the pipeline by the smlal instruction.
> > The memory barrier ensure that ordering.
>
> Wouldn't it be better to push the result storing into the asm, to be
> certain that the ordering is always enforced correctly, and the correct
> number of cycles is waited? Although I'm surprised that the processor
> doesn't actually do a stall to wait for the result.
It does, but the idea is to fill the pipeline with useful work instead of
just stalling and wasting cycles.
<p>Nicolas
--- >8 ----
List archives: http://www.xiph.org/archives/
Ogg project homepage: http://www.xiph.org/ogg/
To unsubscribe from this list, send a message to 'tremor-request at xiph.org'
containing only the word 'unsubscribe' in the body. No subject is needed.
Unsubscribe messages sent to the list will be ignored/filtered.
More information about the Tremor
mailing list