<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body>
<p>You do not talk about the SSE 4.1 version in your bench.</p>
<p>Have you tried this use case ?<br>
</p>
<p>Thanks !<br>
</p>
<div class="moz-cite-prefix">Le 04/07/2022 à 19:23, Martijn van
Beurden a écrit :<br>
</div>
<blockquote type="cite"
cite="mid:CADQbU6_MN3nOtyEx5s5d+nUh1STOX6Vq8yYkhSz=kJ90EeUVag@mail.gmail.com">
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
<div dir="ltr">
<div class="gmail_quote">
<div dir="ltr" class="gmail_attr">Op ma 4 jul. 2022 om 15:06
schreef olivier tristan <<a
href="mailto:o.tristan@uvi.net" moz-do-not-send="true"
class="moz-txt-link-freetext">o.tristan@uvi.net</a>>:<br>
</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left:1px solid
rgb(204,204,204);padding-left:1ex">
While I can understand the rationale for manual assembly as
32 bits x86 <br>
is dead, it seems a greater deal to remove all optimization
including <br>
intrinsic ones.<br>
</blockquote>
<div><br>
</div>
<div>Yes, it does seem a great deal to remove all
optimization, but it really isn't. See the pull request
associated with that change for more information: <a
href="https://github.com/xiph/flac/pull/347"
moz-do-not-send="true" class="moz-txt-link-freetext">https://github.com/xiph/flac/pull/347</a>
I did quite a bit of testing before merging this change, on
two different CPUs, each with 3 different compilers, each
with 4 variants of the non-intrinsics-accelerated functions.
It turns out that there is no performance loss at all, and
in many cases this change makes flac actually faster, not
slower as one would expect.<br>
</div>
<div> </div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left:1px solid
rgb(204,204,204);padding-left:1ex">
Maybe there should be a an opt in if you don't want to be
included by <br>
default but some people including me don't want to see those
<br>
optimization been removed ?<br>
</blockquote>
<div><br>
</div>
<div>There would be no advantage of that over keeping the
original code: it still needs to be maintained and tested,
even if it is hidden behind some configuration option. The
only case where this patch could be problematic in terms of
speed is when one compiles flac to be used on CPUs that do
not support SSE2.<br>
</div>
</div>
</div>
</blockquote>
<pre class="moz-signature" cols="72">--
Olivier Tristan
Research & Development
<a class="moz-txt-link-abbreviated" href="http://www.uvi.net">www.uvi.net</a></pre>
</body>
</html>