aboutsummaryrefslogtreecommitdiffstats
path: root/scripts/qapi/source.py
diff options
context:
space:
mode:
authorAlexander Monakov <amonakov@ispras.ru>2024-02-06 23:48:08 +0300
committerRichard Henderson <richard.henderson@linaro.org>2024-05-03 08:03:05 -0700
commitf28e0bbefa41fe643cce2f107e868abff312ced9 (patch)
tree933db7fedccb1c2590441909271db03ff8cba52f /scripts/qapi/source.py
parentutil/bufferiszero: Remove useless prefetches (diff)
downloadqemu-f28e0bbefa41fe643cce2f107e868abff312ced9.tar.xz
qemu-f28e0bbefa41fe643cce2f107e868abff312ced9.zip
util/bufferiszero: Optimize SSE2 and AVX2 variants
Increase unroll factor in SIMD loops from 4x to 8x in order to move their bottlenecks from ALU port contention to load issue rate (two loads per cycle on popular x86 implementations). Avoid using out-of-bounds pointers in loop boundary conditions. Follow SSE2 implementation strategy in the AVX2 variant. Avoid use of PTEST, which is not profitable there (like in the removed SSE4 variant). Signed-off-by: Alexander Monakov <amonakov@ispras.ru> Signed-off-by: Mikhail Romanov <mmromanov@ispras.ru> Reviewed-by: Richard Henderson <richard.henderson@linaro.org> Message-Id: <20240206204809.9859-6-amonakov@ispras.ru>
Diffstat (limited to 'scripts/qapi/source.py')
0 files changed, 0 insertions, 0 deletions