Changeset 39ced529 in ffmpeg
- Timestamp:
-
Jul 28, 2024, 6:24:58 PM
(4 months ago)
- Author:
- Rémi Denis-Courmont <remi@remlab.net>
- Branches:
- master
- Children:
- 2aa63784
- Parents:
- b0b3bea1
- git-author:
- Rémi Denis-Courmont <remi@remlab.net> (07/25/24 19:17:48)
- git-committer:
- Rémi Denis-Courmont <remi@remlab.net> (07/28/24 18:24:58)
- Message:
-
lavu/riscv: implement floating point clips
Unlike x86, fmin/fmax are single instructions, not function calls. They
are much much faster than doing a comparison, then branching based on its
results. With this, audiodsp.vector_clipf gets almost twice as fast, and
a properly unrollled version of it gets 4-5x faster, on SiFive-U74.
This is only the low-hanging fruit: FFMIN and FFMAX are presumably
affected as well.
This likely applies to other instruction sets with native IEEE floats,
especially those lacking a conditional select instruction.
-
(No files)
-