Latency in floating point
    7 views (last 30 days)
  
       Show older comments
    
hello,
We have implemented a simple multiply/add design in simulink were single precision datatype was used , after generation of VHDL code and post imolementation in vivado we obtained the RTL circuit shown below
my question is Why the Floating Point Multiplication is faster than Floating Point Addition

3 Comments
  Walter Roberson
      
      
 on 7 Feb 2020
				I would need to do a bunch of research on what that architecture is, but I notice that they say:
"Enhanced DSP slices incorporating 27x18-bit multipliers and dual adders that enable a massive jump in fixed- and IEEE Std 754 floating-point arithmetic performance and efficiency"
That suggests to me that they put more resources into multiplication than they do into addition.
IEEE 754 has a bunch of decoding overhead that makes it less efficient than could be done, but that overhead would be much the same for addition and multiplication.
It is not uncommon for vendors to analyze the instructions that their devices tend to be used for, and heavily optimize the most commonly used instructions and dedicate less to instructions that the vendor finds make up less of what the customers want to do.
Answers (1)
  Nabil Mederbel
 on 7 Feb 2020
        1 Comment
  Walter Roberson
      
      
 on 7 Feb 2020
				
      Edited: Walter Roberson
      
      
 on 19 Jan 2021
  
			No, HDL Coder permits you to choose floating-point latency.
If you look at various CPUs you will see that the latency of floating-point operations varies quite a bit.
For example https://www.researchgate.net/publication/3798292_Reduced_latency_IEEE_floating-point_standard_adder_architectures  describes an adder with 3 or 2 cycle latency.
ARM has a 4 cycle latency for FADD and a 5 cycle latency for FMUL
The values used by default in HDL Coder probably represent some particular implementation that happens to have put more design work into multiplication than addition. But as well it would be important check the interval specifications for the instructions: you can plausibly start another addition every clock cycle but multiplication only every few clock cycles.
See Also
Categories
				Find more on Speed Optimization in Help Center and File Exchange
			
	Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!
