How to optimize multiplications with hdl coder

Question

0 votes

Hello community

I'm using Simulink to generate VHDL code. The system runs on 2.5 MHz sample rate and is supposed to run on a target architecture with 100 MHz.

The model uses quite a lot of constant multiplications, too many to fit on a FPGA, i.e. the available DSP blocks are not sufficient.

Since the FPGA runs on a much faster clock than the Simulink model, I want to use the 40 clock cycles to optimize the system by implementing the multiplications based on the shift and add algorithm or by multiplexing between the hardware multipliers.

1) Is it possible to automatically implement the shift and add approach with the hdl coder? If yes, how?

2) Is it possible to automatically implement the multiplexing approach with the hdl coder? If yes, how?

3) Is it possible to automatically approximate a constant with the nearest fixpoint representation and then implement the multiplication as a shift operation only? If yes, how?

1 Comment
Show -1 older comments Hide -1 older comments

Kiran Kintali on 21 Jul 2020

Can you share your design? What are the sizes of multipliers you have in mind?

ConstMultiplierOptimization (CSD/FCSD) would be a good choice if you have Gain blocks (multiplication by constants) and you do not want to use hard multipliers on the FPGA.

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Kiran Kintali on 21 Jul 2020

1 vote

ConstMultiplierOptimization

The ConstMultiplierOptimization implementation parameter lets you specify use of canonical signed digit (CSD) or factored CSD optimizations for processing coefficient multiplier operations in the generated code.

The following table shows the ConstMultiplierOptimization parameter values.

ConstMultiplierOptimization SettingDescription

'none'

(Default)By default, HDL Coder does not perform CSD or FCSD optimizations. Code generated for the Gain block retains multiplier operations.

'CSD'When you specify this option, the generated code decreases the area used by the model while maintaining or increasing clock speed, using canonical signed digit (CSD) techniques. CSD replaces multiplier operations with add and subtract operations. CSD minimizes the number of addition operations required for constant multiplication by representing binary numbers with a minimum count of nonzero digits.

'FCSD'This option uses factored CSD (FCSD) techniques, which replace multiplier operations with shift and add/subtract operations on certain factors of the operands. These factors are generally prime but can also be a number close to a power of 2, which favors area reduction. This option lets you achieve a greater area reduction than CSD, at the cost of decreasing clock speed.

'auto'

When you specify this option, HDL Coder chooses between the CSD or FCSD optimizations. The coder chooses the optimization that yields the most area-efficient implementation, based on the number of adders required. When you specify 'auto', the coder does not use multipliers, unless conditions are such that CSD or FCSD optimizations are not possible (for example, if the design uses floating-point arithmetic).

The ConstMultiplierOptimization parameter is available for the following blocks:

Gain
Stateflow® chart
Truth Table
MATLAB Function
MATLAB System

9 Comments
Show 7 older comments Hide 7 older comments

Dominik Hiltbrunner on 22 Jul 2020

Hello Kiran Kintali

Once again thank you for your answers.

I tested the settings on a small subsystem and so far the results are looking good. The CSD optimization gives the best result for me. I have some additional questions regarding optimization options.

1) My target architecture will most likely have 18x18 multipliers. Therefore, I set the MultiplierPartitioningThreshold to 18. This works fine on my subsystem, i.e. the resource report shows me that 18x18 multipliers have been synthezised. However, if I use the same option on my whole project, then the report shows me that 64x64 multipliers were used. Are there situation in which the slicing fails?

2) What about resource sharing? Many of my constant multiplications are equal. Thus, it should be possible to use the same hardware by multiplexing. The options shown here do not exist in my Simulink settings / Workflow Advisor (Im Using 2020a release), i.e. there is no "Resource sharing factor" option. There is a "Recource sharing" option in the global settings however, but there I can only specify a "minimum bitwidth", which I cannot find in the documentation what it exactly does.

3) My model has over 30 gain blocks. Is it possible to set the CSD optimization for all blocks at once rather than rightclicking on each block and select the wanted setting?

Dominik Hiltbrunner on 24 Jul 2020

Pressing ctrl+D doesn't change this behaviour, the code generator still ignores the optimization. Asking for further help.

Dominik Hiltbrunner on 24 Jul 2020

I found the problem: it is the word length. If I reduce the word size to 64 bit, then Simulink implements CDS and FCDS. If I reduce it further to 32 bit, then it also implements the multiplier slicing.

Sign in to comment.

How to optimize multiplications with hdl coder

1 Comment
Show -1 older comments Hide -1 older comments

Answers (1)

9 Comments
Show 7 older comments Hide 7 older comments

Categories

Products

Release

Tags

Community Treasure Hunt

How to optimize multiplications with hdl coder

1 Comment Show -1 older comments Hide -1 older comments

Answers (1)

9 Comments Show 7 older comments Hide 7 older comments

Categories

Products

Release

Tags

See Also

Community Treasure Hunt

1 Comment
Show -1 older comments Hide -1 older comments

9 Comments
Show 7 older comments Hide 7 older comments