Why do the values of learnables in a quantized dlnetwork still stored as float32(single precision)?

Question

MathWorks Fixed Point Team on 18 Jul 2025

0
Link

Direct link to this question

https://ch.mathworks.com/matlabcentral/answers/2178577-why-do-the-values-of-learnables-in-a-quantized-dlnetwork-still-stored-as-float32-single-precision

Edited: MathWorks Fixed Point Team on 18 Jul 2025

Accepted Answer: MathWorks Fixed Point Team

Even though the dlquantizer is quantizing the weights of the fully connected layer to int8 and bias of the layer to int32, why do I see in the quantized dlnetwork the values are still stored as float32(single precision)?

Also, I would like to find out if dlquantizer can quantize a particular layer or not?

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

MathWorks Fixed Point Team on 18 Jul 2025

0
Link

Direct link to this answer

https://ch.mathworks.com/matlabcentral/answers/2178577-why-do-the-values-of-learnables-in-a-quantized-dlnetwork-still-stored-as-float32-single-precision#answer_1568151

Edited: MathWorks Fixed Point Team on 18 Jul 2025

Yes, the learnables on the dlnetwork/quantized network are still stored as single precision.

Consider estimating parameter memory of the quantized network once deployed using the API: https://www.mathworks.com/help/deeplearning/ref/estimatenetworkmetrics.html.

The layers that it decided to quantize: https://www.mathworks.com/help/deeplearning/ug/supported-layers-for-quantization.html. It changes across releases and varies among intended targets.

The 'Analyze for Compression' feature (available in R2025a) in the Deep Network designer app -- it'll show you which layers in your network are supported for quantization, which can be friendlier than manually comparing to the supported layers doc page. It currently only analyzes for the MATLAB execution environment.

Here is an example that shows using the compression analysis: https://www.mathworks.com/help/deeplearning/ug/compress-sequence-classification-network-for-road-damage-detection.html

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Why do the values of learnables in a quantized dlnetwork still stored as float32(single precision)?

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

0 Comments
Show -2 older commentsHide -2 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Why do the values of learnables in a quantized dlnetwork still stored as float32(single precision)?

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

0 Comments Show -2 older commentsHide -2 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments