95. Quantization

The process of reducing the number of bits that represent numbers in a model, aiming to decrease model size and increase inference speed.

Last updated