95. Quantization
The process of reducing the number of bits that represent numbers in a model, aiming to decrease model size and increase inference speed.
Last updated
The process of reducing the number of bits that represent numbers in a model, aiming to decrease model size and increase inference speed.
Last updated