Blog posts

2025

quantization levels

2 minute read

Published:

How do you determine the appropriate quantization precision levels for your Large language models?

Fake Model Quantization

2 minute read

Published:

Fake Model Quantization Doesn’t Make Any Difference in Accelerating Model Inference Time