19  Quantization and Kernels

Post-training quantization families, low-precision formats, and the fused kernels that make them fast.

NoteStatus

Outline. Source: new. See INTEGRATION.md.

19.1 Problem

19.2 Design

19.3 Evolution

19.4 Trade-offs

19.5 Implementation

19.6 Further reading

NoteTODO

Establish the seminal, frontier, and primary-source anchors for this chapter.