1-bit vs normal LLMs

Topics

1-bit LLMs

In normal case, LLMs are represented in 16-bit or 32-bit precision. 1-bit LLMs use 1-bit to store weights. This results in a significant reduction in total memory size = size of 1 weight * num weights .

Example

LLama 7B normally represented will take up 4bytes * 7B = 26.09GB , whereas in 1-bit case, it will take up 0.125bytes * 7B = 0.815GB (Note that 1byte = 8bits)

1-bit LLM vs Quantization
architectural changes in 1-bit LLMs

Altamash Khan

Altamash Khan

1-bit vs normal LLMs

Backlinks

Altamash Khan

1-bit vs normal LLMs

Related

Backlinks