Skip to main content
Version: dev

BitsandbytesQuantization Configuration

Bits and bytes quantization parameters.

Parameters

NameTypeRequiredDescription
load_in_4bitsboolean
Whether to load the model in 4 bits, default is False.
Defaults:False
load_in_8bitsboolean
Whether to load the model in 8 bits(LLM.int8() algorithm), default is False.
Defaults:False