跳到主要内容
版本:dev

BitsandbytesQuantization Configuration

Bits and bytes quantization parameters.

Parameters

NameTypeRequiredDescription
load_in_4bitsboolean
Whether to load the model in 4 bits, default is False.
Defaults:False
load_in_8bitsboolean
Whether to load the model in 8 bits(LLM.int8() algorithm), default is False.
Defaults:False