BitsandbytesQuantization Configuration
Bits and bytes quantization parameters.
Parameters
| Name | Type | Required | Description |
|---|---|---|---|
load_in_8bits | boolean | ❌ | Whether to load the model in 8 bits(LLM.int8() algorithm), default is False. Defaults: False |
load_in_4bits | boolean | ❌ | Whether to load the model in 4 bits, default is False. Defaults: False |