llmcompressor.pipelines
Compression pipelines for orchestrating different compression strategies.
Provides various compression pipelines including basic, sequential, independent, layer-sequential, and data-free approaches. Each pipeline coordinates different compression techniques and workflows for optimal model optimization based on specific requirements and constraints.
Modules:
-
basic
– -
cache
– -
data_free
– -
independent
– -
layer_sequential
– -
registry
– -
sequential
–
Classes:
-
BasicPipeline
– -
CalibrationPipeline
– -
DataFreePipeline
– -
IndependentPipeline
– -
LayerSequentialPipeline
– -
SequentialPipeline
–
BasicPipeline
Bases: CalibrationPipeline
CalibrationPipeline
Bases: ABC
, RegistryMixin
Methods:
-
from_modifiers
–Infer which calibration pipeline to use based on the available modifiers and
from_modifiers classmethod
Infer which calibration pipeline to use based on the available modifiers and any user specifications
Parameters:
-
modifiers
List[Modifier]
) –modifiers to apply to model
-
user
Optional[str]
, default:None
) –pipeline name passed by user
Returns:
-
CalibrationPipeline
–CalibrationPipeline instance to be called with data (if not datafree)
Source code in llmcompressor/pipelines/registry.py
DataFreePipeline
Bases: CalibrationPipeline
IndependentPipeline
Bases: CalibrationPipeline
LayerSequentialPipeline
Bases: CalibrationPipeline
SequentialPipeline
Bases: CalibrationPipeline