llmcompressor.pipelines
Compression pipelines for orchestrating different compression strategies.
Provides various compression pipelines including basic, sequential, independent, layer-sequential, and data-free approaches. Each pipeline coordinates different compression techniques and workflows for optimal model optimization based on specific requirements and constraints.
Modules:
-
basic– -
cache– -
data_free– -
independent– -
layer_sequential– -
registry– -
sequential–
Classes:
-
BasicPipeline– -
CalibrationPipeline– -
DataFreePipeline– -
IndependentPipeline– -
LayerSequentialPipeline– -
SequentialPipeline–
BasicPipeline
Bases: CalibrationPipeline
CalibrationPipeline
Bases: ABC, RegistryMixin
Methods:
-
from_modifiers–Infer which calibration pipeline to use based on the available modifiers and
from_modifiers classmethod
Infer which calibration pipeline to use based on the available modifiers and any user specifications
Parameters:
-
(modifiersList[Modifier]) –modifiers to apply to model
-
(userOptional[str], default:None) –pipeline name passed by user
Returns:
-
CalibrationPipeline–CalibrationPipeline instance to be called with data (if not datafree)
Source code in llmcompressor/pipelines/registry.py
DataFreePipeline
Bases: CalibrationPipeline
IndependentPipeline
Bases: CalibrationPipeline
LayerSequentialPipeline
Bases: CalibrationPipeline
SequentialPipeline
Bases: CalibrationPipeline