Skip to content

llmcompressor.modeling.deepseek_v3

Classes:

DeepseekV3MoECalibrate

DeepseekV3MoECalibrate(
    config: DeepseekV3Config, original: DeepseekV3MoE
)

Bases: Module

Patched DeepseekV3MoE which sends all tokens to all experts for calibration

Source code in llmcompressor/modeling/deepseek_v3.py
def __init__(self, config: DeepseekV3Config, original: OriginalDeepseekV3MoE):
    super().__init__()
    self.config = config
    self.experts = original.experts
    self.gate = original.gate
    self.shared_experts = original.shared_experts