llmcompressor.transformers.finetune.data.c4
Classes:
-
C4Dataset–Child text generation class for the C4 dataset
C4Dataset
Bases: TextGenerationDataset
Child text generation class for the C4 dataset
Parameters:
-
(dataset_argsDatasetArguments) –configuration settings for dataset loading
-
(splitstr) –split from dataset to load, for instance
testortrain[:5%] -
(processorProcessor) –processor or tokenizer to use on dataset