Class CategorizationTrainingDataset
- Namespace
- LMKit.TextAnalysis.Training
- Assembly
- LM-Kit.NET.dll
Training dataset builder specialized for the Categorization
engine.
Turns labeled categorization examples into ChatTrainingSample items
suitable for supervised fine-tuning.
public sealed class CategorizationTrainingDataset : TrainingDataset
- Inheritance
-
CategorizationTrainingDataset
- Inherited Members
Remarks
The dataset uses the current Categorization
configuration (model, prompts,
preferred modality, and normalization settings) to synthesize ShareGPT-style
conversations where the assistant returns the selected category.
Constructors
- CategorizationTrainingDataset(Categorization)
Initializes a categorization-focused training dataset bound to a specific
Categorization
engine instance.
Properties
- EnableModalityAugmentation
Gets or sets whether to add modality-augmented samples when the created training sample uses Multimodal.
Methods
- AddSample(Attachment, IList<string>, IList<string>, int)
Adds a training sample from an Attachment using the engine’s preferred modality.
- AddSample(InferenceModality, Attachment, IList<string>, IList<string>, int)
Adds a training sample with an explicit InferenceModality.
- AddSample(string, IList<string>, IList<string>, int)
Adds a training sample from raw text using the engine’s preferred modality.