Table of Contents

Class CategorizationTrainingDataset

Namespace
LMKit.TextAnalysis.Training
Assembly
LM-Kit.NET.dll

Training dataset builder specialized for the Categorization engine. Turns labeled categorization examples into ChatTrainingSample items suitable for supervised fine-tuning.

public sealed class CategorizationTrainingDataset : TrainingDataset
Inheritance
CategorizationTrainingDataset
Inherited Members

Remarks

The dataset uses the current Categorization configuration (model, prompts, preferred modality, and normalization settings) to synthesize ShareGPT-style conversations where the assistant returns the selected category.

Constructors

CategorizationTrainingDataset(Categorization)

Initializes a categorization-focused training dataset bound to a specific Categorization engine instance.

Properties

EnableModalityAugmentation

Gets or sets whether to add modality-augmented samples when the created training sample uses Multimodal.

Methods

AddSample(Attachment, IList<string>, IList<string>, int)

Adds a training sample from an Attachment using the engine’s preferred modality.

AddSample(InferenceModality, Attachment, IList<string>, IList<string>, int)

Adds a training sample with an explicit InferenceModality.

AddSample(string, IList<string>, IList<string>, int)

Adds a training sample from raw text using the engine’s preferred modality.

See Also