Table of Contents

Class LanguageDetectionTrainingDataset

Namespace
LMKit.Translation.Training
Assembly
LM-Kit.NET.dll

Training dataset builder specialized for language detection with the TextTranslation engine. Converts labeled language examples into ChatTrainingSample items suitable for supervised fine-tuning.

public sealed class LanguageDetectionTrainingDataset : TrainingDataset
Inheritance
LanguageDetectionTrainingDataset
Inherited Members

Remarks

The dataset uses the current TextTranslation configuration (model, prompts, and preferred modality) to synthesize ShareGPT-style conversations where the assistant returns the detected Language.

Constructors

LanguageDetectionTrainingDataset(TextTranslation)

Initializes a language-detection-focused training dataset bound to a specific TextTranslation engine instance.

Properties

EnableModalityAugmentation

Gets or sets whether to add modality-augmented samples when appropriate.

Methods

AddSample(Attachment, Language, InferenceModality)

Adds a training sample with an explicit InferenceModality.

AddSample(string, Language)

Adds a training sample from raw text, using BestModality.

See Also