Table of Contents

Class TrainingDataset

Namespace
LMKit.Finetuning
Assembly
LM-Kit.NET.dll

Represents a supervised fine-tuning dataset composed of multiple ChatTrainingSample instances. Provides collection management and export functionality to the ShareGPT JSON format.

public class TrainingDataset
Inheritance
TrainingDataset
Inherited Members

Remarks

Use AddSample(ChatTrainingSample) to populate the dataset with conversational training examples. Once built, the dataset can be exported using ExportAsSharegpt(string, bool, string, CancellationToken).

Constructors

TrainingDataset()

Initializes a new empty dataset with no samples.

Properties

Samples

Gets the collection of training samples that make up this dataset.

Methods

AddSample(ChatTrainingSample)

Adds a ChatTrainingSample to the dataset.

ExportAsSharegpt(string, bool, string, CancellationToken)

Exports the dataset as a ShareGPT-compatible JSON file.