Method AddSample

Adds a training sample from raw text content using the engine’s preferred modality.

public void AddSample(string content, string jsonGroundTruth)

content string: The textual content to extract from.
jsonGroundTruth string: The labeled ground-truth JSON matching the extraction schema (elements) configured on the underlying TextExtraction instance.

dataset.AddSample(
    "Invoice ACME-2024-09: amount 120.00 USD.",
    "{\"invoice_id\":\"ACME-2024-09\",\"amount\":120.00,\"currency\":\"USD\"}");

Adds a training sample from an Attachment using the engine’s preferred modality.

public void AddSample(Attachment content, string groundTruth)

content Attachment: The input attachment (e.g., text, image, or multimodal source) to extract from.
groundTruth string: The labeled ground-truth JSON matching the configured extraction elements.

var attachment = new Attachment("invoice_scan.pdf");
dataset.AddSample(attachment, "{\"InvoiceId\":\"INV-001\",\"Total\":250.00}");

Adds a training sample with an explicit InferenceModality.

public void AddSample(InferenceModality modality, Attachment content, string jsonGroundTruth)

modality InferenceModality: The inference modality to use for generating prompts and responses.
content Attachment: The content attachment to extract from.
jsonGroundTruth string: The labeled ground-truth JSON. It must be compatible with the configured Elements.

var attachment = Attachment.CreateFromText(invoiceText, "text");
dataset.AddSample(InferenceModality.Text, attachment, jsonLabel);

This method:

Validates that extraction elements are defined on the bound TextExtraction.
Builds an LMKit.Extraction.ExtractionInput and runs engine.ExtractElements with disableInference=true to materialize the final system and user prompts without invoking the model.
Converts jsonGroundTruth into the canonical JSON completion using AsJson(bool, bool) with formatted element names and empty value normalization.
Assembles a ChatHistory: includes the last system prompt (if any), the single chat prompt from the engine, and the assistant response composed as engine.ResponsePrefix + completion + engine.ResponseSuffix.
Creates and appends a ChatTrainingSample tagged with engine.LastInferenceModality.

If EnableModalityAugmentation is true and the last inference modality is Multimodal, two additional samples are added automatically for Text and Vision.

NotImplementedException: Thrown when the underlying engine provides more than one chat prompt (engine.ChatPrompts.Count != 1), which is not supported by this builder.