Method RunAsync

Namespace: LMKit.Extraction.Ocr

Assembly: LM-Kit.NET.dll

RunAsync(OcrParameters, CancellationToken)

Runs OCR on the specified page using the configured multimodal language model.

public override Task<OcrResult> RunAsync(OcrParameters ocrParameters, CancellationToken cancellationToken = default)

Parameters

ocrParameters OcrParameters: The OcrParameters containing the image data and attachment metadata for the page to be transcribed.
cancellationToken CancellationToken: A CancellationToken that can be used to cancel the operation.

Returns

Task<OcrResult>: A Task<TResult> that produces an OcrResult containing the page-level transcription returned by the model.

Examples

var ocr = new VlmOcr(model);
var parameters = new OcrParameters(imageBuffer);
OcrResult result = await ocr.RunAsync(parameters);
Console.WriteLine(result.PageText);

Remarks

This method implements the OcrEngine contract and is intended for consumers that only need the final OcrResult in a document-oriented workflow. For advanced scenarios that require access to model-level details, use Run(Attachment, int, CancellationToken) or RunAsync(Attachment, int, CancellationToken).

Exceptions

ArgumentNullException: Thrown if ocrParameters is null.
OperationCanceledException: Thrown if the operation is canceled via cancellationToken.

RunAsync(Attachment, int, CancellationToken)

Runs OCR on the specified attachment and returns a detailed result.

public Task<VlmOcr.VlmOcrResult> RunAsync(Attachment attachment, int pageIndex = 0, CancellationToken cancellationToken = default)

Parameters

attachment Attachment: The Attachment representing the image to be transcribed. The attachment may carry additional metadata (for example, page index or layout information) that can be used elsewhere in the extraction pipeline.
pageIndex int
cancellationToken CancellationToken: A CancellationToken that can be used to cancel the operation.

Returns

Task<VlmOcr.VlmOcrResult>: A Task<TResult> that produces a VlmOcr.VlmOcrResult containing both the raw text-generation result and the constructed PageElement.

Examples

var ocr = new VlmOcr(model);
var attachment = new Attachment("invoice.pdf");
VlmOcr.VlmOcrResult result = await ocr.RunAsync(attachment, pageIndex: 2);
Console.WriteLine(result.PageElement.Text);

Exceptions

ArgumentNullException: Thrown if attachment is null.
OperationCanceledException: Thrown if the operation is canceled via cancellationToken.

RunAsync(ImageBuffer, CancellationToken)

Runs OCR on the specified image and returns a detailed result.

public Task<VlmOcr.VlmOcrResult> RunAsync(ImageBuffer image, CancellationToken cancellationToken = default)

Parameters

image ImageBuffer: The ImageBuffer containing the image to be transcribed.
cancellationToken CancellationToken: A CancellationToken that can be used to cancel the operation.

Returns

Task<VlmOcr.VlmOcrResult>: A Task<TResult> that produces a VlmOcr.VlmOcrResult containing both the raw text-generation result and the constructed PageElement.

Examples

var model = LM.LoadFromModelID("lmkit-tasks:4b-preview");
var ocr = new VlmOcr(model);

using var image = ImageBuffer.LoadAsRGB("invoice.png");
var result = await ocr.RunAsync(image, cancellationToken);

string text = result.PageElement.GetText();

Exceptions

ArgumentNullException: Thrown if image is null.
OperationCanceledException: Thrown if the operation is canceled via cancellationToken.

Table of Contents

Method RunAsync

RunAsync(OcrParameters, CancellationToken)

Parameters

Returns

Examples

Remarks

Exceptions

RunAsync(Attachment, int, CancellationToken)

Parameters

Returns

Examples

Exceptions

RunAsync(ImageBuffer, CancellationToken)

Parameters

Returns

Examples

Exceptions