Table of Contents

Namespace LMKit.Extraction.Ocr

Classes

LMKitOcr

Provides high-throughput OCR functionality optimized for business documents, with advanced page layout analysis, automatic language and orientation detection, and automatic model download support. Implements IDisposable to release native OCR resources.

OcrCompletedEventArgs

EventArgs for when OCR has completed (either successfully or faulted). If an exception occurred, Result will be null and Exception will be set.

OcrEngine

Represents an Optical Character Recognition (OCR) engine capable of processing image data and extracting text. Concrete implementations must override RunAsync(OcrParameters, CancellationToken) to perform actual OCR. Exposes events that fire just before and just after OCR executes, and allows cancellation.

OcrParameters

Encapsulates all parameters required to perform OCR on an image, including the raw image data, a display name, and an optional Attachment for contextual metadata.

OcrProgressEventArgs

Provides data for page-level progress events during OCR-based document conversion operations such as LMKit.Document.Pdf.PdfSearchableMaker and LMKit.Document.Conversion.ImageToSearchablePdf.

OcrResult

Represents the result of an OCR operation, encapsulating the recognized text and its optional positional layout within the source image.

OcrStartingEventArgs

EventArgs for when OCR is about to start. Carries the OCR parameters and allows cancellation.

VlmOcr

Provides an OCR engine implementation backed by a multimodal LM model.

VlmOcr.VlmOcrResult

Represents the detailed result of a vision-language OCR operation.

Enums

VlmOcrIntent

Specifies the desired outcome of a VlmOcr operation.

Share