Table of Contents

Property OcrEngine

Namespace
LMKit.Retrieval
Assembly
LM-Kit.NET.dll

OcrEngine

Gets or sets an optional OCR engine for extracting text from image-based pages.

public OcrEngine OcrEngine { get; set; }

Property Value

OcrEngine

An OcrEngine instance, or null (default).

Remarks

When set, enables optical character recognition on pages that contain scanned content or embedded images with text that cannot be extracted directly from the PDF structure.

Used when PageProcessingMode is set to TextExtraction, or when set to Auto and the page requires text recognition but vision-based analysis is not available or not selected.

If neither OcrEngine nor DocumentVisionParser is set, image-based pages yield no extractable text.

See Also