Table of Contents

Class LayoutSearchOptions

Namespace
LMKit.Document.Search
Assembly
LM-Kit.NET.dll

Global options for LayoutSearchEngine controlling normalization and coordinate handling.

public sealed class LayoutSearchOptions
Inheritance
LayoutSearchOptions
Inherited Members

Properties

IgnoreCharactersRegex

Optional compiled regular expression applied per-character to remove additional characters after diacritics/whitespace and the boolean filters. The pattern should be character-class-like (e.g., [\p{P}\p{S}] or [^\p{L}\p{Nd}]) for predictable results. Note: Per-character regex checks are slower than the boolean flags; use only when needed.

IgnoreDiacritics

When true, removes diacritics (accents) prior to searching, improving robustness to encoding/OCR variance.

IgnorePunctuation

When true, removes all Unicode punctuation characters prior to searching (e.g., (), - . , ; : ! ? quotes, etc.). Covers Dash/Open/Close/Other/Initial/Final/Connector punctuation categories.

IgnoreSymbols

When true, removes all Unicode symbol characters prior to searching (Math/Currency/Modifier/Other symbols; useful to drop '+', currency marks, etc.).

KeepOnlyLettersAndDigits

When true, keeps only letters and digits (everything else is stripped). This implies that whitespace, punctuation, and symbols are removed.

NormalizeWhitespace

When true, collapses any sequence of whitespace into a single space prior to searching.