Method FindFuzzy
FindFuzzy(PageElement, string, FuzzySearchOptions)
Performs token-aware fuzzy search using Damerau–Levenshtein distance over sliding windows of the page text. Useful when the source contains OCR noise or minor typos. Normalization (whitespace/diacritics/optional char-stripping) is applied to both the page text and the query.
public List<TextMatch> FindFuzzy(PageElement page, string query, FuzzySearchOptions fuzzyOptions = null)
Parameters
page
PageElementThe page to search. Must not be
null
.query
stringThe text to locate approximately. Must not be
null
.fuzzyOptions
FuzzySearchOptionsFuzzy matching parameters (edit distance, min score, max results, context). May be
null
for defaults.
Returns
Exceptions
- ArgumentNullException
Thrown when
page
orquery
isnull
.
FindFuzzy(IEnumerable<PageElement>, string, FuzzySearchOptions)
Performs fuzzy search across multiple pages
.
public List<TextMatch> FindFuzzy(IEnumerable<PageElement> pages, string query, FuzzySearchOptions fuzzyOptions = null)
Parameters
pages
IEnumerable<PageElement>Pages to search. Must not be
null
.query
stringApproximate text to locate. Must not be
null
.fuzzyOptions
FuzzySearchOptionsFuzzy matching parameters. May be
null
for defaults.
Returns
Exceptions
- ArgumentNullException
Thrown when
pages
orquery
isnull
.