LM-Kit.NET Change History
2026
Version 2026.3.3 (09 March 2026)
New Features
- Added
Embedder.GetQueryEmbeddingsandEmbedder.GetQueryEmbeddingsAsync(LMKit.Embeddings): compute embeddings with model-specific query instruction prefixes applied. Modern embedding models (Qwen3 Embedding, Nomic Embed, BGE v1.5) are trained with asymmetric prefixes for queries versus passages; these methods apply the correct prefix automatically based on the loaded model's architecture. - Automatic query instruction prefix in RAG retrieval:
RagEngine.FindMatchingPartitionsandFindMatchingPartitionsAsyncnow apply the model's query instruction prefix when embedding search queries, improving retrieval recall without any code changes.
Version 2026.3.2 (08 March 2026)
New Features
- Added model cards for
qwen3-coder:30b-a3bandlightonocr-2-bbox:1b - Added
SearchHighlightEngine(LMKit.Document.Search): searches text in paginated documents (PDF or image) and produces a highlighted copy. Supports exact, regex, and fuzzy (Damerau-Levenshtein) matching. Accepts optional pre-computedPageElementinstances for raster PDFs or images. - Added
SearchHighlightOptions,SearchHighlightResult,HighlightAppearance, andSearchModeenum (LMKit.Document.Search) - Added
pdf_search_highlightbuilt-in tool: searches text in a PDF and saves a highlighted copy with matches visually marked - Added
LM.TensorOverride(LMKit.Model): enables fine-grained control over tensor device placement via regex pattern matching, particularly useful for offloading MoE (Mixture of Experts) expert weights to CPU while keeping attention layers on GPU. Includes factory methodsTensorOverride.Cpu(pattern)andTensorOverride.Gpu(pattern, gpuIndex). - Added
LM.DeviceConfiguration.TensorOverridesproperty to configure per-tensor device placement at model load time - Automatic memory extraction in
MultiTurnConversation(LMKit.TextGeneration): whenAgentMemory.ExtractionModeis set toLlmBased, conversations now automatically extract and store facts after each turn. Previously, automatic extraction only worked through theAgentpipeline. This enables memory extraction forRagChat,PdfChat, and directMultiTurnConversationusage.
Improvements
- Inference speed improvements
Version 2026.3.1 (03 March 2026)
New Features
- Added model cards for
qwen3.5:0.8b,qwen3.5:2b,qwen3.5:4bandqwen3.5:9b
Version 2026.2.11 (28 February 2026)
New Features
- Added model card for
glm-4.6v-flash - Added model card for
glm-ocr: Z.ai GLM-OCR 0.9B vision-language model specialized in document parsing, OCR, and structured information extraction
Improvements
- Improved GLM model support: added ChatGLM3 and ChatGLM4 template formats with proper token handling and tool calling support
Version 2026.2.10 (25 February 2026)
New Features
- Added model cards for Qwen 3.5 family:
qwen3.5:27b(27B dense hybrid GDN model) andqwen3.5:35b-a3b(35B MoE, 3B active). Both support chat, vision, tool calling, code completion, math, and OCR with a native 262K context window. Theqwen3-vl:30bmodel card is now deprecated in favor ofqwen3.5:35b-a3b. - Introduced
RagChat(LMKit.Retrieval): a turnkey multi-turn conversational RAG class that wraps a user-managedRagEnginewith an internalMultiTurnConversation. ImplementsIMultiTurnConversationand orchestrates query contextualization, retrieval dispatch, prompt construction, and grounded response generation in a single call. Supports all fourQueryGenerationModestrategies, tools, skills, and agent memory. - Introduced
RagQueryResult(LMKit.Retrieval): return type forRagChat.Submit/SubmitAsync, exposing the generatedTextGenerationResultalongside theIReadOnlyList<PartitionSimilarity>used as context. - Introduced
RetrievalCompletedEventArgs(LMKit.Retrieval.Events): event arguments for theRagChat.RetrievalCompletedevent, providing the retrieval query, matched partitions, requested count, and elapsed time. - Introduced
RetrievalDefaults(LMKit.Retrieval): a static class centralizing default constants for the retrieval subsystem (e.g.MinRelevanceScore), used consistently byRagChat,PdfChat,RagEngine, andVectorSearch. - Added advanced query generation strategies for RAG (
QueryGenerationModeonPdfChatandRagChat):Contextual: follow-up questions are automatically reformulated into self-contained queries before retrieval, configurable viaQueryContextualizationOptionsMultiQuery: generates multiple query variants and merges results using Reciprocal Rank Fusion for improved recall, configurable viaMultiQueryOptionsHypotheticalAnswer: generates a hypothetical answer and uses it as the retrieval query, bridging the gap between question and document phrasing, configurable viaHydeOptions
- Added Maximal Marginal Relevance (MMR) diversity filtering (
MmrLambdaonRagEngineandPdfChat): reduces near-duplicate passages in retrieval results by balancing relevance against diversity - Added context window expansion for retrieval results (
ContextWindowonRagEngineandPdfChat): automatically includes neighboring partitions around each match, providing the LLM with surrounding context for more accurate answers - Added hybrid search with BM25 + vector fusion (
IRetrievalStrategyonRagEngineandPdfChat):VectorRetrievalStrategy: semantic similarity via cosine distance on embeddings (default, unchanged behavior)Bm25RetrievalStrategy: BM25+ lexical ranking with configurable term saturation (K1), length normalization (B), long-document floor (Delta), proximity boosting (ProximityWeight), language-aware stopword filtering (Language), and custom stopword support (CustomStopWords)HybridRetrievalStrategy: combines vector and BM25 strategies with weighted Reciprocal Rank Fusion, configurable viaVectorWeight,KeywordWeight, andRrfK
Version 2026.2.9 (22 February 2026)
Bug Fixes
- Fixed JSON serialization and deserialization failures
- Fixed a critical initialization problem
New Features
- Added
MarkdownToPdfconverter (LMKit.Document.Conversion) with full formatting support (headings, bold/italic, code blocks, lists, blockquotes, tables, links, horizontal rules) - Added
EmlToPdfconverter (LMKit.Document.Conversion) with embedded attachment support - Added
markdown_to_pdfandeml_to_pdfbuilt-in document tools - Added support for PaddleOCR VL models
- Added model card for
paddleocr-vl:0.9b - Added
VlmOcrIntentenum andVlmOcr(LM, VlmOcrIntent)constructor for explicit OCR intent selection (plain text, table, formula, chart, coordinates, seal, Markdown) - Added
VlmOcr.GetSupportedIntents(LM)to query the intents a model is known to support with dedicated behavior - Introduced the
Canvasdrawing API (LMKit.Graphics.Drawing): a fluent, in-place 2D drawing surface backed byImageBufferwith coverage-based antialiasing,Pen/Brushstyling, and support for lines, rectangles, quadrilaterals, ellipses, polygons, and rounded rectangles
Improvements
- VLM OCR now provides text location data when the underlying model supports coordinate output (e.g. PaddleOCR-VL with
VlmOcrIntent.OcrWithCoordinates). Each recognized text region is returned as aTextElementwith a bounding box mapped back to the original source image coordinates. - Improved VLM OCR post-processing to adapt output cleanup to the selected intent and model family
- Added form field rendering support for PDF rasterization, improving extraction accuracy on fillable PDFs
Version 2026.2.6 (19 February 2026)
Filters/middleware pipeline, prompt templates, advanced agent memory, EML/MBOX support.
New Features
- Added Filters / Middleware Pipeline (
LMKit.TextGeneration.Filters) 🔥FilterPipelinewithIPromptFilter,ICompletionFilter, andIToolInvocationFilterfollowing the ASP.NET Core middleware (onion) pattern- Lambda-friendly API and integration with both
MultiTurnConversationandAgent
- Added Prompt Templates with Logic (
LMKit.TextGeneration.Prompts)PromptTemplateclass with Mustache syntax, conditionals, loops, filter chaining, custom helpers, and strict mode
- Added memory capacity limits, eviction policies, and time-decay scoring to
AgentMemory - Added
UserScopedMemoryfor multi-user memory isolation - Added built-in automatic memory extraction to
AgentMemory - Added memory consolidation via
ConsolidateAsynconAgentMemory - Added conversation summarization via
SummarizeConversationAsynconAgentMemory - Added
MemoryEstimationstatic class inLMKit.Hardwarefor accurate VRAM/context fitting - Added EML (email) and MBOX (Unix mailbox) document format support
- Added embedded attachment extraction for PDF, EML, and MBOX documents
Version 2026.2.5 (16 February 2026)
Tool metadata, permission policies, extraction confidence scores, entity validation, HTML chunking.
New Features
- Added Markdown conversion public APIs (
LMKit.Document.Conversion): Markdown to/from HTML and DOCX - Added per-element confidence scores and human verification flag to extraction results
- Added extraction progress events (
ExtractionPhaseenum) - Added entity auto-detection and validation for extraction results (102 entity kinds with format validators)
- Added HTML-aware chunking for RAG workflows (
HtmlChunkingclass) - Added async PDF splitting and extraction APIs
- Added pattern-constrained extraction formatting
- Added standardized tool metadata via
IToolMetadatainterface - Added
ToolPermissionPolicyfor centralized tool access control with approval workflow support - Added
pdf_searchbuilt-in document tool and reusablePdfSearchAPI - Added
filesystem_searchandhttp_downloadbuilt-in tools
Improvements
- Improved tool calling reliability across all models
- Fixed and improved GLM (glm4.7-flash) tool calling support
Breaking Changes
- Split multi-operation built-in tools into atomic single-operation tools (1 tool = 1 feature)
Version 2026.2.4 (12 February 2026)
PDF split/merge, document splitting, database and spreadsheet tools, Document tool category.
Improvements
- Improved
SupervisorOrchestratorsingle-pass execution and real-time streaming of worker output - Improved
DelegateToolto return plain text for verbatim relay
New Features
- Introduced the
PdfSplitterclass - Introduced the
DocumentSplittingclass for detecting logical document boundaries - Introduced the
PdfMergerclass - Added
DatabaseTool,SpreadsheetTool,RssFeedTool,ClipboardTool, andFtpToolto built-in tools - Added JPEG output support to
PdfToImageandImageBuffer - Introduced the
ImageToPdfclass - Introduced the
PdfUnlockerclass - Added
Documentcategory to built-in tools (PDF operations, image preprocessing, content extraction, OCR)
Version 2026.2.3 (09 February 2026)
Improvements
- Improved skill activations via tools
New Features
- Added model cards for
minicpm-o-45,lightonocr-2:1bandglm4.7-flash - Added
StripStyleAttributesproperty toVlmOcr
Version 2026.2.2 (06 February 2026)
Microsoft.Extensions.AI integration, orchestrator streaming, complete MCP protocol support.
Improvements
- Enhanced telemetry with comprehensive OpenTelemetry GenAI semantic conventions support
New Features
- Introduced the
LM-Kit.NET.Integrations.ExtensionsAINuGet package for Microsoft.Extensions.AI integration 🔥- Implements
IChatClientandIEmbeddingGeneratorfor seamless integration with the Microsoft.Extensions.AI ecosystem
- Implements
- Added real-time streaming support for orchestrators
- Added the
ConversationIdproperty to theChatHistoryclass - Extended MCP client with complete protocol support (sampling, roots, elicitation, progress tracking, cancellation, logging, completions, resource templates, resource subscriptions)
Version 2026.2.1 (02 February 2026)
Improvements
- Improved ReAct pattern of agents with tools
New Features
- Added stdio transport support to the MCP client for local MCP servers (Node.js, Python, native executables)
Bug Fixes
- Boolean type was not correctly handled in TextExtraction
Version 2026.1.5 (31 January 2026)
Comprehensive agent framework: orchestrators, built-in tools, planning, delegation, skills.
Improvements
- Improved SpeechToText speed and accuracy
New Features
- Extended the
LMKit.Agentsnamespace with comprehensive agent framework 🔥 (Agent,AgentBuilder,AgentExecutor,AgentRegistry) - Introduced the
LMKit.Agents.Orchestrationnamespace 🔥 (PipelineOrchestrator,ParallelOrchestrator,RouterOrchestrator,SupervisorOrchestrator) - Introduced the
LMKit.Agents.Tools.BuiltInnamespace with built-in tools 🔥 (eight categories: Data, Text, Numeric, Security, Utility, IO, Net, Document) - Introduced the
LMKit.Agents.Planningnamespace (ReActHandler,ChainOfThoughtHandler,TreeOfThoughtHandler,PlanAndExecuteHandler,ReflectionHandler) - Introduced the
LMKit.Agents.Delegationnamespace - Introduced the
LMKit.Agents.Streamingnamespace - Introduced the
LMKit.Agents.Resiliencenamespace - Introduced the
LMKit.Agents.Observabilitynamespace - Introduced the
LMKit.Agents.Templatesnamespace (18 pre-built agent templates) - Introduced the
LMKit.Agents.Skillsnamespace with Agent Skills Protocol support 🔥 - Added skills support to the
MultiTurnConversationclass
Bug Fixes
- Resolved compilation issue with MAUI for macOS (this time for real)
Version 2026.1.4 (27 January 2026)
Improvements
- Improved SpeechToText accuracy
New Features
- Added model cards for
whisper-large2anddevstral-small2 - Introduced the
LMKit.Speech.Dictationnamespace - Added the
SuppressHallucinationsandSuppressNonSpeechTokensproperties to theSpeechToTextclass
Bug Fixes
- Resolved compilation issue with MAUI for macOS
Version 2026.1.3 (17 January 2026)
Improvements
- Improved confidence metrics computation
- Improved the target language support of the summarizer engine
New Features
- Introduced the
NamedEntityRecognitionTrainingDatasetclass
Bug Fixes
- Text attachments were not correctly processed in some tasks
Version 2026.1.2 (11 January 2026)
Improvements
- General performance enhancements
- Improved DataSource create and update performance on cloud vector databases (for example Qdrant)
- Improved error reporting when loading partially corrupted DataSource files
- Improved the accuracy of the keyword extraction engine
New Features
- Added the
DeleteDocumentandDeleteDocumentAsyncmethods to theDocumentRagclass - Added
AddDataSourceandAddDataSourceAsyncoverloads to theRagEngineclass that load an existing DataSource from a vector store - Added
LoadWarningsandHasLoadWarningsproperties to theDataSourceclass for inspecting sections that could not be read during partial loading - Introduced the
LoadWarningclass - Added model cards for
nemotron3-nanoandfalcon-h1r:7b - Added support for the LMKIT_MODELS_DIR environment variable to configure the default model storage directory
Version 2026.1.1 (04 January 2026)
DocumentRag, PdfChat, chunking, telemetry, PDF format support, Attachment async APIs.
Improvements
- General performance enhancements
- Improved speed and accuracy of the
VlmOcrengine - Improved speed of the
SpeechToTextengine - Improved observability with OpenTelemetry GenAI instrumentation
New Features
- Added the
ModelIDproperty to theLMclass - Added
PromptTokenCountandPromptProcessingRateproperties to theTextGenerationResultclass - Added the
GeneratedTokenCountproperty to theTextGenerationResultclass - Introduced the
AttachmentReferenceclass - Added
GetTextandGetTextAsyncoverloads to theAttachmentclass that take a page range as a parameter - Added the
StructuredandAutomembers to theTextOutputModeenumeration - Added
SetTextoverloads to theAttachmentclass that accept a page index - Added URI constructor and
CreateFromUriAsyncfactory method to theAttachmentclass for loading attachments from remote URIs - Added
CreateFromFileAsyncandCreateFromStreamAsyncfactory methods to theAttachmentclass for asynchronous loading - Added
HasTextOnPageandHasTextOnPageAsyncmethods to theAttachmentclass - Added support for the PDF format
- Introduced the
IMultiTurnConversationenumeration - Introduced the
FileSystemVectorStoreclass - Introduced the
IChunkinginterface - Introduced the
MarkdownChunkingclass - Added
QueryPartitionsandQueryPartitionsAsyncoverloads toRagEnginethat accept a custom prompt template - Introduced the
DocumentRagclass - Introduced the
PdfChatclass - Introduced the
DocumentReferenceclass - Introduced the
DocumentIndexingResultclass - Introduced the
DocumentImportProgressEventArgsclass - Introduced the
CacheAccessedEventArgsclass - Introduced the
PassageRetrievalCompletedEventArgsclass - Introduced the
ResponseGenerationStartedEventArgsclass - Introduced the
DocumentImportPhaseenumeration - Introduced the
DocumentQueryResultclass - Introduced the
PageProcessingModeclass - Added the
MaximumCompletionTokensandStripImageMarkupproperties to theVlmOcrclass - Added
SetContentoverloads to theTextExtractionclass that accept a page index or a page range - Introduced the
LMKitTelemetryclass
Breaking changes
- Removed the
Promptclass, useChatHistory.Messageinstead
2025
Version 2025.12.4 (13 December 2025)
Improvements
- General performance enhancements
- Improved task accuracy with Ministral models
- Improved accuracy of the
VlmOcrengine
Bug Fixes
- Fixed unsupported Unicode characters in TextExtractionElement.Name
Version 2025.12.2 (08 December 2025)
Improvements
- General performance enhancements
- Voice activity detection accuracy improvements
- Embedding performance enhancements
New Features
- Introduced the
VlmOcrclass - Added support for LightOnOCR model
- Added model cards for
qwen3-vl:30bandlightonocr1025:1b
Version 2025.12.1 (03 December 2025)
Improvements
- General performance enhancements
New Features
- Added
Messageconstructor overloads taking attachment as parameter - Added
Expandmethod overload to theRectangleclass, allowing asymmetric expansion on each side - Added support for Ministral V3 models
- Added model cards for
ministral3:3b,ministral3:8b, andministral3:14b
Version 2025.12.1 (22 November 2025)
Improvements
- General performance enhancements
- Faster Vulkan backend
New Features
- Added support for Qwen3 embedding models
- Added model cards for
qwen3-embedding:0.6b,qwen3-embedding:4b, andqwen3-embedding:8b
Version 2025.11.2 (12 November 2025)
Improvements
- General performance enhancements
New Features
- Added a native assembly targeting .NET 10
Version 2025.11.1 (05 November 2025)
Improvements
- General performance enhancements
- Improved the summarizer engine
New Features
- Added model cards for
qwen3-vl:2b,qwen3-vl:4b, andqwen3-vl:8b
Version 2025.10.5 (24 October 2025)
Improvements
- General performance enhancements
New Features
- Introduced the
McpResourceclass - Introduced the
McpResourceContentclass - Introduced the
McpPromptclass - Introduced the
McpPromptArgumentclass - Introduced the
McpPromptResultclass - Introduced the
McpMessageRoleenumeration - Introduced the
McpPromptMessageclass - Introduced the
McpPromptContentclass - Introduced the
McpEmbeddedResourceclass - Introduced the
McpClientCapabilitiesandMcpServerCapabilitiesenumerations - Introduced the
McpCatalogKindenumeration - Added the
ServerCapabilities,Prompts,Resources, andMcpProtocolVersionproperties to theMcpClientclass - Added the
HasCapabilitymethod to theMcpClientclass - Added the
ToolsChanged,ResourcesChanged, andPromptsChangedevents to theMcpClientclass - Added the
GetResources,GetResourcesAsync,ReadResource, andReadResourceAsyncmethods to theMcpClientclass - Added the
GetPrompts,GetPromptsAsync,RefreshPrompts, andRefreshPromptsAsyncmethods to theMcpClientclass - Introduced the
McpResponseReceivedEventArgsclass - Introduced the
McpRequestSendingEventArgsclass - Introduced the
McpAuthFailedEventArgsclass - Introduced the
McpCatalogChangedEventArgsclass - Added the
ToolsChanged,ResourcesChanged, andPromptsChangedevents to theMcpClientclass
Version 2025.10.4 (16 October 2025)
Improvements
- General performance enhancements
- Improved tool calling support
- Improved Mistral and GptOss models support
New Features
- Introduced the
LMFunctionToolBinderclass - Introduced the
Agent.Tools.LMFunctionAttributeclass - Added model card for
magistral-small1.2 - Introduced the
ReasoningLevelenumeration - Added the
ReasoningLevelproperty to theMultiTurnConversationclass - Introduced the
ChatHistory.Message.MessageSegmentclass - Added the
Segmentsproperty to theChatHistory.Messageclass - Introduced the
EntityAnnotationclass - Introduced the
ExtractionTrainingDatasetclass - Introduced the
CategorizationTrainingDatasetclass - Introduced the
EmotionDetectionTrainingDatasetclass - Introduced the
SarcasmDetectionTrainingDatasetclass - Introduced the
SentimentAnalysisTrainingDatasetclass - Introduced the
LanguageDetectionTrainingDatasetclass - Introduced the
PiiExtractionTrainingDatasetclass
Version 2025.10.3 (09 October 2025)
Improvements
- Improved tool calling support
- Improved MCP client support
New Features
- Introduced the
TextSegmentTypeenumeration - Added the
SegmentTypeproperty to theAfterTextCompletionEventArgsclass
Version 2025.10.2 (06 October 2025)
Improvements
- General performance enhancements
- Improved tool calling support
- Optimized dynamic sampling mechanisms
New Features
- Added model cards for
granite4-h:3bandgranite4-h:7b
Version 2025.9.3 (29 September 2025)
Tool calling, MCP client, CUDA 13 backend, fine-tuning export, layout engine.
Improvements
- General performance enhancements
- Improved internal paragraph-detection engine, yielding better RAG chunking and higher extraction accuracy
- Improved PII extraction and NER engines accuracy
New Features
- Introduced first version of a CUDA 13 backend
- Introduced the
TextRegionclass - Introduced the
TextOutputModeenumeration - Added the
GetTextmethod to thePageElementclass - Introduced the
IToolclass - Introduced the
ToolRegistryclass - Introduced the
ToolCallPolicyclass - Introduced the
ToolChoiceenumeration - Introduced the
ToolCallResultTypeenumeration - Introduced the
ToolCallclass - Introduced the
ToolCallResultclass - Introduced the
MCPToolclass - Introduced the
MCPClientclass - Introduced the
ShareGptExporterclass - Introduced the
LMKit.Finetuning.Export.RoleMappingPolicyenumeration - Introduced the
LMKit.Finetuning.Export.ExportResultclass - Introduced the
LMKit.Finetuning.Export.ExportProgressclass - Added the
HasTextGenerationproperty to theLMclass - Added the
HasToolCallsproperty to theLMclass - Added the
Toolsproperty to theMultiTurnConversationclass - Added the
ToolPolicyproperty to theMultiTurnConversationclass - Introduced the
AfterToolInvocationEventArgsclass - Introduced the
BeforeToolInvocationEventArgsclass - Added the
BeforeToolInvocationevent to theMultiTurnConversationclass - Added the
AfterToolInvocationevent to theMultiTurnConversationclass - Added the
Occurrencesproperty to thePiiExtraction.PiiExtractedEntityclass - Added the
Occurrencesproperty to theNamedEntityRecognition.ExtractedEntityclass - Added the
PageElementsandPageCountproperties to theAttachmentclass - Added the
GetPageWidthandGetPageHeightmethods to theAttachmentclass
Version 2025.9.2 (10 September 2025)
Improvements
- Improved overall extraction accuracy
New Features
- Introduced the
LayoutSearchEngineclass - Added support for Gemma embedding models
- Added model card for
embeddinggemma-300m
Version 2025.9.1 (09 September 2025)
Improvements
- Improved overall extraction accuracy
New Features
- Added the
Boundsproperty to theTextExtractionResultElementclass - Added the
GetValue,TryGetValue,Get,TryGet, andGetAllmethods to theTextExtractionResultclass - Introduced the
Rectangleclass - Introduced the
Quadrilateralclass - Introduced the
IBounds,IPoint,ISize, andIVector2Dinterfaces - Introduced the
ILayoutElementinterface - Introduced the
ParagraphElementclass - Introduced the
LineElementclass - Added
DetectLinesandDetectParagraphsmethods to thePageElementclass - Added the
Boundsproperty to thePageElementclass - Added
TryDetectBorderBackgroundColor,Deskew,CropAuto,IsBlank,ConvertGRAY8,ConvertRGB24, andConvertRGBA32methods to theImageBufferclass - Added
DOCX,XLSX, andPPTXformats support to theAttachmentclass - Added the
Promptproperty to theSpeechToTextclass - Added model card for
minicpm-v-45
Version 2025.8.4 (22 August 2025)
Improvements
- General performance enhancements
Version 2025.8.3 (19 August 2025)
Improvements
- Improved Categorization engine speed and accuracy
- General performance enhancements
New Features
- Introduced the
ChatTrainingSampleclass - Introduced the
TrainingDatasetclass - Added model card for
gemma3:270m - Added the
SchemaDiscoveryandSchemaDiscoveryAsyncmethods to theTextExtractionclass - Added the
HasImageSegmentationproperty to theLMclass - Introduced the
ImageSegmentationmember to theModelCapabilitiesenumeration - Introduced the
BackgroundDetectionclass - Added model card for
u2net
Version 2025.8.2 (08 August 2025)
Improvements
- Optimized dynamic sampling mechanisms
- Enhanced runtime loading strategy
- General performance enhancements
- Improved the summarizer and keyword extraction engines introducing optional target language detection
New Features
- Introduced the
Configuration.FavorDistributedInferenceproperty - Added
TargetLanguageproperty to theSummarizerclass - Added
TargetLanguageproperty to theKeywordExtractionclass - Added
RemoveLoraAdaptermethod to theLMclass - Added support for OpenAI gpt-oss models
- Added model card for
gptoss:20b
Version 2025.8.2 (02 August 2025)
Improvements
- Optimized dynamic sampling mechanisms
- Enhanced runtime loading strategy
- General performance enhancements
Version 2025.7.5 (25 July 2025)
Improvements
- Optimized dynamic sampling mechanisms
New Features
- Introduced the
TesseractOcrEngineclass
Version 2025.7.4 (17 July 2025)
Improvements
- Better multi-threading support for multimodal inference
New Features
- Dynamic LoRA adapter loading support
- Introduced the
LoraAdapterclass - Introduced the
Adaptersproperty to theLMclass
Version 2025.7.3 (10 July 2025)
New Features
- Added support for SmolLM 3 models family
- Added model card for
smollm3:3b - Introduced the
ExtractionProfilerclass
Version 2025.7.2 (7 July 2025)
New Features
- Added support for linux-arm64 (CPU, Vulkan, and CUDA)
Improvements
- Optimized dynamic sampling mechanisms
Bug Fixes
- Fixed a crash that occurred when aborting speech-to-text tasks
Version 2025.7.1 (1 July 2025)
New Features
- Added
LoadAsRGBmethod overload to theImageBufferclass taking file path as parameter - Added
DetectLanguageandDetectLanguageAsyncmethod overload to theTextTranslationclass taking anImageBufferobject as parameter - Introduced the
VoiceActivityDetectionmember to theModelCapabilitiesenumeration - Introduced the
SpeechToText.SpeechToTextModeenumeration - Introduced the
VadSettingsclass - Added
Modeproperty to theSpeechToTextclass - Added
EnableVoiceActivityDetectionproperty to theSpeechToTextclass - Introduced the
VadSettingsproperty to theSpeechToTextclass - Introduced the
SpeechToText.LanguageDetectionResultclass - Introduced the
IsValidstatic method to theWaveFileclass
Version 2025.6.5 (27 June 2025)
Speech-to-text engine, ImageBuffer, WaveFile, audio processing classes.
New Features
- Added model cards for
mistral-small3.2,magistral-small,whisper-tiny,whisper-base,magistral-small,magistral-medium,whisper-large3, andwhisper-large-turbo3 - Added the
HasSpeechToTextproperty to theLMclass - Introduced the
SpeechToTextmember to theModelCapabilitiesenumeration - Introduced the
WaveFileclass - Introduced the
SpeechToTextclass - Introduced the
SpeechToText.OnNewSegmentEventArgsclass - Introduced the
SpeechToText.OnProgressEventArgsclass - Introduced the
SpeechToText.TranscriptionResultclass - Introduced the
AudioSegmentclass - Introduced the
CorruptedAudioExceptionclass - Introduced the
NotSupportedAudioExceptionclass - Introduced the
ImageBufferclass - Introduced the
ImagePixelFormatenumeration - Added new constructor to the
Attachmentclass to support ImageBuffer input
Improvements
- Improved overall extraction accuracy
Version 2025.6.4 (19 June 2025)
New Features
- Added the
PageElementproperty to theOcrResultclass - Added the
Guidanceproperty to thePiiExtractionclass - Added the
Guidanceproperty to theNamedEntityRecognitionclass
Improvements
- Minor extraction improvements
Version 2025.6.3 (17 June 2025)
New Features
- Added the
InnerElementsproperty to theTextExtractionResultElementclass - Introduced the
PageElementclass - Introduced the
TextElementclass - Added
SetTextmethod overload to theAttachmentclass taking PageElement as parameter
Improvements
- Improved overall extraction accuracy
Bug Fixes
- Fixed txt file loading issue from the Attachment class
Version 2025.6.2 (10 June 2025)
Named entity recognition, PII extraction, OCR events.
New Features
- Introduced the
NamedEntityRecognitionclass - Introduced the
NamedEntityRecognition.EntityDefinitionclass - Introduced the
NamedEntityRecognition.ExtractedEntityclass - Introduced the
NamedEntityRecognition.NamedEntityTypeenumeration - Introduced the
PiiExtractionclass - Introduced the
PiiExtraction.PiiExtractionclass - Introduced the
PiiExtraction.PiiExtractedEntityclass - Introduced the
PiiExtraction.PiiEntityTypeenumeration - Added the
Timeoutproperty to theTextractOcrclass - Introduced the
OcrStartingEventArgsevent - Introduced the
OcrCompletedEventArgsevent - Added the
Idproperty to theAttachmentclass - Added the
ClearContentmethod to theTextExtractionclass - Added the
InferencePoliciesproperty to theSingleTurnConversationclass
Improvements
- Optimized dynamic sampling mechanisms
Bug Fixes
- Fixed the Linux runtime, which was not correctly updated in the previous version
Version 2025.6.1 (2 June 2025)
New Features
- Added support for multimodal classification and extraction
- Added the
TitleandDescriptionproperties to theTextExtractionclass - Added the
OcrEngineproperty to theTextExtractionclass - Added the
MaxInputTokensproperty to theCategorizationclass - Added the
HasTextproperty to theAttachmentclass - Added the
CreateFromTextandDetectMimeTypestatic methods to theAttachmentclass - Added the
SetTextmethods to theAttachmentclass - Added the
Pathproperty to theAttachmentclass - Introduced the
InferenceModalityenumeration - Added the
PreferredInferenceModalityproperty to theCategorizationclass - Added the
PreferredInferenceModalityproperty to theTextExtractionclass - Introduced the
OcrEngineabstract class - Introduced the
OcrResultclass - Introduced the
OcrParametersclass - Introduced the
TextractOcrclass
Improvements
- Improved overall extraction accuracy
- Improved Categorization engine speed and accuracy
- Optimized dynamic sampling mechanisms
Version 2025.5.5 (25 May 2025)
New Features
- Added model card for
qwen2.5-vl:32b - Added the
TrimStartproperty to theTextExtractionElementFormatclass - Added support for more CUDA devices
Improvements
- Reduced memory usage when handling large context
- Improved overall extraction accuracy
Version 2025.5.4 (20 May 2025)
Improvements
- Various speed and memory usage improvements
Bug Fixes
- Fixed a model-loading issue that occurred under rare conditions
Version 2025.5.3 (20 May 2025)
Improvements
- Improved overall extraction accuracy
- Introduced the
TextExtractionElementFormat.PredefinedStringFormatenumeration - Added the
FormatHintproperty to theTextExtractionElementFormatclass - Added the
IsRequiredproperty to theTextExtractionElementFormatclass - Added the
EnableBackendLoggingproperty to theRuntimeclass
Version 2025.5.2 (13 May 2025)
Improvements
- Improved inference on various model families
- Reduced memory footprint of the built-in vector database engine
New Features
- Added model cards for
bge-m3andbge-m3-reranker - Introduced the
Rerankerclass - Introduced the
RagEngine.RagRerankerclass - Added the
Rerankerproperty to theRagEngineclass - Added new
RagEngineconstructor for multimodal RAG - Added
FindMatchingPartitionsandFindMatchingPartitionsAsyncmethod overloads to theRagEngineclass taking attachment as parameter - Added the
DefaultImagePayloadPixproperty to theRagEngineclass - Added
RerankedScoreandRawSimilarityproperties to thePartitionSimilarityclass - Added the
PayloadEncodingproperty to thePartitionSimilarityclass - Added the
NullOnDoubtproperty to theTextExtractionclass - Introduced the
Partitionclass - Introduced the
ImagePartitionclass - Added the
Versionproperty to theDataSourceclass
Version 2025.5.1 (5 May 2025)
Improvements
- Improved inference of vision models
- Improved inference of LLama 4 models
New Features
- Added support for Qwen 2.5 VL and Pixtral model families
- Added model cards for
pixtral,qwen2.5-vl:3b, andqwen2.5-vl:7b
Version 2025.4.13 (30 April 2025)
Improvements
- Improved translation engine precision with small models
New Features
- Added model cards for
qwen3:0.6b,qwen3:1.7b,qwen3:4b,qwen3:8b, andqwen3:14b - Added the
DisableReasoningproperty to theSingleTurnConversationclass - Added the
DisableReasoningproperty to theMultiTurnConversationclass - Added the
HasReasoningSupportmember to theTemplateFormatFlagsenumeration - Added the
HasReasoningproperty to theLMclass
Version 2025.4.12 (27 April 2025)
Bug Fixes
- Fixed a multi-byte decoding issue in produced model tokens
New Features
- Added
UpsertandUpsertAsyncmethod overloads to theDataSourceclass taking raw vector as parameter
Version 2025.4.11 (23 April 2025)
Bug Fixes
- Fixed ONNX runtime loading issue on Linux
New Features
- Added the
DataSource.VectorEntryclass - Added the
UpsertandUpsertAsyncmethod to theDataSourceclass - Introduced the
VectorSearchstatic class
Breaking changes
- The
TextPartitionSimilarityclass has been renamed toPartitionSimilarity
Version 2025.4.10 (22 April 2025)
Improvements
- Optimized dynamic sampling mechanisms
New Features
- Added support for image embeddings
- Added support for running inference on ONNX models
- Added the
GetEmbeddingsandGetEmbeddingsAsyncoverloads to theEmbedderclass to enable embeddings computation from image - Added the
HasImageEmbeddingsproperty to theLMclass - Introduced the
Summarizer.SummarizationIntentenumeration - Added the
Summarizer.Intentproperty to theSummarizerclass - Added model cards for
nomic-embed-vision,granite3.3:2b, andgranite3.3:8b
Version 2025.4.9 (15 April 2025)
Improvements
- Improved translation engine
- Improved structured text extraction accuracy
- Improved compatibility with JSON schemas for structured text extraction
- Improved internal scheduler engine
- Global speed improvement, up to 20%
New Features
- Added
GetTopCategories,GetTopCategoriesAsync,GetBestCategory, andGetBestCategoryAsyncoverloads to the Categorization class to specify category descriptions
Version 2025.4.7 (9 April 2025)
New Features
- Added the
HasSectionmethod to theDataSourceclass
Version 2025.4.6 (9 April 2025)
Built-in vector database engine, file-based DataSource storage.
New Features
- Introduced a new built-in vector database engine that permits handling small- to medium-sized vector collections on the file system without any dependency
- Introduced the
StorageModeenumeration - Added the
DataSource.StorageModeproperty - Added the
LoadFromFileandLoadFromFileAsyncmethods to theDataSourceclass - Added the
OptimizeDataSourceandOptimizeDataSourceAsyncmethods to theDataSourceclass - Added the
CreateFileDataSource,CreateInMemoryDataSource,CreateVectorStoreDataSource, andCreateVectorStoreDataSourceAsyncmethods to theDataSourceclass - Added the
Importmethod to theRagEngineclass - Added the
ImportAsyncmethod to theRagEngineclass - Added the
DefaultTextChunkingproperty to theRagEngineclass
Version 2025.4.1 (2 April 2025)
Improvements
- Enhanced RagEngine accuracy and speed
- The RAG engine now supports multiple DataSources simultaneously, each backed by its own storage mode (e.g., one DataSource using in-memory storage and another using a vector database)
New Features
- Introduced different DataSource storage modes, enabling storage of DataSource content directly in vector databases
- Introduced a new collection of open-source assemblies to bridge external vector databases with LM-Kit.NET. Currently, Qdrant is supported. Git repository: LM-Kit.NET Data Connectors
- Introduced the
IVectorStoreinterface - Introduced the
PointEntryclass - Added the
CollectionChangedevent to theMetadataCollectionclass - Added the
PropertyChangedevent to theMetadataCollectionclass - Added the
AddOrReplacemethod to theMetadataCollectionclass - Added the
ImportText,ImportTextAsync,ImportTextFromFile, andImportTextFromFileAsyncoverloads to theRagEngineclass to specify metadata collection - Added the
GetSectionByIdentifierAsyncmethod to theDataSourceclass - Added the
RemoveSectionAsyncmethod to theDataSourceclass - Added the
LoadFromStoreandLoadFromStoreAsyncmethods to theDataSourceclass - Added the
Metadataproperty to theRagEngine.TextPartitionSimilarityclass
Version 2025.3.6 (24 March 2025)
Improvements
- Strong performance improvements for the Vulkan backend
- Enhanced extraction accuracy in the structured text extraction engine
- Reduced VRAM/RAM usage
New Features
- Added support for Mistral Small 3.1 model and added mistral-small-3.1 model card
- Added the
GetTextmethod to theAttachmentclass - Added HTML format support to the
Attachmentclass
Version 2025.3.5 (17 March 2025)
New Features
- Added the
HandleUnknownCategoryproperty to theCategorizationclass - Added support for Gemma 3 vision
Improvements
- Improved inference with Gemma 3 models family
- Improved internal context caching strategy
- Inference speed improvements
Version 2025.3.4 (12 March 2025)
New Features
- Added support for Gemma 3 models family
- Added model cards for Qwen QwQ 32B, Gemma 3 1B, Gemma 3 4B, Gemma 3 12B, and Gemma 3 27B
- Added the
Guidanceproperty to theCategorizationclass
Version 2025.3.3 (11 March 2025)
New Features
- Added the
SetElementsFromJsonSchememethod to theTextExtractionclass - Added the
IsVisionRequiredproperty to theAttachmentclass - Added the
GetTopCategoriesandGetTopCategoriesAsyncproperty to theCategorizationclass - Added the
GetBestCategoryAsyncandGetTopCategoriesAsyncoverloads to theCategorizationclass to enable classification from image attachments
Version 2025.3.2 (3 March 2025)
Improvements
- Improved inference with phi-3 and phi-4 models
- Reduced CUDA backend size for Windows by a factor of 3
New Features
- Introduced the
MemoryTypeenumeration - Added the
SaveInformationoverloads to theAgentMemoryclass to specify the type of memory to record - Added the
SaveInformationAsyncoverloads to theAgentMemoryclass to specify the type of memory to record - Added the
GetMemoryTypestatic method to theAgentMemoryclass - Added the
GetDataSourcemethod to theRagEngineclass - Added the
TryGetDataSourcemethod to theRagEngineclass - Added the
Getmethod to theMetadataCollectionclass - Added the
TryGetmethod to theMetadataCollectionclass - Added the
MemoryTypeproperty to theMemoryRecallEventArgsclass
Version 2025.3.1 (1 March 2025)
Improvements
- Improvements to generic classification accuracy and speed
New Features
- Added the
RemoveDataSourceoverloads to theRagEngineclass to remove a DataSource from an identifier - Added the
RemoveSectionmethod to theDataSourceclass - Added support for Phi-4 Mini models
- Added model cards for Phi-4 Mini and Deepseek R1
Version 2025.2.4 (26 February 2025)
Improvements
- Minor performance improvements of the inference system
- Improved inference with various model families such as Granite, Mistral, Monarch, Orion, MiniCPM, and GigaChat
- Improved language detection accuracy
New Features
- Added new constructor to the
Attachmentclass to support stream input - Added the
ExtractKeywordsandExtractKeywordsAsyncoverloads to theKeywordExtractionclass to extract keywords from image attachments - Added the
SummarizeandSummarizeAsyncoverloads to theSummarizerclass to produce summary from image attachments - Added the
DetectLanguageandDetectLanguageAsyncoverloads to theTextTranslationclass to detect language from image attachments - Introduced the
DataFilterclass - Added the
RemoveDataSourceoverloads to theAgentMemoryclass to remove a DataSource from an identifier - Added the
AddDataSourcemethod to theAgentMemoryclass - Added the
AddDataSourcesmethod to theAgentMemoryclass - Added the
Filterproperty to theRagEngineclass - Added the
Filterproperty to theAgentMemoryclass - Added support for DeepSeek-R1 models
Version 2025.2.3 (19 February 2025)
Improvements
- Improved RAG engine accuracy and speed
- Optimized dynamic sampling mechanisms
New Features
- Introduced the
AgentMemoryclass - Introduced the
MemoryRecallEventArgsclass - Introduced the
TopNSigmaSamplingclass - Added the
Memoryproperty to theMultiTurnConversationclass - Added the
MaximumRecallTokensproperty to theMultiTurnConversationclass - Added the
MemoryRecallevent to theMultiTurnConversationclass - Added the
DownloadAsyncmethod to theModelCardclass - Added the
LogCallbackevent to theRuntimeclass
Version 2025.2.2 (12 February 2025)
Improvements
- Improved internal context caching strategy
New Features
- Added the
ModelIDproperty to theModelCardclass - Added the
GetPredefinedModelCardByModelIDmethod to theModelCardclass - Added the
LoadFromModelIDstatic method to theLMclass - Added the
ModelUriproperty to theLMclass - Added the
ImportTextAsyncmethod to theRagEngineclass - Added the
ImportTextFromFileAsyncmethod to theRagEngineclass
Version 2025.2.1 (4 February 2025)
Improvements
- Improved inference of MiniCPM engine
New Features
- Added model card for Mistral Small Instruct 2501
- Created a new open-source assembly bridging Microsoft SemanticKernel and LM-Kit.NET. Git repository: LM-Kit.NET Semantic Kernel
Version 2025.1.10 (30 January 2025)
Improvements
- Added WebP and TIFF image support to Attachment
- Minor performance improvements of the inference system
Version 2025.1.9 (27 January 2025)
Improvements
- Enhanced compatibility with Framework 4.6.2 apps
- Enhanced backend management mechanism
Version 2025.1.8 (26 January 2025)
Improvements
- Enhanced compatibility with Framework 4.6.2 apps
- Minor performance improvements for the Vulkan backend
Version 2025.1.7 (24 January 2025)
Improvements
- Improved speed and accuracy of vision models inference
Bug Fixes
- Fixed a memory leak during image eval using MiniCPM models
Version 2025.1.6 (22 January 2025)
Improvements
- Improved speed and accuracy of vision models inference for content retrieval
New Features
- Added support for the MiniCPM-O model
- Added the
ReplacementModelproperty to theModelCardclass
Bug Fixes
- Resolved a crash when regenerating responses using MiniCPM models
- Fixed a memory leak with VLM disposal
Version 2025.1.4 (19 January 2025)
Bug Fixes
- Resolved an inference issue in AOT mode
- Resolved a rare race condition in the Vulkan backend during inference
- Resolved a segmentation fault in the TextExtraction class
Version 2025.1.3 (18 January 2025)
Improvements
- Various performance improvements
- Improved Runtime initialization
New Features
- Added the
SetContentmethod overload that accepts anAttachmentto theTextExtractionclass, allowing extraction of unstructured content from images
Bug Fixes
- Resolved a crash during Qwen2-VL inference on macOS
Version 2025.1.2 (11 January 2025)
First multimodal/VLM support, Attachment class, LMK model format.
Improvements
- Enhanced GPU device memory management
- Optimized dynamic sampling mechanisms
- Various performance improvements
New Features
- Initial support for multimodal capabilities with vision handling (VLM)
- Added support for the LMK model format
- Introduced the
Promptclass - Introduced the
Attachmentclass - Added overloads to the
SubmitandSubmitAsyncmethods in theMultiTurnConversationclass to support prompts with image attachments - Added overloads to the
SubmitandSubmitAsyncmethods in theSingleTurnConversationclass to support prompts with image attachments
Version 2025.1.1 (1 January 2025)
Improvements
- Optimized dynamic sampling mechanisms
- Improved error handling
New Features
- Added the
HasVisionproperty to theLMclass - Added the
EnableModelChecksumValidationproperty to theConfigurationclass
2024 (Preview)
Expand 2024 releases (Preview era: July 2024 to December 2024)
Version 2024.12.13 (29 December 2024)
Improvements
- Optimized dynamic sampling mechanisms
- Improved performance with IBM Granite models
New Features
- Added overloads to the
GetPerformanceScoremethods in theDeviceConfigurationclass to obtain the model's performance score from a specified path
Bug Fixes
- An unhandled exception was raised when a model could not be loaded
Version 2024.12.12 (26 December 2024)
New Features
- Added the
ShortModelNameproperty to theModelCardclass - Added the
IsPredefinedproperty to theModelCardclass
Improvements
- The
LLMclass has been marked as obsolete. UseLMclass as its replacement
Bug Fixes
GpuDeviceInfo.DeviceTypewas returning an invalid value
Version 2024.12.11 (23 December 2024)
New Features
- Added the
Architectureproperty to theModelCardclass - Added the
GetPerformanceScoremethod to theDeviceConfigurationclass - Added support for Llama-3_1-Nemotron-51B and InfiniAI Megrez 3b models
Version 2024.12.10 (22 December 2024)
New Features
- Added the
LocalPathproperty to theModelCardclass
Improvements
- Improved performance with Phi 4 model
Version 2024.12.9 (20 December 2024)
New Features
- Added the
ValidateFileChecksummethod to theModelCardclass - The ModelCard class can now retrieve model metadata directly from Hugging Face servers without downloading the models
Version 2024.12.8 (19 December 2024)
New Features
- Added the
GetDeviceFromNumbermethod to theGpuDeviceInfoclass - Added the
MaximumContextLengthproperty to theTextExtractionclass - Added overloads to
GetOptimalContextSizemethods in theDeviceConfigurationclass to obtain the optimal context size from the available resources and model - Introduced the
ModelCardclass - Introduced the
ModelCapabilitiesenumeration - Added the
ModelStorageDirectoryproperty to theConfigurationclass
Improvements
- Improved summarization engine
Version 2024.12.7 (15 December 2024)
New Features
- Introduced the
Summarizer.OverflowResolutionStrategyenumeration - Introduced the
TextShrinkingStrategyenumeration - Added the
OverflowStrategyproperty to theSummarizerclass - Added the
TextShrinkingStrategyproperty to theKeywordExtractionclass - Introduced the
DeviceConfigurationclass
Improvements
- Improved summarization of very large text
Bug Fixes
- Prevent the
MultiTurnConversationclass from requesting a context size exceeding the model's capacity
Version 2024.12.6 (13 December 2024)
New Features
- Introduced the
Summarizerclass - Introduced the
Summarizer.SummarizerResultclass
Improvements
- Enhanced MAUI compatibility for macOS development
Version 2024.12.5 (11 December 2024)
New Features
- Added the
ContextRemainingSpaceproperty to theMultiTurnConversationclass - Added the
ContextRemainingSpaceproperty to theAfterTokenSamplingEventArgsclass - Added the
ContextSizeproperty to theAfterTokenSamplingEventArgsclass
Version 2024.12.4 (10 December 2024)
Improvements
- Changed type of ChatHistory.Message.PreviousContent to ChatHistory.Message
- Improved CPU backend speed on Windows
Version 2024.12.3 (07 December 2024)
Bug Fixes
- The items in ChatHistory.Message.PreviousContent were not set in the correct order
- The KeywordExtraction.KeywordCount setter was ignored
Improvements
- Improved the sampling strategy in scenarios with a low context length
Version 2024.12.2 (07 December 2024)
New Features
- The Windows Vulkan runtime is now included in the NuGet package
Improvements
- Optimized dynamic sampling mechanisms
Version 2024.12.1 (06 December 2024)
New Features
- Introduced the
LLM.TemplateFormatFlagsenumeration - Introduced the
KeywordExtractionclass - Introduced the
KeywordExtraction.KeywordItemclass - Added the
ChatTemplateFormatFlagsproperty to theLLMclass - Added the
IsArrayproperty to theTextExtractionElementclass
Improvements
- Optimized dynamic sampling mechanisms
- Improved support for large text bodies in the embedding engine
Version 2024.11.10 (29 November 2024)
New Features
- Introduced the
DeviceTypeenumeration - Added the
DeviceTypeproperty to theGpuDeviceInfoclass
Improvements
- Inference speed improvements with Vulkan backend
Version 2024.11.9 (27 November 2024)
New Features
- Added the
UseAsyncModelAttributesLoadingproperty to theConfigurationclass - Added the
DeviceDescriptionproperty to theGpuDeviceInfoclass - Added the
TotalMemorySizeproperty to theGpuDeviceInfoclass - Added the
FreeMemorySizeproperty to theGpuDeviceInfoclass
Improvements
- Improved NuGet package compatibility with MAUI
- Various inference speed improvements
- Improved handling of model loading failures
Version 2024.11.8 (25 November 2024)
New Features
- Added the
SetLogTextWritermethod to theRuntimeclass
Version 2024.11.7 (25 November 2024)
New Features
- Added maccatalyst RID for MAUI
Version 2024.11.6 (25 November 2024)
New Features
- Vulkan backend is now enabled by default
- Introduced the
Runtime.BackendTypeenumeration - Added the
Backendstatic property to theRuntimeclass - Introduced the
GpuDeviceInfoclass - Added the
PreviousContentproperty to theChatHistory.Messageclass
Version 2024.11.5 (23 November 2024)
Improvements
- Enhanced inference speed
- Improved compatibility with MAUI
Version 2024.11.4 (18 November 2024)
New Features
- Added Vulkan backend for Linux
- Added support for .NET 9
Improvements
- Improved compatibility with macOS ARM and macOS x64
- Enhanced CPU inference speed
Version 2024.11.4 (12 November 2024)
New Features
- Added the
CreateGrammarFromStringListmethod to theGrammarclass - Added the
WhitelistedValuesproperty to theTextExtractionElementFormatclass
Version 2024.11.2 (05 November 2024)
New Features
- Added overloads to
GetEmbeddingsandGetEmbeddingsAsyncmethods in theEmbedderclass to support batch processing of multiple inputs
Version 2024.11.1 (04 November 2024)
New Features
- Added support for Granite models
- Added the
Perplexityproperty to theBeforeTokenSamplingEventArgsclass - Added the
Perplexityproperty to theAfterTokenSamplingEventArgsclass - Added the
ContinueLastAssistantResponseandContinueLastAssistantResponseAsyncmethods to theMultiTurnConversationclass
Improvements
- Optimized dynamic sampling mechanisms
Version 2024.10.5 (24 October 2024)
New Features
- Added the
GeneratedTokensproperty to theChatHistory.Messageclass - Added the
TerminationReasonproperty to theChatHistory.Messageclass - Introduced the
TextExtractionElementFormatclass - Introduced the
TextExtractionElementFormat.TextCaseModeenumeration - Added the
TextExtractionElementFormatproperty to theTextExtractionElementclass
Improvements
- Enhanced structured text extraction
- Added new formatting options to text extraction elements
- Improved function calling capabilities
- Refined JSON data sampling
- Optimized dynamic sampling mechanisms
Version 2024.10.4 (17 October 2024)
Improvements
- Structured text extraction improvements
- Function calling improvements
- Refined JSON data sampling
- Optimized dynamic sampling mechanisms
Version 2024.10.3 (16 October 2024)
New Features
- Added the
TextExtractionclass - Added the
TextExtractionResultclass - Added the
TextExtractionResultElementclass - Added the
ElementTypeenumeration - Added the
DateTimeparameter type support to function calling
Improvements
- Optimized dynamic sampling mechanisms
Version 2024.10.2 (09 October 2024)
New Features
- Added the
TextExtractionElementclass - Added the
CreateJsonGrammarFromExtractionElements(IEnumerable<TextExtractionElement> extractionElements)method overload to theGrammarclass
Improvements
- Optimized dynamic sampling mechanisms
Version 2024.10.1 (01 October 2024)
New Features
- Added the
CreateTrainingObjectmethod overload to theEmotionDetectionclass - Added the
GetTrainingDatamethod to theEmotionDetectionclass - Introduced the
EmotionDetection.TrainingDatasetenumeration - Introduced the
JsonNumArray,JsonIntArray, andJsonStringArraymembers to theLMKit.Data.Json.JsonBasicTypeenumeration - Introduced the
TextTranslation.TrainingDatasetenumeration - Added the
GetTrainingDatamethod to theTextTranslationclass - Added the
EnableDynamicSamplingproperty to theConfigurationclass
Improvements
- Improvements to generic classification accuracy and speed
- Improved global performance with Llama 3.2 models
- Improved global performance with Qwen 2.5 models
- Fine-tuning: extended compatibility to all Llama architectures
- Improved sampling of JSON array structures
Version 2024.9.4 (25 September 2024)
New Features
- Added support for array parameters in function calls
- Added the
ForceFunctionSelectionproperty to theSingleFunctionCallclass - Introduced the
LMKit2024_09_INTmember to theSentimentAnalysis.TrainingDatasetenumeration
Improvements
- Improved function calling accuracy and speed
- Improved emotion detection engine accuracy and speed
- Strong improvements to generic classification accuracy and speed
- Improved accuracy of the language detection engine
Bug Fixes
- Fixed a memory leak in the fine-tuning engine
Version 2024.9.3 (18 September 2024)
Improvements
- Improved function calling accuracy
- Enhanced language detection accuracy
- Boosted generic classification accuracy and speed
- Increased inference speed on CPU
Version 2024.9.2 (11 September 2024)
New Features
- Added the
Seedproperty toRandomSampling,MirostatSampling, andMirostat2Samplingclasses - Introduced the
TrimAutomember to theInputLengthOverflowPolicyenumeration - Enabled deserialization of
ChatHistoryobjects without specifying aModelparameter
Improvements
- Further increased inference speed on CPU
- Enhanced internal API for better error handling
Version 2024.9.1 (6 September 2024)
Improvements
- Optimized inference speed on CPU
- Implemented checks to prevent access to disposed objects
- Improved exception handling in quantization features
Version 2024.9.0 (3 September 2024)
New Features
- Added the
GetTokenmethod to theVocabularyclass - Introduced the
Confidenceproperty to theTextTranslationclass - Added the
MaximumInputTokensproperty to theSingleTurnConversationclass
Improvements
- Enhanced JSON content sampling strategy
- Extended CUDA backend compatibility across more Windows configurations
- Improved accuracy of the language detection engine
- Enhanced compatibility with MAUI
Version 2024.8.4 - Preview (26 August 2024)
New Features
- Introduced the
GlobalConfigurationstatic class
Improvements
- Refined JSON content sampling strategy
- Improved function calling accuracy and speed
- Increased inference speed with Gemma2 models
- Enhanced classification accuracy with tiny-small models
Bug Fixes
- Fixed an issue when computing embeddings from a token array matching the context size
Version 2024.8.3 - Preview (21 August 2024)
Improvements
- Improved support for Phi 3.5 models
Version 2024.8.2 - Preview (20 August 2024)
New Features
- Added the
CreateJsonGrammarFromJsonSchememethod to theGrammarclass
Improvements
- Improved JSON content sampling strategy
- Enhanced function calling accuracy and speed
Version 2024.8.1 - Preview (15 August 2024)
New Features
- Added Vulkan backend 🔥
- Introduced the
FunctionCallResultproperty to theFunctionCallEventArgsclass - Added the
EnableVulkanproperty to theRuntimeclass - Included
KeepLastproperty in theAfterTokenSamplingEventArgsclass
Improvements
- Refined JSON content sampling strategy
- Enhanced function calling accuracy and speed
Version 2024.8.0 - Preview (8 August 2024)
New Features
- Added the
CreateJsonGrammarFromFieldsmethod to theGrammarclass - Enabled function calling support with any model class 🔥
- Introduced the
SingleFunctionCallandFunctionCallResultclasses - Added the
LMFunctionAttributeattribute
Improvements
- Enhanced grammar sampling strategy, especially for JSON schemes
Version 2024.7.10 - Preview (6 August 2024)
New Features
- Added support for Google Gemma 2 2B models 🔥
- Introduced the
CreateJsonGrammarFromTextFieldsmethod to theGrammarclass
Improvements
- Improved grammar sampling strategy, particularly with JSON schemes
Version 2024.7.9 - Preview (31 July 2024)
New Features
- Added the
Versionproperty to theRuntimeclass
Bug Fixes
- Fixed an issue enabling neutral emotion support in the
EmotionDetectionclass - Resolved issues when running fine-tuning with gradient checkpointing
Version 2024.7.8 - Preview (30 July 2024)
New Features
- Added the
MaximumContextLengthproperty to theSingleTurnConversationclass - Added the
GetTextAsyncmethod to theAttachmentclass
Improvements
- Enhanced inference engine handling of context overflow scenarios
Version 2024.7.7 - Preview (29 July 2024)
Improvements
- Enhanced compatibility with .NET Framework 4.6.2
Bug Fixes
- Resolved a crash related to grammar sampling
Version 2024.7.6 - Preview (27 July 2024)
New Features
- Added support for LLaMA 3.1 models 🔥
Improvements
- Enhanced logging mechanism
Version 2024.7.1 - Preview (23 July 2024)
🎉 Initial release!