📝 LM-Kit.NET Change History

📅 Version 2025.7.4 (17 July 2025)

Improvements

Better multi-threading support for multimodal inference

New Features

Dynamic LoRA Adapter Loading support
Introduced the LoraAdapter class
Introduced the Adapters property to the LM class

📅 Version 2025.7.3 (10 July 2025)

New Features

Added support for SmolLM 3 models family
Added model card for smollm3:3b
Introduced the ExtractionProfiler class

📅 Version 2025.7.2 (7 July 2025)

New Features

Added support for linux-arm64 (cpu, Vulkan and Cuda)

Improvements

Optimized dynamic sampling mechanisms

Bug Fixes

Fixed a crash that occurred when aborting speech-to-text tasks

📅 Version 2025.7.1 (1 July 2025)

New Features

Added LoadAsRGB method overload to the ImageBuffer class taking file path as parameter
Added DetectLanguage and DetectLanguageAsync method overload to the TextTranslation class taking an ImageBuffer object as parameter
Introduced the VoiceActivityDetection member to the ModelCapabilities enumeration
Introduced the SpeechToText.SpeechToTextMode enumeration
Introduced the VadSettings class
Added Mode property to the SpeechToText class
Added EnableVoiceActivityDetection property to the SpeechToText class
Introduced the VadSettings property to the SpeechToText class
Introduced the SpeechToText.LanguageDetectionResult class
Introduced the IsValid static method to the WaveFile class

📅 Version 2025.6.5 (27 June 2025)

New Features

Added model cards for mistral-small3.2, magistral-small, whisper-tiny, whisper-base, magistral-small, magistral-medium, whisper-large3 and whisper-large-turbo3
Added the HasSpeechToText property to the LM class
Introduced the SpeechToText member to the ModelCapabilities enumeration
Introduced the WaveFile class
Introduced the SpeechToText class
Introduced the SpeechToText.OnNewSegmentEventArgs class
Introduced the SpeechToText.OnProgressEventArgs class
Introduced the SpeechToText.TranscriptionResult class
Introduced the AudioSegment class
Introduced the CorruptedAudioException class
Introduced the NotSupportedAudioException class
Introduced the ImageBuffer class
Introduced the ImagePixelFormat enumeration
Added new constructor to the Attachment class to support ImageBuffer input

Improvements

Improved overall extraction accuracy

📅 Version 2025.6.4 (19 June 2025)

New Features

Added the PageElement property to the OcrResult class
Added the Guidance property to the PiiExtraction class
Added the Guidance property to the NamedEntityRecognition class

Improvements

Minor extraction improvements

📅 Version 2025.6.3 (17 June 2025)

New Features

Added the InnerElements property to the TextExtractionResultElement class
Introduced the PageElement class
Introduced the TextElement class
Added SetText method overload to the Attachment class taking PageElement as parameter

Improvements

Improved overall extraction accuracy

Bug Fixes

Fixed txt file loading issue from the Attachment class

📅 Version 2025.6.2 (10 June 2025)

New Features

Introduced the NamedEntityRecognition class
Introduced the NamedEntityRecognition.EntityDefinition class
Introduced the NamedEntityRecognition.ExtractedEntity class
Introduced the NamedEntityRecognition.NamedEntityType enumeration
Introduced the PiiExtraction class
Introduced the PiiExtraction.PiiExtraction class
Introduced the PiiExtraction.PiiExtractedEntity class
Introduced the PiiExtraction.PiiEntityType enumeration
Added the Timeout property to the TextractOcr class
Introduced the OcrStartingEventArgs event
Introduced the OcrCompletedEventArgs event
Added the Id property to the Attachment class
Added the ClearContent method to the TextExtraction class
Added the InferencePolicies property to the SingleTurnConversation class

Improvements

Optimized dynamic sampling mechanisms

Bug Fixes

Fixed the Linux runtime, which was not correctly updated in the previous version

📅 Version 2025.6.1 (2 June 2025)

New Features

Added support for multimodal classification and extraction
Added the Title and Description properties to the TextExtraction class
Added the OcrEngine property to the TextExtraction class
Added the MaxInputTokens property to the Categorization class
Added the HasText property to the Attachment class
Added the CreateFromText and DetectMimeType static methods to the Attachment class
Added the SetText methods to the Attachment class
Added the Path property to the Attachment class
Introduced the InferenceModality enumeration
Added the PreferredInferenceModality property to the Categorization class
Added the PreferredInferenceModality property to the TextExtraction class
Introduced the OcrEngine abstract class
Introduced the OcrResult class
Introduced the OcrParameters class
Introduced the TextractOcr class

Improvements

Improved overall extraction accuracy
Improved Categorization engine speed & accuracy
Optimized dynamic sampling mechanisms

📅 Version 2025.5.5 (25 May 2025)

New Features

Added model card for qwen2.5-vl:32b
Added the TrimStart property to the TextExtractionElementFormat class
Added support for more CUDA devices

Improvements

Reduced memory usage when handling large context
Improved overall extraction accuracy

📅 Version 2025.5.4 (20 May 2025)

Improvements

various speed and memory usage improvements

Bug Fixes

Fixed a model-loading issue that occurred under rare conditions

📅 Version 2025.5.3 (20 May 2025)

Improvements

Improved overall extraction accuracy
Introduced the TextExtractionElementFormat.PredefinedStringFormat enumeration
Added the FormatHint property to the TextExtractionElementFormat class
Added the IsRequired property to the TextExtractionElementFormat class
Added the EnableBackendLogging property to the Runtime class

📅 Version 2025.5.2 (13 May 2025)

Improvements

Improved inference on various model families
Reduced memory footprint of the built-in vector database engine

New Features

Added model cards for bge-m3 and bge-m3-reranker
Introduced the Reranker class
Introduced the RagEngine.RagReranker class
Added the Reranker property to the RagEngine class
Added new RagEngine constructor for multimodal RAG.
Added FindMatchingPartitions and FindMatchingPartitionsAsync method overloads to the RagEngine class taking attachment as parameter
Added the DefaultImagePayloadPix property to the RagEngine class
Added RerankedScore and RawSimilarity properties to the PartitionSimilarity class
Added the PayloadEncoding property to the PartitionSimilarity class
Added the NullOnDoubt property to the TextExtraction class
Introduced the Partition class
Introduced the ImagePartition class
Added the Version property to the DataSource class

📅 Version 2025.5.1 (5 May 2025)

Improvements

Improved inference of vision models
Improved inference of LLama 4 models

New Features

Added support for Qwen 2.5 VL and Pixtral model families
Added model cards for pixtral, qwen2.5-vl:3b and qwen2.5-vl:7b

📅 Version 2025.4.13 (30 April 2025)

Improvements

Improved translation engine precision with small models

New Features

Added model cards for qwen3:0.6b, qwen3:1.7b, qwen3:4b, qwen3:8b and qwen3:14b
Added the DisableReasoning property to the SingleTurnConversation class
Added the DisableReasoning property to the MultiTurnConversation class
Added the HasReasoningSupport member to the TemplateFormatFlags enumeration
Added the HasReasoning property to the LM class

📅 Version 2025.4.12 (27 April 2025)

Bug Fixes

Fixed a multi-byte decoding issue in produced model tokens

New Features

Added Upsert and UpsertAsync method overloads to the DataSource class taking raw vector as parameter

📅 Version 2025.4.11 (23 April 2025)

Bug Fixes

Fixed ONNX runtime loading issue on Linux

New Features

Added the DataSource.VectorEntry class
Added the Upsert and UpsertAsync method to the DataSource class
Introduced the VectorSearch static class

Breaking changes

The TextPartitionSimilarity class has been renamed to PartitionSimilarity

📅 Version 2025.4.10 (22 April 2025)

Improvements

Optimized dynamic sampling mechanisms

New Features

Added support for image embeddings
Added support for running inference on ONNX models
Added the GetEmbeddings and GetEmbeddingsAsync overloads to the Embedder class to enable embeddings computation from image
Added the HasImageEmbeddings property to the LM class
Introduced the Summarizer.SummarizationIntent enumeration
Added the Summarizer.Intent property to the Summarizer class
Added model cards for nomic-embed-vision, granite3.3:2b and granite3.3:8b

📅 Version 2025.4.9 (15 April 2025)

Improvements

Improved translation engine
Improved structured text extraction accuracy
Improved compatibility with JSON schemas for structured text extraction
Improved internal scheduler engine
Global speed improvement, up to 20%

New Features

Added GetTopCategories, GetTopCategoriesAsync, GetBestCategory and GetBestCategoryAsync overloads to the Categorization class to specify category descriptions

📅 Version 2025.4.7 (9 April 2025)

New Features

Added the HasSection method to the DataSource class

📅 Version 2025.4.6 (9 April 2025)

New Features

Introduced a new built-in vector database engine that permits handling small- to medium-sized vector collections on the file system without any dependency
Introduced the StorageMode enumeration
Added the DataSource.StorageMode property
Added the LoadFromFile and LoadFromFileAsync methods to the DataSource class
Added the OptimizeDataSource and OptimizeDataSourceAsync methods to the DataSource class
Added the CreateFileDataSource, CreateInMemoryDataSource, CreateVectorStoreDataSource and CreateVectorStoreDataSourceAsync methods to the DataSource class
Added the Import method to the RagEngine class
Added the ImportAsync method to the RagEngine class
Added the DefaultTextChunking property to the RagEngine class

📅 Version 2025.4.1 (2 April 2025)

Improvements

Enhanced RagEngine accuracy and speed
The RAG engine now supports multiple DataSources simultaneously, each backed by its own storage mode (e.g., one DataSource using in-memory storage and another using a vector database)

New Features

Introduced different DataSource storage modes, enabling storage of DataSource content directly in vector databases
Introduced a new collection of open-source assemblies to bridge external vector databases with LM-Kit.NET. Currently, Qdrant is supported. Git repository: LM-Kit.NET Data Connectors
Introduced the IVectorStore interface
Introduced the PointEntry class
Added the CollectionChanged event to the MetadataCollection class
Added the PropertyChanged event to the MetadataCollection class
Added the AddOrReplace method to the MetadataCollection class
Added the ImportText, ImportTextAsync, ImportTextFromFile, ImportTextFromFileAsync overloads to the RagEngine class to specify metadata collection
Added the GetSectionByIdentifierAsync method to the DataSource class
Added the RemoveSectionAsync method to the DataSource class
Added the LoadFromStore and LoadFromStoreAsync methods to the DataSource class
Added the Metadata property to the RagEngine.TextPartitionSimilarity class

📅 Version 2025.3.6 (24 March 2025)

Improvements

Strong performance improvements for the Vulkan backend
Enhanced extraction accuracy in the structured text extraction engine
Reduce VRAM/RAM usage

New Features

Added support for Mistral Small 3.1 model and added mistral-small-3.1 model card
Added the GetText method to the Attachment class
Added HTML format support to the Attachment class

📅 Version 2025.3.5 (17 March 2025)

New Features

Added the HandleUnknownCategory property to the Categorization class
Added support for Gemma 3 vision

Improvements

Improved inference with Gemma 3 models family
Improved internal context caching strategy
Inference speed improvements

📅 Version 2025.3.4 (12 March 2025)

New Features

Added support for Gemma 3 models family
Added model cards for Qwen QwQ 32B, Gemma 3 1B, Gemma 3 4B, Gemma 3 12B and Gemma 3 27B
Added the Guidance property to the Categorization class

📅 Version 2025.3.3 (11 March 2025)

New Features

Added the SetElementsFromJsonScheme method to the TextExtraction class
Added the IsVisionRequired property to the Attachment class
Added the GetTopCategories and GetTopCategoriesAsync property to the Categorization class
Added the GetBestCategoryAsync and GetTopCategoriesAsync overloads to the Categorization class to enable classification from image attachments

📅 Version 2025.3.2 (3 March 2025)

Improvements

Improved inference with phi-3 and phi-4 models
Reduced CUDA backend size for Windows by a factor of 3

New Features

Introduced the MemoryType enumeration
Added the SaveInformation overloads to the AgentMemory class to specify the type of memory to record
Added the SaveInformationAsync overloads to the AgentMemory class to specify the type of memory to record
Added the GetMemoryType static method to the AgentMemory class
Added the GetDataSource method to the RagEngine class
Added the TryGetDataSource method to the RagEngine class
Added the Get method to the MetadataCollection class
Added the TryGet method to the MetadataCollection class
Added the MemoryType property to the MemoryRecallEventArgs class

📅 Version 2025.3.1 (1 March 2025)

Improvements

Improvements to generic classification accuracy and speed

New Features

Added the RemoveDataSource overloads to the RagEngine class to remove a DataSource from an identifier
Added the RemoveSection method to the DataSource class
Added support for Phi-4 Mini models
Added model cards for Phi-4 Mini and Deepseek R1

📅 Version 2025.2.4 (26 February 2025)

Improvements

Minor performance improvements of the inference system
Improved inference with various model families such as Granite, Mistral, Monarch, Orion, MiniCPM, and GigaChat
Improved language detection accuracy

New Features

Added new constructor to the Attachment class to support stream input
Added the ExtractKeywords and ExtractKeywordsAsync overloads to the KeywordExtraction class to extract keywords from image attachments
Added the Summarize and SummarizeAsync overloads to the Summarizer class to produce summary from image attachments
Added the DetectLanguage and DetectLanguageAsync overloads to the TextTranslation class to detect language from image attachments
Introduced the DataFilter class
Added the RemoveDataSource overloads to the AgentMemory class to remove a DataSource from an identifier
Added the AddDataSource method to the AgentMemory class
Added the AddDataSources method to the AgentMemory class
Added the Filter property to the RagEngine class
Added the Filter property to the AgentMemory class
Added support for DeepSeek-R1 models

📅 Version 2025.2.3 (19 February 2025)

Improvements

Improved RAG engine accuracy and speed
Optimized dynamic sampling mechanisms

New Features

Introduced the AgentMemory class
Introduced the MemoryRecallEventArgs class
Introduced the TopNSigmaSampling class
Added the Memory property to the MultiTurnConversation class
Added the MaximumRecallTokens property to the MultiTurnConversation class
Added the MemoryRecall event to the MultiTurnConversation class
Added the DownloadAsync method to the ModelCard class
Added the LogCallback event to the Runtime class

📅 Version 2025.2.2 (12 February 2025)

Improvements

Improved internal context caching strategy

New Features

Added the ModelID property to the ModelCard class
Added the GetPredefinedModelCardByModelID method to the ModelCard class
Added the LoadFromModelID static method to the LM class
Added the ModelUri property to the LM class
Added the ImportTextAsync method to the RagEngine class
Added the ImportTextFromFileAsync method to the RagEngine class

📅 Version 2025.2.1 (4 February 2025)

Improvements

Improved inference of MiniCPM engine

New Features

Added model card for Mistral Small Instruct 2501
Created a new open-source assembly bridging Microsoft SemanticKernel and LM-Kit.NET. Git repository: LM-Kit.NET Semantic Kernel

📅 Version 2025.1.10 (30 January 2025)

Improvements

Added WebP and TIFF image support to Attachment
Minor performance improvements of the inference system

📅 Version 2025.1.9 (27 January 2025)

Improvements

Enhanced compatibility with Framework 4.6.2 apps
Enhanced backend management mechanism

📅 Version 2025.1.8 (26 January 2025)

Improvements

Enhanced compatibility with Framework 4.6.2 apps
Minor performance improvements for the Vulkan backend

📅 Version 2025.1.7 (24 January 2025)

Improvements

Improved speed and accuracy of vision models inference

Bug Fixes

Fixed a memory leak during image eval using MiniCPM models

📅 Version 2025.1.6 (22 January 2025)

Improvements

Improved speed and accuracy of vision models inference for content retrieval

New Features

Added support for the MiniCPM-O model
Added the ReplacementModel property to the ModelCard class

Bug Fixes

Resolved a crash when regenerating responses using MiniCPM models
Fixed a memory leak with VLM disposal

📅 Version 2025.1.4 (19 January 2025)

Bug Fixes

Resolved an inference issue in AOT mode
Resolved a rare race condition in the Vulkan backend during inference
Resolved a segmentation fault in the TextExtraction class

📅 Version 2025.1.3 (18 January 2025)

Improvements

Various performance improvements
Improved Runtime initialization

New Features

Added the SetContent method overload that accepts an Attachment to the TextExtraction class, allowing extraction of unstructured content from images

Bug Fixes

Resolved a crash during Qwen2-VL inference on macOS

📅 Version 2025.1.2 (11 January 2025)

Improvements

Enhanced GPU device memory management
Optimized dynamic sampling mechanisms
Various performance improvements

New Features

Initial support for multimodal capabilities with vision handling (VLM)
Added support for the LMK model format
Introduced the Prompt class
Introduced the Attachment class
Added overloads to the Submit and SubmitAsync methods in the MultiTurnConversation class to support prompts with image attachments
Added overloads to the Submit and SubmitAsync methods in the SingleTurnConversation class to support prompts with image attachments

📅 Version 2025.1.1 (1 January 2025)

Improvements

Optimized dynamic sampling mechanisms
Improved error handling

New Features

Added the HasVision property to the LM class
Added the EnableModelChecksumValidation property to the Configuration class

📅 Version 2024.12.13 (29 December 2024)

Improvements

Optimized dynamic sampling mechanisms
Improved performance with IBM Granite models

New Features

Added overloads to the GetPerformanceScore methods in the DeviceConfiguration class to obtain the model's performance score from a specified path

Bug Fixes

An unhandled exception was raised when a model could not be loaded

📅 Version 2024.12.12 (26 December 2024)

New Features

Added the ShortModelName property to the ModelCard class
Added the IsPredefined property to the ModelCard class

Improvements

The LLM class has been marked as obsolete. Use LM class as it replacement

Bug Fixes

GpuDeviceInfo.DeviceType was returning an invalid value

📅 Version 2024.12.11 (23 December 2024)

New Features

Added the Architecture property to the ModelCard class
Added the GetPerformanceScore method to the DeviceConfiguration class
Added support for Llama-3_1-Nemotron-51B and InfiniAI Megrez 3b models

📅 Version 2024.12.10 (22 December 2024)

New Features

Added the LocalPath property to the ModelCard class

Improvements

Improved performance with Phi 4 model

📅 Version 2024.12.9 (20 December 2024)

New Features

Added the ValidateFileChecksum method to the ModelCard class
The ModelCard class can now retrieve model metadata directly from Hugging Face servers without downloading the models

📅 Version 2024.12.8 (19 December 2024)

New Features

Added the GetDeviceFromNumber method to the GpuDeviceInfo class
Added the MaximumContextLength property to the TextExtraction class
Added overloads to GetOptimalContextSize methods in the DeviceConfiguration class to obtain the optimal context size from the available resources and model
Introduced the ModelCard class
Introduced the ModelCapabilities enumeration
Added the ModelStorageDirectory property to the Configuration class

Improvements

Improved summarization engine

📅 Version 2024.12.7 (15 December 2024)

New Features

Introduced the Summarizer.OverflowResolutionStrategy enumeration
Introduced the TextShrinkingStrategy enumeration
Added the OverflowStrategy property to the Summarizer class
Added the TextShrinkingStrategy property to the KeywordExtraction class
Introduced the DeviceConfiguration class

Improvements

Improved summarization of very large text

Bug Fixes

Prevent the MultiTurnConversation class from requesting a context size exceeding the model's capacity

📅 Version 2024.12.6 (13 December 2024)

New Features

Introduced the Summarizer class
Introduced the Summarizer.SummarizerResult class

Improvements

Enhanced MAUI compatibility for macOS development

📅 Version 2024.12.5 (11 December 2024)

New Features

Added the ContextRemainingSpace property to the MultiTurnConversation class
Added the ContextRemainingSpace property to the AfterTokenSamplingEventArgs class
Added the ContextSize property to the AfterTokenSamplingEventArgs class

📅 Version 2024.12.4 (10 December 2024)

Improvements

Changed type of ChatHistory.Message.PreviousContent to ChatHistory.Message
Improved CPU backend speed on Windows

📅 Version 2024.12.3 (07 December 2024)

Bug Fixes

The items in ChatHistory.Message.PreviousContent were not set in the correct order
The KeywordExtraction.KeywordCount setter was ignored

Improvements

Improved the sampling strategy in scenarios with a low context length

📅 Version 2024.12.2 (07 December 2024)

New Features

The Windows Vulkan runtime is now included in the NuGet package

Improvements

Optimized dynamic sampling mechanisms

📅 Version 2024.12.1 (06 December 2024)

New Features

Introduced the LLM.TemplateFormatFlags enumeration
Introduced the KeywordExtraction class
Introduced the KeywordExtraction.KeywordItem class
Added the ChatTemplateFormatFlags property to the LLM class
Added the IsArray property to the TextExtractionElement class

Improvements

Optimized dynamic sampling mechanisms
Improved support for large text bodies in the embedding engine

📅 Version 2024.11.10 (29 November 2024)

New Features

Introduced the DeviceType enumeration
Added the DeviceType property to the GpuDeviceInfo class

Improvements

Inference speed improvements with Vulkan backend

📅 Version 2024.11.9 (27 November 2024)

New Features

Added the UseAsyncModelAttributesLoading property to the Configuration class
Added the DeviceDescription property to the GpuDeviceInfo class
Added the TotalMemorySize property to the GpuDeviceInfo class
Added the FreeMemorySize property to the GpuDeviceInfo class

Improvements

Improved NuGet package compatibility with MAUI
Various inference speed improvements
Improved handling of model loading failures

📅 Version 2024.11.8 (25 November 2024)

New Features

Added the SetLogTextWriter method to the Runtime class

📅 Version 2024.11.7 (25 November 2024)

New Features

Added maccatalyst RID for MAUI

📅 Version 2024.11.6 (25 November 2024)

New Features

Vulkan backend in now enabled by default
Introduced the Runtime.BackendType enumeration
Added the Backend static property to the Runtime class
Introduced the GpuDeviceInfo class
Added the PreviousContent property to the ChatHistory.Message class

📅 Version 2024.11.5 (23 November 2024)

Improvements

Enhanced inference speed
Improved compatibility with MAUI

📅 Version 2024.11.4 (18 November 2024)

New Features

Added Vulkan backend for Linux
Added support for .NET 9

Improvements

Improved compatibility with macOS ARM and macOS x64
Enhanced CPU inference speed

📅 Version 2024.11.4 (12 November 2024)

New Features

Added the CreateGrammarFromStringList method to the Grammar class
Added the WhitelistedValues property to the TextExtractionElementFormat class

📅 Version 2024.11.2 (05 November 2024)

New Features

Added overloads to GetEmbeddings and GetEmbeddingsAsync methods in the Embedder class to support batch processing of multiple inputs

📅 Version 2024.11.1 (04 November 2024)

New Features

Added support for Granit models
Added the Perplexity property to the BeforeTokenSamplingEventArgs class
Added the Perplexity property to the AfterTokenSamplingEventArgs class
Added the ContinueLastAssistantResponse and ContinueLastAssistantResponseAsync methods to the MultiTurnConversation class

Improvements

Optimized dynamic sampling mechanisms

📅 Version 2024.10.5 (24 October 2024)

New Features

Added the GeneratedTokens property to the ChatHistory.Message class
Added the TerminationReason property to the ChatHistory.Message class
Introduced the TextExtractionElementFormat class
Introduced the TextExtractionElementFormat.TextCaseMode enumeration
Added the TextExtractionElementFormat property to the TextExtractionElement class

Improvements

Enhanced structured text extraction
Added new formatting options to text extraction elements
Improved function calling capabilities
Refined JSON data sampling
Optimized dynamic sampling mechanisms

📅 Version 2024.10.4 (17 October 2024)

Improvements

Structured text extraction improvements
Function calling improvements
Refined JSON data sampling
Optimized dynamic sampling mechanisms

📅 Version 2024.10.3 (16 October 2024)

New Features

Added the TextExtraction class
Added the TextExtractionResult class
Added the TextExtractionResultElement class
Added the ElementType enumeration
Added the DateTime parameter type support to function calling

Improvements

Optimized dynamic sampling mechanisms

📅 Version 2024.10.2 (09 October 2024)

New Features

Added the TextExtractionElement class
Added the CreateJsonGrammarFromExtractionElements(IEnumerable<TextExtractionElement> extractionElements) method overload to the Grammar class

Improvements

Optimized dynamic sampling mechanisms

📅 Version 2024.10.1 (01 October 2024)

New Features

Added the CreateTrainingObject method overload to the EmotionDetection class
Added the GetTrainingData method to the EmotionDetection class
Introduced the EmotionDetection.TrainingDataset enumeration
Introduced the JsonNumArray, JsonIntArray and JsonStringArray members to the LMKit.Data.Json.JsonBasicType enumeration
Introduced the TextTranslation.TrainingDataset enumeration
Added the GetTrainingData method to the TextTranslation class
Added the EnableDynamicSampling property to the Configuration class

Improvements

Improvements to generic classification accuracy and speed
Improved global performance with Llama 3.2 models
Improved global performance with Qwen 2.5 models
Fine-tuning: extended compatibility to all Llama architectures
Improved sampling of JSON array structures

📅 Version 2024.9.4 (25 September 2024)

New Features

Added support for array parameters in function calls
Added the ForceFunctionSelection property to the SingleFunctionCall class
Introduced the LMKit2024_09_INT member to the SentimentAnalysis.TrainingDataset enumeration

Improvements

Improved function calling accuracy and speed
Improved emotion detection engine accuracy and speed
Strong improvements to generic classification accuracy and speed
Improved accuracy of the language detection engine

Bug Fixes

Fixed an memory leak in the fine-tuning engine

📅 Version 2024.9.3 (18 September 2024)

Improvements

Improved function calling accuracy
Enhanced language detection accuracy
Boosted generic classification accuracy and speed
Increased inference speed on CPU

📅 Version 2024.9.2 (11 September 2024)

New Features

Added the Seed property to RandomSampling, MirostatSampling, and Mirostat2Sampling classes
Introduced the TrimAuto member to the InputLengthOverflowPolicy enumeration
Enabled deserialization of ChatHistory objects without specifying a Model parameter

Improvements

Further increased inference speed on CPU
Enhanced internal API for better error handling

📅 Version 2024.9.1 (6 September 2024)

Improvements

Optimized inference speed on CPU
Implemented checks to prevent access to disposed objects
Improved exception handling in quantization features

📅 Version 2024.9.0 (3 September 2024)

New Features

Added the GetToken method to the Vocabulary class
Introduced the Confidence property to the TextTranslation class
Added the MaximumInputTokens property to the SingleTurnConversation class

Improvements

Enhanced JSON content sampling strategy
Extended CUDA backend compatibility across more Windows configurations
Improved accuracy of the language detection engine
Enhanced compatibility with MAUI

📅 Version 2024.8.4 - Preview (26 August 2024)

New Features

Introduced the GlobalConfiguration static class

Improvements

Refined JSON content sampling strategy
Improved function calling accuracy and speed
Increased inference speed with Gemma2 models
Enhanced classification accuracy with tiny-small models

Bug Fixes

Fixed an issue when computing embeddings from a token array matching the context size

📅 Version 2024.8.3 - Preview (21 August 2024)

Improvements

Improved support for Phi 3.5 models

📅 Version 2024.8.2 - Preview (20 August 2024)

New Features

Added the CreateJsonGrammarFromJsonScheme method to the Grammar class

Improvements

Improved JSON content sampling strategy
Enhanced function calling accuracy and speed

📅 Version 2024.8.1 - Preview (15 August 2024)

New Features

Added Vulkan backend 🔥
Introduced the FunctionCallResult property to the FunctionCallEventArgs class
Added the EnableVulkan property to the Runtime class
Included KeepLast property in the AfterTokenSamplingEventArgs class

Improvements

Refined JSON content sampling strategy
Enhanced function calling accuracy and speed

📅 Version 2024.8.0 - Preview (8 August 2024)

New Features

Added the CreateJsonGrammarFromFields method to the Grammar class
Enabled function calling support with any model class 🔥
Introduced the SingleFunctionCall and FunctionCallResult classes
Added the LMFunctionAttribute attribute

Improvements

Enhanced grammar sampling strategy, especially for JSON schemes

📅 Version 2024.7.10 - Preview (6 August 2024)

New Features

Added support for Google Gemma 2 2B models 🔥
Introduced the CreateJsonGrammarFromTextFields method to the Grammar class

Improvements

Improved grammar sampling strategy, particularly with JSON schemes

📅 Version 2024.7.9 - Preview (31 July 2024)

New Features

Added the Version property to the Runtime class

Bug Fixes

Fixed an issue enabling neutral emotion support in the EmotionDetection class
Resolved issues when running fine-tuning with gradient checkpointing

📅 Version 2024.7.8 - Preview (30 July 2024)

New Features

Added the MaximumContextLength property to the SingleTurnConversation class
Added the GetTextAsync method to the Attachment class

Improvements

Enhanced inference engine handling of context overflow scenarios

📅 Version 2024.7.7 - Preview (29 July 2024)

Improvements

Enhanced compatibility with .NET Framework 4.6.2

Bug Fixes

Resolved a crash related to grammar sampling

📅 Version 2024.7.6 - Preview (27 July 2024)

New Features

Added support for LLaMA 3.1 models 🔥

Improvements

Enhanced logging mechanism

🎉 Version 2024.7.1 - Preview (23 July 2024)

🎉🥳 Initial release! 🥳🎉
😺 🐱 🐾

Table of Contents

📝 LM-Kit.NET Change History

📅 Version 2025.7.4 (17 July 2025)

Improvements

New Features

📅 Version 2025.7.3 (10 July 2025)

New Features

📅 Version 2025.7.2 (7 July 2025)

New Features

Improvements

Bug Fixes

📅 Version 2025.7.1 (1 July 2025)

New Features

📅 Version 2025.6.5 (27 June 2025)

New Features

Improvements

📅 Version 2025.6.4 (19 June 2025)

New Features

Improvements

📅 Version 2025.6.3 (17 June 2025)

New Features

Improvements

Bug Fixes

📅 Version 2025.6.2 (10 June 2025)

New Features

Improvements

Bug Fixes

📅 Version 2025.6.1 (2 June 2025)

New Features

Improvements

📅 Version 2025.5.5 (25 May 2025)

New Features

Improvements

📅 Version 2025.5.4 (20 May 2025)

Improvements

Bug Fixes

📅 Version 2025.5.3 (20 May 2025)

Improvements

📅 Version 2025.5.2 (13 May 2025)

Improvements

New Features

📅 Version 2025.5.1 (5 May 2025)

Improvements

New Features

📅 Version 2025.4.13 (30 April 2025)

Improvements

New Features

📅 Version 2025.4.12 (27 April 2025)

Bug Fixes

New Features

📅 Version 2025.4.11 (23 April 2025)

Bug Fixes

New Features

Breaking changes

📅 Version 2025.4.10 (22 April 2025)

Improvements

New Features

📅 Version 2025.4.9 (15 April 2025)

Improvements

New Features

📅 Version 2025.4.7 (9 April 2025)

New Features

📅 Version 2025.4.6 (9 April 2025)

New Features

📅 Version 2025.4.1 (2 April 2025)

Improvements

New Features

📅 Version 2025.3.6 (24 March 2025)

Improvements

New Features

📅 Version 2025.3.5 (17 March 2025)

New Features

Improvements

📅 Version 2025.3.4 (12 March 2025)

New Features

📅 Version 2025.3.3 (11 March 2025)

New Features

📅 Version 2025.3.2 (3 March 2025)

Improvements

New Features