📝 LM-Kit.NET Change History
📅 Version 2025.11.1 (05 November 2025)
Improvements
- General performance enhancements
- Improved the summarizer engine
New Features
- Added model cards for
qwen3-vl:2b,qwen3-vl:4bandqwen3-vl:8b
📅 Version 2025.10.5 (24 October 2025)
Improvements
- General performance enhancements
New Features
- Introduced the
McpResourceclass - Introduced the
McpResourceContentclass - Introduced the
McpPromptclass - Introduced the
McpPromptArgumentclass - Introduced the
McpPromptResultclass - Introduced the
McpMessageRoleenumeration - Introduced the
McpPromptMessageclass - Introduced the
McpPromptContentclass - Introduced the
McpEmbeddedResourceclass - Introduced the
McpClientCapabilitiesandMcpServerCapabilitiesenumerations - Introduced the
McpCatalogKindenumeration - Added the
ServerCapabilities,Prompts,Resources&McpProtocolVersionproperties to theMcpClientclass - Added the
HasCapabilitymethod to theMcpClientclass - Added the
ToolsChanged,ResourcesChanged&PromptsChangedevents to theMcpClientclass - Added the
GetResources,GetResourcesAsync,ReadResource&ReadResourceAsyncmethods to theMcpClientclass - Added the
GetPrompts,GetPromptsAsync,RefreshPrompts&RefreshPromptsAsyncmethods to theMcpClientclass - Introduced the
McpResponseReceivedEventArgsclass - Introduced the
McpRequestSendingEventArgsclass - Introduced the
McpAuthFailedEventArgsclass - Introduced the
McpCatalogChangedEventArgsclass - Added the
ToolsChanged,ResourcesChanged&PromptsChangedevents to theMcpClientclass
📅 Version 2025.10.4 (16 October 2025)
Improvements
- General performance enhancements
- Improved Tool calling support
- Improved Mistral and GptOss models support
New Features
- Introduced the
LMFunctionToolBinderclass - Introduced the
Agent.Tools.LMFunctionAttributeclass - Added model card for
magistral-small1.2 - Introduced the
ReasoningLevelenumeration - Added the
ReasoningLevelproperty to theMultiTurnConversationclass - Introduced the
ChatHistory.Message.MessageSegmentclass - Added the
Segmentsproperty to theChatHistory.Messageclass - Introduced the
EntityAnnotationclass - Introduced the
ExtractionTrainingDatasetclass - Introduced the
CategorizationTrainingDatasetclass - Introduced the
EmotionDetectionTrainingDatasetclass - Introduced the
SarcasmDetectionTrainingDatasetclass - Introduced the
SentimentAnalysisTrainingDatasetclass - Introduced the
LanguageDetectionTrainingDatasetclass - Introduced the
PiiExtractionTrainingDatasetclass
📅 Version 2025.10.3 (09 October 2025)
Improvements
- Improved Tool calling support
- Improved MCP client support
New Features
- Introduced the
TextSegmentTypeenumeration - Added the
SegmentTypeproperty to theAfterTextCompletionEventArgsclass
📅 Version 2025.10.2 (06 October 2025)
Improvements
- General performance enhancements
- Improved Tool calling support
- Optimized dynamic sampling mechanisms
New Features
- Added model cards for
granite4-h:3bandgranite4-h:7b
📅 Version 2025.9.3 (29 September 2025)
Improvements
- General performance enhancements
- Improved internal paragraph-detection engine, yielding better RAG chunking and higher extraction accuracy
- Improved PII extraction and NER engines accuracy
New Features
- Introduced first version of a CUDA 13 Backend
- Introduced the
TextRegionclass - Introduced the
TextOutputModeenumeration - Added the
GetTextmethod to thePageElementclass - Introduced the
IToolclass - Introduced the
ToolRegistryclass - Introduced the
ToolCallPolicyclass - Introduced the
ToolChoiceenumeration - Introduced the
ToolCallResultTypeenumeration - Introduced the
ToolCallclass - Introduced the
ToolCallResultclass - Introduced the
MCPToolclass - Introduced the
MCPClientclass - Introduced the
ShareGptExporterclass - Introduced the
LMKit.Finetuning.Export.RoleMappingPolicyenumeration - Introduced the
LMKit.Finetuning.Export.ExportResultclass - Introduced the
LMKit.Finetuning.Export.ExportProgressclass - Added the
HasTextGenerationproperty to theLMclass - Added the
HasToolCallsproperty to theLMclass - Added the
Toolsproperty to theMultiTurnConversationclass - Added the
ToolPolicyproperty to theMultiTurnConversationclass - Introduced the
AfterToolInvocationEventArgsclass - Introduced the
BeforeToolInvocationEventArgsclass - Added the
BeforeToolInvocationevent to theMultiTurnConversationclass - Added the
AfterToolInvocationevent to theMultiTurnConversationclass - Added the
Occurrencesproperty to thePiiExtraction.PiiExtractedEntityclass - Added the
Occurrencesproperty to theNamedEntityRecognition.ExtractedEntityclass - Added the
PageElements&PageCountproperties to theAttachmentclass - Added the
GetPageWidth&GetPageHeightmethods to theAttachmentclass
📅 Version 2025.9.2 (10 September 2025)
Improvements
- Improved overall extraction accuracy
New Features
- Introduced the
LayoutSearchEngineclass - Added support for Gemma embedding models
- Added model card for
embeddinggemma-300m
📅 Version 2025.9.1 (09 September 2025)
Improvements
- Improved overall extraction accuracy
New Features
- Added the
Boundsproperty to theTextExtractionResultElementclass - Added the
GetValue,TryGetValue,Get,TryGetandGetAllmethods to theTextExtractionResultclass - Introduced the
Rectangleclass - Introduced the
Quadrilateralclass - Introduced the
IBounds,IPoint,ISizeandIVector2Dinterfaces - Introduced the
ILayoutElementinterface - Introduced the
ParagraphElementclass - Introduced the
LineElementclass - Added
DetectLinesandDetectParagraphsmethods to thePageElementclass - Added the
Boundsproperty to thePageElementclass - Added
TryDetectBorderBackgroundColor,Deskew,CropAuto,IsBlank,ConvertGRAY8,ConvertRGB24andConvertRGBA32methods to theImageBufferclass - Added
DOCX,XLSXandPPTXformats support to theattachmentclass - Added the
Promptproperty to theSpeechToTextclass - Added model card for
minicpm-v-45
📅 Version 2025.8.4 (22 August 2025)
Improvements
- General performance enhancements
📅 Version 2025.8.3 (19 August 2025)
Improvements
- Improved Categorization engine speed & accuracy
- General performance enhancements
New Features
- Introduced the
ChatTrainingSampleclass - Introduced the
TrainingDatasetclass - Added model card for
gemma3:270m - Added the
SchemaDiscoveryandSchemaDiscoveryAsyncmethods to theTextExtractionclass - Added the
HasImageSegmentationproperty to theLMclass - Introduced the
ImageSegmentationmember to theModelCapabilitiesenumeration - Introduced the
BackgroundDetectionclass - Added model card for
u2net
📅 Version 2025.8.2 (08 August 2025)
Improvements
- Optimized dynamic sampling mechanisms
- Enhanced runtime loading strategy
- General performance enhancements
- Improved the summarizer and keyword extraction engines introducing optional target language detection
New Features
- Introduced the
Configuration.FavorDistributedInferenceproperty - Added
TargetLanguageproperty to theSummarizerclass - Added
TargetLanguageproperty to theKeywordExtractionclass - Added
RemoveLoraAdaptermethod to theLMclass - Added support for OpenAI gpt-oss models
- Added model card for
gptoss:20b
📅 Version 2025.8.2 (02 August 2025)
Improvements
- Optimized dynamic sampling mechanisms
- Enhanced runtime loading strategy
- General performance enhancements
📅 Version 2025.7.5 (25 July 2025)
Improvements
- Optimized dynamic sampling mechanisms
New Features
- Introduced the
TesseractOcrEngineclass
📅 Version 2025.7.4 (17 July 2025)
Improvements
- Better multi-threading support for multimodal inference
New Features
- Dynamic LoRA Adapter Loading support
- Introduced the
LoraAdapterclass - Introduced the
Adaptersproperty to theLMclass
📅 Version 2025.7.3 (10 July 2025)
New Features
- Added support for SmolLM 3 models family
- Added model card for
smollm3:3b - Introduced the
ExtractionProfilerclass
📅 Version 2025.7.2 (7 July 2025)
New Features
- Added support for linux-arm64 (cpu, Vulkan and Cuda)
Improvements
- Optimized dynamic sampling mechanisms
Bug Fixes
- Fixed a crash that occurred when aborting speech-to-text tasks
📅 Version 2025.7.1 (1 July 2025)
New Features
- Added
LoadAsRGBmethod overload to theImageBufferclass taking file path as parameter - Added
DetectLanguageandDetectLanguageAsyncmethod overload to theTextTranslationclass taking anImageBufferobject as parameter - Introduced the
VoiceActivityDetectionmember to theModelCapabilitiesenumeration - Introduced the
SpeechToText.SpeechToTextModeenumeration - Introduced the
VadSettingsclass - Added
Modeproperty to theSpeechToTextclass - Added
EnableVoiceActivityDetectionproperty to theSpeechToTextclass - Introduced the
VadSettingsproperty to theSpeechToTextclass - Introduced the
SpeechToText.LanguageDetectionResultclass - Introduced the
IsValidstatic method to theWaveFileclass
📅 Version 2025.6.5 (27 June 2025)
New Features
- Added model cards for
mistral-small3.2,magistral-small,whisper-tiny,whisper-base,magistral-small,magistral-medium,whisper-large3andwhisper-large-turbo3 - Added the
HasSpeechToTextproperty to theLMclass - Introduced the
SpeechToTextmember to theModelCapabilitiesenumeration - Introduced the
WaveFileclass - Introduced the
SpeechToTextclass - Introduced the
SpeechToText.OnNewSegmentEventArgsclass - Introduced the
SpeechToText.OnProgressEventArgsclass - Introduced the
SpeechToText.TranscriptionResultclass - Introduced the
AudioSegmentclass - Introduced the
CorruptedAudioExceptionclass - Introduced the
NotSupportedAudioExceptionclass - Introduced the
ImageBufferclass - Introduced the
ImagePixelFormatenumeration - Added new constructor to the
Attachmentclass to support ImageBuffer input
Improvements
- Improved overall extraction accuracy
📅 Version 2025.6.4 (19 June 2025)
New Features
- Added the
PageElementproperty to theOcrResultclass - Added the
Guidanceproperty to thePiiExtractionclass - Added the
Guidanceproperty to theNamedEntityRecognitionclass
Improvements
- Minor extraction improvements
📅 Version 2025.6.3 (17 June 2025)
New Features
- Added the
InnerElementsproperty to theTextExtractionResultElementclass - Introduced the
PageElementclass - Introduced the
TextElementclass - Added
SetTextmethod overload to theAttachmentclass taking PageElement as parameter
Improvements
- Improved overall extraction accuracy
Bug Fixes
- Fixed txt file loading issue from the Attachment class
📅 Version 2025.6.2 (10 June 2025)
New Features
- Introduced the
NamedEntityRecognitionclass - Introduced the
NamedEntityRecognition.EntityDefinitionclass - Introduced the
NamedEntityRecognition.ExtractedEntityclass - Introduced the
NamedEntityRecognition.NamedEntityTypeenumeration - Introduced the
PiiExtractionclass - Introduced the
PiiExtraction.PiiExtractionclass - Introduced the
PiiExtraction.PiiExtractedEntityclass - Introduced the
PiiExtraction.PiiEntityTypeenumeration - Added the
Timeoutproperty to theTextractOcrclass - Introduced the
OcrStartingEventArgsevent - Introduced the
OcrCompletedEventArgsevent - Added the
Idproperty to theAttachmentclass - Added the
ClearContentmethod to theTextExtractionclass - Added the
InferencePoliciesproperty to theSingleTurnConversationclass
Improvements
- Optimized dynamic sampling mechanisms
Bug Fixes
- Fixed the Linux runtime, which was not correctly updated in the previous version
📅 Version 2025.6.1 (2 June 2025)
New Features
- Added support for multimodal classification and extraction
- Added the
TitleandDescriptionproperties to theTextExtractionclass - Added the
OcrEngineproperty to theTextExtractionclass - Added the
MaxInputTokensproperty to theCategorizationclass - Added the
HasTextproperty to theAttachmentclass - Added the
CreateFromTextandDetectMimeTypestatic methods to theAttachmentclass - Added the
SetTextmethods to theAttachmentclass - Added the
Pathproperty to theAttachmentclass - Introduced the
InferenceModalityenumeration - Added the
PreferredInferenceModalityproperty to theCategorizationclass - Added the
PreferredInferenceModalityproperty to theTextExtractionclass - Introduced the
OcrEngineabstract class - Introduced the
OcrResultclass - Introduced the
OcrParametersclass - Introduced the
TextractOcrclass
Improvements
- Improved overall extraction accuracy
- Improved Categorization engine speed & accuracy
- Optimized dynamic sampling mechanisms
📅 Version 2025.5.5 (25 May 2025)
New Features
- Added model card for
qwen2.5-vl:32b - Added the
TrimStartproperty to theTextExtractionElementFormatclass - Added support for more CUDA devices
Improvements
- Reduced memory usage when handling large context
- Improved overall extraction accuracy
📅 Version 2025.5.4 (20 May 2025)
Improvements
- various speed and memory usage improvements
Bug Fixes
- Fixed a model-loading issue that occurred under rare conditions
📅 Version 2025.5.3 (20 May 2025)
Improvements
- Improved overall extraction accuracy
- Introduced the
TextExtractionElementFormat.PredefinedStringFormatenumeration - Added the
FormatHintproperty to theTextExtractionElementFormatclass - Added the
IsRequiredproperty to theTextExtractionElementFormatclass - Added the
EnableBackendLoggingproperty to theRuntimeclass
📅 Version 2025.5.2 (13 May 2025)
Improvements
- Improved inference on various model families
- Reduced memory footprint of the built-in vector database engine
New Features
- Added model cards for
bge-m3andbge-m3-reranker - Introduced the
Rerankerclass - Introduced the
RagEngine.RagRerankerclass - Added the
Rerankerproperty to theRagEngineclass - Added new
RagEngineconstructor for multimodal RAG. - Added
FindMatchingPartitionsandFindMatchingPartitionsAsyncmethod overloads to theRagEngineclass taking attachment as parameter - Added the
DefaultImagePayloadPixproperty to theRagEngineclass - Added
RerankedScoreandRawSimilarityproperties to thePartitionSimilarityclass - Added the
PayloadEncodingproperty to thePartitionSimilarityclass - Added the
NullOnDoubtproperty to theTextExtractionclass - Introduced the
Partitionclass - Introduced the
ImagePartitionclass - Added the
Versionproperty to theDataSourceclass
📅 Version 2025.5.1 (5 May 2025)
Improvements
- Improved inference of vision models
- Improved inference of LLama 4 models
New Features
- Added support for Qwen 2.5 VL and Pixtral model families
- Added model cards for
pixtral,qwen2.5-vl:3bandqwen2.5-vl:7b
📅 Version 2025.4.13 (30 April 2025)
Improvements
- Improved translation engine precision with small models
New Features
- Added model cards for
qwen3:0.6b,qwen3:1.7b,qwen3:4b,qwen3:8bandqwen3:14b - Added the
DisableReasoningproperty to theSingleTurnConversationclass - Added the
DisableReasoningproperty to theMultiTurnConversationclass - Added the
HasReasoningSupportmember to theTemplateFormatFlagsenumeration - Added the
HasReasoningproperty to theLMclass
📅 Version 2025.4.12 (27 April 2025)
Bug Fixes
- Fixed a multi-byte decoding issue in produced model tokens
New Features
- Added
UpsertandUpsertAsyncmethod overloads to theDataSourceclass taking raw vector as parameter
📅 Version 2025.4.11 (23 April 2025)
Bug Fixes
- Fixed ONNX runtime loading issue on Linux
New Features
- Added the
DataSource.VectorEntryclass - Added the
UpsertandUpsertAsyncmethod to theDataSourceclass - Introduced the
VectorSearchstatic class
Breaking changes
- The
TextPartitionSimilarityclass has been renamed toPartitionSimilarity
📅 Version 2025.4.10 (22 April 2025)
Improvements
- Optimized dynamic sampling mechanisms
New Features
- Added support for image embeddings
- Added support for running inference on ONNX models
- Added the
GetEmbeddingsandGetEmbeddingsAsyncoverloads to theEmbedderclass to enable embeddings computation from image - Added the
HasImageEmbeddingsproperty to theLMclass - Introduced the
Summarizer.SummarizationIntentenumeration - Added the
Summarizer.Intentproperty to theSummarizerclass - Added model cards for
nomic-embed-vision,granite3.3:2bandgranite3.3:8b
📅 Version 2025.4.9 (15 April 2025)
Improvements
- Improved translation engine
- Improved structured text extraction accuracy
- Improved compatibility with JSON schemas for structured text extraction
- Improved internal scheduler engine
- Global speed improvement, up to 20%
New Features
- Added
GetTopCategories,GetTopCategoriesAsync,GetBestCategoryandGetBestCategoryAsyncoverloads to the Categorization class to specify category descriptions
📅 Version 2025.4.7 (9 April 2025)
New Features
- Added the
HasSectionmethod to theDataSourceclass
📅 Version 2025.4.6 (9 April 2025)
New Features
- Introduced a new built-in vector database engine that permits handling small- to medium-sized vector collections on the file system without any dependency
- Introduced the
StorageModeenumeration - Added the
DataSource.StorageModeproperty - Added the
LoadFromFileandLoadFromFileAsyncmethods to theDataSourceclass - Added the
OptimizeDataSourceandOptimizeDataSourceAsyncmethods to theDataSourceclass - Added the
CreateFileDataSource,CreateInMemoryDataSource,CreateVectorStoreDataSourceandCreateVectorStoreDataSourceAsyncmethods to theDataSourceclass - Added the
Importmethod to theRagEngineclass - Added the
ImportAsyncmethod to theRagEngineclass - Added the
DefaultTextChunkingproperty to theRagEngineclass
📅 Version 2025.4.1 (2 April 2025)
Improvements
- Enhanced RagEngine accuracy and speed
- The RAG engine now supports multiple DataSources simultaneously, each backed by its own storage mode (e.g., one DataSource using in-memory storage and another using a vector database)
New Features
- Introduced different DataSource storage modes, enabling storage of DataSource content directly in vector databases
- Introduced a new collection of open-source assemblies to bridge external vector databases with LM-Kit.NET. Currently, Qdrant is supported. Git repository: LM-Kit.NET Data Connectors
- Introduced the
IVectorStoreinterface - Introduced the
PointEntryclass - Added the
CollectionChangedevent to theMetadataCollectionclass - Added the
PropertyChangedevent to theMetadataCollectionclass - Added the
AddOrReplacemethod to theMetadataCollectionclass - Added the
ImportText,ImportTextAsync,ImportTextFromFile,ImportTextFromFileAsyncoverloads to theRagEngineclass to specify metadata collection - Added the
GetSectionByIdentifierAsyncmethod to theDataSourceclass - Added the
RemoveSectionAsyncmethod to theDataSourceclass - Added the
LoadFromStoreandLoadFromStoreAsyncmethods to theDataSourceclass - Added the
Metadataproperty to theRagEngine.TextPartitionSimilarityclass
📅 Version 2025.3.6 (24 March 2025)
Improvements
- Strong performance improvements for the Vulkan backend
- Enhanced extraction accuracy in the structured text extraction engine
- Reduce VRAM/RAM usage
New Features
- Added support for Mistral Small 3.1 model and added mistral-small-3.1 model card
- Added the
GetTextmethod to theAttachmentclass - Added HTML format support to the
Attachmentclass
📅 Version 2025.3.5 (17 March 2025)
New Features
- Added the
HandleUnknownCategoryproperty to theCategorizationclass - Added support for Gemma 3 vision
Improvements
- Improved inference with Gemma 3 models family
- Improved internal context caching strategy
- Inference speed improvements
📅 Version 2025.3.4 (12 March 2025)
New Features
- Added support for Gemma 3 models family
- Added model cards for Qwen QwQ 32B, Gemma 3 1B, Gemma 3 4B, Gemma 3 12B and Gemma 3 27B
- Added the
Guidanceproperty to theCategorizationclass
📅 Version 2025.3.3 (11 March 2025)
New Features
- Added the
SetElementsFromJsonSchememethod to theTextExtractionclass - Added the
IsVisionRequiredproperty to theAttachmentclass - Added the
GetTopCategoriesandGetTopCategoriesAsyncproperty to theCategorizationclass - Added the
GetBestCategoryAsyncandGetTopCategoriesAsyncoverloads to theCategorizationclass to enable classification from image attachments
📅 Version 2025.3.2 (3 March 2025)
Improvements
- Improved inference with phi-3 and phi-4 models
- Reduced CUDA backend size for Windows by a factor of 3
New Features
- Introduced the
MemoryTypeenumeration - Added the
SaveInformationoverloads to theAgentMemoryclass to specify the type of memory to record - Added the
SaveInformationAsyncoverloads to theAgentMemoryclass to specify the type of memory to record - Added the
GetMemoryTypestatic method to theAgentMemoryclass - Added the
GetDataSourcemethod to theRagEngineclass - Added the
TryGetDataSourcemethod to theRagEngineclass - Added the
Getmethod to theMetadataCollectionclass - Added the
TryGetmethod to theMetadataCollectionclass - Added the
MemoryTypeproperty to theMemoryRecallEventArgsclass
📅 Version 2025.3.1 (1 March 2025)
Improvements
- Improvements to generic classification accuracy and speed
New Features
- Added the
RemoveDataSourceoverloads to theRagEngineclass to remove a DataSource from an identifier - Added the
RemoveSectionmethod to theDataSourceclass - Added support for Phi-4 Mini models
- Added model cards for Phi-4 Mini and Deepseek R1
📅 Version 2025.2.4 (26 February 2025)
Improvements
- Minor performance improvements of the inference system
- Improved inference with various model families such as Granite, Mistral, Monarch, Orion, MiniCPM, and GigaChat
- Improved language detection accuracy
New Features
- Added new constructor to the
Attachmentclass to support stream input - Added the
ExtractKeywordsandExtractKeywordsAsyncoverloads to theKeywordExtractionclass to extract keywords from image attachments - Added the
SummarizeandSummarizeAsyncoverloads to theSummarizerclass to produce summary from image attachments - Added the
DetectLanguageandDetectLanguageAsyncoverloads to theTextTranslationclass to detect language from image attachments - Introduced the
DataFilterclass - Added the
RemoveDataSourceoverloads to theAgentMemoryclass to remove a DataSource from an identifier - Added the
AddDataSourcemethod to theAgentMemoryclass - Added the
AddDataSourcesmethod to theAgentMemoryclass - Added the
Filterproperty to theRagEngineclass - Added the
Filterproperty to theAgentMemoryclass - Added support for DeepSeek-R1 models
📅 Version 2025.2.3 (19 February 2025)
Improvements
- Improved RAG engine accuracy and speed
- Optimized dynamic sampling mechanisms
New Features
- Introduced the
AgentMemoryclass - Introduced the
MemoryRecallEventArgsclass - Introduced the
TopNSigmaSamplingclass - Added the
Memoryproperty to theMultiTurnConversationclass - Added the
MaximumRecallTokensproperty to theMultiTurnConversationclass - Added the
MemoryRecallevent to theMultiTurnConversationclass - Added the
DownloadAsyncmethod to theModelCardclass - Added the
LogCallbackevent to theRuntimeclass
📅 Version 2025.2.2 (12 February 2025)
Improvements
- Improved internal context caching strategy
New Features
- Added the
ModelIDproperty to theModelCardclass - Added the
GetPredefinedModelCardByModelIDmethod to theModelCardclass - Added the
LoadFromModelIDstatic method to theLMclass - Added the
ModelUriproperty to theLMclass - Added the
ImportTextAsyncmethod to theRagEngineclass - Added the
ImportTextFromFileAsyncmethod to theRagEngineclass
📅 Version 2025.2.1 (4 February 2025)
Improvements
- Improved inference of MiniCPM engine
New Features
- Added model card for Mistral Small Instruct 2501
- Created a new open-source assembly bridging Microsoft SemanticKernel and LM-Kit.NET. Git repository: LM-Kit.NET Semantic Kernel
📅 Version 2025.1.10 (30 January 2025)
Improvements
- Added WebP and TIFF image support to Attachment
- Minor performance improvements of the inference system
📅 Version 2025.1.9 (27 January 2025)
Improvements
- Enhanced compatibility with Framework 4.6.2 apps
- Enhanced backend management mechanism
📅 Version 2025.1.8 (26 January 2025)
Improvements
- Enhanced compatibility with Framework 4.6.2 apps
- Minor performance improvements for the Vulkan backend
📅 Version 2025.1.7 (24 January 2025)
Improvements
- Improved speed and accuracy of vision models inference
Bug Fixes
- Fixed a memory leak during image eval using MiniCPM models
📅 Version 2025.1.6 (22 January 2025)
Improvements
- Improved speed and accuracy of vision models inference for content retrieval
New Features
- Added support for the MiniCPM-O model
- Added the
ReplacementModelproperty to theModelCardclass
Bug Fixes
- Resolved a crash when regenerating responses using MiniCPM models
- Fixed a memory leak with VLM disposal
📅 Version 2025.1.4 (19 January 2025)
Bug Fixes
- Resolved an inference issue in AOT mode
- Resolved a rare race condition in the Vulkan backend during inference
- Resolved a segmentation fault in the TextExtraction class
📅 Version 2025.1.3 (18 January 2025)
Improvements
- Various performance improvements
- Improved Runtime initialization
New Features
- Added the
SetContentmethod overload that accepts anAttachmentto theTextExtractionclass, allowing extraction of unstructured content from images
Bug Fixes
- Resolved a crash during Qwen2-VL inference on macOS
📅 Version 2025.1.2 (11 January 2025)
Improvements
- Enhanced GPU device memory management
- Optimized dynamic sampling mechanisms
- Various performance improvements
New Features
- Initial support for multimodal capabilities with vision handling (VLM)
- Added support for the LMK model format
- Introduced the
Promptclass - Introduced the
Attachmentclass - Added overloads to the
SubmitandSubmitAsyncmethods in theMultiTurnConversationclass to support prompts with image attachments - Added overloads to the
SubmitandSubmitAsyncmethods in theSingleTurnConversationclass to support prompts with image attachments
📅 Version 2025.1.1 (1 January 2025)
Improvements
- Optimized dynamic sampling mechanisms
- Improved error handling
New Features
- Added the
HasVisionproperty to theLMclass - Added the
EnableModelChecksumValidationproperty to theConfigurationclass
📅 Version 2024.12.13 (29 December 2024)
Improvements
- Optimized dynamic sampling mechanisms
- Improved performance with IBM Granite models
New Features
- Added overloads to the
GetPerformanceScoremethods in theDeviceConfigurationclass to obtain the model's performance score from a specified path
Bug Fixes
- An unhandled exception was raised when a model could not be loaded
📅 Version 2024.12.12 (26 December 2024)
New Features
- Added the
ShortModelNameproperty to theModelCardclass - Added the
IsPredefinedproperty to theModelCardclass
Improvements
- The
LLMclass has been marked as obsolete. UseLMclass as it replacement
Bug Fixes
GpuDeviceInfo.DeviceTypewas returning an invalid value
📅 Version 2024.12.11 (23 December 2024)
New Features
- Added the
Architectureproperty to theModelCardclass - Added the
GetPerformanceScoremethod to theDeviceConfigurationclass - Added support for Llama-3_1-Nemotron-51B and InfiniAI Megrez 3b models
📅 Version 2024.12.10 (22 December 2024)
New Features
- Added the
LocalPathproperty to theModelCardclass
Improvements
- Improved performance with Phi 4 model
📅 Version 2024.12.9 (20 December 2024)
New Features
- Added the
ValidateFileChecksummethod to theModelCardclass - The ModelCard class can now retrieve model metadata directly from Hugging Face servers without downloading the models
📅 Version 2024.12.8 (19 December 2024)
New Features
- Added the
GetDeviceFromNumbermethod to theGpuDeviceInfoclass - Added the
MaximumContextLengthproperty to theTextExtractionclass - Added overloads to
GetOptimalContextSizemethods in theDeviceConfigurationclass to obtain the optimal context size from the available resources and model - Introduced the
ModelCardclass - Introduced the
ModelCapabilitiesenumeration - Added the
ModelStorageDirectoryproperty to theConfigurationclass
Improvements
- Improved summarization engine
📅 Version 2024.12.7 (15 December 2024)
New Features
- Introduced the
Summarizer.OverflowResolutionStrategyenumeration - Introduced the
TextShrinkingStrategyenumeration - Added the
OverflowStrategyproperty to theSummarizerclass - Added the
TextShrinkingStrategyproperty to theKeywordExtractionclass - Introduced the
DeviceConfigurationclass
Improvements
- Improved summarization of very large text
Bug Fixes
- Prevent the
MultiTurnConversationclass from requesting a context size exceeding the model's capacity
📅 Version 2024.12.6 (13 December 2024)
New Features
- Introduced the
Summarizerclass - Introduced the
Summarizer.SummarizerResultclass
Improvements
- Enhanced MAUI compatibility for macOS development
📅 Version 2024.12.5 (11 December 2024)
New Features
- Added the
ContextRemainingSpaceproperty to theMultiTurnConversationclass - Added the
ContextRemainingSpaceproperty to theAfterTokenSamplingEventArgsclass - Added the
ContextSizeproperty to theAfterTokenSamplingEventArgsclass
📅 Version 2024.12.4 (10 December 2024)
Improvements
- Changed type of ChatHistory.Message.PreviousContent to ChatHistory.Message
- Improved CPU backend speed on Windows
📅 Version 2024.12.3 (07 December 2024)
Bug Fixes
- The items in ChatHistory.Message.PreviousContent were not set in the correct order
- The KeywordExtraction.KeywordCount setter was ignored
Improvements
- Improved the sampling strategy in scenarios with a low context length
📅 Version 2024.12.2 (07 December 2024)
New Features
- The Windows Vulkan runtime is now included in the NuGet package
Improvements
- Optimized dynamic sampling mechanisms
📅 Version 2024.12.1 (06 December 2024)
New Features
- Introduced the
LLM.TemplateFormatFlagsenumeration - Introduced the
KeywordExtractionclass - Introduced the
KeywordExtraction.KeywordItemclass - Added the
ChatTemplateFormatFlagsproperty to theLLMclass - Added the
IsArrayproperty to theTextExtractionElementclass
Improvements
- Optimized dynamic sampling mechanisms
- Improved support for large text bodies in the embedding engine
📅 Version 2024.11.10 (29 November 2024)
New Features
- Introduced the
DeviceTypeenumeration - Added the
DeviceTypeproperty to theGpuDeviceInfoclass
Improvements
- Inference speed improvements with Vulkan backend
📅 Version 2024.11.9 (27 November 2024)
New Features
- Added the
UseAsyncModelAttributesLoadingproperty to theConfigurationclass - Added the
DeviceDescriptionproperty to theGpuDeviceInfoclass - Added the
TotalMemorySizeproperty to theGpuDeviceInfoclass - Added the
FreeMemorySizeproperty to theGpuDeviceInfoclass
Improvements
- Improved NuGet package compatibility with MAUI
- Various inference speed improvements
- Improved handling of model loading failures
📅 Version 2024.11.8 (25 November 2024)
New Features
- Added the
SetLogTextWritermethod to theRuntimeclass
📅 Version 2024.11.7 (25 November 2024)
New Features
- Added maccatalyst RID for MAUI
📅 Version 2024.11.6 (25 November 2024)
New Features
- Vulkan backend in now enabled by default
- Introduced the
Runtime.BackendTypeenumeration - Added the
Backendstatic property to theRuntimeclass - Introduced the
GpuDeviceInfoclass - Added the
PreviousContentproperty to theChatHistory.Messageclass
📅 Version 2024.11.5 (23 November 2024)
Improvements
- Enhanced inference speed
- Improved compatibility with MAUI
📅 Version 2024.11.4 (18 November 2024)
New Features
- Added Vulkan backend for Linux
- Added support for .NET 9
Improvements
- Improved compatibility with macOS ARM and macOS x64
- Enhanced CPU inference speed
📅 Version 2024.11.4 (12 November 2024)
New Features
- Added the
CreateGrammarFromStringListmethod to theGrammarclass - Added the
WhitelistedValuesproperty to theTextExtractionElementFormatclass
📅 Version 2024.11.2 (05 November 2024)
New Features
- Added overloads to
GetEmbeddingsandGetEmbeddingsAsyncmethods in theEmbedderclass to support batch processing of multiple inputs
📅 Version 2024.11.1 (04 November 2024)
New Features
- Added support for Granit models
- Added the
Perplexityproperty to theBeforeTokenSamplingEventArgsclass - Added the
Perplexityproperty to theAfterTokenSamplingEventArgsclass - Added the
ContinueLastAssistantResponseandContinueLastAssistantResponseAsyncmethods to theMultiTurnConversationclass
Improvements
- Optimized dynamic sampling mechanisms
📅 Version 2024.10.5 (24 October 2024)
New Features
- Added the
GeneratedTokensproperty to theChatHistory.Messageclass - Added the
TerminationReasonproperty to theChatHistory.Messageclass - Introduced the
TextExtractionElementFormatclass - Introduced the
TextExtractionElementFormat.TextCaseModeenumeration - Added the
TextExtractionElementFormatproperty to theTextExtractionElementclass
Improvements
- Enhanced structured text extraction
- Added new formatting options to text extraction elements
- Improved function calling capabilities
- Refined JSON data sampling
- Optimized dynamic sampling mechanisms
📅 Version 2024.10.4 (17 October 2024)
Improvements
- Structured text extraction improvements
- Function calling improvements
- Refined JSON data sampling
- Optimized dynamic sampling mechanisms
📅 Version 2024.10.3 (16 October 2024)
New Features
- Added the
TextExtractionclass - Added the
TextExtractionResultclass - Added the
TextExtractionResultElementclass - Added the
ElementTypeenumeration - Added the
DateTimeparameter type support to function calling
Improvements
- Optimized dynamic sampling mechanisms
📅 Version 2024.10.2 (09 October 2024)
New Features
- Added the
TextExtractionElementclass - Added the
CreateJsonGrammarFromExtractionElements(IEnumerable<TextExtractionElement> extractionElements)method overload to theGrammarclass
Improvements
- Optimized dynamic sampling mechanisms
📅 Version 2024.10.1 (01 October 2024)
New Features
- Added the
CreateTrainingObjectmethod overload to theEmotionDetectionclass - Added the
GetTrainingDatamethod to theEmotionDetectionclass - Introduced the
EmotionDetection.TrainingDatasetenumeration - Introduced the
JsonNumArray,JsonIntArrayandJsonStringArraymembers to theLMKit.Data.Json.JsonBasicTypeenumeration - Introduced the
TextTranslation.TrainingDatasetenumeration - Added the
GetTrainingDatamethod to theTextTranslationclass - Added the
EnableDynamicSamplingproperty to theConfigurationclass
Improvements
- Improvements to generic classification accuracy and speed
- Improved global performance with Llama 3.2 models
- Improved global performance with Qwen 2.5 models
- Fine-tuning: extended compatibility to all Llama architectures
- Improved sampling of JSON array structures
📅 Version 2024.9.4 (25 September 2024)
New Features
- Added support for array parameters in function calls
- Added the
ForceFunctionSelectionproperty to theSingleFunctionCallclass - Introduced the
LMKit2024_09_INTmember to theSentimentAnalysis.TrainingDatasetenumeration
Improvements
- Improved function calling accuracy and speed
- Improved emotion detection engine accuracy and speed
- Strong improvements to generic classification accuracy and speed
- Improved accuracy of the language detection engine
Bug Fixes
- Fixed an memory leak in the fine-tuning engine
📅 Version 2024.9.3 (18 September 2024)
Improvements
- Improved function calling accuracy
- Enhanced language detection accuracy
- Boosted generic classification accuracy and speed
- Increased inference speed on CPU
📅 Version 2024.9.2 (11 September 2024)
New Features
- Added the
Seedproperty toRandomSampling,MirostatSampling, andMirostat2Samplingclasses - Introduced the
TrimAutomember to theInputLengthOverflowPolicyenumeration - Enabled deserialization of
ChatHistoryobjects without specifying aModelparameter
Improvements
- Further increased inference speed on CPU
- Enhanced internal API for better error handling
📅 Version 2024.9.1 (6 September 2024)
Improvements
- Optimized inference speed on CPU
- Implemented checks to prevent access to disposed objects
- Improved exception handling in quantization features
📅 Version 2024.9.0 (3 September 2024)
New Features
- Added the
GetTokenmethod to theVocabularyclass - Introduced the
Confidenceproperty to theTextTranslationclass - Added the
MaximumInputTokensproperty to theSingleTurnConversationclass
Improvements
- Enhanced JSON content sampling strategy
- Extended CUDA backend compatibility across more Windows configurations
- Improved accuracy of the language detection engine
- Enhanced compatibility with MAUI
📅 Version 2024.8.4 - Preview (26 August 2024)
New Features
- Introduced the
GlobalConfigurationstatic class
Improvements
- Refined JSON content sampling strategy
- Improved function calling accuracy and speed
- Increased inference speed with Gemma2 models
- Enhanced classification accuracy with tiny-small models
Bug Fixes
- Fixed an issue when computing embeddings from a token array matching the context size
📅 Version 2024.8.3 - Preview (21 August 2024)
Improvements
- Improved support for Phi 3.5 models
📅 Version 2024.8.2 - Preview (20 August 2024)
New Features
- Added the
CreateJsonGrammarFromJsonSchememethod to theGrammarclass
Improvements
- Improved JSON content sampling strategy
- Enhanced function calling accuracy and speed
📅 Version 2024.8.1 - Preview (15 August 2024)
New Features
- Added Vulkan backend 🔥
- Introduced the
FunctionCallResultproperty to theFunctionCallEventArgsclass - Added the
EnableVulkanproperty to theRuntimeclass - Included
KeepLastproperty in theAfterTokenSamplingEventArgsclass
Improvements
- Refined JSON content sampling strategy
- Enhanced function calling accuracy and speed
📅 Version 2024.8.0 - Preview (8 August 2024)
New Features
- Added the
CreateJsonGrammarFromFieldsmethod to theGrammarclass - Enabled function calling support with any model class 🔥
- Introduced the
SingleFunctionCallandFunctionCallResultclasses - Added the
LMFunctionAttributeattribute
Improvements
- Enhanced grammar sampling strategy, especially for JSON schemes
📅 Version 2024.7.10 - Preview (6 August 2024)
New Features
- Added support for Google Gemma 2 2B models 🔥
- Introduced the
CreateJsonGrammarFromTextFieldsmethod to theGrammarclass
Improvements
- Improved grammar sampling strategy, particularly with JSON schemes
📅 Version 2024.7.9 - Preview (31 July 2024)
New Features
- Added the
Versionproperty to theRuntimeclass
Bug Fixes
- Fixed an issue enabling neutral emotion support in the
EmotionDetectionclass - Resolved issues when running fine-tuning with gradient checkpointing
📅 Version 2024.7.8 - Preview (30 July 2024)
New Features
- Added the
MaximumContextLengthproperty to theSingleTurnConversationclass - Added the
GetTextAsyncmethod to theAttachmentclass
Improvements
- Enhanced inference engine handling of context overflow scenarios
📅 Version 2024.7.7 - Preview (29 July 2024)
Improvements
- Enhanced compatibility with .NET Framework 4.6.2
Bug Fixes
- Resolved a crash related to grammar sampling
📅 Version 2024.7.6 - Preview (27 July 2024)
New Features
- Added support for LLaMA 3.1 models 🔥
Improvements
- Enhanced logging mechanism
🎉 Version 2024.7.1 - Preview (23 July 2024)
🎉🥳 Initial release! 🥳🎉
😺 🐱 🐾