Class LM

Namespace: LMKit.Model

Assembly: LM-Kit.NET.dll

Provides a unified manager for AI model instances, including small and large language models, embedding models (image and text), vision‐language models, and speech-to-text models, across GGUF, GGML, ONNX, and LMK container formats. This class handles downloading, loading, configuring, and interacting with these models, ensuring a consistent API for text generation, embeddings, multimodal inference, and speech recognition.

public sealed class LM : IDisposable

Inheritance: object

LM

Implements: IDisposable

Inherited Members: object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.ReferenceEquals(object, object)

object.ToString()

Constructors

LM(ModelCard, string, DeviceConfiguration, LoadingOptions, ModelDownloadingProgressCallback, ModelLoadingProgressCallback): Creates an instance of the Model class from a ModelCard object.

LM(string, DeviceConfiguration, LoadingOptions, ModelLoadingProgressCallback): Creates an instance of the Model class from a file.

LM(Uri, string, DeviceConfiguration, LoadingOptions, ModelDownloadingProgressCallback, ModelLoadingProgressCallback): Creates an instance of the Model class from a System.Uri object.

Properties

Adapters: Gets the collection of LoRA adapters currently applied to the model.

Architecture: Retrieves the architecture type of the model. Ie: 'llama', 'bert', 'phi'...

ChatTemplateFormat: Retrieves or sets the format of the model chat template as detected by LMKit.

ChatTemplateFormatFlags: Gets or sets the flags that specify how the chat template should be formatted.

ContextLength: Gets the context size the model was trained on.

Description: Specifies the model description.

EmbeddingSize: Gets the dimension of embedding vectors produced by the model.

GpuLayerCount: Gets the count of layers that have been previously loaded into the VRAM (Video Random Access Memory) or GPU (Graphics Processing Unit) memory.

HasImageEmbeddings: Indicates whether the model supports image embeddings.

HasReasoning: Gets a value indicating whether the model supports intermediate reasoning steps, such as structured thinking, multi-step problem solving, or chain-of-thought prompting.

HasSpeechToText: Gets a value indicating whether speech-to-text support is available in this model.

HasVision: Indicates whether the model includes vision capabilities, enabling it to process visual content in addition to text.

IsEmbeddingModel: Indicates whether the model primarily functions as a text embedding model.

LayerCount: Specifies the number of input layers in the model.

MainGpu: Gets the GPU used for scratch and small tensors.

ModelMetadata: Specifies metadata keys in the model.

ModelPath: Gets the full local file system path where the model is stored on the local machine. This property represents the destination location after the model has been downloaded.

ModelType: Gets the precision of the model input tensors.

ModelUri: Gets the original URI from which the model is downloaded or can be accessed remotely. This property represents the source location of the model.

Name: Specifies the model name.

ParameterCount: Gets the number of parameters of the model.

RopeAlgorithm: Gets the type of rope algorithm used for positional encoding in the model.

RopeFreqScaleTrain: Gets the RoPE frequency scaling factor used for training the rope positional encoding in the model.

Size: Gets the model size, in bytes.

Vocabulary: Gets the model's vocabulary handler, offering tokenization features.

Methods

ApplyLoraAdapter(LoraAdapterSource): Preloads a Low-Rank Adaptation (LoRA) adapter into the model using parameters from a LoraAdapterSource instance. Upon preload, the adapter is registered in the model’s Adapters collection and will only be applied at inference time when its Scale is greater than zero.

ApplyLoraAdapter(string, float): Preloads a Low-Rank Adaptation (LoRA) adapter from the specified file into the model, preparing it for dynamic application at inference time. Upon preload, a new entry appears in the model’s Adapters collection. The adapter will only take effect when its scale is greater than zero.

ClearCache(): Removes all cached resources linked to this model instance from memory.

Dispose(): Ensures the release of this instance and the complete removal of all associated unmanaged resources.

LoadFromModelID(string, string, DeviceConfiguration, LoadingOptions, ModelDownloadingProgressCallback, ModelLoadingProgressCallback)

Loads an LM instance based on a given model identifier.

Where to find modelID:
1. Use GetPredefinedModelCards(bool) to see a list of available predefined models. Each predefined model exposes its unique identifier in the ModelID property.
2. For custom models, define your own ModelCard instance and specify the ModelID.

ValidateFormat(string, bool): Validates the specified model file to ensure it adheres to a supported format, including verifying its header and overall structure.

Table of Contents

Class LM

Constructors

Properties

Methods