Namespace LMKit.Speech

Classes

AudioSegment: Represents a segment of audio with its recognized text, timing, confidence score, and language.

SpeechToText: Provides transcription and language-detection capabilities using an LM model with speech-to-text support.

SpeechToText.LanguageDetectionResult: Represents the result of a language detection operation, including the detected language code and the confidence score.

SpeechToText.OnNewSegmentEventArgs: Event arguments for when a new AudioSegment is produced during transcription.

SpeechToText.OnProgressEventArgs: Provides data for the OnProgress event, containing the current completion percentage of the transcription.

SpeechToText.TranscriptionResult: Holds the set of AudioSegments produced by a transcription and provides the full combined text.

VadSettings: Configuration settings for voice activity detection (VAD).

Enums

SpeechToText.SpeechToTextMode: Specifies the operating mode for the SpeechToText engine: whether to transcribe speech in the original language or to translate it into English.