Enum InferenceModality

Specifies which modality (or combination of modalities) should be used when performing inference on the input data.

public enum InferenceModality

Fields

Text = 0: Perform inference using only text-based processing. Any image content will be ignored or treated as non-text.
Vision = 1: Perform inference using only image-based (vision) processing. Any text content will be ignored.
Multimodal = 2: Perform inference using both text and image modalities. The model will combine signals from text and vision.
BestModality = 3: Automatically select the single best modality (Text or Image) based on the input characteristics.