Enum InferenceModality
Specifies which modality (or combination of modalities) should be used when performing inference on the input data.
public enum InferenceModalityFields
- Text = 0
- Perform inference using only text-based processing. Any image content will be ignored or treated as non-text. 
- Vision = 1
- Perform inference using only image-based (vision) processing. Any text content will be ignored. 
- Multimodal = 2
- Perform inference using both text and image modalities. The model will combine signals from text and vision. 
- BestModality = 3
- Automatically select the single best modality (Text or Image) based on the input characteristics.