Method GetLoadedContexts
GetLoadedContexts()
Returns a snapshot of the inference contexts (KV-caches) currently held in memory for this model: those actively in use and those idle in the recycle pool. Each entry reports the context's token capacity, memory footprint, residency state, and device, so callers can see exactly what is keeping the model resident and where its memory is going. The list is empty when no context is held.
public IReadOnlyList<ContextInfo> GetLoadedContexts()