InferX — Serverless GPU Inference Platform for Production Workloads

Tenant Namespace Pod Name State Node Name Req. GPU Count Req. GPU vRam (MB) Type Standby (MB) Allocated GPU vRam (MB) Allocated GPU Slots
GPU Pageable Pinned GPU Slot Count
public ActionAnalytics public/ActionAnalytics/CR-70B/79/137 Ready g8398d4 4 71000 Restore Mem : 271616 File : 5460 File : 4 71168 0 278
1 278
2 278
4 278
public ActionAnalytics public/ActionAnalytics/CR-70B/79/142 Standby g8398d4 4 71000 Restore Mem : 271616 File : 5460 File : 4 0 N/A