3
Total Pods
3
Running
0
Stopped
$3.78
Cost / Hour
Active Pods
Create Podqwen 8b vl embed
exho3rolcl3wks...
A40
11h 36m
0.400/hr
Checking endpoints...
GPT OSS 120B
g7a6uemnl14j88...
A100 SXM
11h 6m
2.980/hr
Checking endpoints...
paddle+rerank+whisper
yx8y9c07i00iw1...
A40
10h 26m
0.400/hr
Checking endpoints...
Available Models
| Model | GPU Configuration | Ports | Auto Deploy |
|---|---|---|---|
|
GPT-OSS 120B
Large language model for chat completions
|
2x A100 80GB | 8082 | Enabled |
|
Qwen VL Embedding
Text/image embedding model
|
1x A40 48GB | 8082 | Enabled |
|
Whisper + PaddleOCR + Reranker
Speech-to-text, OCR, and reranking models
|
1x A40 48GB | 8082, 8083, 8084 | Enabled |