3
Total Pods
3
Running
0
Stopped
$3.78
Cost / Hour

Active Pods

Create Pod
qwen 8b vl embed
exho3rolcl3wks...
Running
A40
11h 36m 0.400/hr
Checking endpoints...
GPT OSS 120B
g7a6uemnl14j88...
Running
A100 SXM
11h 6m 2.980/hr
Checking endpoints...
paddle+rerank+whisper
yx8y9c07i00iw1...
Running
A40
10h 26m 0.400/hr
Checking endpoints...

Available Models

Model GPU Configuration Ports Auto Deploy
GPT-OSS 120B
Large language model for chat completions
2x A100 80GB 8082 Enabled
Qwen VL Embedding
Text/image embedding model
1x A40 48GB 8082 Enabled
Whisper + PaddleOCR + Reranker
Speech-to-text, OCR, and reranking models
1x A40 48GB 8082, 8083, 8084 Enabled