Select a model configuration and deploy to RunPod
Large language model for chat completions
Text/image embedding model
Speech-to-text, OCR, and reranking models