AI models available through the gateway
Available Models
Vindex Ai provides access to high-performance AI models through Groq.
Model Overview
The following models are available. The model is selected when creating or editing a site.
| Model | Context | Max Output | Speed | Best For |
|---|---|---|---|---|
| Llama 3.3 70B | 128K | 8K | ~180 TPS | General purpose, reasoning |
| Llama 3.1 70B | 128K | 8K | ~150 TPS | Complex tasks, code generation |
| Llama 3.2 90B | 128K | 8K | ~100 TPS | Large context tasks |
| Mixtral 8x7B | 32K | 32K | ~200 TPS | Fast responses, coding |
| Gemma 2 9B | 8K | 4K | ~250 TPS | Lightweight, fast tasks |
Selecting a Model
For General Use
Recommended: Llama 3.3 70B
Best for: Customer support, general conversation, Q&A
For Code Generation
Recommended: Llama 3.1 70B or Mixtral 8x7B
Best for: Programming help, code review, technical questions
For Fast Responses
Recommended: Gemma 2 9B
Best for: Simple queries, high-volume scenarios
For Large Documents
Recommended: Llama 3.2 90B
Best for: Long document processing, analysis
Model Selection
To change your site's model:
- Go to Sites in the dashboard
- Open your site and click Edit
- Select a model from the AI Model dropdown
- Save changes
The new model will be used for all new conversations. Existing sessions continue with their original model.