AI models available through the gateway

Available Models

Vindex Ai provides access to high-performance AI models through Groq.

Model Overview

The following models are available. The model is selected when creating or editing a site.
ModelContextMax OutputSpeedBest For
Llama 3.3 70B128K8K~180 TPSGeneral purpose, reasoning
Llama 3.1 70B128K8K~150 TPSComplex tasks, code generation
Llama 3.2 90B128K8K~100 TPSLarge context tasks
Mixtral 8x7B32K32K~200 TPSFast responses, coding
Gemma 2 9B8K4K~250 TPSLightweight, fast tasks

Selecting a Model

For General Use

Recommended: Llama 3.3 70B
Best for: Customer support, general conversation, Q&A

For Code Generation

Recommended: Llama 3.1 70B or Mixtral 8x7B
Best for: Programming help, code review, technical questions

For Fast Responses

Recommended: Gemma 2 9B
Best for: Simple queries, high-volume scenarios

For Large Documents

Recommended: Llama 3.2 90B
Best for: Long document processing, analysis

Model Selection

To change your site's model:
  1. Go to Sites in the dashboard
  2. Open your site and click Edit
  3. Select a model from the AI Model dropdown
  4. Save changes
The new model will be used for all new conversations. Existing sessions continue with their original model.