Usage Limits & Balance¶

To ensure fair access for all university members, uniGPT uses a token-based balance system.

For a comparison of available internal and external models, see Available Models.

How It Works¶

Click on your username in the chat interface to see your current balance.

Use internal models: They consume fewer resources and cost the university less (e.g., Mistral Small, Llama-3.3-70B, gemma-3-27b-it, medgemma-1.5-4b-it, Qwen3.5-35B-A3B).
Avoid expensive models for simple tasks: Use Mistral Small or Llama instead of Claude Opus for basic questions.
Prefer Gemini if you need an external model — it is more cost-effective than Claude Opus or GPT-5.
Lower thinking level: For supported models, select a lower "thinking" setting to reduce token consumption.

See Available Models for model descriptions, privacy guidance, and relative input/output credit costs.

If your balance appears stuck at zero after 4 hours, try logging out and logging back in. This usually resolves the issue.

Limits are evolving

The exact limits are currently being fine-tuned. If you experience issues, please report them in the support channel.