Available Models¶

uniGPT offers a variety of models — both internal (hosted on-premises at the university) and external (provided via cloud partners).

For balance reset details and practical advice on conserving credits, see Usage Limits & Balance.

Internal Models (On-Premises)¶

These models run entirely on university infrastructure. Your data never leaves the university, making them fully GDPR-compliant and suitable for sensitive data.

Model	Description	Input / output cost (credits)
Mistral Small	Fast, efficient general-purpose model	0.15 / 0.2
Llama-3.3-70B	Older open-weight model by Meta	0.65 / 0.5
gpt-oss-120b	Large internal model for complex tasks	0.15 / 0.6
gemma-3-27b-it	Efficient multimodal model with image support	0.09 / 0.16
medgemma-1.5-4b-it	Specialized multimodal model for medical text and image understanding	0.02 / 0.04
Qwen3.5-35B-A3B	Chinese Advanced reasoning and coding model with image support	0.035 / 0.138

Best for sensitive data

Use internal models when working with personal data, unpublished research, or any data subject to GDPR restrictions.

External Models (Cloud Providers)¶

These models are accessed via cloud providers through the GÉANT OCRE framework. While they offer state-of-the-art performance, data is processed externally.

Model	Provider	Notes	Input / output cost (credits)
GPT-5.4	OpenAI (via Azure)	Strong flagship reasoning model	2.5 / 15
GPT-5.2	OpenAI (via Azure)	High-capability reasoning model	1.75 / 14
GPT-5 Mini	OpenAI (via Azure)	Lower-cost GPT-5 variant	0.25 / 2
Gemini 3.1 Pro	Google Cloud	Advanced multimodal model	2 / 12
Gemini 3 Flash	Google Cloud	Fast multimodal model	0.25 / 1.5
Claude Opus 4.6	Anthropic (via Google)	Strongest Claude model for deep analysis	5 / 25
Claude Sonnet 4.5	Anthropic (via Google)	Balanced writing and analysis model	3 / 15
Claude Haiku 4.5	Anthropic (via Google)	Fastest Claude option	1 / 5

Cost awareness

External models cost significantly more per token than internal models. The university receives only a minimal discount (<5%) from providers. Please use external models sparingly and prefer internal models or cost-effective options like Gemini (Flash) when possible. Otherwise, user limits may need to be reduced.

Choosing the Right Model¶

Quick tasks & drafting: Mistral Small, Llama-3.3-70B
Complex reasoning: gpt-oss-120b, GPT-5, Claude Opus
Medical text & image tasks: medgemma-1.5-4b-it
Cost-effective external option: Gemini Pro
Sensitive/confidential data: Always use internal models

For current balance behavior and reset timing, see Usage Limits & Balance.