Question 1

Is Omnivra AI free?

Accepted Answer

Yes. It routes through free tiers of OpenRouter, Groq, Hugging Face, and Tavily. You bring your own API keys (all have generous free tiers, no credit card required).

Question 2

Which AI models does it use?

Accepted Answer

Llama 3.3 70B and OpenAI GPT-OSS 120B for chat and reasoning, DeepSeek V4 Flash for code, Google Gemma 4 for vision, FLUX.1-schnell for images, and any other model you wire into the router.

Question 3

How is this different from ChatGPT?

Accepted Answer

ChatGPT uses one model for everything. Omnivra auto-picks the right specialist model per task and falls back across providers when free quotas run out — so you get better answers without managing accounts at five different AI companies.

Question 4

Where is my data stored?

Accepted Answer

Your conversations live in your own Supabase project (free tier includes 500MB Postgres). Row-level security guarantees nobody else can read them. We never store them on our servers.

Question 5

Can I self-host it?

Accepted Answer

Yes — the code is fully open. Backend is FastAPI, frontend is Next.js. Deploy frontend to Vercel and backend to Render in under 10 minutes.

Your prompt looks like	Routed to
"how do I fix this stack trace?"	DeepSeek V4 Flash
"prove the binomial theorem"	GPT-OSS 120B
"generate an image of a fox"	FLUX.1-schnell
photo + "what's in this?"	Gemma 4 31B
"latest AI news today"	Tavily → Llama 3.3
"hi, how are you?"	Llama 3.3 70B

One chat, every AI model.

Built for every kind of question

Smart auto-routing

Live web search

Vision Q&A

Image generation

Provider fallback

Sign in & sync

How does auto-routing work?

Frequently asked questions

Stop paying for five AI tools.