Local AI · Fast · Private
Ask questions, summarize pages, and get instant AI help — directly inside your browser.
Powerful features designed with privacy and speed in mind
Run AI models directly on your machine. Your data never leaves your device, ensuring complete privacy and control.
With local models through Ollama, QuickAI works without internet. Perfect for sensitive work or limited connectivity.
Access AI instantly with a single click. The minimal popup interface gets out of your way while delivering powerful results.
Choose between Ollama, Google Gemini, or OpenAI. Switch providers based on your needs — local privacy or cloud power.
Your conversations are stored locally in your browser. Access up to 50 previous conversations anytime, securely saved on your device.
Choose the theme that suits your workflow. QuickAI adapts to your preference with beautiful dark and light modes.
API keys are stored securely in your browser's local storage. They never leave your device or get transmitted anywhere.
Smart input validation (1–5000 characters) and configurable timeouts ensure reliable, consistent performance.
Understand exactly how QuickAI processes your requests
Click the QuickAI icon in your browser toolbar. A minimal, focused popup opens instantly — no page navigation required.
Type your question (1–5000 characters). Use Shift + Enter for multiline input. The interface validates your input in real-time.
Choose your AI provider: Ollama for local processing, or Gemini/OpenAI for cloud-powered responses.
The popup sends your request to Chrome's background service worker. Input is validated, sanitized, and securely routed to your chosen provider.
Ollama: Processed entirely on your machine — zero network calls.
Gemini/OpenAI: Secure HTTPS API call to the provider.
The AI response appears in the popup with smooth streaming. Conversations are saved locally (up to 50 items) for future reference.
QuickAI is built from the ground up with privacy as a core principle. We believe AI tools shouldn't require sacrificing your data.
Choose your preferred AI provider and start using QuickAI
Run AI models completely offline on your own machine. No data leaves your device.
Download and install Ollama from ollama.ai
Open your terminal and run:
ollama pull llama3
Enable cross-origin requests for the browser extension:
OLLAMA_ORIGINS="*" ollama serve
Open QuickAI and select "Ollama" as your provider. You're ready to go!
Browser extensions need CORS enabled to communicate with local servers. The OLLAMA_ORIGINS="*" flag allows the extension to send requests to your local Ollama instance.
Use Google's powerful Gemini AI models through their cloud API.
Visit Google AI Studio and create a new API key.
Click the gear icon in the QuickAI popup to open settings.
Enter your Gemini API key in the designated field. It's stored locally and never sent anywhere else.
Choose "Gemini" from the provider dropdown and start chatting!
Google offers a generous free tier for Gemini API usage, perfect for personal use.
Access GPT-4 and other OpenAI models through their API.
Sign up at platform.openai.com if you don't have an account.
Navigate to API Keys section and create a new secret key.
OpenAI requires a payment method. Add billing information to enable API access.
Open QuickAI settings, paste your API key, and select OpenAI as provider.
OpenAI API is a paid service. Monitor your usage to avoid unexpected charges. Consider setting usage limits in your OpenAI dashboard.
Get instant AI assistance right in your browser. Private, fast, and completely free.