What is Avelyn?
Everything you need to know about the local offline AI assistant: privacy, capabilities, and integrations.
Q.What is Avelyn?
Avelyn is a privacy-first, local offline AI writing assistant specifically designed for macOS. It integrates system-wide, allowing you to highlight text in any application (like Chrome, Slack, Word, or VS Code), hit a customizable global keyboard shortcut, and refine or rewrite your prose instantly using local AI models.
Q.Is the application safe to use?
Yes, this utility is highly secure. Unlike traditional cloud writing assistants that stream your highlighted inputs to remote servers, the tool operates entirely within a local sandbox on your machine. It only captures your highlighted text when the hotkey is pressed, processes it strictly in local memory, and updates your clipboard with zero external network request logging or telemetry.
Q.Is it completely offline?
Yes. When configured with local offline servers like Ollama, the assistant operates 100% offline. No active internet connection is required to rewrite, format, translate, or proofread your text, making it ideal for air-gapped workstations or sensitive corporate environments.
Q.Does the macOS assistant use Ollama?
The tool defaults to Ollama for local offline LLM model execution. This allows you to host and run open-weights models like Llama 3, Mistral, and Gemma 3 on your Apple Silicon CPU or GPU. Alternatively, if you prefer not to host binaries locally, you can configure cloud API connections inside the Settings panel.
Q.How does the offline AI assistant function system-wide?
The utility uses standard macOS accessibility APIs to copy highlighted text when the global shortcut (Ctrl+Shift+E) is pressed. It displays a minimalist command palette next to your cursor, takes your selected rewrite instruction (Smart Assist, Fix Grammar, or custom prompt), processes the text via local Ollama inference, and pastes the result directly back in place.
Q.Why is the system different from cloud alternatives?
The system is different because it is built from the ground up for zero cloud dependency and complete data sovereignty. Unlike cloud assistants that transmit your keystrokes to third-party databases, this utility runs open LLMs natively on your Apple Silicon hardware, executes edits in-place inside a local sandbox, requires no subscriptions, and functions completely offline.
Q.What macOS versions are supported?
The software supports macOS 13 (Ventura), macOS 14 (Sonoma), and macOS 15 (Sequoia) or newer. It is optimized to run on both Intel-based Macs and Apple Silicon machines (M1, M2, M3, M4 series), though Apple Silicon is highly recommended for faster token generation and lower memory latency during local LLM model execution.
Q.Can I run fine-tuned or custom model files?
Yes. Because the utility integrates directly with Ollama's local api server, you can load any custom Modelfile inside your local library. Once registered in Ollama, your custom fine-tuned model profile will automatically appear in the assistant's model dropdown list.
Q.Is there any background network tracking?
Absolutely not. The application code is built with zero telemetry dependencies. It does not contain analytic tracking scripts, error reporting mechanisms that connect to external endpoints, or remote config fetching. Your texts, keyboard actions, and logs remain strictly on your local SSD.
Detailed Security Architecture Overview
To fully establish trust, the application uses local macOS sandboxing rules that restrict standard network bindings. Unlike typical macOS writing assistants that maintain active HTTP connections to send text packets to remote cloud queues, this utility enforces static isolation.
When a user triggers the global shortcut command palette, the local memory buffer is temporarily filled with the copied text selection. The text block is analyzed locally, processed using CPU or GPU acceleration, and pasted back. The clipboard cache is immediately cleared, preventing data leakage across applications.
Additionally, our offline AI assistant on macOS utilizes advanced OS level security permissions. The utility requires Accessibility permissions solely to communicate with open editors via standard UI scripting interfaces. It does not monitor keyboard interrupts in the background when inactive, meaning it never functions as a keylogger. This is a critical distinction for anyone comparing privacy-focused tools to cloud alternatives.
The architecture is also fully compatible with corporate VPNs, firewall rules, and air-gapped systems. Since all network requests are blocked by the system-level sandbox, security operations center (SOC) analysts can verify that no outbound traffic packets are generated during editing. This allows immediate onboarding in financial institutes, medical software editing desks, and aerospace development firms.
By decoupling local rewriting from the cloud, users can confidently edit proprietary source code, internal spreadsheets, and confidential emails without violating company compliance frameworks or security protocols. To see how this compares to cloud alternatives, view our detailed comparison guides:
- Explore the Avelyn vs ChatGPT: Local ChatGPT Alternative Guide.
- Explore the Avelyn vs Grammarly: Privacy-First Grammarly Alternative Guide.
- Learn about native optimizations in the About Avelyn Philosophy Page or review model integrations in the Avelyn AI Integration Page.
Ready to enhance your writing locally?
Get started with this privacy-first assistant for macOS and run Llama 3 or Gemma 3 offline today.
Apply for Beta Access