App Store

Private LLM - Local AI Chat

Originator Numen Technologies Limited

Primary Metric 659

Discover the Ultimate Privacy-Focused AI Assistant on iOS: Private LLM Unlock a new realm of productivity and creativity on your iPhone and iPad with Private LLM, the premier AI assistant designed with your privacy in mind. Available for a one-time purchase, it offers a range of AI capabilities without needing a subscription. Experience advanced on-device AI that keeps your interactions confidential and offline. Why Private LLM is Your Go-To AI Companion: - Exclusive AI Model Selection: Choose from a diverse set of open-source LLM models optimized for performance and perplexity on iOS with state of the art OmniQuant quantization: including models from Llama 2, Llama 3.2, Llama 3.1, Google Gemma 2, Gemma 3, Microsoft Phi-3, Mistral 7B, Qwen 2.5, Qwen 3, StableLM 3B and many more. Whether you need help with creative brainstorming, coding, or daily questions, customize your AI experience to meet your unique needs. - Integrated with Siri & Shortcuts: Enhance your AI interactions with Siri commands and customizable Shortcuts. Private LLM seamlessly fits within your Apple ecosystem, making your digital assistant more accessible. - Customizable Interactions: Tailor your AI's responses and interactions with customizable system prompts to match your preferences and needs. - Uncompromised Privacy and Security: With Private LLM, your conversations stay confidential and on your device. Our advanced on-device AI performs robust computing without risking data compromise or needing an internet connection. - Family Sharing & Offline Capabilities: Benefit from a one-time purchase that includes Family Sharing. Download models as needed and enjoy the full functionality of your AI assistant, even without internet access. Supported LLM Model families: - DeepSeek R1 Distill based models - Phi 4 based models - Qwen 3 based models (Qwen3-4B-Instruct-2507) - Qwen 2.5 based models (0.5B, 1.5B, 3B and 7B) - Qwen 2.5 Coder based Models (0.5B, 1.5B, 3B, 7B and 14B) - Llama 3.1 8B based models - Llama 3.2 1B and 3B based models - Google Gemma 2 2B and 9B based models - Google Gemma 3 1B based models - Mistral 7B based models - Yi 6B based models For a full list of supported models, including detailed specifications, please visit privatellm.app/models. Private LLM is not just a chatbot; it's a comprehensive AI companion designed to respect your privacy while providing versatile, on-demand assistance. Whether you're enhancing your creative writing, tackling complex programming challenges, or just seeking answers, Private LLM adapts to meet your needs while keeping your data secure. Start your journey with Private LLM today and elevate your productivity and creative projects with the most private AI assistant for iOS devices. Private LLM is a better alternative to generic llama.cpp and MLX wrappers apps like Enchanted, Ollama, LLM Farm, LM Studio, Locally AI, RecurseChat, etc on three fronts: 1. Private LLM uses a faster and highly-optimized mlc-llm based inference engine. 2. Models in Private LLM are quantized using the state of the art quantization algorithms like OmniQuant, while competing apps use naive round-to-nearest quantization. 3. Private LLM is a fully native app built using C++, Metal and Swift with deep integrations with iOS and iPadOS, while many of the competing apps are bloated and non-native Electron or Flutter based apps. Please note that Private LLM only supports inference with text based LLMs. Model support varies by device capabilities.

View Raw Thread

Developer & User Discourse

RobK69420 • May 5, 2026 ★ 5

I use daily on the train

Gevdhxbeb • May 4, 2026 ★ 1

I really wanted to like it but its just not worth it man. Its answers are worse than just guessing yourself or asking a friend. It just totally ignores my prompt and gives a vague answer for 5% of what i typed. Its a cool idea and i hope it gets better.

RealLilGary • May 3, 2026 ★ 2

The app looks really good on the store page, bought it and it is very disappointing. It is a very barebones app, no conversation memory (you have to delete your conversation to have another one), and the downloading models stopped working. They would download to 38% and then hang up and the app would freeze if I clicked resume download or anything. Overall there are many better free apps that do the same thing as this app but better In every way. The best thing this app has going for it is that it has obliterated models of local llms, but again that’s assuming you can even download them.

heyheydee • May 1, 2026 ★ 1

Terrible frustrating experience. Very limited information, super basic, did not understand my questions, filled responses with cya legal language over and over instead of being efficient. I gave it feedback on how to respond but kept getting the same canned responses. I’m confused how a general AI assistant could work so poorly. Waste of $4.95!!!

Amir703 • May 1, 2026 ★ 1

All the models are incredibly dumb, when you download Gemini everything gets buggy and the app crashes, it's just a terrible app.

Acrobyte • Apr 27, 2026 ★ 1

Waste of money. The models don’t work correctly and most of the time cannot keep up a coherent conversation for more the 2 prompts. My guy just devolved into slop. Not even English. Waste of my money in super angry

Eno7e • Apr 18, 2026 ★ 4

Add transcription of audio messages from system files.

seali33 • Apr 10, 2026 ★ 1

The only thing that I thought might I might like is their shortcuts. Today I found that it didn’t work as expected.

tyaoqb • Apr 4, 2026 ★ 2

里面的模型都跟傻子一样，回答驴唇不对马嘴

AbominationCub • Mar 31, 2026 ★ 1

What is the point of a local model on an iPad if I can’t add local files to the session. Uploading to Private LLM servers goes against the whole point of a local llm

zenphysician • Mar 24, 2026 ★ 5

Excellent app! The only thing I’d like to see is to be able to paste in links to other ollama models, you can include the same disclaimer about memory limits and just add a “+ Custom Model” option under manage models, it would prevent you from having to update the app monthly to incorporate new models. It would really expand the usability of the app.

Great work!

wavetracer • Mar 14, 2026 ★ 3

Running this on my MacBook Pro, the font is tiny and there’s no way to increase its size. App seems to function OK but it’s not terribly useful like this.

ChuckSpark • Mar 2, 2026 ★ 3

They take a long time to add
new models

Eaglemax6 • Feb 25, 2026 ★ 1

Good app but has a major flaw. Chat history gets lost if app crashes which happens almost daily. Also the models token context are super small. Around 8k tokens max. But even way under the limit and it still crashes and I lose all chat history. AND YES IT GETS HOT AND AFTER ABOUT 15 seconds it bogs down in speed drastically. (The heat part is not the apps fault just due to hardware limitations. If they fix this the app will be worth the money

Fo”nem • Feb 15, 2026 ★ 1

Wast of money.

oseek jecuba • Feb 12, 2026 ★ 1

The ai used in this app is only up to date september2021

PuckHawgs • Jan 28, 2026 ★ 2

I saw this application in one of my AI newsletters I receive in my email and thought I’d give it a try. Not impressed. Waste of money. If you have used ANY of the mainstream, up to date, Chat AI platforms, you’ll be disappointed in this application. You get constant repetitive responses, very little memory of conversations and data.

amperland • Jan 24, 2026 ★ 3

Needs to support: offline datasets, RAG via web or shortcuts app for multiple step generation. Select text feature implementation is very bad

xxx的三次方 • Jan 22, 2026 ★ 1

所有模型都下载不了

Ms.SmurfyLu • Jan 21, 2026 ★ 1

I asked about the assurances in the privacy policy. The response to whether personal data is collected, “Yes, personal data may be collected in certain circumstances . . . where it is necessary to protect the interests of another individual.”

It does collect your data (but may not retain it — but who knows). If it wasn’t collecting your personal identifying data, it couldn’t report you to the “authorities”.