Show HN: Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B

Name: Show HN: Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B
Rating: 4.5 (16 reviews)

Demonstrating real-time, on-device AI capabilities with specific hardware and model, implying efficiency and performance.

200

Traction Score

Discussions

Apr 6, 2026

Launch Date

View Origin Link

Product Positioning & Context

AI Executive Synthesis

Demonstrating real-time, on-device AI capabilities with specific hardware and model, implying efficiency and performance.

This submission highlights the increasing viability of high-performance, on-device AI inference. The ability to run real-time audio/video processing with voice output on an M3 Pro using Gemma E2B signifies a critical shift towards edge computing for AI workloads. This reduces reliance on cloud infrastructure, addressing data privacy concerns and latency issues inherent in cloud-based solutions. For B2B SaaS, this trend enables new product categories requiring immediate, local AI processing, such as enhanced security systems, specialized industrial automation, or highly responsive customer interaction tools. It also lowers operational costs for businesses by minimizing API calls and data transfer fees. The M3 Pro's capability underscores Apple Silicon's growing relevance in professional AI development, potentially driving adoption of macOS for specialized AI applications. This development directly impacts SaaS providers by enabling more robust, private, and efficient client-side AI features, expanding the scope of what can be delivered without constant internet connectivity or external API dependencies.

Related: https://news.ycombinator.com/item?id=47653752

Related Ecosystem & Alternatives

Discover adjacent products, open-source repositories, and developer tools sharing similar technical architecture.

Deep-Dive FAQs

What is Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B?

Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B is analyzed by our AI as: Demonstrating real-time, on-device AI capabilities with specific hardware and model, implying efficiency and performance.. It focuses on This submission highlights the increasing viability of high-performance, on-device AI inference. The ability to run real-time audio/video processin...

Where did Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B originate?

Data for Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B was aggregated directly from the Hacker News community ecosystem, representing raw developer and early-adopter sentiment.

When was Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B publicly launched?

The initial public indexing or launch date for Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B within our tracked developer communities was recorded on April 6, 2026.

How popular is Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B?

Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B has achieved measurable traction, logging over 200 traction score and facilitating 16 recorded discussions or engagements.

Which technical categories define Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B?

Based on metadata extraction, Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B is categorized under topics such as: Real-time AI, audio/video in, voice out, M3 Pro, Gemma E2B.

What are some commercial alternatives to Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B?

Our semantic intelligence engine identifies potential commercial alternatives in the SaaS space, such as Trump Accounts, which offers overlapping value propositions.

How does the creator describe Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B?

The original author or development team describes the product as follows: "Related: https://news.ycombinator.com/item?id=47653752"

Community Voice & Feedback

crsAbtEvrthng • Apr 6, 2026

If I run this without internet connection it says "loading..." at the bottom of the localhost site and won't work.If I run this with internet connected it works flawlessly. Even if I disconnect my internet afterwards it still goes on working fine.Why there has to be an internet connection established at the time I open the localhost site when all of this should be working purely on device?Despite of this, I am really impressed that this actually works so fast with video input on my M4 Pro 48 GB.

myultidevhq • Apr 6, 2026

This is really impressive for running locally on an M3 Pro. The latency looks surprisingly good for real-time audio and video input.Curious about one thing though, how does it handle switching between languages? I work with both Greek and English daily and local models usually struggle with that.Great work, bookmarking this.

an0n-elem • Apr 6, 2026

Cool work buddy:)

magzter • Apr 6, 2026

This is so cool, I'm always speaking to people about how the advancement in the SOTA hosted AI's is also happening in the local model space, i.e. the SOTA hosted AI models 6-12 months ago are what we're seeing now being able to run locally on average hardware - this is such an amazing way to actually demo it.

est • Apr 6, 2026

I am making something similar. Also been using Kokoro for TTS. Very cool project!Gemma 4 is kinda too heavyweight even with E2B. I am sticking with qwen 0.8B at the moment.

divan • Apr 6, 2026

Can someone quickly vibe code MacOS native app for that so it doesn't require running terminal commands and searching for that browser tab? (: (also for iOS, pls)

jwr • Apr 6, 2026

That is very, very interesting. I've been hoping to have an assistant in the workshop (hands-free!) that I could talk to and have it help me with simple tasks: timers, calculating, digging up notes, etc. — basically, what the phone assistants were supposed to be, but aren't."You will have to unlock your iphone first" is kind of a deal-breaker when you are in the middle of mixing polyurethane resin and have gloves and a mask on.More and more I find that we have the technology, but the supposedly "tech" companies are the gatekeepers, preventing us from using the technological advances and holding us back years behind the state of the art.I'll be trying this out on my Macbook, looks very promising!

zerop • Apr 6, 2026

I have been looking forward to build something like this using open models. A voice assisstant I can talk while I am driving, as I do have long commute. I do use chatGPT voice mode and it works great for querying any information or discussions. But I want to do tasks like browsing web, act like a social media manager for my business etc.

dvt • Apr 6, 2026

Solid work and great showcase, I've done a bunch of stuff with Kokoro and the latency is incredible. So crazy how badly Apple dropped the ball... feels like your demo should be a Siri demo (I mean that in the most complimentary way possible).

Discovery Source

Hacker News

Aggregated via automated community intelligence tracking.

Tech Stack Dependencies

No direct open-source NPM package mentions detected in the product documentation.

Media Tractions & Mentions

No mainstream media stories specifically mentioning this product name have been intercepted yet.

Deep Research & Science

No direct peer-reviewed scientific literature matched with this product's architecture.