Bare Metal Voice · talk to your machine

A voice agent.
100% local.

Speak to an AI that hears you, thinks, and answers out loud — entirely on your own GPU. It can act on your machine too: read your files, run local tools, take actions. No call leaves the box. No per-minute meter.

Live loop “What does the note on my desktop say I need to do?”
01
Hear
Whisper transcribes your speech on-device — no audio is uploaded.
02
Think
Your local model decides the reply — and which tool to call.
03
Act
Tools run on your actual machine — read a file, run a command — in a fraction of a millisecond.
04
Speak
On-device text-to-speech voices the answer. The whole loop never touches the network.
Whisper STTYour local LLMOn-device TTS0 bytes to the cloud
Private by construction
$0
Per minute — your GPU, your model, no meter. Cloud voice agents bill $0.13–0.31/min.
0
Bytes leave your machine — audio, transcript and tool data stay on-device
0.2 ms
To call a tool on your own machine — local execution, not a cloud webhook
100%
Offline-capable — pull the network cable and it still answers
Hear · think · act · speak — one stack

The tools run on
your machine.

Cloud voice platforms can call functions — but only as webhooks back to a server on the public internet. Their brain lives in the cloud, so they can’t read your files, open your apps, or reach a service on localhost. Bare Metal Voice runs the whole loop — speech, model, and tool execution — on the same box. So a “tool call” is a local function on your machine: instant, private, and able to touch things no cloud agent can reach.

Hear

On-device speech

Whisper runs locally and transcribes as you talk. Your voice is never streamed to a server.

Think

Your local model

Any model in the catalog, with tool-calling wired below the model so the agent loop actually holds.

Act

Tools on your box

Read a file, run a command, hit a localhost service — local execution a cloud webhook can’t reach.

Speak

On-device voice

Natural text-to-speech renders the reply on your own GPU. The whole round-trip stays offline.

Talk to it.
Privately.

Your voice, your model, your machine — the call never leaves the box.

Get the runtime Try Presence