Vendor-agnostic from day one.
Reachy Mini is the wedge, not the commitment. Skills, brains and the policy engine live above an abstraction so that adding a new robot is a new bridge, not a rewrite.
Kapllan Robotics is the small, exploratory wing of the lab where we ask what changes when our agents stop being chat windows and start being objects in a room. This is not a product roadmap. It is a playground, a notebook, and a record of what we learn as we learn it.
The wedge is small on purpose. Reachy Mini is a head with two antennas, a camera, a microphone, and a speaker — and the quiet expressivity that comes from those four things together. We are starting here because the failure modes are legible and the cost of being wrong is low.
Pollen Robotics built the body. We bring the brains — voice, reasoning, memory — and the policy layer that decides what a small robot in a home is permitted to do.
We have brains — reasoning, voice, vision. We don't yet know what it feels like when one of them can also nod, look at you, or turn toward the door. The first thing we want to learn is what we don't already.
Reachy Mini is not a humanoid. It is a head, two antennas, a speaker, and a microphone. A modest body. We are curious whether that is enough — for company, for a small assistant, for a child's first hello.
A chat window is a polite servant — it speaks only when spoken to. A robot in a room cannot pretend to be invisible. We will need to decide what it is allowed to volunteer, and what it is required to keep to itself.
Hand-tuned baselines, LLM-emitted tags, sentiment fallback — three layers stacked. On screen, this kind of thing is design polish. In the same room, it is the difference between a friendly object and an unsettling one.
Robots are higher-stakes than file edits. We are bringing over the deny-by-default policy engine we built for our coding agent and rethinking it for motion, audio, and voice identity. Humility is a feature.
We are not promising features by date. We are publishing the order in which we plan to learn them, and a guess at how long each one will take.
What we hope to learn · What it is to talk to a thing in the room and have it listen back. Tool-calling — lights, rooms, audio — earns the right to be embodied.
What we hope to learn · Whether asking out loud — “what did I learn about X?” — feels different from typing the same question into NxNotes.
What we hope to learn · When an agent should initiate. A finished training run, a milestone hit, a pull request that needs eyes. The robot becomes an outbound channel, not just an inbound one.
What we hope to learn · What changes when the robot knows who is talking. Speaker diarization, an identity database, and the manners that follow from both.
What we hope to learn · The emotional north star. A child says “pappa” into one body and the message arrives in another. Wake-word, identity verification, secure peer routing — all built so that this last step is layered, not rewritten.
Reachy Mini is the wedge, not the commitment. Skills, brains and the policy engine live above an abstraction so that adding a new robot is a new bridge, not a rewrite.
The brains already exist. Voice, reasoning, second-brain RAG, boardroom deliberation. Robotics is the layer that lets those brains nod, listen and gesture. We are not training new models for it.
Deny-by-default motion envelopes, blast-radius classification, an audit log. Borrowed wholesale from our coding agent and adapted for a body. Robots are higher stakes than file edits.
Robotics work surfaces in the research catalog as field notes. The voice agent essay was the first; expect short, dated entries when something teaches us something — including the failures, especially the failures.
Read the field notes →