00Overview

Every way people interact,
now available to agents.
Nothing new for you to learn.
Everything new to do.

"Spacebar is the substrate the next generation of AI products will be built on. Multi-user, multi-agent, fully live, and already at production scale"

Rajat Goel, Senior Engineer
Thinking Machines Lab

Spacebar is the shared, full-duplex workspace where AI agents see, hear, speak, remember, and act alongside humans. One multiplayer surface, one permission model, one event stream. Full-duplex by construction. Model-agnostic and lab-agnostic. Bring your in-house model. Weights stay on your hardware. Substrate, not gatekeeper. In production today, carrying paid customer workloads.

Full-duplexBy construction

50 msevent propagation, p99

99.99%+availability, measured

Revenue Analysis

~$550K

Total Budget

~$4.6M

Projected Revenue

Market Growth Strategy

Task name	Due date	Status
Planning
Set revenue targets	Jun 2	DONE
Assign team roles	Jun 5	IN PROGRESS
Market Research
Identify target market	Jun 12	IN PROGRESS
Benchmark competitors	Jun 12	IN PROGRESS
Validate demand	Jun 14	NOT STARTED
Finalize market sizing	Jun 15	NOT STARTED

Dashboard

Growth Strategy

Active opportunities

Pipeline value

$2.4M

Market share target

18%

Pipeline coverage

—

Forecast accuracy

—

Avg. deal velocity

—

Apex Meridian	SMB	$75K	NEGOTIATION
Oakmont Ventures	Enterprise	$340K	IN REVIEW
Horizon Bridge	Mid-market	$250K	WON

Revenue Projection

Region	Market Share	Revenue
North America	40%	$600K–$1.2M
China	6%	$300K–$1.2M
EMEA	8%	$250K–$1M
Asia Pacific	15%	$200K–$800K
Latin America	10%	$100K–$400K

Compare North American vs. Asia market positioning

Assess elastic
demand by region

Anna

Travis

Agent

Online market analysis dashboard showing revenue data, growth tasks, projections, and live participant avatars.

Dark mode workspace showing market analysis with charts, task lists, video call, and live document editing.

Fig. 00The Spacebar canvas: humans, embedded applications, and AI participants working on a single synchronized surface, under one permission model.

01The thesis

The models are ready. They can reason, see, hear, and act. What they lack is a shared, stateful room to work alongside people.

A frontier model with no real surface to act on is an F1 driver in a rental car. The talent is not the constraint. The car is. A great car turns an average driver into a fast one, and an exceptional driver into a world champion.

Collaboration tools are everywhere; most now include AI. What they lack is a persistent, programmable surface where a human and an AI share context, see the same state, and act under the same rules. Notion holds the document; its AI is a sidebar. Zoom carries the conversation; its AI is a notetaker. The AI is always adjacent to the work, never intrinsic to it.

Spacebar is one synchronized space where every signal, every action, and every participant — human or model — flows into a single event stream. It is the F1 car.

One space for all signals

Every action addressable

Shared Event Stream

08:23:43:792

Audio chunk

participant_a

08:23:46:127

Video frame

participant_b

08:23:46:495

Cursor move

participant_a

08:23:46:591

Canvas object update

participant_a

08:23:47:004

Document change

participant_b

08:23:47:870

Tool call

agent_1

Diagram showing a shared event stream combining audio, video, cursor, and updates from participants and AI agent.

Diagram showing shared event stream syncing live audio, video, cursor, and document changes between humans and an AI agent.

Fig. 01One shared event stream is the single source of truth for the space. Every signal observable; every action addressable; humans and agents share context.

02Built for full-duplex models

The model is full-duplex.
The surface has to be too.

AI products were turn-based because the models were. Push-to-talk, wait, response. Every UI was built around that constraint. The constraint is gone. Models now stream perception and action simultaneously: hearing you while they speak, watching you while they listen, acting while they think.

A chat box with voice bolted on doesn't change the underlying loop. The user waits, the model waits, the conversation happens in turns.

That's what Spacebar is built for. Not a chatbot UI retrofitted with audio, but a substrate where every signal sits on the same stream, and every participant reads and writes simultaneously.

Three things every participant needs

01 Perception

Full-duplex sensing

Continuous parallel streams (audio, vision, action), not discrete turns. Models can do this now. Most workspaces can't.

02 Representation

Persistent internal state

Somewhere to refer back to. A canvas, an event log, a memory that survives the session. Without it, the model lives one prompt at a time. With it, the model has a place to come back to.

03 Loop

Self-iterative action

Observe, act, observe again, in a loop that closes inside the same surface — not across tool calls stitched together by an orchestrator. The model's actions land on the same stream as every human action.

Models can perceive.
‍Spacebar gives them a place to remember and a loop to act in.
In production today.

03No environment switch

The agent meets the user where they already are.

The fastest way to lose a user is to ask them to leave their workflow. Open a new tab. Install a desktop client. Sign into a new tool. Move the work into the room where the AI lives.

Spacebar's Chrome and Firefox extensions invert that. The user installs the extension; nothing else changes. Same Chrome profile, same cookies, same logged-in sessions, same 2FA. The agent comes to them: a sidebar on whatever page they're on, with structured access to the same DOM.

No remote desktop. No VM. No "share your screen." The user stays in their browser.

ApproachUser effortAgent visibility

Spacebar — browser extensionSame Chrome / Firefox. Same profile, cookies, auth. Agent reads live DOM, watches interaction.

Install onceStructured DOM + interaction stream

VM-based agent (status quo)Remote browser on a machine the user has never used. Cookies don't carry. 2FA breaks. The workflow has to be rebuilt inside the VM.

New environmentScreenshots, no user signal

Onboarding cost: zero.
Environment switch: none
‍Time to first value: the next page load.

04Canvas & capabilities

What the canvas is made of.

Spatial state

An infinite canvas that remembers exactly where everything is. Lay out documents, applications, browsers, whiteboard sketches, and video tiles where they belong, and they stay there. Close a space, come back a week later, and every object is exactly where it was left.

Persistent spatial state

Annual Budget

Revenue Projection

Market Analysis

Include a comparison of Starbucks vs. competitors in Asia

Highlight key emerging markets

Create a competitive analysis

Visual flow showing documents and notes tagged with Mon, Wed, and Returned Fri timestamps and spatial state note.

Graphic of digital workspace with documents and notes marked Mon, Wed, Returned Fri, labeled persistent spatial state.

Fig. 02Persistent spatial state: objects persist between sessions.

Live presence

Cursors, voice, video, and screen share sit on the same surface as the work itself. Video tiles are objects in the space, not a frame layered over it. Nothing here is a meeting tool bolted on.

Presence lives on the work surface

Editing

Marilyn

Travis

Anna

Travis

Fig. 03Presence lives on the work surface, not in chrome around it.

Embedded software

Open web apps, desktop applications, or virtual machines directly inside a space. Server-provisioned VMs streamed over WebRTC, no install. Any number of users sharing control of one live instance.

Revenue Projection

	A	B	C	D	E	F
1	Region	Population	Market Share	Potential Customers	Average Revenue/Customer	Revenue Projection (Annual)
2	North America (USA, Canada)
3	Europe (EU, UK)
4	Asia-Pacific (China, Japan, India)	4.7 billion	15%	705.0 million	$87.50	$61.7 billion
5	Latin America (Brazil, Mexico, Argentina)
6	Middle East & Africa
7	Oceania (Australia, New Zealand)

Travis

Spreadsheet showing revenue projection for Asia-Pacific: 4.7 billion population, 15% market share, $61.7 billion revenue.

Revenue projection table highlighting Asia-Pacific with population 4.7B, market share 15%, revenue $61.7B.

Fig. 04A server-hosted application, shared by default: one live instance, many participants.

Programmable runtime

Every object, event, and stream on the canvas is observable and addressable through one SDK. space.objects.add(…) to write, space.events.subscribe(…) to read. Any service or agent joins the same way a person does.

room.objects.add(. )
// every event observable
room.speak(stream)
// every action addressable

Fig. 05An SDK call, an addressable event, a canvas object. Actor: agent, permission: delegated.

Shared browser

A server-provisioned browser tab that the human and the agent operate simultaneously. Not a remote desktop, not a Live View onto an agent-controlled session. Both parties click, scroll, and type on the same live instance. The agent receives a continuous video feed of the human's interaction, not discrete screenshots. That signal (mouse trajectories, scroll pauses, abandoned inputs) is the telemetry that screenshot-based agents never see.

SpacebarBrowserbase / Browser Use

Who drives

Human and agent simultaneously

Agent drives; human intervenes

What the agent sees

Continuous video + live DOM

Discrete screenshots or a11y tree

What the human feels

Native browsing

Remote control

The architecture difference sits above the headless-browser primitive, not within it. Spacebar uses infrastructure like that under the hood; the distinction is in the observation and control layer built on top. Full comparison with Browserbase →

Everything a real-time agent needs.

What a real-time multimodal AI needsWhere Spacebar already has it

To work multimodally (conversation, sight, action) on a single surface

Voice, video, cursor, gesture, screen share, every embedded browser, every canvas object — all on one surface, through one coherent interface. Not a stack of SDKs glued together.

Perceive02

To see what humans are looking at, in real time

Live, structured access to every canvas object, cursor position, embedded browser, and video feed — all observable through one consistent interface.

To hear humans, including overlapping speech, while filtering out noise

Per-participant, server-side audio streams. Full-duplex. No “release the mic” turn-taking. Noise suppression and acoustic filtering applied per stream. STT provider configurable per session, not locked to one vendor.

To read embodied and social signals

Client-side perception streams: 478 face landmarks, 21 keypoints per hand, 20+ recognized gestures, attention score, engagement signals — all derived locally, in the browser. Only the results leave the device and are made available to the agent; the video stream follows standard WebRTC routing, as in any video call.

To read drawings and sketches as data, not just as images

Every stroke on the canvas is stored as a vector object — coordinates, shape, path — directly addressable through the same SDK as any other canvas element. Drawings are structured, addressable data from the moment of creation, not pixels that need interpretation.

act06

To act on the same surface as humans, through the same interface

A symmetric API: every canvas action a human takes is also an API call, through the same surface and the same permission model. Same interface, same rules, for any actor.

To act with bounded authority

An agent takes on the full persona of the user it represents. The same role assignments, space access, and device controls that bind a person bind it.

Persist08

To remember — minutes, weeks, or months

Persistent space state, full journaled history, lossless replay: every event captured in sequence, used by the substrate to reconstruct context for the agent's memory. Short-term and long-term memory layers, active in the voice + vision agent.

Infrastructure09

To run on any model, from any vendor

Pluggable STT, LLM, and TTS providers, selected per session. No single vendor is a hard dependency; when one degrades, new sessions route to another provider.

To respond fast enough that it feels like collaboration

50 ms event propagation, p99. The latency budget belongs to the model; the canvas adds effectively none.

05What you can build

Two modes. One substrate.

Spacebar supports two distinct agent modes, not as features, but as first-class deployment patterns. Real-time agents join live sessions and act in the moment. Always-on agents run between sessions, on triggers, and return to a Space rather than sending a notification. Both run on the same substrate. Both are in production today.

01 / Voice + vision agent

A multimodal assistant that joins live rooms.

The agent joins a Spacebar space over WebRTC as a first-class participant, on the same surface as everyone in the space. It runs a streaming voice-activity detector against the live per-participant audio streams, transcribes each participant’s speech in real time through a pluggable provider, and watches the canvas as a live visual stream. Board images are sampled at the cadence of speaking activity, so the multimodal frame budget follows the conversation rather than being exhausted during silence. It speaks back through a pluggable streaming text-to-speech provider into the shared audio mix, and retains session memory in two tiers.

Real-time audioPer-participant full-duplex media server, managed

Vision streamBoard images + canvas snapshots, sampled per frame

ReasoningPluggable LLM provider

MemoryShort-term + long-term per user

What the agent sees

1 Per-participant audio

Marilyn

Should we verify the Asia number?

Travis

Full duplex

2 Canvas visual stream

speaking: high rate

listening: low rate

3 Reasoning context

4 Memory

Agent joined
room

Listening &
Watching

Ready to act

Permission scoped

Interface showing agent's audio, visual stream, reasoning context, memory, and activity progress in a communication tool.

Fig. 06What the agent sees: per-participant audio, sampled board images, LLM context, and memory tiers.

Vignette / Spreadsheet voiceover

The agent narrates the model as it changes.

A user opens a financial model on the canvas. The voice agent watches the cells. When a partner edits an assumption, the agent says, out loud: "You just dropped APAC market share from 18% to 15%. Revenue projection is down $12.3B. The change cascades into the terminal value on row 47." No prompt. No "explain this change." Vision sees the diff, voice narrates it, the conversation continues. Same model anyone else can run. Different substrate underneath.

02 / Canvas-native assistant

24 Structured tools

Runs as a sidebar inside the canvas itself. The board image is piped into every iteration, so the model always has full visual context. Every tool routes through the same authorization layer a human action would pass through; the assistant cannot do what the user it represents is not permitted to do. It runs multi-step loops, observing the board, calling a tool, and receiving the result — until the requested change is complete or the loop depth is exceeded.

BoardsCreate, duplicate, and organize boards. Set backgrounds, review and annotate content, and add generated material directly onto the canvas.

DiscoveryBrowse all boards and folders, build folder structures, and navigate to the current active board.

RoomsList available rooms, create new ones, and assign participants to the right space.

UsersList all users in a space and see exactly who is currently active.

Device control
Force microphone and camera state for any participant or the entire space.

Mute everyone but one participant. Or unmute an entire space. One call.

Embedded appsInsert timers, stopwatches, and interactive widgets directly onto the canvas.

Generation & vision
Generate structured content onto the board and export board images to feed back into the model’s vision context.

Mobile / iOS + Android

Your model, on every surface
your users already have.

A user is on a phone, mid-task. They want the agent to help with what's on the screen — a form, a checkout, a doc. The standard answer: agent reads screenshots and guesses. Spacebar's answer: agent reads structured DOM, through the same SDK, same event stream, same permission model. Just on a six-inch surface.

IOSNative screen capture over WebRTC

AndroidBrowser-delivered, automatable

SurfaceStructured DOM, not screenshots

1Observe

Region

Revenue

North America

$182.7M

Europe

$116.2M

Asia-Pacific

$87.5M

Latin America

$48.5M

Anna

2Call

add_generated_content ()

actor

agent

permission

inherited

target

canvas

3Result

Q2 Asia Competitive Summary

Asia revenue grew 18% QoQ
Starbucks leads in premium segment
Pricing pressure in entry tier
Growth lever: expand cold brew distribution

4Repeat

Region

Revenue

North America

$182.7M

Europe

$116.2M

Asia-Pacific

$87.5M

Latin America

$48.5M

Q2 Asia Competitive Summary

Asia revenue grew 18% QoQ
Starbucks leads in premium segment
More details added to canvas

Anna

Flowchart showing steps: observe colored blocks with revenue, call add_generated_content, see Q2 Asia summary, repeat.

Flowchart showing steps: observe board image, call function adding content, view Q2 Asia summary, repeat observation.

Fig. 07The action loop: observe the board, call a structured tool through the shared authorization layer, write the result, observe again.

Both are production systems, not demos: evidence that a real-time multimodal agent can participate reliably in a live, multi-participant space. We built the substrate, scaled it under real customer load, and proved the cost structure holds. The numbers are measured, not modeled.

Mode 02 / Always-on agents

Most proactive agent products are headless monitoring loops with notification surfaces. The agent finishes and fires off a Slack message. That model works for tasks with a single output. It breaks down the moment the task is ongoing, ambiguous, or needs a human to pick up and continue it.

Most proactive agent products today are headless monitoring loops with notification surfaces. When the agent finishes, it sends you a Slack message. Spacebar is different: when your agent finishes, it has been working in a Space. You walk back into the room it was working in — the canvas already laid out, the sources already open, the draft already there. You don't read a summary of what happened. You see it.

Time trigger

Every morning at 7am, scan our RFP inbox and draft responses.

The agent opens the inbox, reads each RFP, checks the canvas for relevant past proposals, drafts a response, and leaves it pinned to the board — ready for your first review when you arrive.

Event trigger

Customer NPS drops below threshold — start the retention playbook.

The agent opens a Space, pulls the account history from the CRM, drafts a personalized outreach sequence, and flags the three most at-risk accounts with recommended next actions. No prompt required.

Layered

Monitored 12 supplier pages overnight. Flagged 3 changes. Drafted outreach for the one that matters.

You asked it to watch. It watched. When it found something real — a 7% price increase on a critical component — it logged it on the canvas, cross-referenced your contract terms, and drafted a response for your approval.

The handoff. When the agent comes back to you, it is not a notification. It is a Space with the work already laid out — sources open, canvas annotated, next steps visible. Every other proactive agent platform sends you a summary. Spacebar puts you back in the room it was working in.

06Latency

Where the time goes.

Fig. 08 · where the time goes

Turn-based AI in a browser tab, prompt to response, p50~2,500 ms

Model inference and network, typical, p50~600 ms

Spacebar event propagation, p9950 ms

The first two are p50 estimates, for context. The 50 ms is Spacebar’s measured p99: even the worst case clears the others’ typical case by more than an order of magnitude. Whatever a user waits on, almost none of it is the substrate.

07Substrate

Built for an agent to live in.

Building a system that holds live video, an embedded browser, a shared document, and a collaborative whiteboard in one synchronized space — all of it surviving a dropped connection — took five years. It is the infrastructure a real-time agent needs to see, hear, act, and remember alongside people, without adding latency, context loss, or broken permissions.

Native

CRDT engine

A custom CRDT engine on the hot path, computing minimum binary deltas from a client-supplied state vector. Runs as a compiled native service with multi-threaded execution, process-isolated from socket I/O to keep the hot path fast under load.

Dedicated

Worker pool

CRDT apply and encode operations run on a dedicated worker pool, sized to leave headroom on the main loop for socket I/O.

Layered

Cache

Hot in-memory state, backed by a shared cache, backed by versioned durable storage. Versioning invalidates the hot tier whenever a snapshot lands: warm-start by design.

Async

Compaction

Snapshot compaction runs out-of-band with cooldown to prevent cascading recompactions. The hot path never blocks on snapshot work. That is how the 50 ms p99 holds under load.

Distributed

Ownership lock

Per-space ownership locks ensure one server compacts a given space at a time; ownership tracking enables failover detection. The mechanism behind reliable sharded sessions.

Self-healing

Desync detection

Client and server each maintain expectations about the next updates; deviation is detected within milliseconds and the system recovers without losing a single state update. An immutable mutation log captures every create, update, and delete.

08Why screenshot-based agents fail

Pixels lose too much.

The first wave of computer-use agents took the obvious approach: render a remote browser, ship screenshots to the model, parse pixels, predict the next click. It works in a demo. It falls apart in production.

Three failure modes show up at scale. Latency — screenshots are heavyweight and discrete; the agent is always reasoning about a frame that is already stale. Loss of state — pixels are not structure; a dropdown that opened mid-frame, a hover state that mattered, a scroll position that contextualized the next action, all flatten into one ambiguous image. No user signal — when the human is in the loop, the agent sees the outcome of an action, not the path to it. How a user navigates, hesitates, corrects course — all the signal that makes intent legible — collapses between frames.

Spacebar's shared browser uses the same headless-browser primitives the screenshot approach does. The difference is the observation and control layer built on top: continuous video, live DOM, and a structured event stream the agent can subscribe to. The model gets a feed, not a slideshow.

09Build on it, or build it

The model layer is converging.
‍The substrate isn't.

Model capability is converging: real-time APIs from Google and OpenAI, computer-control surfaces from Anthropic, tool-using agents everywhere. The substrate beneath them is not.

A real-time agent needs a persistent, permissioned, multiplayer surface to see, hear, act, and remember alongside people. Building one is a multi-year systems project. The opportunity cost is real: every month spent building this layer is a month not spent on your actual product. Spacebar is that surface, in production. The question is whether you build it or build on it.

For builders weighing the alternative
Save 18 – 24 months of infrastructure work.
For a 20-person team, this layer is a two-year build at minimum. Even with parallel workstreams and an unusually senior bench, you don't ship it in under eighteen months. Every month on substrate is a month not on your product.

What you needWhat Spacebar provides
What you'd need to build

Multiplayer persistent canvas: CRDT state, snapshots, replayAvailable now12-24 mo

Per-participant full-duplex audio + video over WebRTCAvailable now6-12 mo

Full-duplex media server — concurrent send/receive per participant, sub-frame audio mixing, no push-to-talk fallbackAvailable now9-18 mo

Server-hosted shared browser, doc, and app surface inside the canvasAvailable now12+ mo

Symmetric agent action API: shared surface, shared permission model, per-actor scopingAvailable now6+ mo

Short-term + long-term memory tiers, per participantAvailable now3-6 mo

Scoped agency: agents inherit human permissions, no privilege escalationAvailable now3+ mo

Bring-your-own-model provider contract — LLM, STT, TTS endpoints all pluggable; weights never leave your infrastructureAvailable nowCustom build

Enterprise compliance controls: SOC 2, HIPAA, GDPRAvailable now12+ mo

Real-user operating data: per-session cost, failure modes, latency, and verified throughput (10,000+/sec across spaces)Available nowUnavailable

Estimates assume production-grade implementation: hardened, redundant, and operationally stable under real load.

You will need most of this. Even with a large team working in parallel, building it is at least two years of work. Build on Spacebar, and free that time for your product.

10Where Spacebar is essential, and where it isn't

When you need this.
When you don't.

Not every AI product needs this layer. A voice assistant that answers a question and hangs up doesn't need a persistent canvas. A chatbot in a marketing page doesn't need a multiplayer event stream. Single-turn, single-user, no persistent state. You don't need a substrate. You need an inference endpoint.

Spacebar is built for the cases that don't fit that shape. Work that's ongoing, multi-party, browser-grounded, long-horizon, or visually structured. Conversations that have to survive a week. Two humans and an agent looking at the same thing at the same time. An agent coming back to a partial task and picking up where it left off.

Essential

Where the substrate earns its keep

— Multi-party live sessions, humans + agents on one surface
— Long-horizon work that survives a session
— Browser-grounded agents on real, live web surfaces
— Enterprise + regulated, with auditable event streams
— Visually-structured tasks where the canvas is the artifact

Probably not needed

Where you should just hit an API

— Single-turn voice assistants (ask, answer, hang up)
— Embedded chatbots with no persistent state
— Single-user productivity tools, no cross-session context
— Headless batch agents whose output is one notification
— Anything self-contained in one prompt

11Make it yours

Your model, your brand,
our substrate.

What you see at spacebar.ai is one shape this can take. There are others. Some frontier labs already have an opinionated app and don't want another UI on top. They want the runtime, not the front end: the same substrate that powers Pencil Spaces, running invisibly as the engine under their own product. Available to their engineers, never seen by their users.

Three partnership shapes

Most conversations with a frontier lab end up at one of these three. None of them require the others.

01 — Data

Data partnership

Trajectory and interaction data, sourced under contract, licensed for RL and model training. Real users, real workflows, real consent. Aggregated and anonymized to spec.

02 — Integration

Integration partnership

Your model becomes a first-class provider inside Spacebar. Pluggable LLM / TTS / STT contract, weights on your infrastructure, traffic routes to you. Your users meet your model through our canvas.

03 — Product surface

Product-surface partnership

Spacebar as the runtime under your app. Your brand, your domain, your design — our substrate, invisible. The lab's own product surface, with full multimodal collaboration built in.

If one of these is the conversation you want to have, tell us which one. Each starts the same way: a call, an architecture review, and a short paid pilot.

12Platform

A space any application, service, or agent can join.

Every object, event, and stream on the Spacebar canvas is observable, addressable, and actionable through one coherent SDK. Build an app, an integration, a service, or an autonomous agent — each joins the canvas the same way a person would. The protocol is symmetric: human and agent reach the same surface through the same API. Any permission a human holds, an agent can hold. The system draws no distinction between what is available to a person and what is available to a program.

// Make any web component multiplayer in a Spacebar space.
// @pncl/mario — Spacebar's real-time SDK
import { MarioClient } from "@pncl/mario";

const space = await MarioClient.join("space_8KXq...");
space.bind(myComponent);

// every state change now syncs to everyone in the space.
// presence, CRDT conflict resolution, cursors, undo/redo,
// version history, and snapshot recovery: free.

// Observe everything happening in a space, in real time.
import { MarioClient } from "@pncl/mario";

const space = await MarioClient.join("space_8KXq...");

space.events.subscribe(event => {
  // event.kind: "cursor" | "draw" | "speak" | "type"
  //             | "object.add" | "object.move" | ...
  // event.actor, event.timestamp, event.payload
});

// Drive the canvas the same way a human would.
await space.objects.add({
  kind: "stickyNote",
  x: 320, y: 480,
  text: "Try this approach instead."
});

await space.objects.move("obj_a91...", { x: 600, y: 480 });
await space.speak({ stream: ttsStream });
await space.browser.type("doc_b14...", "Hello.");

An agent built against this SDK is not integrated into the canvas. It is a participant in it, with the same reach and the same limits as the person sitting next to it.

13Connectors & reach

Connect anything. In minutes, not months.

Most real-time AI platforms are rigid: the surface they ship is the surface you get. Spacebar is designed to be extended. Everything in a space — presence, state, events, audio, browser, memory — is observable and writable through the same SDK your agent uses. Adding a connector is adding a participant that speaks a specific protocol. That is all it is.

Browser extensions and Mobile moved up to §03 and §05. This grid now covers backend integrations only.

MCP server

native

Any agent that speaks MCP can speak Spacebar out of the box. Exposes users, spaces, sessions, recordings, transcripts, presence, audit logs, billing, and scheduling availability — the full operational context — through standard MCP protocol. No custom integration code required.

SDK + REST API

extensible

Every connector on this list was built using the same public SDK and REST API available to you. If a connector does not exist yet, building one takes hours, not weeks. The event model is uniform: subscribe to anything, write to anything, from any runtime.

Webhooks

HMAC-signed

HMAC-signed outbound events on session, recording, and presence state. Drop a URL and start receiving structured payloads immediately — no polling, no SDK required on the receiving end.

Meeting adapters

bots

Adapter bots join the meeting tool your team already uses — Zoom, Meet, Teams — bringing the canvas and the agent surface with them. Your AI joins the call as a participant, not a sidebar.

Browser extensions

Chrome · Firefox

Surface a Spacebar sidebar on any webpage. Useful for building context-aware agents that work alongside users in their existing browser workflows, without redirecting them to a new URL.

Calendar

Google · Outlook · Office 365

Bi-directional session sync. Sessions appear on the user's calendar; calendar events can trigger space creation. Two lines of configuration, not two weeks of integration.

Mobile

iOS · Android

iOS and iPad with native screen capture over WebRTC. Full canvas on Android through Chrome or any modern browser. Browser-based delivery is intentional: browsers are automatable, which matters for agent integrations running on mobile surfaces.

14Operating data & compliance

What it costs, measured in production.

Pencil Spaces — our own platform, built entirely on Spacebar — has run paid customer workloads for over four years. The numbers are measured, not estimated: per-session cost, failure modes, cost structure, and uptime. Across that four-year window, availability has held above 99.99%. Status page → Technical evaluators: reach out to partnerships@spacebar.ai for the full data package. Spacebar monetizes as infrastructure: usage-based pricing for builders, enterprise licensing for deployers, and trajectory data licensing for frontier labs. Working with frontier labs on training data? See spacebar.ai/labs →

99.99%+Measured availability

50 ms p99Event propagation

200,000,000+Minutes served, to date

Compliance

SOC 2·HIPAA·GDPR

Scoped agency. An agent takes on the full persona of the user it represents and nothing more. The same role assignments, space access, and device controls that bind a person bind it. The authorization model prevents privilege escalation: an agent cannot exceed its delegated permissions.

Tenant isolation

Terraform-managed regional infrastructure: GKE, object storage, and cache. JWT-scoped real-time channels, validated at every message, not just at connection time. Perception models run in-browser: face landmarks and gesture signals are derived locally and only the results leave the device. The video stream follows standard WebRTC routing.

15Custom solutions

Built to your specification.

Not every use case fits the standard platform. When yours does not, we build to specification.

Research & frontier systems

A substrate for human-AI collaboration at the frontier.

A bespoke environment for teams building where the field is still open: custom memory architecture, full session instrumentation, configurable tool sets, and a shared surface where your model works alongside your researchers in real time — observable, steerable, and debuggable at every step.

Custom memoryFull instrumentationResearch-gradeMultimodal

For example — demos

A purpose-built space for demonstrating your AI.

A controlled environment for showing your system to investors, customers, or conference audiences. Custom canvas layout, branded, silent by default — only what you want visible, nothing you do not. Live in front of anyone.

Custom layoutBrandedAudience-ready

For example — consults

A structured session environment for high-stakes conversations.

An AI agent participates in a real consultation alongside a professional — seeing the same documents, hearing the same conversation, assisting in real time. Role-scoped, fully auditable, HIPAA-ready on request. Built for the constraints of regulated industries.

Role-scopedAuditableHIPAA-ready

Every custom build starts with a conversation. Tell us what you need →

16faqs

Frequently asked questions

17Contact

Get in touch.

Tell us what you’re building and we’ll route you to the right person. Or reach out directly. Whichever is easiest for you. If your use case calls for something beyond the standard platform, we build to specification.

Thanks. We’ll get back to you quickly.

Something went wrong. Please use one of the direct contact options.

Call or text(650) 550-9341

Book a timeFind a slot →

Organizations

sales@spacebar.aiDeployment, procurement, support

Developers

engineering@spacebar.aiRuntime, SDK, MCP server

Labs & partners

partnerships@spacebar.aiResearch and infrastructure partnerships

18Team

The people who built it

Spacebar was built by the same team behind Pencil Spaces — four years of production at scale, carrying real customer workloads. The substrate was not designed in theory.

Co-founder & CEO

Ayush Agarwal

Co-founder & CTO

Amogh Asgekar

Head of Product

Imran Ahmed

Coo

Every way people interact,now available to agents.Nothing new for you to learn.Everything new to do."Spacebar is the substrate the next generation of AI products will be built on. Multi-user, multi-agent, fully live, and already at production scale"

Revenue Analysis

The models are ready. They can reason, see, hear, and act. What they lack is a shared, stateful room to work alongside people.

One space for all signals

The model is full-duplex. The surface has to be too.

Full-duplex sensing

Persistent internal state

Self-iterative action

The agent meets the user where they already are.

What the canvas is made of.

Spatial state

Live presence

Embedded software

Programmable runtime

Shared browser

Everything a real-time agent needs.

Two modes. One substrate.

A multimodal assistant that joins live rooms.

The agent narrates the model as it changes.

24 Structured tools

Your model, on every surface your users already have.

Every morning at 7am, scan our RFP inbox and draft responses.

Customer NPS drops below threshold — start the retention playbook.

Monitored 12 supplier pages overnight. Flagged 3 changes. Drafted outreach for the one that matters.

Where the time goes.

Built for an agent to live in.

CRDT engine

Worker pool

Cache

Compaction

Ownership lock

Desync detection

Pixels lose too much.

The model layer is converging.‍The substrate isn't.

When you need this. When you don't.

Where the substrate earns its keep

Where you should just hit an API

Your model, your brand,our substrate.

Data partnership

Integration partnership

Product-surface partnership

A space any application, service, or agent can join.

Connect anything. In minutes, not months.

MCP server

SDK + REST API

Webhooks

Meeting adapters

Browser extensions

Calendar

Mobile

What it costs, measured in production.

Built to your specification.

A substrate for human-AI collaboration at the frontier.

A purpose-built space for demonstrating your AI.

A structured session environment for high-stakes conversations.

Frequently asked questions

Get in touch.

The people who built it

Ayush Agarwal

Amogh Asgekar

Imran Ahmed

Swati Khandelwal

Every way people interact,
now available to agents.
Nothing new for you to learn.
Everything new to do.

"Spacebar is the substrate the next generation of AI products will be built on. Multi-user, multi-agent, fully live, and already at production scale"

The model is full-duplex.
The surface has to be too.

Your model, on every surface
your users already have.

The model layer is converging.
‍The substrate isn't.

When you need this.
When you don't.

Your model, your brand,
our substrate.