Project Houston Weekly · June 29–July 3 (Week 2)
Last week we set up Houston to access Cambrian’s files and talk to our team. This week we let it reason over the raw files themselves. Let us call this task as shipping bytes to the metal: sending the actual bytes of videos, documents, spreadsheets down to a powerful model.
Given that Cambrian is collecting a lot of data off each jobsite, we cannot send all the data to the metal. So we came up with a schema that is agent readable and encourages a bijective mapping between files to provide a source of truth. The result was that Houston has an improved ability to “see”, i.e., read the photos, renderings, and plans you send it. This is a small starting point in bringing Spatial AI into Houston.
Shipping bytes to the metal
As mentioned above, construction generates far too much data to shove wholesale into a context window, so the trick is routing only the relevant bytes to the model. Every file now gets a small Git pointer (e.g. an index card) which includes a thin spine (where it lives, its hash, its type) plus a payload of some meta data. These Git pointers bridge Drive and GitHub so a signed PDF and a code file share one source of truth.
As shown in the figure below, when you tag Houston with a photo, the following sequence occurs: your message fires a Google Workspace event onto a Pub/Sub topic that wakes Houston instantly instead of polling on a clock as it was setup earlier. The photo’s pixels travel separately to the metal, as attachment bytes, alongside your question.
Before this week; After this week
Before this week, Houston could file a photo but not open it. Now it can read the image, answer, and refer you to the source it read:
Before this week
kelton_finished_exterior_dusk.png in Drive — I can’t open it to tell you what’s inside.
After this week
A potentially useful upgrade that is still in progress is reading multiple images and comparing them. Here’s a test of Houston where Sasha posted two renderings and asked whether they matched his intent:

• Left — the hedge is tall and dense, right against the driveway.
• Right — shorter and thinner, leaving a bigger gap to that neighbor.
So for ficus pushed closer to the right neighbor as a taller screen, the left rendering is closer to your intent. One other change: the left has an open lawn, the right a raised planter wall with drought planting. (compared 2 renderings · unverified vision read)
Quality of Life Improvements: Speed and Rendering
Until this week Houston checked for messages on a clock cycle. This week we replaced the clock cycle with a listener: a posted message wakes Houston the moment it lands to minimize lag.
Before this week
After this week
A small quality of life improvement was also to fix Houston’s Chat formatting this week, as Google does not use markdown. This enables its tables and text to render more cleanly.
Before this week
**Address** | **Sold** | **Days**
| 2725 Midvale | $1.42M | 12 |
| 1847 Greenfield | $1.30M | 21 |
After this week
• 2725 Midvale — sold $1.42M, 12 days on market
• 1847 Greenfield — sold $1.30M, 21 days on market
Goal for Next Week
The goal for next week is to improve the ability to reason over imagery. We noticed that the field crew is taking pictures of long receipts (e.g. 30 line items) across different photos. Image handling is now not just important for the Spatial AI and plans, but comprises a large amount of our numerical data as well. This will be one priority for Houston next week.
A quality of life improvement we will make will be speed. Houston is actually slower now, because we’ve shipped a bunch of data to the metal, that now slows down the model. How to handle caching and the context window will be on our mind for the coming week.
Houston backlog
| Backlog Task | What it is |
|---|---|
| Update Houston to Create, Read, Update, Delete | Round out full CRUD over the company’s records and documents from chat, so any entity can be created, read, changed, or removed through one reasoned, gated path. |
| Improve Houston’s image handling | Build on this week’s multi-image vision: sharper grounding, image generation and variation, and eventually video. |
| Interface Houston with Sentinel and Horizon | Connect Houston to the Sentinel and Horizon systems so they share data and act as one. |
| Move Houston to the right server hardware | Get the always-on loop off the laptop onto proper hosting, with secrets in a store and its own service identity. |
| Continue to evolve Cambrian’s data schema | Keep refining the source-of-truth schema for files and the registry so the data stays clean as the company grows. |