Question 1

What is FDM-1 and who built it?

Accepted Answer

FDM-1 is a foundation model for general computer use built by Standard Intelligence (standard intelligence pbc), announced February 23, 2026. Unlike prior computer-use agents that fine-tune a vision-language model on screenshots, FDM-1 trains and infers directly on video at 30 FPS. It was trained on a portion of an 11-million-hour screen recording dataset labeled by a custom inverse dynamics model. The team positions it as the first fully general computer action model.

Question 2

How is FDM-1 different from screenshot-based agents like Claude Computer Use or OpenAI's Operator?

Accepted Answer

Traditional computer-use agents fine-tune a VLM on contractor-annotated screenshots, which limits them to a few seconds of context, low framerates, and short-horizon tasks. FDM-1 instead trains directly on 30 FPS video and uses a video encoder that compresses ~2 hours into 1M tokens, giving it minute-scale context. It also avoids per-task reinforcement learning environments, learning unsupervised from the open internet's video corpus. Based on our analysis of 870+ AI tools, this is the only Automation entry that trains a custom video foundation model end-to-end for computer use.

Question 3

What can FDM-1 actually do today?

Accepted Answer

Standard Intelligence demonstrated FDM-1 performing multi-action CAD sequences in Blender (including extruding faces on an n-gon to make a gear), exploring and fuzzing complex websites, and driving a car in the real world — all at 30 FPS. The CAD demo uses OS checkpoints created at successful operations (extrude, select, etc.) to enable test-time compute via a forking VM. The blog post emphasizes that capabilities consistently improve with scale, and the team frames the current model as the first step toward CAD, finance, engineering, and ML-research coworker agents.

Question 4

How much does FDM-1 cost and how do I access it?

Accepted Answer

FDM-1 has no published pricing or self-serve access as of the February 23, 2026 announcement. Standard Intelligence describes it as a research milestone in a blog post at si.inc/posts/fdm1/, and access appears to be limited to enterprise or research partnerships. Compared to other Automation tools in our directory that publish $20–$200/month tiers, FDM-1 sits firmly in the enterprise / contact-sales segment with no free or developer tier announced.

Question 5

What are the technical components of FDM-1's training recipe?

Accepted Answer

The training recipe has three core components, all described in the launch post. First, a video encoder that compresses approximately 2 hours of 30 FPS video into 1 million tokens, enabling long-context training. Second, an inverse dynamics model that labels raw screen recordings with the actions that produced them, removing the need for contractor annotation. Third, a forward dynamics model that predicts future frames conditioned on actions, which is the component used to drive the agent at inference time.

FDM-1 Doesn't Have a Free Plan — Here's What It Costs

⚡ Quick Verdict

Who Should Pay for This

Best For

What Users Say About FDM-1

👍 What Users Love

👎 Common Concerns

Frequently Asked Questions

What is FDM-1 and who built it?

How is FDM-1 different from screenshot-based agents like Claude Computer Use or OpenAI's Operator?

What can FDM-1 actually do today?

How much does FDM-1 cost and how do I access it?

What are the technical components of FDM-1's training recipe?

Ready to Get Started?

More about FDM-1

FDM-1 Doesn't Have a Free Plan — Here's What It Costs

⚡ Quick Verdict

Who Should Pay for This

Best For

What Users Say About FDM-1

👍 What Users Love

👎 Common Concerns

Frequently Asked Questions

What is FDM-1 and who built it?

How is FDM-1 different from screenshot-based agents like Claude Computer Use or OpenAI's Operator?

What can FDM-1 actually do today?

How much does FDM-1 cost and how do I access it?

What are the technical components of FDM-1's training recipe?

Ready to Get Started?

More about FDM-1