Posted by: Jordan Kretchmer
Posted on 05/23/2025
We’re thrilled to announce that Outlander has led the $3.5M Seed round in DiffuseDrive, a company that’s redefining how AI models see—and learn from—the physical world.
If you’ve ever worked in computer vision, you know the data problem. Real-world data is messy, expensive, and worst of all—limited. Training a model to drive a car, inspect a pipeline, or pilot a drone doesn’t just require millions of images. It requires collecting images of every edge case, every lighting condition, and every unexpected variable. Getting that data the traditional way—by deploying hardware into the world or building complex simulations—takes months or years and still only captures a small percentage of the data needed to build reliable models. DiffuseDrive does it in hours.
Bálint (CEO) and Roland (CTO) met while leading multi-million dollar efforts to build autonomous driving Ground Truth systems at Bosch, where they spent years wrestling with the scarcity of high-quality training data. They’ve seen firsthand how slow traditional data pipelines can hold back even the most capable models. What started as an internal innovation effort turned into a conviction: The only way to truly scale physical AI was to rethink data generation from the ground up. So Bálint and Roland left Bosch and started DiffuseDrive to solve the problem their way.
What sets DiffuseDrive apart isn’t just their speed or visual fidelity. It’s the way the platform closes the loop. The solution analyzes gaps in a customer’s existing training images, then generates photorealistic, perfectly annotated synthetic images to fill those gaps—on demand, tuned to specific sensors, and validated against real-world benchmarks. It’s data infrastructure for the next generation of autonomous systems.
As Bálint puts it, “The future of AI is in how it interacts with the real world. And real-world AI needs real-world-grade data—fast. We’re not just making synthetic images. We’re building infrastructure to make physical AI systems smarter every day. Anyone can generate pretty images,” he said. “But data that actually improves model performance—that’s the hard part. That’s what we’ve solved.”
From the outset, they’ve been customer-first. Even before incorporation, they were delivering pilot datasets to enterprise design partners. Bálint still leads every sale himself, not because they lack a team—but because he wants to hear customer feedback firsthand. Roland, meanwhile, has engineered a backend that integrates seamlessly with enterprise ML pipelines, and can generate asset-specific datasets in a matter of hours. The result is a company that’s unusually sharp in both vision and execution.
Their vision is ambitious: to be the foundational dataset powering autonomy across vehicles, satellites, robotics, and beyond. But they’ve already proven they can go from zero to traction in months, with real revenue, real customers (Denso, Continental, AISIN, and one of the largest Defense Primes), and real urgency. We believe synthetic data is one of the most powerful unlocks in AI today, that DiffuseDrive is the team to lead it, and that they will become the data infrastructure layer for the real world.
Welcome, DiffuseDrive. We’re proud to be your partner in this next chapter.
As we explore the unknown of each new investment, our Field Guides are where we document all that we learn along the way.
So, whether you’re actively raising, trying to break into VC, or interested in our game-changing portfolio, our Field Guide's got you covered.
Sign up now for exclusive access to funding opportunities, events/resources from our network of experts, updates from our portfolio, and more!