Announcing our $20m Series A

AI Development needs a standard & we’re building it at Vellum

Written by

Akash Sharma

Reviewed by

CONTENTS

Inline evaluation / Guardrails: Ensure good system performance at run-time

This is some text inside of a div block.

A year ago, everyone was racing to prototype with AI.

Now everyone is learning that it's one thing to build a demo and another thing entirely to get it production-ready. At Vellum, we've lived that gap and it's why we exist. Today I'm excited to share that we've raised $20M in Series A funding, a milestone that allows us to double down on helping teams bridge that gap from proof of concept to production ready AI systems.

The round was led by Leaders Fund, with participation from Y Combinator, Socii Capital, Rebel Fund, Pioneer Fund and Eastlink Capital.

It’s a big milestone for us, but more than anything, it validates what we’ve been seeing on the ground: the need for a standard approach to bring AI products into production, especially inside large, complex organizations.

The promise of AI is clear. We all dream of a future where AI agents automate the mundane work and we focus on the high leverage creative tasks. AI also comes with real business value, AI native companies like Cursor & Lovable have shown rocketship growth & industry leaders like Salesforce are making AI a core part of their strategy.

But what about the others?

Building AI applications is still a massive challenge. For those that make it to market, it often takes many quarters of painstaking work, and many more fall by the wayside because they can’t meet quality standards.

With our latest funding, we’re committed to accelerating AI adoption globally.

Why build an AI Development Standard?

Since March 2020, Sidd, Noa, and I have been building AI applications with LLMs and we’ve bumped into the same roadblocks every time:

What works in a demo often breaks in production because models behave unpredictably.
The pace of change makes it nearly impossible to stay current, let alone build with confidence (agents weren’t mainstream until just 6 months ago!)
Everything falls on engineers, making them the main bottleneck

Developing AI feels like writing software in quicksand, the ground keeps shifting and teams struggle just to stay afloat.

We’ve worked with over 150 companies across industries, ranging from bleeding edge startups to household names like Swisscom, Redfin, Drata and Headspace and the same pattern emerges: successful AI requires structured, cross-functional teamwork and rigorous development practices.

A Platform to Bring Rigor to AI Development

Vellum is an enterprise development platform for building, evaluating, and deploying mission-critical AI products. It is the most comprehensive on the market, helping cross functional teams work together through the entire AI development lifecycle.

AI workflow definition: A UI builder and SDK let teams visualize, test, and refine AI logic. Engineers and non-technical experts can collaborate side by side.
End-to-end evaluation: A robust testing suite catches failures and edge cases before they reach production.
Safe deployments: Push updates and publish new versions without risky redeploys. Vellum enables precise version control, even in highly complex environments.
Live monitoring and continuous improvement: Real-time observability shows how systems behave in the real world, with live feedback loops that feed directly into testing.

What is the Test-Driven Development Standard?

We believe true test driven development is the standard AI teams need to build systems they can trust and control as they grow. Every part of our platform is designed around this principle, turning best practices into everyday workflows and cutting time to production from quarters to weeks.This brings control, rigor, and reliability:

One place for the full workflow: Build, test, deploy, and monitor AI systems in a single platform. Every change is tracked and versioned for clear history and explainability.
Standards you can trust: Ship only when your models meet real quality, cost, and latency goals. Keep iterating with confidence as new models or orchestration techniques emerge.
Learning built in: Test driven development turns every update into a lesson. Teams gain a deeper understanding of how AI works in practice and adapt fast as needs change.

This only works when everyone has the right tools and context.

Vellum gives engineers an SDK to manage and control their environments with confidence, while product managers and domain experts use a visual builder to shape AI behavior without writing code. Everyone works in the same space, sharing context and staying in sync as they build, evaluate, and iterate on AI systems together.

Our customers’ wins

Every team wants the same thing from AI: to move fast without sacrificing quality. What they build, though, is unique to their world. With Vellum they turn these ideas into reliable AI products and keep standards high while enabling their teams to move fast.

Swisscom has made Vellum a core part of their AI platform, giving Swiss banks and governments a secure and reliable way to build AI applications

Drata builds and secures 7,000+ isolated knowledge bases to drive compliant GRC automation across tenants. PMs and engineers collaborate in Vellum for rapid validation and deployment

Redfin rolled out “Ask Redfin” to millions of users across 14 markets by having their domain experts evaluate the conversational agent using thousands of test cases

DeepScribe cuts clinician note iteration time by 20–40%, using feedback loops and regression testing to ensure accuracy and trust

Rely Health went from multi engineer, multi month builds to deploying healthcare workflows in days, automating voice agents, smart triage, and charting because of Vellum tracing and decoupled deployments

Rentgrata launched “Ari”, a renter chatbot, with Vellum powering testing, deployment, and post-launch monitoring for airtight accuracy

GravityStack slashed credit agreement review time by 200% using agentic workflows, powered end-to-end on Vellum

Educating the Market

In an emerging market like this, sharing what works and what doesn’t is just as important as the platform itself. That’s a big part of what we do. We make sure teams have both so they can build with confidence and learn as they go.

#1 ranking LLM Leaderboard worldwide sharing which models perform best across different use cases
Best practices guides, webinars, and blogs covering recent AI developments, orchestration strategy, observability, and evaluation frameworks
Live training through office hours and workshops to help product, ML, and engineering teams build production ready AI systems
Technology partnership program with the best service providers to equip our customers with high value talent and instant support from day 1

What’s next?

I remember someone asking me in Month 2 of our company, “What’s your vision of Vellum?”

My answer back then holds true today: “We’ve lived the AI development pain first-hand so our customers don’t have to. We will be the best-in-class platform engineering teams around the world rely on to power core AI applications.”

Today we are excited to partner with Leaders.vc to scale what we have shown works time and time again. With this capital we will:

Increase the number of AI use cases deployed through Vellum
Lower the time to production of each AI use case deployed through Vellum
Expand our presence in new verticals and geographies
Cement Vellum as the foundational layer in the AI stack

We’re building not just software, but the standard of how the world builds AI products. For everyone venturing beyond prototypes where quality, and reliability matter, Vellum is your foundation.

We’re also hiring across the board! If you’re interested in any open role, apply here.

Let’s build the future together.

Akash Sharma

Founder & CEO, Vellum

But what about the others?

With our latest funding, we’re committed to accelerating AI adoption globally.

Why build an AI Development Standard?

Since March 2020, Sidd, Noa, and I have been building AI applications with LLMs and we’ve bumped into the same roadblocks every time:

What works in a demo often breaks in production because models behave unpredictably.
The pace of change makes it nearly impossible to stay current, let alone build with confidence (agents weren’t mainstream until just 6 months ago!)
Everything falls on engineers, making them the main bottleneck

Developing AI feels like writing software in quicksand, the ground keeps shifting and teams struggle just to stay afloat.

A Platform to Bring Rigor to AI Development

AI workflow definition: A UI builder and SDK let teams visualize, test, and refine AI logic. Engineers and non-technical experts can collaborate side by side.
End-to-end evaluation: A robust testing suite catches failures and edge cases before they reach production.
Safe deployments: Push updates and publish new versions without risky redeploys. Vellum enables precise version control, even in highly complex environments.
Live monitoring and continuous improvement: Real-time observability shows how systems behave in the real world, with live feedback loops that feed directly into testing.

What is the Test-Driven Development Standard?

One place for the full workflow: Build, test, deploy, and monitor AI systems in a single platform. Every change is tracked and versioned for clear history and explainability.
Standards you can trust: Ship only when your models meet real quality, cost, and latency goals. Keep iterating with confidence as new models or orchestration techniques emerge.
Learning built in: Test driven development turns every update into a lesson. Teams gain a deeper understanding of how AI works in practice and adapt fast as needs change.

This only works when everyone has the right tools and context.

Our customers’ wins

Swisscom has made Vellum a core part of their AI platform, giving Swiss banks and governments a secure and reliable way to build AI applications

Drata builds and secures 7,000+ isolated knowledge bases to drive compliant GRC automation across tenants. PMs and engineers collaborate in Vellum for rapid validation and deployment

Redfin rolled out “Ask Redfin” to millions of users across 14 markets by having their domain experts evaluate the conversational agent using thousands of test cases

DeepScribe cuts clinician note iteration time by 20–40%, using feedback loops and regression testing to ensure accuracy and trust

Rentgrata launched “Ari”, a renter chatbot, with Vellum powering testing, deployment, and post-launch monitoring for airtight accuracy

GravityStack slashed credit agreement review time by 200% using agentic workflows, powered end-to-end on Vellum

Educating the Market

#1 ranking LLM Leaderboard worldwide sharing which models perform best across different use cases
Best practices guides, webinars, and blogs covering recent AI developments, orchestration strategy, observability, and evaluation frameworks
Live training through office hours and workshops to help product, ML, and engineering teams build production ready AI systems
Technology partnership program with the best service providers to equip our customers with high value talent and instant support from day 1

What’s next?

I remember someone asking me in Month 2 of our company, “What’s your vision of Vellum?”

Today we are excited to partner with Leaders.vc to scale what we have shown works time and time again. With this capital we will:

Increase the number of AI use cases deployed through Vellum
Lower the time to production of each AI use case deployed through Vellum
Expand our presence in new verticals and geographies
Cement Vellum as the foundational layer in the AI stack

We’re building not just software, but the standard of how the world builds AI products. For everyone venturing beyond prototypes where quality, and reliability matter, Vellum is your foundation.

We’re also hiring across the board! If you’re interested in any open role, apply here.

Let’s build the future together.

Akash Sharma

Founder & CEO, Vellum

ABOUT THE AUTHOR

Akash Sharma

Co-founder & CEO

Akash Sharma, CEO and co-founder at Vellum (YC W23) is enabling developers to easily start, develop and evaluate LLM powered apps. By talking to over 1,500 people at varying maturities of using LLMs in production, he has acquired a very unique understanding of the landscape, and is actively distilling his learnings with the broader LLM community. Before starting Vellum, Akash completed his undergrad at the University of California, Berkeley, then spent 5 years at McKinsey's Silicon Valley Office.

ABOUT THE reviewer

No items found.

lAST UPDATED

Jul 10, 2025

Expert verified

Model Comparisons

February 6, 2026

•

10 min

Claude Opus 4.6 Benchmarks

LLM basics

February 5, 2026

•

12 min

15 Best Make Alternatives: Reviewed & Compared

Product Updates

February 3, 2026

•

5 min

Vellum Product Update | January

LLM basics

January 30, 2026

•

20 min

15 Best Zapier Alternatives: Reviewed & Compared

LLM basics

January 28, 2026

•

20 min

2026 Marketer's Guide to AI Agents for Marketing Operations

LLM basics

January 26, 2026

•

18 min

Top 20 AI Agent Builder Platforms (Complete 2026 Guide)

The Best AI Tips — Direct To Your Inbox

Latest AI news, tips, and techniques

Specific tips for Your AI use cases

No spam

Oops! Something went wrong while submitting the form.

Each issue is packed with valuable resources, tools, and insights that help us stay ahead in AI development. We've discovered strategies and frameworks that boosted our efficiency by 30%, making it a must-read for anyone in the field.

Marina Trajkovska

Head of Engineering

This is just a great newsletter. The content is so helpful, even when I’m busy I read them.

Jeremy Hicks

Solutions Architect

Book a DemoLearn more

Automate the work
that slows you down

AI agents for your boring ops tasks.

Product

Resources

Company

Careers

Affiliate program rules

Announcing our $20m Series A

Why build an AI Development Standard?

A Platform to Bring Rigor to AI Development

What is the Test-Driven Development Standard?

Our customers’ wins

Educating the Market

What’s next?

Why build an AI Development Standard?

A Platform to Bring Rigor to AI Development

What is the Test-Driven Development Standard?

Our customers’ wins

Educating the Market

What’s next?

Automate the workthat slows you down

General CTA component, Use {{general-cta}}

General CTA component [For enterprise], Use {{general-cta-enterprise}}

[Dynamic] Ebook CTA component using the Ebook CMS filtered by name of ebook.Use {{ebook-cta}} and add a Ebook reference in the article

LLM leaderboard CTA component. Use {{llm-cta}}

Case study CTA component (ROI) = {{roi-cta}}

Case study CTA component (cutting eng overhead) = {{coursemojo-cta}}

Case study CTA component (Time to value) = {{time-cta}}

[Dynamic] Guide CTA component using Blog Post CMS, filtering on Guides’ names

Dynamic template box for healthcare, Use {{healthcare}}

Start with some of these healthcare examples

Dynamic template box for insurance, Use {{insurance}}

Start with some of these insurance examples

Dynamic template box for eCommerce, Use {{ecommerce}}

Start with some of these eCommerce examples

Dynamic template box for Marketing, Use {{marketing}}

Start with some of these marketing examples

Dynamic template box for Sales, Use {{sales}}

Start with some of these sales examples

Dynamic template box for Legal, Use {{legal}}

Start with some of these legal examples

Dynamic template box for Supply Chain/Logistics, Use {{supply}}

Start with some of these supply chain examples

Dynamic template box for Edtech, Use {{edtech}}

Start with some of these edtech examples

Dynamic template box for Compliance, Use {{compliance}}

Start with some of these compliance examples

Dynamic template box for Customer Support, Use {{customer}}

Start with some of these customer support examples

Template box, 2 random templates, Use {{templates}}

Start with some of these agents

Template box, 6 random templates, Use {{templates-plus}}

Build AI agents in minutes

Build AI agents in minutes for

{{industry_name}}

Case study results overview (usually added at top of case study)

1-click

28,000+

100+

Automate the work
that slows you down

[Dynamic] Ebook CTA component using the Ebook CMS filtered by name of ebook.
Use {{ebook-cta}} and add a Ebook reference in the article