Vellum is coming to the AI Engineering World's Fair in SF. Come visit our booth and get a live demo!

Rentgrata's Test Driven Journey to a Production-Ready Chatbot

Learn how Rentgrata used Vellum to evaluate their chatbot, and cut development time in half.

Written by
Reviewed by
No items found.

Property management companies have a new ally—Rentgrata's revolutionary AI chatbot, Ari.

This chatbot helps them gain precise insights into what residents think about their living conditions. Using this valuable feedback, managers can make data-driven decisions to boost resident satisfaction and retention.

Rentgrata built this AI chatbot with a focus on security and validity, conducting numerous feasibility tests and evaluations before its release. But they also had an ally — they used Vellum throughout the entire development lifecycle, and were able to cut their projected 9-month development timeline nearly in half.

For those interested to learn how to build a production-ready AI chatbot, keep reading.

Who is Rentgrata?

Rentgrata is reshaping the renting experience by connecting prospective renters with current tenants to discuss living experiences. This platform rewards current residents for their insights, and new perspective residents get personalized and unique feedback.

Building on this success, they introduced Ari, a chatbot that uses advanced LLM technology to anonymously gather and analyze feedback from residents. This tool provides management companies with critical insights into tenant satisfaction, enabling them to make informed decisions and enhance the living experience for residents.

We sat down with Max Bryan, Rentgrata’s VP of Technology and Design, to learn more — here’s their test-driven journey from a simple concept to a production-ready AI chatbot that will revolutionize the proptech industry.

What Brought Them to Vellum?

Rentgrata had a vision to transform the way management companies understand and respond to resident feedback.

Early on, they tested their initial ideas by patching together models and code in Jupyter notebooks. However, they were missing a crucial element: "Evaluation.”

To build a truly reliable and actionable chatbot, they needed to evaluate their LLM features, and solve their early challenges:

  1. Overcoming the limitations of language models, such as context window size and math capabilities;
  2. Making the vast amounts of conversation data actionable and insightful for management companies.

That’s when they found Vellum.

How Does Rentgrata Use Vellum Today?

Today, Vellum is an integral part of Rentgrata's AI development lifecycle. As Max puts it, "We start with Vellum and end with Vellum.".

Their team started using Vellum for Evaluations, but soon enough started using it for every stage of their AI development:

Feasibility Testing: Before building a new feature, they use Vellum to confirm its feasibility, saving valuable time and resources.

Evaluation: Vellum's evaluation tools, both LLM-based and ground truth data, ensure that Ari's outputs are accurate and reliable.

Deployment: Rentgrata leverages Vellum's deployment capabilities to seamlessly connect Ari's backend to their user interface.

Production Monitoring: After deploying their system, they employ Vellum’s monitoring tools to gather feedback from end users and analyze the performance of their setup.

Building Ari: Actionable Renter Insights

The team has officially launched the outcome of their extensive development process: Ari, short for "Actionable Renter Insights”. This is a game-changing chatbot that enables management companies to understand what residents say about their community and what prospective residents want to know.

By analyzing conversation transcripts and presenting the data in an actionable format, Ari helps companies make data-driven decisions about marketing, management, and investments.

Preview of the Ari chatbot.

What impact has this partnership had on Rentgrata?

Bulletproof Accuracy

With Vellum, Rentgrata has achieved unparalleled confidence in Ari's performance. The numbers and insights provided by the chatbot are rigorously tested and evaluated, ensuring they are 100% accurate and trustworthy for their customers, the property management businesses.

Accelerated Development

Their small team initially estimated a 9-month timeline just for prompt engineering and evaluation.

By leveraging Vellum, they not only completed those tasks but also built complex AI workflows and managed deployment within an impressive 5 months, cutting their projected 9-month timeline nearly in half.

Actionable Insights

Ari empowers management companies to make informed decisions about where to invest their resources to improve resident happiness. In one case, a company was considering a $300,000 window renovation based on noise complaints. However, Rentgrata’s insights revealed that windows were not the primary concern, allowing the company to allocate funds more effectively.

Vellum has been instrumental in making Rentgrata's data actionable and reliable. With Ari, management companies can confidently make significant decisions based on quantifiable data, rather than relying on anecdotal reviews.

We're thrilled to collaborate with leaders like Max and his team, helping them bring their vision to life.

Want to Try Out Vellum?

Vellum has enabled more than 100 companies to build complex AI chatbot logic, evaluate their infra and ship production-grade apps.

If you’re looking to develop a reliable AI assistant, we’re here to help you. Request a demo for our app here or reach out to us at support@vellum.ai if you have any questions.

We’re excited to see what you and your team builds with Vellum next!

ABOUT THE AUTHOR
Anita Kirkovska
Founding Growth Lead

An AI expert with a strong ML background, specializing in GenAI and LLM education. A former Fulbright scholar, she leads Growth and Education at Vellum, helping companies build and scale AI products. She conducts LLM evaluations and writes extensively on AI best practices, empowering business leaders to drive effective AI adoption.

ABOUT THE reviewer

No items found.
lAST UPDATED
May 2, 2024
share post
Expert verified
Related Posts
January 10, 2026
8 min
Vellum Product Update | December
All
December 12, 2025
7 min
How we use coding agents to 2x engineering output
LLM basics
December 12, 2025
8 min
GPT-5.2 Benchmarks
LLM basics
December 4, 2025
8 min
Top 12 AI Workflow Platforms
Product Updates
December 3, 2025
12 min
Vellum Product Update | November
Model Comparisons
November 27, 2025
18 min
Flagship Model Report: Gpt-5.1 vs Gemini 3 Pro vs Claude Opus 4.5
The Best AI Tips — Direct To Your Inbox

Latest AI news, tips, and techniques

Specific tips for Your AI use cases

No spam

Oops! Something went wrong while submitting the form.

Each issue is packed with valuable resources, tools, and insights that help us stay ahead in AI development. We've discovered strategies and frameworks that boosted our efficiency by 30%, making it a must-read for anyone in the field.

Marina Trajkovska
Head of Engineering

This is just a great newsletter. The content is so helpful, even when I’m busy I read them.

Jeremy Hicks
Solutions Architect

Experiment, Evaluate, Deploy, Repeat.

AI development doesn’t end once you've defined your system. Learn how Vellum helps you manage the entire AI development lifecycle.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Build AI agents in minutes with Vellum
Build agents that take on the busywork and free up hundreds of hours. No coding needed, just start creating.

General CTA component, Use {{general-cta}}

Build AI agents in minutes with Vellum
Build agents that take on the busywork and free up hundreds of hours. No coding needed, just start creating.

General CTA component  [For enterprise], Use {{general-cta-enterprise}}

The best AI agent platform for enterprises
Production-grade rigor in one platform: prompt builder, agent sandbox, and built-in evals and monitoring so your whole org can go AI native.

[Dynamic] Ebook CTA component using the Ebook CMS filtered by name of ebook.
Use {{ebook-cta}} and add a Ebook reference in the article

Thank you!
Your submission has been received!
Oops! Something went wrong while submitting the form.
Button Text

LLM leaderboard CTA component. Use {{llm-cta}}

Check our LLM leaderboard
Compare all open-source and proprietary model across different tasks like coding, math, reasoning and others.

Case study CTA component (ROI) = {{roi-cta}}

40% cost reduction on AI investment
Learn how Drata’s team uses Vellum and moves fast with AI initiatives, without sacrificing accuracy and security.

Case study CTA component (cutting eng overhead) = {{coursemojo-cta}}

6+ months on engineering time saved
Learn how CourseMojo uses Vellum to enable their domain experts to collaborate on AI initiatives, reaching 10x of business growth without expanding the engineering team.

Case study CTA component (Time to value) = {{time-cta}}

100x faster time to deployment for AI agents
See how RelyHealth uses Vellum to deliver hundreds of custom healthcare agents with the speed customers expect and the reliability healthcare demands.

[Dynamic] Guide CTA component using Blog Post CMS, filtering on Guides’ names

100x faster time to deployment for AI agents
See how RelyHealth uses Vellum to deliver hundreds of custom healthcare agents with the speed customers expect and the reliability healthcare demands.
New CTA
Sorts the trigger and email categories

Dynamic template box for healthcare, Use {{healthcare}}

Start with some of these healthcare examples

Healthcare explanations of a patient-doctor match
Summarize why a patient was matched with a specific provider.
Personalized care plan agent
Creates individualized care plans from EHR data by parsing medical data

Dynamic template box for insurance, Use {{insurance}}

Start with some of these insurance examples

Agent that summarizes lengthy reports (PDF -> Summary)
Summarize all kinds of PDFs into easily digestible summaries.
Insurance claims automation agent
Collect and analyze claim information, assess risk and verify policy details.
AI agent for claims review
Review healthcare claims, detect anomalies and benchmark pricing.

Dynamic template box for eCommerce, Use {{ecommerce}}

Start with some of these eCommerce examples

E-commerce shopping agent
Check order status, manage shopping carts and process returns.

Dynamic template box for Marketing, Use {{marketing}}

Start with some of these marketing examples

Reddit monitoring agent
Monitor Reddit for new posts and send summaries to a specified Slack channel.
Creative content generator agent
Give it a URL and a format, and it turns the source into finished creative content.

Dynamic template box for Sales, Use {{sales}}

Start with some of these sales examples

Objection capture agent for sales calls
Take call transcripts, extract objections, and update the associated Hubspot contact record.
Research agent for sales demos
Company research based on Linkedin and public data as a prep for sales demo.

Dynamic template box for Legal, Use {{legal}}

Start with some of these legal examples

Compliance review agent
Checks DPAs and privacy policies against your compliance checklist then scores coverage and make a plan.
PDF Data Extraction to CSV
Extract unstructured data (PDF) into a structured format (CSV).

Dynamic template box for Supply Chain/Logistics, Use {{supply}}

Start with some of these supply chain examples

Risk assessment agent for supply chain operations
Comprehensive risk assessment for suppliers based on various data inputs.

Dynamic template box for Edtech, Use {{edtech}}

Start with some of these edtech examples

No items found.

Dynamic template box for Compliance, Use {{compliance}}

Start with some of these compliance examples

No items found.

Dynamic template box for Customer Support, Use {{customer}}

Start with some of these customer support examples

Trust center RAG Chatbot
RAG chatbot for internal policy documents with reranking model and Google search.
Customer support agent
Support chatbot that classifies user messages and escalates to a human when needed.

Template box, 2 random templates, Use {{templates}}

Start with some of these agents

Turn LinkedIn Posts into Articles and Push to Notion
Convert your best Linkedin posts into long form content.
Research agent for sales demos
Company research based on Linkedin and public data as a prep for sales demo.

Template box, 6 random templates, Use {{templates-plus}}

Build AI agents in minutes

Contract review agent
Reviews contract text against a checklist, flags deviations, scores risk, and produces a lawyer friendly summary.
Research agent for sales demos
Company research based on Linkedin and public data as a prep for sales demo.
Content Repurposing Agent
This agent transforms a webinar transcript into publish-ready content.
Creative content generator agent
Give it a URL and a format, and it turns the source into finished creative content.
Review Comment Generator for GitHub PRs
Use predefined guidelines to write a code review comment for a GitHub PR.
Synthetic Dataset Generator
Generate a synthetic dataset for testing your AI engineered logic.

Build AI agents in minutes for

{{industry_name}}

Roadmap planner
Agent that reviews your roadmap and suggests changes based on team capacity.
Account monitoring agent
Combines product usage data with CRM data from HubSpot or Salesforce to flag accounts with declining usage, especially ahead of renewals.
Cross team status updates
Scans Linear for stale, blocked, or repeatedly reopened issues, flags patterns, and uses Devin to propose cleanup or refactor suggestions.
SEO article generator
Generates SEO optimized articles by researching top results, extracting themes, and writing content ready to publish.
Stripe transaction review agent
Analyzes recent Stripe transactions for suspicious patterns, flags potential fraud, posts a summary in Slack.
KYC compliance agent
Automates KYC checks by reviewing customer documents stored in HubSpot

Case study results overview (usually added at top of case study)

What we did:

1-click

This is some text inside of a div block.

28,000+

Separate vector databases managed per tenant.

100+

Real-world eval tests run before every release.