Automating PR Reviews for Dummies

Time to see if I’ve automated myself out of a job.

Written by

Pei Li

Reviewed by

CONTENTS

Inline evaluation / Guardrails: Ensure good system performance at run-time

This is some text inside of a div block.

PR reviews are an important part of an engineering organization. It’s a way to ensure quality, standardize coding practices, and share knowledge.

On average, Vellum’s 15 engineers open 50+ PRs per day. If each code review took 5 minutes, I would be spending 4 hours each day on only code reviews.

It’s not realistic, and definitely not fun.

You know what would be fun, though? Creating a bot that will review all my PRs for me - using just Github and Vellum.

Breaking It Down

High level, these are the steps I need to implement for a fully automated review bot:

Trigger actions whenever a pull request is opened
Retrieve the PR, associated diffs, and coding guidelines
Use an LLM to generate a review based on the diff and relevant coding guidelines
Post a comment with the review on the PR

Step 1 can be done using Github Actions, steps 2 and 4 can be done using the Github API, and step 3 can be done using a good prompt. These steps can be orchestrated using Vellum Workflows.

The Prompt

The biggest unknown is #3 - whether or not I can create a good review based on the diff and coding guidelines. I don’t want to distract or mislead our engineers with bad suggestions, and so quality is the highest priority here.

I went with GPT-4.5 for its quality, even though it’s more expensive than cheaper options such as GPT-4o Mini or GPT-4 Turbo. We can always reduce costs later by using a cheaper model, but first we have to prove that automated reviews can work. Here is the prompt I used on the first attempt:

You are a code reviewer. You will be given guidelines for reviewing code in markdown format, and a code diff in git diff format.
You should output clear and concise feedback that summarize high-level guideline violations at the start.
Then output detailed feedback per guideline violation, citing the original code from the diff. You MUST correctly cite the code from the diff if you call out a violation.
Don't be a nit. If it's a very minor violation, a brief callout is better than a long explanation, but you must still cite the code that caused the violation.

The output was surprisingly already great, with no further iterations and no examples. I won’t spoil the fun just yet, though - keep reading to see the final results.

The Workflow

To implement steps 2-4, I’ll use Vellum Workflows. Using a combination of Template Nodes, API Nodes, and Prompt Nodes, I can make a standalone agent that reviews any PR if it’s given the PR number as input.

Here are the high-level steps:

1. Get the PR diff using

GET /repos/{owner}/{repo}/pulls/{pull_number} with Accept: application/vnd.github.diff

2. Execute the prompt using a Prompt Node and pass in the diff and our coding guidelines.

3. Apply formatting on the code review output using a Template Node.

4. Make a comment on the PR using

POST /repos/{owner}/{repo}/issues/{issue_number}/comments

‍

Click to Interact

At this point, I can immediately test the Workflow by passing in an example PR number. If the Workflow passes testing, I can deploy the Workflow on Vellum, which allows it to be executed via API.

The Integration

The only thing left to do is to hook up the Workflow Deployment to Github. I can do this using a Github Action:

name: Github Reviewer
on:
  pull_request:
    types:
      - opened
      - ready_for_review
jobs:
  deployment:
    runs-on: ubuntu-latest
    if: ${{ !github.event.pull_request.draft }}
    steps:
    - name: Execute Vellum Workflow
      uses: fjogeleit/http-request-action@v1
      with:
        url: 'https://predict.vellum.ai/v1/execute-workflow'
        method: 'POST'
        timeout: 300000
        customHeaders: '{"Content-Type": "application/json", "X_API_KEY": "${{ secrets.VELLUM_GITHUB_REVIEWER_API_KEY }}"}'
        data: '{"workflow_deployment_name": "github-reviewer-workflow-deployment", "release_tag": "LATEST", "inputs": [{"type": "STRING", "name": "org", "value": "vellum-ai"}, {"type": "STRING", "name": "repo", "value": "vellum"}, {"type": "STRING", "name": "pull_number", "value": "${{github.event.number}}"}]}'

Time to see if I’ve automated myself out of a job.

Pei the Code Cop

These suggestions are right on the money! 🚢

The Aftermath

In less than 4 hours, I’ve created a bot that produces high-quality reviews on 50+ PRs a day. While there is still room for improvement, this is already providing immediate value to our engineers by catching issues minutes after their PR is opened.

Some potential improvements we could make using Vellum:

Use good and bad examples in our prompt to help it make better reviews
Use Vellum’s Actuals API to collect feedback on quality
Break down the Workflow into simpler steps to allow cheaper models to be used

All of these improvements can be done using only Vellum. If you want to use this bot for your own engineering organization, book a demo with us here.

PR reviews are an important part of an engineering organization. It’s a way to ensure quality, standardize coding practices, and share knowledge.

On average, Vellum’s 15 engineers open 50+ PRs per day. If each code review took 5 minutes, I would be spending 4 hours each day on only code reviews.

It’s not realistic, and definitely not fun.

You know what would be fun, though? Creating a bot that will review all my PRs for me - using just Github and Vellum.

Breaking It Down

High level, these are the steps I need to implement for a fully automated review bot:

Trigger actions whenever a pull request is opened
Retrieve the PR, associated diffs, and coding guidelines
Use an LLM to generate a review based on the diff and relevant coding guidelines
Post a comment with the review on the PR

Step 1 can be done using Github Actions, steps 2 and 4 can be done using the Github API, and step 3 can be done using a good prompt. These steps can be orchestrated using Vellum Workflows.

The Prompt

You are a code reviewer. You will be given guidelines for reviewing code in markdown format, and a code diff in git diff format.
You should output clear and concise feedback that summarize high-level guideline violations at the start.
Then output detailed feedback per guideline violation, citing the original code from the diff. You MUST correctly cite the code from the diff if you call out a violation.
Don't be a nit. If it's a very minor violation, a brief callout is better than a long explanation, but you must still cite the code that caused the violation.

The output was surprisingly already great, with no further iterations and no examples. I won’t spoil the fun just yet, though - keep reading to see the final results.

The Workflow

Here are the high-level steps:

1. Get the PR diff using

GET /repos/{owner}/{repo}/pulls/{pull_number} with Accept: application/vnd.github.diff

2. Execute the prompt using a Prompt Node and pass in the diff and our coding guidelines.

3. Apply formatting on the code review output using a Template Node.

4. Make a comment on the PR using

POST /repos/{owner}/{repo}/issues/{issue_number}/comments

‍

Click to Interact

At this point, I can immediately test the Workflow by passing in an example PR number. If the Workflow passes testing, I can deploy the Workflow on Vellum, which allows it to be executed via API.

The Integration

The only thing left to do is to hook up the Workflow Deployment to Github. I can do this using a Github Action:

name: Github Reviewer
on:
  pull_request:
    types:
      - opened
      - ready_for_review
jobs:
  deployment:
    runs-on: ubuntu-latest
    if: ${{ !github.event.pull_request.draft }}
    steps:
    - name: Execute Vellum Workflow
      uses: fjogeleit/http-request-action@v1
      with:
        url: 'https://predict.vellum.ai/v1/execute-workflow'
        method: 'POST'
        timeout: 300000
        customHeaders: '{"Content-Type": "application/json", "X_API_KEY": "${{ secrets.VELLUM_GITHUB_REVIEWER_API_KEY }}"}'
        data: '{"workflow_deployment_name": "github-reviewer-workflow-deployment", "release_tag": "LATEST", "inputs": [{"type": "STRING", "name": "org", "value": "vellum-ai"}, {"type": "STRING", "name": "repo", "value": "vellum"}, {"type": "STRING", "name": "pull_number", "value": "${{github.event.number}}"}]}'

Time to see if I’ve automated myself out of a job.

Pei the Code Cop

These suggestions are right on the money! 🚢

The Aftermath

Some potential improvements we could make using Vellum:

Use good and bad examples in our prompt to help it make better reviews
Use Vellum’s Actuals API to collect feedback on quality
Break down the Workflow into simpler steps to allow cheaper models to be used

All of these improvements can be done using only Vellum. If you want to use this bot for your own engineering organization, book a demo with us here.

ABOUT THE AUTHOR

Pei Li

Founding Engineer

Pei is a serial entrepreneur and Founding Engineer at Vellum (YC W23). He was previously a founder at Venue.live (YC W22), CodeMode (consultancy), and Hack The 6ix (NPO). His side hustle is fishing at the poker tables.

ABOUT THE reviewer

No items found.

lAST UPDATED

Mar 19, 2025

Expert verified

Model Comparisons

February 6, 2026

•

10 min

Claude Opus 4.6 Benchmarks

LLM basics

February 5, 2026

•

12 min

15 Best Make Alternatives: Reviewed & Compared

Product Updates

February 3, 2026

•

5 min

Vellum Product Update | January

LLM basics

January 30, 2026

•

20 min

15 Best Zapier Alternatives: Reviewed & Compared

LLM basics

January 28, 2026

•

20 min

2026 Marketer's Guide to AI Agents for Marketing Operations

LLM basics

January 26, 2026

•

18 min

Top 20 AI Agent Builder Platforms (Complete 2026 Guide)

The Best AI Tips — Direct To Your Inbox

Latest AI news, tips, and techniques

Specific tips for Your AI use cases

No spam

Oops! Something went wrong while submitting the form.

Each issue is packed with valuable resources, tools, and insights that help us stay ahead in AI development. We've discovered strategies and frameworks that boosted our efficiency by 30%, making it a must-read for anyone in the field.

Marina Trajkovska

Head of Engineering

This is just a great newsletter. The content is so helpful, even when I’m busy I read them.

Jeremy Hicks

Solutions Architect

Book a DemoLearn more

Automate the work
that slows you down

AI agents for your boring ops tasks.

Product

Resources

Company

Careers

Affiliate program rules

Automating PR Reviews for Dummies

Breaking It Down

The Prompt

The Workflow

The Integration

Pei the Code Cop

The Aftermath

Breaking It Down

The Prompt

The Workflow

The Integration

Pei the Code Cop

The Aftermath

Automate the workthat slows you down

General CTA component, Use {{general-cta}}

General CTA component [For enterprise], Use {{general-cta-enterprise}}

[Dynamic] Ebook CTA component using the Ebook CMS filtered by name of ebook.Use {{ebook-cta}} and add a Ebook reference in the article

LLM leaderboard CTA component. Use {{llm-cta}}

Case study CTA component (ROI) = {{roi-cta}}

Case study CTA component (cutting eng overhead) = {{coursemojo-cta}}

Case study CTA component (Time to value) = {{time-cta}}

[Dynamic] Guide CTA component using Blog Post CMS, filtering on Guides’ names

Dynamic template box for healthcare, Use {{healthcare}}

Start with some of these healthcare examples

Dynamic template box for insurance, Use {{insurance}}

Start with some of these insurance examples

Dynamic template box for eCommerce, Use {{ecommerce}}

Start with some of these eCommerce examples

Dynamic template box for Marketing, Use {{marketing}}

Start with some of these marketing examples

Dynamic template box for Sales, Use {{sales}}

Start with some of these sales examples

Dynamic template box for Legal, Use {{legal}}

Start with some of these legal examples

Dynamic template box for Supply Chain/Logistics, Use {{supply}}

Start with some of these supply chain examples

Dynamic template box for Edtech, Use {{edtech}}

Start with some of these edtech examples

Dynamic template box for Compliance, Use {{compliance}}

Start with some of these compliance examples

Dynamic template box for Customer Support, Use {{customer}}

Start with some of these customer support examples

Template box, 2 random templates, Use {{templates}}

Start with some of these agents

Template box, 6 random templates, Use {{templates-plus}}

Build AI agents in minutes

Build AI agents in minutes for

{{industry_name}}

Case study results overview (usually added at top of case study)

1-click

28,000+

100+

Automate the work
that slows you down

[Dynamic] Ebook CTA component using the Ebook CMS filtered by name of ebook.
Use {{ebook-cta}} and add a Ebook reference in the article