AI

Recce Raises $4M to Bring Data-Native Code Review to AI Dev

Its popular open-source project is growing quickly, with users including The Philadelphia Inquirer and government entities in Brazil and Australia

Recce, a provider of data-native code review tools for data transformation projects and AI systems, has launched a cloud platform for its popular open-source toolkit and raised $4 million in funding. Heavybit led the round, with participation from Vertex Ventures US, Hive Ventures, and angels Visionary, SVT Angels, Brighter Capital, Ventek Ventures, Scott Breitenother and Tim Chen of Essence VC.

Recce was created as an open-source project in 2023 by CL Kao, a pioneer in code versioning systems who built Git-precursor SVK, which was widely adopted by Apple, Ubisoft, and others. Recce brings the best practice workflows data professionals are used to–including data diffing, validation checklists, and query result comparison–natively into data transformation workflows alongside existing tools like dbt, so data engineers, data scientists and other stakeholders can streamline data validation across the software lifecycle.

“A company’s proprietary data set is a key differentiator in the advent of AI commoditization, and extracting value from that data is a critical priority. Tools like dbt unlocked data analytics for software; now, we’re in an era where the data itself is managed programmatically, and you have to continuously validate not just the logic of the data, but also the data being generated,” said founder and CEO Kao. “We believe most code reviews in the future will become data reviews as data correctness becomes a defining element for success. Recce’s mission is to ensure the stability and accuracy of complex data systems as AI and specifically LLMs drive more data transformation.”

Recce’s open-source project now sees 3,600 downloads per week on GitHub, with users including The Philadelphia Inquirer, telecoms companies, healthcare tech startups, and government entities in Brazil and Australia.

“Recce has become essential to our analytics workflow at The Inquirer,” said Brian Waligorski, Lead Data Engineer at The Philadelphia Inquirer. “Recce automates data validation across 50+ downstream consumers of our data models, supports ad-hoc impact analysis, and integrates cleanly into our CI/CD pipeline. It’s helped us move faster and smarter without compromising data integrity.”

Recce OSS 1.0 Release and Launch of Recce Cloud

Today, Recce is releasing 1.0 of the open-source project, providing the foundation for the company’s new collaborative SaaS offering, Recce Cloud:

  • Column-Level Impact Lineage Analysis forprecise and granular downstream change impact scoping and visualization
  • Change Exploration between production and development data withflexiblerow-by-row, profiling, value distribution, and arbitrary query result comparison
  • Evidence Collection for capturing exploration insights into shareable and reusable validation checklists

Recce Cloud launches today in private beta to enable team sharing and collaboration:

  • Full data-validation context sharing with teams including lineage diffs, custom query results, and structured checklists
  • Data workflow integration with GitHub for ensuring merging code only when all checks are approved
  • Free tier in the pricing plan

Apply for early access today or learn more about the full product announcement.

Heavybit, the leading investor in developer-focused startups, led Recce’s $4 million pre-seed round and sees a transformation underway as software becomes more beholden to data’s impact on the organization.

“Data pipelines are the New Secret Sauce for every company building with AI, enabling teams to create and improve high-quality training data from their own IP,” said Heavybit General Partner and DevOps trailblazer Jesse Robbins, who is joining Recce’s board. “Recce provides the essential toolkit for unlocking the full value of their data with iteration, refinement, and monitoring, while mitigating the risk of errors and corruption. Heavybit is thrilled to support them as they grow the ecosystem for data pipeline validation in the age of AI as part of our ongoing mission of 10+ years: Bringing critical enterprise infrastructure to market.”

“AI models bring a large degree of randomness to software development, especially for data-intensive applications,” said Brian Behlendorf, an open-source pioneer and founding member of the Apache Software Foundation who has also joined Recce’s board. “This raises the premium on data-forward testing tools to get closer to predictability. Until now, that’s been done in a bespoke manner and largely by hand. CL, who has been a longtime collaborator of mine on open-source projects, is solving this problem beautifully with the Recce toolkit, and I’m glad to be supporting him.”

Book a session to learn more about how to join the Recce Cloud beta.

Recce founder and CEO CL Kao will speak on how AI is impacting data pipelines and deployment strategies at the Data Council conference in Oakland, Calif., on Thursday, April 24.

Business Wire

Business Wire is a trusted source for news organizations, journalists, investment professionals and regulatory authorities, delivering news directly into editorial systems and leading online news sources via its multi-patented NX Network. Business Wire has 18 newsrooms worldwide to meet the needs of communications professionals and news media.

Related posts

AI named #1 technology for deriving tangible business value from shift to servitization

PR Newswire

Growing 5G and AI Boost the Global Demand for New Form Factors

PR Newswire

6sense acquires AI-Based B2B Marketing Solution Fortella

PR Newswire