Velou builds AI data infrastructure for the retail AI boom
Velou, which sells AI-powered self-optimizing product catalogs, raises $5M and adds Gavin Hewitt as COO

.png)
Every ecommerce team knows they need product titles, descriptions, and images. Most know they need color, size, and category classification. But there is a second tier of product attributes — less obvious, rarely completed, consistently overlooked — that have outsized commercial impact relative to the attention they receive. These are not niche data points. They are attributes that determine filter visibility, AI agent evaluation, return rates, and agentic discovery outcomes. Getting them right is not an advanced optimization. It is basic commercial hygiene that most catalogs are failing.
.png)
The traditional catalog manager role — the person who maintains product listings, writes descriptions, manages the feed, and runs the annual data clean-up project — is becoming obsolete. Not because the work is less important than it used to be (it is more important than it has ever been) but because the nature of the work is changing faster than most job descriptions have caught up with. The teams building durable commercial advantages from product data in 2025 are not better at the old version of this role. They are doing a fundamentally different job.
.png)
Benchmarks matter because "improve your product data" is not an actionable brief. "Your attribute completeness rate is 58% against a category benchmark of 82%" is. This report provides specific, category-level product data quality benchmarks across the metrics that most directly affect commercial outcomes — drawn from catalog analyses, channel platform data, and observed performance ranges. It is designed to be usable: take the benchmarks relevant to your business, measure your current state against them, and use the gap as your enrichment mandate.
.png)
Most ecommerce teams know their product data is not where it should be. What most teams lack is a clear framework for understanding where they currently are, what "good" looks like at their scale, and what the specific next step is from their current position. The Product Data Maturity Model provides exactly this framework — five levels, each with specific characteristics, specific commercial consequences, and a specific advancement path. It is designed not as a grade but as a map.
.png)
Every year, product data gets talked about more and improved less. The gap between what ecommerce teams know about data quality and what they actually do about it has become one of the defining operational dysfunctions of the industry. In 2025, with AI agents actively evaluating product catalogs, that gap has started to have consequences that are both measurable and irreversible. This report synthesizes findings from catalog analyses, channel performance data, and commercial outcome research to give you an accurate picture of where the industry actually stands — and what separates the retailers who are pulling ahead from those who are falling behind.
.png)
A significant share of most ecommerce catalogs is effectively invisible. Not suppressed with an error message. Not flagged in any dashboard. Simply absent from the search results where it should appear, quietly missing the sales it should be generating, while every analytics report shows nothing unusual because products that are not found generate no sessions, no clicks, and no revenue to report. This is the invisible product problem — and solving it is one of the highest-ROI data quality activities available to an ecommerce team because it costs nothing to fix and reveals revenue that was always there.
.png)
Value normalization is the enrichment discipline that nobody talks about — and that everyone is quietly suffering from. It is the process of taking a catalog where "Navy Blue," "navy," "dark navy," "midnight blue," "cobalt," and "ink blue" are all stored as separate color values for what should be one canonical color, and resolving them to a consistent, machine-queryable standard. It sounds tedious. It is. It is also the foundational prerequisite for faceted filtering to work correctly, for AI agent attribute matching to be reliable, and for your catalog to behave as a coherent data asset rather than an accumulated set of inconsistent records.
.png)
A product data audit is the essential first step in any enrichment project, any channel launch, and any commercial performance investigation where product data may be a contributing factor. Without one, you are enriching blind — unable to prioritize correctly, unable to measure improvement, and unable to distinguish data-caused performance problems from other causes. This guide provides a complete, repeatable audit template that any ecommerce team can execute in a week.
.png)
Your Google Shopping feed is not just a data file you upload to Merchant Center. It is the primary data input to the Shopping Graph — the knowledge base that determines your product's eligibility, ranking, and format availability across Google Shopping ads, AI Overviews, Gemini recommendations, and organic Shopping results. A feed with 85% approval rate is a feed with 15% of your ad spend producing zero impressions. Feed optimization is one of the highest-ROAS activities available to any Google Shopping advertiser.
.png)
Schema.org Product markup is one of the most powerful yet most inconsistently implemented tools in ecommerce. When done correctly, it enables star ratings in Google Search results, rich product panels in Shopping, AI agent readability on your product pages, and a trust signal that strengthens your entire Shopping Graph entity. When done incorrectly — static templates, incomplete fields, conflicts with your live data — it actively damages your performance. This guide gives you the precise implementation specification you need to get it right.
.png)
GTINs are among the most undervalued fields in an ecommerce product record — and among the most commercially impactful to get right. They are not just an identifier that confirms a product exists. They are the key that unlocks entity matching in Google's Knowledge Graph, cross-merchant comparison formats in Google Shopping, aggregated review data on multiple channels, and improved confidence scoring in AI agent evaluation systems. A missing or incorrect GTIN is a quietly compounding commercial disadvantage across every channel you sell on.
.png)
Not all product attributes have equal commercial impact. Enriching your catalog's attribute coverage without prioritizing by impact is a common reason enrichment projects produce less commercial improvement than expected — teams spend weeks filling attributes that have minimal effect on filter traffic or algorithm rank, while the attributes that drive the most query exclusion remain empty. This guide gives you a framework for identifying which attributes matter most in your specific catalog, why, and how to enrich them correctly.
.png)
Your product title is the single highest-weight field in every algorithm that determines your commercial discoverability — on Google Shopping, on Amazon, on your own website's search engine, and in AI agent evaluation systems. It is the first text your product's algorithm encounters, the first text a shopper reads, and the primary signal that determines whether your product appears for a given query. Getting it right is not a copywriting exercise. It is a structured data optimization exercise with measurable commercial impact.
.png)
Voice search and AI assistants were supposed to revolutionize ecommerce five years ago — and in the end, the revolution came late and came quietly. Today, with AI assistants deeply embedded in smartphones, smart speakers, and automotive interfaces, voice is a meaningful and growing discovery channel. More importantly, the way voice queries work has evolved: they are no longer simple "find me [product]" commands. They are multi-criteria, conversational, attribute-specific queries — and they evaluate product data the same way any agentic system does. Structured attributes, not keyword density, determine who appears.
.png)
Most discussions of product data and AI stay at the strategic level: "make your data more structured," "add more attributes," "be precise." This article goes further. It explains the technical specifications that determine how AI systems read, parse, and evaluate product data — at the level of data types, schema fields, and format requirements. If you are responsible for catalog data quality and you want to understand precisely what "AI-ready" means in implementable terms, this is the article for you.
.png)
Google's AI Overviews have arrived in product search — and they are fundamentally different from organic search results and Shopping ads. They are not ranked links. They are generated answers, with embedded product recommendations, that appear before any other search result. For ecommerce managers, the question is not whether AI Overviews will affect your traffic — they already do. The question is what determines whether your products appear in them, and what you need to change to be consistently included.
.png)
Amazon's Rufus shopping assistant is not an experimental feature. It is embedded in Amazon's main mobile app and website interface, accessible to every Amazon shopper, and actively reshaping how products are discovered on the world's largest ecommerce platform. If you are selling on Amazon, Rufus is already evaluating your listings. The question is whether your product data is structured to perform in that evaluation — or whether you are invisible to a growing share of Amazon's highest-intent discovery traffic.
.png)
The shift to agentic commerce is not arriving with an announcement or a deadline. It is accumulating — one agent-driven product search at a time, one Gemini product recommendation, one Rufus query, one browser AI shopping session. By the time most retailers treat it as urgent, the first-mover window will have closed. This self-assessment gives you the framework to measure where your catalog actually stands today — across the six dimensions that determine agent visibility — and a prioritized action plan based on where your biggest gaps are.
.png)
Most ecommerce managers understand, in the abstract, that AI shopping agents are becoming important. What most don't have is a precise model of what happens inside the evaluation — the specific steps an agent takes between receiving a shopper's intent and deciding which products to recommend. That precision is what this article provides. Because once you understand the mechanism, the data gaps that are costing you agent visibility become obvious — and the fixes become targeted rather than generic.
.png)
Something fundamental is changing in how products get discovered, compared, and bought online. It is not a new platform or a new ad format. It is a new participant in the buying process: an AI agent that acts autonomously on behalf of the shopper. Understanding what this means — specifically what it means for your product data — is one of the most commercially urgent things an ecommerce manager can do right now. Because the retailers who understand it first will build data infrastructure that compounds in advantage. Those who don't will find themselves invisible to a growing share of purchase-ready traffic.
.png)
Buying the wrong product data enrichment software is an expensive mistake — not just because of the software cost, but because of the implementation time, the team adjustment, the integration work, and the opportunity cost of 6–12 months without the results you expected. The market is full of vendors who use "AI," "enrichment," and "automation" as interchangeable terms for very different capabilities. This guide gives you the specific questions, evaluation criteria, and red flags that separate tools worth investing in from those that will disappoint.
.png)
The product data tooling landscape has fragmented significantly in the past three years. Every category of tool — PIM systems, feed managers, general AI writing platforms, and purpose-built commerce AI — now claims to offer "product data enrichment." Most don't. This guide gives you an honest, category-by-category breakdown of what each tool type actually does, where it genuinely adds value, and what its limits are — so you can make an investment decision based on your catalog's real needs rather than vendor claims.
.png)
Enriching a catalog of 10,000 SKUs is not a larger version of enriching 100. At that scale, the operational challenges change qualitatively: source data arrives in dozens of different formats from different suppliers, attribute taxonomies have thousands of edge cases, channel requirements differ across categories, and the ongoing maintenance load grows as fast as you can address the backlog. Getting to full-catalog enrichment at that scale requires not just AI tooling but a systematic operational approach — a defined pipeline, a quality framework, a batch strategy, and a monitoring system that maintains what you build.
.png)
The arrival of AI enrichment tools prompts an instinct to automate everything. Automate nothing and you have the manual throughput problem. Automate everything and you have an accuracy problem — specifically, the problem of confidently wrong data published at scale, with no human having reviewed it before it reached your customers, your channels, and your algorithms. The right answer is a deliberate division of labor: understand precisely which enrichment tasks benefit from automation, which require human judgment, and which benefit from the combination of both.
.png)
When ChatGPT became widely available, ecommerce teams quickly discovered it could write product descriptions. Many teams are still using it for this — or tools built on top of it — under the assumption that AI-generated content is AI-enriched data. It is not. The gap between what a general AI writing tool produces and what genuine product data enrichment requires is significant, and closing it with the wrong tool creates a category of problem that is worse than having sparse data: it creates confidently wrong data. This article explains the failure modes clearly, so you can make an informed decision about what your catalog actually needs.
.png)
AI has fundamentally changed what is possible in product data enrichment — not incrementally, but structurally. Tasks that previously required a specialist team working for weeks can now be completed in hours. Catalogs that were too large to fully enrich with human labor can be brought to complete coverage. But not all AI enrichment is equal, and the market is full of tools that use AI as a label while delivering results that fall well short of what purpose-built commerce AI achieves. This guide explains how AI enrichment actually works, what the key capability categories are, and how to tell the difference between a tool that will genuinely transform your catalog quality and one that will disappoint.
.png)
B2B buyers were the original AI agents. Long before AI shopping assistants existed, procurement managers were executing structured, criteria-based product evaluations against detailed specifications — comparing attribute values, checking compliance standards, verifying compatibility parameters, and matching technical tolerances against engineering requirements. They didn't browse emotionally. They evaluated systematically. Which means the product data challenge in B2B ecommerce has always been more demanding than in B2C — and the cost of data gaps is immediately and quantifiably higher.
.png)
Every marketplace has a different data model, a different ranking algorithm, and a different buyer intent profile. What makes a listing perform on Amazon frequently does not translate directly to Walmart Marketplace or eBay. Yet most multi-marketplace sellers apply the same product data strategy across all three — usually defaulting to whatever their Amazon setup looks like. The result: sub-optimized performance on every channel except the one they treat as primary.
.png)
Shopify is the world's most widely used ecommerce platform. It handles payments, inventory, storefront design, and order management with remarkable simplicity. What it does not handle — and what many Shopify merchants only discover after their catalog grows beyond a few hundred SKUs — is product data enrichment at the level that multi-channel commerce performance actually requires. Understanding exactly where Shopify's native capabilities end, and where you need to go beyond them, determines how much of your catalog's commercial potential you can actually realize.
.png)
Your DTC website is the one sales channel you fully control. You set the data model, the navigation structure, the search logic, and the on-page experience. That control is an enormous advantage — but it also means you bear full responsibility for every data quality failure. No algorithm will suppress your listings or alert you to invisible exclusion. The consequences are quieter: slightly lower organic traffic, slightly worse filtered search results, slightly higher bounce rates on product pages that don't answer shoppers' questions. Quietly compounding until someone notices the revenue gap.
.png)
Google Shopping is not a level playing field. Two retailers can bid the same CPC on the same query for the same product type — and receive radically different impression shares, click-through rates, and ROAS. The differentiator is almost never the bid. It is the quality, completeness, and structure of their product data. Retailers who understand Google Shopping at the data level — not just the bidding level — have a structural cost advantage over those who don't. This guide explains how to build that advantage.
.png)
More than 60% of U.S. product searches start on Amazon — not Google. For many categories, Amazon is where purchase decisions are made, comparisons are settled, and brand preferences are formed. The challenge: you have no brand experience layer to fall back on. No custom navigation, no editorial storytelling, no loyalty mechanics. Your product data is your entire competitive toolkit. Get it right, and you rank. Get it wrong, and you are invisible — regardless of price, quality, or how good your Sponsored Products budget is.
.png)
Ecommerce returns are treated as a logistics problem. The CFO looks at reverse logistics costs. The operations team optimizes the returns processing workflow. The customer experience team refines the returns portal. All of this is rational — but it is optimizing the symptom rather than addressing the cause. For a significant proportion of returns, the root problem is not the product, the logistics, or the customer. It is the data.
.png)
Most ecommerce teams think of Google Shopping as a bidding problem. ROAS underperforms? Adjust the bids. CPC too high? Tweak the audience targeting. Impression share dropping? Raise the budget. These instincts are understandable — but they are often treating the symptom rather than the cause. In a significant proportion of underperforming Google Shopping accounts, the root cause is product data quality — and the single most impactful data quality problem is sparse, thin, or imprecise product descriptions.
.png)
Your product is live on Amazon. It has competitive pricing, decent reviews, and an active Sponsored Products campaign. But organic sessions have dropped sharply. Sales are thin. Your ads are running but clicks are low. You check the listing and everything looks normal. What you have not checked — and should have — is your Amazon listing quality score. Your product may be suppressed from organic search entirely.
.png)
You ran a product data enrichment project 12 months ago. Titles were optimized, attributes were filled in, descriptions were rewritten. Performance improved. Then, slowly, it didn't. Traffic dipped a little. ROAS softened. Amazon rank on a few key products quietly slipped. Nothing dramatic — just a persistent, creeping underperformance that nobody could quite explain.
.png)
You've done the work. The products are live, the photography is good, the prices are competitive. But when shoppers search on your website — or in Google — certain products simply don't appear. You check your analytics. Traffic is low but not zero. The SEO team says the pages are indexed. Nothing obvious is broken. So what's going wrong?
.png)
Every ecommerce team knows, in theory, that product data quality affects performance. What almost no team has done is calculate how much poor product data is actually costing them — in specific, quantifiable terms across each of the channels where the cost shows up. This article does that calculation. Not in abstract percentages, but with the mechanisms explained clearly enough to build your own business case.
.png)
"Product data enrichment" and "product data quality" are used interchangeably in most ecommerce conversations. They are not the same thing. Conflating them leads to misaligned investment decisions — teams that think they have an enrichment program when they have a quality monitoring program, and teams that think quality audits are enrichment when they are not. The distinction is simple but consequential.
.png)
Most ecommerce teams have done some version of product data enrichment. They've written better product descriptions, fixed a batch of Merchant Center errors, or pushed the team to fill in missing attributes before a big launch. But sustained, systematic enrichment — the kind that compounds into a genuine performance advantage — is rare. The reason is not effort. It's a set of recurring mistakes that are deeply embedded in how teams think about and resource the work.
.png)
Your product is only as discoverable as the data describing it. That sentence sounds obvious. But most ecommerce teams treat it as a content problem when it is actually a data architecture problem — and the difference between those two framings determines whether enrichment gets treated as a creative task or a commercial infrastructure investment.

.png)
.png)







%20(1).png)


.png)
.png)

.png)



.png)
