ExtraltExtralt
products/02/ 04

Enrich

Normalize, classify, and generate fine-grained attributes and market signals.

What is Enrich?

Enrich transforms each capture into standardized, queryable Items. AI analyzes both the extracted text and product images to translate to English, classify products, and extract attributes. It also extracts market signals - price positioning, style, seasonality - that capture what a product means beyond its physical attributes. Each capture typically produces multiple Items when non-size options split the source page into distinct sellable records.

Why Enrich?

Raw scraped data is messy. Product titles are in different languages. Categories vary between sites. The same product has different names on different marketplaces.

Enrich solves this by applying a standardization pipeline to every capture. The result is clean, consistent data you can actually analyze and compare.

Consistent schema across any ecommerce site. Ready for analysis, comparison, and integration into your systems. The foundation for building product relationships in Extend.

How it works

When you enable Enrich, every capture goes through an automatic enrichment pipeline. AI processes two inputs: the extracted text (title, description, brand...) and the product images from the page.

Language normalization

All product titles, descriptions, and attributes are translated to English for consistent querying and analysis.

Taxonomy classification

AI analyzes both text and images to classify products using an industry-standard taxonomy, containing over 10,000 categories and subcategories.

Attribute extraction

Category-specific attributes are then extracted from text and inferred from images. This allows generating much more fine-grained data than was available in the webpage at extraction time.

Signal extraction

Market signals capture context beyond physical attributes — price tier, style, use context, seasonality — to understand how a product is positioned in the market.

Option mapping & item assembly

Raw store options (e.g., "Couleur: Blanc") are mapped to taxonomy attributes (color: White). Items are assembled by grouping option combinations, multiplying each capture into structured, queryable Enrich outputs with transformed offers.

Features

  • AI analysis of both text and product images
  • English normalization for all text fields
  • Industry-standard taxonomy classification
  • Category-specific attribute extraction
  • Market signal extraction (price tier, style, seasonality, and more)
  • Option mapping to taxonomy attributes
  • Same-store re-capture deduplication

Use cases

Catalog standardization

Unify product data from multiple sources into a single, consistent format for your product database.

Cross-marketplace analysis

Standardized data makes it possible to compare products across marketplaces using consistent categories and attributes.

See Enrich in action

Pricing

1 credit per capture

Each capture processed produces multiple listings — one per non-size option combination (e.g., a shoe in 8 colors = 8 listings from 1 capture, 1 credit). You get classified, English-translated product intelligence with taxonomy, attributes, and market signals.