Question 1

What is product data enrichment?

Accepted Answer

Product data enrichment turns source-specific ecommerce data into records you can filter and compare. The input can be a Capture from a public page or an imported catalog row. Enrich maps categories to an industry-standard taxonomy, structures category attributes and product signals, normalizes options and commerce data, and translates captured text to English while retaining the source evidence.

Question 2

How is enriched product data different from raw scraped data?

Accepted Answer

A raw Capture preserves what the page exposed in the language and structure the merchant used. Enrich maps that evidence into one ecommerce model: normalized product content, taxonomy, category attributes, signals, options, identifiers, embedded SKU and Offer data, review aggregates, lineage, and the source URL. Each source still contributes only the facts it exposes.

Question 3

What does one enriched product record contain?

Accepted Answer

Enrich produces one page-grain Item per Capture. The Item contains normalized product content, English text alongside the source text, taxonomy, category-specific attributes, product signals, option-matrix data, identifiers when present, embedded source SKU and Offer data, review aggregates, lineage, and source context. Extend later explodes those Items into beta Products, exact Variants, Listings, Offers, Reviews, and Stores for cross-store analysis.

Question 4

Which ecommerce sources can Extralt enrich?

Accepted Answer

Enrichment runs on Captures produced by Extract, whether they come from validated public ecommerce sources or imported catalog files. Coverage depends on the specific pages and access constraints. Once a Capture exists, Enrich applies the same ecommerce model across retailer, marketplace, DTC, and catalog inputs.

Question 5

How does Extralt match the same product across multiple sellers?

Accepted Answer

Extend performs cross-store identity resolution after Enrich. It uses identifiers, normalized product evidence, and similarity signals to connect Products and exact Variants to their source Listings and Offers. Matching is not presented as exhaustive or infallible, and the source relationships remain available for inspection.

Question 6

What languages does enrichment cover?

Accepted Answer

Enrich can translate captured titles and descriptions to English while keeping the original-language source text. Actual coverage depends on the captured source content and should be validated for the languages in your dataset.

Product pages.
One ecommerce model.
Across your sources.

Start with your catalog, then compare it to the market.

Inspect products, categories, and brands through one model.

What's inside this product record?

What's the shape of this category?

How is this brand's catalog structured?

Add cross-store relationships when the workflow needs them

What you get

Built for catalogs that come from everywhere.

One model across sources

Customer-facing data, not a separate feed

Taxonomy and attributes included

For teams whose data comes from many sources and needs to look like one.

Frequently asked questions

Turn your next Capture into structured product data.