Product data extraction
that tells you when it's guessing.

Extract product data from any commerce URL. Every field comes with a confidence score so your agents know what to act on and what to verify.

Try it. 50 free calls/month · No signup. Your quota resets every month.
0.70

Confidence Scoring

Every extracted field includes a confidence score from 0.0 to 1.0 showing how it was derived. Set a threshold (e.g., 0.85) and the engine filters fields that fall below it. Your agents act on data they can verify, not data they have to trust.

Native UCP Output

Standardize your entire catalog pipeline. ShopGraph extracts complex variant matrices and instantly normalizes them into Universal Commerce Protocol JSON, strictly validated against the official ucp-schema. One schema for every merchant.

AgentReady Scoring

Deploy your agents with complete environmental awareness. ShopGraph evaluates target URLs across 6 dimensions of machine-readability and access readiness, providing a comprehensive 0-100 score on extraction viability.

Self-Healing Fallbacks

Three extraction methods cross-check each other. When two tiers agree on a price and one dissents, the confidence score reflects that signal. The pipeline: Schema.org → LLM extraction → Headless Playwright rendering. Self-healing means the confidence contract is calibrated, not just the extraction.

RFC 9421 Signature Compliance

ShopGraph signs outbound requests with RFC 9421 (Ed25519). Commerce sites in the Cloudflare Signed Agents directory can verify ShopGraph as a registered agent, enabling fast-lane access past bot detection. This does not bypass account authentication or commercial agreements.

Design Principles

Transparent uncertainty.

Every field tells you how confident the system is. You decide what to trust.

Access without infrastructure.

ShopGraph handles the extraction pipeline. Your agents get structured data with confidence scores. You don't manage proxies or identity stacks.

One schema, any merchant.

Every response follows the same structure regardless of the source site. Write one parser, use it everywhere.

Full provenance.

Every response shows which extraction method produced each field. Your downstream systems make informed decisions.

Your threshold, your rules.

All data comes back with quality signals attached. You set the bar for what's trusted, flagged, or rejected.