The extraction API that
shows its work.

ShopGraph extracts product data from any URL and shows you exactly how each field was derived. Per-field provenance based on extraction method, not opinion. Your automation decides what's reliable enough to act on.

Try it. 50 free calls/month · No signup. Your quota resets every month.

Provenance and Freshness

Every extracted field carries its provenance and its freshness: which extraction method produced the value, and how recent it is. Your agents act on data with visible origins, so they know where each value came from and whether it is current enough to act on.

Native UCP Output

Standardize your entire catalog pipeline. ShopGraph extracts complex variant matrices and instantly normalizes them into Universal Commerce Protocol JSON, strictly validated against the official ucp-schema. One schema for every merchant.

AgentReady Scoring

Score how well a site supports automated extraction before committing pipeline resources. ShopGraph evaluates target URLs across five active dimensions of machine-readability and returns a 0-100 score.

Automatic Tier Escalation

Three extraction methods cross-check each other, and each field records which one produced its value. The pipeline: Schema.org → LLM extraction → Headless Playwright rendering. Automatic escalation means every field carries the provenance of how it was extracted, alongside how fresh the value is.

RFC 9421 Signature Compliance

ShopGraph signs outbound requests with RFC 9421 (Ed25519). Commerce sites in the Cloudflare Signed Agents directory can verify ShopGraph as a registered agent, enabling fast-lane access past bot detection. This does not bypass account authentication or commercial agreements.

Design Principles

Extraction provenance.

Every field tells you how it was extracted and how fresh it is. You decide what to act on.

Access without infrastructure.

ShopGraph handles the extraction pipeline. Your agents get structured data with provenance and freshness on every field. You don't manage proxies or extraction infrastructure.

One schema, any merchant.

Every response follows the same structure regardless of the source site. Write one parser, use it everywhere.

Full provenance.

Every response shows which extraction method produced each field. Your downstream systems make informed decisions.

Your threshold, your rules.

All data comes back with quality signals attached. You set the bar for what's trusted, flagged, or rejected.