Customer Data Platform (CDP)
Also known as: CDP, Customer 360 platform
A Customer Data Platform (CDP) is software that ingests customer data from every source a company has — web, app, CRM, support, billing, ad platforms — resolves which records belong to the same person, and serves the result as one persistent, governed profile that operational tools can act on in real time. The hard part isn't storage; it's the identity decision that says the anonymous web visitor, the email subscriber, and the support ticket are the same human. A warehouse can hold the data. A CDP is judged by how well it unifies and activates it.
Why it matters
Scattered customer data isn't a tidiness problem — it's a revenue and risk problem with a price tag. You can't personalize, suppress, attribute, or measure against a customer you can't assemble. Every fragmented profile is a wasted ad impression, a duplicate outreach, or a churn signal you missed because it lived in three systems that never compared notes. For the executive, the business case is the second-order effects, not the platform license. Take an illustrative example: if a meaningful slice of your 'new leads' are actually existing customers your systems failed to recognize, you're paying acquisition cost to re-acquire people you already have — and annoying them in the process. A CDP's job is to make that slice visible and shrink it. For the researcher, the honest framing: a CDP is identity resolution plus a serving layer. Its output quality is capped by your matching logic and source data. It does not create truth; it reconciles the versions you already have.
- Collapses duplicate and conflicting records so campaigns and agents act on real people, not fragments.
- Gives every downstream tool one shared definition of a customer, a segment, and — critically — a consent state.
- Becomes the prerequisite layer for AI: an agent is only as accurate as the unified profile it reads.
How it works
A CDP runs four stages. Ingestion pulls data via streaming events, batch loads, or direct connectors. Identity resolution stitches records together using deterministic keys (a shared email or user ID) and probabilistic matching (same device, IP, and behavior pattern). Profile unification merges the stitched records into one golden profile with full attribute and event history, applying survivorship rules to decide which conflicting value wins. Activation pushes segments and traits back out to ad platforms, messaging tools, and increasingly to AI agents that act on them. The part the marketing skips: identity resolution is where CDPs quietly succeed or fail, and it's a tuning problem with no free setting. Deterministic matching is accurate but sparse — it misses everyone anonymous. Probabilistic matching fills the gap but risks false merges, fusing two different people into one profile, which then corrupts every decision made downstream. Loosen the threshold and you over-merge; tighten it and you fragment the same customer across several profiles. A serious CDP evaluation is really an evaluation of its matching logic, its survivorship rules, and whether you can audit and reverse a bad merge after the fact.
- Deterministic match: reliable, needs a shared hard identifier, blind to anonymous traffic.
- Probabilistic match: broader reach, must be tuned and monitored or it silently corrupts profiles.
- Consent and governance travel with the profile, not bolted on after — a hard requirement under GDPR, CPRA, and their successors.
Where it fits — and where the line is moving
Classic CDPs (Segment, mParticle, Tealium) own a copy of the data — they ingest it, store it again, and unify the copy. The newer pattern is the composable or 'zero-copy' CDP that runs on top of your existing warehouse or lakehouse; Salesforce Data Cloud is the prominent example, designed to resolve and unify data in place across Snowflake, BigQuery, or Databricks rather than physically extracting it. The reason this shift happened is unglamorous but real: the old model's biggest hidden cost was a second copy of your customer data that drifted out of sync with the system of record and quietly became wrong. The distinction worth holding onto: a warehouse stores and analyzes; a CDP unifies and activates. They increasingly share infrastructure, which is exactly why the category is confusing right now. But the CDP's reason to exist is narrow and specific — making a clean, consented, real-time profile available to the systems that touch the customer, and now to the agents that act for them. If a tool can't activate a profile in near real time to an operational system, it's a warehouse with marketing slides, not a CDP.
The CDP as the floor under AI agents
Here is the non-obvious shift. A CDP used to be a marketing convenience. In the agent era it becomes infrastructure, because an agent that handles a service case, qualifies a lead, or recommends a next action is only as good as the profile it reads from. Feed it fragmented, stale, or duplicated data and it will act on the wrong picture — confidently, and at machine speed, so the mistake scales before anyone notices. This is why 'data before agents' is a sequencing rule, not a slogan. The unified profile is the substrate the agent reasons over; identity resolution and freshness are the things that have to be right first. There's a second-order reason too: when your fee or your KPI is tied to the agent's outcome, you can't skip the data layer, because a clean profile is what makes the result measurable and the agent accountable. Get the floor right and the agent inherits a clean world. Skip it and you've automated your data debt — and made it harder to see. If you're scoping an agent program and aren't sure your profile layer is ready, run that audit before you build.
Frequently asked
What's the difference between a CDP and a CRM?
A CRM is a system of record that humans update to manage known relationships — accounts, deals, cases. A CDP is an automated unification layer that ingests data from many systems (including the CRM) and builds a complete, real-time profile per person, including the anonymous and behavioral data the CRM never sees. The CRM is one source the CDP reads; the CDP is not a replacement for it.
Is a CDP the same as a data warehouse?
No, though the line is blurring. A warehouse is built to store and analyze data at rest; a CDP is built to resolve identity and activate profiles in real time to operational tools. Composable CDPs like Salesforce Data Cloud now run directly on top of a warehouse, so the same data can serve both purposes without a second copy — but the CDP's distinct job is the unified, consented, action-ready profile, not the storage.
Do we need a CDP before deploying AI agents?
You don't always need a packaged CDP product, but you do need the capability it provides: a unified, governed, fresh customer profile. An agent acts on whatever data it can read, so fragmented or stale profiles produce confident wrong answers at scale. Getting identity resolution and data freshness right is the prerequisite — and the cheapest place to fix it, since the agent inherits whatever you hand it.
Related terms
Salesforce Data Cloud
Salesforce Data Cloud is the layer that ingests, unifies, and resolves identity across your data sources into a single real-time customer profile that the rest of Salesforce — including Agentforce AI agents — can reason and act on.
Identity Resolution
Identity resolution is the process of deciding which records scattered across your systems refer to the same real-world person or account, then linking them into one profile. It is fundamentally a probability problem: each candidate pair gets a match score, and a threshold turns that score into a yes-or-no decision. Set the threshold wrong and you either fragment one customer into many profiles or collapse two customers into one.
Agentforce
Agentforce is Salesforce’s platform for building AI agents — software that reasons over your business data, makes decisions, and takes actions inside Salesforce, governed by your existing permissions and audit trail. Unlike a chatbot that only replies, an agent can complete a task end to end.
Customer 360
Customer 360 is a single, unified view of everything an organization knows about a customer — identity, transactions, support history, web and product behavior, and consent — resolved from otherwise siloed systems into one trustworthy record per real person or account. It is a data-integration outcome, not a product or a screen. The bar that matters is whether every team and system, including an AI agent, can read the same resolved profile and act on it.
Ready when you are
Worth a
conversation?
Tell us one number you'd like AI to move. We'll show you how we'd do it, what it's worth, and how we'd tie our fee to getting you there.