TL;DR
You can set up automated waterfall enrichment for your CRM in five steps: prepare input data, upload and configure, let the cascade query 15+ sources, review results, then connect your CRM for continuous enrichment. Most teams see 20-30% higher fill rates than single-source tools.
If you've ever spent hours manually enriching leads only to end up with incomplete data, you're not alone. Most sales teams lose 20-30% of their pipeline to bad data - bounced emails, wrong phone numbers, outdated job titles. Poor data quality is the root cause.
Waterfall enrichment solves this. Instead of relying on a single data provider (and accepting their gaps), waterfall enrichment queries multiple sources in sequence until it finds complete, verified information.
In this guide, you'll learn how to set up waterfall enrichment for your CRM. By the end, you'll have automated data enrichment that pulls from 15+ sources - no manual work required.
Before You Start
Make sure you have:
- Admin access to your CRM (HubSpot, Salesforce, or similar)
- A Cleanlist account (free tier works for testing)
- A test list of 50-100 contacts to validate results
What is Waterfall Enrichment?
Waterfall enrichment is a multi-provider data strategy that queries premium data sources in sequence until it finds the best match. Think of it like a cascade - your input record flows through Provider 1, then Provider 2, then Provider 3, collecting verified data at each stop.
The key difference from traditional enrichment:
- Single-source enrichment: Query one provider → accept whatever they return (gaps and all)
- Waterfall enrichment: Query multiple providers → merge the best data from each into one complete record
The result? A "golden record" with verified emails, direct dial phone numbers, and complete firmographics - all from one API call.
Cleanlist's waterfall enrichment connects to 15+ premium data sources and handles the cascade logic automatically. You get complete records; we handle the complexity.
Step 1: Prepare Your Input Data
Before running enrichment, clean up your input file. The waterfall algorithm matches records based on the data you provide, so better input means better output.
Required fields (at least one):
- Full name + company name
- LinkedIn profile URL
- Work email address
- Company domain
Optional fields that improve match rates:
- Job title
- Location
- Phone number (for verification)
Export your contacts from your CRM as a CSV. Remove any duplicates and fix obvious formatting issues (all caps names, special characters in email fields).
Watch Out
Avoid using personal email addresses (Gmail, Yahoo) as input - they won't match to business profiles. Use company domains or LinkedIn URLs for best results.
Expected result: A clean CSV with 50-100 test records ready for enrichment.
Step 2: Upload to Cleanlist and Configure Enrichment
Log into your Cleanlist dashboard and navigate to Waterfall Enrichment.
- Click New Enrichment Job
- Upload your CSV file
- Map your columns to Cleanlist fields (the system auto-detects most formats)
- Choose your enrichment level:
| Enrichment Type | Credits | What You Get |
|---|---|---|
| Partial | 1 credit/record | Verified work email, LinkedIn URL, company data |
| Full | 11 credits/record | Everything in Partial + direct dial phone, mobile number |
For most sales teams, start with partial enrichment to validate emails, then run full enrichment on your highest-priority accounts.
- Click Start Enrichment
The cascade begins. Each record flows through our provider network - typically completing in under 30 seconds per record for partial enrichment.
Expected result: Job status shows "Processing" with a progress bar.
Step 3: Understand the Cascade Logic
Here's what happens behind the scenes when you run waterfall enrichment:
Sequential Query: Your input record is sent to Provider 1 (our highest-confidence source). If they return a match with verified data, we capture it. If not, we cascade to Provider 2, then Provider 3, and so on through 15+ sources.
Response Validation: Every response is validated before acceptance:
- Email syntax check
- Deliverability verification (is the mailbox active?)
- Company data cross-reference
- Freshness scoring (when was this data last updated?)
Best-Match Merge: If multiple providers return data, our algorithm selects the highest-quality value for each field. Provider 1 might have the best email, Provider 3 the best phone number - you get the best of both.
Golden Record Output: The final record includes 25+ fields:
- Contact: verified email, phone, LinkedIn, job title, department
- Company: name, domain, industry, employee count, revenue, headquarters
- Metadata: confidence score, sources used, last verified date
This cascade approach typically achieves 98% data accuracy - far higher than any single provider can deliver alone.
Step 4: Review Results and Export
Once processing completes, review your enriched data in the Cleanlist dashboard.
Key metrics to check:
- Match rate: What percentage of records found matches? (Aim for 80%+)
- Fill rate per field: How complete is each data point?
- Confidence scores: Higher scores = higher accuracy
Filter your results by confidence score. Records scoring 90+ are ready for outreach. Records below 70 may need manual verification.
Export options:
- Download as CSV (for spreadsheet analysis)
- Push to HubSpot, Salesforce, or Outreach via native integrations
- Access via API for custom workflows
Pro Tip
Set up a webhook to automatically push enriched records to your CRM as they complete. No manual exports needed.
Expected result: Enriched CSV with complete contact records, ready for your sales team.
Step 5: Automate with CRM Integration
Manual uploads work for one-off projects, but the real power comes from automation. Connect Cleanlist to your CRM for continuous enrichment.
HubSpot Integration:
- Go to Settings → Integrations → Connect HubSpot
- Authorize Cleanlist to access your contacts
- Configure triggers: "Enrich new contacts automatically" or "Enrich when contact enters [list/workflow]"
- Set enrichment rules: partial for all leads, full for accounts matching your ICP
Salesforce Integration:
- Install the Cleanlist managed package from AppExchange
- Configure field mappings in Setup → Cleanlist Settings
- Create a Process Builder flow to trigger enrichment on lead creation
- Map enriched fields back to standard or custom Salesforce fields
API Integration: For custom workflows, use the Cleanlist API directly:
POST /api/v1/enrich
{
"records": [
{ "name": "John Smith", "company": "Acme Corp" }
],
"type": "full"
}Response includes all 25+ fields with confidence scores and source metadata.
Expected result: New CRM records automatically enrich within minutes of creation.
Waterfall vs Single-Source: Cost & Coverage Comparison
The difference between waterfall and single-source enrichment isn't just theoretical — it shows up directly in your match rates, data quality, and cost per valid record.
| Metric | Single-Source (e.g., Apollo, ZoomInfo) | Waterfall Enrichment (Cleanlist) |
|---|---|---|
| Email match rate | 50-70% | 85-95% |
| Email deliverability | 78-85% | 95-98% |
| Phone coverage | 30-50% | 60-75% |
| Cost per lookup | $0.10-0.50 | $0.15-0.40 |
| Cost per VALID record | $0.15-1.00 | $0.16-0.47 |
| Data sources queried | 1 | 15+ |
| Coverage by geography | US-heavy | Global |
The key insight: single-source tools appear cheaper per lookup, but the cost per valid record is often higher because 30-50% of lookups return no data or undeliverable emails. With waterfall enrichment, you pay slightly more per query but get valid data 85-95% of the time.
For a team enriching 5,000 leads per month, the math looks like this:
- Single-source at $0.25/lookup: $1,250/month, but only 3,000-3,500 valid records = $0.36-0.42 per valid record
- Waterfall at $0.30/lookup: $1,500/month, but 4,250-4,750 valid records = $0.32-0.35 per valid record
Waterfall costs 20% more per query but delivers 35-40% more valid records — making it cheaper per actual usable result. Teams that switch from single-source to waterfall enrichment typically see ROI positive within the first month.
“We tested five single-source providers against Cleanlist's waterfall on the same 10,000-record dataset. The best single-source provider hit 67% match rate. Cleanlist hit 91%. That's 2,400 additional enriched records from the same input list — records that would have been dead leads without waterfall.”
Common Waterfall Enrichment Challenges & Solutions
While waterfall enrichment delivers significantly better results than single-source approaches, there are challenges to be aware of — and solutions for each.
Challenge 1: Data Conflicts Between Providers
When multiple providers return different values for the same field (e.g., different job titles or phone numbers), which one do you trust?
Solution: Cleanlist uses a confidence-scoring algorithm that considers data recency, provider accuracy history, and cross-validation across sources. If three providers agree on a job title and one disagrees, the majority wins. For fields like email, deliverability verification is the tiebreaker — the email that passes SMTP verification is the one that gets returned.
Challenge 2: Cost Optimization Across Providers
Not all providers cost the same. Querying premium sources for every record wastes budget when cheaper sources could have provided the answer.
Solution: Smart cascade ordering. Cleanlist routes records to cost-effective providers first, only escalating to premium sources when cheaper options fail. This reduces enrichment costs by 30-50% compared to routing all records through expensive providers.
Challenge 3: Latency at Scale
Querying 15+ providers in sequence takes longer than querying one. For real-time use cases (form enrichment, live lead routing), latency matters.
Solution: Parallel pre-fetching for high-priority fields. Cleanlist queries multiple providers simultaneously for time-critical fields like email and company data, while cascading sequentially for supplementary fields. Partial enrichment completes in under 5 seconds; full enrichment in under 30 seconds per record.
Challenge 4: Handling Catch-All Domains
Catch-all domains accept emails to any address, making SMTP verification unreliable. A waterfall might return an email that "passes" verification but never reaches a real inbox.
Solution: Cleanlist flags catch-all domains with a risk score based on domain reputation, historical deliverability data, and pattern analysis. You can choose to include or exclude catch-all results based on your risk tolerance.
“The biggest mistake teams make with waterfall enrichment is treating it as 'set and forget.' Your provider cascade should be reviewed quarterly — data provider quality shifts, pricing changes, and new providers emerge. The waterfall that was optimal six months ago may not be optimal today.”
Waterfall Enrichment ROI: What to Expect
Understanding the return on investment helps justify the switch from single-source to waterfall enrichment.
Based on testing across 50,000+ records from diverse B2B datasets. Individual results vary by industry and geography.
Source: Cleanlist Internal Benchmarks, 2026Measured as valid, deliverable records returned per dollar of enrichment spend across 200+ customer accounts.
Source: Cleanlist Customer Data, 2026Smart routing queries cost-effective providers first, escalating to premium sources only when needed.
Source: Cleanlist Platform AnalyticsMost teams see measurable ROI within the first enrichment batch:
- Week 1: Run a test batch of 1,000 records. Compare match rates and deliverability against your current provider.
- Month 1: Full CRM enrichment. Expect 20-40% more complete records than your previous tool delivered.
- Quarter 1: Measure downstream impact — email open rates, phone connect rates, and pipeline generated from enriched leads.
The teams that get the most value from waterfall enrichment combine it with ICP scoring to prioritize enriched leads and email verification to ensure deliverability before sending.
Verify Your Setup
Run these checks to confirm everything works:
- Test record: Add a new contact to your CRM with just name + company. Verify it enriches automatically within 5 minutes.
- Spot check accuracy: Pick 10 random enriched records. Manually verify emails (send a test) and phone numbers (call them).
- Check coverage: Review your match rate. Below 70%? Your input data may need cleanup.
Troubleshooting common issues:
| Problem | Solution |
|---|---|
| Low match rate | Add more input fields (LinkedIn URL, domain) |
| Missing phone numbers | Upgrade to full enrichment (11 credits) |
| Slow processing | Contact support - may indicate provider issues |
| Duplicate records | Enable deduplication in enrichment settings |
Next Steps
You've set up waterfall enrichment. Here's how to get more value from it:
- Build an ICP scoring model: Use enriched firmographics to score leads automatically with ICP Scoring
- Clean your existing database: Run enrichment on your full CRM to fill data gaps
- Set up enrichment triggers: Automate enrichment for webinar signups, demo requests, and inbound leads
Frequently Asked Questions
How long does waterfall enrichment take?
Most records enrich within 30 seconds. Large batches (10,000+ records) complete in 5-15 minutes depending on complexity. Full enrichment (with phone lookup) takes slightly longer than partial.
What if a record doesn't enrich?
The waterfall queries all 15+ sources before giving up. If no data is found, you're not charged for that credit. Common reasons for no match: personal email used as input, very small companies, or executives with limited online presence.
Is waterfall enrichment better than single-source providers?
Yes - by definition. Single providers have data gaps. Waterfall enrichment fills those gaps by combining the best data from multiple sources. Most customers see 20-30% higher fill rates compared to single-source tools.
How fresh is the enriched data?
We prioritize sources with recently verified data. Each record includes a "last verified" timestamp so you know exactly how fresh it is. For email deliverability, we run real-time verification before returning results.
Can I use waterfall enrichment for GDPR compliance?
Cleanlist pulls from commercially-licensed data sources that comply with GDPR and CCPA. However, you're responsible for having a lawful basis to process the contact data in your jurisdiction. Consult your legal team for specific compliance questions.
Is waterfall enrichment worth it for small teams?
Yes. Small teams actually benefit more because they can't afford wasted outreach on bad data. A 5-person sales team sending 500 emails per month with 85% accuracy (single-source) wastes 75 emails. With 98% accuracy (waterfall), that drops to 10. Those 65 saved emails represent real conversations that would have bounced. At $29/month for Cleanlist's Starter plan, the ROI is immediate.
How does waterfall enrichment handle international data?
Waterfall enrichment excels at international data because different providers have different regional strengths. Provider A might have the best US coverage, Provider B the best European data, and Provider C the best APAC coverage. By querying all providers in sequence, waterfall enrichment achieves strong global coverage that no single provider can match. Cleanlist's cascade is optimized by region — records with European domains are routed through European-strong providers first.
What is the difference between waterfall enrichment and Clay?
Clay is a workflow platform that lets you build custom waterfall enrichment tables. Cleanlist provides pre-built waterfall enrichment that works out of the box. Clay offers more customization but requires significant setup time and technical expertise. Cleanlist handles the cascade logic, provider management, and data normalization automatically. Choose Clay if you have an ops engineer to dedicate; choose Cleanlist if you want results in 60 seconds. See our detailed Cleanlist vs Clay comparison for a full breakdown.
How many providers for waterfall enrichment?
Most effective waterfall sequences use 5-10 data providers for the primary cascade, with additional specialized providers for specific data types. Beyond 10 providers, the incremental match rate improvement is typically less than 2% while costs continue to increase. Cleanlist connects to 15+ providers but intelligently routes each record to only the most relevant 3-7 based on data type and geography.
Can I build waterfall enrichment in-house?
You can, but most teams find it's not worth the engineering investment. Building a production-grade waterfall requires: API integrations with each provider (weeks of engineering), rate limit management, response normalization across different data formats, cost tracking, error handling, and ongoing maintenance as provider APIs change. Most companies spend $50-100K+ in engineering time to build what Cleanlist provides for $29-99/month.
Does waterfall enrichment work with my CRM?
Cleanlist's waterfall enrichment integrates natively with HubSpot, Salesforce, and Pipedrive, plus 10+ additional CRMs via UnifiedTo. You can also use the API for custom integrations, CSV import/export for manual workflows, and Zapier for connecting to tools we don't directly support. The integration pushes enriched data directly to your CRM fields in real-time.
What data points does waterfall enrichment return?
A full waterfall enrichment returns 25+ data points per record: verified work email, direct dial phone number, mobile number, current job title, department, seniority level, LinkedIn URL, company name, company domain, industry, employee count, revenue range, headquarters location, technology stack, and confidence scores for each field. Partial enrichment (1 credit) returns email, LinkedIn, and company data. Full enrichment (11 credits) adds phone numbers and extended firmographics.
Waterfall enrichment turns incomplete lead lists into actionable sales data. One API call, 15+ sources, one golden record. Start your free trial and see the difference complete data makes.