# PHT Master List Data Enrichment Strategy

**Current Status:** 2,159 unique companies  
**Goal:** Enrich all companies with verified contacts, room counts, and qualification data  
**Created:** 2026-02-11

---

## 🎯 ENRICHMENT PRIORITIES

### Priority 1: Contact Information (90% coverage target)
- Decision-maker emails (Plant Manager, VP Operations, Facilities Director)
- Direct phone numbers
- Contact names and titles

### Priority 2: Cold Room Verification (100% target)
- Confirm CA storage capability (Yes/No/Unknown)
- Number of CA rooms (critical for qualification - need 10+ for outreach, 50+ priority)
- Storage capacity (tons/bins if available)

### Priority 3: Company Intelligence (70% coverage target)
- Employee count (helps prioritize by size)
- Annual revenue estimate
- Specific fruit types (apples, citrus, kiwis, etc.)
- Current ethylene monitoring system (if any)

---

## 🛠️ ENRICHMENT TOOLS & METHODS

### Method 1: Clay.com Waterfall Enrichment ⭐ RECOMMENDED

**What it is:** Multi-provider waterfall that checks 50+ data sources sequentially until it finds the data  
**Best for:** Email + phone enrichment at scale  
**Coverage:** 85-95% email coverage, 60-70% phone coverage  

**How it works:**
1. Upload CSV with company domains to Clay
2. Set up waterfall: Prospeo → DropContact → Hunter → Apollo → Lusha → Snov.io → etc.
3. Clay tries each provider in order, stops when data found
4. Only pay for successful finds

**Pricing:**
- Starter: $149/mo (5,000 credits)
- Pro: $349/mo (20,000 credits)  
- Credits vary by provider (1-5 credits per lookup)
- **Estimated cost for 2,159 companies:** ~$200-300 (one-time)

**Pros:**
- Highest coverage (uses multiple sources)
- Only charges when data found
- AI-powered personalization fields included
- Great for finding decision-makers by title

**Cons:**
- Learning curve (worth it)
- Monthly subscription even for one-time use

**Setup:**
1. Sign up at clay.com
2. Import MASTER-LIST-FINAL.csv
3. Add "Find Work Email" waterfall enrichment
4. Filter by job title: "VP Operations", "Plant Manager", "Facilities Director", "General Manager"
5. Add "Find Phone Number" waterfall
6. Export enriched data

---

### Method 2: Hunter.io (Email Only)

**What it is:** Email finder with 100M+ business profiles  
**Best for:** Domain-based email discovery  
**Coverage:** 60-70% email coverage  

**How it works:**
1. Upload company domains
2. Search by domain OR by domain + name
3. Bulk email verification included

**Pricing:**
- Free: 25 searches/mo
- Starter: $49/mo (500 searches)
- Growth: $149/mo (5,000 searches)
- **API Key:** fda8536970076bc3228c5b5fa6e19fdc407c43c9 (already have!)

**Estimated cost for 2,159 companies:** $149/mo (5,000 searches covers it)

**Pros:**
- We already have an API key
- High accuracy (verified emails)
- Simple API integration
- Built-in email verification

**Cons:**
- Lower coverage than Clay waterfall
- No phone numbers
- Doesn't find people by job title (need names first)

**Best use case:** Secondary enrichment after Clay, or batch verification of found emails

---

### Method 3: AnyMailFinder (Budget Option)

**What it is:** Email enrichment with nominative search  
**Best for:** Finding emails when you have name + company  
**Coverage:** 40-60% for domain-only, 70-80% with names  

**Pricing:**
- Pay-as-you-go: $0.10/email found
- Enterprise bulk: 1 credit = 40 emails (for finding ALL emails at a company)
- **Estimated cost:** $100-200 for 2,000 emails

**Pros:**
- Cheapest option
- Good for nominative enrichment (name + domain)
- Enterprise bulk option for large companies

**Cons:**
- Lower coverage than Clay/Hunter
- Needs names first (not great for cold lists)
- No phone numbers

**Best use case:** Budget-friendly follow-up after LinkedIn scraping

---

### Method 4: Apollo.io (All-in-One)

**What it is:** B2B database + enrichment + outreach platform  
**Best for:** Finding contacts by title at specific companies  
**Coverage:** 70-80% email + phone  

**How it works:**
1. Upload company list to Apollo
2. Search for contacts by title within those companies
3. Export with verified emails + direct dials

**Pricing:**
- Free: 50 exports/mo
- Basic: $49/user/mo (1,200 exports/yr)
- Professional: $79/user/mo (12,000 exports/yr)
- **Current status:** We have free account (blocked by corporate email requirement for upgrade)

**Pros:**
- Huge B2B database (275M+ contacts)
- Phone numbers included
- Can search by job title within companies
- Intent data and technographics

**Cons:**
- Need corporate email to unlock higher tiers
- Lower accuracy than Hunter for emails
- Expensive at scale

**Workaround:** Use free tier for top 50 priority companies first

---

### Method 5: RocketReach

**What it is:** Contact finder for executives  
**Best for:** Hard-to-find decision-makers  
**Coverage:** 60-70% email, 40-50% phone  

**Pricing:**
- Essentials: $53/mo (170 lookups)
- Pro: $105/mo (375 lookups)
- Ultimate: $249/mo (1,000 lookups)

**Pros:**
- Good for C-level contacts
- Chrome extension for LinkedIn

**Cons:**
- Expensive per lookup
- Lower coverage than Clay waterfall

**Best use case:** Finding elusive contacts at top 100 target companies after other methods fail

---

### Method 6: LinkedIn Sales Navigator + Manual Export

**What it is:** LinkedIn premium search + manual/semi-automated scraping  
**Best for:** Finding decision-makers by title  
**Coverage:** Nearly 100% for names/titles, then enrich emails separately  

**How it works:**
1. Search Sales Navigator for titles at each company
2. Export to CSV (manually or with tools like Vain, Phantombuster, Waalaxy)
3. Feed names + companies into Hunter/Apollo/Clay for email enrichment

**Pricing:**
- Sales Nav Core: $99/mo (2,500 searches)
- Scraper tools: $50-100/mo (Phantombuster, Waalaxy)

**Pros:**
- Most accurate for finding the RIGHT people
- Can filter by seniority, function, location
- LinkedIn profiles have bio/background context

**Cons:**
- Two-step process (find → enrich)
- LinkedIn scraping ToS gray area
- Time-intensive

**Best use case:** Building targeted contact lists for top 200 priority companies

---

### Method 7: Manual Website Research + ZoomInfo/Lusha Chrome Extension

**What it is:** Visit company websites, use browser extension to capture contacts  
**Best for:** Small batches of high-value targets  
**Coverage:** 40-60% (depends on website quality)  

**How it works:**
1. Visit company "About" or "Contact" pages
2. Use ZoomInfo/Lusha extension to reveal emails
3. Manually record in spreadsheet

**Pricing:**
- ZoomInfo: Custom (expensive, $15k+/year)
- Lusha: $29/mo (50 credits), $99/mo (200 credits)

**Pros:**
- Often finds the BEST contact (not just any contact)
- Can read company context while researching

**Cons:**
- Extremely time-intensive
- Not scalable to 2,000+ companies

**Best use case:** Final pass on top 50 dream prospects before outreach

---

### Method 8: Cold Room Verification via Website + AI

**What it is:** Visit company websites, use AI to extract CA room details  
**Best for:** Qualifying facilities before contact enrichment  
**Coverage:** 30-50% have CA details on website  

**How it works:**
1. Use web scraping or manual browsing
2. Look for "Services", "Facilities", "Storage" pages
3. Use Claude/ChatGPT to extract: # of CA rooms, capacity, fruit types
4. Flag "Qualified" (10+ rooms) vs "Not Qualified" vs "Unknown"

**Tools:**
- Apify Website Scraper: $5-10 to scrape 2,000 sites
- Claude API: $0.01-0.02 per website analysis
- **Total estimated cost:** $50-100

**Pros:**
- Qualifies BEFORE spending on contact enrichment
- Finds detailed facility info (capacity, fruit types, tech)
- Can discover competitor systems in use

**Cons:**
- Many sites don't list CA room details
- Requires some manual review/verification

**Recommended workflow:**
1. Scrape all 2,159 websites with Apify
2. Feed HTML to Claude to extract CA room data
3. Flag qualified companies (10+ rooms) for priority enrichment
4. Enrich qualified companies first (maybe 500-800 companies)

---

## 📊 RECOMMENDED ENRICHMENT WORKFLOW

### Phase 1: Qualify (Week 1) - $100 budget
1. **Scrape all 2,159 websites** with Apify ($10)
2. **AI extract CA room data** with Claude API ($50)
3. **Flag qualified** companies (10+ CA rooms, target fruits)
4. **Result:** ~500-800 qualified companies ready for contact enrichment

### Phase 2: Enrich Qualified Companies (Week 2) - $300 budget
1. **Upload qualified list to Clay** ($150-200 for Pro plan)
2. **Run waterfall enrichment:**
   - Find emails by title (Plant Manager, VP Ops, Facilities Director)
   - Find direct phone numbers
3. **Verify emails** with Hunter.io API (already have key)
4. **Result:** ~400-600 companies with verified decision-maker contacts

### Phase 3: Top 100 Deep Dive (Week 3) - $100 budget
1. **LinkedIn Sales Nav search** for top 100 dream accounts ($99/mo)
2. **Manual website research** for missing contacts
3. **RocketReach** for hard-to-find executives (50 lookups)
4. **Result:** Top 100 have multiple contacts, detailed intel, ready for personalized outreach

### Phase 4: Remaining Companies (Week 4) - $200 budget
1. **Run remaining ~1,600 companies** through Hunter.io bulk ($149)
2. **Accept lower coverage** (60-70%) for non-priority companies
3. **Flag "No Contact Found"** for future enrichment
4. **Result:** Full list enriched to max coverage within budget

---

## 💰 TOTAL ESTIMATED COSTS

| Method | Companies Covered | Cost | Coverage |
|--------|------------------|------|----------|
| **Phase 1: Qualification** | 2,159 | $100 | 100% sites scraped, 30-50% CA data found |
| **Phase 2: Clay Waterfall** | 500-800 qualified | $200 | 85-95% emails, 60-70% phones |
| **Phase 3: Top 100 Deep Dive** | 100 priority | $150 | 95%+ emails, 80%+ phones |
| **Phase 4: Bulk Hunter** | 1,200-1,600 remaining | $150 | 60-70% emails |
| **TOTAL** | 2,159 | **$600** | **~75% overall enrichment** |

**Alternative Budget Option:** $200 total
- Skip Clay, use only Hunter.io ($149) for all companies
- Manual qualification (skip AI website scraping)
- Coverage drops to 60-70%, but still gets 1,200-1,500 contacts

---

## 🚀 QUICK START: Get 100 Contacts This Week

**Goal:** Prove the system with 100 enriched contacts  
**Budget:** $50  
**Time:** 2-3 hours  

1. **Pick top 100 US companies** from MASTER-LIST-FINAL.csv (apples, citrus, 50+ rooms if known)
2. **Manual website check** (30 min) - verify they have CA storage, note # of rooms
3. **Upload to Hunter.io** (we have API key) - find emails
4. **Supplement with Apollo free tier** (50 lookups) - add phones for top 50
5. **Result:** 60-70 verified contacts ready for outreach testing

**Next decision:** If this works, scale to full enrichment (Phases 1-4 above)

---

## 📝 NOTES

- **Clay vs Hunter:** Clay = higher coverage but monthly cost, Hunter = lower coverage but we already have it
- **Qualification first:** Don't enrich companies without CA storage (waste of credits)
- **Waterfall approach:** Use multiple tools in sequence for best coverage
- **Verification matters:** Always verify emails before cold outreach (avoid bounces)

---

## ✅ ACTION ITEMS FOR JONNY

1. **Decision:** Full enrichment ($600, 4 weeks) or Quick Start ($50, 1 week)?
2. **If Quick Start:** Pick which 100 companies to start with (I can pull top US apples/citrus)
3. **If Full:** Approve budget and I'll set up Clay + start Phase 1 qualification
4. **Either way:** Confirm we can use Hunter API key (fda8536970076bc3228c5b5fa6e19fdc407c43c9)

**Your call!** 🚀
