# 🚀 Contact Enrichment - Quick Start Guide

## Current Status
✅ **Batch 1 Complete:** 10/200 facilities enriched (5%)  
📊 **Contacts Found:** 65+ decision-makers  
📁 **Output File:** `enriched-top-200-contacts.csv`

---

## Next Steps - 3 Options

### Option A: Manual Batch-by-Batch (Recommended for Quality)
Process batches incrementally with review between each:

```bash
cd /Users/max/.openclaw/workspace/postharvest

# Process batch 2 (facilities 11-20)
node auto-researcher.js
./batch-enricher.sh

# Review results, compile, then continue to batch 3...
```

**Time:** ~30 min per batch, ~5 hours total for all 190 remaining  
**Quality:** High - manual review catches edge cases  
**Best for:** Critical contacts, high-value targets

---

### Option B: Automated Bulk Processing (Fastest)
Run all remaining batches automatically:

```bash
cd /Users/max/.openclaw/workspace/postharvest

# Process batches 2-20 (facilities 11-200) automatically
./process-all-batches.sh 2 20

# Wait ~2-3 hours (includes API rate limits)
# Results saved to: batch-results/
```

**Time:** 2-3 hours (mostly unattended)  
**Quality:** Good - automated extraction, requires manual cleanup  
**Best for:** Bulk data collection, then manual refinement

---

### Option C: Hybrid Approach (Balanced)
Automate data collection, manual enrichment:

```bash
# 1. Run automated Hunter.io searches for all 190 facilities
./process-all-batches.sh 2 20

# 2. Review results and identify gaps
# 3. Manual LinkedIn searches for missing roles:
#    - Operations Managers
#    - Quality Managers  
#    - CEOs/GMs

# 4. Compile final CSV with all contacts
```

**Time:** 3-4 hours total  
**Quality:** Best - combines automation + human judgment  
**Best for:** Comprehensive coverage with validation

---

## Files Reference

| File | Purpose | Status |
|------|---------|--------|
| `enriched-top-200-contacts.csv` | **Main deliverable** | ✅ 10/200 |
| `batch-1-enriched.csv` | Batch 1 results | ✅ Complete |
| `ENRICHMENT-STATUS.md` | Detailed progress report | ✅ Current |
| `batch-enricher.sh` | Hunter.io automation | ✅ Tested |
| `process-all-batches.sh` | Master automation script | ✅ Ready |
| `top-200-prioritized.csv` | Source list (sorted) | ✅ Ready |

---

## Hunter.io API Key
```
fda8536970076bc3228c5b5fa6e19fdc407c43c9
```

**Limits:**
- 10 emails per domain search
- ~150 requests/day on free tier
- 2-second delay between requests (built into scripts)

**Usage so far:** 10 requests (10 facilities)  
**Remaining today:** ~140 requests (140 facilities possible)

---

## Research Workflow (Per Facility)

1. **Hunter.io** → Find emails automatically  
2. **LinkedIn** → Validate names, find missing roles  
3. **Google** → Company-specific searches  
4. **Company Website** → Confirm phone numbers  
5. **Industry Directories** → Fallback for hard-to-find contacts

---

## Target Contacts (Priority Order)

1. ✅ **Operations Manager** (primary target)
2. ✅ **Quality Manager / Food Safety** (primary target)  
3. ✅ **General Manager**
4. ✅ **CEO** (especially family-owned facilities)
5. 📋 Sales/Business Development (bonus)
6. 📋 HR/CFO (bonus)

**Goal:** 3-5 contacts per facility minimum

---

## Common Email Patterns Discovered

```
{first}{last_initial}@domain.com    (e.g., jimh@firstfruits.com)
{first}.{last}@domain.com            (e.g., erik.ware@stemilt.com)
{last_initial}{last}@domain.com      (e.g., jshort@superfreshgrowers.com)
{first}{last}@domain.com             (e.g., jeannetteedgerton@nashproduce.com)
```

Hunter.io provides the pattern for each domain!

---

## Troubleshooting

### No Hunter.io results?
→ Try RocketReach, ZoomInfo, or direct website scraping  
→ Call facility and ask for operations/quality manager

### Website down or outdated?
→ Use Wayback Machine: `web.archive.org`  
→ Check LinkedIn company page for employee list

### Can't find specific role?
→ Search LinkedIn: `site:linkedin.com "{Company}" {role}`  
→ Check industry directories (GCCA, IARW)  
→ Call and ask reception

### API rate limit hit?
→ Wait 24 hours for reset  
→ Use alternative email finders  
→ Continue with manual LinkedIn research

---

## Quick Commands

```bash
# Check progress
cat enriched-top-200-contacts.csv | wc -l  # Should be 11 (10 + header)

# Process next batch
./batch-enricher.sh

# Run full automation (batches 2-20)
./process-all-batches.sh 2 20

# View Hunter.io results for a domain
curl -s "https://api.hunter.io/v2/domain-search?domain=DOMAIN.com&api_key=fda8536970076bc3228c5b5fa6e19fdc407c43c9&limit=10" | jq .

# Count total contacts found
jq '[.[] | .contacts | length] | add' enriched-data.json
```

---

## Success Metrics

📊 **Current:** 6.5 contacts/facility average  
🎯 **Target:** 3-5 contacts/facility  
🎉 **Performance:** Exceeding target by 30%

**Projected Final Count:** 1,300+ contacts from 200 facilities

---

## Need Help?

- **Hunter.io Docs:** https://hunter.io/api-documentation  
- **LinkedIn Search Syntax:** Use `site:linkedin.com "Company" role`  
- **Email Pattern Detection:** Hunter.io provides this automatically  
- **Batch Results:** Check `batch-results/` directory

---

**Last Updated:** 2026-02-11  
**Status:** ✅ Batch 1 complete, ready for scaling  
**Next Action:** Choose Option A, B, or C above and execute 🚀
