How We Handle 40,000+ Cold Emails a Month Without Losing Our Minds
Feb 18, 2026
Mahdin M Zahere
The middle school version: Imagine you send 40,000 letters to people you want to talk to. Some write back saying "yes, tell me more!" Others say "no thanks." And some just ask random questions. Now imagine a super-smart robot that reads every single reply, figures out who's actually interested, looks up info about them online, sends the hot leads to the right salesperson, and even writes follow-up messages — all in seconds. That's what Surface does for us.
The scale problem
At 40,000 outbound emails per month, we generate a lot of replies. Good replies, bad replies, out-of-office replies, angry replies, confused replies, and everything in between. At this volume, having a human read and categorize every response is impossible. But treating them all the same — or worse, letting them pile up in an inbox — means missing the 3–5% that are genuinely interested buyers.
Before Surface, our outbound replies went into a shared inbox. Someone would scan through them each morning, try to identify the interested ones, manually update the CRM, and forward hot leads to the right rep. It took 2–3 hours per day. Responses from overnight sat until the morning review. And the categorization was inconsistent — one person's "maybe interested" was another person's "not a fit."
How Surface handles it
Every outbound reply now flows through Surface automatically. Here's the pipeline:
Step 1: Auto-tagging. Surface reads every reply and classifies it into one of five categories: interested, not interested, out of office, wrong person, or question. The classification happens in seconds and is accurate enough that we only spot-check 10–15 per week for quality control.
Step 2: Enrichment. For every reply tagged "interested," Surface immediately enriches the lead — company size, industry, revenue, tech stack, funding status. This data was available before, but it used to happen hours later after someone manually triggered it. Now it happens the moment the reply is classified.
Step 3: Qualification. The enriched data gets evaluated against our ICP criteria. Does this company match our target segment? Is the contact a decision-maker? Is the company the right size? Leads that pass qualification get a score; leads that don't get tagged for nurture or deprioritized.
Step 4: Smart routing. Qualified leads are routed to the right rep based on territory, deal size, and product interest — the same routing logic we use for inbound. The rep doesn't get a generic notification. They get the full context: the original email, the reply, enrichment data, qualification score, and the suggested next action.
Step 5: Automated follow-up. Different reply types trigger different sequences. "Interested" replies get an immediate, personalized follow-up from the assigned rep. "Question" replies get an AI-drafted answer that the rep can review and send. "Not interested" replies get a graceful exit. "Out of office" replies get a scheduled follow-up for when the person returns. "Wrong person" replies trigger a lookup for the right contact at that company.
What changed
Metric | Before Surface | After Surface |
|---|---|---|
Time to process daily replies | 2–3 hours (manual) | ~0 (automated) |
Average response to interested reply | 4–6 hours | Under 10 minutes |
Reply classification accuracy | Inconsistent (human variation) | 94% automated accuracy |
Interested replies that got lost | ~15% (buried in volume) | Under 2% |
Response-to-meeting conversion rate | 8% | 19% |
The biggest impact isn't time savings — it's the replies we used to miss. At 40,000 emails per month, even a 3% interested rate is 1,200 potential conversations. Losing 15% of those to slow processing or human error meant 180 opportunities per month slipping through the cracks. That's pipeline we were generating and then immediately losing.
What we learned
Volume outbound only works if the response handling is as automated as the sending. Most teams invest heavily in the send side — deliverability, personalization, sequence design — and then process replies manually. That's like building a highway that ends in a dirt road.
The reply is the highest-intent signal in outbound. Someone who took the time to write back "tell me more" is warmer than almost any inbound lead from a content download. Treating that reply with the same speed and precision as a demo request is the unlock most outbound teams are missing.
See how Surface can turn your cold email chaos into a clean pipeline.


