Learn · 6 MIN READ

How to Split Test Cold Email Campaigns to Hit 5%+ Reply Rates

The exact split-testing process BuzzLead uses to 3-5x cold email results — from audience sourcing in Clay to the one-sentence script that outperformed everything.

Troy Aitken
Published JAN 6, 2025

Most people treat split testing as something you do after you write your emails. That's backwards. The testing process starts before a single word of copy is written, and if you get the sequence wrong, you're optimizing noise instead of signal. Here's the exact process we use at BuzzLead to hit 5% reply rates or better, consistently, across clients doing $1k/month and clients doing seven to eight figures annually.

Start by Testing Audiences, Not Copy

The first variable to test isn't your subject line or your CTA. It's who you're talking to.

When a new client comes to us, the first question we ask is: who are your best clients right now? The ones who retain, look for upsells, and refer people without being asked. Once we have a name, we run a lookalike campaign. We package up that client's result as a case study, then find every company that looks like them and sell directly into that audience.

We do this inside Clay, paired with an Ocean.io API (we run Ocean directly rather than through Clay's native integration because it's cheaper). You plug in your best-fit client, set your target countries, company size ranges, any relevant keywords, and a minimum lookalike score. We typically use 0.80 as the floor. If you want tighter, higher-intent matches, push it to 0.85.

For a campaign we ran for C3 Presents, an entertainment brand client, this process pulled 3,900 similar businesses. From there, we used Clay to find the actual people at those companies by piping in the relevant job titles and scraping LinkedIn. That yielded around 6,200 leads for that campaign alone.

Waterfall Enrichment and Verification (Don't Skip This)

Getting a lead list is one thing. Getting verified, deliverable emails is another. We run a waterfall enrichment process inside Clay that checks multiple data providers in sequence, starting with the cheapest.

Our typical order: LeadMagic first, then Drop Contact, then FindThatEmail (which also double-verifies), and a few others after that. The final check is Million Verifier, which actually sends a test email to confirm the address is live. If it bounces back clean, we're good to send.

We also check the email service provider for each contact using EmailGuard's host lookup API. On one campaign we were running, we identified a deliverability problem specific to Outlook addresses. Because we had that data, we filtered down to Google and custom mail service providers only. That kind of filtering protects your sender reputation and keeps you out of spam folders before you've even written a word.

Test Verticals Before You Commit to One

Outside of lookalike campaigns, clients often want to test multiple industries at once. We worked with Bob at Life360, who was running a B2C age deceleration program and needed to take it B2B through partner channels. He came in with three verticals in mind. We added four more. Within 42 days of testing all seven, we had narrowed it to three that were consistently booking meetings. Bob generated $120,000 in new recurring revenue through those partner channels.

The lead sourcing for vertical testing works the same way: Apollo, LinkedIn Sales Navigator, or Clay's native search. Sales Navigator is worth the roughly $96/month if you're pulling lists three or four times a month or more. Filter by revenue, headcount, industry, and keywords, then pipe those lists directly into Clay for enrichment.


📥 Cold Email Swipe File

Steal the cold email templates our clients used to generate $8M+ in revenue.

Get it here →


How We Actually Structure the Script Tests

Once the audience is locked, we start writing. For a campaign we ran for Haul HQ, a localized marketing agency serving junk removal companies, we contacted around 7,700 leads over about two and a half months. Roughly 4% replied, with 58 positive replies. Here's how we structured the test.

We start optimizing a campaign around 200 to 300 sends per variant. Past 400 sends on a losing variant, you're burning leads. We ran multiple script variants and watched the early reply signals closely.

Our baseline script structure across all variants: 1. Call out a specific pain point the audience is actually feeling 2. Package the offer as a question to capture attention 3. Drop a case study from a similar client 4. Close with a low-friction CTA

We tested pain-point intros against compliment intros. We tested the offer framed as a statement versus a question. We tested direct booking CTAs versus softer "let me know if you're interested" CTAs. One small but consistent finding: never use the word "help." Use "partner with" or "collaborate with" instead. There's something about "help" that triggers resistance. We've seen it enough times that it's now a firm rule in our copy.

The Script Format That Outperformed Everything by 120%

After testing all the variants for Haul HQ, one format won by a wide margin. We call it the "skip the pitch" format. It goes straight to the result, framed as a question:

"If we could double or triple the lead flow to your junk removal business and add $100,000 to $150k in profit back to your business, just like we did for [Case Study 1] and [Case Study 2], would that be interesting?"

That's it. One sentence. It outperformed the next closest variant by roughly 120%.

What we've found repeatedly, especially with blue-collar and service-based businesses, is that shorter wins. These are people moving fast. They don't want to read about your process. They want to know the output. One or two sentences that state the result you've delivered for someone like them, framed as a question, is the highest-converting format we've found.

A close variation that also works well: lead with the case study as a standalone statement, then elaborate in a PS. Something like: "We helped Texas Junkers increase their lead flow by 2.7x in three months. Pretty confident we can do the same for you. Mind if I send over a bit more detail?" The PS section is where you expand. The body stays lean.

Key Takeaways

  • Split testing starts with your audience, not your copy. Run lookalike campaigns first using your best existing clients as the seed.

  • Use a waterfall enrichment process in Clay across multiple data providers, and always run a final verification step (we use Million Verifier) before sending.

  • Check the ESP of your contacts and filter out problematic providers if you're seeing deliverability issues.

  • Test multiple verticals early. You can narrow from seven to three in under 45 days if you're running clean tests.

  • Start optimizing at 200 to 300 sends per variant. Don't let a losing variant run past 400.

  • The "skip the pitch" format, one sentence framing your proven result as a question, consistently outperforms longer, structured scripts.

  • Never use the word "help." Use "partner with" or "collaborate with."

  • Simpler scripts win. One to two sentences beats five paragraphs, especially in service-based industries.

Frequently Asked Questions

When should I start optimizing a cold email variant? We start making optimization decisions at 200 to 300 sends per variant. If a script hasn't shown positive signals by 400 sends, we turn it off. Running a losing variant past that point wastes leads you can't get back.

What lookalike score should I use in Clay with Ocean.io? We use 0.80 as the standard minimum. If you want a tighter, higher-intent match, push it to 0.85. Going lower than 0.80 tends to bring in companies that don't closely resemble your best-fit client.

Why does the "skip the pitch" one-sentence format work so well? Service-based and blue-collar businesses move fast. They're not reading long emails. A single sentence that states a specific, proven result and asks if they want the same thing cuts through in a way that structured, multi-paragraph scripts don't. We've seen it outperform traditional formats by over 100% in multiple campaigns.

What's the waterfall enrichment process and why does it matter? It's a sequential verification flow that checks multiple email data providers one after another, starting with the cheapest. If one database doesn't have a valid email, it moves to the next. We end with Million Verifier as a final confirmation step. The goal is the highest possible email validity rate before you send a single message, which protects your sender reputation and keeps you out of spam.

LIKED THIS?

Get one outbound playbook in your inbox each Tuesday.

Ready?

Your pipeline, rebuilt.

20-minute strategy call. We'll audit your ICP, show you which signals we'd track, and map out exactly what the first 120 days would look like. No commitment, no pressure, no pitch deck.

TRUSTED BY 50+ B2B TEAMS ACROSS SAAS, AGENCIES, AND PROFESSIONAL SERVICES
Certified partnersClayEmailBisonCloseSmartleadHubSpot
© 2026 BuzzLead Corp. All rights reserved.
Made with vibes and sunshine in St. Pete Beach since 2022
contact@buzzlead.io