“You should replace humans with algorithms whenever possible. Even when the algorithm does not do very well, humans do so poorly and are so noisy that, just by removing the noise, you can do better than people.”

Everyone says they can “spot” talent. Then the quarter ends and we’re left explaining miss-hires that should’ve been obvious. The truth is most hiring loops still run on unstructured chats and gut feel. A 21-year-old Daniel Kahneman found exactly this same pattern happening in the Israeli Army in 1955. 

His fix was simple and powerful: define a handful of traits, ask standardized questions, score each trait separately, then only at the end add a touch of human judgment. He replaced intuitive interviews with a structured, six-dimension evaluation system.

That disciplined approach outperformed instinct for 45 years.

Modern B2B sales teams face the same problem. Territories, long cycles, and big quotas mean a single wrong hire ripples across the entire number. So instead of “he seems sharp,” we need to score behavior we can actually see, and we need a way to keep doing that after we hire, in weekly coaching, not once a year at performance review time.

The $2.7 million problem

The cost of a bad sales hire extends beyond their salary. Research shows that by the time you add missed quota, strained customer relationships, idle territory, manager time, and the hit to team morale, the true cost can run to ~$2.7 million. 

And yet most teams still default to the same shaky process Kahneman called out in 1955: unstructured chats, gut impressions, and bias creeping in at every turn.

Why “gut feel” fails (and what Kahneman did instead)

The Israeli military ran into the same problem most hiring loops still have: interviews that felt smart but predicted almost nothing. The standard was a 15–20 minute unstructured conversation. Interviewers walked away with big, global impressions—based on whatever they found interesting—not on what actually predicted success.

The culprit was human bias sneaking in from every angle:

  • Halo effect: One strong trait colors everything else

  • Recency bias: The last five minutes outweigh the full picture

  • Confirmation bias: We hunt for evidence that fits our first take

  • Similarity bias: We favor people who feel like “us”

Kahneman’s fix was elegantly simple: structure the evaluation.

  1. Define a small set of independent traits that matter for the role

  2. Ask standard, factual questions for each trait

  3. Score traits separately (no halo carryover)

  4. Sum the scores for an objective baseline

  5. Only then add disciplined intuition on top

Prediction jumped, and the method stuck for decades. 

For sales leaders, this means we have to stop judging the whole candidate at once, and instead score the parts that matter (with evidence) and make our “read” the last step, not the first.

How AI makes it even better

Kahneman’s system beat gut feel, but it still relied on humans – and all the limits that come with us: unconscious bias, fatigue, shaky memory, and no real way to scale. 

AI closes those gaps. It scores with the same criteria every time, surfaces patterns across hundreds of calls, anchors judgments to receipts (quotes, timestamps), and gets better as more data flows through it. In practice, that means moving from “moderately useful” prediction to consistently evidence-based evaluation with far less bias and far more nuance.

Why curiosity and coachability matter more than ever

In the AI era, information is cheap—questions and adaptation are the edge. 

Curious reps will use AI to dig deeper, connect dots, and ask the 20th question that unlocks the deal. 

Coachable reps turn AI’s feedback (and their manager’s) into new behavior on the very next call. 

Think of AI as a GPS: It can show you the best route, warn about obstacles, and recalculate when conditions change. But just like a GPS, it only works if you're willing to follow its guidance. The curious rep asks "Why is it suggesting this route?" The coachable rep says "Let me try this new approach."

In essence, AI handles the "what" (data, analysis, patterns) so humans can focus on the uniquely human "how" (building trust, showing empathy, creative problem-solving). But this only works for those curious enough to leverage AI's insights and coachable enough to act on them.

The modern sales challenge: beyond "gut feel"

B2B selling is harder and noisier than ever: buying committees now include 6–8 stakeholders and cycles run 6–12 months while offerings look increasingly similar and buyers are over halfway through their research before they talk to you. 

In that world, the gap between an average rep and a top performer is huge: The top 20% still drive the majority of revenue, high-EQ sellers close 40–50% more, and real business acumen lifts win rates by ~30%.

Yet many teams still evaluate talent the old way: using chemistry over evidence, activity over outcomes, stories over patterns, and our biases (halo, recency, similarity) sneak in at every turn. 

The idea: Score the traits, not the vibes

We need to trade “I have a good feeling” for structured, evidence-based assessment.

Replace good first impressions with a simple rubric that forces you to collect observable evidence for each trait. Keep it light, repeatable, and rooted in reality. Use call snippets, emails, meeting notes, manager observations.

For sales reps, the COACH framework evaluates the five dimensions proven to predict sales success:

COACH (SDR/AE)

C - Curiosity

  • Top performers score extremely high on intellectual curiosity

  • Drives deeper discovery and better solutions

  • Observable through questioning quality and research depth

O - Ownership

  • Conscientiousness is the #1 predictor of performance across all roles

  • 85% of top sellers demonstrate high ownership

  • Measured through follow-through and accountability

A - Adaptability (Coachability)

  • Coachability is the gateway to all improvement—without it, development stops

  • Top performers actively seek feedback and implement it rapidly

  • Observable through response to coaching, note-taking, and behavioral change

  • Coachable reps improve 2-3x faster than resistant ones

  • Critical for AI adoption: Only coachable reps benefit from AI's continuous coaching insights

C - Critical Thinking

  • High cognitive ability creates 4x better performance prediction

  • Essential for consultative selling

  • Demonstrated through strategic account planning

H - Human Connection

  • High-EQ reps are 40-50% more likely to close deals

  • Not about being liked, but building credibility and trust

  • Shown through multi-threading and stakeholder navigation

For account managers and customer success, SERVE evaluates the five dimensions critical for retention and growth:

SERVE (AM / CS)

S - Seeking Knowledge

  • Active listening is the #1 AM skill

  • Drives proactive value identification

  • Prevents reactive firefighting mode

E - Execution Excellence

  • 5% retention improvement = 25-95% profit increase

  • Critical for managing 20-50+ account portfolios

  • Measured through CRM discipline and follow-through

R - Responsive Adaptability

  • Adaptable AMs are 2x more likely to retain at-risk accounts

  • Essential for changing customer needs

  • Shown through process adoption and feedback implementation

V - Value Communication

  • 94% higher renewal rates with value conversations

  • Shifts AM from cost center to revenue generator

  • Demonstrated through ROI articulation

E - Empathetic Partnership

  • Multi-threading reduces churn risk by 50%

  • Builds resilient account relationships

  • Evident in stakeholder mapping and difficult conversations

The point isn’t to turn managers into auditors; it’s to remove the biggest bias traps, like halo effects, recency, and “reminds me of me.” Score each trait independently, collect receipts (timestamps and quotes), and only then look across the whole picture.

The five-minute weekly check

This works best as a tiny, consistent habit. Pick one dimension or trait each week:

  1. Observe specific behaviors during recent calls and coaching sessions

  2. Check yes/no boxes for each dimension. Look for 3 specific behaviors as evidence of each trait. 

  3. Calculate simple scores (# of checks out of 3)

  4. Track patterns over time (4+ weeks for reliability)

  5. Make data-driven decisions based on trends. Log one improvement commitment for the next conversation. 

Here’s an example: 

Trait = Curiosity

  • Asked layered how/why? Yes

  • Brought pre-call research? No

  • Synthesized inputs into a hypothesis? Yes
    Score: 2/3 (link two call timestamps as evidence)

Don’t score everything at once. Work one trait at a time, finish it, then move on. For each trait, attach specific evidence (a timestamp, quote, or snippet). Hold off on any “overall take” until you’ve evaluated all traits, and stick to observable behaviors, not personality. That simple sequence blocks bias and keeps the rubric honest. If you’re using an assistant on top of call transcripts, it can enforce the order and collect the receipts automatically—so one strong moment doesn’t inflate every score.

Rotate traits over a month and patterns should appear: strengths that deserve amplification, gaps that need coaching, and, when necessary, objective evidence for tough decisions.

This approach reduces evaluation time by 70% while improving consistency by 85%.

Where AI helps (and where it doesn’t)

Think of AI as the junior analyst who never gets tired. It pulls evidence on request (“Find moments the rep quantified ROI”), spots changes over time (“Compare objection handling in June vs. July”), and produces a clean one-pager for you with the reps’ strengths, gaps, and links to the exact moments. Research from MIT shows that AI-assisted evaluation reduces human bias by up to 85%. 

What it won’t do is understand your strategy, your culture, or your trade-offs. That’s your job. Combined you get consistent, citation-backed inputs from AI and your (or your managers’) contextual, human judgment on top.

What “good” looks like in practice

Within a few weeks of running COACH/SERVE, coachable behavior is the first thing to move. You’ll hear reps adopt stronger discovery, cleaner value stories, and more deliberate multi-threading. Managers stop debating the “potential” of a rep and start discussing what changed in the field. The conversation becomes: here’s the clip, here’s the score, here’s what we try next.

Want the COACH/SERVE rubric and a prompt to score calls in five minutes? Get in touch and I’ll send the template.

About the Framework: The COACH and SERVE frameworks were developed through extensive research including The Challenger Sale study, Five-Factor personality modeling, and performance data from thousands of B2B sales professionals. They represent a practical application of Nobel Prize-winning behavioral science to the modern sales environment, enhanced by cutting-edge AI technology to achieve unprecedented accuracy and objectivity. Most importantly, they identify the human traits that matter most as AI transforms the profession.

Keep Reading

No posts found