← Back to all articles
Challenges

Story Points Are Not Time Estimates (And Why It Matters)

By Marc Molas·August 21, 2023·9 min read

Every few months, the same scene plays out. A product manager walks into a planning meeting, the team assigns story points to a set of tickets, and then someone pulls out a spreadsheet converting those points into hours so they can promise a delivery date to stakeholders.

And just like that, the entire purpose of story points collapses.

I've watched this happen at startups, at scale-ups, and at companies that should know better. The root cause is always the same: a fundamental misunderstanding of what story points are supposed to measure — and a management culture that can't resist translating everything into a timeline.

What Story Points Actually Measure

Story points are a unit of relative complexity. They compare the effort of one piece of work to another, factoring in complexity, uncertainty, and risk — not clock hours.

When a team says "this is a 5 and that's a 2," they're saying the first task is roughly 2.5 times more complex than the second. They're not saying the first task takes 5 hours or 5 days. The number is relational, not absolute.

This distinction matters because:

  • Two developers might complete the same 5-point story in very different amounts of time. One knows the codebase better. The other is faster at debugging. The complexity of the problem is the same for both.
  • A 5-point story this sprint might take longer than a 5-point story last sprint. Context switches, production incidents, team changes — all of these affect duration but not complexity.
  • Complexity is estimable; duration is not. Engineers are reasonably good at comparing the difficulty of two tasks. They're terrible at predicting how many hours something will take. Story points lean into the strength and avoid the weakness.

The moment you start saying "1 point = half a day," you've abandoned relative estimation and you're back to time estimates with extra steps.

Why Managers Convert Points to Time (And the Damage It Does)

I get it. If you're a product manager or a CEO, you need to answer "when will this be done?" That's a legitimate question. The problem is the shortcut: taking story points and dividing by a "velocity" to produce a date.

Here's what actually happens when you do this:

Teams start gaming the numbers. If management treats points as time, engineers learn to inflate estimates so they don't get pressured for "committing" to too many points. What was a 3 becomes a 5. What was a 5 becomes an 8. The numbers stop meaning anything.

Velocity becomes a performance metric. Instead of being a planning tool, velocity turns into a scorecard. Teams that "deliver" more points are rewarded. Teams start splitting stories artificially or padding estimates to look more productive. You're now measuring the wrong thing and incentivizing the wrong behavior.

Trust erodes. Engineers feel surveilled rather than trusted. They spend more energy managing expectations than solving problems. Sprint planning stops being a collaborative conversation about what's realistic and becomes a negotiation where both sides are posturing.

Estimation accuracy gets worse, not better. The irony is cruel: the harder you push for precise time estimates, the less reliable your estimates become. Pressure introduces bias. Bias destroys signal.

Alternatives That Actually Work

If story points are being misused on your team, you have options. Some teams do better by switching to a different approach entirely.

T-Shirt Sizing

Instead of numbers, use S/M/L/XL. It's deliberately imprecise, which is the point. It forces conversations about relative size without the illusion of mathematical precision. A product owner can look at a roadmap of mediums and larges and get a rough sense of scope without anyone pretending an L is exactly 3.5 days.

The No-Estimates Movement

This one is more radical: stop estimating individual stories altogether. Instead, focus on breaking work into small, similarly-sized pieces and measuring throughput. If your team completes an average of 12 items per sprint and the backlog has 36 items of similar size, you can forecast "about 3 sprints" without ever assigning a number to any individual story.

This works best when you have discipline around story decomposition. If some stories are tiny and others are enormous, throughput-based forecasting falls apart.

Cycle Time Forecasting

This is my preferred approach for teams that have at least a few months of historical data. Instead of estimating forward, you look backward at how long work actually takes.

Cycle time is the duration from when work starts to when it's done. If you track this per story or per ticket, you can calculate percentiles:

  • "85% of our stories complete within 7 days."
  • "50% complete within 3 days."

Now when someone asks "when will this be done?" you give a probabilistic answer: "Based on our history, there's an 85% chance it'll be done within 7 working days." That's more honest, more defensible, and more useful than any point-based estimate.

Tools like Jira, Linear, and Shortcut can generate cycle time data. You don't need anything fancy — a spreadsheet tracking start date and completion date per story will get you 80% of the value.

When Estimation Adds Value

I'm not anti-estimation. Estimation is useful when it serves a clear purpose:

  • Scoping large initiatives. When you need to decide between two projects that might take 3 months vs. 9 months, rough estimation helps allocate resources and set expectations. T-shirt sizing or story mapping works well here.
  • Identifying unknowns. If the team can't agree on whether something is a 3 or a 13, that disagreement is the value. It surfaces different assumptions, missing requirements, and hidden complexity. The conversation around the estimate is more valuable than the number.
  • Capacity planning. Knowing approximately how much work fits in a sprint helps teams avoid overcommitting. But this only works if you protect the estimates from being weaponized into deadlines.

When Estimation Is Pure Waste

Estimation adds zero value when:

  • Every story is roughly the same size. If your team has gotten good at breaking work into small chunks, the estimates are redundant. Just count stories.
  • The estimates don't feed any decision. If nobody uses the estimates to make a planning decision, you're spending 30 minutes per sprint on a ritual that produces nothing.
  • The estimates are used to blame, not to plan. "You said this was a 3 and it took a week." If that sentence has ever been spoken on your team, your estimation process is doing more harm than good.

The Real Goal: Predictability, Not Precision

Here's the thing most people miss: the goal of any estimation practice is predictability, not precision. You don't need to know that Feature X will take exactly 14 days. You need to know that your team can reliably deliver a set of features within a quarter. That's a different problem, and it's solved with flow metrics, not better guessing.

Track cycle time. Measure throughput. Monitor work-in-progress limits. These are lagging indicators that reflect reality, not leading indicators that reflect hope.

The teams I've worked with that have the highest delivery predictability are not the ones with the most sophisticated estimation rituals. They're the ones that keep work small, limit WIP, and use historical data to forecast. No planning poker required.

At Conectia, the senior engineers we embed in your teams understand this. They've worked on enough teams to know that estimation theater wastes everyone's time, and they bring practical habits — small PRs, clear scope, consistent throughput — that make your delivery predictable without turning sprint planning into a negotiation. That's what experienced engineers do: they make the process work, not just the code.


Tired of estimation theater slowing your team down? Talk to a CTO — our senior LATAM engineers bring the delivery habits that make predictability a byproduct, not a battle.

Ready to build your engineering team?

Talk to a technical partner and get CTO-vetted developers deployed in 72 hours.