Watch the full debrief

Two voices. One question. The insider reaction you don't usually see.

Also on YouTube 5–7 min 2026

Question decoded

"How would you measure the success of a feature that has no direct revenue signal?"

Competency tested

Are Right, A Lot

Who asks it

Bar Raiser · HM · Peer

What they're really asking

Can you reason causally before you measure?

Answers compared

The answer that fails — and why

Candidate answer Does not raise the bar — Are Right, A Lot

I would start by identifying proxy metrics that correlate with the outcome we care about. For a feature like a new onboarding flow, I might look at activation rate, time-to-first-action, and seven-day retention. I would track these in an A/B test and check for statistically significant lift. If the proxies move in the right direction, that is a strong signal the feature is working. I would also segment by cohort to make sure the gains are not driven by a single user group.

Bar Raiser evaluation

⚑ Jumps to metric selection without establishing causal chain first

⚑ No validation step — assumes proxy correlation equals real outcome

⚑ Statistical significance mentioned but no discussion of proxy risk or drift

⚑ No acknowledgment that the framework itself could be wrong

Prefer to hear it? Watch the video for the two-voice delivery with live reaction commentary.

Amazon debrief · DS loop · Bar Raiser evaluation Below Bar

Leadership Principle: Are Right, A Lot

Does not demonstrate Are Right, A Lot.

✗ Proposes proxy metrics with no causal chain from feature to outcome

✗ Treats proxy correlation as sufficient evidence; no invalidation step

✗ Discusses statistical significance but not proxy validity or measurement risk

✗ No indication candidate understands the framework itself could be wrong

interview101.com · Are Right, A Lot · Amazon DS · Bar Raiser debrief reference

→ Now here's what a strong answer actually sounds like

The answer that works — in full

Strong answer Raises the bar — Are Right, A Lot

Before I pick any metric, I want to walk the causal chain — what behavior does this feature change, and how does that behavior connect to a customer outcome we actually care about? For a new onboarding flow, the chain might be: feature reduces friction → users reach first meaningful action faster → they form a habit → thirty-day retention improves. I would propose time-to-first-meaningful-action and thirty-day retention as my leading and lagging proxies. But here is the part I think is often skipped: I would then actively try to break my own framework. What if retention improves but users are doing the action mechanically without real value? I would pair the quantitative signal with a small qualitative check — session recordings or a targeted survey — to validate the proxy is measuring engagement and not just motion. I would document that assumption explicitly before shipping the measurement plan, so the team knows what would cause us to revisit it.

Bar Raiser evaluation

✓ Opens with causal chain before naming a single metric

✓ Distinguishes leading and lagging proxies with explicit rationale

✓ Proactively stress-tests own framework — core Are Right, A Lot signal

✓ Documents assumptions so the team can invalidate them later

Amazon debrief · DS loop · Bar Raiser evaluation Raises Bar

Leadership Principle: Are Right, A Lot

Strong signal. Raises the bar.

✓ Establishes causal chain from feature behavior to customer outcome before measuring

✓ Distinguishes leading and lagging proxies; explains rationale for each

✓ Actively stress-tests own framework — identifies conditions that would invalidate it

✓ Documents assumptions explicitly so team can revisit if proxy drifts

interview101.com · Are Right, A Lot · Amazon DS · Bar Raiser debrief reference

Fix your answer before your loop

Run your story through these three questions

1

Can you name the causal step between the feature and the metric?

If not, you are measuring activity, not customer impact — the Bar Raiser will see it.

2

Have you described one way your proxy metric could be wrong?

A metric you cannot falsify is not rigorous — it is just a guess with a dashboard.

3

Did you document the assumption that would cause you to revisit?

Without a stated condition, your framework has no expiration date and no owner.

Get your personalized report

How do your real stories score?

Get a personalized report scored against the interview rubric Amazon uses for your role.

Get your Amazon Data Scientist report →

More Amazon Data Scientist debriefs