Concept Testing Questions: Examples for Product and Marketing Research

Concept testing questions that uncover emotional reactions, usage scenarios, and perceived value, not just polite feedback. Examples for product and marketing teams.

Dieter De Mesmaeker Headshot

Dieter De Mesmaeker

Co-Founder & CEO

Articles

A smiling woman holds a smartphone and a coffee cup outdoors, with five white star icons overlaid above her head and two white speech bubble icons to her lower left, one labeled "Reliable Feedback" and one showing an ellipsis. The image sits on a warm orange-to-pink gradient background.
A smiling woman holds a smartphone and a coffee cup outdoors, with five white star icons overlaid above her head and two white speech bubble icons to her lower left, one labeled "Reliable Feedback" and one showing an ellipsis. The image sits on a warm orange-to-pink gradient background.

In this article

In this article

Qualitative insights at the speed of your business

Conveo automates video interviews to speed up decision-making.

TL;DR

  • Most concept testing questions stop at surface-level scores and never uncover the reasoning behind participant reactions

  • Effective concept tests are built around five question categories: comprehension, emotional reaction, perceived value, usage scenario, and competitive context

  • Each category answers a different business question; conflating them produces noise, separating them produces decisions

  • Always establish baseline behavior before showing a concept: reactions only make sense against what potential customers currently use

  • A concept that scores well on appeal but feels familiar rather than distinctive will cannibalize existing products rather than grow the category

  • Adaptive probing (following up based on what a participant actually said) is where the real insight lives

  • Video-first, AI-moderated interviews capture tone, hesitation, and facial cues that surveys and transcripts miss

  • Multi-market concept tests break when culturally loaded questions are applied without adaptation

Most concept-testing questions elicit polite, surface-level feedback that does not explain why consumers react as they do. A participant rates purchase intent at 7 out of 10, says the concept "seems useful," and moves on. The researcher gets a score. The product team gets a slide. Nobody gets an answer.

The consequence is research debt: teams accumulate purchase-intent data without the clear direction needed to tell them what to change. A concept scores 62% trial intent. Is that because the price feels high, the benefit is unclear, or the format conflicts with how people already solve the problem? The survey cannot tell you.

Concept testing matters because it is the one stage of the product development process where a product idea can still be reshaped before it reaches the market. Good concept testing gives product managers a reliable read on how potential customers would use, value, and choose a new concept, before committing resources to a final product. This guide shows how to design concept testing questions that uncover the trade-offs, hesitations, and context that matter for a successful product launch.

Why most concept testing questions fail

A text card on a light cream background featuring a white rounded card titled "Why most concept testing questions fail," with a description stating that AI-moderated interviews surface reliable feedback from potential customers at the speed of a survey with the depth of a qualitative interview.

Survey-based concept tests stop at the surface. Closed-ended formats (Likert scales from strongly agree to strongly disagree, purchase intent ratings, single-select preference questions) aggregate responses from survey respondents; they cannot explore them. Asking participants to rate the following statements on a 1-to-5 scale produces concept feedback that is directionally useful but diagnostically shallow.

The agency alternative addresses the depth problem but reintroduces the timing problem. A six-week qual study yields rich, probed responses from real conversations, but only after the roadmap has already been updated or the campaign brief has already gone out.

The gap teams are trying to close is not "fast vs. deep." The assumption is that those two things cannot coexist. AI-moderated interviews with adaptive probing change that constraint directly: they surface reliable feedback from potential customers at the speed of a survey with the depth of a qualitative interview.

"We ran a concept test for a new product line, in one night we had 200 interviews analyzed."

— CMI Lead, Edgar & Cooper

The core concept testing questions every study needs

Good concept testing questions are not a list you hand participants and score. They are a framework organized around five key types of evidence, each answering a different business question.

Category

Business question it answers

What breaks without it

Comprehension

Does the concept communicate its purpose?

Negative reactions blamed on the idea when the real problem is unclear messaging

Emotional reaction

What do participants feel, and why?

Purchase intent scores with no explanation behind them

Perceived value

Would they pay for it? What would they give up?

Price decisions made without switching cost data

Usage scenario

When and why would they actually use it?

Concepts that feel appealing in the abstract but fail in real life

Competitive context

What would they choose instead?

Weak differentiation and missed switching barriers

Here are 2-3 example questions per category, with adaptive follow-ups where probing is most important.

Comprehension

Concept validation questions in this category confirm that respondents understand the concept before they provide feedback on appeal or value. Skip this step, and you risk measuring reactions to confusion rather than to the idea itself (a critical failure in any concept testing research study).

  • "In your own words, what does this product do?"

  • "Who do you think this is designed for?"

  • "What problem is this trying to solve?"

What this reveals: Whether the concept communicates its purpose, and whether the target audience sees themselves in it. Mismatches here are a positioning problem, not a concept problem.

Emotional reaction

  • "What was your first reaction when you saw this concept?"

  • "What word or phrase would you use to describe how this made you feel?"

Follow-up: "Can you say more about what specifically triggered that reaction?"

What this reveals: The emotional drivers behind interest or hesitation. Tone, hesitation, and facial cues in video interviews often surface here before words do.

Perceived value

  • "What would you expect to pay for something like this?"

  • "What would you have to give up or change to use this?"

  • "Is there anything here that justifies a premium over what you use today?"

What this reveals: Price sensitivity, switching cost perception, and whether the value proposition lands as genuinely differentiated or as a marginal improvement on the status quo.

Usage scenario

  • "Walk me through a situation where you would actually use this."

  • "When in your day or week would this fit in?"

  • "Can you think of a time in the past month when something like this would have helped?"

Follow-up: "What would have to be true for you to reach for this instead of what you currently use?"

What this reveals: Whether the use case is real and recurring, or whether interest is aspirational. Participants who struggle to place a new concept in a specific scenario are signaling a frequency or relevance problem.

Competitive context

  • "What do you currently use to solve this problem?"

  • "How does this compare to what you already have?"

  • "What would make you choose this over what you use today?"

What this reveals: The actual competitive set in participants' minds, the switching threshold, and the specific dimensions on which your concept needs to win.

One practical note: open-ended questions in every category require follow-up questions to produce useful data. The follow-up is not optional. It is where the insight lives.

How to structure questions for real usage scenarios

Generic "sounds good" feedback tells you nothing about when someone would reach for the product or whether they encounter the problem often enough to matter. Usage scenario questions close that gap: anchor responses in real behavior, not hypothetical preferences, and surface the pain points that actually drive decisions.

One sequencing principle matters above all else: establish baseline behavior before presenting the concept. A screening question that captures what participants currently use, when, and why establishes the baseline against which the new concept is compared. Without it, reactions float free of any real alternative, and the switching-cost signal disappears.

Four questions that consistently produce that level of detail:

"Walk me through the last time you ran into [the problem this concept solves]. What happened?" Forces participants into episodic memory rather than hypothetical mode. Reveals how frequently the problem occurs and what their default response looked like.

"What are you using right now to handle this? What works about it, and what doesn't?" Maps the existing solution landscape and identifies where current alternatives fall short. Those gaps are where the concept has to perform.

"If this product existed today, what would you stop using (or stop doing) to make room for it?" Adoption always involves displacement. This question makes that trade-off explicit and tests whether the value proposition is strong enough to justify a behavioral change.

"What would need to be different about this concept for you to choose it over what you use now?" The most actionable feedback of any concept test: it tells you exactly what the concept needs to deliver to win.

Adaptive probing: The missing layer in most concept tests

Adaptive probing is a crucial part of any rigorous concept testing process. It means following up based on what a participant said, not what the script anticipated. Static survey formats cannot do this: the question order is fixed at design time, and branching logic can only route participants based on structured responses. It cannot respond to tone, detect hesitation, or recognize that "it's interesting" is not the same as "I'd buy this."

Critically, static scripts tend to produce leading questions by default: fixed question sequences telegraph what the researcher expects to hear. Adaptive probing sidesteps this entirely, generating thoughtful answers rather than first-instinct responses shaped by the order of questions.

The difference in practice:

  • Participant says, "It's interesting." Follow-up: "What specifically makes it interesting to you?" Not the next survey question about purchase intent.

  • Participant says, "I'm not sure I'd pay that much." Follow-up: "What price would feel right? What would make the higher price worth it?" That exchange surfaces the actual value gap.

  • Participant pauses before answering. Follow-up: "You paused there. What were you thinking?" That pause disappears entirely in a survey.

  • Participant says a concept "sounds good," but their tone is flat. A skilled moderator (human or AI) notices the mismatch and probes further.

Negative feedback is where the most actionable signal lives. A participant who says "I wouldn't pay that much" or "I already have something that does this" is telling you exactly what the concept needs to address. A fixed question set suppresses that; adaptive probing draws it out.

Video interviews add a layer that text responses cannot. Facial expressions during feature explanations, tone shifts when the price is mentioned, and the difference between genuine enthusiasm and polite agreement: these signals are visible on camera but absent from survey data.

Conveo makes every finding traceable to the participant's video clip, explaining their reaction. The output is not "68% found the concept appealing"; it is a participant explaining exactly what appealed to them, followed by the probe that elicited the real answer.

Concept testing questions by use case

The specific examples below are organized by artifact type because generic concept test survey questions tend to follow the same template regardless of what is being evaluated. Different types of concept testing suit different stages of the development process; the most cost-effective test strategy is one that matches question design to the evidence the decision actually needs.

CPG packaging and product concepts

Evidence needed: shelf standout, category fit, perceived quality signals.

  • "If you saw this on the shelf next to [competitor brand], which would you reach for first? Walk me through what drew your eye."

  • "What does this packaging tell you about the product inside, before you read any of the text?"

  • "Does this feel like it's made for someone like you? What gives you that impression?"

  • "Is there anything about this design that would make you put it back on the shelf?"

SaaS and product feature concepts

Evidence needed: workflow integration, perceived switching friction.

  • "Walk me through where this feature would fit into how you currently handle [specific task]. What would change?"

  • "Of the following features, which would you actually use in your first week? Rank them in order."

  • "What would you need to see before you trusted this enough to use it for real work?"

  • "Is there anything in your current setup that this would replace? How do you feel about that trade-off?"

  • "If this existed today, what would stop you from using it within the first week?"

Ad copy and messaging concepts

Evidence needed: message recall, emotional resonance with the target market.

  • "After looking at this for a few seconds, what's the main thing it's telling you?"

  • "Who do you think this is speaking to? Is that you?"

  • "Is there anything here that feels off, unclear, or like it's trying too hard?"

  • "If a friend described this ad to you later, what would they say it was about?"

Service and experience concepts

Evidence needed: expectation alignment, confidence breakpoints.

  • "What would you expect to happen next, after this step? Does what you see match that?"

  • "At what point in this experience would you feel confident you'd made the right choice?"

  • "Is there a moment here where you'd want to speak to a real person rather than continue on your own?"

The "advise the brand" close

Regardless of use case, one closing question consistently produces the most actionable output in multi-concept studies:

"Thinking about everything you've seen, which concept would you tell [brand] to launch, and why? What would happen to the brand if it launched the weakest one instead?"

Framing it as advice rather than preference shifts participants out of evaluator mode. The follow-up on the weak concept forces them to articulate the specific risk the brand would take, not just their personal preference.

A note on sequential monadic testing: When testing more than one concept, show each participant a single concept at a time, collect all reactions, then move to the next. This comparative testing approach preserves fresh first impressions and avoids contamination of later ratings by earlier exposures. Presenting multiple options simultaneously risks anchoring effects that skew the data. In monadic testing, each participant evaluates a single concept in isolation, which makes the data comparable across your customer base. The "advise the brand" question sits at the end, after all concepts have been seen.

Multi-market concept testing: Which questions break across cultures

Running the same question guide across five markets and treating the outputs as comparable is one of the most common ways multi-market concept testing goes sideways. Three question types break most often:

  • Direct pricing questions: Price sensitivity varies by cultural norms around negotiation and what a price point communicates about quality, not just purchasing power.

  • Hypothetical preference questions: In high-context or relationship-oriented cultures, participants tend to answer affirmatively regardless of their reaction, inflating concept scores across your entire customer base.

  • Emotional reaction questions: Enthusiasm is normative in some markets; understated responses are the cultural default in others, making cross-market comparison unreliable without adaptation.

Original question

Breaks in

Why

Adapted version

"Would you buy this?"

High-context markets

Invites polite agreement, not genuine intent

"Walk me through how you'd decide whether to try this."

"What do you dislike?"

Face-saving cultures

Participants deflect or give non-answers

"What would make this feel more right for you?"

"What would you pay for this?"

Markets where price signals quality

Responses reflect cultural norms, not concept value

"When you imagine this on a shelf, what price range would feel appropriate?"

"How excited are you?"

Low-expressiveness cultures

Enthusiasm norms vary; scales are not comparable

"Tell me about a moment when you'd reach for something like this."

With AI moderation in 50+ languages and recruitment across 50+ markets, teams can run parallel fieldwork across regions and compare findings within days rather than the weeks required to coordinate separate agencies market by market.

How to interpret concept testing results (beyond purchase intent scores)

A purchase intent score tells you where a concept landed, not which part of the problem belongs to the idea and which belongs to how it was communicated. Choosing the right metrics (distinctiveness, relevance, purchase-intent questions, willingness to pay, and net promoter score, where applicable) provides a more complete picture for data analysis than any single number. The clarity-versus-appeal matrix is the most practical diagnostic frame once those scores are in:


High clarity

Low clarity

High appeal

Strong candidate: optimize for conversion and pricing

Messaging problem: exciting but confusing. Rework positioning.

Low appeal

Value proposition problem: understood but not compelling. Revisit the core benefit claim.

Fundamental rethink: both the idea and the communication need work.

Watch for the distinctiveness trap: A concept can score well on both clarity and appeal and still fail commercially. If it does not feel meaningfully different from what already exists, it tends to cannibalize existing products rather than grow the category. Always ask participants how new or different the concept feels compared to what they already use, and probe why. A concept that is liked but not distinctive is a substitution play, not an innovation; this is the failure mode that matters most at later stages of development, when significant investment has already been committed.

Separating novelty from real demand: "Do you like this?" is not the same as "Would you switch from what you use now?" Framing the probe around switching behavior surfaces the actual competitive barrier and gives product managers the insight needed to assess potential success before the final product goes to market.

Order effects: When participants rate concepts sequentially, the first anchors expectations. Use ranking questions or forced-choice ("Which concept would you choose?") rather than rating each in isolation; forced-choice mirrors real decision-making, while ranking questions encourage generosity.

Concept testing is an iterative process, not a one-off gate. Gathering feedback across multiple stages (from early product idea through to near-final concepts) reduces launch risk more than a single high-stakes test. Text-only feedback misses what participants communicate: tone shifts, hesitation, and a participant re-reading a statement because something did not land. These signals do not survive transcription. Platforms that deliver traceable, video-backed findings help stakeholders trust the interpretation, because every theme can be audited back to the real conversation.

How Conveo makes concept testing faster and deeper

A branded Conveo graphic on an orange-to-pink gradient background, showing the Conveo logo in a white card at the top connected to a four-item winding flowchart with orange gradient numbered icons: 1. Adaptive probing at scale, 2. Traceable evidence, 3. Multi-market speed, 4. A compounding knowledge library.

Concept testing has always forced a trade-off between depth and speed: surveys deliver scores without reasoning, and agency qual takes 6 to 8 weeks, long after the decision window has closed.

Conveo, the video-first AI research platform, removes that trade-off. Its AI moderator conducts adaptive video interviews, probing vague answers in real time; asynchronous sessions let teams run hundreds of conversations with prospective customers in parallel, making it one of the most effective platforms for high-volume concept testing.

Watch: How to build and launch a study in Conveo →

For concept testing specifically, that means:

  • Adaptive probing at scale: Every participant gets a follow-up tailored to what they said. A 7-out-of-10 rating becomes a conversation about what would push it to a 9.

  • Traceable evidence: Every finding links back to a video clip in which the participant explains their reaction. Stakeholders audit the insight, not just the summary.

  • Multi-market speed: AI moderation in 50+ languages and recruitment across 50+ markets via integrated panel partners enable parallel fieldwork across regions, with findings ready for comparison within days.

  • A compounding knowledge library: Insights from every concept test feed a searchable library, so the next study builds on what the last one found.

Qualitative insights at the speed of your business:

Qualitative insights at the speed of your business:

Frequently Asked Questions

What concept testing questions actually get people to explain their gut reaction instead of giving polite, surface-level feedback?

How do I structure concept testing questions so I learn the real usage scenario rather than a generic "sounds good"?

What's the difference between concept testing survey questions and qualitative concept testing questions?

How many concept-testing questions should I ask in a single interview?

When should I use qualitative concept testing instead of a survey?

Qualitative insights at the speed of your business

Conveo automates video interviews to speed up decision-making.

Related articles.

News

How AI-Powered Qual Helps You Hear the ‘Why’ Behind Customer Behavior

You’ve seen it happen. A number on the dashboard blips,engagement dips, CTR slides, NPS stalls, then Slack lights up: What changed? Maybe your concept test shows B beating A, but nobody can articulate why. The team starts guessing: “Was it the headline? The color? The whole premise?” This is the moment qualitative research earns its keep. Not the old, slow, twelve-weeks‑to-a-powerpoint version,AI‑powered qual that moves at the speed of the business and turns raw customer language into crisp, defensible decisions. In this post, we’ll show you exactly how to use it to get from what happened to why it happened,and what to do next.

Headshot of Florian Hendrickx

Florian Hendrickx

Head of Growth

Success stories

“The Quickest Wins”: How Pronails Finds Creative Sparks Faster with Conveo

If you work in consumer marketing, you can feel the ground shifting under your feet. New formats pop up, algorithms blink, trends peak and vanish. The brands that thrive aren’t necessarily the loudest,they’re the ones that move the fastest, test the most, and let real customer language guide every creative decision. That’s the spirit of Lize Olaerts, Marketing Manager at **PN Self‑Care**, the B2C sister of **ProNails**, a manufacturer and distributor of professional gel nail products. As Lize puts it, “In marketing, the one who is the quickest wins.” In this story,filmed as a short testimonial,you’ll hear how her team uses **Conveo** to move from *hunch* to *hook* faster: spotting fresh segments they missed, turning real customer phrases into scroll‑stopping ad angles, and ramping up the volume of creative tests without burning the team out. Whether you run paid social for a beauty brand or you’re building a new DTC play, Lize’s process is a blueprint for speed without sacrificing substance.

Headshot of Hendrick Van Hove

Hendrik Van Hove

Founder & CPO

Success stories

“From Hunches to Evidence”: Why Louis (Founder & CMO of Edgar & Cooper) says CMI is like special forces.

Hit play on the testimonial from Louis (CMO), Levi (Head of CMI), and Pieter (CMI Manager) from Edgar & Cooper, a General Mills company. In a few minutes, you’ll see how Conveo blends qual-depth with quant-confidence, running interviews in parallel, surfacing the “why,” and giving teams evidence they can literally watch. Pieter captures the surprise best: an AI interviewer that asks nuanced, accurate follow-ups and feels genuinely reliable.

Headshot of Alex de Hemptinne

Alex de Hemptinne

Head of Customer Success

Decisions powered by talking to real people.

Automate interviews, scale insights, and lead your organization into the next era of research.