Riddle me this: How many interviews (or focus groups) are enough?

Emily Namey

This blog post is the final in a series of three sampling-focused posts.

The first two posts in this series describe commonly used research sampling strategies and provide some guidance on how to choose from this range of sampling methods. Here we delve further into the sampling world and address sample sizes for qualitative research and evaluation projects. Specifically, we address the often-asked question: How many in-depth interviews/focus groups do I need to conduct for my study?

Within the qualitative literature (and community of practice), the concept of “saturation” – the point when incoming data produce little or no new information – is the well-accepted standard by which sample sizes for qualitative inquiry are determined (Guest et al., 2006; Guest and MacQueen, 2008). There’s just one small problem with this: saturation, by definition, can be determined only during or after data analysis. And most of us need to justify our sample sizes (to funders, ethics committees, etc.) before collecting data!

Until relatively recently, researchers and evaluators had to rely on rules of thumb or their personal experiences to estimate how many qualitative data collection events they needed for a study; empirical data to support these sample sizes were virtually non-existent. This began to change a little over a decade ago. Morgan and colleagues (2002) decided to plot (and publish!) the number of new concepts identified in successive interviews across four datasets. They found that nearly no new concepts were found after 20 interviews. Extrapolating from their data, we see that the first five to six in-depth interviews produced the majority of new data, and approximately 80% to 92% of concepts were identified within the first 10 interviews.

Building on this work, Guest et al. (2006) conducted a systematic inductive thematic analysis of 60 in-depth interviews among female sex workers in West Africa. Of the 114 themes identified in the entire dataset, 80 (70%) turned up in the first six interviews, and 100 themes (92%) were identified within the first 12 interviews (Figure 1). Additionally, those 100 themes comprised 97% of the most common (highest prevalence) themes, indicating that the “big ones” were evident early on.

Figure 1. Number of new codes identified in batches of six individual interviews (Guest et al., 2006)

Since Guest et al.’s publication in 2006, other researchers have confirmed that 6-12 interviews seem to be a sweet spot for the number of qualitative interviews needed to reach saturation. We provide the following table as a summary.

Study authors	Saturation definition	Findings
Individual interviews
Morgan and colleagues (2002)	Not defined	5-6 interviews for most concepts In all four sets of interviews, approximately 80-92% of concepts identified within 10 interviews (extrapolated from reported data)
Guest et al. (2006)	The proportion of identified themes at a given point in analysis divided by the total number of themes identified in that analysis	6 interviews to reach 70% saturation 12 interviews to reach 92% saturation
Francis et al. (2010) (gated)	The point, after conducting 10 interviews, when three additional interviews yield no new themes	Most themes in both studies identified within 5-6 interviews Saturation reached within 17 interviews in one study, and not reached in 14 interviews in a second study
Coenen et al. (2012) (gated)	The point at which linking concepts from two consecutive focus groups or individual interviews reveals no additional second-level categories	Inductive approach: 13 interviews to reach saturation Deductive approach: 8 interviews to reach saturation
Hagaman and Wutich (2016) (gated)	The number of interviews required to identify the most common themes in a total of three interviews	Less than 16 interviews at site level 20-40 interviews to identify cross-cultural meta-themes
Namey, et al. (2016)	The proportion of identified themes at a given point in analysis divided by the total number of themes identified in that analysis	At the median: 8 interviews to reach 80% saturation (range 5-11) 16 interviews to reach 90% saturation (range 11-26)

“But what about focus groups?” you ask. An empirically-based study by Coenen et al. (2012) (gated) found that five focus groups were enough to reach saturation for their inductive thematic analysis. In a recent methodological study (gated), we followed a similar approach used by Guest et al. (2006) and monitored thematic discovery and code creation after each of 40 focus groups conducted among African-American men in North Carolina on the topic of health-seeking behavior (more on this study and its methodological findings here). We found the majority of themes were identified within the first focus group, and nearly all of the important (read most frequently expressed) themes were discovered within the first three focus groups (Figure 2).

Figure 2. Average number of new codes identified per focus group (focus groups randomly ordered) (Guest et al., 2016)

These data from our study suggest that a sample size of two to three focus groups will likely capture about 80% of themes on a topic — including those most broadly shared — in a study with a relatively homogeneous population, and using a semi-structured guide. As few as three to six focus groups are likely enough to identify 90% of important themes.

Note that these sample sizes, for both interviews and focus groups, apply per sub-population of interest. Note too that thematic saturation will vary based on a number of factors (keep watch for a future blog post) and sample size should be adjusted accordingly.

Use this catchy poem to remember how many in-depth interviews or focus groups you need.

Sampling to reach saturation?
Here’s the magical equation:
For interviews, to do them well,
choose a sample from 6-12*;
If focus groups are in the mix,
aim to conduct 3-6*.
(Okay, equation it is not
But empirical guidance helps a lot!)

*per sub-population of interest

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Emily Namey

Related Posts

Exploring the parameters of “it depends” for estimating the rate of data saturation in qualitative inquiry

Learning about focus groups from an RCT

Turning lemons into lemonade, and then drinking it: Rigorous evaluation under challenging conditions

Never miss an email

Our use of cookies