Logo for Open Educational Resources

Chapter 17. Content Analysis

Introduction.

Content analysis is a term that is used to mean both a method of data collection and a method of data analysis. Archival and historical works can be the source of content analysis, but so too can the contemporary media coverage of a story, blogs, comment posts, films, cartoons, advertisements, brand packaging, and photographs posted on Instagram or Facebook. Really, almost anything can be the “content” to be analyzed. This is a qualitative research method because the focus is on the meanings and interpretations of that content rather than strictly numerical counts or variables-based causal modeling. [1] Qualitative content analysis (sometimes referred to as QCA) is particularly useful when attempting to define and understand prevalent stories or communication about a topic of interest—in other words, when we are less interested in what particular people (our defined sample) are doing or believing and more interested in what general narratives exist about a particular topic or issue. This chapter will explore different approaches to content analysis and provide helpful tips on how to collect data, how to turn that data into codes for analysis, and how to go about presenting what is found through analysis. It is also a nice segue between our data collection methods (e.g., interviewing, observation) chapters and chapters 18 and 19, whose focus is on coding, the primary means of data analysis for most qualitative data. In many ways, the methods of content analysis are quite similar to the method of coding.

qualitative content analysis in education research

Although the body of material (“content”) to be collected and analyzed can be nearly anything, most qualitative content analysis is applied to forms of human communication (e.g., media posts, news stories, campaign speeches, advertising jingles). The point of the analysis is to understand this communication, to systematically and rigorously explore its meanings, assumptions, themes, and patterns. Historical and archival sources may be the subject of content analysis, but there are other ways to analyze (“code”) this data when not overly concerned with the communicative aspect (see chapters 18 and 19). This is why we tend to consider content analysis its own method of data collection as well as a method of data analysis. Still, many of the techniques you learn in this chapter will be helpful to any “coding” scheme you develop for other kinds of qualitative data. Just remember that content analysis is a particular form with distinct aims and goals and traditions.

An Overview of the Content Analysis Process

The first step: selecting content.

Figure 17.2 is a display of possible content for content analysis. The first step in content analysis is making smart decisions about what content you will want to analyze and to clearly connect this content to your research question or general focus of research. Why are you interested in the messages conveyed in this particular content? What will the identification of patterns here help you understand? Content analysis can be fun to do, but in order to make it research, you need to fit it into a research plan.

Figure 17.1. A Non-exhaustive List of "Content" for Content Analysis

To take one example, let us imagine you are interested in gender presentations in society and how presentations of gender have changed over time. There are various forms of content out there that might help you document changes. You could, for example, begin by creating a list of magazines that are coded as being for “women” (e.g., Women’s Daily Journal ) and magazines that are coded as being for “men” (e.g., Men’s Health ). You could then select a date range that is relevant to your research question (e.g., 1950s–1970s) and collect magazines from that era. You might create a “sample” by deciding to look at three issues for each year in the date range and a systematic plan for what to look at in those issues (e.g., advertisements? Cartoons? Titles of articles? Whole articles?). You are not just going to look at some magazines willy-nilly. That would not be systematic enough to allow anyone to replicate or check your findings later on. Once you have a clear plan of what content is of interest to you and what you will be looking at, you can begin, creating a record of everything you are including as your content. This might mean a list of each advertisement you look at or each title of stories in those magazines along with its publication date. You may decide to have multiple “content” in your research plan. For each content, you want a clear plan for collecting, sampling, and documenting.

The Second Step: Collecting and Storing

Once you have a plan, you are ready to collect your data. This may entail downloading from the internet, creating a Word document or PDF of each article or picture, and storing these in a folder designated by the source and date (e.g., “ Men’s Health advertisements, 1950s”). Sølvberg ( 2021 ), for example, collected posted job advertisements for three kinds of elite jobs (economic, cultural, professional) in Sweden. But collecting might also mean going out and taking photographs yourself, as in the case of graffiti, street signs, or even what people are wearing. Chaise LaDousa, an anthropologist and linguist, took photos of “house signs,” which are signs, often creative and sometimes offensive, hung by college students living in communal off-campus houses. These signs were a focal point of college culture, sending messages about the values of the students living in them. Some of the names will give you an idea: “Boot ’n Rally,” “The Plantation,” “Crib of the Rib.” The students might find these signs funny and benign, but LaDousa ( 2011 ) argued convincingly that they also reproduced racial and gender inequalities. The data here already existed—they were big signs on houses—but the researcher had to collect the data by taking photographs.

In some cases, your content will be in physical form but not amenable to photographing, as in the case of films or unwieldy physical artifacts you find in the archives (e.g., undigitized meeting minutes or scrapbooks). In this case, you need to create some kind of detailed log (fieldnotes even) of the content that you can reference. In the case of films, this might mean watching the film and writing down details for key scenes that become your data. [2] For scrapbooks, it might mean taking notes on what you are seeing, quoting key passages, describing colors or presentation style. As you might imagine, this can take a lot of time. Be sure you budget this time into your research plan.

Researcher Note

A note on data scraping : Data scraping, sometimes known as screen scraping or frame grabbing, is a way of extracting data generated by another program, as when a scraping tool grabs information from a website. This may help you collect data that is on the internet, but you need to be ethical in how to employ the scraper. A student once helped me scrape thousands of stories from the Time magazine archives at once (although it took several hours for the scraping process to complete). These stories were freely available, so the scraping process simply sped up the laborious process of copying each article of interest and saving it to my research folder. Scraping tools can sometimes be used to circumvent paywalls. Be careful here!

The Third Step: Analysis

There is often an assumption among novice researchers that once you have collected your data, you are ready to write about what you have found. Actually, you haven’t yet found anything, and if you try to write up your results, you will probably be staring sadly at a blank page. Between the collection and the writing comes the difficult task of systematically and repeatedly reviewing the data in search of patterns and themes that will help you interpret the data, particularly its communicative aspect (e.g., What is it that is being communicated here, with these “house signs” or in the pages of Men’s Health ?).

The first time you go through the data, keep an open mind on what you are seeing (or hearing), and take notes about your observations that link up to your research question. In the beginning, it can be difficult to know what is relevant and what is extraneous. Sometimes, your research question changes based on what emerges from the data. Use the first round of review to consider this possibility, but then commit yourself to following a particular focus or path. If you are looking at how gender gets made or re-created, don’t follow the white rabbit down a hole about environmental injustice unless you decide that this really should be the focus of your study or that issues of environmental injustice are linked to gender presentation. In the second round of review, be very clear about emerging themes and patterns. Create codes (more on these in chapters 18 and 19) that will help you simplify what you are noticing. For example, “men as outdoorsy” might be a common trope you see in advertisements. Whenever you see this, mark the passage or picture. In your third (or fourth or fifth) round of review, begin to link up the tropes you’ve identified, looking for particular patterns and assumptions. You’ve drilled down to the details, and now you are building back up to figure out what they all mean. Start thinking about theory—either theories you have read about and are using as a frame of your study (e.g., gender as performance theory) or theories you are building yourself, as in the Grounded Theory tradition. Once you have a good idea of what is being communicated and how, go back to the data at least one more time to look for disconfirming evidence. Maybe you thought “men as outdoorsy” was of importance, but when you look hard, you note that women are presented as outdoorsy just as often. You just hadn’t paid attention. It is very important, as any kind of researcher but particularly as a qualitative researcher, to test yourself and your emerging interpretations in this way.

The Fourth and Final Step: The Write-Up

Only after you have fully completed analysis, with its many rounds of review and analysis, will you be able to write about what you found. The interpretation exists not in the data but in your analysis of the data. Before writing your results, you will want to very clearly describe how you chose the data here and all the possible limitations of this data (e.g., historical-trace problem or power problem; see chapter 16). Acknowledge any limitations of your sample. Describe the audience for the content, and discuss the implications of this. Once you have done all of this, you can put forth your interpretation of the communication of the content, linking to theory where doing so would help your readers understand your findings and what they mean more generally for our understanding of how the social world works. [3]

Analyzing Content: Helpful Hints and Pointers

Although every data set is unique and each researcher will have a different and unique research question to address with that data set, there are some common practices and conventions. When reviewing your data, what do you look at exactly? How will you know if you have seen a pattern? How do you note or mark your data?

Let’s start with the last question first. If your data is stored digitally, there are various ways you can highlight or mark up passages. You can, of course, do this with literal highlighters, pens, and pencils if you have print copies. But there are also qualitative software programs to help you store the data, retrieve the data, and mark the data. This can simplify the process, although it cannot do the work of analysis for you.

Qualitative software can be very expensive, so the first thing to do is to find out if your institution (or program) has a universal license its students can use. If they do not, most programs have special student licenses that are less expensive. The two most used programs at this moment are probably ATLAS.ti and NVivo. Both can cost more than $500 [4] but provide everything you could possibly need for storing data, content analysis, and coding. They also have a lot of customer support, and you can find many official and unofficial tutorials on how to use the programs’ features on the web. Dedoose, created by academic researchers at UCLA, is a decent program that lacks many of the bells and whistles of the two big programs. Instead of paying all at once, you pay monthly, as you use the program. The monthly fee is relatively affordable (less than $15), so this might be a good option for a small project. HyperRESEARCH is another basic program created by academic researchers, and it is free for small projects (those that have limited cases and material to import). You can pay a monthly fee if your project expands past the free limits. I have personally used all four of these programs, and they each have their pluses and minuses.

Regardless of which program you choose, you should know that none of them will actually do the hard work of analysis for you. They are incredibly useful for helping you store and organize your data, and they provide abundant tools for marking, comparing, and coding your data so you can make sense of it. But making sense of it will always be your job alone.

So let’s say you have some software, and you have uploaded all of your content into the program: video clips, photographs, transcripts of news stories, articles from magazines, even digital copies of college scrapbooks. Now what do you do? What are you looking for? How do you see a pattern? The answers to these questions will depend partially on the particular research question you have, or at least the motivation behind your research. Let’s go back to the idea of looking at gender presentations in magazines from the 1950s to the 1970s. Here are some things you can look at and code in the content: (1) actions and behaviors, (2) events or conditions, (3) activities, (4) strategies and tactics, (5) states or general conditions, (6) meanings or symbols, (7) relationships/interactions, (8) consequences, and (9) settings. Table 17.1 lists these with examples from our gender presentation study.

Table 17.1. Examples of What to Note During Content Analysis

One thing to note about the examples in table 17.1: sometimes we note (mark, record, code) a single example, while other times, as in “settings,” we are recording a recurrent pattern. To help you spot patterns, it is useful to mark every setting, including a notation on gender. Using software can help you do this efficiently. You can then call up “setting by gender” and note this emerging pattern. There’s an element of counting here, which we normally think of as quantitative data analysis, but we are using the count to identify a pattern that will be used to help us interpret the communication. Content analyses often include counting as part of the interpretive (qualitative) process.

In your own study, you may not need or want to look at all of the elements listed in table 17.1. Even in our imagined example, some are more useful than others. For example, “strategies and tactics” is a bit of a stretch here. In studies that are looking specifically at, say, policy implementation or social movements, this category will prove much more salient.

Another way to think about “what to look at” is to consider aspects of your content in terms of units of analysis. You can drill down to the specific words used (e.g., the adjectives commonly used to describe “men” and “women” in your magazine sample) or move up to the more abstract level of concepts used (e.g., the idea that men are more rational than women). Counting for the purpose of identifying patterns is particularly useful here. How many times is that idea of women’s irrationality communicated? How is it is communicated (in comic strips, fictional stories, editorials, etc.)? Does the incidence of the concept change over time? Perhaps the “irrational woman” was everywhere in the 1950s, but by the 1970s, it is no longer showing up in stories and comics. By tracing its usage and prevalence over time, you might come up with a theory or story about gender presentation during the period. Table 17.2 provides more examples of using different units of analysis for this work along with suggestions for effective use.

Table 17.2. Examples of Unit of Analysis in Content Analysis

Every qualitative content analysis is unique in its particular focus and particular data used, so there is no single correct way to approach analysis. You should have a better idea, however, of what kinds of things to look for and what to look for. The next two chapters will take you further into the coding process, the primary analytical tool for qualitative research in general.

Further Readings

Cidell, Julie. 2010. “Content Clouds as Exploratory Qualitative Data Analysis.” Area 42(4):514–523. A demonstration of using visual “content clouds” as a form of exploratory qualitative data analysis using transcripts of public meetings and content of newspaper articles.

Hsieh, Hsiu-Fang, and Sarah E. Shannon. 2005. “Three Approaches to Qualitative Content Analysis.” Qualitative Health Research 15(9):1277–1288. Distinguishes three distinct approaches to QCA: conventional, directed, and summative. Uses hypothetical examples from end-of-life care research.

Jackson, Romeo, Alex C. Lange, and Antonio Duran. 2021. “A Whitened Rainbow: The In/Visibility of Race and Racism in LGBTQ Higher Education Scholarship.” Journal Committed to Social Change on Race and Ethnicity (JCSCORE) 7(2):174–206.* Using a “critical summative content analysis” approach, examines research published on LGBTQ people between 2009 and 2019.

Krippendorff, Klaus. 2018. Content Analysis: An Introduction to Its Methodology . 4th ed. Thousand Oaks, CA: SAGE. A very comprehensive textbook on both quantitative and qualitative forms of content analysis.

Mayring, Philipp. 2022. Qualitative Content Analysis: A Step-by-Step Guide . Thousand Oaks, CA: SAGE. Formulates an eight-step approach to QCA.

Messinger, Adam M. 2012. “Teaching Content Analysis through ‘Harry Potter.’” Teaching Sociology 40(4):360–367. This is a fun example of a relatively brief foray into content analysis using the music found in Harry Potter films.

Neuendorft, Kimberly A. 2002. The Content Analysis Guidebook . Thousand Oaks, CA: SAGE. Although a helpful guide to content analysis in general, be warned that this textbook definitely favors quantitative over qualitative approaches to content analysis.

Schrier, Margrit. 2012. Qualitative Content Analysis in Practice . Thousand Okas, CA: SAGE. Arguably the most accessible guidebook for QCA, written by a professor based in Germany.

Weber, Matthew A., Shannon Caplan, Paul Ringold, and Karen Blocksom. 2017. “Rivers and Streams in the Media: A Content Analysis of Ecosystem Services.” Ecology and Society 22(3).* Examines the content of a blog hosted by National Geographic and articles published in The New York Times and the Wall Street Journal for stories on rivers and streams (e.g., water-quality flooding).

  • There are ways of handling content analysis quantitatively, however. Some practitioners therefore specify qualitative content analysis (QCA). In this chapter, all content analysis is QCA unless otherwise noted. ↵
  • Note that some qualitative software allows you to upload whole films or film clips for coding. You will still have to get access to the film, of course. ↵
  • See chapter 20 for more on the final presentation of research. ↵
  • . Actually, ATLAS.ti is an annual license, while NVivo is a perpetual license, but both are going to cost you at least $500 to use. Student rates may be lower. And don’t forget to ask your institution or program if they already have a software license you can use. ↵

A method of both data collection and data analysis in which a given content (textual, visual, graphic) is examined systematically and rigorously to identify meanings, themes, patterns and assumptions.  Qualitative content analysis (QCA) is concerned with gathering and interpreting an existing body of material.    

Introduction to Qualitative Research Methods Copyright © 2023 by Allison Hurst is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License , except where otherwise noted.

An official website of the United States government

Official websites use .gov A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS A lock ( Lock Locked padlock icon ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

  • Publications
  • Account settings
  • Advanced Search
  • Journal List

Demystifying Content Analysis

A j kleinheksel , phd, nicole rockich-winston , pharmd, huda tawfik , phd, md, tasha r wyatt , phd.

  • Author information
  • Article notes
  • Copyright and License information

Corresponding Author: A.J. Kleinheksel, The Medical College of Georgia at Augusta University, 1120 15th St., GB 3353, Augusta, GA 30912. Tel: 706-721-0105. Fax: 706-721-2030. Email: [email protected] .

Corresponding author.

Received 2018 Apr 17; Accepted 2019 Feb 20.

Objective. In the course of daily teaching responsibilities, pharmacy educators collect rich data that can provide valuable insight into student learning. This article describes the qualitative data analysis method of content analysis, which can be useful to pharmacy educators because of its application in the investigation of a wide variety of data sources, including textual, visual, and audio files.

Findings. Both manifest and latent content analysis approaches are described, with several examples used to illustrate the processes. This article also offers insights into the variety of relevant terms and visualizations found in the content analysis literature. Finally, common threats to the reliability and validity of content analysis are discussed, along with suitable strategies to mitigate these risks during analysis.

Summary. This review of content analysis as a qualitative data analysis method will provide clarity and actionable instruction for both novice and experienced pharmacy education researchers.

Keywords: content analysis, educational research, latent, manifest, qualitative data analysis

INTRODUCTION

The Academy’s growing interest in qualitative research indicates an important shift in the field’s scientific paradigm. Whereas health science researchers have historically looked to quantitative methods to answer their questions, this shift signals that a purely positivist, objective approach is no longer sufficient to answer pharmacy education’s research questions. Educators who want to study their teaching and students’ learning will find content analysis an easily accessible, robust method of qualitative data analysis that can yield rigorous results for both publication and the improvement of their educational practice. Content analysis is a method designed to identify and interpret meaning in recorded forms of communication by isolating small pieces of the data that represent salient concepts and then applying or creating a framework to organize the pieces in a way that can be used to describe or explain a phenomenon. 1 Content analysis is particularly useful in situations where there is a large amount of unanalyzed textual data, such as those many pharmacy educators have already collected as part of their teaching practice. Because of its accessibility, content analysis is also an appropriate qualitative method for pharmacy educators with limited experience in educational research. This article will introduce and illustrate the process of content analysis as a way to analyze existing data, but also as an approach that may lead pharmacy educators to ask new types of research questions.

Content analysis is a well-established data analysis method that has evolved in its treatment of textual data. Content analysis was originally introduced as a strictly quantitative method, recording counts to measure the observed frequency of pre-identified targets in consumer research. 1 However, as the naturalistic qualitative paradigm became more prevalent in social sciences research and researchers became increasingly interested in the way people behave in natural settings, the process of content analysis was adapted into a more interesting and meaningful approach. Content analysis has the potential to be a useful method in pharmacy education because it can help educational researchers develop a deeper understanding of a particular phenomenon by providing structure in a large amount of textual data through a systematic process of interpretation. It also offers potential value because it can help identify problematic areas in student understanding and guide the process of targeted teaching. Several research studies in pharmacy education have used the method of content analysis. 2-7 Two studies in particular offer noteworthy examples: Wallman and colleagues employed manifest content analysis to analyze semi-structured interviews in order to explore what students learn during experiential rotations, 7 while Moser and colleagues adopted latent content analysis to evaluate open-ended survey responses on student perceptions of learning communities. 6 To elaborate on these approaches further, we will describe the two types of qualitative content analysis, manifest and latent, and demonstrate the corresponding analytical processes using examples that illustrate their benefit.

Qualitative Content Analysis

Content analysis rests on the assumption that texts are a rich data source with great potential to reveal valuable information about particular phenomena. 8 It is the process of considering both the participant and context when sorting text into groups of related categories to identify similarities and differences, patterns, and associations, both on the surface and implied within. 9-11 The method is considered high-yield in educational research because it is versatile and can be applied in both qualitative and quantitative studies. 12 While it is important to note that content analysis has application in visual and auditory artifacts (eg, an image or song), for our purposes we will largely focus on the most common application, which is the analysis of textual or transcribed content (eg, open-ended survey responses, print media, interviews, recorded observations, etc). The terminology of content analysis can vary throughout quantitative and qualitative literature, which may lead to some confusion among both novice and experienced researchers. However, there are also several agreed-upon terms and phrases that span the literature, as found in Table 1 .

Terms and Definitions Used in Qualitative Content Analysis

graphic file with name ajpe7113-t1.jpg

There is more often disagreement on terminology in the methodological approaches to content analysis, though the most common differentiation is between the two types of content: manifest and latent. In much of the literature, manifest content analysis is defined as describing what is occurring on the surface, what is and literally present, and as “staying close to the text.” 8,13 Manifest content analysis is concerned with data that are easily observable both to researchers and the coders who assist in their analyses, without the need to discern intent or identify deeper meaning. It is content that can be recognized and counted with little training. Early applications of manifest analysis focused on identifying easily observable targets within text (eg, the number of instances a certain word appears in newspaper articles), film (eg, the occupation of a character), or interpersonal interactions (eg, tracking the number of times a participant blinks during an interview). 14 This application, in which frequency counts are used to understand a phenomenon, reflects a surface-level analysis and assumes there is objective truth in the data that can be revealed with very little interpretation. The number of times a target (ie, code) appears within the text is used as a way to understand its prevalence. Quantitative content analysis is always describing a positivist manifest content analysis, in that the nature of truth is believed to be objective, observable, and measurable. Qualitative research, which favors the researcher’s interpretation of an individual’s experience, may also be used to analyze manifest content. However, the intent of the application is to describe a dynamic reality that cannot be separated from the lived experiences of the researcher. Although qualitative content analysis can be conducted whether knowledge is thought to be innate, acquired, or socially constructed, the purpose of qualitative manifest content analysis is to transcend simple word counts and delve into a deeper examination of the language in order to organize large amounts of text into categories that reflect a shared meaning. 15,16 The practical distinction between quantitative and qualitative manifest content analysis is the intention behind the analysis. The quantitative method seeks to generate a numerical value to either cite prevalence or use in statistical analyses, while the qualitative method seeks to identify a construct or concept within the text using specific words or phrases for substantiation, or to provide a more organized structure to the text being described.

Latent content analysis is most often defined as interpreting what is hidden deep within the text. In this method, the role of the researcher is to discover the implied meaning in participants’ experiences. 8,13 For example, in a transcribed exchange in an office setting, a participant might say to a coworker, “Yeah, here we are…another Monday. So exciting!” The researcher would apply context in order to discover the emotion being conveyed (ie, the implied meaning). In this example, the comment could be interpreted as genuine, it could be interpreted as a sarcastic comment made in an attempt at humor in order to develop or sustain social bonds with the coworker, or the context might imply that the sarcasm was meant to convey displeasure and end the interaction.

Latent content analysis acknowledges that the researcher is intimately involved in the analytical process and that the their role is to actively use mental schema, theories, and lenses to interpret and understand the data. 10 Whereas manifest analyses are typically conducted in a way that the researcher is thought to maintain distance and separation from the objects of study, latent analyses underscore the importance of the researcher co-creating meaning with the text. 17 Adding nuance to this type of content, Potter and Levine‐Donnerstein argue that within latent content analysis, there are two distinct types: latent pattern and latent projective . 14 Latent pattern content analysis seeks to establish a pattern of characteristics in the text itself, while latent projective content analysis leverages the researcher’s own interpretations of the meaning of the text. While both approaches rely on codes that emerge from the content using the coder’s own perspectives and mental schema, the distinction between these two types of analyses are in their foci. 14 Though we do not agree, some researchers believe that all qualitative content analysis is latent content analysis. 11 These disagreements typically occur where there are differences in intent and where there are areas of overlap in the results. For example, both qualitative manifest and latent pattern content analyses may identify patterns as a result of their application. Though in their research design, the researcher would have approached the content with different methodological approaches, with a manifest approach seeking only to describe what is observed, and the latent pattern approach seeking to discover an unseen pattern. At this point, these distinctions may seem too philosophical to serve a practical purpose, so we will attempt to clarify these concepts by presenting three types of analyses for illustrative purposes, beginning with a description of how codes are created and used.

Creating and Using Codes

Codes are the currency of content analysis. Researchers use codes to organize and understand their data. Through the coding process, pharmacy educators can systematically and rigorously categorize and interpret vast amounts of text for use in their educational practice or in publication. Codes themselves are short, descriptive labels that symbolically assign a summative or salient attribute to more than one unit of meaning identified in the text. 18 To create codes, a researcher must first become immersed in the data, which typically occurs when a researcher transcribes recorded data or conducts several readings of the text. This process allows the researcher to become familiar with the scope of the data, which spurs nascent ideas about potential concepts or constructs that may exist within it. If studying a phenomenon that has already been described through an existing framework, codes can be created a priori using theoretical frameworks or concepts identified in the literature. If there is no existing framework to apply, codes can emerge during the analytical process. However, emergent codes can also be created as addenda to a priori codes that were identified before the analysis begins if the a priori codes do not sufficiently capture the researcher’s area of interest.

The process of detecting emergent codes begins with identification of units of meaning. While there is no one way to decide what qualifies as a meaning unit, researchers typically define units of meaning differently depending on what kind of analysis is being conducted. As a general rule, when dialogue is being analyzed, such as interviews or focus groups, meaning units are identified as conversational turns, though a code can be as short as one or two words. In written text, such as student reflections or course evaluation data, the researcher must decide if the text should be divided into phrases or sentences, or remain as paragraphs. This decision is usually made based on how many different units of meaning are expressed in a block of text. For example, in a paragraph, if there are several thoughts or concepts being expressed, it is best to break up the paragraph into sentences. If one sentence contains multiple ideas of interest, making it difficult to separate one important thought or behavior from another, then the sentence can be divided into smaller units, such as phrases or sentence fragments. These phrases or sentence fragments are then coded as separate meaning units. Conversely, longer or more complex units of meaning should be condensed into shorter representations that still retain the original meaning in order to reduce the cognitive burden of the analytical process. This could entail removing verbal ticks (eg, “well, uhm…”) from transcribed data or simplifying a compound sentence. Condensation does not ascribe interpretation or implied meaning to a unit, but only shortens a meaning unit as much as possible while preserving the original meaning identified. 18 After condensation, a researcher can proceed to the creation of codes.

Many researchers begin their analyses with several general codes in mind that help guide their focus as defined by their research question, even in instances where the researcher has no a priori model or theory. For example, if a group of instructors are interested in examining recorded videos of their lectures to identify moments of student engagement, they may begin with using generally agreed upon concepts of engagement as codes, such as students “raising their hands,” “taking notes,” and “speaking in class.” However, as the instructors continue to watch their videos, they may notice other behaviors which were not initially anticipated. Perhaps students were seen creating flow charts based on information presented in class. Alternatively, perhaps instructors wanted to include moments when students posed questions to their peers without being prompted. In this case, the instructors would allow the codes of “creating graphic organizers” and “questioning peers” to emerge as additional ways to identify the behavior of student engagement.

Once a researcher has identified condensed units of meaning and labeled them with codes, the codes are then sorted into categories which can help provide more structure to the data. In the above example of recorded lectures, perhaps the category of “verbal behaviors” could be used to group the codes of “speaking in class” and “questioning peers.” For complex analyses, subcategories can also be used to better organize a large amount of codes, but solely at the discretion of the researcher. Two or more categories of codes are then used to identify or support a broader underlying meaning which develops into themes. Themes are most often employed in latent analyses; however, they are appropriate in manifest analyses as well. Themes describe behaviors, experiences, or emotions that occur throughout several categories. 18 Figure 1 illustrates this process. Using the same videotaped lecture example, the instructors might identify two themes of student engagement, “active engagement” and “passive engagement,” where active engagement is supported by the category of “verbal behavior” and also a category that includes the code of “raising their hands” (perhaps something along the lines of “pursuing engagement”), and the theme of “passive engagement” is supported by a category used to organize the behaviors of “taking notes” and “creating graphic organizers.”

Figure 1.

The Process of Qualitative Content Analysis

To more fully demonstrate the process of content analysis and the generation and use of codes, categories, and themes, we present and describe examples of both manifest and latent content analysis. Given that there are multiple ways to create and use codes, our examples illustrate both processes of creating and using a predetermined set of codes. Regardless of the kind of content analysis instructors want to conduct, the initial steps are the same. The instructor must analyze the data using codes as a sense-making process.

Manifest Content Analysis

The first form of analysis, manifest content analysis, examines text for elements that exist on the surface of the text, the meaning of which is taken at face value. Schools and colleges of pharmacy may benefit from conducting manifest content analyses at a programmatic level, including analysis of student evaluations to determine the value of certain courses, or analysis of recruitment materials for addressing issues of cultural humility in a uniform manner. Such uses for manifest content analysis may help administrators make more data-based decisions about students and courses. However, for our example of manifest content analysis, we illustrate the use of content analysis in informing instruction for a single pharmacy educator ( Figure 2 ).

Figure 2.

A Student’s Completed Beta-blocker Case with Codes in Underlined Bold Text

In the example, a pharmacology instructor is trying to assess students’ understanding of three concepts related to the beta-blocker class of drugs: indication of the drug, relevance of family history, and contraindications and precautions. To do so, the instructor asks the students to write a patient case in which beta-blockers are indicated. The instructor gives the students the following prompt: “Reverse-engineer a case in which beta-blockers would be prescribed to the patient. Include a history of the present illness, the patients’ medical, family, and social history, medications, allergies, and relevant lab tests.” Figure 2 is a hypothetical student’s completed assignment, in which they demonstrate their understanding of when and why a beta-blocker would be prescribed.

The student-generated cases are then treated as data and analyzed for the presence of the three previously identified indicators of understanding in order to help the instructor make decisions about where and how to focus future teaching efforts related to this drug class. Codes are created a priori out of the instructor’s interest in analyzing students’ understanding of the concepts related to beta-blocker prescriptions. A codebook ( Table 2 ) is created with the following columns: name of code, code description, and examples of the code. This codebook helps an individual researcher to approach their analysis systematically, but it can also facilitate coding by multiple coders who would apply the same rules outlined in the codebook to the coding process.

Example Code Book Created for Manifest Content Analysis

graphic file with name ajpe7113-t2.jpg

Using multiple coders introduces complexity to the analysis process, but it is oftentimes the only practical way to analyze large amounts of data. To ensure that all coders are working in tandem, they must establish inter-rater reliability as part of their training process. This process requires that a single form of text be selected, such as one student evaluation. After reviewing the codebook and receiving instruction, everyone on the team individually codes the same piece of data. While calculating percentage agreement has sometimes been used to establish inter-rater reliability, most publication editors require more rigorous statistical analysis (eg, Krippendorf’s alpha, or Cohen’s kappa). 19 Detailed descriptions of these statistics fall outside the scope of this introduction, but it is important to note that the choice depends on the number of coders, the sample size, and the type of data to be analyzed.

Latent Content Analysis

Latent content analysis is another option for pharmacy educators, especially when there are theoretical frameworks or lenses the educator proposes to apply. Such frameworks describe and provide structure to complex concepts and may often be derived from relevant theories. Latent content analysis requires that the researcher is intimately involved in interpreting and finding meaning in the text because meaning is not readily apparent on the surface. 10 To illustrate a latent content analysis using a combination of a priori and emergent codes, we will use the example of a transcribed video excerpt from a student pharmacist interaction with a standardized patient. In this example, the goal is for first-year students to practice talking to a customer about an over-the-counter medication. The case is designed to simulate a customer at a pharmacy counter, who is seeking advice on a medication. The learning objectives for the pharmacist in-training are to assess the customer’s symptoms, determine if the customer can self-treat or if they need to seek out their primary care physician, and then prescribe a medication to alleviate the patient’s symptoms.

To begin, pharmacy educators conducting educational research should first identify what they are looking for in the video transcript. In this case, because the primary outcome for this exercise is aimed at assessing the “soft skills” of student pharmacists, codes are created using the counseling rubric created by Horton and colleagues. 20 Four a priori codes are developed using the literature: empathy, patient-friendly terms, politeness, and positive attitude. However, because the original four codes are inadequate to capture all areas representing the skills the instructor is looking for during the process of analysis, four additional codes are also created: active listening, confidence, follow-up, and patient at ease. Figure 3 presents the video transcript with each of the codes assigned to the meaning units in bolded parentheses.

Figure 3.

A Transcript of a Student’s (JR) Experience with a Standardized Patient (SP) in Which the Codes are Bolded in Parentheses

Following the initial coding using these eight codes, the codes are consolidated to create categories, which are depicted in the taxonomy in Figure 4 . Categories are relationships between codes that represent a higher level of abstraction in the data. 18 To reach conclusions and interpret the fundamental underlying meaning in the data, categories are then organized into themes ( Figure 1 ). Once the data are analyzed, the instructor can assign value to the student’s performance. In this case, the coding process determines that the exercise demonstrated both positive and negative elements of communication and professionalism. Under the category of professionalism, the student generally demonstrated politeness and a positive attitude toward the standardized patient, indicating to the reviewer that the theme of perceived professionalism was apparent during the encounter. However, there were several instances in which confidence and appropriate follow-up were absent. Thus, from a reviewer perspective, the student's performance could be perceived as indicating an opportunity to grow and improve as a future professional. Typically, there are multiple codes in a category and multiple categories in a theme. However, as seen in the example taxonomy, this is not always the case.

Figure 4.

Example of a Latent Content Analysis Taxonomy

If the educator is interested in conducting a latent projective analysis, after identifying the construct of “soft skills,” the researcher allows for each coder to apply their own mental schema as they look for positive and negative indicators of the non-technical skills they believe a student should develop. Mental schema are the cognitive structures that provide organization to knowledge, which in this case allows coders to categorize the data in ways that fit their existing understanding of the construct. The coders will use their own judgement to identify the codes they feel are relevant. The researcher could also choose to apply a theoretical lens to more effectively conceptualize the construct of “soft skills,” such as Rogers' humanism theory, and more specifically, concepts underlying his client-centered therapy. 21 The role of theory in both latent pattern and latent projective analyses is at the discretion of the researcher, and often is determined by what already exists in the literature related to the research question. Though, typically, in latent pattern analyses theory is used for deductive coding, and in latent projective analyses underdeveloped theory is used to first deduce codes and then for induction of the results to strengthen the theory applied. For our example, Rogers describes three salient qualities to develop and maintain a positive client-professional relationship: unconditional positive regard, genuineness, and empathetic understanding. 21 For the third element, specifically, the educator could look for units of meaning that imply empathy and active listening. For our video transcript analysis, this is evident when the student pharmacist demonstrated empathy by responding, "Yeah, I understand," when discussing aggravating factors for the patient's condition. The outcome for both latent pattern and latent projective content analysis is to discover the underlying meaning in a text, such as social rules or mental models. In this example, both pattern and projective approaches can discover interpreted aspects of a student’s abilities and mental models for constructs such as professionalism and empathy. The difference in the approaches is where the precedence lies: in the belief that a pattern is recognizable in the content, or in the mental schema and lived experiences of the coder(s). To better illustrate the differences in the processes of latent pattern and projective content analyses, Figure 5 presents a general outline of each method beginning with the creation of codes and concluding with the generation of themes.

Figure 5.

Flow Chart of the Stages of Latent Pattern and Latent Projective Content Analysis

How to Choose a Methodological Approach to Content Analysis

To determine which approach a researcher should take in their content analysis, two decisions need to be made. First, researchers must determine their goal for the analysis. Second, the researcher must decide where they believe meaning is located. 14 If meaning is located in the discrete elements of the content that are easily identified on the surface of the text, then manifest content analysis is appropriate. If meaning is located deep within the content and the researcher plans to discover context cues and make judgements about implied meaning, then latent content analysis should be applied. When designing the latent content analysis, a researcher then must also identify their focus. If the analysis is intended to identify a recognizable truth within the content by uncovering connections and characteristics that all coders should be able to discover, then latent pattern content analysis is appropriate. If, on the other hand, the researcher will rely heavily on the judgment of the coders and believes that interpretation of the content must leverage the mental schema of the coders to locate deeper meaning, then latent projective content analysis is the best choice.

To demonstrate how a researcher might choose a methodological approach, we have presented a third example of data in Figure 6 . In our two previous examples of content analysis, we used student data. However, faculty data can also be analyzed as part of educational research or for faculty members to improve their own teaching practices. Recall in the video data analyzed using latent content analysis, the student was tasked to identify a suitable over-the-counter medication for a patient complaining of heartburn symptoms. We have extended this example by including an interview with the pharmacy educator supervising the student who was videotaped. The goal of the interview is to evaluate the educator’s ability to assess the student’s performance with the standardized patient. Figure 6 is an excerpt of the interview between the course instructor and an instructional coach. In this conversation, the instructional coach is eliciting evidence to support the faculty member’s views, judgements, and rationale for the educator’s evaluation of the student’s performance.

Figure 6.

A Transcript of an Interview in Which the Interviewer (IN) Questions a Faculty Member (FM) Regarding Their Student’s Standardized Patient Experience

Manifest content analysis would be a valid choice for this data if the researcher was looking to identify evidence of the construct of “instructor priorities” and defined discrete codes that described aspects of performance such as “communication,” “referrals,” or “accurate information.” These codes could be easily identified on the surface of the transcribed interview by identifying keywords related to each code, such as “communicate,” “talk,” and “laugh,” for the code of “communication.” This would allow coders to identify evidence of the concept of “instructor priorities” by sorting through a potentially large amount of text with predetermined targets in mind.

To conduct a latent pattern analysis of this interview, researchers would first immerse themselves in the data to identify a theoretical framework or concepts that represent the area of interest so that coders could discover an emerging truth underneath the surface of the data. After immersion in the data, a researcher might believe it would be interesting to more closely examine the strategies the coach uses to establish rapport with the instructor as a way to better understand models of professional development. These strategies could not be easily identified in the transcripts if read literally, but by looking for connections within the text, codes related to instructional coaching tactics emerge. A latent pattern analysis would require that the researcher code the data in a way that looks for patterns, such as a code of “facilitating reflection,” that could be identified in open-ended questions and other units of meaning where the coder saw evidence of probing techniques, or a code of “establishing rapport” for which a coder could identify nonverbal cues such as “[IN leans forward in chair].”

Conducting latent projective content analysis might be useful if the researcher was interested in using a broader theoretical lens, such as Mezirow’s theory of transformative learning. 22 In this example, the faculty member is understood to have attempted to change a learner’s frame of reference by facilitating cognitive dissonance or a disorienting experience through a standardized patient simulation. To conduct a latent projective analysis, the researcher could analyze the faculty member’s interview using concepts found in this theory. This kind of analysis will help the researcher assess the level of change that the faculty member was able to perceive, or expected to witness, in their attempt to help their pharmacy students improve their interactions with patients. The units of meaning and subsequent codes would rely on the coders to apply their own knowledge of transformative learning because of the absence in the theory of concrete, context-specific behaviors to identify. For this analysis, the researcher would rely on their interpretations of what challenging educational situations look like, what constitutes cognitive dissonance, or what the faculty member is really expecting from his students’ performance. The subsequent analysis could provide evidence to support the use of such standardized patient encounters within the curriculum as a transformative learning experience and would also allow the educator to self-reflect on his ability to assess simulated activities.

OTHER ASPECTS TO CONSIDER

Navigating terminology.

Among the methodological approaches, there are other terms for content analysis that researchers may come across. Hsieh and Shannon 10 proposed three qualitative approaches to content analysis: conventional, directed, and summative. These categories were intended to explain the role of theory in the analysis process. In conventional content analysis, the researcher does not use preconceived categories because existing theory or literature are limited. In directed content analysis, the researcher attempts to further describe a phenomenon already addressed by theory, applying a deductive approach and using identified concepts or codes from exiting research to validate the theory. In summative content analysis, a descriptive approach is taken, identifying and quantifying words or content in order to describe their context. These three categories roughly map to the terms of latent projective, latent pattern, and manifest content analyses respectively, though not precisely enough to suggest that they are synonyms.

Graneheim and colleagues 9 reference the inductive, deductive, and abductive methods of interpretation of content analysis, which are data-driven, concept-driven, and fluid between both data and concepts, respectively. Where manifest content produces phenomenological descriptions most often (but not always) through deductive interpretation, and latent content analysis produces interpretations most often (but not always) through inductive or abductive interpretations. Erlingsson and Brysiewicz 23 refer to content analysis as a continuum, progressing as the researcher develops codes, then categories, and then themes. We present these alternative conceptualizations of content analysis to illustrate that the literature on content analysis, while incredibly useful, presents a multitude of interpretations of the method itself. However, these complexities should not dissuade readers from using content analysis. Identifying what you want to know (ie, your research question) will effectively direct you toward your methodological approach. That said, we have found the most helpful aid in learning content analysis is the application of the methods we have presented.

Ensuring Quality

The standards used to evaluate quantitative research are seldom used in qualitative research. The terms “reliability” and “validity” are typically not used because they reflect the positivist quantitative paradigm. In qualitative research, the preferred term is “trustworthiness,” which is comprised of the concepts of credibility, transferability, dependability, and confirmability, and researchers can take steps in their work to demonstrate that they are trustworthy. 24 Though establishing trustworthiness is outside the scope of this article, novice researchers should be familiar with the necessary steps before publishing their work. This suggestion includes exploration of the concept of saturation, the idea that researchers must demonstrate they have collected and analyzed enough data to warrant their conclusions, which has been a focus of recent debate in qualitative research. 25

There are several threats to the trustworthiness of content analysis in particular. 14 We will use the terms “reliability and validity” to describe these threats, as they are conceptualized this way in the formative literature, and it may be easier for researchers with a quantitative research background to recognize them. Though some of these threats may be particular to the type of data being analyzed, in general, there are risks specific to the different methods of content analysis. In manifest content analysis, reliability is necessary but not sufficient to establish validity. 14 Because there is little judgment required of the coders, lack of high inter-rater agreement among coders will render the data invalid. 14 Additionally, coder fatigue is a common threat to manifest content analysis because the coding is clerical and repetitive in nature.

For latent pattern content analysis, validity and reliability are inversely related. 14 Greater reliability is achieved through more detailed coding rules to improve consistency, but these rules may diminish the accessibility of the coding to consumers of the research. This is defined as low ecological validity. Higher ecological validity is achieved through greater reliance on coder judgment to increase the resonance of the results with the audience, yet this often decreases the inter-rater reliability. In latent projective content analysis, reliability and validity are equivalent. 14 Consistent interpretations among coders both establishes and validates the constructed norm; construction of an accurate norm is evidence of consistency. However, because of this equivalence, issues with low validity or low reliability cannot be isolated. A lack of consistency may result from coding rules, lack of a shared schema, or issues with a defined variable. Reasons for low validity cannot be isolated, but will always result in low consistency.

Any good analysis starts with a codebook and coder training. It is important for all coders to share the mental model of the skill, construct, or phenomenon being coded in the data. However, when conducting latent pattern or projective content analysis in particular, micro-level rules and definitions of codes increase the threat of ecological validity, so it is important to leave enough room in the codebook and during the training to allow for a shared mental schema to emerge in the larger group rather than being strictly directed by the lead researcher. Stability is another threat, which occurs when coders make different judgments as time passes. To reduce this risk, allowing for recoding at a later date can increase the consistency and stability of the codes. Reproducibility is not typically a goal of qualitative research, 15 but for content analysis, codes that are defined both prior to and during analysis should retain their meaning. Researchers can increase the reproducibility of their codebook by creating a detailed audit trail, including descriptions of the methods used to create and define the codes, materials used for the training of the coders, and steps taken to ensure inter-rater reliability.

In all forms of qualitative analysis, coder fatigue is a common threat to trustworthiness, even when the instructor is coding individually. Over time, the cases may start to look the same, making it difficult to refocus and look at each case with fresh eyes. To guard against this, coders should maintain a reflective journal and write analytical memos to help stay focused. Memos might include insights that the researcher has, such as patterns of misunderstanding, areas to focus on when considering re-teaching specific concepts, or specific conversations to have with students. Fatigue can also be mitigated by occasionally talking to participants (eg, meeting with students and listening for their rationale on why they included specific pieces of information in an assignment). These are just examples of potential exercises that can help coders mitigate cognitive fatigue. Most researchers develop their own ways to prevent the fatigue that can seep in after long hours of looking at data. But above all, a sufficient amount of time should be allowed for analysis, so that coders do not feel rushed, and regular breaks should be scheduled and enforced.

Qualitative content analysis is both accessible and high-yield for pharmacy educators and researchers. Though some of the methods may seem abstract or fluid, the nature of qualitative content analysis encompasses these concerns by providing a systematic approach to discover meaning in textual data, both on the surface and implied beneath it. As with most research methods, the surest path towards proficiency is through application and intentional, repeated practice. We encourage pharmacy educators to ask questions suited for qualitative research and to consider the use of content analysis as a qualitative research method for discovering meaning in their data.

  • 1. Kolbe RH, Burnett MS. Content-analysis research: an examination of applications with directives for improving research reliability and objectivity. J Consumer R. 1991;18(2):243-250. [ Google Scholar ]
  • 2. Bird E, Anderson HM, Anaya G, Moore DL. Beginning an assessment project: a case study using data audit and content analysis. Am J Pharm Educ. 2005;69(3):53. [ Google Scholar ]
  • 3. Danielson J, Craddick K, Eccles D, Kwasnik A, O’Sullivan TA. A qualitative analysis of common concerns about challenges facing pharmacy experiential education programs. Am J Pharm Educ. 2015;79(1):6. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • 4. Fjortoft N. Students' motivations for class attendance. Am J Pharm Educ. 2005;69(1):15. [ Google Scholar ]
  • 5. Margolis AR, Porter AL, Pitterle ME. Best practices for use of blended learning. Am J Pharm Educ. 2017;81(3):49. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • 6. Moser L, Berlie H, Salinitri F, McCuistion M, Slaughter R. Enhancing academic success by creating a community of learners. Am J Pharm Educ. 2015;79(5):70. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • 7. Wallman A, Sporrong SK, Gustavsson M, Lindblad ÅK, Johansson M, Ring L. Swedish students' and preceptors' perceptions of what students learn in a six-month advanced pharmacy practice experience. Am J Pharm Educ. 2011;75(10):197. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • 8. Kondracki NL, Wellman NS, Amundson DR. Content analysis: review of methods and their applications in nutrition education. J Nutr Educ Behav. 2002;34(4):224-230. [ DOI ] [ PubMed ] [ Google Scholar ]
  • 9. Graneheim UH, Lindgren BM, Lundman B. Methodological challenges in qualitative content analysis: a discussion paper. Nurse Educ Today. 2017;56:29-34. [ DOI ] [ PubMed ] [ Google Scholar ]
  • 10. Hsieh HF, Shannon SE. Three approaches to qualitative content analysis. Qual Health Res. 2005;15(9):1277-1288. [ DOI ] [ PubMed ] [ Google Scholar ]
  • 11. Julien H. Content analysis. In: Given LM, ed. The SAGE Encyclopedia of Qualitative Research Methods. Vol.2 Thousand Oaks, CA: SAGE;2008:121-123. [ Google Scholar ]
  • 12. Krippendorff K. Content Analysis: An Introduction to Its Methodology. Thousand Oaks, CA: SAGE Publications; 2012. [ Google Scholar ]
  • 13. Graneheim UH, Lundman B. Qualitative content analysis in nursing research: concepts, procedures, and measures to achieve trustworthiness. Nurse Education Today. 2004;24(2):105-112. [ DOI ] [ PubMed ] [ Google Scholar ]
  • 14. Potter WJ, Levine‐Donnerstein D. Rethinking validity and reliability in content analysis. J Applied Comm Res . 1999;27(3):258-284. [ Google Scholar ]
  • 15. Lincoln YS, Guba EG. Naturalistic Inquiry. Beverly Hills, CA: SAGE;1985. [ Google Scholar ]
  • 16. Weber RP. Basic Content Analysis. Beverly Hills, CA: SAGE;1990. [ Google Scholar ]
  • 17. Mishler EG. The analysis of interview-narratives. In Sarbin TR, ed. Narrative Psychology: The Storied Nature of Human Conduct. Westport, CT: Praeger;1986:233-255. [ Google Scholar ]
  • 18. Saldana J. The Coding Manual for Qualitative Researchers . Thousand Oaks, CA: SAGE; 2009. [ Google Scholar ]
  • 19. Krippendorff K. Reliability in content analysis. Human Communication Research. 2004;30(3):411-433. [ Google Scholar ]
  • 20. Horton N, Payne KD, Jernigan M, et al. . A standardized patient counseling rubric for a pharmaceutical care and communications course. Am J Pharm Educ . 2013;77(7):1-9. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • 21. Rogers CR. Client-Centered Therapy. Washington, DC: American Psychological Association;1966. [ Google Scholar ]
  • 22. Mezirow J. Transformative Dimensions of Adult Learning. San Francisco, CA: Jossey-Bass;1991. [ Google Scholar ]
  • 23. Erlingsson C, Brysiewicz P. A hands-on guide to doing content analysis. African J Emergency Med. 2017;7(3):93-99. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • 24. Guba EG, Lincoln YS. Effective Evaluation: Improving the Usefulness of Evaluation Results Through Responsive and Naturalistic Approaches. San Francisco, CA: Jossey-Bass ;1981. [ Google Scholar ]
  • 25. Saunders B, Sim J, Kingstone T, et al. . Saturation in qualitative research: exploring its conceptualization and operationalization. Quality & Quantity. 2018;52(4):1893-1907. [ DOI ] [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • View on publisher site
  • PDF (1.4 MB)
  • Collections

Similar articles

Cited by other articles, links to ncbi databases.

  • Download .nbib .nbib
  • Format: AMA APA MLA NLM

Add to Collections

  • Search Menu

Sign in through your institution

  • Browse content in Arts and Humanities
  • Browse content in Archaeology
  • Anglo-Saxon and Medieval Archaeology
  • Archaeological Methodology and Techniques
  • Archaeology by Region
  • Archaeology of Religion
  • Archaeology of Trade and Exchange
  • Biblical Archaeology
  • Contemporary and Public Archaeology
  • Environmental Archaeology
  • Historical Archaeology
  • History and Theory of Archaeology
  • Industrial Archaeology
  • Landscape Archaeology
  • Mortuary Archaeology
  • Prehistoric Archaeology
  • Underwater Archaeology
  • Urban Archaeology
  • Zooarchaeology
  • Browse content in Architecture
  • Architectural Structure and Design
  • History of Architecture
  • Residential and Domestic Buildings
  • Theory of Architecture
  • Browse content in Art
  • Art Subjects and Themes
  • History of Art
  • Industrial and Commercial Art
  • Theory of Art
  • Biographical Studies
  • Byzantine Studies
  • Browse content in Classical Studies
  • Classical History
  • Classical Philosophy
  • Classical Mythology
  • Classical Numismatics
  • Classical Literature
  • Classical Reception
  • Classical Art and Architecture
  • Classical Oratory and Rhetoric
  • Greek and Roman Epigraphy
  • Greek and Roman Law
  • Greek and Roman Papyrology
  • Greek and Roman Archaeology
  • Late Antiquity
  • Religion in the Ancient World
  • Social History
  • Digital Humanities
  • Browse content in History
  • Colonialism and Imperialism
  • Diplomatic History
  • Environmental History
  • Genealogy, Heraldry, Names, and Honours
  • Genocide and Ethnic Cleansing
  • Historical Geography
  • History by Period
  • History of Emotions
  • History of Agriculture
  • History of Education
  • History of Gender and Sexuality
  • Industrial History
  • Intellectual History
  • International History
  • Labour History
  • Legal and Constitutional History
  • Local and Family History
  • Maritime History
  • Military History
  • National Liberation and Post-Colonialism
  • Oral History
  • Political History
  • Public History
  • Regional and National History
  • Revolutions and Rebellions
  • Slavery and Abolition of Slavery
  • Social and Cultural History
  • Theory, Methods, and Historiography
  • Urban History
  • World History
  • Browse content in Language Teaching and Learning
  • Language Learning (Specific Skills)
  • Language Teaching Theory and Methods
  • Browse content in Linguistics
  • Applied Linguistics
  • Cognitive Linguistics
  • Computational Linguistics
  • Forensic Linguistics
  • Grammar, Syntax and Morphology
  • Historical and Diachronic Linguistics
  • History of English
  • Language Acquisition
  • Language Evolution
  • Language Reference
  • Language Variation
  • Language Families
  • Lexicography
  • Linguistic Anthropology
  • Linguistic Theories
  • Linguistic Typology
  • Phonetics and Phonology
  • Psycholinguistics
  • Sociolinguistics
  • Translation and Interpretation
  • Writing Systems
  • Browse content in Literature
  • Bibliography
  • Children's Literature Studies
  • Literary Studies (Asian)
  • Literary Studies (European)
  • Literary Studies (Eco-criticism)
  • Literary Studies (Romanticism)
  • Literary Studies (American)
  • Literary Studies (Modernism)
  • Literary Studies - World
  • Literary Studies (1500 to 1800)
  • Literary Studies (19th Century)
  • Literary Studies (20th Century onwards)
  • Literary Studies (African American Literature)
  • Literary Studies (British and Irish)
  • Literary Studies (Early and Medieval)
  • Literary Studies (Fiction, Novelists, and Prose Writers)
  • Literary Studies (Gender Studies)
  • Literary Studies (Graphic Novels)
  • Literary Studies (History of the Book)
  • Literary Studies (Plays and Playwrights)
  • Literary Studies (Poetry and Poets)
  • Literary Studies (Postcolonial Literature)
  • Literary Studies (Queer Studies)
  • Literary Studies (Science Fiction)
  • Literary Studies (Travel Literature)
  • Literary Studies (War Literature)
  • Literary Studies (Women's Writing)
  • Literary Theory and Cultural Studies
  • Mythology and Folklore
  • Shakespeare Studies and Criticism
  • Browse content in Media Studies
  • Browse content in Music
  • Applied Music
  • Dance and Music
  • Ethics in Music
  • Ethnomusicology
  • Gender and Sexuality in Music
  • Medicine and Music
  • Music Cultures
  • Music and Religion
  • Music and Media
  • Music and Culture
  • Music Education and Pedagogy
  • Music Theory and Analysis
  • Musical Scores, Lyrics, and Libretti
  • Musical Structures, Styles, and Techniques
  • Musicology and Music History
  • Performance Practice and Studies
  • Race and Ethnicity in Music
  • Sound Studies
  • Browse content in Performing Arts
  • Browse content in Philosophy
  • Aesthetics and Philosophy of Art
  • Epistemology
  • Feminist Philosophy
  • History of Western Philosophy
  • Meta-Philosophy
  • Metaphysics
  • Moral Philosophy
  • Non-Western Philosophy
  • Philosophy of Science
  • Philosophy of Language
  • Philosophy of Mind
  • Philosophy of Perception
  • Philosophy of Action
  • Philosophy of Law
  • Philosophy of Religion
  • Philosophy of Mathematics and Logic
  • Practical Ethics
  • Social and Political Philosophy
  • Browse content in Religion
  • Biblical Studies
  • Christianity
  • East Asian Religions
  • History of Religion
  • Judaism and Jewish Studies
  • Qumran Studies
  • Religion and Education
  • Religion and Health
  • Religion and Politics
  • Religion and Science
  • Religion and Law
  • Religion and Art, Literature, and Music
  • Religious Studies
  • Browse content in Society and Culture
  • Cookery, Food, and Drink
  • Cultural Studies
  • Customs and Traditions
  • Ethical Issues and Debates
  • Hobbies, Games, Arts and Crafts
  • Natural world, Country Life, and Pets
  • Popular Beliefs and Controversial Knowledge
  • Sports and Outdoor Recreation
  • Technology and Society
  • Travel and Holiday
  • Visual Culture
  • Browse content in Law
  • Arbitration
  • Browse content in Company and Commercial Law
  • Commercial Law
  • Company Law
  • Browse content in Comparative Law
  • Systems of Law
  • Competition Law
  • Browse content in Constitutional and Administrative Law
  • Government Powers
  • Judicial Review
  • Local Government Law
  • Military and Defence Law
  • Parliamentary and Legislative Practice
  • Construction Law
  • Contract Law
  • Browse content in Criminal Law
  • Criminal Procedure
  • Criminal Evidence Law
  • Sentencing and Punishment
  • Employment and Labour Law
  • Environment and Energy Law
  • Browse content in Financial Law
  • Banking Law
  • Insolvency Law
  • History of Law
  • Human Rights and Immigration
  • Intellectual Property Law
  • Browse content in International Law
  • Private International Law and Conflict of Laws
  • Public International Law
  • IT and Communications Law
  • Jurisprudence and Philosophy of Law
  • Law and Politics
  • Law and Society
  • Browse content in Legal System and Practice
  • Courts and Procedure
  • Legal Skills and Practice
  • Legal System - Costs and Funding
  • Primary Sources of Law
  • Regulation of Legal Profession
  • Medical and Healthcare Law
  • Browse content in Policing
  • Criminal Investigation and Detection
  • Police and Security Services
  • Police Procedure and Law
  • Police Regional Planning
  • Browse content in Property Law
  • Personal Property Law
  • Restitution
  • Study and Revision
  • Terrorism and National Security Law
  • Browse content in Trusts Law
  • Wills and Probate or Succession
  • Browse content in Medicine and Health
  • Browse content in Allied Health Professions
  • Arts Therapies
  • Clinical Science
  • Dietetics and Nutrition
  • Occupational Therapy
  • Operating Department Practice
  • Physiotherapy
  • Radiography
  • Speech and Language Therapy
  • Browse content in Anaesthetics
  • General Anaesthesia
  • Browse content in Clinical Medicine
  • Acute Medicine
  • Cardiovascular Medicine
  • Clinical Genetics
  • Clinical Pharmacology and Therapeutics
  • Dermatology
  • Endocrinology and Diabetes
  • Gastroenterology
  • Genito-urinary Medicine
  • Geriatric Medicine
  • Infectious Diseases
  • Medical Toxicology
  • Medical Oncology
  • Pain Medicine
  • Palliative Medicine
  • Rehabilitation Medicine
  • Respiratory Medicine and Pulmonology
  • Rheumatology
  • Sleep Medicine
  • Sports and Exercise Medicine
  • Clinical Neuroscience
  • Community Medical Services
  • Critical Care
  • Emergency Medicine
  • Forensic Medicine
  • Haematology
  • History of Medicine
  • Browse content in Medical Dentistry
  • Oral and Maxillofacial Surgery
  • Paediatric Dentistry
  • Restorative Dentistry and Orthodontics
  • Surgical Dentistry
  • Browse content in Medical Skills
  • Clinical Skills
  • Communication Skills
  • Nursing Skills
  • Surgical Skills
  • Medical Ethics
  • Medical Statistics and Methodology
  • Browse content in Neurology
  • Clinical Neurophysiology
  • Neuropathology
  • Nursing Studies
  • Browse content in Obstetrics and Gynaecology
  • Gynaecology
  • Occupational Medicine
  • Ophthalmology
  • Otolaryngology (ENT)
  • Browse content in Paediatrics
  • Neonatology
  • Browse content in Pathology
  • Chemical Pathology
  • Clinical Cytogenetics and Molecular Genetics
  • Histopathology
  • Medical Microbiology and Virology
  • Patient Education and Information
  • Browse content in Pharmacology
  • Psychopharmacology
  • Browse content in Popular Health
  • Caring for Others
  • Complementary and Alternative Medicine
  • Self-help and Personal Development
  • Browse content in Preclinical Medicine
  • Cell Biology
  • Molecular Biology and Genetics
  • Reproduction, Growth and Development
  • Primary Care
  • Professional Development in Medicine
  • Browse content in Psychiatry
  • Addiction Medicine
  • Child and Adolescent Psychiatry
  • Forensic Psychiatry
  • Learning Disabilities
  • Old Age Psychiatry
  • Psychotherapy
  • Browse content in Public Health and Epidemiology
  • Epidemiology
  • Public Health
  • Browse content in Radiology
  • Clinical Radiology
  • Interventional Radiology
  • Nuclear Medicine
  • Radiation Oncology
  • Reproductive Medicine
  • Browse content in Surgery
  • Cardiothoracic Surgery
  • Gastro-intestinal and Colorectal Surgery
  • General Surgery
  • Neurosurgery
  • Paediatric Surgery
  • Peri-operative Care
  • Plastic and Reconstructive Surgery
  • Surgical Oncology
  • Transplant Surgery
  • Trauma and Orthopaedic Surgery
  • Vascular Surgery
  • Browse content in Science and Mathematics
  • Browse content in Biological Sciences
  • Aquatic Biology
  • Biochemistry
  • Bioinformatics and Computational Biology
  • Developmental Biology
  • Ecology and Conservation
  • Evolutionary Biology
  • Genetics and Genomics
  • Microbiology
  • Molecular and Cell Biology
  • Natural History
  • Plant Sciences and Forestry
  • Research Methods in Life Sciences
  • Structural Biology
  • Systems Biology
  • Zoology and Animal Sciences
  • Browse content in Chemistry
  • Analytical Chemistry
  • Computational Chemistry
  • Crystallography
  • Environmental Chemistry
  • Industrial Chemistry
  • Inorganic Chemistry
  • Materials Chemistry
  • Medicinal Chemistry
  • Mineralogy and Gems
  • Organic Chemistry
  • Physical Chemistry
  • Polymer Chemistry
  • Study and Communication Skills in Chemistry
  • Theoretical Chemistry
  • Browse content in Computer Science
  • Artificial Intelligence
  • Computer Architecture and Logic Design
  • Game Studies
  • Human-Computer Interaction
  • Mathematical Theory of Computation
  • Programming Languages
  • Software Engineering
  • Systems Analysis and Design
  • Virtual Reality
  • Browse content in Computing
  • Business Applications
  • Computer Security
  • Computer Games
  • Computer Networking and Communications
  • Digital Lifestyle
  • Graphical and Digital Media Applications
  • Operating Systems
  • Browse content in Earth Sciences and Geography
  • Atmospheric Sciences
  • Environmental Geography
  • Geology and the Lithosphere
  • Maps and Map-making
  • Meteorology and Climatology
  • Oceanography and Hydrology
  • Palaeontology
  • Physical Geography and Topography
  • Regional Geography
  • Soil Science
  • Urban Geography
  • Browse content in Engineering and Technology
  • Agriculture and Farming
  • Biological Engineering
  • Civil Engineering, Surveying, and Building
  • Electronics and Communications Engineering
  • Energy Technology
  • Engineering (General)
  • Environmental Science, Engineering, and Technology
  • History of Engineering and Technology
  • Mechanical Engineering and Materials
  • Technology of Industrial Chemistry
  • Transport Technology and Trades
  • Browse content in Environmental Science
  • Applied Ecology (Environmental Science)
  • Conservation of the Environment (Environmental Science)
  • Environmental Sustainability
  • Environmentalist Thought and Ideology (Environmental Science)
  • Management of Land and Natural Resources (Environmental Science)
  • Natural Disasters (Environmental Science)
  • Nuclear Issues (Environmental Science)
  • Pollution and Threats to the Environment (Environmental Science)
  • Social Impact of Environmental Issues (Environmental Science)
  • History of Science and Technology
  • Browse content in Materials Science
  • Ceramics and Glasses
  • Composite Materials
  • Metals, Alloying, and Corrosion
  • Nanotechnology
  • Browse content in Mathematics
  • Applied Mathematics
  • Biomathematics and Statistics
  • History of Mathematics
  • Mathematical Education
  • Mathematical Finance
  • Mathematical Analysis
  • Numerical and Computational Mathematics
  • Probability and Statistics
  • Pure Mathematics
  • Browse content in Neuroscience
  • Cognition and Behavioural Neuroscience
  • Development of the Nervous System
  • Disorders of the Nervous System
  • History of Neuroscience
  • Invertebrate Neurobiology
  • Molecular and Cellular Systems
  • Neuroendocrinology and Autonomic Nervous System
  • Neuroscientific Techniques
  • Sensory and Motor Systems
  • Browse content in Physics
  • Astronomy and Astrophysics
  • Atomic, Molecular, and Optical Physics
  • Biological and Medical Physics
  • Classical Mechanics
  • Computational Physics
  • Condensed Matter Physics
  • Electromagnetism, Optics, and Acoustics
  • History of Physics
  • Mathematical and Statistical Physics
  • Measurement Science
  • Nuclear Physics
  • Particles and Fields
  • Plasma Physics
  • Quantum Physics
  • Relativity and Gravitation
  • Semiconductor and Mesoscopic Physics
  • Browse content in Psychology
  • Affective Sciences
  • Clinical Psychology
  • Cognitive Psychology
  • Cognitive Neuroscience
  • Criminal and Forensic Psychology
  • Developmental Psychology
  • Educational Psychology
  • Evolutionary Psychology
  • Health Psychology
  • History and Systems in Psychology
  • Music Psychology
  • Neuropsychology
  • Organizational Psychology
  • Psychological Assessment and Testing
  • Psychology of Human-Technology Interaction
  • Psychology Professional Development and Training
  • Research Methods in Psychology
  • Social Psychology
  • Browse content in Social Sciences
  • Browse content in Anthropology
  • Anthropology of Religion
  • Human Evolution
  • Medical Anthropology
  • Physical Anthropology
  • Regional Anthropology
  • Social and Cultural Anthropology
  • Theory and Practice of Anthropology
  • Browse content in Business and Management
  • Business Strategy
  • Business Ethics
  • Business History
  • Business and Government
  • Business and Technology
  • Business and the Environment
  • Comparative Management
  • Corporate Governance
  • Corporate Social Responsibility
  • Entrepreneurship
  • Health Management
  • Human Resource Management
  • Industrial and Employment Relations
  • Industry Studies
  • Information and Communication Technologies
  • International Business
  • Knowledge Management
  • Management and Management Techniques
  • Operations Management
  • Organizational Theory and Behaviour
  • Pensions and Pension Management
  • Public and Nonprofit Management
  • Social Issues in Business and Management
  • Strategic Management
  • Supply Chain Management
  • Browse content in Criminology and Criminal Justice
  • Criminal Justice
  • Criminology
  • Forms of Crime
  • International and Comparative Criminology
  • Youth Violence and Juvenile Justice
  • Development Studies
  • Browse content in Economics
  • Agricultural, Environmental, and Natural Resource Economics
  • Asian Economics
  • Behavioural Finance
  • Behavioural Economics and Neuroeconomics
  • Econometrics and Mathematical Economics
  • Economic Systems
  • Economic History
  • Economic Methodology
  • Economic Development and Growth
  • Financial Markets
  • Financial Institutions and Services
  • General Economics and Teaching
  • Health, Education, and Welfare
  • History of Economic Thought
  • International Economics
  • Labour and Demographic Economics
  • Law and Economics
  • Macroeconomics and Monetary Economics
  • Microeconomics
  • Public Economics
  • Urban, Rural, and Regional Economics
  • Welfare Economics
  • Browse content in Education
  • Adult Education and Continuous Learning
  • Care and Counselling of Students
  • Early Childhood and Elementary Education
  • Educational Equipment and Technology
  • Educational Research Methodology
  • Educational Strategies and Policy
  • Higher and Further Education
  • Organization and Management of Education
  • Philosophy and Theory of Education
  • Schools Studies
  • Secondary Education
  • Teaching of a Specific Subject
  • Teaching of Specific Groups and Special Educational Needs
  • Teaching Skills and Techniques
  • Browse content in Environment
  • Applied Ecology (Social Science)
  • Climate Change
  • Conservation of the Environment (Social Science)
  • Environmentalist Thought and Ideology (Social Science)
  • Management of Land and Natural Resources (Social Science)
  • Natural Disasters (Environment)
  • Pollution and Threats to the Environment (Social Science)
  • Social Impact of Environmental Issues (Social Science)
  • Sustainability
  • Browse content in Human Geography
  • Cultural Geography
  • Economic Geography
  • Political Geography
  • Browse content in Interdisciplinary Studies
  • Communication Studies
  • Museums, Libraries, and Information Sciences
  • Browse content in Politics
  • African Politics
  • Asian Politics
  • Chinese Politics
  • Comparative Politics
  • Conflict Politics
  • Elections and Electoral Studies
  • Environmental Politics
  • Ethnic Politics
  • European Union
  • Foreign Policy
  • Gender and Politics
  • Human Rights and Politics
  • Indian Politics
  • International Relations
  • International Organization (Politics)
  • Irish Politics
  • Latin American Politics
  • Middle Eastern Politics
  • Political Methodology
  • Political Communication
  • Political Philosophy
  • Political Sociology
  • Political Behaviour
  • Political Economy
  • Political Institutions
  • Political Theory
  • Politics and Law
  • Politics of Development
  • Public Administration
  • Public Policy
  • Qualitative Political Methodology
  • Quantitative Political Methodology
  • Regional Political Studies
  • Russian Politics
  • Security Studies
  • State and Local Government
  • UK Politics
  • US Politics
  • Browse content in Regional and Area Studies
  • African Studies
  • Asian Studies
  • East Asian Studies
  • Japanese Studies
  • Latin American Studies
  • Middle Eastern Studies
  • Native American Studies
  • Scottish Studies
  • Browse content in Research and Information
  • Research Methods
  • Browse content in Social Work
  • Addictions and Substance Misuse
  • Adoption and Fostering
  • Care of the Elderly
  • Child and Adolescent Social Work
  • Couple and Family Social Work
  • Direct Practice and Clinical Social Work
  • Emergency Services
  • Human Behaviour and the Social Environment
  • International and Global Issues in Social Work
  • Mental and Behavioural Health
  • Social Justice and Human Rights
  • Social Policy and Advocacy
  • Social Work and Crime and Justice
  • Social Work Macro Practice
  • Social Work Practice Settings
  • Social Work Research and Evidence-based Practice
  • Welfare and Benefit Systems
  • Browse content in Sociology
  • Childhood Studies
  • Community Development
  • Comparative and Historical Sociology
  • Disability Studies
  • Economic Sociology
  • Gender and Sexuality
  • Gerontology and Ageing
  • Health, Illness, and Medicine
  • Marriage and the Family
  • Migration Studies
  • Occupations, Professions, and Work
  • Organizations
  • Population and Demography
  • Race and Ethnicity
  • Social Theory
  • Social Movements and Social Change
  • Social Research and Statistics
  • Social Stratification, Inequality, and Mobility
  • Sociology of Religion
  • Sociology of Education
  • Sport and Leisure
  • Urban and Rural Studies
  • Browse content in Warfare and Defence
  • Defence Strategy, Planning, and Research
  • Land Forces and Warfare
  • Military Administration
  • Military Life and Institutions
  • Naval Forces and Warfare
  • Other Warfare and Defence Issues
  • Peace Studies and Conflict Resolution
  • Weapons and Equipment

Content Analysis

  • < Previous chapter
  • Next chapter >

4 Qualitative Content Analysis

  • Published: November 2015
  • Cite Icon Cite
  • Permissions Icon Permissions

This chapter examines qualitative content analysis, a recent methodological innovation. Qualitative content analysis is defined and distinguished here from basic and interpretive approaches to content analysis. Qualitative content analysis is also distinguished from other qualitative research methods, though features and techniques overlap with other qualitative methods. Key differences in the predominant use of newly collected data and use of non-quantitative analysis techniques are detailed. Differences in epistemology and the role of researcher self-awareness and reflexivity are also discussed. Methods of graphic data presentation are illustrated. Three short exemplar studies using qualitative content analysis are described and examined. Qualitative content analysis is explored in detail in terms of its characteristic components: (1) the research purposes of content analysis, (2) target audiences, (3) epistemological issues, (4) ethical issues, (5) research designs, (6) sampling issues and methods, (7) collecting data, (8) coding and categorization methods, (9) data analysis methods, and (10) the role of researcher reflection.

Signed in as

Institutional accounts.

  • Google Scholar Indexing
  • GoogleCrawler [DO NOT DELETE]

Personal account

  • Sign in with email/username & password
  • Get email alerts
  • Save searches
  • Purchase content
  • Activate your purchase/trial code
  • Add your ORCID iD

Institutional access

Sign in with a library card.

  • Sign in with username/password
  • Recommend to your librarian
  • Institutional account management
  • Get help with access

Access to content on Oxford Academic is often provided through institutional subscriptions and purchases. If you are a member of an institution with an active account, you may be able to access content in one of the following ways:

IP based access

Typically, access is provided across an institutional network to a range of IP addresses. This authentication occurs automatically, and it is not possible to sign out of an IP authenticated account.

Choose this option to get remote access when outside your institution. Shibboleth/Open Athens technology is used to provide single sign-on between your institution’s website and Oxford Academic.

  • Click Sign in through your institution.
  • Select your institution from the list provided, which will take you to your institution's website to sign in.
  • When on the institution site, please use the credentials provided by your institution. Do not use an Oxford Academic personal account.
  • Following successful sign in, you will be returned to Oxford Academic.

If your institution is not listed or you cannot sign in to your institution’s website, please contact your librarian or administrator.

Enter your library card number to sign in. If you cannot sign in, please contact your librarian.

Society Members

Society member access to a journal is achieved in one of the following ways:

Sign in through society site

Many societies offer single sign-on between the society website and Oxford Academic. If you see ‘Sign in through society site’ in the sign in pane within a journal:

  • Click Sign in through society site.
  • When on the society site, please use the credentials provided by that society. Do not use an Oxford Academic personal account.

If you do not have a society account or have forgotten your username or password, please contact your society.

Sign in using a personal account

Some societies use Oxford Academic personal accounts to provide access to their members. See below.

A personal account can be used to get email alerts, save searches, purchase content, and activate subscriptions.

Some societies use Oxford Academic personal accounts to provide access to their members.

Viewing your signed in accounts

Click the account icon in the top right to:

  • View your signed in personal account and access account management features.
  • View the institutional accounts that are providing access.

Signed in but can't access content

Oxford Academic is home to a wide variety of products. The institutional subscription may not cover the content that you are trying to access. If you believe you should have access to that content, please contact your librarian.

For librarians and administrators, your personal account also provides access to institutional account management. Here you will find options to view and activate subscriptions, manage institutional settings and access options, access usage statistics, and more.

Our books are available by subscription or purchase to libraries and institutions.

  • About Oxford Academic
  • Publish journals with us
  • University press partners
  • What we publish
  • New features  
  • Open access
  • Rights and permissions
  • Accessibility
  • Advertising
  • Media enquiries
  • Oxford University Press
  • Oxford Languages
  • University of Oxford

Oxford University Press is a department of the University of Oxford. It furthers the University's objective of excellence in research, scholarship, and education by publishing worldwide

  • Copyright © 2024 Oxford University Press
  • Cookie settings
  • Cookie policy
  • Privacy policy
  • Legal notice

This Feature Is Available To Subscribers Only

Sign In or Create an Account

This PDF is available to Subscribers Only

For full access to this pdf, sign in to an existing account, or purchase an annual subscription.

Qualitative Text Analysis: A Systematic Approach

  • Open Access
  • First Online: 27 April 2019

Cite this chapter

You have full access to this open access chapter

qualitative content analysis in education research

  • Udo Kuckartz 4  

Part of the book series: ICME-13 Monographs ((ICME13Mo))

126k Accesses

228 Citations

3 Altmetric

Thematic analysis, often called Qualitative Content Analysis (QCA) in Europe, is one of the most commonly used methods for analyzing qualitative data. This paper presents the basics of this systematic method of qualitative data analysis, highlights its key characteristics, and describes a typical workflow. The aim is to present the main characteristics and to give a simple example of the process so that readers can assess whether this method might be useful for their own research. Special attention is paid to the formation of categories, since all scholars agree that categories are at the heart of the method.

You have full access to this open access chapter,  Download chapter PDF

Similar content being viewed by others

qualitative content analysis in education research

Qualitative Content Analysis: Theoretical Background and Procedures

qualitative content analysis in education research

Qualitative Methodology

qualitative content analysis in education research

The role of analytic direction in qualitative research

  • Qualitative data analysis
  • Text analysis
  • Qualitative methods
  • Qualitative content analysis
  • MAXQDA software

1 Introduction: Qualitative and Quantitative Data

Thematic analysis, often called Qualitative Content Analysis (QCA) in Europe, is one of the most commonly used methods for analyzing qualitative data (Guest et al. 2012 ; Kuckartz 2014 ; Mayring 2014 , 2015 ; Schreier 2012 ). This chapter presents the basics of this systematic method of qualitative data analysis, highlights its key characteristics, and describes a typical workflow.

Working with codes and categories is a proven method in qualitative research. QCA is a method that is reliable, easy to learn, transparent, and it is a method that is easily understood by other researchers. In short, it is a method that enjoys a high level of recognition and is to be highly recommended, especially in the context of dissertations.

The aim of this paper is to present the main characteristics and to give a simple example of the process so that readers can assess whether this method might be useful for their own research. Special attention is paid to the formation of categories, since all scholars agree that categories are at the heart of the method.

Let’s start with some of the basics of data analysis in empirical research: What does ‘qualitative data’ mean, and what do we mean by ‘quantitative data’? Quantitative data entail numerical information that results, for example, from the collection of data from a standardized interview. In a quantitative data matrix, each row corresponds to a case, namely, an interview with a respondent. The columns of the matrix are formed by the variables. Table  8.1 therefore shows the data of four cases, here the respondents 1–4. Six variables were collected for these individuals, on a scale of 1–6, concerning how often they perform certain household activities (laundry, small repairs etc.). Typically, these kinds of data sets are available in social research in the form of a rectangular matrix, for instance as shown in Table  8.1 .

A matrix like this that consists of numbers can be analyzed using statistical methods. For example, you can calculate univariate statistics such as mean values, variance, and standard deviations. You can also generate graphical displays such as box plots or bar charts. In addition, variables can be related to each other, for example by using methods of correlation and regression statistics. Another form of analysis tests groups for differences. In the above study, for example, the questions ‘Are women more frequently engaged in laundry than men in the household?’ and ‘Are men more frequently engaged in minor repairs than women in the household?’ can be calculated using an analysis of variance.

Qualitative data are far more diverse and complex than quantitative data. These data may comprise transcripts of face-to-face interviews or focus group discussions, documents, Twitter tweets, YouTube comments, or videos of the teacher-student interactions in the classroom.

In this chapter, I restrict the presentation of the QCA method to a specific type of data, namely qualitative interviews. This collective term can be used to describe very different forms of interviews, such as guideline-assisted interviews or narrative interviews on critical life events conducted in the context of biographical research. The latter can last several hours and comprise more than 30 pages as a transcription. A qualitative interview may also consist of a short online survey, like the one I conducted in preparation for my workshop at the International Congress on Mathematical Education (ICME-13).

Obviously, the different types of qualitative data are not as easy to analyze as the numbers in a quantitative data matrix. Numerous analytical methods have been developed in qualitative research, among them the well-proven method of qualitative content analysis.

2 Key Points of Qualitative Content Analysis

What are the key points of the qualitative content analysis method? Regardless of which variant of QCA is used, the focus will always be on working with categories (codes) and developing a category system (coding frame). What Berelson formulated in 1952 for quantitative content analysis still applies today, both to quantitative and qualitative content analysis:

Content analysis stands or falls by its categories (…) since the categories contain the substance of the investigation, a content analysis can be no better than its system of categories. (Berelson 1952 , p. 147)

Categories are therefore of crucial importance for effective research, not only in their role as analysis tools, but also insofar as they form the substance of the research and the building blocks of the theory the researchers want to develop. That raises the question ‘What are categories?’—or more precisely, ‘What are categories in the context of empirical social research?’ Answering this question is by no means easy and there are at least two ways of doing so. The first way can be described as phenomenological : Kuckartz ( 2016 , pp. 31–39) focuses on the use of this term in the practice of empirical social research, i.e., drawing attention to what is called a category in empirical social research. The result of this analysis is a very diverse spectrum, whereby several different types of categories can be distinguished in social science research literature (ibid., pp. 34–35):

Factual categories denote actual or supposed objective circumstances such as ‘length of training’ or ‘occupation’.

Thematic categories refer to certain topics, arguments, schools of thought etc. such as ‘inclusion’, ‘environmental justice’ or ‘Ukrainian conflict’.

Evaluative categories are related to an evaluation scale—usually ordinal types, for example the category ‘helper syndrome’ with the characteristics ‘not pronounced’, ‘somewhat pronounced’ and ‘pronounced’. For evaluative categories, it is the researchers who classify the data according to predefined criteria.

Analytical categories are the result of intensive analysis of the data, i.e., these categories move away from the description of the data, for example by means of thematic categories.

Theoretical categories are subspecies of analytical categories that refer to an existing theory, such as Ajzen’s theory of planned behavior, Ainsworth’s attachment theory, or Foucault’s analysis of power.

Natural categories , also called “in vivo codes” (Charmaz 2006 , p. 56; Kuckartz 2014 , p. 23), are terms used by the actors in the field.

Formal categories denote formal characteristics of an analysis unit, e.g., the length of time in an interview.

The above list is not complete; there are many more types of categories and corresponding methods of coding (Saldana 2015 ).

A second way of answering the question ‘What is a category?’ can be described as conceptual and historical; this way leads us far back into the history of philosophy. The conceptual historical view of the term, originating from ancient Greece, starts with Greek philosophy more than 2000 years ago. Plato and Aristotle already dealt with categories—Aristotle even in an elaboration of the same term (“categories”). The study of categories runs through Western philosophy from Plato and Kant to Peirce and analytical philosophy. The philosophers are by no means in agreement on the concept of categories, but a discussion of the differences between the different schools would far exceed the scope of this paper; Instead, reading the mostly very extensive contributions on the terms ‘category’ and ‘category theory’ in the various lexicons of philosophy is recommended. Categories are basic concepts of cognition; they are—generally speaking—a commonality between certain things: a term, a heading, a label that designates something similar under certain aspects. Categories also play this role in content analysis, as the following quote from the Content Analysis textbook of Früh ( 2004 ) demonstrates:

The pragmatic sense of any content analysis is ultimately to reduce complexity from a certain research-led perspective. Text sets are described in a classifying manner with regard to characteristics of theoretical interest. In this reduction of complexity, information is necessarily lost: On the one hand, information is lost due to the suppression of message characteristics that are present in the examined texts but are not of interest in connection with the present research question; on the other hand, information is lost due to the classification of the analyzed message characteristics. According to specified criteria, some of them are each considered similar to one another and assigned to a certain characteristic class or a characteristic type, which is called ‘category’ in the content analysis. The original differences in meaning of the message characteristics uniformly grouped in a category shall not be taken into account. (p. 42, translated by the author)

But how does qualitative content analysis arrive at its categories, the basic building blocks for forming theory? There are three principal ways to develop categories:

Concept-driven (‘deductive’) development of categories; in this case the categories

are derived from a theory or

derived from the literature (the current state of research) or

derived from the research question (e.g. directly related to an interview guide)

Data-driven (‘inductive’) development of categories; the characteristics here are

the step-by-step procedure,

the method of open coding until saturation occurs,

the continuous organization and systematization of the formed codes, and

the development of top-level codes and subcodes at different levels.

Mixing a concept-driven and data-driven development of codes:

The starting point here is usually a coding frame with deductively formed codes and

the subsequent inductive coding of all data coded with a specific main category.

The terms deductive and inductive are often used for the concept-driven and data-driven approaches, respectively. However, the use of the term ‘deductive’ is rather problematic in this context: In scientific logic, the term ‘inductive’ refers to the abstract conclusion from what has been observed empirically to a general rule or a law; this has little to do with the formation of categories based on empirical data. The situation is similar with the term ‘deductive’: In scientific logic, the deductive conclusion is a logical consequence of its premises; the formation of categories based on the state of research, a theory, or an advanced hypothesis is very different. Categories do not necessarily emerge from a systematic literature review or from a research question. Due to its skid resistance, however, the word pair ‘inductive-deductive’ will probably remain in the language theorem of empirical social research or the formation of categories for a long time to come. Nevertheless, I try to avoid the terms inductive and deductive, and—like Schreier ( 2012 , p. 84)—prefer the terms ‘data-driven’ and ‘concept-driven’ for these different approaches to the formation of categories.

The decisive action in QCA is the coding of the data, i.e. a precisely defined part of the material is selected, and a category is assigned. As shown in the following figure, this may be a passage from an interview. Here, paragraph 15 of the text was coded with the code Simultaneousness (Fig.  8.1 ).

figure 1

Text passage with a coded text segment

The individuals who perform this segmentation and coding of the data are referred to as coders. In this context, we also speak of “inter- and intracoder agreement” (reliability) (Krippendorff 2012 ; Kuckartz 2016 ; Schreier 2012 ). In quantitative content analysis, the units to be coded are usually defined in advance and referred to as coding units. In qualitative content analysis, on the other hand, coding units are not usually defined in advance; they are created by the coding process.

The general workflow of a qualitative content analysis is in Fig.  8.2 . In all variants the research question plays the central role in this method: It provides the perspective for the textual work necessary at the beginning, that is, the intensive reading and study of the texts (Kuckartz 2016 , p. 45). For qualitative methods, it is common for the individual analysis phases to be carried out on a circular basis. This also applies to QCA: The creation of categories and subcategories and the coding of the data can take place in several cycles. Saldana ( 2015 ) speaks of first cycle coding and second cycle coding, for example. The number of cycles is not fixed, and only in rare cases would one get by with just a single cycle.

figure 2

The five phases of qualitative content analysis

Once all the data have been coded with the final category frame, a systematization and structuring of all the relevant data in view of the research questions at hand will have been achieved. Table  8.2 illustrates a model of such a thematic matrix. It is similar to the quantitative data matrix shown in Fig.  8.2 , but instead of containing numbers, the cells of the matrix now contain text excerpts coded with the respective corresponding category.

The further analysis of the matrix can now take two directions: If you look at columns, you can examine certain topics. These forms of analysis can be described as ‘category-based’. Looking at the rows, you can focus on cases (people) and carry out a ‘case-oriented analysis’.

Category-based analyses can focus on a specific category or even consider several categories simultaneously. For example, the statements made by the research participants can be contrasted between two or across several topics. Such complex analyses can lead to very rich descriptions or to the determination of influencing factors and effects, which can then be displayed in a concept map. Case-oriented analyses allow you to identify similarities between cases, identify extreme cases, and form types. Methods of consistently comparing and contrasting cases can be used to this end. For example, if you have determined a typology, you can then visualize it as a constellation of clusters and cases.

3 The Analysis Process in Detail

The example used in the following is a short online survey conducted in preparation for the ‘Workshop on qualitative text analysis’ as part of the ICME 13. The aim of the survey was to provide an overview of the research needs of the participants and their level of knowledge. In other words, its aim was descriptive and not about the development of hypotheses or a theory. In this online interview, I asked the following five questions and asked the participants to write their responses directly below the questions. Table  8.3 contains the resulting qualitative data.

Typically, QCA consists of six steps

Step 1: Preparing the data, initiating text work

Step 2: Forming main categories corresponding to the questions asked in the interview

Step 3: Coding data with the main categories

Step 4: Compiling text passages of the main categories and forming subcategories inductively on the material; assigning text passages to subcategories

Step 5: Category-based analyses and presenting results

Step 6: Reporting and documentation.

Since the purpose of the survey in this case was to get an overview of the relevant interests of the workshop participants and to tailor the workshop to their needs, the last step was omitted. There was no need for reporting and documentation.

The first phase consists of preparing of the data and conducting an initial read-through the responses; the analysis of this short survey did not require extensive interpretation of the responses. Since respondents used different fonts and font sizes in their e-mails, these had to be standardized first when preparing the data. In addition, the overall formatting was also adjusted to render it more uniform across responses. This would not have been absolutely necessary for the analysis, but without this preparation, later compilations of coded text passages might have looked rather chaotic.

In the second phase of QCA, categories are formed. When analyzing data obtained through an online survey, it is best to create a set of main categories based on the questions asked. In this analysis, the following five categories were formed for the first coding cycle:

Motives and goals

Experience with QCA

Specific questions about QCA

Experience with QDAS ( Q ualitative d ata a nalysis s oftware)

Academic discipline.

Since the questions in the online survey were numbered, the numbers were retained for better orientation, but they could have been dispensed with without any problems.

According to the differentiation of categories laid out earlier in this paper, the categories Motives and goals and Specific questions about QCA are thematic categories. Category 5 Academic Discipline is a factual code. The other two categories Experience with QCA and Experience with QDAS are about the experiences with the method and with QDA software. If the researcher is interested in the extent of participants’ experience, both categories are evaluative categories; alternatively, if the specific type of experience is the primary point of interest, the categories are thematic. Since the aim of this survey was to get an overview of the level of knowledge and practical experience of the respondents, an overview was sufficient; detailed knowledge of the types of experience the participants had gained was not absolutely necessary. Reading the responses also demonstrated that the respondents understood the question in this sense and that in most cases no specific details were provided. In any case, working with software like MAXQDA guarantees that you can always return to the original texts should this be useful or necessary during the course of the analysis.

In the third phase of the analysis, the corresponding text segments are coded with the five main categories. Figure  8.3 shows a screenshot of the software MAXQDA after this first cycle of coding was performed on the survey responses. The assignments of the codes are displayed to the left of the corresponding text sections.

figure 3

Display of a text with code assignments after the first cycle of coding

In the following fourth phase of the analysis, the coding frame is developed further. To do this, all the text passages coded with one of the main categories are first compiled, a procedure which is also referred to as retrieval . Subcodes are then developed directly in the relation to this data—in other words, the creation of categories is data-driven. This process is described in the following with regard to the first main category Motives and goals :

The category Motives and goals coded the responses to the question regarding what the participants wanted to learn in the workshop. First, all text passages to which this category was assigned were compiled. Then each of these text passages was coded a second time. This was done with a procedure similar to that of open coding in Grounded Theory (Strauss and Corbin 1990 ). In this case, the codes were short sequences of words that described what the participants wanted to learn:

analyze mathematics textbook curricula

learn type-building analysis

analyze e-portfolios and group discussions

analyze responses to open-ended questions

learn more about different research methods

how to establish credibility in practice

learn more about rigor within the process and how to ensure its validity

the role of reliability coefficients

insight into conducting qualitative research

learn about the QCA method

how to code video transcripts

how to take the richness of data into account (not only numbers)

analyze large numbers of open questions

learn more about a few different approaches to choose from

searching for a suitable method to analyze the interviews

interesting for me to see how colleagues are working.

As part of the software MAXQDA there is a module called “Creative Coding” that allows you to visually group codes obtained through the open coding method. After arranging the open codes, seven subcategories were created for the category “Motives and Objectives”, namely

Getting an overview of qualitative research

Getting an overview of QCA

Learning basic techniques

Learning about special type of analysis

Reliability and validity

Learning to analyze special types of data

Interesting for me to see how colleagues are working.

Figure  8.4 shows a visual display of the category formation; the original statements are assigned to the respective category. It turns out that many participants in the workshop were mainly interested in obtaining an overview of qualitative content analysis and qualitative research in general. The graph also implicitly illustrates the differences between a quantitative and qualitative analysis of the responses: Four participants (a comparatively large proportion) wanted to learn how to analyze specific types of data, but a closer look at the details, that is, the qualitative dimension, reveals that the types of data the respondents had in mind were completely different.

figure 4

Visualization of the motives grouped into subcategories

Once the subcategories have been created, all the data coded with the main category Motives and goals must be coded a second time. This is also known as the second coding cycle. In this sample survey, all the coded text passages were included in the formation of the subcategories due to the relatively small sample. In the case of small sample sizes like this, the Creative Coding module automatically reassigns the subcategories. In the case of larger samples, however, category formation will usually be carried out only with a subsample and not with all the data, or the process of open coding will be performed only until the system of subcategories appears saturated and no further subcategories need to be redefined. Then, of course, the data that have not been considered up to this point must still be coded in line with the final category system.

The two categories Experience with QCA and Experience with QDAS were used to code the text passages in which the respondents reported on their experience with the QCA method and the use of QDA software. For the purposes of preparing the workshop as described above, the analysis should address only whether participants had prior experience and how extensive this experience was. An evaluative category with the values ‘yes’, ‘partial’, ‘no’ was therefore defined.

For the third main category, Specific questions about QCA , no subcategories were formed, since the questions formulated by the participants had to be retained in their wording to answer them in the workshop. However, the questions asked were sorted by topic, and essentially identical questions were summarized.

For category 5, Academic discipline , subcategories were initially formed according to the disciplines mentioned by the respondents. However, it quickly transpired that almost all participants came from the field of mathematics education and that there were only a few individual cases from other fields such as development psychology or primary school teacher (see Fig.  8.5 ). These individual cases were combined into the subcategory others for the final category system, so that ultimately only two subcategories were formed.

figure 5

Main category “Academic discipline and status” with subcategories

After the main categories have been processed in this way—five in the case of this survey—the fifth phase ‘Category-based analyses and presenting results’ can begin. However, it should be clear that in the fourth phase of the development of the category system, an extensive amount of analytical work has already being carried out. The identification of the different motive types represents an analytical achievement in itself and is, at the same time, the foundation of the corresponding category-based analysis in phase 5. The category Motives and goals was of central importance in this survey. In addition to identifying the various motives, both quantitative and qualitative analyses can now be carried out. Quantitatively, we can determine how many people expressed which motives in their statement. Of course, it is quite possible for someone to have expressed several motives. In terms of a qualitative analysis, we can ask what is behind these categories in greater detail. In relation to the subcategory Learning to analyze special types of data , for example, we could ask which special data types the respondents had in mind here.

The category-based analysis always offers the option of focusing on qualitative and/or quantitative aspects. A frequency analysis of the category Experience with QDAS shows that the vast majority of participants have not yet had any practical experience with QDA software (see Fig.  8.6 ).

figure 6

Bar chart of the category “Experiences with QDA software”

The question concerning their experience with text analysis methods presents a somewhat different picture. Quantitatively, we can see that more people are experienced in this regard, while the more detailed qualitative view reveals that this experience mainly involved the Grounded Theory method. It is interesting to compare the two categories that deal with experience. Table  8.4 contains an excerpt from such a comparison between five people.

There are also many further possibilities regarding the analysis of interrelationships that can be carried out in this fifth phase. For example, the connection between motives and goals, and previous knowledge and experience, can be examined. In relation to the specific questions asked by respondents in the survey, one could create a cross table (or “crosstab”) in which the questions asked by the experienced group are compared with the questions asked by those with no experience.

There are many other analysis options for larger studies than those presented for the small online survey. Qualitative content analysis is not a method that is always applied in the same way regardless of the data or research questions at hand. Although it is a systematic procedure, it nonetheless offers a flexibility that allows you to adapt it to the respective requirements of a project. There are other analytical possibilities in this regard, which were not mentioned in the above description. Among these, two should be highlighted in particular, namely, the possibility of paraphrasing text passages and the possibility of creating thematic summaries.

Paraphrasing passages of text can be understood in its everyday sense, namely, that researchers reformulate these text passages in their own words. This can be a very useful tool for category development. This technique is especially recommended for beginners, as it forces them to read the text line by line, interpret it to gain a thorough understanding, and then record it in their own words. It is certainly too time-consuming in most cases to edit all texts in this way but paraphrasing a selected subset of texts can sharpen your analytical view and be a valuable intermediate step in the development of a meaningful category system. Moreover, these paraphrases can then be sorted, particularly significant paraphrases can be combined, and gradually more abstract and theoretically rich categories can be formed.

In contrast to paraphrasing texts, formulating thematic summaries assumes that the texts have already been coded. In this approach, all the text passages coded in regard to a specific topic are read for each case and a thematic summary is written for each person. Usually, there is a huge gap between a category and the amount of original text assigned to it in the case of longer qualitative interviews, such as narrative interviews. On the one hand there is a relatively short code, such as ‘Environmental behavior in relation to nutrition’, and on the other there are numerous passages of varying length in which a respondent says something on this subject. A thematic summary summarizes all these passages as said by a certain person from the perspective of the research question. This means that the text is not repeated, but rather edited conceptually. Summaries thus create a second level between the original text and the categories and concepts. They also enable complex analyses to be carried out in which several categories are compared or the statements of different groups (women/men, different age groups, different schooling, etc.) are contrasted. This would be nigh impossible if the original quotations were always used since the amount of text would simply be too large, and it would consequently not be possible to create case overviews. A thematic summary, on the other hand, compresses what one person has said in such a way that it can easily be included in further analyses.

A third possibility the QCA method offers is the visualization of relationships between categories. Diagrams, in the form of concept maps, can be generated in which the influencing factors, effects, and relations are visualized.

Phase 6, ‘Reporting and documentation’, is about putting the results of your analyses on paper. The research report of a project working with the QCA method is usually divided into a descriptive and an analytical section. Depending on the method and the significance of the categories, category-based analyses will be the center of attention. The case dimension, however, which is all too often neglected, should also be taken into account in the report. It is often very valuable for the recipients of the research not only to learn something about the connections between the categories, but also something about the participants, that is, the cases that are consciously selected for such a presentation. It is particularly interesting if the cases are grouped into types and the report presents cases that are representative of these types.

The category-based presentation should be illustrated with quotes from the original material. However, you should also be aware of the danger of selective plausibility, i.e., that one mainly selects quotations that clarify the alleged connections between categories, while contradictory examples are not considered. For this reason, counterexamples should always be sought and included in the report.

Category-based analysis should not be limited to a description of the results per category but should also look at the relationships between two or more categories. In other words, you should move from the initial description to the development of a theory.

4 Summary and Conclusions

This chapter presents a method for the methodically controlled analysis of texts in empirical research. To conclude, therefore, the characteristics of the QCA method are concisely summarized:

The focus of the QCA method is on the categories with which the data are coded.

The categories of the final coding frame are described as precisely as possible and it is ensured that the coding procedure itself is reliable, i.e., that different coders concur in their coding.

The data must be coded completely. Complete in this sense means that all passages in the texts that are relevant to the research question are coded. It does, however, make sense to leave those parts of the data uncoded, which are outside the focus of the research question.

The codes and categories can be formed in different ways: empirically, i.e., based directly on the material, or conceptually, i.e., based on the current state of research or on a theory/hypothesis or, rather, as an implementation of the guidelines used in an interview or focus group.

The QCA method is carried out in several phases, ranging from data preparation, category building and coding—which may run in several cycles—to analysis, report writing and presenting the results. QCA therefore means more than just coding the data. Coding is an important step in the analysis, but it is ultimately a preparation for the subsequent analytical steps.

The actual analysis phase consists of summarizing the data, and constantly comparing and contrasting the data. The analysis techniques can be qualitative as well as quantitative. The qualitative analysis may, for example, consist of comparing the statements of certain groups (for instance according to their characteristics, e.g., socio-demographic characteristics) on certain topics. Differences and similarities are identified and summarized in a report. Quantitative analyses may, on the other hand, consist of comparing the frequency of certain categories and/or subcategories for certain groups.

Summary tables and diagrams (e.g., concept maps) can play an important role in the analysis. A good example of a presentation in table form would be a case overview of selected research participants (or groups), in which their statements on certain topics, their judgements and variable values are displayed. An example of a concept map would be a diagram of the determined causal effects of different categories.

Visualizations can also have a diagnostic function in QCA—similarly to imaging procedures in medicine. For example, a ‘cases by categories’ or ‘categories by categories’ display can help identify patterns in the data and indicate which categories are particularly frequently or particularly rarely associated with certain other categories.

When analyzing texts, you should keep in mind that you are working in the field of interpretation. It can be assumed that texts or statements could be interpreted differently. Instead of adopting a constructivist ‘anything goes’ approach, the QCA method tries to reach a consensus—as far as this is possible—on the subjective meaning of statements and tries to define the categories formed or used by it so precisely that an intersubjective agreement can be achieved in the application of the categories.

Group processes play an important role in this process of achieving the necessary level of agreement. Divergent assignments to categories are discussed as a team and should result in an improvement of the category definitions. Categories for which no agreement can be reached in the coding of relevant points in the data must be excluded from the analysis. Content analysis stands and falls by its categories. An analysis with the help of categories that are interpreted and applied differently in the research team, does not make sense.

QCA does not claim to be the best method but recognizes that it has its limits (the interpretation barrier) and that its results have to face comparison with those of competing methods.

The systematic approach of QCA is multidisciplinary and can be applied in many disciplines, including mathematics education (Schwarz 2015 ). This method is particularly appropriate when working with clearly formulated research questions, because these questions play the central role in this method. Indeed, in every phase of the analysis there is a strong reference to the questions leading the research. One strength of QCA is that it can be used both to describe social phenomena and to develop theories or test hypotheses (Hopf 2016 , pp. 155–166).

Berelson, B. (1952). Content analysis in communication research . Glencoe: Free Press.

Google Scholar  

Charmaz, K. (2006). Constructing grounded theory . Thousand Oaks: SAGE.

Früh, W. (2004). Inhaltsanalyse. Theorie und Praxis (5th ed.). Konstanz: UVK.

Guest, G., MacQueen, K. M., & Namey, E. E. (2012). Applied thematic analysis . Thousand Oaks: SAGE.

Book   Google Scholar  

Hopf, C. (2016). In W. Hopf & U. Kuckartz (Eds.), Schriften zu Methodologie und Methoden qualitativer Sozialforschung . Wiesbaden: Springer.

Krippendorff, K. H. (2012). Content analysis: An introduction to its methodology (3rd ed.). Thousand Oaks: SAGE.

Kuckartz, U. (2014). Qualitative text analysis: A guide to methods, practice and using software . Los Angeles: SAGE.

Kuckartz, U. (2016). Qualitative Inhaltsanalyse. Methoden, Praxis, Computerunterstützung (3rd ed.). Weinheim: Beltz Juventa.

Mayring, P. (2014). Qualitative content analysis: Theoretical foundation, basic procedures and software solution. Klagenfurt. http://nbn-resolving.de/urn:nbn:de:0168-ssoar-395173 .

Mayring, P. (2015). Qualitative content analysis: Theoretical background and procedures. In A. Bikner-Ahsbahs, C. Knipping, & N. Presmeg (Eds.), Approaches to qualitative research in mathematics education. Examples of methodology and methods (pp. 365–380). Dordrecht: Springer.

Saldana, J. (2015). The coding manual for qualitative researchers (3rd ed.). Thousand Oaks: SAGE.

Schreier, M. (2012). Qualitative content analysis in practice . Thousand Oaks: SAGE.

Schwarz, B. (2015). A study on professional competence of future teacher students as an example of a study using qualitative content analysis. In A. Bikner-Ahsbahs, C. Knipping, & N. Presmeg (Eds.), Approaches to qualitative research in mathematics education. Examples of methodology and methods (pp. 381–399). Dordrecht: Springer.

Strauss, A. L., & Corbin, J. M. (1990). Basics of qualitative research: Grounded theory procedures and techniques. Thousand Oaks: Sage Publications, Inc.

Download references

Author information

Authors and affiliations.

Philipps-University Marburg, Marburg, Germany

Udo Kuckartz

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Udo Kuckartz .

Editor information

Editors and affiliations.

Faculty of Education, Universität Hamburg, Hamburg, Germany

Gabriele Kaiser

Mathematics Department, Illinois State University, Normal, IL, USA

Norma Presmeg

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

© 2019 The Author(s)

About this chapter

Kuckartz, U. (2019). Qualitative Text Analysis: A Systematic Approach. In: Kaiser, G., Presmeg, N. (eds) Compendium for Early Career Researchers in Mathematics Education . ICME-13 Monographs. Springer, Cham. https://doi.org/10.1007/978-3-030-15636-7_8

Download citation

DOI : https://doi.org/10.1007/978-3-030-15636-7_8

Published : 27 April 2019

Publisher Name : Springer, Cham

Print ISBN : 978-3-030-15635-0

Online ISBN : 978-3-030-15636-7

eBook Packages : Education Education (R0)

Share this chapter

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Publish with us

Policies and ethics

  • Find a journal
  • Track your research

COMMENTS

  1. (PDF) Qualitative content analysis - ResearchGate

    The main purpose of this article is to clarify the features of Qualitative Content Analysis (QCA) and highlighting how it is utilised in conduting educational research.

  2. Chapter 17. Content Analysis – Introduction to Qualitative ...

    This chapter will explore different approaches to content analysis and provide helpful tips on how to collect data, how to turn that data into codes for analysis, and how to go about presenting what is found through analysis.

  3. How to plan and perform a qualitative study using content ...

    Four distinct main stages are described in this paper: the decontextualisation, the recontextualisation, the categorization, and the compilation. This description of qualitative content analysis offers one approach that shows how the general principles of the method can be used.

  4. Qualitative Content Analysis: Theoretical Background and ...

    Qualitative Content Analysis designates a bundle of text analysis procedures integrating qualitative and quantitative steps of analysis, which makes it an approach of mixed methods. This contribution defines it with a background of quantitative content analysis and...

  5. Demystifying Content Analysis - PMC - PubMed Central (PMC)

    This review of content analysis as a qualitative data analysis method will provide clarity and actionable instruction for both novice and experienced pharmacy education researchers. Keywords: content analysis, educational research, latent, manifest, qualitative data analysis.

  6. Three Approaches to Qualitative Content Analysis

    Content analysis is a widely used qualitative research technique. Rather than being a single method, current applications of content analysis show three distinct approaches: conventional, directed, or summative.

  7. Qualitative Content Analysis - Oxford Academic

    Qualitative content analysis is explored in detail in terms of its characteristic components: (1) the research purposes of content analysis, (2) target audiences, (3) epistemological issues, (4) ethical issues, (5) research designs, (6) sampling issues and methods, (7) collecting data, (8) coding and categorization methods, (9) data analysis ...

  8. Qualitative Text Analysis: A Systematic Approach

    Thematic analysis, often called Qualitative Content Analysis (QCA) in Europe, is one of the most commonly used methods for analyzing qualitative data. This paper presents the basics of this systematic method of qualitative data analysis, highlights its key...

  9. Reflexive Content Analysis: An Approach to Qualitative Data ...

    Content analysis, introduced as a quantitative technique for discerning patterns in qualitative data (Graneheim & Lundman, 2004), has since been redefined into a popularly adopted qualitative method (Cho & Lee, 2014).

  10. Qualitative research in science education: A literature ...

    The researchers utilized a systematic literature review methodology by (1) creating search terms based on the research question; (2) choosing relevant databases in which to search; (3) conducting the search and gathering articles; and (4) selecting articles based on inclusion criteria.