Have a language expert improve your writing

Run a free plagiarism check in 10 minutes, generate accurate citations for free.

  • Knowledge Base

Methodology

  • How to Write a Literature Review | Guide, Examples, & Templates

How to Write a Literature Review | Guide, Examples, & Templates

Published on January 2, 2023 by Shona McCombes . Revised on September 11, 2023.

What is a literature review? A literature review is a survey of scholarly sources on a specific topic. It provides an overview of current knowledge, allowing you to identify relevant theories, methods, and gaps in the existing research that you can later apply to your paper, thesis, or dissertation topic .

There are five key steps to writing a literature review:

  • Search for relevant literature
  • Evaluate sources
  • Identify themes, debates, and gaps
  • Outline the structure
  • Write your literature review

A good literature review doesn’t just summarize sources—it analyzes, synthesizes , and critically evaluates to give a clear picture of the state of knowledge on the subject.

Instantly correct all language mistakes in your text

Upload your document to correct all your mistakes in minutes

upload-your-document-ai-proofreader

Table of contents

What is the purpose of a literature review, examples of literature reviews, step 1 – search for relevant literature, step 2 – evaluate and select sources, step 3 – identify themes, debates, and gaps, step 4 – outline your literature review’s structure, step 5 – write your literature review, free lecture slides, other interesting articles, frequently asked questions, introduction.

  • Quick Run-through
  • Step 1 & 2

When you write a thesis , dissertation , or research paper , you will likely have to conduct a literature review to situate your research within existing knowledge. The literature review gives you a chance to:

  • Demonstrate your familiarity with the topic and its scholarly context
  • Develop a theoretical framework and methodology for your research
  • Position your work in relation to other researchers and theorists
  • Show how your research addresses a gap or contributes to a debate
  • Evaluate the current state of research and demonstrate your knowledge of the scholarly debates around your topic.

Writing literature reviews is a particularly important skill if you want to apply for graduate school or pursue a career in research. We’ve written a step-by-step guide that you can follow below.

Literature review guide

Receive feedback on language, structure, and formatting

Professional editors proofread and edit your paper by focusing on:

  • Academic style
  • Vague sentences
  • Style consistency

See an example

literature review of error analysis

Writing literature reviews can be quite challenging! A good starting point could be to look at some examples, depending on what kind of literature review you’d like to write.

  • Example literature review #1: “Why Do People Migrate? A Review of the Theoretical Literature” ( Theoretical literature review about the development of economic migration theory from the 1950s to today.)
  • Example literature review #2: “Literature review as a research methodology: An overview and guidelines” ( Methodological literature review about interdisciplinary knowledge acquisition and production.)
  • Example literature review #3: “The Use of Technology in English Language Learning: A Literature Review” ( Thematic literature review about the effects of technology on language acquisition.)
  • Example literature review #4: “Learners’ Listening Comprehension Difficulties in English Language Learning: A Literature Review” ( Chronological literature review about how the concept of listening skills has changed over time.)

You can also check out our templates with literature review examples and sample outlines at the links below.

Download Word doc Download Google doc

Before you begin searching for literature, you need a clearly defined topic .

If you are writing the literature review section of a dissertation or research paper, you will search for literature related to your research problem and questions .

Make a list of keywords

Start by creating a list of keywords related to your research question. Include each of the key concepts or variables you’re interested in, and list any synonyms and related terms. You can add to this list as you discover new keywords in the process of your literature search.

  • Social media, Facebook, Instagram, Twitter, Snapchat, TikTok
  • Body image, self-perception, self-esteem, mental health
  • Generation Z, teenagers, adolescents, youth

Search for relevant sources

Use your keywords to begin searching for sources. Some useful databases to search for journals and articles include:

  • Your university’s library catalogue
  • Google Scholar
  • Project Muse (humanities and social sciences)
  • Medline (life sciences and biomedicine)
  • EconLit (economics)
  • Inspec (physics, engineering and computer science)

You can also use boolean operators to help narrow down your search.

Make sure to read the abstract to find out whether an article is relevant to your question. When you find a useful book or article, you can check the bibliography to find other relevant sources.

You likely won’t be able to read absolutely everything that has been written on your topic, so it will be necessary to evaluate which sources are most relevant to your research question.

For each publication, ask yourself:

  • What question or problem is the author addressing?
  • What are the key concepts and how are they defined?
  • What are the key theories, models, and methods?
  • Does the research use established frameworks or take an innovative approach?
  • What are the results and conclusions of the study?
  • How does the publication relate to other literature in the field? Does it confirm, add to, or challenge established knowledge?
  • What are the strengths and weaknesses of the research?

Make sure the sources you use are credible , and make sure you read any landmark studies and major theories in your field of research.

You can use our template to summarize and evaluate sources you’re thinking about using. Click on either button below to download.

Take notes and cite your sources

As you read, you should also begin the writing process. Take notes that you can later incorporate into the text of your literature review.

It is important to keep track of your sources with citations to avoid plagiarism . It can be helpful to make an annotated bibliography , where you compile full citation information and write a paragraph of summary and analysis for each source. This helps you remember what you read and saves time later in the process.

The only proofreading tool specialized in correcting academic writing - try for free!

The academic proofreading tool has been trained on 1000s of academic texts and by native English editors. Making it the most accurate and reliable proofreading tool for students.

literature review of error analysis

Try for free

To begin organizing your literature review’s argument and structure, be sure you understand the connections and relationships between the sources you’ve read. Based on your reading and notes, you can look for:

  • Trends and patterns (in theory, method or results): do certain approaches become more or less popular over time?
  • Themes: what questions or concepts recur across the literature?
  • Debates, conflicts and contradictions: where do sources disagree?
  • Pivotal publications: are there any influential theories or studies that changed the direction of the field?
  • Gaps: what is missing from the literature? Are there weaknesses that need to be addressed?

This step will help you work out the structure of your literature review and (if applicable) show how your own research will contribute to existing knowledge.

  • Most research has focused on young women.
  • There is an increasing interest in the visual aspects of social media.
  • But there is still a lack of robust research on highly visual platforms like Instagram and Snapchat—this is a gap that you could address in your own research.

There are various approaches to organizing the body of a literature review. Depending on the length of your literature review, you can combine several of these strategies (for example, your overall structure might be thematic, but each theme is discussed chronologically).

Chronological

The simplest approach is to trace the development of the topic over time. However, if you choose this strategy, be careful to avoid simply listing and summarizing sources in order.

Try to analyze patterns, turning points and key debates that have shaped the direction of the field. Give your interpretation of how and why certain developments occurred.

If you have found some recurring central themes, you can organize your literature review into subsections that address different aspects of the topic.

For example, if you are reviewing literature about inequalities in migrant health outcomes, key themes might include healthcare policy, language barriers, cultural attitudes, legal status, and economic access.

Methodological

If you draw your sources from different disciplines or fields that use a variety of research methods , you might want to compare the results and conclusions that emerge from different approaches. For example:

  • Look at what results have emerged in qualitative versus quantitative research
  • Discuss how the topic has been approached by empirical versus theoretical scholarship
  • Divide the literature into sociological, historical, and cultural sources

Theoretical

A literature review is often the foundation for a theoretical framework . You can use it to discuss various theories, models, and definitions of key concepts.

You might argue for the relevance of a specific theoretical approach, or combine various theoretical concepts to create a framework for your research.

Like any other academic text , your literature review should have an introduction , a main body, and a conclusion . What you include in each depends on the objective of your literature review.

The introduction should clearly establish the focus and purpose of the literature review.

Depending on the length of your literature review, you might want to divide the body into subsections. You can use a subheading for each theme, time period, or methodological approach.

As you write, you can follow these tips:

  • Summarize and synthesize: give an overview of the main points of each source and combine them into a coherent whole
  • Analyze and interpret: don’t just paraphrase other researchers — add your own interpretations where possible, discussing the significance of findings in relation to the literature as a whole
  • Critically evaluate: mention the strengths and weaknesses of your sources
  • Write in well-structured paragraphs: use transition words and topic sentences to draw connections, comparisons and contrasts

In the conclusion, you should summarize the key findings you have taken from the literature and emphasize their significance.

When you’ve finished writing and revising your literature review, don’t forget to proofread thoroughly before submitting. Not a language expert? Check out Scribbr’s professional proofreading services !

This article has been adapted into lecture slides that you can use to teach your students about writing a literature review.

Scribbr slides are free to use, customize, and distribute for educational purposes.

Open Google Slides Download PowerPoint

If you want to know more about the research process , methodology , research bias , or statistics , make sure to check out some of our other articles with explanations and examples.

  • Sampling methods
  • Simple random sampling
  • Stratified sampling
  • Cluster sampling
  • Likert scales
  • Reproducibility

 Statistics

  • Null hypothesis
  • Statistical power
  • Probability distribution
  • Effect size
  • Poisson distribution

Research bias

  • Optimism bias
  • Cognitive bias
  • Implicit bias
  • Hawthorne effect
  • Anchoring bias
  • Explicit bias

A literature review is a survey of scholarly sources (such as books, journal articles, and theses) related to a specific topic or research question .

It is often written as part of a thesis, dissertation , or research paper , in order to situate your work in relation to existing knowledge.

There are several reasons to conduct a literature review at the beginning of a research project:

  • To familiarize yourself with the current state of knowledge on your topic
  • To ensure that you’re not just repeating what others have already done
  • To identify gaps in knowledge and unresolved problems that your research can address
  • To develop your theoretical framework and methodology
  • To provide an overview of the key findings and debates on the topic

Writing the literature review shows your reader how your work relates to existing research and what new insights it will contribute.

The literature review usually comes near the beginning of your thesis or dissertation . After the introduction , it grounds your research in a scholarly field and leads directly to your theoretical framework or methodology .

A literature review is a survey of credible sources on a topic, often used in dissertations , theses, and research papers . Literature reviews give an overview of knowledge on a subject, helping you identify relevant theories and methods, as well as gaps in existing research. Literature reviews are set up similarly to other  academic texts , with an introduction , a main body, and a conclusion .

An  annotated bibliography is a list of  source references that has a short description (called an annotation ) for each of the sources. It is often assigned as part of the research process for a  paper .  

Cite this Scribbr article

If you want to cite this source, you can copy and paste the citation or click the “Cite this Scribbr article” button to automatically add the citation to our free Citation Generator.

McCombes, S. (2023, September 11). How to Write a Literature Review | Guide, Examples, & Templates. Scribbr. Retrieved April 15, 2024, from https://www.scribbr.com/dissertation/literature-review/

Is this article helpful?

Shona McCombes

Shona McCombes

Other students also liked, what is a theoretical framework | guide to organizing, what is a research methodology | steps & tips, how to write a research proposal | examples & templates, what is your plagiarism score.

  • Original Research
  • Open access
  • Published: 30 January 2018

Teaching and learning mathematics through error analysis

  • Sheryl J. Rushton 1  

Fields Mathematics Education Journal volume  3 , Article number:  4 ( 2018 ) Cite this article

42k Accesses

22 Citations

12 Altmetric

Metrics details

For decades, mathematics education pedagogy has relied most heavily on teachers, demonstrating correctly worked example exercises as models for students to follow while practicing their own exercises. In more recent years, incorrect exercises have been introduced for the purpose of student-conducted error analysis. Combining the use of correctly worked exercises with error analysis has led researchers to posit increased mathematical understanding. Combining the use of correctly worked exercises with error analysis has led researchers to posit increased mathematical understanding.

A mixed method design was used to investigate the use of error analysis in a seventh-grade mathematics unit on equations and inequalities. Quantitative data were used to establish statistical significance of the effectiveness of using error analysis and qualitative methods were used to understand participants’ experience with error analysis.

The results determined that there was no significant difference in posttest scores. However, there was a significant difference in delayed posttest scores.

In general, the teacher and students found the use of error analysis to be beneficial in the learning process.

For decades, mathematics education pedagogy has relied most heavily on teachers demonstrating correctly worked example exercises as models for students to follow while practicing their own exercises [ 3 ]. In more recent years, incorrect exercises have been introduced for the purpose of student-conducted error analysis [ 17 ]. Conducting error analysis aligns with the Standards of Mathematical Practice [ 18 , 19 ] and the Mathematics Teaching Practices [ 18 ]. Researchers posit a result of increased mathematical understanding when these practices are used with a combination of correctly and erroneously worked exercises [ 1 , 4 , 8 , 11 , 15 , 16 , 18 , 19 , 23 ].

Review of literature

Correctly worked examples consist of a problem statement with the steps taken to reach a solution along with the final result and are an effective method for the initial acquisitions of procedural skills and knowledge [ 1 , 11 , 26 ]. Cognitive load theory [ 1 , 11 , 25 ] explains the challenge of stimulating the cognitive process without overloading the student with too much essential and extraneous information that will limit the working memory and leave a restricted capacity for learning. Correctly worked examples focus the student’s attention on the correct solution procedure which helps to avoid the need to search their prior knowledge for solution methods. Correctly worked examples free the students from performance demands and allow them to concentrate on gaining new knowledge [ 1 , 11 , 16 ].

Error analysis is an instructional strategy that holds promise of helping students to retain their learning [ 16 ]. Error analysis consists of being presented a problem statement with the steps taken to reach a solution in which one or more of the steps are incorrect, often called erroneous examples [ 17 ]. Students analyze and explain the errors and then complete the exercise correctly providing reasoning for their own solution. Error analysis leads students to enact two Standards of Mathematical Practice, namely, (a) make sense of problems and persevere in solving them and (b) attend to precision [ 19 ].

Another of the Standards of Mathematical Practice suggests that students learn to construct viable arguments and comment on the reasoning of others [ 19 ]. According to Große and Renkl [ 11 ], students who attempted to establish a rationale for the steps of the solution learned more than those who did not search for an explanation. Teachers can assist in this practice by facilitating meaningful mathematical discourse [ 18 ]. “Arguments do not have to be lengthy, they simply need to be clear, specific, and contain data or reasoning to back up the thinking” [ 20 ]. Those data and reasons could be in the form of charts, diagrams, tables, drawings, examples, or word explanations.

Researchers [ 7 , 21 ] found the process of explaining and justifying solutions for both correct and erroneous examples to be more beneficial for achieving learning outcomes than explaining and justifying solutions to correctly worked examples only. They also found that explaining why an exercise is correct or incorrect fostered transfer and led to better learning outcomes than explaining correct solutions only. According to Silver et al. [ 22 ], students are able to form understanding by classifying procedures into categories of correct examples and erroneous examples. The students then test their initial categories against further correct and erroneous examples to finally generate a set of attributes that defines the concept. Exposing students to both correctly worked examples and error analysis is especially beneficial when a mathematical concept is often done incorrectly or is easily confused [ 11 ].

Große and Renkl [ 11 ] suggested in their study involving university students in Germany that since errors are inherent in human life, introducing errors in the learning process encourages students to reflect on what they know and then be able to create clearer and more complete explanations of the solutions. The presentation of “incorrect knowledge can induce cognitive conflicts which prompt the learner to build up a coherent knowledge structure” [ 11 ]. Presenting a cognitive conflict through erroneously worked exercises triggers learning episodes through reflection and explanations, which leads to deeper understanding [ 29 ]. Error analysis “can foster a deeper and more complete understanding of mathematical content, as well as of the nature of mathematics itself” [ 4 ].

Several studies have been conducted on the use of error analysis in mathematical units [ 1 , 16 , 17 ]. The study conducted for this article differed from these previous studies in mathematical content, number of teachers and students involved in the study, and their use of a computer or online component. The most impactful differences between the error analysis studies conducted in the past and this article’s study are the length of time between the posttest and the delayed posttest and the use of qualitative data to add depth to the findings. The previous studies found students who conducted error analysis work did not perform significantly different on the posttest than students who received a more traditional approach to learning mathematics. However, the students who conducted error analysis outperformed the control group in each of the studies on delayed posttests that were given 1–2 weeks after the initial posttest.

Loibl and Rummel [ 15 ] discovered that high school students became aware of their knowledge gaps in a general manner by attempting an exercise and failing. Instruction comparing the erroneous work with correctly worked exercises filled the learning gaps. Gadgil et al. [ 9 ] conducted a study in which students who compared flawed work to expertly done work were more likely to repair their own errors than students who only explained the expertly done work. This discovery was further supported by other researchers [8, 14, 24]. Each of these researchers found students ranging from elementary mathematics to university undergraduate medical school who, when given correctly worked examples and erroneous examples, learned more than students who only examined correctly worked examples. This was especially true when the erroneous examples were similar to the kinds of errors that they had committed [ 14 ]. Stark et al. [ 24 ] added that it is important for students to receive sufficient scaffolding in correctly worked examples before and alongside of the erroneous examples.

The purpose of this study was to explore whether seventh-grade mathematics students could learn better from the use of both correctly worked examples and error analysis than from the more traditional instructional approach of solving their exercises in which the students are instructed with only correctly worked examples. The study furthered previous research on the subject of learning from the use of both correctly worked examples and error analysis by also investigating the feedback from the teacher’s and students’ experiences with error analysis. The following questions were answered in this study:

What was the difference in mathematical achievement when error analysis was included in students’ lessons and assignments versus a traditional approach of learning through correct examples only?

What kind of benefits or disadvantages did the students and teacher observe when error analysis was included in students’ lessons and assignments versus a traditional approach of learning through correct examples only?

A mixed method design was used to investigate the use of error analysis in a seventh-grade mathematics unit on equations and inequalities. Quantitative data were used to establish statistical significance of the effectiveness of using error analysis and qualitative methods were used to understand participants’ experience with error analysis [ 6 , 27 ].

Participants

Two-seventh-grade mathematics classes at an International Baccalaureate (IB) school in a suburban charter school in Northern Utah made up the control and treatment groups using a convenience grouping. One class of 26 students was the control group and one class of 27 students was the treatment group.

The same teacher taught both the groups, so a comparison could be made from the teacher’s point of view of how the students learned and participated in the two different groups. At the beginning of the study, the teacher was willing to give error analysis a try in her classroom; however, she was not enthusiastic about using this strategy. She could not visualize how error analysis could work on a daily basis. By the end of the study, the teacher became very enthusiastic about using error analysis in her seventh grade mathematics classes.

The total group of participants involved 29 males and 24 females. About 92% of the participants were Caucasian and the other 8% were of varying ethnicities. Seventeen percent of the student body was on free or reduced lunch. Approximately 10% of the students had individual education plans (IEP).

A pretest and posttest were created to contain questions that would test for mathematical understanding on equations and inequalities using Glencoe Math: Your Common Core Edition CCSS [ 5 ] as a resource. The pretest was reused as the delayed posttest. Homework assignments were created for both the control group and the treatment group from the Glencoe Math: Your Common Core Edition CCSS textbook. However, the researcher rewrote two to three of the homework exercises as erroneous examples for the treatment group to find the error and fix the exercise with justifications (see Figs.  1 , 2 ). Students from both groups used an Assignment Time Log to track the amount of time which they spent on their homework assignments.

Example of the rewritten homework exercises as equation erroneous examples

Example of the rewritten homework exercises as inequality erroneous examples

Both the control and the treatment groups were given the same pretest for an equations and inequality unit. The teacher taught both the control and treatment groups the information for the new concepts in the same manner. The majority of the instruction was done using the direct instruction strategy. The students in both groups were allowed to work collaboratively in pairs or small groups to complete the assignments after instruction had been given. During the time she allotted for answering questions from the previous assignment, she would only show the control group the exercises worked correctly. However, for the treatment group, the teacher would write errors which she found in the students’ work on the board. She would then either pair up the students or create small groups and have the student discuss what errors they noticed and how they would fix them. Often, the teacher brought the class together as a whole to discuss what they discovered and how they could learn from it.

The treatment group was given a homework assignment with the same exercises as the control group, but including the erroneous examples. Students in both the control and treatment groups were given the Assignment Time Log to keep a record of how much time was spent completing each homework assignment.

At the end of each week, both groups took the same quiz. The quizzes for the control group received a grade, and the quiz was returned without any further attention. If a student asked how to do an exercise, the teacher only showed the correct example. The teacher graded the quizzes for the treatment group using the strategy found in the Teaching Channel’s video “Highlighting Mistakes: A Grading Strategy” [ 2 ]. She marked the quizzes by highlighting the mistakes; no score was given. The students were allowed time in class or at home to make corrections with justifications.

The same posttest was administered to both groups at the conclusion of the equation and inequality chapter, and a delayed posttest was administered 6 weeks later. The delayed posttest also asked the students in the treatment group to respond to an open-ended request to “Please provide some feedback on your experience”. The test scores were analyzed for significant differences using independent samples t tests. The responses to the open-ended request were coded and analyzed for similarities and differences, and then, used to determine the students’ perceptions of the benefits or disadvantages of using error analysis in their learning.

At the conclusion of gathering data from the assessments, the researcher interviewed the teacher to determine the differences which the teacher observed in the preparation of the lessons and students’ participation in the lessons [ 6 ]. The interview with the teacher contained a variety of open-ended questions. These are the questions asked during the interview: (a) what is your opinion of using error analysis in your classroom at the conclusion of the study versus before the study began? (b) describe a typical classroom discussion in both the control group class and the treatment group class, (c) talk about the amount of time you spent grading, preparing, and teaching both groups, and (d) describe the benefits or disadvantages of using error analysis on a daily basis compared to not using error analysis in the classroom. The responses from the teacher were entered into a computer, coded, and analyzed for thematic content [ 6 , 27 ]. The themes that emerged from coding the teacher’s responses were used to determine the kind of benefits or disadvantages observed when error analysis was included in students’ lessons and assignments versus a traditional approach of learning through correct examples only from the teacher’s point of view.

Findings and discussion

Mathematical achievement.

Preliminary analyses were carried out to evaluate assumptions for the t test. Those assumptions include: (a) the independence, (b) normality tested using the Shapiro–Wilk test, and (c) homogeneity of variance tested using the Levene Statistic. All assumptions were met.

The Levene Statistic for the pretest scores ( p  > 0.05) indicated that there was not a significant difference in the groups. Independent samples t tests were conducted to determine the effect error analysis had on student achievement determined by the difference in the means of the pretest and posttest and of the pretest and delayed posttest. There was no significant difference in the scores from the posttest for the control group ( M  = 8.23, SD = 5.67) and the treatment group ( M  = 9.56, SD = 5.24); t (51) = 0.88, p  = 0.381. However, there was a significant difference in the scores from the delayed posttest for the control group ( M  = 5.96, SD = 4.90) and the treatment group ( M  = 9.41, SD = 4.77); t (51) = 2.60, p  = 0.012. These results suggest that students can initially learn mathematical concepts through a variety of methods. Nevertheless, the retention of the mathematical knowledge is significantly increased when error analysis is added to the students’ lessons, assignments, and quizzes. It is interesting to note that the difference between the means from the pretest to the posttest was higher in the treatment group ( M  = 9.56) versus the control group ( M  = 8.23), implying that even though there was not a significant difference in the means, the treatment group did show a greater improvement.

The Assignment Time Log was completed by only 19% of the students in the treatment group and 38% of the students in the control group. By having such a small percentage of each group participate in tracking the time spent completing homework assignment, the results from the t test analysis cannot be used in any generalization. However, the results from the analysis were interesting. The mean time spent doing the assignments for each group was calculated and analyzed using an independent samples t test. There was no significant difference in the amount of time students which spent on their homework for the control group ( M  = 168.30, SD = 77.41) and the treatment group ( M  = 165.80, SD = 26.53); t (13) = 0.07, p  = 0.946. These results suggest that the amount of time that students spent on their homework was close to the same whether they had to do error analyses (find the errors, fix them, and justify the steps taken) or solve each exercise in a traditional manner of following correctly worked examples. Although the students did not spend a significantly different amount of time outside of class doing homework, the treatment group did spend more time during class working on quiz corrections and discussing error which could attribute to the retention of knowledge.

Feedback from participants

All students participating in the current study submitted a signed informed consent form. Students process mathematical procedures better when they are aware of their own errors and knowledge gaps [ 15 ]. The theoretical model of using errors that students make themselves and errors that are likely due to the typical knowledge gaps can also be found in works by other researchers such as Kawasaki [ 14 ] and VanLehn [ 29 ]. Highlighting errors in the students’ own work and in typical errors made by others allowed the participants in the treatment group the opportunity to experience this theoretical model. From their experiences, the participants were able to give feedback to help the researcher delve deeper into what the thoughts were of the use of error analysis in their mathematics classes than any other study provided [ 1 , 4 , 7 , 8 , 9 , 11 , 14 , 15 , 16 , 17 , 21 , 23 , 24 , 25 , 26 , 29 ]. Overall, the teacher and students found the use of error analysis in the equations and inequalities unit to be beneficial. The teacher pointed out that the discussions in class were deeper in the treatment group’s class. When she tried to facilitate meaningful mathematical discourse [ 18 ] in the control group class, the students were unable to get to the same level of critical thinking as the treatment group discussions. In the open-ended question at the conclusion of the delayed posttest (“Please provide some feedback on your experience.”), the majority (86%) of the participants from the treatment group indicated that the use of erroneous examples integrated into their lessons was beneficial in helping them recognize their own mistakes and understanding how to correct those mistakes. One student reported, “I realized I was doing the same mistakes and now knew how to fix it”. Several (67%) of the students indicated learning through error analysis made the learning process easier for them. A student commented that “When I figure out the mistake then I understand the concept better, and how to do it, and how not to do it”.

When students find and correct the errors in exercises, while justifying themselves, they are being encouraged to learn to construct viable arguments and critique the reasoning of others [ 19 ]. This study found that explaining why an exercise is correct or incorrect fostered transfer and led to better learning outcomes than explaining correct solutions only. However, some of the higher level students struggled with the explanation component. According to the teacher, many of these higher level students who typically do very well on the homework and quizzes scored lower on the unit quizzes and tests than the students expected due to the requirement of explaining the work. In the past, these students had not been justifying their thinking and always got correct answers. Therefore, providing reasons for erroneous examples and justifying their own process were difficult for them.

Often teachers are resistant to the idea of using error analysis in their classroom. Some feel creating erroneous examples and highlighting errors for students to analyze is too time-consuming [ 28 ]. The teacher in this study taught both the control and treatment groups, which allowed her the perspective to compare both methods. She stated, “Grading took about the same amount of time whether I gave a score or just highlighted the mistakes”. She noticed that having the students work on their errors from the quizzes and having them find the errors in the assignments and on the board during class time ultimately meant less work for her and more work for the students.

Another reason behind the reluctance to use error analysis is the fact that teachers are uncertain about exposing errors to their students. They are fearful that the discussion of errors could lead their students to make those same errors and obtain incorrect solutions [ 28 ]. Yet, most of the students’ feedback stated the discussions in class and the error analyses on the assignments and quizzes helped them in working homework exercises correctly. Specifically, they said figuring out what went wrong in the exercise helped them solve that and other exercises. One student said that error analysis helped them “do better in math on the test, and I actually enjoyed it”. Nevertheless, 2 of the 27 participating students in the treatment group had negative comments about learning through error analysis. One student did not feel that correcting mistakes showed them anything, and it did not reinforce the lesson. The other student stated being exposed to error analysis did, indeed, confuse them. The student kept thinking the erroneous example was a correct answer and was unsure about what they were supposed to do to solve the exercise.

When the researcher asked the teacher if there were any benefits or disadvantages to using error analysis in teaching the equations and inequalities unit, she said that she thoroughly enjoyed teaching using the error analysis method and was planning to implement it in all of her classes in the future. In fact, she found that her “hands were tied” while grading the control group quizzes and facilitating the lessons. She said, “I wanted to have the students find their errors and fix them, so we could have a discussion about what they were doing wrong”. The students also found error analysis to have more benefits than disadvantages. Other than one student whose response was eliminated for not being on topic and the two students with negative comments, the other 24 of the students in the treatment group had positive comments about their experience with error analysis. When students had the opportunity to analyze errors in worked exercises (error analysis) through the assignments and quizzes, they were able to get a deeper understanding of the content and, therefore, retained the information longer than those who only learned through correct examples.

Discussions generated in the treatment group’s classroom afforded the students the opportunity to critically reason through the work of others and to develop possible arguments on what had been done in the erroneous exercise and what approaches might be taken to successfully find a solution to the exercise. It may seem surprising that an error as simple as adding a number when it should have been subtracted could prompt a variety of questions and lead to the students suggesting possible ways to solve and check to see if the solution makes sense. In an erroneous exercise presented to the treatment group, the students were provided with the information that two of the three angles of a triangle were 35° and 45°. The task was to write and solve an equation to find the missing measure. The erroneous exercise solver had created the equation: x  + 35 + 45 = 180. Next was written x  + 80 = 180. The solution was x  = 260°. In the discussion, the class had on this exercise, the conclusion was made that the error occurred when 80 was added to 180 to get a sum of 260. However, the discussion progressed finding different equations and steps that could have been taken to discover the missing angle measure to be 100° and why 260° was an unreasonable solution. Another approach discussed by the students was to recognize that to say the missing angle measure was 260° contradicted with the fact that one angle could not be larger than the sum of the angle measures of a triangle. Analyzing the erroneous exercises gave the students the opportunity of engaging in the activity of “explaining” and “fixing” the errors of the presented exercise as well as their own errors, an activity that fostered the students’ learning.

The students participating in both the control and treatment groups from the two-seventh-grade mathematics classes at the IB school in a suburban charter school in Northern Utah initially learned the concepts taught in the equations and inequality unit statistically just as well with both methods of teaching. The control group had the information taught to them with the use of only correctly worked examples. If they had a question about an exercise which they did wrong, the teacher would show them how to do the exercise correctly and have a discussion on the steps required to obtain the correct solutions. On their assignments and quizzes, the control group was expected to complete the work by correctly solving the equations and inequalities in the exercise, get a score on their work, and move on to the next concept. On the other hand, the students participating in the treatment group were given erroneous examples within their assignments and asked to find the errors, explain what had been done wrong, and then correctly solve the exercise with justifications for the steps they chose to use. During lessons, the teacher put erroneous examples from the students’ work on the board and generated paired, small groups, or whole group discussion of what was wrong with the exercise and the different ways to do it correctly. On the quizzes, the teacher highlighted the errors and allowed the students to explain the errors and justify the correct solution.

Both the method of teaching using error analysis and the traditional method of presenting the exercise and having the students solve it proved to be just as successful on the immediate unit summative posttest. However, the delayed posttest given 6 weeks after the posttest showed that the retention of knowledge was significantly higher for the treatment group. It is important to note that the fact that the students in the treatment group were given more time to discuss the exercises in small groups and as a whole class could have influenced the retention of mathematical knowledge just as much or more than the treatment of using error analysis. Researchers have proven academic advantages of group work for students, in large part due to the perception of students having a secure support system, which cannot be obtained when working individually [ 10 , 12 , 13 ].

The findings of this study supported the statistical findings of other researchers [ 1 , 16 , 17 ], suggesting that error analysis may aid in providing a richer learning experience that leads to a deeper understanding of equations and inequalities for long-term knowledge. The findings of this study also investigated the teacher’s and students’ perceptions of using error analysis in their teaching and learning. The students and teacher used for this study were chosen to have the same teacher for both the control and treatment groups. Using the same teacher for both groups, the researcher was able to determine the teacher’s attitude toward the use of error analysis compared to the non-use of error analysis in her instruction. The teacher’s comments during the interview implied that she no longer had an unenthusiastic and skeptical attitude toward the use of error analysis on a daily basis in her classroom. She was “excited to implement the error analysis strategy into the rest of her classes for the rest of the school year”. She observed error analysis to be an effective way to deal with common misconceptions and offer opportunities for students to reflect on their learning from their errors. The process of error analysis assisted the teacher in supporting productive struggle in learning mathematics [ 18 ] and created opportunity for students to have deep discussions about alternative ways to solve exercises. Error analysis also aided in students’ discovery of their own errors and gave them possible ways to correct those errors. Learning through the use of error analysis was enjoyable for many of the participating students.

According to the NCTM [ 18 ], effective teaching of mathematics happens when a teacher implements exercises that will engage students in solving and discussing tasks that promote mathematical reasoning and problem solving. Providing erroneous examples allowed discussion, multiple entry points, and varied solution strategies. Both the teacher and the students participating in the treatment group came to the conclusion that error analysis is a beneficial strategy to use in the teaching and learning of mathematics. Regardless of the two negative student comments about error analysis not being helpful for them, this researcher recommends the use of error analysis in teaching and learning mathematics.

The implications of the treatment of teaching students mathematics through the use of error analysis are that students’ learning could be fostered and retention of content knowledge may be longer. When a teacher is able to have their students’ practice critiquing the reasoning of others and creating viable arguments [ 19 ] by analyzing errors in mathematics, the students not only are able to meet the Standard of Mathematical Practice, but are also creating a lifelong skill of analyzing the effectiveness of “plausible arguments, distinguish correct logic or reasoning from that which is flawed, and—if there is a flaw in an argument—explain what it is” ([ 19 ], p. 7).

Limitations and future research

This study had limitations. The sample size was small to use the same teacher for both groups. Another limitation was the length of the study only encompassed one unit. Using error analysis could have been a novelty and engaged the students more than it would when the novelty wore off. Still another limitation was the study that was conducted at an International Baccalaureate (IB) school in a suburban charter school in Northern Utah, which may limit the generalization of the findings and implications to other schools with different demographics.

This study did not have a separation of conceptual and procedural questions on the assessments. For a future study, the creation of an assessment that would be able to determine if error analysis was more helpful in teaching conceptual mathematics or procedural mathematics could be beneficial to teachers as they plan their lessons. Another suggestion for future research would be to gather more data using several teachers teaching both the treatment group and the control group.

Adams, D.M., McLaren, B.M., Durkin, K., Mayer, R.E., Rittle-Johnson, B., Isotani, S., van Velsen, M.: Using erroneous examples to improve mathematics learning with a web-based tutoring system. Comput. Hum. Behav. 36 , 401–411 (2014)

Article   Google Scholar  

Alcala, L.: Highlighting mistakes: a grading strategy. The teaching channel. https://www.teachingchannel.org/videos/math-test-grading-tips

Atkinson, R.K., Derry, S.J., Renkl, A., Wortham, D.: Learning from examples: instructional Principles from the worked examples research. Rev. Educ. Res. 70 (2), 181–214 (2000)

Borasi, R.: Exploring mathematics through the analysis of errors. Learn. Math. 7 (3), 2–8 (1987)

Google Scholar  

Carter, J.A., Cuevas, G.J., Day, R., Malloy, C., Kersaint, G., Luchin, B.M., Willard, T.: Glencoe math: your common core edition CCSS. Glencoe/McGraw-Hill, Columbus (2013)

Creswell, J.: Research design: qualitative, quantitative, and mixed methods approaches, 4th edn. Sage Publications, Thousand Oaks (2014)

Curry, L. A.: The effects of self-explanations of correct and incorrect solutions on algebra problem-solving performance. In: Proceedings of the 26th annual conference of the cognitive science society, vol. 1548. Erlbaum, Mahwah (2004)

Durkin, K., Rittle-Johnson, B.: The effectiveness of using incorrect examples to support learning about decimal magnitude. Learn. Instr. 22 (3), 206–214 (2012)

Gadgil, S., Nokes-Malach, T.J., Chi, M.T.: Effectiveness of holistic mental model confrontation in driving conceptual change. Learn. Instr. 22 (1), 47–61 (2012)

Gaudet, A.D., Ramer, L.M., Nakonechny, J., Cragg, J.J., Ramer, M.S.: Small-group learning in an upper-level university biology class enhances academic performance and student attitutdes toward group work. Public Libr. Sci. One 5 , 1–9 (2010)

Große, C.S., Renkl, A.: Finding and fixing errors in worked examples: can this foster learning outcomes? Learn. Instr. 17 (6), 612–634 (2007)

Janssen, J., Kirschner, F., Erkens, G., Kirschner, P.A., Paas, F.: Making the black box of collaborative learning transparent: combining process-oriented and cognitive load approaches. Educ. Psychol. Rev. 22 , 139–154 (2010)

Johnson, D.W., Johnson, R.T.: An educational psychology success story: social interdependence theory and cooperative learning. Educ. Res. 38 , 365–379 (2009)

Kawasaki, M.: Learning to solve mathematics problems: the impact of incorrect solutions in fifth grade peers’ presentations. Jpn. J. Dev. Psychol. 21 (1), 12–22 (2010)

Loibl, K., Rummel, N.: Knowing what you don’t know makes failure productive. Learn. Instr. 34 , 74–85 (2014)

McLaren, B.M., Adams, D., Durkin, K., Goguadze, G., Mayer, R.E., Rittle-Johnson, B., Van Velsen, M.: To err is human, to explain and correct is divine: a study of interactive erroneous examples with middle school math students. 21st Century learning for 21st Century skills, pp. 222–235. Springer, Berlin (2012)

Chapter   Google Scholar  

McLaren, B.M., Adams, D.M., Mayer, R.E.: Delayed learning effects with erroneous examples: a study of learning decimals with a web-based tutor. Int. J. Artif. Intell. Educ. 25 (4), 520–542 (2015)

National Council of Teachers of Mathematics (NCTM): Principles to actions: ensuring mathematical success for all. Author, Reston (2014)

National Governors Association Center for Best Practices & Council of Chief State School Officers (NGA Center and CCSSO): Common core state standards. Authors, Washington, DC (2010)

O’Connell, S., SanGiovanni, J.: Putting the practices into action: Implementing the common core standards for mathematical practice K-8. Heinemann, Portsmouth (2013)

Siegler, R.S.: Microgenetic studies of self-explanation. Microdevelopment: transition processes in development and learning, pp. 31–58. Cambridge University Press, New York (2002)

Silver, H.F., Strong, R.W., Perini, M.J.: The strategic teacher: Selecting the right research-based strategy for every lesson. ASCD, Alexandria (2009)

Sisman, G.T., Aksu, M.: A study on sixth grade students’ misconceptions and errors in spatial measurement: length, area, and volume. Int. J. Sci. Math. Educ. 14 (7), 1293–1319 (2015)

Stark, R., Kopp, V., Fischer, M.R.: Case-based learning with worked examples in complex domains: two experimental studies in undergraduate medical education. Learn. Instr. 21 (1), 22–33 (2011)

Sweller, J.: Cognitive load during problem solving: effects on learning. Cognitive Sci. 12 , 257–285 (1988)

Sweller, J., Cooper, G.A.: The use of worked examples as a substitute for problem solving in learning algebra. Cognit. Instr. 2 (1), 59–89 (1985)

Tashakkori, A., Teddlie, C.: Sage handbook of mixed methods in social & behavioral research, 2nd edn. Sage Publications, Thousand Oaks (2010)

Book   Google Scholar  

Tsovaltzi, D., Melis, E., McLaren, B.M., Meyer, A.K., Dietrich, M., Goguadze, G.: Learning from erroneous examples: when and how do students benefit from them? Sustaining TEL: from innovation to learning and practice, pp. 357–373. Springer, Berlin (2010)

VanLehn, K.: Rule-learning events in the acquisition of a complex skill: an evaluation of CASCADE. J. Learn. Sci. 8 (1), 71–125 (1999)

Download references

Acknowledgements

Not applicable.

Competing interests

The author declares that no competing interests.

Availability of data and materials

A spreadsheet of the data will be provided as an Additional file 1 : Error analysis data.

Consent for publication

Ethics approval and consent to participate.

All students participating in the current study submitted a signed informed consent form.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Authors and affiliations.

Weber State University, 1351 Edvalson St. MC 1304, Ogden, UT, 84408, USA

Sheryl J. Rushton

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Sheryl J. Rushton .

Additional file

Additional file 1:.

Error analysis data.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/ ), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article.

Rushton, S.J. Teaching and learning mathematics through error analysis. Fields Math Educ J 3 , 4 (2018). https://doi.org/10.1186/s40928-018-0009-y

Download citation

Received : 07 March 2017

Accepted : 16 January 2018

Published : 30 January 2018

DOI : https://doi.org/10.1186/s40928-018-0009-y

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Error analysis
  • Correct and erroneous examples
  • Mathematics teaching practices
  • Standards of mathematical practices

literature review of error analysis

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings

Preview improvements coming to the PMC website in October 2024. Learn More or Try it out now .

  • Advanced Search
  • Journal List
  • Korean J Anesthesiol
  • v.71(2); 2018 Apr

Introduction to systematic review and meta-analysis

1 Department of Anesthesiology and Pain Medicine, Inje University Seoul Paik Hospital, Seoul, Korea

2 Department of Anesthesiology and Pain Medicine, Chung-Ang University College of Medicine, Seoul, Korea

Systematic reviews and meta-analyses present results by combining and analyzing data from different studies conducted on similar research topics. In recent years, systematic reviews and meta-analyses have been actively performed in various fields including anesthesiology. These research methods are powerful tools that can overcome the difficulties in performing large-scale randomized controlled trials. However, the inclusion of studies with any biases or improperly assessed quality of evidence in systematic reviews and meta-analyses could yield misleading results. Therefore, various guidelines have been suggested for conducting systematic reviews and meta-analyses to help standardize them and improve their quality. Nonetheless, accepting the conclusions of many studies without understanding the meta-analysis can be dangerous. Therefore, this article provides an easy introduction to clinicians on performing and understanding meta-analyses.

Introduction

A systematic review collects all possible studies related to a given topic and design, and reviews and analyzes their results [ 1 ]. During the systematic review process, the quality of studies is evaluated, and a statistical meta-analysis of the study results is conducted on the basis of their quality. A meta-analysis is a valid, objective, and scientific method of analyzing and combining different results. Usually, in order to obtain more reliable results, a meta-analysis is mainly conducted on randomized controlled trials (RCTs), which have a high level of evidence [ 2 ] ( Fig. 1 ). Since 1999, various papers have presented guidelines for reporting meta-analyses of RCTs. Following the Quality of Reporting of Meta-analyses (QUORUM) statement [ 3 ], and the appearance of registers such as Cochrane Library’s Methodology Register, a large number of systematic literature reviews have been registered. In 2009, the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement [ 4 ] was published, and it greatly helped standardize and improve the quality of systematic reviews and meta-analyses [ 5 ].

An external file that holds a picture, illustration, etc.
Object name is kjae-2018-71-2-103f1.jpg

Levels of evidence.

In anesthesiology, the importance of systematic reviews and meta-analyses has been highlighted, and they provide diagnostic and therapeutic value to various areas, including not only perioperative management but also intensive care and outpatient anesthesia [6–13]. Systematic reviews and meta-analyses include various topics, such as comparing various treatments of postoperative nausea and vomiting [ 14 , 15 ], comparing general anesthesia and regional anesthesia [ 16 – 18 ], comparing airway maintenance devices [ 8 , 19 ], comparing various methods of postoperative pain control (e.g., patient-controlled analgesia pumps, nerve block, or analgesics) [ 20 – 23 ], comparing the precision of various monitoring instruments [ 7 ], and meta-analysis of dose-response in various drugs [ 12 ].

Thus, literature reviews and meta-analyses are being conducted in diverse medical fields, and the aim of highlighting their importance is to help better extract accurate, good quality data from the flood of data being produced. However, a lack of understanding about systematic reviews and meta-analyses can lead to incorrect outcomes being derived from the review and analysis processes. If readers indiscriminately accept the results of the many meta-analyses that are published, incorrect data may be obtained. Therefore, in this review, we aim to describe the contents and methods used in systematic reviews and meta-analyses in a way that is easy to understand for future authors and readers of systematic review and meta-analysis.

Study Planning

It is easy to confuse systematic reviews and meta-analyses. A systematic review is an objective, reproducible method to find answers to a certain research question, by collecting all available studies related to that question and reviewing and analyzing their results. A meta-analysis differs from a systematic review in that it uses statistical methods on estimates from two or more different studies to form a pooled estimate [ 1 ]. Following a systematic review, if it is not possible to form a pooled estimate, it can be published as is without progressing to a meta-analysis; however, if it is possible to form a pooled estimate from the extracted data, a meta-analysis can be attempted. Systematic reviews and meta-analyses usually proceed according to the flowchart presented in Fig. 2 . We explain each of the stages below.

An external file that holds a picture, illustration, etc.
Object name is kjae-2018-71-2-103f2.jpg

Flowchart illustrating a systematic review.

Formulating research questions

A systematic review attempts to gather all available empirical research by using clearly defined, systematic methods to obtain answers to a specific question. A meta-analysis is the statistical process of analyzing and combining results from several similar studies. Here, the definition of the word “similar” is not made clear, but when selecting a topic for the meta-analysis, it is essential to ensure that the different studies present data that can be combined. If the studies contain data on the same topic that can be combined, a meta-analysis can even be performed using data from only two studies. However, study selection via a systematic review is a precondition for performing a meta-analysis, and it is important to clearly define the Population, Intervention, Comparison, Outcomes (PICO) parameters that are central to evidence-based research. In addition, selection of the research topic is based on logical evidence, and it is important to select a topic that is familiar to readers without clearly confirmed the evidence [ 24 ].

Protocols and registration

In systematic reviews, prior registration of a detailed research plan is very important. In order to make the research process transparent, primary/secondary outcomes and methods are set in advance, and in the event of changes to the method, other researchers and readers are informed when, how, and why. Many studies are registered with an organization like PROSPERO ( http://www.crd.york.ac.uk/PROSPERO/ ), and the registration number is recorded when reporting the study, in order to share the protocol at the time of planning.

Defining inclusion and exclusion criteria

Information is included on the study design, patient characteristics, publication status (published or unpublished), language used, and research period. If there is a discrepancy between the number of patients included in the study and the number of patients included in the analysis, this needs to be clearly explained while describing the patient characteristics, to avoid confusing the reader.

Literature search and study selection

In order to secure proper basis for evidence-based research, it is essential to perform a broad search that includes as many studies as possible that meet the inclusion and exclusion criteria. Typically, the three bibliographic databases Medline, Embase, and Cochrane Central Register of Controlled Trials (CENTRAL) are used. In domestic studies, the Korean databases KoreaMed, KMBASE, and RISS4U may be included. Effort is required to identify not only published studies but also abstracts, ongoing studies, and studies awaiting publication. Among the studies retrieved in the search, the researchers remove duplicate studies, select studies that meet the inclusion/exclusion criteria based on the abstracts, and then make the final selection of studies based on their full text. In order to maintain transparency and objectivity throughout this process, study selection is conducted independently by at least two investigators. When there is a inconsistency in opinions, intervention is required via debate or by a third reviewer. The methods for this process also need to be planned in advance. It is essential to ensure the reproducibility of the literature selection process [ 25 ].

Quality of evidence

However, well planned the systematic review or meta-analysis is, if the quality of evidence in the studies is low, the quality of the meta-analysis decreases and incorrect results can be obtained [ 26 ]. Even when using randomized studies with a high quality of evidence, evaluating the quality of evidence precisely helps determine the strength of recommendations in the meta-analysis. One method of evaluating the quality of evidence in non-randomized studies is the Newcastle-Ottawa Scale, provided by the Ottawa Hospital Research Institute 1) . However, we are mostly focusing on meta-analyses that use randomized studies.

If the Grading of Recommendations, Assessment, Development and Evaluations (GRADE) system ( http://www.gradeworkinggroup.org/ ) is used, the quality of evidence is evaluated on the basis of the study limitations, inaccuracies, incompleteness of outcome data, indirectness of evidence, and risk of publication bias, and this is used to determine the strength of recommendations [ 27 ]. As shown in Table 1 , the study limitations are evaluated using the “risk of bias” method proposed by Cochrane 2) . This method classifies bias in randomized studies as “low,” “high,” or “unclear” on the basis of the presence or absence of six processes (random sequence generation, allocation concealment, blinding participants or investigators, incomplete outcome data, selective reporting, and other biases) [ 28 ].

The Cochrane Collaboration’s Tool for Assessing the Risk of Bias [ 28 ]

Data extraction

Two different investigators extract data based on the objectives and form of the study; thereafter, the extracted data are reviewed. Since the size and format of each variable are different, the size and format of the outcomes are also different, and slight changes may be required when combining the data [ 29 ]. If there are differences in the size and format of the outcome variables that cause difficulties combining the data, such as the use of different evaluation instruments or different evaluation timepoints, the analysis may be limited to a systematic review. The investigators resolve differences of opinion by debate, and if they fail to reach a consensus, a third-reviewer is consulted.

Data Analysis

The aim of a meta-analysis is to derive a conclusion with increased power and accuracy than what could not be able to achieve in individual studies. Therefore, before analysis, it is crucial to evaluate the direction of effect, size of effect, homogeneity of effects among studies, and strength of evidence [ 30 ]. Thereafter, the data are reviewed qualitatively and quantitatively. If it is determined that the different research outcomes cannot be combined, all the results and characteristics of the individual studies are displayed in a table or in a descriptive form; this is referred to as a qualitative review. A meta-analysis is a quantitative review, in which the clinical effectiveness is evaluated by calculating the weighted pooled estimate for the interventions in at least two separate studies.

The pooled estimate is the outcome of the meta-analysis, and is typically explained using a forest plot ( Figs. 3 and ​ and4). 4 ). The black squares in the forest plot are the odds ratios (ORs) and 95% confidence intervals in each study. The area of the squares represents the weight reflected in the meta-analysis. The black diamond represents the OR and 95% confidence interval calculated across all the included studies. The bold vertical line represents a lack of therapeutic effect (OR = 1); if the confidence interval includes OR = 1, it means no significant difference was found between the treatment and control groups.

An external file that holds a picture, illustration, etc.
Object name is kjae-2018-71-2-103f3.jpg

Forest plot analyzed by two different models using the same data. (A) Fixed-effect model. (B) Random-effect model. The figure depicts individual trials as filled squares with the relative sample size and the solid line as the 95% confidence interval of the difference. The diamond shape indicates the pooled estimate and uncertainty for the combined effect. The vertical line indicates the treatment group shows no effect (OR = 1). Moreover, if the confidence interval includes 1, then the result shows no evidence of difference between the treatment and control groups.

An external file that holds a picture, illustration, etc.
Object name is kjae-2018-71-2-103f4.jpg

Forest plot representing homogeneous data.

Dichotomous variables and continuous variables

In data analysis, outcome variables can be considered broadly in terms of dichotomous variables and continuous variables. When combining data from continuous variables, the mean difference (MD) and standardized mean difference (SMD) are used ( Table 2 ).

Summary of Meta-analysis Methods Available in RevMan [ 28 ]

The MD is the absolute difference in mean values between the groups, and the SMD is the mean difference between groups divided by the standard deviation. When results are presented in the same units, the MD can be used, but when results are presented in different units, the SMD should be used. When the MD is used, the combined units must be shown. A value of “0” for the MD or SMD indicates that the effects of the new treatment method and the existing treatment method are the same. A value lower than “0” means the new treatment method is less effective than the existing method, and a value greater than “0” means the new treatment is more effective than the existing method.

When combining data for dichotomous variables, the OR, risk ratio (RR), or risk difference (RD) can be used. The RR and RD can be used for RCTs, quasi-experimental studies, or cohort studies, and the OR can be used for other case-control studies or cross-sectional studies. However, because the OR is difficult to interpret, using the RR and RD, if possible, is recommended. If the outcome variable is a dichotomous variable, it can be presented as the number needed to treat (NNT), which is the minimum number of patients who need to be treated in the intervention group, compared to the control group, for a given event to occur in at least one patient. Based on Table 3 , in an RCT, if x is the probability of the event occurring in the control group and y is the probability of the event occurring in the intervention group, then x = c/(c + d), y = a/(a + b), and the absolute risk reduction (ARR) = x − y. NNT can be obtained as the reciprocal, 1/ARR.

Calculation of the Number Needed to Treat in the Dichotomous table

Fixed-effect models and random-effect models

In order to analyze effect size, two types of models can be used: a fixed-effect model or a random-effect model. A fixed-effect model assumes that the effect of treatment is the same, and that variation between results in different studies is due to random error. Thus, a fixed-effect model can be used when the studies are considered to have the same design and methodology, or when the variability in results within a study is small, and the variance is thought to be due to random error. Three common methods are used for weighted estimation in a fixed-effect model: 1) inverse variance-weighted estimation 3) , 2) Mantel-Haenszel estimation 4) , and 3) Peto estimation 5) .

A random-effect model assumes heterogeneity between the studies being combined, and these models are used when the studies are assumed different, even if a heterogeneity test does not show a significant result. Unlike a fixed-effect model, a random-effect model assumes that the size of the effect of treatment differs among studies. Thus, differences in variation among studies are thought to be due to not only random error but also between-study variability in results. Therefore, weight does not decrease greatly for studies with a small number of patients. Among methods for weighted estimation in a random-effect model, the DerSimonian and Laird method 6) is mostly used for dichotomous variables, as the simplest method, while inverse variance-weighted estimation is used for continuous variables, as with fixed-effect models. These four methods are all used in Review Manager software (The Cochrane Collaboration, UK), and are described in a study by Deeks et al. [ 31 ] ( Table 2 ). However, when the number of studies included in the analysis is less than 10, the Hartung-Knapp-Sidik-Jonkman method 7) can better reduce the risk of type 1 error than does the DerSimonian and Laird method [ 32 ].

Fig. 3 shows the results of analyzing outcome data using a fixed-effect model (A) and a random-effect model (B). As shown in Fig. 3 , while the results from large studies are weighted more heavily in the fixed-effect model, studies are given relatively similar weights irrespective of study size in the random-effect model. Although identical data were being analyzed, as shown in Fig. 3 , the significant result in the fixed-effect model was no longer significant in the random-effect model. One representative example of the small study effect in a random-effect model is the meta-analysis by Li et al. [ 33 ]. In a large-scale study, intravenous injection of magnesium was unrelated to acute myocardial infarction, but in the random-effect model, which included numerous small studies, the small study effect resulted in an association being found between intravenous injection of magnesium and myocardial infarction. This small study effect can be controlled for by using a sensitivity analysis, which is performed to examine the contribution of each of the included studies to the final meta-analysis result. In particular, when heterogeneity is suspected in the study methods or results, by changing certain data or analytical methods, this method makes it possible to verify whether the changes affect the robustness of the results, and to examine the causes of such effects [ 34 ].

Heterogeneity

Homogeneity test is a method whether the degree of heterogeneity is greater than would be expected to occur naturally when the effect size calculated from several studies is higher than the sampling error. This makes it possible to test whether the effect size calculated from several studies is the same. Three types of homogeneity tests can be used: 1) forest plot, 2) Cochrane’s Q test (chi-squared), and 3) Higgins I 2 statistics. In the forest plot, as shown in Fig. 4 , greater overlap between the confidence intervals indicates greater homogeneity. For the Q statistic, when the P value of the chi-squared test, calculated from the forest plot in Fig. 4 , is less than 0.1, it is considered to show statistical heterogeneity and a random-effect can be used. Finally, I 2 can be used [ 35 ].

I 2 , calculated as shown above, returns a value between 0 and 100%. A value less than 25% is considered to show strong homogeneity, a value of 50% is average, and a value greater than 75% indicates strong heterogeneity.

Even when the data cannot be shown to be homogeneous, a fixed-effect model can be used, ignoring the heterogeneity, and all the study results can be presented individually, without combining them. However, in many cases, a random-effect model is applied, as described above, and a subgroup analysis or meta-regression analysis is performed to explain the heterogeneity. In a subgroup analysis, the data are divided into subgroups that are expected to be homogeneous, and these subgroups are analyzed. This needs to be planned in the predetermined protocol before starting the meta-analysis. A meta-regression analysis is similar to a normal regression analysis, except that the heterogeneity between studies is modeled. This process involves performing a regression analysis of the pooled estimate for covariance at the study level, and so it is usually not considered when the number of studies is less than 10. Here, univariate and multivariate regression analyses can both be considered.

Publication bias

Publication bias is the most common type of reporting bias in meta-analyses. This refers to the distortion of meta-analysis outcomes due to the higher likelihood of publication of statistically significant studies rather than non-significant studies. In order to test the presence or absence of publication bias, first, a funnel plot can be used ( Fig. 5 ). Studies are plotted on a scatter plot with effect size on the x-axis and precision or total sample size on the y-axis. If the points form an upside-down funnel shape, with a broad base that narrows towards the top of the plot, this indicates the absence of a publication bias ( Fig. 5A ) [ 29 , 36 ]. On the other hand, if the plot shows an asymmetric shape, with no points on one side of the graph, then publication bias can be suspected ( Fig. 5B ). Second, to test publication bias statistically, Begg and Mazumdar’s rank correlation test 8) [ 37 ] or Egger’s test 9) [ 29 ] can be used. If publication bias is detected, the trim-and-fill method 10) can be used to correct the bias [ 38 ]. Fig. 6 displays results that show publication bias in Egger’s test, which has then been corrected using the trim-and-fill method using Comprehensive Meta-Analysis software (Biostat, USA).

An external file that holds a picture, illustration, etc.
Object name is kjae-2018-71-2-103f5.jpg

Funnel plot showing the effect size on the x-axis and sample size on the y-axis as a scatter plot. (A) Funnel plot without publication bias. The individual plots are broader at the bottom and narrower at the top. (B) Funnel plot with publication bias. The individual plots are located asymmetrically.

An external file that holds a picture, illustration, etc.
Object name is kjae-2018-71-2-103f6.jpg

Funnel plot adjusted using the trim-and-fill method. White circles: comparisons included. Black circles: inputted comparisons using the trim-and-fill method. White diamond: pooled observed log risk ratio. Black diamond: pooled inputted log risk ratio.

Result Presentation

When reporting the results of a systematic review or meta-analysis, the analytical content and methods should be described in detail. First, a flowchart is displayed with the literature search and selection process according to the inclusion/exclusion criteria. Second, a table is shown with the characteristics of the included studies. A table should also be included with information related to the quality of evidence, such as GRADE ( Table 4 ). Third, the results of data analysis are shown in a forest plot and funnel plot. Fourth, if the results use dichotomous data, the NNT values can be reported, as described above.

The GRADE Evidence Quality for Each Outcome

N: number of studies, ROB: risk of bias, PON: postoperative nausea, POV: postoperative vomiting, PONV: postoperative nausea and vomiting, CI: confidence interval, RR: risk ratio, AR: absolute risk.

When Review Manager software (The Cochrane Collaboration, UK) is used for the analysis, two types of P values are given. The first is the P value from the z-test, which tests the null hypothesis that the intervention has no effect. The second P value is from the chi-squared test, which tests the null hypothesis for a lack of heterogeneity. The statistical result for the intervention effect, which is generally considered the most important result in meta-analyses, is the z-test P value.

A common mistake when reporting results is, given a z-test P value greater than 0.05, to say there was “no statistical significance” or “no difference.” When evaluating statistical significance in a meta-analysis, a P value lower than 0.05 can be explained as “a significant difference in the effects of the two treatment methods.” However, the P value may appear non-significant whether or not there is a difference between the two treatment methods. In such a situation, it is better to announce “there was no strong evidence for an effect,” and to present the P value and confidence intervals. Another common mistake is to think that a smaller P value is indicative of a more significant effect. In meta-analyses of large-scale studies, the P value is more greatly affected by the number of studies and patients included, rather than by the significance of the results; therefore, care should be taken when interpreting the results of a meta-analysis.

When performing a systematic literature review or meta-analysis, if the quality of studies is not properly evaluated or if proper methodology is not strictly applied, the results can be biased and the outcomes can be incorrect. However, when systematic reviews and meta-analyses are properly implemented, they can yield powerful results that could usually only be achieved using large-scale RCTs, which are difficult to perform in individual studies. As our understanding of evidence-based medicine increases and its importance is better appreciated, the number of systematic reviews and meta-analyses will keep increasing. However, indiscriminate acceptance of the results of all these meta-analyses can be dangerous, and hence, we recommend that their results be received critically on the basis of a more accurate understanding.

1) http://www.ohri.ca .

2) http://methods.cochrane.org/bias/assessing-risk-bias-included-studies .

3) The inverse variance-weighted estimation method is useful if the number of studies is small with large sample sizes.

4) The Mantel-Haenszel estimation method is useful if the number of studies is large with small sample sizes.

5) The Peto estimation method is useful if the event rate is low or one of the two groups shows zero incidence.

6) The most popular and simplest statistical method used in Review Manager and Comprehensive Meta-analysis software.

7) Alternative random-effect model meta-analysis that has more adequate error rates than does the common DerSimonian and Laird method, especially when the number of studies is small. However, even with the Hartung-Knapp-Sidik-Jonkman method, when there are less than five studies with very unequal sizes, extra caution is needed.

8) The Begg and Mazumdar rank correlation test uses the correlation between the ranks of effect sizes and the ranks of their variances [ 37 ].

9) The degree of funnel plot asymmetry as measured by the intercept from the regression of standard normal deviates against precision [ 29 ].

10) If there are more small studies on one side, we expect the suppression of studies on the other side. Trimming yields the adjusted effect size and reduces the variance of the effects by adding the original studies back into the analysis as a mirror image of each study.

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • View all journals
  • My Account Login
  • Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • Open access
  • Published: 18 April 2024

Effects of human milk odor stimulation on feeding in premature infants: a systematic review and meta-analysis

  • Yangyang Qin 1 ,
  • Shu Liu 1 ,
  • Yanming Yang 1 ,
  • Yuan Zhong 1 ,
  • Danshi Hao 2 &
  • Han Han 3  

Scientific Reports volume  14 , Article number:  8964 ( 2024 ) Cite this article

Metrics details

  • Health care
  • Medical research

Previous studies suggested odor stimulation may influence feeding of premature neonates. Therefore, this systematic review and meta-analysis of randomized controlled trials was conducted to assess the effect of human milk odor stimulation on feeding of premature infants. All randomized controlled trials related to human milk odor stimulation on feeding in premature infants published in PubMed, Cochrane, Library, Medline, Embase, Web of science databases and Chinese biomedical literature databases, China National Knowledge Infrastructure, China Science and Technology Journal Database (VIP) and Wanfang Chinese databases were searched, and The Cochrane Handbook 5.1.0 was used to evaluate the quality and authenticity of the literature. Relevant information of the included studies was extracted and summarized, and the evaluation indexes were analyzed using ReviewManager5.3. The retrieval time was from the establishment of the database to July 28, 2022.12 articles were assessed for eligibility, and six randomized controlled studies were eventually included in the meta-analysis (PRISMA). A total of 6 randomized controlled studies with 763 patients were finally included in the study, and the quality evaluation of literatures were all grade B. Human milk odor stimulation reduced the transition time to oral feeding in premature infants [SMD = − 0.48, 95% CI (− 0.69, − 0.27), Z = 4.54, P  < 0.00001] and shortened the duration of parenteral nutrition [MD = − 1.01, 95% CI (− 1.70, − 0.32), Z = 2.88, P  = 0.004]. However, it did not change the length of hospitalization for premature infants [MD = − 0.03, 95% CI (− 0.41, 0.35), Z = 0.17, P  = 0.86]. The implementation of human milk odor stimulation can reduce the transition time to oral feeding and the duration of parenteral nutrition in premature infants, but further studies are needed to determine whether it can reduce the length of hospital stay in premature infants. More high-quality, large-sample studies are needed to investigate the effect of human milk odor stimulation on the feeding process and other outcomes in premature infants.

Introduction

Premature infants are babies born alive before 37 weeks of pregnancy are completed. According to the relevant World Health Organization (WHO) report, 15 million premature infants are born every year in the world 1 . Preterm birth is an important public health issue, as it is associated to a high burden of mortality and morbidities 2 . Premature infants are at high risk for aspiration due to poor coordination of sucking and swallowing 3 . Thus, they usually need tube feeding for nutritional needs with a gradual transition to oral feeding. Premature infants need to start oral feeding at the youngest possible age to improve survival and recovery 4 .

It has been shown 5 , 6 , 7 that fetal olfactory receptors begin to appear in the 8th week of pregnancy, ciliated olfactory receptors mature in the 24th week, and the nasopharyngeal epithelium can express olfactory marker proteins in the 28th week. Premature infants, just like full-term infants, possess a more advanced olfactory system at birth, enabling them to detect, selectively process, retain, and recall odor information. They are able to distinguish between different odors, including those of human milk, even without history of postpartum exposure to such odors 8 , 9 . Olfactory stimulus refers to an environmental stimulus that uses a familiar odor or aromatic odor and is transmitted to the cerebral cortex through olfactory receptors and olfactory nerves to produce an olfactory response. In recent years, an increasing number of studies have used olfactory stimulation as a non-drug intervention to improve the effects of feeding. For example, human milk odor stimulation has a sedative effect on neonates 10 , 11 and relieve the pain 12 caused by venipuncture. The milk odor can also prevent apnea 13 and improve oxygen saturation 14 in premature infants.

Nutritional status parameters, including body weight and oral feeding, are key in determining whether the premature infants can be discharged in time. At present, many reports on the application of human milk odor stimulation to improve the nutritional status of premature infants, but there are differences in the research results and a lack of comprehensive evaluation. Therefore, this systematic review and meta-analysis was conducted to comprehensively evaluate the effect of human milk odor stimulation, and to provide updated evidence for the development of nursing measures in clinical practice.

This study was conducted in conformity to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 15 .

Search strategy

Literature search was conducted from the establishment of the database to July 2022. We searched PubMed/Medline, Cochrane, Library, Embase, Web of science , Chinese biomedical literature databases, China National Knowledge Infrastructure (CNKI), China Science and Technology Journal Database (VIP ), and Wanfang Data Knowledge Service Platform to retrieve published studies on the human milk odor stimulation as an intervention to improve nutritional status in premature infants. Keyword selection and search included both medical subject headings (MeSH) and life science term indexes (EMBASE TREE; EMTREE). The relevant retrieval strategy was as follows: (“Infant newborn” OR “infant” OR “newborn” OR “neonate”) AND (“Feeding” OR “nutrition” OR “feed” OR “nourishment” OR “pabulum”) AND (“Olfactory stimulation” OR “breast milk stimulation” OR “olfactory” OR “human milk” OR “breast milk” OR “odorant” OR “odor” OR “odour” OR “smell”).

Inclusion criteria

Study characteristics used as criteria for eligibility are as follows: (1) Premature infants born at less than 37 weeks gestation who are receiving tube feeding and/or parenteral nutrition; (2) randomized controlled trials; (3) both groups of premature infants were given tube feeding and/or parenteral nutrition, with interventions involving the application of human milk odor stimulation in the intervention group and routine care in the control group; (4) evaluation metrics included transition time to oral feeding, length of stay, duration of parenteral nutrition, and/or body weight; (5) English or Chinese.

Exclusion criteria

(1) Duplicate articles; (2) Preclinical study, meta-analysis, case reports, reviews, guidelines; (3) Valid ending data unable to be extracted or calculated; (4) Full text of the study is not available; (5) The quality evaluation is grade C.

Data extraction

Two authors (YW and AP) carried out the data extraction process independently. Any disagreement was resolved with a senior researcher (CS) through discussion and consensus. Extracted contents were listed as follows: (1) Basic information of the included articles (title, the first author’s name, year of publication, geographic locations, the quality of the studies). (2) Baseline characteristics of the subjects in the eligible literature. (3) Detail of interventions or exposure factors. (4) The outcome indicators and outcome measures of interest (MD and SMD with the corresponding 95% CI).

Quality assessment

The quality of the selected studies was evaluated by two investigators using a revised tool for assessing risk of bias in Review Manager software. According to the Cochrane intervention research system evaluation manual 5.1.0, the document authenticity evaluation standard is carried out 16 . It mainly includes five aspects of bias (selection bias, performance bias, detection bias, attrition bias, reporting bias), six evaluation items: the generation method of random sequence, the concealment of random scheme allocation, the blind method of subjects and interventions, the blind method of outcome evaluators, the integrity of outcome data (loss of follow-up), and the possibility of selective reporting of research results. The single evaluation item is divided into three grades: (1) “low risk of bias” when a low risk of bias was determined for all domains, (2) “high risk of bias” when high risk of bias was reached for at least one domain or the study judgment included some concerns in multiple domains, and (3) unclear risk of bias 17 . The final quality evaluation grades of the literature are Grade A, grade B and grade C.

Statistical analysis

The main statistical software used in this study was ReviewManager5.3; Cochrane library) software. Measures such as length of hospital stay, duration of transitional oral feeding, and duration of parenteral nutrition use were statistically analyzed using the mean ± standard deviation and 95% CI. Standardized conversions could be performed with different measurement instruments to calculate MD/SMD values and 95% CI. The heterogeneity of included studies was examined by the I 2 index. If the test showed a high level of heterogeneity ( I 2  > 50%), a random effect model was used, otherwise a fixed-effect model ( I 2  < 50%) was used 18 . Sensitivity analysis was also performed to investigate the potential interference to the pooled effect size 19 . Statistical significance was set at P  < 0 0.05.

Ethics approval and consent to participate

This is a systematic review, no ethics review.

Literature search results

Initially, 322 literatures related to olfactory stimulation applied to premature infant were searched until July 28, 2022, of which 145 literatures related to human milk odor feeding of premature infants were screened. After excluding duplicate publications and those without full texts, 49 studies remained for full text screening. Reading through the full text, 26 articles were finally retained after excluding the inconsistent literature from the three aspects of study topic, overall design, and evaluation index. Then 12 articles were assessed for eligibility, and six randomized controlled studies were eventually included in the meta-analysis 20 , 21 , 22 , 23 , 24 , 25 (Fig.  1 ). General information and characteristics of the included literature are detailed (Table 1 ).

figure 1

PRISMA flow diagram.

Quality assessment of the selected studies

The qualities of the six included literatures were evaluated as Grade B (partially meeting all criteria). Most of the studies failed to demonstrate the concealment of random scheme allocation and the blind method of outcome evaluators (Fig.  2 , Table 2 ).

figure 2

Risk of bias assessment.

Effects of human milk odor stimulation on the transition time of oral feeding in premature infants

Five studies 20 , 21 , 22 , 23 , 24 evaluated the transition time of oral feeding for premature infants. Due to the small heterogeneity among studies ( P  = 0.39, I 2  = 3%), fixed-effect model analysis was conducted. The result showed that the transition time of oral feeding for premature infants in the intervention group was statistically significantly shorter than that in the routine care group,the statistical unit of the outcomes were days [SMD = − 0.48, 95% CI (− 0.69, − 0.27), Z = 4.54, P  < 0.00001] (Fig.  3 ).

figure 3

Effects of human milk odor stimulation on the transition time of oral feeding for premature infants.

Effects of human milk odor stimulation on duration of parenteral nutrition in premature infants

Two studies 24 , 25 evaluated the duration of parenteral nutrition of premature infants. Due to the small heterogeneity among studies ( P  = 0.43, I 2  = 0%), the fixed effect model was used to analyze the duration of parenteral nutrition. The result indicated a statistically significantly shorter duration of parenteral nutrition support in the intervention group than that in the routine care group, the statistical unit of the outcomes were days [MD = − 1.01, 95% CI (− 1.70, − 0.32), Z = 2.88, P  = 0.004] (Fig.  4 ).

figure 4

Effect of human milk odor stimulation on duration of parenteral nutrition for the premature infants.

Effects of human milk odor stimulation on the length of hospital stay for premature infants

Four 20 , 21 , 24 , 25 studies explored the impact of human milk odor stimulation intervention on the length of hospital stay. Random-effect model was applied given the high heterogeneity ( P  = 0.04, I 2  = 65%), which found no statistically significant difference between the intervention group and routine nursing care group [MD = − 0.28, 95% CI (− 1.19, 0.63), Z = 0.06, P  = 0.55] (Fig.  5 A). To explore the source of heterogeneity, sensitivity analysis was conducted by omitting one study at a time, which found that the study conducted by Yildiz could be have a significant impact the heterogeneity 20 . After removing this study 20 , heterogeneity was dramatically reduced ( P  = 0.50, I 2  = 0%), and fixed effect model was used to assess the effect of human milk odor stimulation intervention on the length of hospital stay. The result still showed no statistically significant difference between the two groups, the statistical unit of the outcomes were days [MD = − 0.03, 95% CI (− 0.41, 0.35), Z = 0.17, P  = 0.86] (Fig.  5 B).

figure 5

( A ) Effects of human milk odor stimulation on the hospitalization time for premature infants before removing Yildiz’s study; ( B ) Effects of human milk odor stimulation on the hospitalization time for premature infants after removing Yildiz’s study.

The systematic review and meta-analysis of six randomized controlled studies found improved outcomes of premature infants associated with human milk odor stimulation. Premature infants are difficult to be fed through bottles by mouth due to underdeveloped oral motor function and uncoordinated sucking, swallowing and respiratory movements, which usually require formula or human milk delivered through a gastric tube 26 . In addition, oral exercise by oral intake contributes to weight gain and neurological development and accelerate their recovery process, while non-oral feeding deprives premature infants of oral exercise 27 , 28 , 29 . Moreover, prolonged tube feeding affects the oral motor skills of the child, leading to reduced respiratory coordination, late sensory problems, and malnutrition 30 . Malnutrition leads to lack of stable weight gain, prolonged hospitalization, and even neurological deficits and readmission 31 . In contrast, adequate nutrition, maintenance of weight gain, and physiological stability play crucial roles in the successful recovery of premature infants from hospitalization 22 . Therefore, the transition from parenteral or tube feeding to complete oral feeding will contribute significantly to sufficient nutrition and prompt recovery of premature infants. The results of the pooled analyses in the present study showed that human milk odor stimulation was able to reduce the time required for transition to normal oral feeding in premature infants. It is well known that normal oral feeding (sucking, swallowing and respiratory coordination) is an early sign of neuromotor integrity in premature infants and an important indication for hospital discharge 28 .

Premature infants admitted to the newborn intensive care unit (NICU) for further treatment and care after birth often require controlled number, frequency, and volume of feedings. Therefore, these newborns may lack adequate stimulation and sensory experiences related to feeding, such as hunger, fullness, taste, and smell 32 . Olfactory and gustatory stimulation alone or in combination can reduce gastrointestinal-related adverse reactions and effectively improve the nutritional status of premature infants by activating complex pathways and triggering cephalic responses, which increase intestinal motility, digestive enzyme secretion, and hormone release 33 , 34 , 35 . Therefore, the application of human milk odor stimulation plays a vital role in promoting the recovery process of premature infants.

Although human milk odor stimulation was associated with reduced transition time to oral feeding and short duration of parental feeding, it did not change the length of hospitalization. Hospitalization in premature infants is affected by a variety of confounding factors, such as body weight, gestational age, the occurrence of complications, family economic status, and medical environment factors. Relevant reports found that very low birth weight infants had significantly longer hospital stays. Moreover, it also revealed that the smaller the gestational age was, the more likely it would be to have complications such as infection, cerebral hemorrhage, and pulmonary hemorrhage, thus prolonging the length of hospital stay 36 . Due to differences in medical and economic levels in different countries, there will also be inconsistencies in the length of hospitalization for premature infants 37 .

It is noteworthy that the study by Küçük 23 , Beker 25 , Khodagholi 21 , and Yildiz 20 reported the weight of premature infants at discharge, and the mean weight at discharge of the control group of these infants in the four studies was 1933.10 ± 90.50 g, 2913 ± 577 g, 1588.1 ± 84.4 g, and 1922.25 ± 230.82 g, respectively. In contrast, the mean weight of the intervention group at discharge were 1908.00 ± 87.86 g, 2986 ± 672 g, 1565.6 ± 93.6 g, and 1893.50 ± 189.04, respectively. However, the difference of the discharge weight between the intervention group and the control group in each study was not statistically significant. Because the initial weight of the premature infants at admission was different between the control group and the intervention group, a simple comparison of body weight at discharge did not yield enough information. Therefore, the effects of the weight of premature infants were not included and observed.

Several inherent limitations need to be noticed when interpreting the results of this meta-analysis. First, the number of studies included was small with overall small sample size. Second, all included studies had a quality rating of B, which may have had an impact on the evaluation results. Third, other parameters, such as weight gain in premature infants, were not available and their effects were not assessed.

This systematic review and meta-analysis found that human milk odor could reduce the transition time to oral feeding and duration of parenteral nutrition for premature infants, suggesting a cheap, effective, and easily accessible method to improve the overall outcomes of premature infants. However, the findings are limited by the number and quality of included studies, therefore, more well-designed studies are still needed to verify our findings.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

World Health Organization. Preterm birth [EB/OL] [2018-12-27].

Sadeghzadeh, M. et al. Early and late outcome of premature newborns with history of neonatal intensive care units admission at 6 years old in Zanjan, Northwestern Iran. Iran. J. Child Neural. 10 (2), 67 (2016).

Google Scholar  

Lau, C. Development of infant oral feeding skills: What do we know?. Am. J. Clin. Nutr. 103 (2), 616S-S621 (2016).

Article   ADS   CAS   PubMed   PubMed Central   Google Scholar  

Costa, J., Neves, A., Camargo, J., et al. Characterization of the transition to oral feeding in premature newborns. Caracterização da transição alimentar para via oral em recém-nascidos prematuros. CoDAS 34 (5), e20210136 (2022).

Schaal, B., Hummel, T. & Soussingnan, R. Olfaction in the fetal and premature infant: Functional status and clinical implications. Clin. Perinatol. 31 (2), 261–285 (2004).

Article   PubMed   Google Scholar  

Chuah, M. I. & Zheng, D. R. Olfactory marker protein is present in olfactory receptor cells of human fetuses. Neuroscience 23 , 363–370 (1987).

Article   ADS   CAS   PubMed   Google Scholar  

Bartocci, M. et al. Activation of olfactory cortex in newborn infants after odor stimulation: A functional near-infrared spectroscopy study. Pediatr. Res. 48 , 18–23 (2000).

Article   CAS   PubMed   Google Scholar  

Marlier, L., Gaugler, C. & Astruc, D. La sensibilité olfactive du nouveau-né prématuré [The olfactory sensitivity of the premature newborn]. Arch. De Pediatr. 14 (1), 45–53 (2007).

Article   CAS   Google Scholar  

Marlier, L. & Schaal, B. Human newborns prefer human milk: Conspecific milk odor is attractive without postnatal exposure. Child Dev. 76 (1), 155–168 (2005).

Badiee, Z., Asghari, M. & Mohammadizadeh, M. The calming effect of maternal breast milk odor on premature infants. Pediatr. Neonatol. 54 (5), 322–325 (2016).

Article   Google Scholar  

Tasci, B. & Kuzlu, A. T. The calming effect of maternal breast milk odor on term infant: A randomized controlled trial. Breastfeed Med. 15 (11), 724–730 (2020).

Baudessonde Chanville, A., Brevaut-Malaty, V., Garbi, A., et al. Analgesic effect of maternal human milk odor on premature neonates: A randomized controlled trial [published correction appears in J Hum Lact. 2017 Nov;33(4):822]. J. Hum. Lact. 33 (2), 300–308 (2017).

Kanbur, B. N. & Balci, S. Impact of the odors of vanilla extract and breast milk on the frequency of apnea in preterm neonates. Jpn J. Nurs. Sci. 17 (1), e12271 (2020).

Neshat, H. et al. Effects of breast milk and vanilla odors on premature neonate’s heart rate and blood oxygen saturation during and after venipuncture. Pediatr. Neonatol. 57 (3), 225–231 (2016).

Moher, D. et al. Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. PLoS Med. 6 (7), e1000097 (2009).

Article   PubMed   PubMed Central   Google Scholar  

Hu, Y., & Hao, Y. F. Evidence-based nursing. Second edition vol. 1, pp. 56–60 (People's Health Publishing House, Beijing, 2018).

Sterne, J. A. C. et al. RoB 2: A revised tool for assessing risk of bias in randomised trials. Br. Med. J. 366 , 1–8 (2019).

Higgins, J. P. & Thompson, S. G. Quantifying heterogeneity in a meta-analysis. Stat. Med. 21 (11), 1539–1558 (2002).

Haidich, A. B. Meta-analysis in medical research. Hippokratia 14 (Suppl 1), 29–37 (2010).

Yildiz, A. et al. The effect of the odor of breast milk on the time needed for transition from gavage to total oral feeding in preterm infants. J. Nurs. Scholar. 43 (3), 265–273 (2011).

Khodagholi, Z. et al. The effect of non-nutritive sucking and maternal milk odor on the independent oral feeding in preterm infants. Iran. J. Child Neurol. 12 (4), 55–64 (2018).

PubMed   PubMed Central   Google Scholar  

Lee, E. J. The effects of breast milk olfactory stimulation on physiological responses, oral feeding progression and body weight in preterm infants. J. Korean Acad. Nurs. 49 (2), 126–136 (2019).

Küçük Alemdar, D. & İnal, S. The effect of individualized developmental care practices in preterm infants. Complement Med. Res. 27 (2), 97–104 (2020).

Le, Q. et al. Effects of olfactory and taste stimulation on tube-fed premature infants. J. Nurs. 36 (19), 32–34 (2021).

Beker, F. et al. Effects on growth of smell and taste of milk during tube feeding of preterm infants: A randomized clinical trial. JAMA Pediatr. 175 (11), 1115–1123 (2021).

Lessen, B. S. Effect of the premature infant oral motor intervention on feeding progression and length of stay in preterm infants. Adv. Neonatal Care. 11 (2), 129–139 (2011).

Article   ADS   PubMed   Google Scholar  

Li, X. L. et al. Early premature infant oral motor intervention improved oral feeding and prognosis by promoting neurodevelopment. Am. J. Perinatol. 37 (6), 626–632 (2020).

Thabet, A. M. & Sayed, Z. A. Effectiveness of the premature infant oral motor intervention on feeding performance, duration of hospital stay, and weight of preterm neonates in neonatal intensive care unit: Results from a randomized controlled trial. Dimens Crit. Care Nurs. 40 (4), 257–265 (2021).

Jadcherla, S. Dysphagia in the high-risk infant: Potential factors and mechanisms. Am. J. Clin. Nutr. 103 (2), 622S-628S (2016).

Article   CAS   PubMed   PubMed Central   Google Scholar  

Toly, V. B. et al. Neonates and infants discharged home dependent on medical technology: Characteristics and outcomes. Adu Neonatal Care. 16 (5), 379–389 (2016).

Evereklian, M. & Posmontier, B. The impact of Kangaroo care on premature infant weight gain. J. Pediatr. Nurs. 34 , e10–e16 (2017).

Beker, F. et al. The effect of smell and taste of milk during tube feeding of preterm infants (the Taste trial): A protocol for a randomised controlled trial. BMJ Open. 9 (7), e027805 (2019).

Kitamura, A. et al. Role played by afferent signals from olfactory, gustatory and gastrointestinal sensors in regulation of autonomic nerve activity. Biol. Pharm. Bull. 33 (11), 1778–1782 (2010).

Power, M. L. & Schulkin, J. Anticipatory physiological regulation in feeding biology: cephalic phase responses. Appetite. 50 (2–3), 194–206 (2008).

Teff, K. Nutritional implications of the cephalic-phase reflexes: Endocrine responses. Appetite. 34 (2), 206–213 (2000).

Gupta, S. et al. Short term outcome and predictors of mortality among very low birth weight infants—a descriptive study. Indian J. Pediatr. 88 (4), 351–357 (2021).

Ahmadzadeh, N. et al. Estimation of economic burden of preterm and premature births in Iran. Med. J. Islamic Republic Iran 1 , 31–78 (2017).

Download references

1. Henan Traditional Chinese Medicine Culture and management Research Projece (NO.TCM2023001); 2. Special Scientific Research Project of Traditional Chinese Medicine in Henan Province (NO.2022JDZX075) 3. Soft Science Projece of Medical Science and Technology Research Programme of Henan Province (NO.RKX202302026); 4. Special Research Project of National TCM Inheritance and Innovation Centre of Henan Provincial Health Commission (NO2023ZXZX1102); 5. Special Research Project of National TCM Inheritance and Innovation Centre of Henan Provincial Health Commission NO.2023ZXZX1114.

Author information

Authors and affiliations.

Nursing Department, The First Affiliated Hospital of Henan University of Chinese Medicine, No. 19 Renmin Road, Jinshui District, Zhengzhou, 450099, He Nan, China

Yangyang Qin, Shu Liu, Yanming Yang & Yuan Zhong

Henan University of Chinese Medicine, No. 156 Jinshui East Road, Jinshui District, Zhengzhou, China

Obstetrics and Gynecology Department, The First Affiliated Hospital of Henan University of Chinese Medicine, No. 19 Renmin Road, Jinshui District, Zhengzhou, 450099, He Nan, China

You can also search for this author in PubMed   Google Scholar

Contributions

Yangyang Qin and Shu Liu were responsible for the topic selection, literature search, draft of the paper and data analysis. Han Han participated in literature screening, literature quality evaluation, and made the final revision of the paper. All authors conducted the search of literature, reviewed the articles, helped with data synthesis and interpretation, and played a major role in writing the manuscript. All authors agree to publish.

Corresponding author

Correspondence to Shu Liu .

Ethics declarations

Competing interests.

The authors declare no competing interests.

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Cite this article.

Qin, Y., Liu, S., Yang, Y. et al. Effects of human milk odor stimulation on feeding in premature infants: a systematic review and meta-analysis. Sci Rep 14 , 8964 (2024). https://doi.org/10.1038/s41598-024-59175-4

Download citation

Received : 15 September 2022

Accepted : 08 April 2024

Published : 18 April 2024

DOI : https://doi.org/10.1038/s41598-024-59175-4

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Human milk odor stimulation
  • Premature infants
  • Meta-analysis
  • Evidence-based nursing

By submitting a comment you agree to abide by our Terms and Community Guidelines . If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Quick links

  • Explore articles by subject
  • Guide to authors
  • Editorial policies

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

literature review of error analysis

  • Get 7 Days Free

FNAIT Systematic Literature Review and Meta-Analysis Presented at the Academy of Managed Care Pharmacy 2024 Annual Meeting

  - Data Support HPA-1a Negative Frequency of More than 2% in Nearly 200,000 Screened Pregnant Women -

- Among HPA-1a Negative Pregnant Women, Approximately 33% are at Higher Risk for Alloimmunization -

Rallybio Corporation (Nasdaq: RLYB), a clinical-stage biotechnology company committed to identifying and accelerating the development of life-transforming therapies for patients with severe and rare diseases, today announced the presentation of results from a fetal and neonatal alloimmune thrombocytopenia (FNAIT) systematic literature review and meta-analysis at the Academy of Managed Care Pharmacy (AMCP) 2024 Annual Meeting, which is taking place in New Orleans, LA. The results of this research found that, in a pooled analysis of 198,062 pregnant women, 2.2% were HPA-1a negative and 32.3% of these women were also HLA-DRB3*01:01 positive and therefore at ~25-fold higher risk for alloimmunization. These rates are consistent with Rallybio’s current estimate of annual at-risk pregnancies and translate to tens of thousands of fetuses and newborns at risk each year for the potentially devastating consequences of FNAIT.

“We are pleased to establish a robust foundation of knowledge documenting the frequency of FNAIT risk as reported from a pooled analysis of peer-review literature, which is consistent with our current estimates,” said Stephen Uden, M.D., Chief Executive Officer of Rallybio. “This information, in combination with data from our ongoing FNAIT natural history study, will enable us to create a shared understanding of the number of pregnant women and babies at higher risk of FNAIT annually, underscoring the importance of having an effective preventative therapeutic option.”

Rallybio is developing RLYB212, a novel human monoclonal anti-HPA-1a antibody designed to prevent alloimmunization in pregnant women, thereby eliminating the risk of FNAIT and its potentially devastating consequences in their fetuses and newborns. Rallybio is on track to initiate a Phase 2 dose confirmation study for RLYB212 in pregnant women in the second half of 2024. The company is also conducting an ongoing FNAIT natural history study that will provide a contemporary dataset for HPA-1a alloimmunization frequency in a racially and ethnically diverse population, which is intended to support a future Phase 3 registration study for RLYB212. RLYB212 is the only investigational therapy currently reported to be in clinical development to address the needs of pregnant women at higher risk of FNAIT who have not alloimmunized.

The poster, titled “Fetal and Neonatal Alloimmune Thrombocytopenia: A Systematic Literature Review and Meta-analysis of Adverse Pregnancy-Related Outcomes to Support the Development of a Novel Prophylactic Therapeutic,” was presented by Andrea V. Margulis of RTI Health Solutions. Specifically, the literature review found that, of 198,062 screened pregnant women, 2.2% (95% confidence interval [CI], 2.0%-2.5%) were HPA-1a negative; 32.3% (28.6%-36.1%) of HPA-1a–negative women were HLA-DRB3*01:01 positive and therefore at even higher risk for alloimmunization. Approximately 10% of HPA-1a–negative women were already alloimmunized to HPA-1a. The meta-analysis is based on 12 observational cohort studies from Europe, Canada, and Egypt published from 1985 through 2018. A link to the poster is available here .

About FNAIT

Fetal and Neonatal Alloimmune Thrombocytopenia (FNAIT) is a potentially life-threatening rare disease that can cause uncontrolled bleeding in fetuses and newborns. FNAIT can arise during pregnancy due to an immune incompatibility between an expectant mother and her fetus in a specific platelet antigen called human platelet antigen 1, or HPA-1.

There are two predominant forms of HPA-1, known as HPA-1a and HPA-1b, which are expressed on the surface of platelets. Individuals who are homozygous for HPA-1b, meaning that they have two copies of the HPA-1b allele and no copies of the HPA-1a allele, are also known as HPA-1a negative. Upon exposure to the HPA-1a antigen, these individuals can develop antibodies to that antigen in a process known as alloimmunization. In HPA-1a-negative expectant mothers bearing a HPA-1a-positive fetus, alloimmunization can occur upon mixing of fetal blood with maternal blood. When alloimmunization occurs in an expectant mother, the anti-HPA-1a antibodies that develop in the mother can cross the placenta and destroy platelets in the fetus. The destruction of platelets in the fetus can result in severely low platelet counts, or thrombocytopenia, and potentially lead to devastating consequences including miscarriage, stillbirth, death of the newborn, or severe lifelong neurological disability in those babies who survive. There is currently no approved therapy for the prevention or prenatal treatment of FNAIT.

About Rallybio

Rallybio (Nasdaq: RLYB) is a clinical-stage biotechnology company with a mission to develop and commercialize life-transforming therapies for patients with severe and rare diseases. Rallybio has built a broad pipeline of promising product candidates aimed at addressing diseases with unmet medical need in areas of maternal fetal health, complement dysregulation, hematology, and metabolic disorders. The Company has two clinical stage programs: RLYB212, an anti-HPA-1a antibody for the prevention of fetal and neonatal alloimmune thrombocytopenia (FNAIT) and RLYB116, an inhibitor of complement component 5 (C5), with the potential to treat several diseases of complement dysregulation, as well as additional programs in preclinical development. Rallybio is headquartered in New Haven, Connecticut. For more information, please visit www.rallybio.com and follow us on LinkedIn and Twitter .

Forward-Looking Statements

This press release contains forward-looking statements that are based on our management’s beliefs and assumptions and on currently available information. All statements, other than statements of historical facts contained in this press release are forward-looking statements. In some cases, forward-looking statements can be identified by terms such as “may,” “will,” “should,” “expect,” “plan,” “anticipate,” “could,” “intend,” “target,” “project,” “contemplate,” “believe,” “estimate,” “predict,” “potential” or “continue” or the negative of these terms or other similar expressions, although not all forward-looking statements contain these words. Forward-looking statements in this press release include, but are not limited to, statements concerning the rates of women who are HPA-1a negative and HLA-DRB3*01:01 positive, the level of increased higher risk for alloimmunization among women who are HPA-1a negative and HLA-DRB3*01:01 positive, the actual rates of annual at-risk pregnancies for FNAIT and the number of potential pregnancies that could be at risk, our belief that the literature review establishes the relevant foundation of knowledge regarding such rates, our ability to create a shared understanding of the number of pregnant women and babies at higher risk of FNAIT annually, the timing of initiation of the Phase 2 dose confirmation study for RLYB212, whether the results of the natural history study and the planned Phase 2 dose confirmation study will be sufficient to support design and implementation of a Phase 3 registrational study for RLYB212, and the likelihood that Rallybio will be successful in developing RLYB212. The forward-looking statements in this press release are only predictions and are based largely on management’s current expectations and projections about future events and financial trends that management believes may affect Rallybio’s business, financial condition, and results of operations. These forward-looking statements speak only as of the date of this press release and are subject to a number of known and unknown risks, uncertainties and assumptions, including, but not limited to, our ability to successfully initiate and conduct our planned clinical studies, and complete such clinical studies and obtain results on our expected timelines, or at all, whether our cash resources will be sufficient to fund our operating expenses and capital expenditure requirements and whether we will be successful raising additional capital, our ability to enter into strategic partnerships or other arrangements, competition from other biotechnology and pharmaceutical companies, and those risks and uncertainties described in Rallybio’s filings with the U.S. Securities and Exchange Commission (SEC), including Rallybio’s Annual Report on Form 10-K for the period ended December 31, 2023, and subsequent filings with the SEC. The events and circumstances reflected in our forward-looking statements may not be achieved or occur and actual future results, levels of activity, performance and events and circumstances could differ materially from those projected in the forward-looking statements. Except as required by applicable law, we are not obligated to publicly update or revise any forward-looking statements contained in this press release, whether as a result of any new information, future events, changed circumstances or otherwise.

literature review of error analysis

Investors Samantha Tracy Rallybio Corporation (475) 47-RALLY (Ext. 282) [email protected] Hannah Deresiewicz Stern Investor Relations, Inc. (212) 362-1200 [email protected] Media Victoria Reynolds Mission North (760) 579-2134 [email protected]

View source version on businesswire.com: https://www.businesswire.com/news/home/20240417889090/en/

Market Updates

Going into earnings, is microsoft a buy, a sell, or fairly valued, big banks: high interest rates and sticky inflation haven’t dented consumer spending, is the world deglobalizing, how to position your investment portfolio before the 2024 election, q2 update: 2024 outlook for the stock market and economy, 6 stocks to sell and 6 stocks to buy instead, markets brief: will the new inflation story hurt stocks, magnificent no more apple and tesla stocks are weighing on the market, stock picks, keycorp earnings: 2024 net interest income outlook unchanged; fees strong and expenses stable, lithium americas: shares plunge on equity issuance, 4 undervalued semiconductor stocks, comerica earnings: net interest income and expense outlook steady, kinder morgan earnings: ai growth likely to add second leg to gas demand on top of us lng exports, going into earnings, is meta stock a buy, a sell, or fairly valued, united airlines earnings: strong revenue offset by ongoing investments, 19 best healthcare companies to invest in, sponsor center.

  • Open access
  • Published: 11 May 2023

Comparative efficacy and safety of PD-1/PD-L1 inhibitors in triple negative breast cancer: a systematic review and network meta-analysis of randomized controlled trials

  • Ibrahim Elmakaty 1 ,
  • Ruba Abdo 1 ,
  • Ahmed Elsabagh 1 ,
  • Abdelrahman Elsayed 1 &
  • Mohammed Imad Malki   ORCID: orcid.org/0000-0002-6801-2126 2  

Cancer Cell International volume  23 , Article number:  90 ( 2023 ) Cite this article

2680 Accesses

4 Citations

8 Altmetric

Metrics details

Triple-Negative Breast Cancer (TNBC) is a lethal subtype of breast cancer with limited treatment options. The purpose of this Network Meta-Analysis (NMA) is to compare the efficacy and safety of inhibitors of programmed cell death 1 (PD-1) and programmed cell death ligand 1 (PD-L1) in treating TNBC.

Our search strategy was used in six databases: PubMed, Cochrane Library, Cumulative Index to Nursing and Allied Health Literature database, Embase, Scopus, and Web of Science up to November 2nd, 2022, as well as a thorough search in the most used trial registries. We included phase II and III randomized controlled trials that looked at the efficacy of PD-1/PD-L1 inhibitors in the treatment of TNBC and reported either Overall Survival (OS), Progression-Free Survival (PFS), or pathological Complete Response (pCR). The risk of bias was assessed utilizing Cochrane's risk of bias 2 tool, and the statistical analysis was performed using a frequentist contrast-based method for NMA by employing standard pairwise meta-analysis applying random effects model.

12 trials (5324 patients) were included in our NMA including seven phase III trials. Pembrolizumab in a neoadjuvant setting achieved a pooled OS of 0.82 (95% Confidence Interval (CI) 0.65 to 1.03), a PFS of 0.82 (95% CI 0.71 to 0.94) and a pCR 2.79 (95% CI 1.07 to 7.24) compared to Atezolizumab’s OS of 0.92 (95% CI 0.74 to 1.15), PFS of 0.82 (95% CI 0.69 to 0.97), and pCR of 1.94 (95% CI 0.86 to 4.37). Atezolizumab had less grade ≥ 3 adverse events (OR 1.48, 95% CI 0.90 to 2.42) than Pembrolizumab (OR 1.90, 95% CI 1.08 to 3.33) in the neoadjuvant setting.

Conclusions

PD-1/PD-L1 inhibitors exhibited varying efficacy in terms of OS, PFS, and pCR. They were associated with an increase in immune-related adverse effects. When used early in the course of TNBC, PD-1/PD-L1 inhibitors exert their maximum benefit. Durvalumab as a maintenance treatment instead of chemotherapy has shown promising outcomes. Future studies should focus on PD-L1 expression status and TNBC subtypes, since these factors may contribute to the design of individualized TNBC therapy regimens.

Systematic review registration PROSPERO Identifier: CRD42022380712.

Breast cancer remains a major health burden, causing considerable morbidity and mortality worldwide [ 1 ]. It has surpassed lung cancer as the most frequently diagnosed malignancy overall and ranks the fifth leading cause of cancer-related mortality, with an estimated 2.3 million new cases (11.7% of all cancers), and 685,000 deaths in 2020 [ 2 ]. The incidence rate has been increasing at an alarming rate over the past years, especially in transitioning countries, and it is predicted that by 2040, this burden will grow further by over 40% to about 3 million new cases and 1 million deaths every year [ 2 , 3 ]. Triple-Negative Breast Cancer (TNBC) is a particularly aggressive subtype that accounts for approximately 15–20% of all cases and is characterized by a lack of expression of both estrogen and progesterone receptors as well as human epidermal growth factor receptor 2 [ 4 ]. The high molecular heterogeneity, great metastatic potential, and limited therapeutic options have all contributed to TNBC having a relatively poor prognosis with a 5-year overall survival rate of 77% [ 5 , 6 ]. Due to the absence of well-defined molecular targets, TNBC therapy predominantly relies on the administration of Taxane and Anthracycline-based regimens in both the neoadjuvant and the adjuvant settings [ 4 , 6 , 7 ]. More favorable response rates are shown to be achieved when using a combination rather than single-agent chemotherapy [ 8 , 9 ]. Although this can be effective initially, chemotherapy is often accompanied by resistance, relapse, and high toxicity [ 10 , 11 ]. Additionally, survival rates in those who develop metastatic disease have not changed over the past 20 years [ 9 ]. The median Overall Survival (OS) for those patients with the current treatment option is 16 months and the median Progression-Free Survival (PFS) is 5.6 months [ 12 ]. These results underscore the urgent need for more effective and less toxic therapies.

The introduction of immunotherapy has revolutionized the field of oncology over the past decade and has been successfully incorporated into the standard treatment paradigm of many malignancies including non-small cell lung cancer and renal cell cancer [ 13 , 14 ]. Whilst breast cancer has traditionally been considered immunogenically quiescent, several lines of evidence have demonstrated TNBC to be highly immunogenic and feature a microenvironment that is enriched with stromal Tumor Infiltrating Lymphocytes (TILs) with a relatively high tumor mutational burden as opposed to other subtypes [ 15 , 16 ]. The high levels of inhibitory checkpoint molecules expressed on the TILs led to the successful implementation of Immune Checkpoint Inhibitors (ICI) in TNBC treatment, particularly inhibitors of the Programmed Cell Death 1 (PD-1) and the Programmed Cell Death Ligand 1 (PD-L1) which have shown great promise in the field’s clinical trials [ 15 ]. The PD‑L1/PD-1 signaling pathway exerts a critical role in forming an adaptive immune resistance mechanism that mediates tumor invasion and metastasis [ 17 ]. Blocking this pathway would therefore restore the antitumor immune responses by reducing the inhibition of innate immunity and reactivating tumor-specific cytotoxic T cells [ 18 ].

Atezolizumab, an anti-PD-L1 antibody was the first Food and Drug Administration (FDA) approved ICI given along with nab-paclitaxel for patients with unresectable locally advanced or metastatic TNBC whose tumors express PD-L1 [ 19 ]. This accelerated approval was based on the results of the Impassion130 trial. Unfortunately, the designated confirmatory trial, IMpassion131 neither met the primary endpoint of PFS superiority nor achieved statistically significant overall OS leading to the withdrawal of this combination as an indication for treatment [ 12 ]. Alternatively, FDA granted approval to pembrolizumab, a PD-1 inhibitor to be used in combination with chemotherapy for patients with high-risk, early-stage TNBC, as well as those with locally recurrent unresectable or metastatic TNBC whose tumors have a PD-L1 Combined Positive Score (CPS) of ≥ 10 [ 12 ]. Nonetheless, there remain several additional clinical trials that have assessed the role of anti‑PD‑L1/PD‑1 agents in TNBC treatment with inconsistent results. The objective of this Network Meta-Analysis (NMA) is to evaluate the efficacy and safety of these agents, as well as compare them in order to determine the optimal therapeutic regimen for patients with TNBC.

Protocol and registration

This systematic review and meta-analysis is reported following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) extension for NMA Additional file 1 : (Table S1) [ 20 ]. The NMA protocol was carried in accordance with a protocol that had been registered in the International Prospective Register of Systematic Reviews (PROSPERO) online database (PROSPERO Identifier: CRD42022380712).

Search strategy

We developed our search strategy in the PubMed database using Medical Subject Headings (MeSH) that included the terms (“Immune Checkpoint Inhibitors”[MeSH] OR “programmed cell death 1 receptor/antagonists and inhibitors”[MeSH]) AND “Triple Negative Breast Neoplasms”[MeSH] AND “Randomized Controlled Trial”[Publication Type] with multiple keywords build around them. There was no date or language restriction applied to our strategy. The developed search strategy was then transferred from PubMed to five other databases by the Polyglot translator [ 21 ], namely Cochrane Library, Cumulative Index to Nursing and Allied Health Literature database, Embase, Scopus, and Web of Science. All databases were searched from the inception date until the 2nd of November 2022. The yielded studies were then exported to EndNote X7, where duplicates were identified and excluded. The remaining articles were uploaded to the Rayyan platform for screening [ 22 ]. In addition, we searched popular clinical trial registries such as ClinicalTrials.gov, EU Clinical Trials Register, International Standard Randomised Controlled Trial Number registry, International Clinical Trials Registry Platform, and breastcancertrials.org for Gery literature (unpublished trials) to ensure the comprehensiveness of our search strategy. Additional file 1 contains the complete strategy for each database and trial registries.

Eligibility criteria

We included trials that met the following criteria: (1) usage of FDA-approved PD-1/PD-L1 inhibitors, (2) phase II or III RCTs, (3) for the management of confirmed TNBC, (4) compared against a different Immune Checkpoint Inhibitors (ICIs), multiple agents’ chemotherapy regimen, single agent chemotherapy regimen or placebo (5) reported Hazard Ratios (HR) for OS, PFS or numbers of pathological Complete Response (pCR) in each both arms of the trial. We excluded review articles, non-randomized trials, quasi-randomized trials, meta-analyses and observational studies, as well as studies on animal models. We also excluded trials using non-FDA-approved immune checkpoint inhibitors.

Study selection and screening

The records obtained from applying the search strategy were evaluated on the Rayyan platform [ 22 ]. Titles and abstracts were screened independently by two reviewers either IE/RA or AhE/AbE with any disagreements were resolved by consensus among the entire team (IE, RA, AhE, AbE and MIM). The full texts of studies that were deemed potentially eligible were then retrieved and double-screened independently (IE/RA or AhE/AbE), with discrepancies dealt with through discussion with the whole team (IE, RA, AhE, AbE and MIM).

Data extraction

We extracted information from each eligible study on the first author, publication date, phase, total number of patients included, and number of patients in each arm, as well as patient demographics (median age, cancer stage), treatment given in each arm, duration of treatment, follow-up time and percentage of patients with positive PD-L1 expression at baseline defined by CPS ≥ 1. We also extracted HR values and their 95% Confidence Intervals (CI) for OS and PFS from each study, as well as the number of patients who achieved pCR in both arms. We collected data on the occurrence of common Adverse Events (AEs) in patients from each study arm. When duplicate publications were discovered, only the most recent and complete reports of RCTs were included. Two reviewers extracted all data (IE/RA or AhE/AbE), which was then summarized, discussed by the team, and compiled into an online Microsoft Excel spreadsheet accessible to all authors.

Risk of bias assessment

To assess the risk of bias, version 2 of the Cochrane Risk-Of-Bias (RoB2) assessment tool for randomized trials was used [ 23 ]. This was done independently by the reviewers (IE/RA or AhE/AbE) with disagreement being resolved by discussion and input from a third author (MIM). The RoB2 assessment tool includes five distinct domains with multiple signaling questions to aid in assessing the risk of bias. The five domains in this tool appraise bias arising from the following: randomization process, deviations from intended interventions, missing outcome data, measurement of the outcome and selection of the reported result. Accordingly, the signaling questions provided by the ROB2 tool were answered, and the two other reviewers evaluating the trial used those answers to categorize the current domain as “low risk of bias,” “some concerns,” or “high risk of bias.” The reviewer's judgment in each domain resulted in an overall risk-of-bias conclusion for the trial under consideration. The study was deemed to have a “low risk of bias” if it was judged to have a low risk of bias in all domains included in the tool, “some concerns” if it raised some concerns in at least one domain, or “high risk of bias” if it was judged to have a high risk of bias in at least one or some concerns for multiple domains, significantly lowering confidence in the result. This data for all studies was compiled in the tool's template excel sheet, which was made available to all reviewers.

As our aim is to evaluate the efficacy and safety of ICIs, we selected four different outcomes in this NMA. The first two are OS, which is defined as the time from randomization to death from any cause, and PFS, which is defined as the time from randomization to the first documented disease progression per Response Evaluation Criteria in Solid Tumors version 1.1. The HR and its 95% CI comparing the two arms of the trials in Intention-To-Treat (ITT) populations were used to generate our final effect sizes in this NMA. The third outcome is pCR, which is defined as the absence of invasive tumors in the breast and regional nodes at the time of definitive surgery (ypT0/is pN0). Finally, to assess the safety of PD-1/PD-L1 inhibitors, we estimated the likelihood of developing AEs in each arm of the ITT populations by using the number of patients who had AEs in all grades and grade 3 or higher. Both pCR and AEs were calculated using Odds Ratios (OR) and their 95% CI based on the number of reported events in each of the trial arms.

Data analysis

Our NMA used standard pairwise meta-analysis implemented in multivariate meta-analysis models using a frequentist contrast-based approach [ 24 ]. If there is no evidence of importance in transitivity, a random-effects frequentist NMA has to be performed. These models assume that direct and indirect evidence are consistent. The network meta-analysis' net evidence is a weighted average of direct and indirect evidence. For OS and PFS, we calculated the mean log HR and its standard error and entered it into the model [ 25 ], while for pCR and AEs, we entered the number of events in each arm. When the same intervention was used in both arms of an RCT, it was assumed that the effect of that intervention was cancelled out, thus we assumed that all trials used the same comparator chemotherapy, which is necessary because even within the same trial, different chemotherapy regimens were used as controls. The assumption of transitivity was tested by comparing the distribution of study and population characteristics that may act as effect modifiers across the various pairwise comparisons. If transitivity issues were present, we returned to data extraction to verify the stage of TNBC, and the type of chemotherapy regimen used. In the case of indirect evidence, inconsistency between direct and indirect evidence was investigated locally through the use of symmetrical node-splitting [ 26 ]. However, we found no head-to-head comparisons of PD-1/PD-L1 inhibitors. Visual inspection of comparison-adjusted funnel plots for NMA was used to assess publication bias [ 27 ]. Studies were expected to form an inverted funnel centred at zero in the absence of small-study effects. The Surface Under the Cumulative Ranking Curve (SUCRA) value, which represents the re-scaled mean ranking, was also calculated and summarized [ 28 ]. Where quantitative synthesis is deemed invalid due to a small number of studies using the same intervention, narrative synthesis was used to report the findings in the results section, with estimates from the original studies. For all comparisons, we adopted the network suite in Stata to perform analyses and graphs, Stata version 16 (College Station, TX, USA) [ 29 ].

Subgroup analysis

In the event of significant heterogeneity, we conducted a sensitivity analysis, removing each study and comparing its effect. In terms of the outcome of AEs, we investigated the impact of reported symptoms on AEs to check which side effects are likely to produce this effect. We performed a sensitivity analysis for NMA using the Generalized Pairwise Modelling (GPM) framework to investigate the effect of the models used [ 30 ]. The GPM framework was used to generate mixed treatment effects against a common comparator. The common comparator for all outcomes was chemotherapy. Other than transitivity, this framework requires no additional assumptions [ 30 ]. In this sensitivity analysis, the Inverse Variance Heterogeneity model was used to pool the meta-analytical estimates [ 31 ]. The H index was used to assess statistical heterogeneity across pooled direct effects, while the weighted pooled H index ( \(\overline{H }\) ) was used to examine inconsistency across the network and assess transitivity [ 30 ]. The smallest value that H and \(\overline{H }\) can take is 1, and \(\overline{H }\) <3 was thought to represent minimal inconsistency [ 32 ]. MetaXL version 5.3 was used for the GPM framework analyses (EpiGear Int Pty Ltd.; Brisbane, Australia). The results of those sensitivity analyses will be presented in the Additional file 1 .

Study selection

Figure  1 illustrates the PRISMA flow diagram of the study selection process. Our extensive database and trial registry search yielded 1583 results. 397 duplicates were automatically removed through EndNote. A total of 1186 potentially relevant articles were identified, of which 1056 were excluded after the initial review of their titles and abstracts. The full text of the remaining 130 articles was assessed for eligibility. Of those, 71 were found to be duplicate patient records, and only the most recent and inclusive records were kept. Another 31 RCTs were excluded due to a paucity of outcome measures at the time of the search. Other 16 records were similarly removed for a variety of reasons depicted in Fig.  1 . Eventually, 12 studies were eligible for inclusion in our NMA [ 33 , 34 , 35 , 36 , 37 , 38 , 39 , 40 , 41 , 42 , 43 , 44 ]. Additional file 1 : Table S2 includes all the additional information on the omitted record citations as well as full reasoning.

figure 1

PRISMA flowchart showing the number of studies at each stage of conducting this NMA

Study characteristics and data collection

Table 1 summarizes the characteristics of the included RCTs. All 12 trials included were two-arm trials that reported results from 5324 patients with median ages ranging from 48 to 59.1 years. There were seven phase III trials and five phase II trials. Six studies looked at the effect of PD-1/PD-L1 inhibitors on unresectable, invasive, or metastatic (advanced) TNBC [ 33 , 35 , 36 , 37 , 40 , 43 ], four looked at non-metastatic/early-stage TNBC [ 39 , 41 , 42 , 44 ], and two looked at treated metastatic TNBC for maintenance therapy [ 34 , 38 ]. Atezolizumab (n = 5 trials) was the most commonly studied ICI [ 33 , 36 , 39 , 40 ], followed by Pembrolizumab (n = 4 trials) [ 34 , 35 , 41 , 42 ], Durvalumab (n = 2 trials) [ 37 , 38 ], and Nivolumab (n = 1 trial) [ 43 ]. Six trials used multiple-agent chemotherapy regimens in combination with PD-1/PD-L1 inhibitors [ 36 , 37 , 39 , 41 , 42 , 44 ], and four used mono-chemotherapy regimens with PD-1/PD-L1 inhibitors, including two Taxane-based [ 33 , 40 ], one Platinum-based [ 43 ], and one Investigator's choice chemotherapy [ 35 ]. The other two trials compared PD-1/PD-L1 inhibitors alone to chemotherapy for maintenance therapy in patients with previously treated metastatic TNBC [ 34 , 38 ]. There were some minor differences in the duration of PD-1/PD-L1 inhibitors used between studies. With the exception of one trial [ 44 ], PD-1/PD-L1 inhibitors were used for four to eight cycles with a follow-up time of more than 12 months. The PD-L1 expression in TNBC tissue samples varied significantly between the included RCTs, ranging from 39 to 87% (see Table 1 ). Table 1 is to be inserted here.

Overall, five RCTs had a low risk of bias [ 33 , 35 , 37 , 40 , 41 ], six had some concerns [ 36 , 38 , 39 , 42 , 43 , 44 ], and only one had a high risk of bias [ 34 ]. When following the intended protocol and performing ITT analysis, all included trials were of high quality. Five of the six trials that raised concerns were due to the trial being non-blinded [ 36 , 38 , 42 , 43 , 44 ], which could affect the assessment of the outcome of interest. One study found a significant difference in one of the baseline parameters [ 39 ], while the high-risk study failed to report one of the secondary outcomes in the main text [ 34 ]. Figure  2 depicts the overall risk of bias across all domains (Fig.  2 A), as well as the reviewers' judgment within each domain for all included trials (Fig.  2 B).

figure 2

The results of the risk of bias assessment. A Stacked bar chart showing a summary of the risk of bias assessment overall and in each domain. B The detailed answers for all studies in each domain

  • Overall survival

The OS was reported in nine RCTs [ 33 , 34 , 35 , 37 , 38 , 39 , 40 , 41 , 43 ], three of which used Atezolizumab [ 33 , 39 , 40 ], two used Pembrolizumab [ 35 , 41 ], and one used either Durvalumab or Nivolumab as a neoadjuvant to chemotherapy (Fig.  3 A) [ 37 , 43 ]. Pembrolizumab in a neoadjuvant setting had a pooled HR of 0.82 (95% CI 0.65 to 1.03, SUCRA = 46%, n = 2 trials, 1449 patients), which was comparable to Atezolizumab’s HR of 0.92 (95% CI 0.74 to 1.15, SUCRA = 28%, n = 3 studies, 1886 patients), demonstrating a prolonged but insignificant OS in PD-1/PD-L1 inhibitors arms (see SUCRA Additional file 1 : Table S3). GeparNuevo using Durvalumab had the only significant reported prolonged OS in PD-1/PD-L1 inhibitors in neoadjuvant settings (HR 0.24, 95% CI 0.08 to 0.72) [ 37 ]. Durvalumab also improved OS when used as a monotherapy for maintenance therapy in patients with metastatic TNBC (SAFIR02-BREAST trial, HR 0.54, 95% CI 0.30 to 0.97) [ 38 ]. This outcome's results were consistent among the studies. The rest of the analysis is shown in Fig.  3 . GPM sensitivity analysis also revealed no significant differences (Additional file 1 : Figure S1).

figure 3

Overall survival network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the effectiveness of each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately

  • Progression-free survival

Only six RCTs reported PFS [ 33 , 34 , 35 , 38 , 40 , 43 ], two of which used Atezolizumab in neoadjuvant sitting [ 33 , 40 ], as shown in Fig.  4 A. In a neoadjuvant setting along with chemotherapy, Atezolizumab achieved a pooled PFS HR of 0.82 (95% CI 0.69 to 0.97, SUCRA = 76.5%, 1553 patients) (see complete SUCRA values in Additional file 1 : Table S4), whereas Pembrolizumab can also prolong PFS as reported in KEYNOTE-355 trial when combined with chemotherapy (HR 0.82, 95% CI 0.71 to 0.94) [ 35 ]. In the SAFIR02-BREAST trial, Durvalumab had similar PFS to single-agent chemotherapy (HR 0.87, 95% CI 0.54 to 1.42, 82 patients) [ 38 ], whereas Pembrolizumab alone was associated with significantly worse PFS than chemotherapy in KEYNOTE-119 trial (HR 1.60, 95% CI 1.33 to 19.2, 622 patients) [ 34 ]. The rest of the analysis is shown in Fig.  4 , and the GPM sensitivity analysis is illustrated in the Additional file 1 : (Figure S2).

figure 4

Progression-free survival network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the effectiveness of each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately

Pathologic complete response

The number of patients who achieved a complete response was reported in six trials [ 36 , 39 , 41 , 42 , 44 ]: three on Atezolizumab [ 36 , 39 , 44 ], two on Pembrolizumab [ 41 , 42 ], and one on Durvalumab [ 37 ], all in the neoadjuvant setting to chemotherapy. Pembrolizumab in combination with chemotherapy significantly increased the odds of achieving pCR compared to chemotherapy alone (OR 2.79, 95% CI 1.07 to 7.24, SUCRA = 82.1%, 2 studies, 709 patients), whereas Atezolizumab showed an insignificant increase in pCR (OR 1.94, 95% CI 0.86 to 4.37, SUCRA = 62.3, 3 studies, 674 patients) (complete SUCRA values in Additional file 1 : Table S5). In the GeparNuevo trial, the calculated OR of achieving pCR with Durvalumab and chemotherapy was 1.45 (95% CI 0.80 to 2.63) [ 37 ]. Figure  5 summarizes the results of the pCR analysis, and the GPM sensitivity analysis is presented in the Additional file 1 : Figure S3.

figure 5

Pathologic complete response network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the effectiveness of each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately

  • Adverse events

At the time of analysis, nine trials had AEs grade ≥ 3 results reported [ 33 , 34 , 35 , 36 , 39 , 40 , 41 , 42 , 44 ], the majority of which was the effect of Atezolizumab combined with chemotherapy versus chemotherapy alone (n = 5 studies) [ 33 , 36 , 39 , 40 , 44 ], followed by Pembrolizumab with chemotherapy (n = 3 studies) (Fig.  6 A) [ 35 , 41 , 42 ]. The pooled OR of Atezolizumab addition to chemotherapy causing AEs grade 3 or more compared to chemotherapy alone was 1.48 (95% CI 0.90 to 2.42, 5 studies, 2325 patients), whereas Pembrolizumab with chemotherapy showed a slightly greater risk of causing AEs grade ≥ 3 (OR 1.90, 95% CI 1.08 to 3.33, 3 studies, 2263 patients) (Fig.  6 C). Atezolizumab and Pembrolizumab achieved SUCRA values of 26.7% and 9.3% respectively compared to 64.3% for chemotherapy (Additional file 1 : Table S6). When compared to single-agent chemotherapy, the KEYNOTE-119 trial showed a significant reduction in AEs grade ≥ 3 when using Pembrolizumab alone in maintenance therapy (OR 0.29, 95% CI 0.19 to 0.43) [ 34 ].

figure 6

Grade ≥ 3 adverse events network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing adverse events for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately

In the sensitivity analysis investigating the subtype of the reported AEs, neoadjuvant Pembrolizumab to chemotherapy showed an increase in the odds of developing adrenal insufficiency (OR 26.24, 95% CI 3.50 to 197.86, Additional file 1 : Figure S4), diarrhea (OR 1.47, 95% CI 1.14 to 1.88, Additional file 1 : Figure S5), hyperthyroidism (OR 5.22, 95% CI 2.44 to 11.15, Additional file 1 : Figure S6), hypothyroidism (OR 5.23, 95% CI 3.35 to 8.16, Additional file 1 : Figure S7), infusion reaction (OR 1.64, 95% CI 1.13 to 2.37, Additional file 1 : Figure S8) and pneumonitis (OR 5.94, 95% CI 1.29 to 27.27, Additional file 1 : Figure S9). On the other hand, Atezolizumab in the neoadjuvant settings increased the odds of developing hyperthyroidism (OR 10.91, 95% CI 1.98 to 60.15, Additional file 1 : Figure S6), hypothyroidism (OR 3.77, 95% CI 2.52 to 5.63, Additional file 1 : Figure S7) and pneumonitis (OR 2.73, 95% CI 1.41 to 5.31, Additional file 1 : Figure S9) compared to chemotherapy alone. The remaining results of the sensitivity analysis according to the type of AE developed and GPM are outlined in the Additional file 1 : (Figure S10 to Figure S17).

Principle findings and existing literature

TNBC is an aggressive form of breast cancer that is often associated with poor patient outcomes, largely due to the limited treatment options available [ 6 ]. Intensive research efforts have therefore attempted to improve the efficiency of standard-of-care chemotherapy by incorporating immunotherapeutic agents, particularly ICIs, which have emerged as a novel breakthrough in cancer treatment in the past recent years [ 15 ]. The present network meta-analysis aimed to compare the published data on the efficacy and safety of ICIs in treating TNBC. Our results showed that antiPD-1/PD-L1 therapies can be used as a neoadjuvant to chemotherapy in the first-line treatment or alone in previously treated TNBC. Multiple RCTs that were conducted on this topic have demonstrated a greater benefit of adding ICIs to chemotherapy in terms of OS, PFS, and pCR [ 45 , 46 , 47 , 48 , 49 ]. As a result, existing meta-analyses evaluating those trials were successful in achieving statistical and clinical significance. For example, Zhang et al. group reported that PD-1/PD-L1 inhibitors in combination with chemotherapy improved pCR (OR 1.59, 95% CI 1.28 to 1.98), event-free survival (HR 0.66, 95% CI 0.48 to 0.91, p = 0.01), and overall survival (HR 0.72, 95% CI 0.52 to 0.99) in TNBC patients compared to chemotherapy alone [ 45 ]. Moreover, Li et al. studied the pCR of ICIs in neoadjuvant setting in TNBC and reported that the OR significantly increased in their four included study meta-analysis (OR 2.14, 95% CI 1.37–3.35, P < 0.001) and a better event-free survival (HR 0.66, 95% CI 0.48 to 0.89, P = 0.007) [ 49 ], while similar values for pCR were reported by Rizzo et al. (OR 1.95, 95% CI 1.27 to 2.99) and Xin et al. (OR 1.91, 95% CI 1.32 to 2.78) [ 46 , 48 ]. Villacampa et al. reported that patients with PD-L1-positive tumors had a significantly better PFS with ICIs (HR 0.67, 95% CI 0.58 to 0.79) and a trend towards better OS (HR 0.79, 95% CI 0.60 to 1.03), while no benefit was observed in patients with PD-L1-negative tumors [ 47 ]. This is in contrast to Zhang et al. who found that the pCR rate was almost identical in the PD-L1-positive and negative groups [ 45 ]. However, many have reported high heterogeneity in effect estimates, indicating major systematic differences between the included RCTs [ 45 , 46 , 47 , 48 , 49 ]. Although this heterogeneity has been attributed to many factors including patient population, TNBC stage, PD-L1 levels, randomization process, and type of chemotherapy regimen, these meta-analyses have failed to acknowledge the performance differences and the distinct immunologic mechanisms by which ICIs act. Contrary to our study, they have combined all agents into a large group and regarded them as one entity, assuming they have similar efficacy and safety.

Efficacy of PD-1/PD-L1 inhibitors

In our NMA, only two trials out of nine reported statistical significance in terms of OS, both of which used Durvalumab, one as a neoadjuvant (GeparNuevo phase II trial, HR 0.24, 95% CI 0.08 to 0.72, 174 patients) and the other as maintenance (SAFIR02-BREAST trial, HR 0.54, 95% CI 0.30 to 0.97) [ 37 , 38 ]. Six of the remaining seven trials reported longer, yet statistically insignificant survival. This could be attributed to the small sample size or the lack of follow-up, yet the possibility of Durvalumab having superior efficacy remains, highlighting the need for an additional large phase III RCTs investigating Durvalumab efficacy and safety in TNBC. Five of the seven trials that used neoadjuvant PD-1/PD-L1 inhibitors and reported OS were invasive or metastatic (advanced), with only GeparNuevo achieving a significant reduction in OS HR (0.24, 95% CI 0.08 to 0.72). The remaining two neoadjuvant trials (IMpassion031 trial, HR 0.69, 95% CI 0.25 to 1.87) and (KEYNOTE-522 trial, HR 0.72, 95% CI 0.51 to 1.02) were on non-metastatic or advanced and did not show any improvement in OS.

In general, PFS prolongation followed a positive trend similar to OS when ICIs were used. The IMpassion130 trial demonstrated a significant improvement in PFS with Atezolizumab (HR 0.80, 95% CI 0.69 to 0.92) [ 40 ], as opposed to the confirmatory trial Impassion131which failed to achieve statistical significance with Atezolizumab despite extending PFS (HR 0.86, 95% CI 0.70 to 1.05) [ 33 ]. An FDA review of the discordant findings between these two trials, including chemotherapy regimens, study design, conduct and population found no single component that could be responsible for this discrepancy, as a result, the reason for this is unclear at present. It is also worth mentioning that the only two trials that reported statistical significance, KEYNOTE-355 and IMpassion130, are the ones with the largest population sample, which may have accounted for their outcome.

Alternatively, the KEYNOTE-355 trial found that Pembrolizumab is effective in prolonging PFS in the neoadjuvant setting (HR 0.82, 95% CI 0.69 to 0.97) [ 35 ], while Nivolumab appears to be less effective in improving survival PFS (HR 0.98, 95% CI 0.51 to 1.88). Both KEYNOTE-522 and IMpassion031 trials found that using ICI in the neoadjuvant setting improved disease-free survival [ 39 , 41 ]. ICIs use as maintenance therapy instead of chemotherapy in treated metastatic TNBC has also shown promising results in terms of prolonging survival using Durvalumab in the SAFIR02-BREAST trial, in contrast to Pembrolizumab that showed no significant improvement in the Keynote-119 trial (PFS HR 1.6, 95% CI 1.33 to 1.92) [ 34 , 38 ]. Nonetheless, the Keynote-119 trial demonstrated a significant reduction in AEs grade ≥ 3, negating one of chemotherapy's worst attributes [ 34 ]. Furthermore, ICIs have also been shown to improve the chances of achieving pCR in TNBC patients when compared to chemotherapy alone. According to our NMA, neoadjuvant Pembrolizumab resulted in the highest pCR (OR 2.79, 95% CI 1.07 to 7.24), followed by Atezolizumab (OR 1.94, 95% CI 0.86 to 4.37, 3 studies, 674 patients), and Durvalumab, which had the lowest pCR (1.45, 95% CI 0.80 to 2.63). However, among the six trials that reported pCR, NeoTRIPaPDL1 and GeparNuevo were the only two RCTs that did not report significant improvement in pCR [ 36 , 37 ]. This can be explained by the advanced TNBC stage both studies were conducted upon, implying that using ICIs at an earlier stage of TNBC disease progression will more likely benefit patients and improve their survival. This is supported by the fact that early-stage TNBC has a greater tumor immune microenvironment than advanced TNBC, which increases the effectiveness of ICIs with the additional stimulation to the immune response provided by chemotherapy treatment [ 46 ]. Another possibility for the negative NeoTRIPaPDL1 results could be due to the insufficient immune induction effect of the chemotherapy regimens used in the study design [ 46 ].

Safety of PD-1/PD-L1 inhibitors

In regard to safety, ICIs appear to be associated with a significant toxicity burden, especially in the form of immune-related AEs [ 50 ]. Our NMA showed that Pembrolizumab generally has a worse safety profile than Atezolizumab, causing more grade ≥ 3 AEs (OR 1.90, 95% CI 1.08 to 3.33). Despite the fact that both drugs increased the risk of hyperthyroidism, hypothyroidism, and pneumonitis, Pembrolizumab caused a significant increase in adrenal insufficiency, diarrhea, and infusion reaction, making Atezolizumab a safer option. These AEs are likely to be related to drugs’ mechanism of action. The ability of ICIs to reinvigorate exhausted T-cells in an attempt to kill the tumor may destroy the immune tolerance balance and result in autoimmune and inflammatory responses in normal tissue [ 51 , 52 ]. However, the reason why certain people or specific organs are more susceptible than others is still incompletely understood [ 51 ]. Proposed hypotheses include hereditary predisposition, environmental factors and expression of shared antigens between tumors and affected tissue [ 51 ]. Whilst most of these immune-related AEs are usually manageable and reversible, some may require long-term intervention, such as endocrinopathies [ 50 ]. Of note, close monitoring of patients and early detection of any AEs is of utmost importance to ensure patients can benefit from adding PD-1/PD-L1 inhibitors to their chemotherapy regimen. Careful follow-up care is also warranted to prevent potential later onset immune-related AEs that may present after cessation of ICIs [ 50 ].

Enhancing the benefit of PD-1/PD-L1 inhibitors

It is crucial to note that the response to ICIs as well as to the combination of other agents differs significantly among patients, highlighting the importance of predictive biomarkers [ 53 ]. A multitude of promising novel biomarkers has recently gained considerable attention including the CD274 gene and TILs, but to date, PD-L1 status remains the only biomarker approved to guide patient selection in TNBC [ 53 , 54 , 55 ]. We considered PD-L1 positivity as CPS ≥ 1 in Table 1 , yet the threshold for PD-L1 positivity and at what level ICIs become more effective remains a topic of scientific debate. Analysis of the present NMA showed that IMpassion031, Keynote-522, and GeparNuevo trials have all demonstrated PD-1/PD-L1 inhibitors to improve efficacy regardless of PD-L1 status in patients with early-stage TNBC [ 33 , 41 ]. Conversely, IMpassion130 and Keynote-355 demonstrated improved efficacy in metastatic TNBC but not in early-stage TNBC [ 35 , 40 ]. Following the outcomes of the recently published IMpassion130 and KEYNOTE-355 trials, this biomarker was validated as a predictor of response to PD-1/PD-L1 inhibitors in metastatic breast cancer [ 48 ]. Even though data from a previous meta-analysis found no correlation between pCR rates and PD-L1 expression, further investigation revealed pCR rates to be higher in PD-L1-positive patients [ 46 ]. Notably, the lack of a standardized approach for PD-L1 detection in TNBC has led to inconsistent PD-L1 prevalence, thereby hampering the precise guiding of immunotherapy [ 45 , 54 ]. Another significant challenge is that TNBC is composed of numerous heterogeneous subtypes. Biomarker research on IMpassion130 samples revealed that PD-L1 is expressed higher in basal-like immune-activated subtype (75%) and immune-inflamed tumors (63%) TNBC subtypes [ 56 , 57 ]. Another exploratory study found an improved advantage in PFS in TNBC patients with immune-inflamed tumors, basal-like immune-activated and basal-like immunosuppressed subtypes, in addition to the prolonged OS in inflamed tumors and basal-like immune-activated subtypes [ 47 , 56 , 57 ]. Certainly, the identification of predictive biomarkers of efficacy will greatly aid in optimizing personalized regimens for TNBC patients, as well as predicting the long-term effectiveness of PD-1/PD-L1 inhibitors.

Future RCTs using PD-1/PD-L1 inhibitors in TNBC

Interestingly, the majority of the currently ongoing RCTs are investigating Atezolizumab and Pembrolizumab, both of which were studied the most in nine out of the 12 RCTs included in our NMA. Hoffmann-La Roche, the sponsor of IMpassion130, IMpassion131, and Impassion031, is currently funding three additional phase III RCTs on Atezolizumab. IMpassion132 is a double-blind Phase III RCT on the efficacy and safety of neoadjuvant Atezolizumab for early relapsing TNBC (NCT03371017), while IMpassion030 is planned to be the largest RCT on ICI as it is presently recruiting 2300 patients with operable TNBC to investigate the combination of neoadjuvant Atezolizumab and chemotherapy (NCT03498716). Hoffmann-La Roche’s third RCT is looking into the combination of Atezolizumab, Ipatasertib, and Paclitaxel in patients with advanced or metastatic TNBC (NCT04177108). In another phase III double-blinded RCT, GeparDouze will investigate neoadjuvant Atezolizumab followed by adjuvant Atezolizumab in patients with high-risk TNBC (NCT03281954). The National Cancer Institute (NCI) is also funding a large phase III RCT to assess the efficacy and safety of Pembrolizumab as adjuvant therapy following neoadjuvant chemotherapy (NCT02954874). Additionally, ASCENT-04 and ASCENT-05 are both ongoing phase III RCTs investigating the PFS of Pembrolizumab in combination with Sacituzumab Govitecan versus chemotherapy in either advanced or residual invasive TNBC (NCT05382286, NCT05633654). TROPION-Breast03 is similarly a new phase III RCT looking at Datopotamab Deruxtecan (DatoDXd) with or without Durvalumab in early-stage TNBC (NCT05629585). Finally, Avelumab, another PD-L1 inhibitor, is currently being studied in a phase III RCT on high-risk TNBC patients (A-Brave trial, NCT02926196).

Limitations

There are some limitations that must be addressed in this NMA. Firstly, only 12 studies were included, in addition to the limited number of reported outcomes of interest. This is primarily due to the fact that we only included phase II and phase III RCTs because our goal was to compare the efficacy of PD-1/PD-L1 inhibitors in clinical settings. With the ongoing development of neoadjuvant ICI clinical trials, there will certainly be more comprehensive data to be analyzed in future NMA. Second, the NMA comparisons were solely based on direct evidence, with no head-to-head comparisons of neoadjuvant ICIs in TNBC. Moreover, the small number of studies has caused the limited network connectivity to produce large confidence intervals for some estimates, even when effect sizes were large. It may have also resulted in an immature investigation of heterogeneity and publication bias. We would also like to point out the differences between the included studies in terms of TNBC stage, chemotherapy backbone, ICI duration, follow-up time, and PD-L1 expression status. Different chemotherapy backbone regimens used in different studies may have influenced the interpretation of the results as they could have been added to separate groups in the NMA if the number of included studies allowed. Given this heterogeneity and the limited RCTs number, further subgroup analysis based on PD-L1 expression status and nodal involvement, as well as advanced vs early-stage, was not deemed feasible. Finally, all data in this study were derived from published literature, and no individual patient data were used. Noteworthily, the meta-analysis results could potentially be biased by two of the included RCTs that were published as abstracts, which may have relatively incomplete data, missing safety data, and unclear research methods.

Our NMA found variation in efficacy and safety among PD-1/PD-L1 inhibitors used to treat TNBC, as well as significant systematic differences between the RCTs included. To better assess those variations in efficacy, head-to-head trials between those PD-1/PD-L1 inhibitors are needed. In their use as a neoadjuvant to chemotherapy, ICIs demonstrated comparable efficacy in terms of OS, PFS, and pCR. This benefit is offset by an increase in immune-related adverse events, such as hyperthyroidism, hypothyroidism, pneumonitis, and adrenal insufficiency. We also demonstrated that Atezolizumab is safer than Pembrolizumab in the neoadjuvant setting. Only trials evaluating early-stage TNBC showed a significant improvement in pCR, implying that PD-1/PD-L1 inhibitors may be most effective when started early in the disease course. Durvalumab as a maintenance therapy instead of chemotherapy in patients with metastatic TNBC has also shown promising results in terms of survival extension. Future research should focus on PD-L1 expression status and TNBC subtypes, as these parameters may aid in the optimization of personalized treatment regimens for TNBC patients.

Availability of data and materials

Data used in this study analysis is provided in the Additional file 1 : (Table S7). Further analysis data requests and inquiries can be directed to the corresponding author.

Abbreviations

Confidence interval

Combined positive score

Food and Drug Administration

Generalized pairwise modelling

Hazard ratio

  • Immune checkpoint inhibitors

Intention-to-treat

  • Network meta-analysis
  • Pathological complete response

Programmed cell death protein 1

Programmed cell death ligand 1

Preferred reporting items for systematic reviews and meta-analyses

International prospective register of systematic reviews

Cochrane risk-of-bias tool 2

Surface under the cumulative ranking curve

Tumor infiltrating lymphocytes

Triple-negative breast cancer

Huang J, Chan PS, Lok V, Chen X, Ding H, Jin Y, Yuan J, Lao XQ, Zheng ZJ, Wong MC. Global incidence and mortality of breast cancer: a trend analysis. Aging. 2021;13(4):5748–803.

Article   PubMed   PubMed Central   Google Scholar  

Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71(3):209–49.

Article   PubMed   Google Scholar  

Arnold M, Morgan E, Rumgay H, Mafra A, Singh D, Laversanne M, Vignat J, Gralow JR, Cardoso F, Siesling S, et al. Current and future burden of breast cancer: global statistics for 2020 and 2040. Breast. 2022;66:15–23.

Yao H, He G, Yan S, Chen C, Song L, Rosol TJ, Deng X. Triple-negative breast cancer: is there a treatment on the horizon? Oncotarget. 2017;8(1):1913–24.

Hsu JY, Chang CJ, Cheng JS. Survival, treatment regimens and medical costs of women newly diagnosed with metastatic triple-negative breast cancer. Sci Rep. 2022;12(1):729.

Article   CAS   PubMed   PubMed Central   Google Scholar  

Yin L, Duan JJ, Bian XW, Yu SC. Triple-negative breast cancer molecular subtyping and treatment progress. Breast Cancer Res. 2020;22(1):61.

Zaheed M, Wilcken N, Willson ML, O’Connell DL, Goodwin A. Sequencing of anthracyclines and taxanes in neoadjuvant and adjuvant therapy for early breast cancer. Cochrane Database Syst Rev. 2019;2(2):Cd012873.

PubMed   Google Scholar  

de Ruijter TC, Veeck J, de Hoon JP, van Engeland M, Tjan-Heijnen VC. Characteristics of triple-negative breast cancer. J Cancer Res Clin Oncol. 2011;137(2):183–92.

Article   CAS   PubMed   Google Scholar  

Zeichner SB, Terawaki H, Gogineni K. A review of systemic treatment in metastatic triple-negative breast cancer. Breast Cancer. 2016;10:25–36.

CAS   PubMed   PubMed Central   Google Scholar  

Nanda R, Chow LQ, Dees EC, Berger R, Gupta S, Geva R, Pusztai L, Pathiraja K, Aktan G, Cheng JD, et al. Pembrolizumab in patients with advanced triple-negative breast cancer: phase Ib KEYNOTE-012 study. J Clin Oncol. 2016;34(21):2460–7.

Manjunath M, Choudhary B. Triple-negative breast cancer: a run-through of features, classification and current therapies. Oncol Lett. 2021;22(1):512.

Abdou Y, Goudarzi A, Yu JX, Upadhaya S, Vincent B, Carey LA. Immunotherapy in triple negative breast cancer: beyond checkpoint inhibitors. NPJ Breast Cancer. 2022;8(1):121.

Pathak N, Chitikela S, Malik PS. Recent advances in lung cancer genomics: application in targeted therapy. Adv Genet. 2021;108:201–75.

Popovic M, Matovina-Brko G, Jovic M, Popovic LS. Immunotherapy: a new standard in the treatment of metastatic clear cell renal cell carcinoma. World J Clin Oncol. 2022;13(1):28–38.

Bai X, Ni J, Beretov J, Graham P, Li Y. Immunotherapy for triple-negative breast cancer: a molecular insight into the microenvironment, treatment, and resistance. J Natl Cancer Cent. 2021;1(3):75–87.

Article   Google Scholar  

Quinn KM. Innovative approaches to immunotherapy in breast cancer. J Thorac Dis. 2020;12(8):4536–40.

Han Y, Liu D, Li L. PD-1/PD-L1 pathway: current researches in cancer. Am J Cancer Res. 2020;10(3):727–42.

Jiang X, Wang J, Deng X, Xiong F, Ge J, Xiang B, Wu X, Ma J, Zhou M, Li X, et al. Role of the tumor microenvironment in PD-L1/PD-1-mediated tumor immune escape. Mol Cancer. 2019;18(1):10.

Narayan P, Wahby S, Gao JJ, Amiri-Kordestani L, Ibrahim A, Bloomquist E, Tang S, Xu Y, Liu J, Fu W, et al. FDA approval summary: atezolizumab plus paclitaxel protein-bound for the treatment of patients with advanced or metastatic TNBC whose tumors express PD-L1. Clin Cancer Res. 2020;26(10):2284–9.

Brian H, Georgia S, Deborah MC, Anna C, Christopher HS, Chris C, John PAI, Sharon S, Kristian T, Jeroen PJ, Cynthia M, Ferrán C-L, Peter CG, Kay D, Isabelle B, Douglas GA, David M. The PRISMA extension statement for reporting of systematic reviews incorporating network meta-analyses of health care interventions: checklist and explanations. Ann Intern Med. 2015;162(11):777–84.

Clark JM, Sanders S, Carter M, Honeyman D, Cleo G, Auld Y, Booth D, Condron P, Dalais C, Bateup S, et al. Improving the translation of search strategies using the polyglot search translator: a randomized controlled trial. J Med Libr Assoc. 2020;108(2):195–207.

Ouzzani M, Hammady H, Fedorowicz Z, Elmagarmid A. Rayyan-a web and mobile app for systematic reviews. Syst Rev. 2016;5(1):210.

Sterne JAC, Savović J, Page MJ, Elbers RG, Blencowe NS, Boutron I, Cates CJ, Cheng H-Y, Corbett MS, Eldridge SM, et al. RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ. 2019;366:l4898.

White IR, Barrett JK, Jackson D, Higgins JPT. Consistency and inconsistency in network meta-analysis: model estimation using multivariate meta-regression. Res Synth Methods. 2012;3(2):111–25.

Woods BS, Hawkins N, Scott DA. Network meta-analysis on the log-hazard scale, combining count and hazard ratio statistics accounting for multi-arm trials: a tutorial. BMC Med Res Methodol. 2010;10(1):54.

Dias S, Welton NJ, Caldwell DM, Ades AE. Checking consistency in mixed treatment comparison meta-analysis. Stat Med. 2010;29(7–8):932–44.

Chaimani A, Salanti G. Using network meta-analysis to evaluate the existence of small-study effects in a network of interventions. Res Synth Methods. 2012;3(2):161–76.

Salanti G, Ades AE, Ioannidis JP. Graphical methods and numerical summaries for presenting results from multiple-treatment meta-analysis: an overview and tutorial. J Clin Epidemiol. 2011;64(2):163–71.

White IR. Network meta-analysis. Stata J. 2015;15(4):951–85.

Doi SAR, Barendregt JJ. A generalized pairwise modelling framework for network meta-analysis. Int J Evid Based Healthc. 2018;16(4):187–94.

Doi SAR, Barendregt JJ, Khan S, Thalib L, Williams GM. Advances in the meta-analysis of heterogeneous clinical trials I: the inverse variance heterogeneity model. Contemp Clin Trial. 2015;45:130–8.

Doi SAR, Barendregt JJ. A generalized pairwise modelling framework for network meta-analysis. JBI Evid Implement. 2018;16(4):187–94.

Google Scholar  

Miles D, Gligorov J, André F, Cameron D, Schneeweiss A, Barrios C, Xu B, Wardley A, Kaen D, Andrade L, et al. Primary results from IMpassion131, a double-blind, placebo-controlled, randomised phase III trial of first-line paclitaxel with or without atezolizumab for unresectable locally advanced/metastatic triple-negative breast cancer. Ann Oncol. 2021;32(8):994–1004.

Winer EP, Lipatov O, Im SA, Goncalves A, Muñoz-Couselo E, Lee KS, Schmid P, Tamura K, Testa L, Witzel I, et al. Pembrolizumab versus investigator-choice chemotherapy for metastatic triple-negative breast cancer (KEYNOTE-119): a randomised, open-label, phase 3 trial. Lancet Oncol. 2021;22(4):499–511.

Cortes J, Cescon DW, Rugo HS, Nowecki Z, Im SA, Yusof MM, Gallardo C, Lipatov O, Barrios CH, Holgado E, et al. Pembrolizumab plus chemotherapy versus placebo plus chemotherapy for previously untreated locally recurrent inoperable or metastatic triple-negative breast cancer (KEYNOTE-355): a randomised, placebo-controlled, double-blind, phase 3 clinical trial. Lancet. 2020;396(10265):1817–28.

Gianni L, Huang CS, Egle D, Bermejo B, Zamagni C, Thill M, Anton A, Zambelli S, Bianchini G, Russo S, et al. Pathologic complete response (pCR) to neoadjuvant treatment with or without atezolizumab in triple-negative, early high-risk and locally advanced breast cancer: NeoTRIP Michelangelo randomized study. Ann Oncol. 2022;33(5):534–43.

Loibl S, Schneeweiss A, Huober J, Braun M, Rey J, Blohmer JU, Furlanetto J, Zahm DM, Hanusch C, Thomalla J, et al. Neoadjuvant durvalumab improves survival in early triple-negative breast cancer independent of pathological complete response. Ann Oncol. 2022;33(11):1149–58.

Bachelot T, Filleron T, Bieche I, Arnedos M, Campone M, Dalenc F, Coussy F, Sablin MP, Debled M, Lefeuvre-Plesse C, et al. Durvalumab compared to maintenance chemotherapy in metastatic breast cancer: the randomized phase II SAFIR02-BREAST IMMUNO trial. Nat Med. 2021;27(2):250–5.

Mittendorf EA, Zhang H, Barrios CH, Saji S, Jung KH, Hegg R, Koehler A, Sohn J, Iwata H, Telli ML, et al. Neoadjuvant atezolizumab in combination with sequential nab-paclitaxel and anthracycline-based chemotherapy versus placebo and chemotherapy in patients with early-stage triple-negative breast cancer (IMpassion031): a randomised, double-blind, phase 3 trial. The Lancet. 2020;396(10257):1090–100.

Article   CAS   Google Scholar  

Schmid P, Adams S, Rugo HS, Schneeweiss A, Barrios CH, Iwata H, Diéras V, Hegg R, Im S-A, Shaw Wright G, et al. Atezolizumab and nab-paclitaxel in advanced triple-negative breast cancer. New Engl J Med. 2018;379(22):2108–21.

Schmid P, Cortes J, Pusztai L, McArthur H, Kümmel S, Bergh J, Denkert C, Park YH, Hui R, Harbeck N, et al. Pembrolizumab for early triple-negative breast cancer. New Engl J Med. 2020;382(9):810–21.

Nanda R, Liu MC, Yau C, Shatsky R, Pusztai L, Wallace A, Chien AJ, Forero-Torres A, Ellis E, Han H, et al. Effect of pembrolizumab plus neoadjuvant chemotherapy on pathologic complete response in women with early-stage breast cancer: an analysis of the ongoing phase 2 adaptively randomized I-SPY2 trial. JAMA Oncol. 2020;6(5):676–84.

Garrido-Castro AC, Graham N, Bi K, Park J, Fu J, Keenan T, Richardson ET, Pastorello R, Lange P, Attaya V, et al. Abstract P2–14–18: A randomized phase II trial of carboplatin with or without nivolumab in metastatic triple-negative breast cancer. Cancer Res. 2022;82(4_Supplement):P2-14-18-P12-14–8.

Ademuyiwa FO, Gao F, Chen I, Northfelt DW, Wesolowski R, Arora M, Brufsky A, Dees C, Santa-Maria CA, Connolly RM, et al. Abstract PD14–09: Nci 10013—a randomized phase 2 study of neoadjuvant carboplatin and paclitaxel, with or without atezolizumab in triple negative breast cancer (TNBC). Cancer Res. 2021;81(4_Supplement):PD14-09-PD14-09.

Zhang M, Song J, Yang H, Jin F, Zheng A. Efficacy and safety of PD-1/PD-L1 inhibitors in triple-negative breast cancer: a systematic review and meta-analysis. Acta Oncol. 2022;61(9):1105–15.

Xin Y, Shen G, Zheng Y, Guan Y, Huo X, Li J, Ren D, Zhao F, Liu Z, Li Z, et al. Immune checkpoint inhibitors plus neoadjuvant chemotherapy in early triple-negative breast cancer: a systematic review and meta-analysis. BMC Cancer. 2021;21(1):1261.

Villacampa G, Tolosa P, Salvador F, Sánchez-Bayona R, Villanueva L, Dienstmann R, Ciruelos E, Pascual T. Addition of immune checkpoint inhibitors to chemotherapy versus chemotherapy alone in first-line metastatic triple-negative breast cancer: a systematic review and meta-analysis. Cancer Treat Rev. 2022;104:102352.

Rizzo A, Cusmai A, Massafra R, Bove S, Comes MC, Fanizzi A, Rinaldi L, Acquafredda S, Gadaleta-Caldarola G, Oreste D, et al. Pathological complete response to neoadjuvant chemoimmunotherapy for early triple-negative breast cancer: an updated meta-analysis. Cells. 2022;11(12):1857.

Li Y, Xing L, Li F, Liu H, Gan L, Yang D, Wang M, Yin X, Li H, Ren G. Efficacy and safety of adding immune checkpoint inhibitors to neoadjuvant chemotherapy against triple-negative breast cancer: a meta-analysis of randomized controlled trials. Front Oncol. 2021;11:657634.

Brown LC, Loi S. Immune checkpoint inhibition in the treatment of early stage triple negative breast cancer: 2021 update. Breast. 2022;62 Suppl 1(Suppl 1):S29-s33.

Wang DY, Johnson DB, Davis EJ. Toxicities associated with PD-1/PD-L1 blockade. Cancer J. 2018;24(1):36–40.

Zheng J, Cui T, Gao Y, Li T. Retrospective analysis of immune-related adverse events of the immune checkpoint inhibitors of PD-1/PD-l1 in the Fujian provincial hospital. Eur J Inflam. 2022;20:1721727X221091540.

Carretero-González A, Otero I, Lora D, Carril-Ajuria L, Castellano D, de Velasco G. Efficacy and safety of anti-PD-1/PD-L1 combinations versus standard of care in cancer: a systematic review and meta-analysis. Oncoimmunology. 2021;10(1):1878599.

Chen F, Chen N, Gao Y, Jia L, Lyu Z, Cui J. Clinical progress of PD-1/L1 inhibitors in breast cancer immunotherapy. Front Oncol. 2021;11:724424.

Leite KRM, Barrios CH, Buzaid AC, Gagliato D, Gobbi H, Soares F. Applicability of PD-L1 tests to tailor triple-negative breast cancer treatment in Brazil. Surg Exp Pathol. 2021;4(1):10.

Emens LA, Goldstein LD, Schmid P, Rugo HS, Adams S, Barrios CH, Schneeweiss A, Dieras V, Iwata H, Chang C-W, et al. The tumor microenvironment (TME) and atezolizumab + nab-paclitaxel (A+nP) activity in metastatic triple-negative breast cancer (mTNBC): IMpassion130. J Clin Oncol. 2021;39(15_suppl):1006–1006.

Emens LA, Molinero L, Loi S, Rugo HS, Schneeweiss A, Diéras V, Iwata H, Barrios CH, Nechaeva M, Nguyen-Duc A, et al. Atezolizumab and nab-paclitaxel in advanced triple-negative breast cancer: biomarker evaluation of the IMpassion130 study. J Natl Cancer Inst. 2021;113(8):1005–16.

Download references

Acknowledgements

QNL Funded the APC publication of this article.

Open Access funding provided by the Qatar National Library. This research did not receive any specific grants from funding agencies in the public, commercial, or not-for-profit sectors.

Author information

Authors and affiliations.

College of Medicine, QU Health, Qatar University, P. O. Box: 2713, Doha, Qatar

Ibrahim Elmakaty, Ruba Abdo, Ahmed Elsabagh & Abdelrahman Elsayed

Pathology Unit, Department of Basic Medical Sciences, College of Medicine, QU Health, Qatar University, Doha, Qatar

Mohammed Imad Malki

You can also search for this author in PubMed   Google Scholar

Contributions

IE: Data curation, Formal analysis, Investigation, Methodology, Writing—original draft, Writing—review and editing, illustration of tables and figures. RA: Data curation, Formal analysis, Writing—original draft, Writing—review and editing. AE: Data curation, Formal analysis, Writing—original draft, Writing—review and editing. AE: Data curation, Formal analysis, Writing—review and editing. MIM: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Supervision, Validation, Funding Acquisition, Writing—original draft, Writing—review and editing. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Mohammed Imad Malki .

Ethics declarations

Ethics approval and consent to participate.

Not applicable. All the work was developed using published data.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: table s1:.

PRISMA checklist for this network meta-analysis. Table S2: Excluded articles at full-text screening. Table S3: Overall survival treatment ranking and surface under the cumulative ranking curve. Figure S1: Overall survival using generalized pairwise modelling. Table S4: Progression free survival treatment ranking and surface under the cumulative ranking curve. Figure S2: Progression free survival using generalized pairwise modelling. Table S5: Pathologic complete response treatment ranking and surface under the cumulative ranking curve. Figure S3: Pathologic complete response using generalized pairwise modelling. Table S6: Adverse events grade ≥ 3 treatment ranking and surface under the cumulative Table. Figure S4: Adrenal insufficiency odds network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing this adverse event for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately. Figure S5: Diarrhea odds network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing this adverse event for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately. Figure S6: Hyperthyroidism odds network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing this adverse event for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately. Figure S7: Hypothyroidism odds network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing this adverse event for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately. Figure S8: Infusion reaction odds network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing this adverse event for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately. Figure S9: Pneumonitis odds network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing this adverse event for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately. Figure S10: Anemia odds network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing this adverse event for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately. Figure S11: Colitis odds network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing this adverse event for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately. Figure S12: Fatigue odds network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing this adverse event for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately. Figure S13: Nausea odds network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing this adverse event for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately. Figure S14: Neutropenia odds network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing this adverse event for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately. Figure S15: Rash odds network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing this adverse event for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately. Figure S16: Vomiting odds network meta-analysis results. A Schematic diagram showing the network map for the treatments included in the analysis. B Rankogram showing the ranking probabilities for the least odds of causing this adverse event for each treatment. C Forest plot showing each trial effect size and confidence interval as well as the pooled effect size. D Bias-adjusted funnel plot showing each treatment separately. Figure S17: Adverse events grade ≥ 3 using generalized pairwise modelling. Table S7: Extracted data used for the analysis.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ . The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/ ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article.

Elmakaty, I., Abdo, R., Elsabagh, A. et al. Comparative efficacy and safety of PD-1/PD-L1 inhibitors in triple negative breast cancer: a systematic review and network meta-analysis of randomized controlled trials. Cancer Cell Int 23 , 90 (2023). https://doi.org/10.1186/s12935-023-02941-7

Download citation

Received : 11 January 2023

Accepted : 07 May 2023

Published : 11 May 2023

DOI : https://doi.org/10.1186/s12935-023-02941-7

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • PD-1/PD-L1 inhibitors
  • Systematic review

Cancer Cell International

ISSN: 1475-2867

literature review of error analysis

A Systematic Literature Review of Human Error and Machine Error in Accident Investigation

  • First Online: 01 November 2022

Cite this chapter

Book cover

  • Nathan B. Rowland Miller 6  

Part of the book series: Automation, Collaboration, & E-Services ((ACES,volume 11))

515 Accesses

The presence of error in systems has been an essential part of analyzing accidents and incidents when investigations occur. More importantly, it is imperative to look at the overall impacts of human error and machine error systems and how they impact the investigation and accident. For this study of the effects and implications of both, literature reviews were conducted, and tools like Purdue Databases, MAXQDA, Vicintas, VOSviewer, and Harzing's Publish or Perish were used to gather data. These tools analyzed the most prevalent articles in the study through co-citation analysis, and significant trends were mapped and tracked. From this, the conclusion is found that the relationship between human error and machine error is dependent on one another, as the presence of one can cause the other. Additionally, in this analysis, consideration was given when looking into the relationship that machine AI and accidents and errors have. The conclusion reached is that while a connection is evident between the two points and there are systems in the works, these systems need to be implemented in a more robust fashion and with more stability in this practice.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
  • Available as EPUB and PDF
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
  • Durable hardcover edition

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Dempsey PG (2012) Accident and incident investigation. In: Handbook of human factors and ergonomics, pp 1083–1091. Wiley, Hoboken, NJ, USA. https://doi.org/10.1002/9781118131350.ch38

Reinach S, Viale A (2006) Application of a human error framework to conduct train accident/incident investigations. Accid Anal Prev 38(2):396–406. https://doi.org/10.1016/j.aap.2005.10.013

Article   Google Scholar  

Endres A (1975) An analysis of errors and their causes in system programs. IEEE Trans Softw Eng SE-1(2):140–149. https://doi.org/10.1109/TSE.1975.6312834

Che H, Zeng S, Guo J (2019) Reliability assessment of man-machine systems subject to mutually dependent machine degradation and human errors. Reliab Eng Syst Saf 190:106504. https://doi.org/10.1016/j.ress.2019.106504

Sharit J (2012) Human error and human reliability analysis. In: Handbook of human factors and ergonomics, pp 734–800. Wiley, Hoboken, NJ, USA. https://doi.org/10.1002/9781118131350.ch26

Cacciabue PC (1988) Evaluation of human factors and man-machine problems in the safety of nuclear power plants. Nucl Eng Des 109(3):417–431. https://doi.org/10.1016/0029-5493(88)90287-7

Chauvin C, Lardjane S, Morel G, Clostermann JP, Langard B (2013) Human and organisational factors in maritime accidents: analysis of collisions at sea using the HFACS. Accid Anal Prev 59:26–37. https://doi.org/10.1016/j.aap.2013.05.006

Mendeley Reference Manager (n.d). Accessed April 30, 2021. https://www.mendeley.com/reference-manager/library/all-references

Web of Science [v.5.35]—Web of Science Core Collection Basic Search (n.d). Accessed April 29, 2021. https://apps-webofknowledge-com.ezproxy.lib.purdue.edu/WOS_GeneralSearch_input.do?product=WOS&search_mode=GeneralSearch&SID=6EKEdLuDT2UBSrhNcCA&preferencesSaved=

VOSviewer—Visualizing Scientific Landscapes (n.d). Accessed April 30, 2021. https://www.vosviewer.com/

MAXQDA | All-In-One Tool for Qualitative Data Analysis & Mixed Methods (n.d). Accessed May 1, 2021. https://www.maxqda.com/

Vicinitas : Twitter Analytics Tool for Tracking Hashtags, Keywords, and Accounts (n.d). Accessed May 1, 2021. https://www.vicinitas.io/

Rasmussen J (2021) General rights risk management in a dynamic society a modelling problem. Downloaded from Orbit.Dtu.Dk On. https://doi.org/10.1016/S0925-7535(97)00052-0

Cacciabue PC (2004) Human error risk management for engineering systems: a methodology for design, safety assessment, accident investigation and training. Reliab Eng Syst Saf 83(2):229–240. https://doi.org/10.1016/j.ress.2003.09.013

WHO | Global Status Report on Road Safety 2015 (2018). WHO. http://www.who.int/violence_injury_prevention/road_safety_status/2015/en/

Global Status Report on Road Safety 2018 (n.d). Accessed May 1, 2021. https://www.who.int/publications/i/item/9789241565684

Lecun Y, Bengio Y, Hinton G (2015) Deep learning. Nature. Nature Publishing Group. https://doi.org/10.1038/nature14539

NSF Award Search: Award # 1918140—FMitF: Collaborative Research: Track I: Preventing Human Errors in Cyber-Human Systems with Formal Approaches to Human Reliability Rating and Model Repair (n.d). Accessed April 29, 2021. https://nsf.gov/awardsearch/showAward?AWD_ID=1918140&HistoricalAwards=false

Pot M, Kieusseyan N, Prainsack B (2021) Not all biases are bad: equitable and inequitable biases in machine learning and radiology. In: Insights into imaging. Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1186/s13244-020-00955-7

Download references

Author information

Authors and affiliations.

School of Industrial Engineering, Purdue University, West Lafayette, IN, 47906, USA

Nathan B. Rowland Miller

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Nathan B. Rowland Miller .

Editor information

Editors and affiliations.

School of Industrial Engineering, Department of Agricultural and Biological Engineering, Purdue University, West Lafayette, IN, USA

Vincent G. Duffy

Industrial and Manufacturing Engineering, The Pennsylvania State University, University Park, PA, USA

Steven J. Landry

Department of Industrial and Systems Engineering, University of Wisconsin-Madison, Madison, WI, USA

John D. Lee

Human Factors in Transport within Engineering and Physical Sciences, University of Southampton, Southampton, UK

Neville Stanton

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Rowland Miller, N.B. (2023). A Systematic Literature Review of Human Error and Machine Error in Accident Investigation. In: Duffy, V.G., Landry, S.J., Lee, J.D., Stanton, N. (eds) Human-Automation Interaction. Automation, Collaboration, & E-Services, vol 11. Springer, Cham. https://doi.org/10.1007/978-3-031-10784-9_18

Download citation

DOI : https://doi.org/10.1007/978-3-031-10784-9_18

Published : 01 November 2022

Publisher Name : Springer, Cham

Print ISBN : 978-3-031-10783-2

Online ISBN : 978-3-031-10784-9

eBook Packages : Intelligent Technologies and Robotics Intelligent Technologies and Robotics (R0)

Share this chapter

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Publish with us

Policies and ethics

  • Find a journal
  • Track your research
  • Open access
  • Published: 15 June 2023

Association of cigarette smoking habits with the risk of prostate cancer: a systematic review and meta-analysis

  • Xiangwei Yang 1   na1 ,
  • Hong Chen 2   na1 ,
  • Shiqiang Zhang 1 ,
  • Xianju Chen 1 ,
  • Yiyu Sheng 1 &
  • Jun Pang 1  

BMC Public Health volume  23 , Article number:  1150 ( 2023 ) Cite this article

2034 Accesses

2 Citations

Metrics details

Association of cigarette smoking habits with the risk of prostate cancer is still a matter of debate. This systematic review and meta-analysis aimed to assess the association between cigarette smoking and prostate cancer risk.

We conducted a systematic search on PubMed, Embase, Cochrane Library, and Web of Science without language or time restrictions on June 11, 2022. Literature search and study screening were performed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses statement. Prospective cohort studies that assessed the association between cigarette smoking habits and the risk of prostate cancer were included. Quality assessment was conducted using the Newcastle–Ottawa Scale. We used random-effects models to obtain pooled estimates and the corresponding 95% confidence intervals.

A total of 7296 publications were screened, of which 44 cohort studies were identified for qualitative analysis; 39 articles comprising 3 296 398 participants and 130 924 cases were selected for further meta-analysis. Current smoking had a significantly reduced risk of prostate cancer (RR, 0.74; 95% CI, 0.68–0.80; P  < 0.001), especially in studies completed in the prostate-specific antigen screening era. Compared to former smokers, current smokers had a significant lower risk of PCa (RR, 0.70; 95% CI, 0.65–0.75; P  < 0.001). Ever smoking showed no association with prostate cancer risk in overall analyses (RR, 0.96; 95% CI, 0.93–1.00; P  = 0.074), but an increased risk of prostate cancer in the pre-prostate-specific antigen screening era (RR, 1.05; 95% CI, 1.00–1.10; P  = 0.046) and a lower risk of prostate cancer in the prostate-specific antigen screening era (RR, 0.95; 95% CI, 0.91–0.99; P  = 0.011) were observed. Former smoking did not show any association with the risk of prostate cancer.

Conclusions

The findings suggest that the lower risk of prostate cancer in smokers can probably be attributed to their poor adherence to cancer screening and the occurrence of deadly smoking-related diseases, and we should take measures to help smokers to be more compliant with early cancer screening and to quit smoking.

Trial registration

This study was registered on PROSPERO (CRD42022326464).

Peer Review reports

Prostate cancer (PCa) is the second most commonly diagnosed cancer and the fifth leading cause of cancer death among males, with an estimated 1.4 million new cases and 375 000 deaths worldwide in 2020, accounting for 7.3% and 3.8% of all cancers diagnosed, respectively [ 1 ]. Various endogenous and exogenous risk factors for PCa have been discussed for decades. Several factors have been identified to be associated with an increased risk of PCa, for instance, family history [ 2 ], elevated hormone levels [ 2 ], black ethnicity [ 2 ], and high alcohol consumption [ 3 ]. Conversely, several factors have been associated with a decreased risk of PCa, such as higher intake of tomatoes [ 4 ], increased coffee consumption [ 5 ] and sexual activity [ 6 ].

Smoking is a well-established risk factor for several cancers, such as lung cancer, head and neck cancer, bladder cancer, and esophageal cancer [ 7 , 8 ]. However, the data on the association between smoking and PCa incidence are conflicting [ 9 , 10 ]. In a meta-analysis of 24 prospective cohort studies [ 11 ], M. Huncharek showed that current smokers had no increased risk of incident PCa, but in data stratified by amount smoked, a significant elevated risk was observed, and former smokers had a higher risk of PCa in comparison with never smokers. Another meta-analysis conducted in 2014 [ 12 ] revealed an inverse association between current smoking and PCa risk, while in studies completed before the prostate-specific antigen (PSA) screening era, ever smoking was positively associated with PCa. In addition, a recent pooled study of five Swedish cohorts [ 13 ] demonstrated that former smokers and current smokers had a lower risk of PCa than never smokers, and smoking intensity was inversely associated with PCa risk, especially in the PSA screening era.

Biological mechanisms underlying smoking and PCa risk have been studied for many years. Burning cigarettes can produce more than 7000 chemicals, and at least 70 carcinogens such as polycyclic aromatic hydrocarbons (PAHs) and cadmium [ 14 ]. Mutations or functional polymorphism in genes involved in PAH metabolism and detoxification may increase the risk of PCa [ 15 ]. The glutathione-S-transferases (GSTs) are a class of enzymes that can detoxify PAHs. The most common subtypes of GSTs in human prostate are GSTP and GSTM, which were reported to be associated with an increased risk of PCa in smokers [ 15 , 16 ]. Cadmium induces prostate carcinogenesis through interaction with the androgen receptor because of its androgen-like activity, and it also enhances androgen-mediated transcriptional activity when in combination with the androgen [ 17 ]. A higher level of androgen was related to increased PCa risk [ 2 , 18 ]. Smoking can increase testosterone concentrations by promoting testosterone secretion from Leydig cells or acting as an aromatase inhibitor [ 19 ]. Mutations in the p53 gene and CYP1A1 gene showed a higher risk of PCa in smokers, suggesting that smoking may have a joint effect on PCa risk when combined with susceptible genotypes [ 20 ]. Increased heme oxygenase 1 (HO-1) messenger RNA expression and upregulated HO-1 protein levels were observed in PCa cell lines DU 145 and PC3 [ 21 ], implying that HO-1 may play a role in the development of PCa for its function in promoting angiogenesis [ 22 ]. Evidence also suggested that prostatic inflammation may be involved in the development and progression of PCa [ 23 ]. Cigarette smoke augments the production of numerous pro-inflammatory cytokines, decreases the levels of anti-inflammatory cytokines, and activates macrophage and dendritic cell activity in many ways [ 24 ].

We performed this systematic review and meta-analysis to investigate the association of cigarette smoking habits with the risk of PCa. We aimed to include a larger sample of studies than previous meta-analyses and collect the latest evidence and the most comprehensive information on the association between cigarette smoking and PCa risk. Our primary objective was to assess the risk of PCa in current smokers, former smokers, and ever smokers. We hypothesized that smokers have a higher risk of PCa compared to non-smokers.

Search strategy

This systematic review and meta-analysis was performed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement [ 25 ]. Two independent investigators (XWY and HC) searched PubMed, Embase, Cochrane Library, and Web of Science for publications from database inception to June 11, 2022. The following search terms were used: ("Prostate cancer") AND ("Cigarette" OR "Smoking" OR "Tobacco") AND ("Risk" OR "Incidence"). No language restrictions were applied. Reference lists of identified articles and relevant reviews were screened for additional studies. Details of the protocol for this systematic review were registered on PROSPERO and can be accessed by CRD42022326464.

Selection criteria

Prospective cohort studies investigating the association between cigarette smoking and PCa risk were included for analysis. The primary outcome was the risk of PCa. Those studies that provided an effect measure (i.e., a relative risk) quantifying the impact of smoking on the risk of PCa were considered for further quantitative synthesis (meta-analysis). The removal of duplicates and assessment of article eligibility were conducted independently by XWY and HC, and any disagreements were resolved by consulting the senior author (JP). Review articles, editorials, meeting abstracts, case‒control studies, cross-sectional studies, and those not on the topic were excluded.

Quality assessment

All included studies were independently assessed by XWY and HC for risk of bias using the Newcastle‒Ottawa Scale for cohort studies [ 26 ]. This scale assesses the selection of the study groups, the comparability of the groups, and the ascertainment of the outcome of interest. Studies with 7–9 scores were considered to be of high quality, those with 5–6 scores were classified as intermediate quality, and those with less than 4 scores were classified as low quality. Disagreements in the quality assessment were resolved by consulting JP.

Data extraction

Data were independently extracted by XWY and HC. All extracted variables were cross-checked to ensure their reliability. We recorded the total number of participants, PCa cases, and the mean or median follow-up time across all included studies. Relative risks (RRs) and the corresponding 95% confidence intervals (CIs) were retrieved or calculated using frequency distributions. Considering the prevalence rate of PCa in the public, we believed that the odds ratio was close to the RR [ 27 , 28 ]. Hazard ratios (HRs) and RRs are different, HRs contain temporal information but RRs do not [ 28 ]. We converted HRs to RRs based on the formula provided by Shor E et al. [ 29 ], and the corresponding 95% CIs were converted using the same method. RRs and 95% CIs of ever smokers were computed by combining the results for former and current smokers when these results were not reported in the original papers. In addition, we recorded the baseline characteristics, methods, adjusted confounding factors, and other important comments to establish comparability. Discrepancies were discussed and resolved by consensus.

Statistical analysis

Three authors (SZQ, XJC and YYS) performed statistical analyses using Stata software, version 16.0 (StataCorp). When both crude and adjusted RRs were provided, we used the most fully adjusted value. We calculated the pooled RRs and 95% CIs and plotted forest plots using random-effects models (DerSimonian‒Laird method) for the association of current smoking, former smoking, and ever smoking with the risk of PCa [ 30 ]. Statistical heterogeneity across the trials was assessed using the I 2 statistic and the Cochran’s Q test. Values of the I 2 statistic of approximately 25%, 50%, and 75% were interpreted as low, moderate, and high heterogeneity, respectively [ 31 ]. In the case of low heterogeneity, a fixed-effects model (Inverse variance method) was applied. We plotted funnel plots and used Egger’s test to examine publication bias. Additionally, a series of sensitivity analyses were performed to assess the robustness of our results. We stratified studies by reference status (never smoker, former smokers), completion year (pre-PSA screening era vs. PSA screening era), world region (North America vs. Europe vs. Asia vs. Australia), and the Newcastle‒Ottawa Scale score (≤ 6 points vs. > 6 points). We considered 1995 as a cutoff year of study completion to distinguish studies before and after the PSA screening era [ 12 ]. All tests were two-tailed, and P  < 0.05 was considered statistically significant.

Study population

We identified 7296 citations, and after removing duplicates, 4963 citations remained for screening. After the removal of ineligible citations, we retained 60 articles that we assessed for eligibility by reading the full text; 16 of these were excluded for specific reasons. Finally, 44 studies met our inclusion criteria for qualitative synthesis and meta-analysis (Fig.  1 ). The number of participants and PCa cases from each selected study for systematic review ranged separately from 997 to 844 455 and 54 to 40 821, with a median of 22 677 and 382, respectively. Overall, 39 studies with 3 296 398 participants and 130 924 cases were identified for meta-analysis, and 5 studies with 91 377 participants and 1364 cases were not included in meta-analysis due to lack of information (Additional file 1 ). Articles were published between 1989 and 2022 and were from studies conducted in the following geographic regions: 19 from Europe (4 from the United Kingdom, 4 from Norway, 3 from Sweden, 2 from Finland, 1 from France, 1 from the Netherlands, 1 from Denmark, 1 from Lithuania, 1 from Iceland, and one from 10 European countries), 18 from North America (17 from the United States, 1 from Canada), 5 from Asia (3 from Japan, 1 from South Korea, 1 from Singapore), and 2 from Australia. The median score of quality assessment for all eligible studies was 7, with a range of 6–9 (Additional file 2 ).

figure 1

Flow diagram of included studies

Current smoking

In total, 37 studies [ 6 , 13 , 32 , 33 , 34 , 35 , 36 , 37 , 38 , 39 , 40 , 41 , 42 , 43 , 44 , 45 , 46 , 47 , 48 , 49 , 50 , 51 , 52 , 53 , 54 , 55 , 56 , 57 , 58 , 59 , 60 , 61 , 62 , 63 , 64 , 65 , 66 ] reported the risk of current smoking on PCa, among which 6 studies [ 32 , 35 , 41 , 53 , 55 , 63 ] took non-smokers as the reference and the remaining 31 studies [ 6 , 13 , 33 , 34 , 36 , 37 , 38 , 39 , 40 , 42 , 43 , 44 , 45 , 46 , 47 , 48 , 49 , 50 , 51 , 52 , 54 , 56 , 57 , 58 , 59 , 60 , 61 , 62 , 64 , 65 , 66 ] took never smokers as the reference. We defined non-smokers as never smokers plus former smokers. RRs and 95% CIs of current smokers versus non-smokers were calculated using frequency distributions in never smokers and former smokers when the risk estimates were not provided in original studies. Ten studies [ 34 , 36 , 38 , 39 , 42 , 43 , 54 , 58 , 59 , 66 ] did not provide enough data on frequency distribution and were not included in analysis. Twenty-seven studies [ 6 , 13 , 32 , 33 , 35 , 37 , 40 , 41 , 44 , 45 , 46 , 47 , 48 , 49 , 50 , 51 , 52 , 53 , 55 , 56 , 57 , 60 , 61 , 62 , 63 , 64 , 65 ] were included to calculate the pooled RR and 95% CI. The results showed that current smoking at baseline was associated with a reduced risk of PCa (RR, 0.74; 95% CI, 0.68–0.80; P  < 0.001) (Fig.  2 ). The I 2 statistic and the Cochran’s Q test showed high heterogeneity (I 2  = 90.5%; P  < 0.001). Inspection of the funnel plot did not demonstrate publication bias ( P  = 0.231; Fig.  3 ).

figure 2

Forest plot for the association between current smoking and prostate cancer. RR, relative risk; CI, confidence interval; PCa, prostate cancer; US, United States; UK, United Kingdom. a Rohrmann et al. [ 49 ] had two sub-populations. b RR and 95% CI were calculated using frequency distributions. c RR and 95% CI were converted from HR and corresponding 95% CI using the formula RR ≈ (1-e HR x ln (1−P0) )/P 0 (P 0 refers to the incidence rate of PCa in the control group). d Weights were from random effects analysis

figure 3

Funnel plot for publication bias in the studies investigating current smoking and prostate cancer risk. SE, standard error. Twenty-eight dots from 27 studies. P  = 0.231

When performing sensitivity analyses (Additional file 3 ) stratified by reference status, studies using never smokers as the reference [ 6 , 13 , 33 , 34 , 36 , 37 , 38 , 39 , 40 , 42 , 43 , 44 , 45 , 46 , 47 , 48 , 49 , 50 , 51 , 52 , 54 , 56 , 57 , 58 , 59 , 60 , 61 , 62 , 64 , 65 , 66 ] showed a similar inverse association with PCa risk (RR, 0.90; 95% CI, 0.86–0.95; P  < 0.001), with the heterogeneity lower than that of analysis of studies using non-smokers as the reference (I 2  = 66.7%; P  < 0.001). Compared to former smokers, current smokers had a significant lower risk of PCa (RR, 0.70; 95% CI, 0.65–0.75; P  < 0.001) based on 21 studies [ 6 , 13 , 33 , 37 , 40 , 44 , 45 , 46 , 47 , 48 , 49 , 50 , 51 , 52 , 56 , 57 , 60 , 61 , 62 , 64 , 65 ]. In the pre-PSA screening era, current smoking showed a decreased risk of PCa (RR, 0.79; 95% CI, 0.64–0.98; P = 0.033) compared to non-smokers, while in the PSA screening era, the risk was significantly lower (RR, 0.72; 95% CI, 0.66–0.79; P  < 0.001). When stratified by world region, studies conducted in North America, Europe, Asia, and Australia showed a negative association between current smoking and PCa risk. We also performed subgroup analyses in 21 studies with quality scores ≥ 7 [ 6 , 13 , 32 , 33 , 35 , 40 , 44 , 45 , 46 , 49 , 50 , 51 , 52 , 55 , 56 , 57 , 60 , 61 , 62 , 64 , 65 ] and 6 studies with quality scores of 6 [ 37 , 41 , 47 , 48 , 53 , 63 ]. Thereupon, both demonstrated a reduced risk of PCa.

Former smoking

Meta-analysis on former smoking as a risk factor for PCa was performed in 31 studies (Fig.  4 ) [ 6 , 13 , 33 , 34 , 36 , 37 , 38 , 39 , 40 , 42 , 43 , 44 , 45 , 46 , 47 , 48 , 49 , 50 , 51 , 52 , 54 , 56 , 57 , 58 , 59 , 60 , 61 , 62 , 64 , 65 , 66 ], and the results showed no significant association between former smoking and the risk of PCa (RR, 0.98; 95% CI, 0.95–1.02; P  = 0.313). The data were heterogeneous according to the I 2 statistic and the Cochran’s Q test (I 2  = 61.5%; P  < 0.001). Inspection of the corresponding funnel plot did not show evidence of publication bias ( P  = 0.431; Fig.  5 ). Sensitivity analyses stratified by PSA screening era, world region, and quality score also demonstrated no association between former smoking and PCa risk (Additional file 3 ).

figure 4

Forest plot for the association between former smoking and prostate cancer. a Rohrmann et al. [ 49 ] had two sub-populations. b RR and 95% CI were calculated using frequency distributions. c RR and 95% CI were converted from HR and corresponding 95% CI using the formula RR ≈ (1-e HR x ln (1−P0) )/P 0 (P 0 refers to the incidence rate of PCa in the control group). d Weights were from random effects analysis

figure 5

Funnel plot for publication bias in the studies investigating former smoking and prostate cancer risk. Thirty-two dots from 31 studies. P  = 0.431

Ever smoking

Thirty-three studies were included in the meta-analysis to assess the association of ever smoking with the risk of PCa (Fig.  6 ) [ 6 , 13 , 33 , 34 , 36 , 37 , 38 , 39 , 40 , 42 , 43 , 44 , 45 , 46 , 47 , 48 , 49 , 50 , 51 , 52 , 54 , 56 , 57 , 58 , 59 , 60 , 61 , 62 , 64 , 65 , 66 , 67 , 68 ]. Two of those studies [ 67 , 68 ] provided RRs and 95% CIs in the original paper, and the risk estimates of the remaining 31 studies [ 6 , 13 , 33 , 34 , 36 , 37 , 38 , 39 , 40 , 42 , 43 , 44 , 45 , 46 , 47 , 48 , 49 , 50 , 51 , 52 , 54 , 56 , 57 , 58 , 59 , 60 , 61 , 62 , 64 , 65 , 66 ] were calculated by combining results for former and current smokers. Thereupon, the pooled RR and 95% CI showed no association with the risk of PCa (RR, 0.96; 95% CI, 0.93–1.00; P  = 0.074), with an I 2 value of 67.0% and a negative result of publication bias (( P  = 0.672; Fig.  7 ). The association was inverse when analyzing studies completed in the PSA screening era (RR, 0.95; 95% CI, 0.91–0.99; P  = 0.011), but in the pre-PSA screening era, ever smokers showed a significantly increased risk of PCa compared to never smokers (RR, 1.05; 95% CI, 1.00–1.10; P  = 0.046) (Additional file 3 ). Four studies [ 50 , 57 , 60 , 67 ] from Asia showed a pooled reduced risk of PCa in ever smokers (RR, 0.82; 95% CI, 0.74–0.91; P  < 0.001), and studies from North America, Europe, and Australia revealed no association between ever smoking and PCa incidence. In terms of subgroup analyses stratified by quality score, the studies with a quality score ≥ 7 showed a modest negative association with PCa risk (RR, 0.96; 95% CI, 0.92–1.00; P  = 0.047), while the studies with a quality score of 6 showed no association.

figure 6

Forest plot for the association between ever smoking and prostate cancer. DM, diabetes mellitus. a Rohrmann et al. [ 49 ] had two sub-populations. b RR and 95% CI were calculated using frequency distributions or risk estimates and 95% CIs in subgroups. c RR and 95% CI were converted from HR and corresponding 95% CI using the formula RR ≈ (1-e HR x ln (1−P0) )/P 0 (P 0 refers to the incidence rate of PCa in the control group). d Weights were from random effects analysis. e Onitilo et al. [ 68 ] had two sub-populations

figure 7

Funnel plot for publication bias in the studies investigating ever smoking and prostate cancer risk. Thirty-five dots from 33 studies. P  = 0.672

Studies not included in the meta-analysis

Of these 5 studies (Additional file 1 ) [ 69 , 70 , 71 , 72 , 73 ], 4 studies (involving 211 cases, 524 cases, 127 cases, and 129 cases) [ 69 , 70 , 72 , 73 ] reported no significant association between cigarette smoking and the risk of PCa, 2 of which had a smoking category increment of 10 cigarette per day [ 69 ] or cigarette pack-years per 10 years [ 72 ]. The study conducted by Karlsen et al. [ 73 ] did not differentiate cigarette, cigar, cheroot, and pipe when assessing the risk of PCa in smokers, and as a result, this study could not be included in the meta-analysis. In the study conducted by Chamie et al. [ 71 ], a reduced PCa risk was reported in participants with a smoking history (with 13,144 participants and 363 cases; RR, 0.78; 95% CI, 0.72–0.85; P  < 0.001).

In this systematic review and meta-analysis, we found that current smoking was inversely associated with the risk of PCa, especially in the PSA screening era, which was inconsistent with our hypothesis but was consistent with the results of the recent studies [ 12 , 13 ]. In studies using never smokers as the reference, current smoking revealed a similar negative correlation with PCa risk, accompanied by less heterogeneity. Current smokers had a lower risk of PCa compared to former smokers. Former smoking and ever smoking were not associated with PCa risk in the overall analyses. However, when stratified by completion year, ever smoking showed an increased risk of PCa in the pre-PSA screening era and a lower risk of PCa in the PSA screening era. Studies from North America, Europe, Asia, and Australia showed a similar reduced PCa risk in current smokers compared to non-smokers, whereas in ever smokers, only studies conducted in Asia demonstrated a decreased risk of PCa. There are several explanations for these results. Current smoking was believed to be associated with a lower likelihood of PSA testing [ 74 , 75 ], and individuals with a smoking history were less likely to undergo prostate biopsy [ 62 , 76 ]. As a consequence, the detection rate of PCa could be relatively lower among participants in the PSA screening era. The difference in the patterns of the association between ever smoking and PCa risk in Asia and other regions can be attributed to the higher proportion of studies in the PSA screening era in Asia than afterward. Additionally, the differences in race/ethnicity, socioeconomic status, educational attainment, and health literacy may also play important roles in explaining regional distinctions [ 77 , 78 , 79 ]. In a national cross-sectional survey, PSA testing was significantly higher in US-born men and older non-Hispanic White men than in foreign-born men and men from other racial categories [ 77 ]. Another study revealed that White men aged > 50 years were more likely than Black men to undergo PSA testing, and those with lower socioeconomic status were associated with less PSA testing [ 78 ]. The association of education levels with the preference for PSA screening was inconsistent [ 77 , 79 ]. Johnson JA et al. [ 77 ] declared that higher educational levels were associated with higher odds of ever having had a PSA test; however, Pickles K et al. [ 79 ] announced that the preference for PSA screening was stronger in those without tertiary education and with inadequate health literacy. The age of the participants in the selected studies varied widely, and therefore, the willingness to receive PSA screening differs considerably; older people often show poorer adherence to PSA testing guidelines [ 77 ]. On the other hand, the relationship between PSA levels and smoking is still a matter of debate. According to an Italian cross-sectional study [ 80 ], PSA accuracy was reported to be lower in smokers than in nonsmokers and former smokers, suggesting that the need for PSA-based prostate biopsy can be affected to a certain extent by smoking.

Another possible explanation is that smoking is the leading risk factor for death among males [ 81 ]. Smokers may die from smoking attributable diseases including cancers, cardiovascular diseases, and respiratory diseases before their diagnosis of PCa. The majority of cases of lung cancer [ 7 ], head and neck cancer [ 82 ], approximately 50% of bladder cancer cases [ 83 ], and 49% of esophageal squamous cell carcinoma cases [ 84 ] are caused by cigarette smoking. Furthermore, smoking was reported to cause nearly 90% of lung cancer deaths [ 7 ] and showed significant associations with poor survival in patients with head and neck cancer [ 85 ]. Moreover, the detection of asymptomatic PCa can be frequently ignored when focusing on a more aggressive cancer. In addition, smoking increases the risk for stroke and coronary heart disease by 2 to 4 times, and stroke and coronary heart disease are considered to be the leading causes of death in the United States [ 8 ], and most of these deaths are caused by smoking [ 86 ]. Smoking can also cause chronic obstructive pulmonary disease (COPD), increasing 12 to 13-fold risk of dying from COPD than nonsmokers [ 8 ], and nearly 80% of deaths from COPD can be ascribed to smoking [ 86 ].

Our study found an increased risk of PCa among ever smokers in the pre-PSA screening era, indicating that it is necessary to promote smoking cessation as early as possible. Nearly one in five deaths are caused by cigarette smoking in the United States, leading to more than 480 000 deaths each year [ 8 ]. Continued tobacco use has been shown to limit the effectiveness of major cancer treatments, increase the risk of treatment-related complications and the development of secondary cancers, and lower cancer survival rates and the quality of life of patients [ 7 ]. In patients with PCa, smokers at the time of PCa diagnosis are associated with more aggressive characteristics, and the risk of experiencing biochemical recurrence, distant metastasis, cancer-specific mortality, and overall mortality is much higher [ 9 , 10 , 12 , 87 , 88 ]. Nicotine-induced chronic prostatic inflammation [ 23 , 89 ], aberrant CpG methylations of adenomatous polyposis coli and glutathione S-transferase pi are the potential biological mechanisms responsible for these [ 90 ]. Although the effect of smoking cessation on PCa progression remains unclear, the negative impact of smoking has suggested to be maintained as long as 10 years after smoking cessation [ 10 ]. Additionally, active smoking is associated with adverse reproductive health outcomes, type 2 diabetes mellitus, and rheumatoid arthritis, harming nearly every organ of the body and resulting in significant economic costs for smokers, their families, and society [ 7 ].

Much progress has been made in promoting smoking cessation in recent decades. However, it is far from sufficient. In 2018, 13.7% of all adults (34.2 million people) in the United States were reported as current cigarette smokers [ 91 ]. Of them, 55.1% had made an attempt to quit in the past 12 months, but only 7.5% achieved success. Overcoming both physical nicotine dependence and long-standing rewarding behavior is a huge challenge, and most individuals relapse within 3 months after quitting smoking [ 92 ]. Evidence has indicated that the combination of behavioral and pharmacological interventions produces the largest cessation effects [ 7 , 8 , 92 ]. Nevertheless, fewer than one-half of tobacco users were offered cessation treatment according to a survey of oncology providers [ 93 ], and the inability to get patients to quit and patient resistance to treatment are two dominant barriers to cessation intervention. A brief intervention may be more acceptable and sustainable to help smokers quit smoking, according to a randomized clinical trial performed at emergency departments in Hong Kong [ 94 ]. Quitlines are good alternatives to interventions for both patients and clinicians because of their convenience and specialization, and their roles in improving smoking cessation rates have been confirmed [ 95 ]. For smokers with time constraints, internet-based self-help materials such as the website smokefred.gov and newer smartphone applications have also shown benefits in promoting smoking cessation and can serve as good alternatives [ 96 , 97 ].

Strengths and limitations

The key strength of this systematic review is that the study comprised a total of 44 prospective cohort studies, 39 of which were included in the meta-analysis, with the largest number of participants and PCa cases to date. Furthermore, we included all the data on current smoking, former smoking, and ever smoking in the analysis without date and language restrictions, which means that the study provides the latest evidence and the most comprehensive information on the association between cigarette smoking and risk of PCa. We assessed the quality of each selected study using the Newcastle‒Ottawa Scale for cohort studies, and the median score was 7 and the lowest score was 6, suggesting that the quality of the included studies can be guaranteed. Other strengths include applying independent literature search, quality assessment, and data extraction by two investigators; conducting several sensitivity analyses; and using Egger’s test to examine publication bias.There are some limitations of our study. Most of the information on smoking habits was obtained from self-administered questionnaires, and the definitions of current smokers and former smokers were not completely the same between different studies. Some participants may have changed their smoking habits after baseline investigations, but repeated assessment of smoking exposure was absent in primary studies. We calculated RRs and the corresponding 95% CIs using frequency distributions without adjusting confounding factors when risk estimates were not reported. We focused on the impact of cigarette smoking on the risk of PCa; second-hand cigarette smoke and the use of other tobacco products (cigars, smokeless tobacco, e-cigarettes, pipes, etc.) that have showed increased risk of many cancers in numerous studies [ 98 , 99 ] were not discussed. Alcohol consumption showed a significant dose–response relationship with PCa risk in several studies [ 3 , 100 ], and were often used concurrently with cigarette smoking [ 101 ], but we didn’t analyze the effect of concurrent use of cigarette and alcohol on risk of PCa due to lack of information on alcohol consumption in the included studies. High heterogeneity was showed by the I 2 statistics and the Cochran’s Q test, and the difference in adjusted confounding factors may be one of the reasons. We have included multivariate results as much as possible to reduce the bias, and there was no indication of publication bias. Dividing studies into pre-PSA screening era and PSA screening era based on publication year (1995 as the cut-off) may produce bias because many of the cohorts published and categorized into the PSA screening era extended into the pre-PSA screening era. Another limitation is that we failed to calculate the impact of quantitative cigarette consumption on the PCa risk due to a lack of data. However, we have to point out that the meta-regression conducted by Islami et al. [ 12 ] was methodologically wrong as including multiple data points from a single study with the same control group counts the effect of that control group multiple times (i.e., unit-of-analysis error).

To the best of our knowledge, this systematic review and meta-analysis contained the largest sample of prospective cohort studies, the latest evidence and the most comprehensive information on the association between cigarette smoking habits and the risk of PCa. The smokers’ poor adherence to cancer screening and the occurrence of smoking-related aggressive cancers as well as cardiovascular, pulmonary, and several other deadly diseases may explain the negative association. Regional distinctions can be attributed to the difference of participants in age, ethnicity, socioeconomic status, and educational levels. In addition, a correct methodology is important, the choice of different effect models should base on the heterogeneity and characteristics of enrolled studies. However, it is difficult to conclude a positive association between cigarette smoking and PCa risk as we hypothesized due to these affecting factors. We should focus on taking measures to help smokers to be more compliant with early cancer screening and to quit smoking.

Availability of data and materials

All data generated or analyzed during this study are included in this published article and its supplementary information files.

Abbreviations

  • Prostate cancer

Prostate-specific antigen

Polycyclic aromatic hydrocarbons

Glutathione-S-transferases

Heme oxygenase 1

Preferred Reporting Items for Systematic Reviews and Meta-Analyses

Cigarettes per day

Confidence interval

Relative risk

Not reported

United States

Blood pressure

Body mass index

United Kingdom

Diabetes mellitus

Non-significant

Chronic obstructive pulmonary disease

Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71(3):209–49.

Article   PubMed   Google Scholar  

Bostwick DG, Burke HB, Djakiew D, Euling S, Ho SM, Landolph J, et al. Human prostate cancer risk factors. Cancer. 2004;101(10 Suppl):2371–490.

Article   CAS   PubMed   Google Scholar  

Dickerman BA, Markt SC, Koskenvuo M, Pukkala E, Mucci LA, Kaprio J. Alcohol intake, drinking patterns, and prostate cancer risk and mortality: a 30-year prospective cohort study of Finnish twins. Cancer Causes Control. 2016;27(9):1049–58.

Article   PubMed   PubMed Central   Google Scholar  

Rowles JL 3rd, Ranard KM, Applegate CC, Jeon S, An R, Erdman JW Jr. Processed and raw tomato consumption and risk of prostate cancer: a systematic review and dose-response meta-analysis. Prostate Cancer Prostatic Dis. 2018;21(3):319–36.

Chen X, Zhao Y, Tao Z, Wang K. Coffee consumption and risk of prostate cancer: a systematic review and meta-analysis. BMJ Open. 2021;11(2): e038902.

Perez-Cornago A, Key TJ, Allen NE, Fensom GK, Bradbury KE, Martin RM, et al. Prospective investigation of risk factors for prostate cancer in the UK Biobank cohort study. Br J Cancer. 2017;117(10):1562–71.

Karam-Hage M, Cinciripini PM, Gritz ER. Tobacco use and cessation for cancer survivors: an overview for clinicians. CA Cancer J Clin. 2014;64(4):272–90.

U.S. Department of Health and Human Services. Smoking cessation. A report of the surgeon general. Atlanta: U.S. Department of Health and Human Services, Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, Office on Smoking and Health; 2020. (Accessed 30 Sep 2022).

De Nunzio C, Andriole GL, Thompson IM Jr, Freedland SJ. Smoking and prostate cancer: a systematic review. Eur Urol Focus. 2015;1(1):28–38.

Brookman-May SD, Campi R, Henríquez JDS, Klatte T, Langenhuijsen JF, Brausi M, et al. Latest evidence on the impact of smoking, sports, and sexual activity as modifiable lifestyle risk factors for prostate cancer incidence, recurrence, and progression: a systematic review of the literature by the European Association of Urology Section of Oncological Urology (ESOU). Eur Urol Focus. 2019;5(5):756–87.

Huncharek M, Haddock KS, Reid R, Kupelnick B. Smoking as a risk factor for prostate cancer: a meta-analysis of 24 prospective cohort studies. Am J Public Health. 2010;100(4):693–701.

Islami F, Moreira DM, Boffetta P, Freedland SJ. A systematic review and meta-analysis of tobacco use and prostate cancer mortality and incidence in prospective cohort studies. Eur Urol. 2014;66(6):1054–64.

Jochems SHJ, Fritz J, Häggström C, Järvholm B, Stattin P, Stocks T. Smoking and risk of prostate cancer and prostate cancer death: a pooled study. Eur Urol. 2022;3:S0302-2838(22)01804-8.

Centers for Disease Control and Prevention (US); National Center for Chronic Disease Prevention and Health Promotion (US); Office on Smoking and Health (US). How tobacco smoke causes disease: the biology and behavioral basis for smoking-attributable disease: a report of the surgeon general. Atlanta: Centers for Disease Control and Prevention (US); 2010.

Nock NL, Tang D, Rundle A, Neslund-Dudas C, Savera AT, Bock CH, et al. Associations between smoking, polymorphisms in polycyclic aromatic hydrocarbon (PAH) metabolism and conjugation genes and PAH-DNA adducts in prostate tumors differ by race. Cancer Epidemiol Biomarkers Prev. 2007;16(6):1236–45.

Article   CAS   PubMed   PubMed Central   Google Scholar  

Mao GE, Morris G, Lu QY, Cao W, Reuter VE, Cordon-Cardo C, et al. Glutathione S-transferase P1 Ile105Val polymorphism, cigarette smoking and prostate cancer. Cancer Detect Prev. 2004;28(5):368–74.

Ye J, Wang S, Barger M, Castranova V, Shi X. Activation of androgen response element by cadmium: a potential mechanism for a carcinogenic effect of cadmium in the prostate. J Environ Pathol Toxicol Oncol. 2000;19(3):275–80.

CAS   PubMed   Google Scholar  

Watts EL, Appleby PN, Perez-Cornago A, Bueno-de-Mesquita HB, Chan JM, Chen C, et al. Low free testosterone and prostate cancer risk: a collaborative analysis of 20 prospective studies. Eur Urol. 2018;74(5):585–94.

Shiels MS, Rohrmann S, Menke A, Selvin E, Crespo CJ, Rifai N, et al. Association of cigarette smoking, alcohol consumption, and physical activity with sex steroid hormone levels in US men. Cancer Causes Control. 2009;20(6):877–86.

Quiñones LA, Irarrázabal CE, Rojas CR, Orellana CE, Acevedo C, Huidobro C, et al. Joint effect among p53, CYP1A1, GSTM1 polymorphism combinations and smoking on prostate cancer risk: an exploratory genotype-environment interaction study. Asian J Androl. 2006;8(3):349–55.

Birrane G, Li H, Yang S, Tachado SD, Seng S. Cigarette smoke induces nuclear translocation of heme oxygenase 1 (HO-1) in prostate cancer cells: nuclear HO-1 promotes vascular endothelial growth factor secretion. Int J Oncol. 2013;42(6):1919–28.

Miyake M, Fujimoto K, Anai S, Ohnishi S, Kuwada M, Nakai Y, et al. Heme oxygenase-1 promotes angiogenesis in urothelial carcinoma of the urinary bladder. Oncol Rep. 2011;25(3):653–60.

De Nunzio C, Kramer G, Marberger M, Montironi R, Nelson W, et al. The controversial relationship between benign prostatic hyperplasia and prostate cancer: the role of inflammation. Eur Urol. 2011;60(1):106–17.

Arnson Y, Shoenfeld Y, Amital H. Effects of tobacco smoke on immunity, inflammation and autoimmunity. J Autoimmun. 2010;34(3):J258–65.

Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. 2021;372: n71.

Lo CK, Mertz D, Loeb M. Newcastle-Ottawa Scale: comparing reviewers’ to authors’ assessments. BMC Med Res Methodol. 2014;14:45.

Davies HT, Crombie IK, Tavakoli M. When can odds ratios mislead? BMJ. 1998;316(7136):989–91.

Janez S, Maucort-Boulch D. Odds ratio, hazard ratio and relative risk. Adv Methodol Stat. 2016;13(1):59–67.

Google Scholar  

Shor E, Roelfs D, Vang ZM. The “Hispanic mortality paradox” revisited: Meta-analysis and meta-regression of life-course differentials in Latin American and Caribbean immigrants’ mortality. Soc Sci Med. 2017;186:20–33.

DerSimonian R, Laird N. Meta-analysis in clinical trials. Control Clin Trials. 1986;7(3):177–88.

Higgins JP, Thompson SG, Deeks JJ, Altman DG. Measuring inconsistency in meta-analyses. BMJ. 2003;327(7414):557–60.

Thompson MM, Garland C, Barrett-Connor E, Khaw KT, Friedlander NJ, Wingard DL. Heart disease risk factors, diabetes, and prostatic cancer in an adult community. Am J Epidemiol. 1989;129(3):511–7.

Severson RK, Nomura AM, Grove JS, Stemmermann GN. A prospective study of demographics, diet, and prostate cancer among men of Japanese ancestry in Hawaii. Cancer Res. 1989;49(7):1857–60.

Mills PK, Beeson WL, Phillips RL, Fraser GE. Cohort study of diet, lifestyle, and prostate cancer in Adventist men. Cancer. 1989;64(3):598–604.

Le Marchand L, Kolonel LN, Wilkens LR, Myers BC, Hirohata T. Animal fat consumption and prostate cancer: a prospective study in Hawaii. Epidemiology. 1994;5(3):276–82.

Hiatt RA, Armstrong MA, Klatsky AL, Sidney S. Alcohol consumption, smoking, and other risk factors and prostate cancer in a large health plan cohort in California (United States). Cancer Causes Control. 1994;5(1):66–72.

Adami HO, Bergström R, Engholm G, Nyrén O, Wolk A, Ekbom A, et al. A prospective study of smoking and risk of prostate cancer. Int J Cancer. 1996;67(6):764–8.

Engeland A, Andersen A, Haldorsen T, Tretli S. Smoking habits and risk of cancers other than lung cancer: 28 years’ follow-up of 26,000 Norwegian men and women. Cancer Causes Control. 1996;7(5):497–506.

Veierød MB, Laake P, Thelle DS. Dietary fat intake and risk of prostate cancer: a prospective study of 25,708 Norwegian men. Int J Cancer. 1997;73(5):634–8.

Cerhan JR, Torner JC, Lynch CF, Rubenstein LM, Lemke JH, Cohen MB, et al. Association of smoking, body mass, and physical activity with risk of prostate cancer in the Iowa 65+ Rural Health Study (United States). Cancer Causes Control. 1997;8(2):229–38.

Will JC, Vinicor F, Calle EE. Is diabetes mellitus associated with prostate cancer incidence and survival? Epidemiology. 1999;10(3):313–8.

Giovannucci E, Rimm EB, Ascherio A, Colditz GA, Spiegelman D, Stampfer HJ, et al. Smoking and risk of total and fatal prostate cancer in United States health professionals. Cancer Epidemiol Biomarkers Prev. 1999;8(4):277–82.

Putnam SD, Cerhan JR, Parker AS, Bianchi GD, Wallace RB, Cantor KP, et al. Lifestyle and anthropometric risk factors for prostate cancer in a cohort of Iowa men. Ann Epidemiol. 2000;10(6):361–9.

Lund Nilsen TL, Johnsen R, Vatten LJ. Socio-economic and lifestyle factors associated with the risk of prostate cancer. Br J Cancer. 2000;82(7):1358–63.

Lotufo PA, Lee IM, Ajani UA, Hennekens CH, Manson JE. Cigarette smoking and risk of prostate cancer in the physicians’ health study (United States). Int J Cancer. 2000;87(1):141–4.

Baglietto L, Severi G, English DR, Hopper JL, Giles GG. Alcohol consumption and prostate cancer risk: results from the Melbourne collaborative cohort study. Int J Cancer. 2006;119(6):1501–4.

Gonzalez A, Peters U, Lampe JW, White E. Boron intake and prostate cancer risk. Cancer Causes Control. 2007;18(10):1131–40.

Park SY, Murphy SP, Wilkens LR, Henderson BE, Kolonel LN. Fat and meat intake and prostate cancer risk: the multiethnic cohort study. Int J Cancer. 2007;121(6):1339–45.

Rohrmann S, Genkinger JM, Burke A, Helzlsouer KJ, Comstock GW, Alberg AJ, et al. Smoking and risk of fatal prostate cancer in a prospective U.S. study. Urology. 2007;69(4):721–5.

Butler LM, Wang R, Wong AS, Koh WP, Yu MC. Cigarette smoking and risk of prostate cancer among Singapore Chinese. Cancer Causes Control. 2009;20(10):1967–74.

Watters JL, Park Y, Hollenbeck A, Schatzkin A, Albanes D. Cigarette smoking and prostate cancer in a prospective US cohort study. Cancer Epidemiol Biomarkers Prev. 2009;18(9):2427–35.

Grundmark B, Zethelius B, Garmo H, Holmberg L. Serum levels of selenium and smoking habits at age 50 influence long term prostate cancer risk; a 34 year ULSAM follow-up. BMC Cancer. 2011;11:431.

Li Q, Kuriyama S, Kakizaki M, Yan H, Nagai M, Sugawara Y, et al. History of cholelithiasis and the risk of prostate cancer: the Ohsaki Cohort Study. Int J Cancer. 2011;128(1):185–91.

Geybels MS, Verhage BA, van Schooten FJ, van den Brandt PA. Measures of combined antioxidant and pro-oxidant exposures and risk of overall and advanced stage prostate cancer. Ann Epidemiol. 2012;22(11):814–20.

Karppi J, Kurl S, Laukkanen JA, Kauhanen J. Serum β-carotene in relation to risk of prostate cancer: the Kuopio Ischaemic Heart Disease Risk Factor study. Nutr Cancer. 2012;64(3):361–7.

Shafique K, McLoone P, Qureshi K, Leung H, Hart C, Morrison DS. Coffee consumption and prostate cancer risk: further evidence for inverse relationship. Nutr J. 2012;11:42.

Bae JM, Li ZM, Shin MH, Kim DH, Lee MS, Ahn YO. Cigarette smoking and prostate cancer risk: negative results of the Seoul Male Cancer Cohort Study. Asian Pac J Cancer Prev. 2013;14(8):4667–9.

Lemogne C, Consoli SM, Geoffroy-Perez B, Coeuret-Pellicer M, Nabi H, Melchior M, et al. Personality and the risk of cancer: a 16-year follow-up study of the GAZEL cohort. Psychosom Med. 2013;75(3):262–71.

Rohrmann S, Linseisen J, Allen N, Bueno-de-Mesquita HB, Johnsen NF, Tjønneland A, et al. Smoking and the risk of prostate cancer in the European Prospective Investigation into Cancer and Nutrition. Br J Cancer. 2013;108(3):708–14.

Sawada N, Inoue M, Iwasaki M, Sasazuki S, Yamaji T, Shimazu T, et al. Alcohol and smoking and subsequent risk of prostate cancer in Japanese men: the Japan Public Health Center-based prospective study. Int J Cancer. 2014;134(4):971–8.

Everatt R, Kuzmickienė I, Virvičiūtė D, Tamošiūnas A. Cigarette smoking, educational level and total and site-specific cancer: a cohort study in men in Lithuania. Eur J Cancer Prev. 2014;23(6):579–86.

Ho T, Howard L, Vidal A, Gerber L, Moreira D, McKeever M, et al. Smoking and risk of low- and high-grade prostate cancer: results from the REDUCE study. Clin Cancer Res. 2014;20(20):5331–8.

Jacob L, Freyn M, Kalder M, Dinas K, Kostev K. Impact of tobacco smoking on the risk of developing 25 different cancers in the UK: a retrospective study of 422,010 patients followed for up to 30 years. Oncotarget. 2018;9(25):17420–9.

Viner B, Barberio AM, Haig TR, Friedenreich CM, Brenner DR. The individual and combined effects of alcohol consumption and cigarette smoking on site-specific cancer risk in a prospective cohort of 26,607 adults: results from Alberta’s Tomorrow Project. Cancer Causes Control. 2019;30(12):1313–26.

Weber MF, Sarich PEA, Vaneckova P, Wade S, Egger S, Ngo P, et al. Cancer incidence and cancer death in relation to tobacco smoking in a population-based Australian cohort study. Int J Cancer. 2021;149(5):1076–88.

Hippisley-Cox J, Coupland C. Predicting the risk of prostate cancer in asymptomatic men: a cohort study to develop and validate a novel algorithm. Br J Gen Pract. 2021;71(706):e364–71.

Allen NE, Sauvaget C, Roddam AW, Appleby P, Nagano J, Suzuki G, et al. A prospective study of diet and prostate cancer in Japanese men. Cancer Causes Control. 2004;15(9):911–20.

Onitilo AA, Berg RL, Engel JM, Stankowski RV, Glurich I, Williams GM, et al. Prostate cancer risk in pre-diabetic men: a matched cohort study. Clin Med Res. 2013;11(4):201–9.

Thune I, Lund E. Physical activity and the risk of prostate and testicular cancer: a cohort study of 53,000 Norwegian men. Cancer Causes Control. 1994;5(6):549–56.

Tulinius H, Sigfússon N, Sigvaldason H, Bjarnadóttir K, Tryggvadóttir L. Risk factors for malignant diseases: a cohort study on a population of 22,946 Icelanders. Cancer Epidemiol Biomarkers Prev. 1997;6(11):863–73.

Chamie K, DeVere White RW, Lee D, Ok JH, Ellison LM. Agent Orange exposure, Vietnam War veterans, and the risk of prostate cancer. Cancer. 2008;113(9):2464–70.

Laukkanen JA, Pukkala E, Rauramaa R, Makikallio TH, Toriola AT, Kurl S. Cardiorespiratory fitness, lifestyle factors and cancer risk and mortality in Finnish men. Eur J Cancer. 2010;46(2):355–63.

Karlsen RV, Bidstrup PE, Christensen J, Larsen SB, Tjønneland A, Dalton SO, et al. Men with cancer change their health behaviour: a prospective study from the Danish diet, cancer and health study. Br J Cancer. 2012;107(1):201–6.

Rolison JJ, Hanoch Y, Miron-Shatz T. Smokers: at risk for prostate cancer but unlikely to screen. Addict Behav. 2012;37(6):736–8.

Littlejohns TJ, Travis RC, Key TJ, Allen NE. Lifestyle factors and prostate-specific antigen (PSA) testing in UK Biobank: Implications for epidemiological research. Cancer Epidemiol. 2016;45:40–6.

Tangen C, Goodman P, Till C, Schenk J, Lucia M, Thompson I. Biases in recommendations for and acceptance of prostate biopsy significantly affect assessment of prostate cancer risk factors: results from two large randomized clinical trials. J Clin Oncol. 2016;34(36):4338–44.

Johnson JA, Moser RP, Ellison GL, Martin DN. Associations of prostate-specific antigen (PSA) testing in the US population: results from a national cross-sectional survey. J Community Health. 2021;46(2):389–98.

Moses KA, Zhao Z, Bi Y, Acquaye J, Holmes A, Blot WJ, et al. The impact of sociodemographic factors and PSA screening among low-income Black and White men: data from the Southern Community Cohort Study. Prostate Cancer Prostatic Dis. 2017;20(4):424–9.

Pickles K, Scherer LD, Cvejic E, Hersch J, Barratt A, McCaffery KJ. Preferences for more or less health care and association with health literacy of men eligible for prostate-specific antigen screening in Australia. JAMA Netw Open. 2021;4(10): e2128380.

de Nunzio C, Tema G, Trucchi A, Cicione A, Sica A, Lombardo R, et al. Smoking reduces PSA accuracy for detection of prostate cancer: Results from an Italian cross-sectional study. Minerva Urol Nefrol. 2019;71(6):583–9.

GBD 2019 Tobacco Collaborators. Spatial, temporal, and demographic patterns in prevalence of smoking tobacco use and attributable disease burden in 204 countries and territories, 1990–2019: a systematic analysis from the Global Burden of Disease Study 2019. Lancet. 2021;397(10292):2337-2360.

Sturgis EM, Wei Q, Spitz MR. Descriptive epidemiology and risk factors for head and neck cancer. Semin Oncol. 2004;31(6):726–33.

Burger M, Catto JW, Dalbagni G, Grossman HB, Herr H, Karakiewicz P, et al. Epidemiology and risk factors of urothelial bladder cancer. Eur Urol. 2013;63(2):234–41.

Cook MB, Kamangar F, Whiteman DC, Freedman ND, Gammon MD, Bernstein L, et al. Cigarette smoking and adenocarcinomas of the esophagus and esophagogastric junction: a pooled analysis from the international BEACON consortium. J Natl Cancer Inst. 2010;102(17):1344–53.

Duffy SA, Ronis DL, McLean S, Fowler KE, Gruber SB, Wolf GT, et al. Pretreatment health behaviors predict survival among patients with head and neck squamous cell carcinoma. J Clin Oncol. 2009;27(12):1969–75.

Centers for Disease Control and Prevention (CDC). Smoking-attributable mortality, years of potential life lost, and productivity losses--United States, 2000-2004. MMWR Morb Mortal Wkly Rep. 2008;57(45):1226–8.

Foerster B, Pozo C, Abufaraj M, Mari A, Kimura S, D’Andrea D, et al. Association of smoking status with recurrence, metastasis, and mortality among patients with localized prostate cancer undergoing prostatectomy or radiotherapy: a systematic review and meta-analysis. JAMA Oncol. 2018;4(7):953–61.

Darcey E, Boyle T. Tobacco smoking and survival after a prostate cancer diagnosis: a systematic review and meta-analysis. Cancer Treat Rev. 2018;70:30–40.

Prueitt RL, Wallace TA, Glynn SA, Yi M, Tang W, Luo J, et al. An immune-inflammation gene expression signature in prostate tumors of smokers. Cancer Res. 2016;76(5):1055–65.

Enokida H, Shiina H, Urakami S, Terashima M, Ogishima T, Li LC, et al. Smoking influences aberrant CpG hypermethylation of multiple genes in human prostate carcinoma. Cancer. 2006;106(1):79–86.

Creamer MR, Wang TW, Babb S, Cullen KA, Day H, Willis G, et al. Tobacco product use and cessation indicators among adults - United States, 2018. MMWR Morb Mortal Wkly Rep. 2019;68(45):1013–9.

Rigotti NA, Kruse GR, Livingstone-Banks J, Hartmann-Boyce J. Treatment of tobacco smoking: a review. JAMA. 2022;327(6):566–77.

Warren GW, Marshall JR, Cummings KM, Toll BA, Gritz ER, Hutson A, et al. Addressing tobacco use in patients with cancer: a survey of American Society of Clinical Oncology members. J Oncol Pract. 2013;9(5):258–62.

Li WHC, Ho KY, Wang MP, Cheung DYT, Lam KKW, Xia W, et al. Effectiveness of a brief self-determination theory-based smoking cessation intervention for smokers at Emergency Departments in Hong Kong: a randomized clinical trial. JAMA Intern Med. 2020;180(2):206–14.

Stead LF, Hartmann-Boyce J, Perera R, Lancaster T. Telephone counselling for smoking cessation. Cochrane Database Syst Rev. 2013;(8):CD002850.

Lancaster T, Stead LF. Self-help interventions for smoking cessation. Cochrane Database Syst Rev. 2005;(3):CD001118.

Civljak M, Sheikh A, Stead LF, Car J. Internet-based interventions for smoking cessation. Cochrane Database Syst Rev. 2010;(9):CD007078.

Kim AS, Ko HJ, Kwon JH, Lee JM. Exposure to secondhand smoke and risk of cancer in never smokers: a meta-analysis of epidemiologic studies. Int J Environ Res Public Health. 2018;15(9):1981.

Andreotti G, Freedman ND, Silverman DT, Lerro CC, Koutros S, Hartge P, et al. Tobacco use and cancer risk in the agricultural health study. Cancer Epidemiol Biomarkers Prev. 2017;26(5):769–78.

Zhao J, Stockwell T, Roemer A, Chikritzhs T. Is alcohol consumption a risk factor for prostate cancer? A systematic review and meta-analysis. BMC Cancer. 2016;16(1):845.

Sharma R, Lodhi S, Sahota P, Thakkar MM. Nicotine administration in the wake-promoting basal forebrain attenuates sleep-promoting effects of alcohol. J Neurochem. 2015;135(2):323–31.

Download references

Acknowledgements

Not applicable.

This work was supported by the Sanming Project of Medicine in Shenzhen (grant number: SZSM202011011). The funder had no role in the study design, data collection, analysis and interpretation, or writing of the report.

Author information

Xiangwei Yang and Hong Chen contributed equally to this work.

Authors and Affiliations

Department of Urology, Kidney and Urology Center, Pelvic Floor Disorders Center, The Seventh Affiliated Hospital, Sun Yat-Sen University, No.628 Zhenyuan Road, Shenzhen, 518107, China

Xiangwei Yang, Shiqiang Zhang, Xianju Chen, Yiyu Sheng & Jun Pang

School of Nursing, LKS Faculty of Medicine, University of Hong Kong, Hong Kong, China

You can also search for this author in PubMed   Google Scholar

Contributions

XWY, HC and JP conceptualized the study and developed the registered protocol for the review. XWY and HC conducted the literature search, quality assessment, data extraction, and drafted the manuscript. SQZ, XJC and YYS performed statistical analyses. JP revised the manuscript, obtained funding and supervised the project. JP is responsible for the overall content and serves as the guarantor. All authors helped refine the final version of the manuscript and approve with its submission.

Corresponding author

Correspondence to Jun Pang .

Ethics declarations

Ethics approval and consent to participate, consent for publication, competing interests.

The authors declare no competing interests.

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1..

A. Characteristics of the 39 studies included in the meta-analysis. B. Characteristics of the 5 studies not included in the meta-analysis due to lack of information.

Additional file 2.

 Results of quality assessment using the Newcastle-Ottawa Scale for cohort studies.

Additional file 3.

 Sensitivity analyses of association between smoking status and risk of prostate cancer.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ . The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/ ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article.

Yang, X., Chen, H., Zhang, S. et al. Association of cigarette smoking habits with the risk of prostate cancer: a systematic review and meta-analysis. BMC Public Health 23 , 1150 (2023). https://doi.org/10.1186/s12889-023-16085-w

Download citation

Received : 21 November 2022

Accepted : 09 June 2023

Published : 15 June 2023

DOI : https://doi.org/10.1186/s12889-023-16085-w

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Cigarette smoking
  • Systematic review
  • Meta-analysis

BMC Public Health

ISSN: 1471-2458

literature review of error analysis

IMAGES

  1. ERROR ANALYSIS

    literature review of error analysis

  2. (PDF) Error Analysis: Understanding the origins of errors in FL learning

    literature review of error analysis

  3. An Introduction to Error Analysis The Study of Uncertainties in

    literature review of error analysis

  4. (PDF) A Review Study of Error Analysis Theory

    literature review of error analysis

  5. Error analysis

    literature review of error analysis

  6. (PDF) A STUDY OF ERROR ANALYSIS IN WRITTEN PRODUCTION: A CASE STUDY OF

    literature review of error analysis

VIDEO

  1. Error Analysis 2nd Class

  2. ERROR DETECTION AND CORRECTIONS: NUMERICAL ANALYSIS

  3. Part 18: Errors

  4. Types of errors, inherent error, round off error, truncation error in error analysis #erroranalysis

  5. Adjective Common Error (Phase-2) Lec _23 |#TGT_PGT_English & 6to8

  6. Error Analysis

COMMENTS

  1. (PDF) A Review Study of Error Analysis Theory

    This review reveals that despite the criticism that this theory has received, it still plays a fundamental role in investigating, identifying and describing second language learners' errors and ...

  2. Error analysis of measurement uncertainty: a snapshot literature review

    To analyze and statistically compare common errors in the evaluation of measurement uncertainty in medicine and health field, using literature research and comparison with national standards, in order to understand the current status of measurement uncertainty evaluation in the medicine and health field. Using Chinese National Knowledge Infrastructure (CNKI) as the sample population ...

  3. A Review Study of Error Analysis Theory

    This review reveals that despite the criticism that this theory has received, it still plays a fundamental role in investigating, identifying and describing second language learners' errors and their causes, and can enable second language teachers to find out different sources of second language errors and take some pedagogical precautions towards them. Up until the late sixties, the prominent ...

  4. How to Write a Literature Review

    Examples of literature reviews. Step 1 - Search for relevant literature. Step 2 - Evaluate and select sources. Step 3 - Identify themes, debates, and gaps. Step 4 - Outline your literature review's structure. Step 5 - Write your literature review.

  5. A practical guide to data analysis in general literature reviews

    This article is a practical guide to conducting data analysis in general literature reviews. The general literature review is a synthesis and analysis of published research on a relevant clinical issue, and is a common format for academic theses at the bachelor's and master's levels in nursing, physiotherapy, occupational therapy, public health and other related fields.

  6. Failures, Errors, and Mistakes: A Systematic Review of the Literature

    Abstract. Terms such as failure, mistakes, errors, obstacles, and struggle are used interchangeably, but each carries different connotations and discipline-specific meanings. Reactions to experiencing a failure can range as well from being seen as having educative value to be debilitating. These reactions are based on criteria like environment ...

  7. PDF Error Analysis: Sources of L2 Learners Errors

    Consequently, the review of the literature that follows addresses itself to the sources of errors rather than the most frequent EFL learners' errors reported in various studies per se. At first, for this review of the literature, it is necessary

  8. Teaching and learning mathematics through error analysis

    Correctly worked examples consist of a problem statement with the steps taken to reach a solution along with the final result and are an effective method for the initial acquisitions of procedural skills and knowledge [1, 11, 26].Cognitive load theory [1, 11, 25] explains the challenge of stimulating the cognitive process without overloading the student with too much essential and extraneous ...

  9. An error-analysis study from an EFL writing context: Human and

    Evaluating written texts is believed to be a time-consuming process that can lack consistency and objectivity. Automated essay scoring (AES) can provide solutions to some of the limitations of human scoring. This research aimed to evaluate the performance of one AES system, Grammarly, in comparison to human raters. Both approaches' performances were analyzed quantitatively using Corder's ...

  10. REVIEW OF LEARNERS' ERRORS: AN ERROR ANALYSIS PERSPECTIVE

    The final purpose of doing EA is to evaluate and treat errors (Yaghi & Abdullah, 2015). They further explained that the classification of errors had been developed over time and one of the current ...

  11. PDF Chapter Two: Literature Review

    which is relevant to this study. 2.1.1 Definition. 'Errors' is an important key word in this study, which tends to be used. interchangeably with 'slips' and 'mistakes'. It is crucial to define 'errors', at the very. beginning, and distinguish 'errors' from 'slips' and 'mistakes'. "Errors are the flawed.

  12. PDF A REVIEW ON ERROR ANALYSIS STAGES IN TEACHING ENGLISH STRUCTURE

    REVIEW OF LITERATURE The Description of Errors During the learning process, learners produce errors as a characteristic of systematically and consistently at a certain point of learning, a deviant language system Errors are sections of a conversation or a work that stray from a pre-determined standard of adult language performance.

  13. (PDF) The Impact of Error Analysis and Feedback in ...

    English Literature and Language Review. ISSN(e): 2412-1703, ISSN(p): 2413-8827 ... feedback and the analysis o f errors by language teac hers are two areas that have also been criticized by some ...

  14. Eight problems with literature reviews and how to fix them

    Main. The aims of literature reviews range from providing a primer for the uninitiated to summarizing the evidence for decision making 1. Traditional approaches to literature reviews are ...

  15. Nurses' experiences in voluntary error reporting: An integrative

    2.4. Data analysis. Thematic analysis was chosen for an integrative review due to the heterogeneity of the included papers where meta-analysis would not be appropriate [26,27].Data analysis consists of four stages: data reduction, data display, comparison, conclusion drawing, and verification [26,33].Firstly, the data of each study were extracted and succinctly presented into a comprehensive ...

  16. A systematic literature review of LASA error interventions

    A systematic literature review 8 was subsequently published in 2018 on strategies to avoid look-alike labelling errors. The authors of this review 8 looked only at "original studies" published in English involving healthcare professionals or "consumers", with no restrictions on study design. Although this review and our own were carried ...

  17. PDF Error analysis of measurement uncertainty: a snapshot literature review

    Accreditation and Quality Assurance (2023) 28:245-249 247 1 3 present, was 0, 2, 9, 15, and 4, respectively. The articles were from 15 provinces and municipalities, including food

  18. Introduction to systematic review and meta-analysis

    A systematic review collects all possible studies related to a given topic and design, and reviews and analyzes their results [ 1 ]. During the systematic review process, the quality of studies is evaluated, and a statistical meta-analysis of the study results is conducted on the basis of their quality. A meta-analysis is a valid, objective ...

  19. Human reliability in railway engineering: Literature review and

    Most of the papers discovered in this literature review deal with the analysis of significant railway accidents of the recent past involving human errors by different perspectives. A recurrent topic in the accident analysis is the investigation of accident at rail level crossing.

  20. An error analysis of journal papers written by Persian authors

    This is due to the fact that Persian and English defer in terms of prepositions. For example, the word "tired" requires "of" in English and "from" in Persian. The errors in prepositions are also among the 18 type of errors extracted by Darus and Ching ( 2009) who studied the errors in the essays of 70 Chinese students.

  21. Approaches for Sampling and Sample Preparation for Microplastic ...

    The aim of this review is to outline the key steps of MPs analysis in laundry effluents and domestic washing, focusing on those steps that are underestimated in the current literature: sampling and sample preparation for analysis. Precisely these analytical steps, which can become the main source of analytical measurement system errors, ensure ...

  22. Error analysis of measurement uncertainty: a snapshot literature review

    Objective In China where social work is a fledgling profession, practice research is still a novelty. This article aims to provide an overview of the development of social work practice research ...

  23. Effects of human milk odor stimulation on feeding in premature infants

    Yangyang Qin and Shu Liu were responsible for the topic selection, literature search, draft of the paper and data analysis. Han Han participated in literature screening, literature quality ...

  24. FNAIT Systematic Literature Review and Meta-Analysis ...

    FNAIT Systematic Literature Review and Meta-Analysis Presented at the Academy of Managed Care Pharmacy 2024 Annual Meeting - Data Support HPA-1a Negative Frequency of More than 2% in Nearly ...

  25. Comparative efficacy and safety of PD-1/PD-L1 inhibitors in triple

    Triple-Negative Breast Cancer (TNBC) is a lethal subtype of breast cancer with limited treatment options. The purpose of this Network Meta-Analysis (NMA) is to compare the efficacy and safety of inhibitors of programmed cell death 1 (PD-1) and programmed cell death ligand 1 (PD-L1) in treating TNBC. Our search strategy was used in six databases: PubMed, Cochrane Library, Cumulative Index to ...

  26. A Systematic Literature Review of Human Error and Machine Error in

    2.1 Justification in Relation to Job Design. The primary purpose of this study is to perform a literature review of a variety of articles that focus on human and machine errors, using tools like Harzings's Publish or Perish or Purdue Databases to gather reports and then tools like VOSviewer, MAXQDA, and BibExcel to conduct analyses on the results.

  27. FNAIT Systematic Literature Review and Meta-Analysis ...

    NEW HAVEN, Conn.--(BUSINESS WIRE)-- Rallybio Corporation (Nasdaq: RLYB), a clinical-stage biotechnology company committed to identifying and accelerating the development of life-transforming therapies for patients with severe and rare diseases, today announced the presentation of results from a fetal and neonatal alloimmune thrombocytopenia (FNAIT) systematic literature review and meta ...

  28. Buffalo welfare: a literature review from 1992 to 2023 with a text

    This review aims to present the key research topics, the evolution and geographic distribution of literature on buffalo welfare, and to underscore potentiality and existing knowledge gaps. A comprehensive search on the Scopus bibliometric database was carried out using various welfare-related keywords such as: 'buffalo welfare', 'buffalo ...

  29. (PDF) The Significance of Error Analysis in Written ...

    The results exhibited that (1) article and preposition usages were more persistent grammatical errors; (2) run-ons and fragmentation were more reiterated syntactic errors; and (3) uses of single ...

  30. Association of cigarette smoking habits with the risk of prostate

    Background Association of cigarette smoking habits with the risk of prostate cancer is still a matter of debate. This systematic review and meta-analysis aimed to assess the association between cigarette smoking and prostate cancer risk. Methods We conducted a systematic search on PubMed, Embase, Cochrane Library, and Web of Science without language or time restrictions on June 11, 2022 ...