Last updated 10th July 2024: Online ordering is currently unavailable due to technical issues. We apologise for any delays responding to customers while we resolve this. For further updates please visit our website https://www.cambridge.org/news-and-insights/technical-incident

We use cookies to distinguish you from other users and to provide you with a better experience on our websites. Close this message to accept cookies or find out how to manage your cookie settings .

Login Alert

equivalence hypothesis in psychology

  • > Journals
  • > Psychological Medicine
  • > Volume 48 Issue 11
  • > Equivalence and non-inferiority testing in psychotherapy...

equivalence hypothesis in psychology

Article contents

Equivalence and non-inferiority margins, statistical hypotheses in equivalence and non-inferiority testing, equivalence v. non-inferiority testing, assay sensitivity and constancy of study conditions, conclusions, equivalence and non-inferiority testing in psychotherapy research.

Published online by Cambridge University Press:  11 May 2018

With more than 100 non-inferiority or equivalence trials published per year in many areas of research (Piaggio et al. , Reference Piaggio 2012 ), statistical and methodological issues involved in these trials become increasingly important. A recent article by Rief and Hofmann ( Reference Rief and Hofmann 2018 ) suggests, however, that some of these issues are not sufficiently clear. For this reason, central issues will be discussed here and some misunderstandings will be addressed.

For defining a non-inferiority or equivalence margin (i.e. the minimum difference important enough to make treatments non-equivalent), no generally accepted standards exist. In 332 equivalence or non-inferiority medical trials, a median margin of 0.50 standard deviations was found (Lange and Freitag, Reference Lange and Freitag 2005 ), corresponding quite well to the value of 0.42 reported by Gladstone and Vach ( Reference Gladstone and Vach 2014 ). Only five studies used margins < 0.25 (Gladstone and Vach, Reference Gladstone and Vach 2014 ) and only 12% of studies margins ⩽0.25 (Lange and Freitag, Reference Lange and Freitag 2005 ).

In psychotherapy research, margins ranging from 0.24 to 0.60 have been proposed (e.g. Steinert et al. , Reference Steinert 2017 , p. 944). In a meta-analysis of psychodynamic therapy (PDT) including different mental disorders, Steinert et al. ( Reference Steinert 2017 ) chose a margin of g  = 0.25, which is among the smallest margins ever used in psychotherapy and medical research (Gladstone and Vach, Reference Gladstone and Vach 2014 , Figure 2, Steinert et al. , Reference Steinert 2017 , p. 944). This margin is very close to both (a) the threshold for a minimally important difference specifically suggested for depression (0.24, Cuijpers et al. , Reference Cuijpers 2014 ), and (b) the margin recommended by Gladstone and Vach ( Reference Gladstone and Vach 2014 ) to protect against degradation of treatment effects in non-inferiority trials ( d  = −0.23).

In their recent correspondence article, Rief and Hofmann ( Reference Rief and Hofmann 2018 ) make a quite different proposal, recommending margins not to fall below 90% of the uncontrolled effect size of the established treatment. This proposal, however, is associated with several problems described in more detail in Table 1 , particularly regarding the clinical significance of the suggested margin and its implications for sample size determination, rendering non-inferiority trials in psychotherapy research virtually impossible ( Table 1 ).

Table 1. Further methodological issues of equivalence and non-inferiority testing

a Paul Crits-Christoph, personal communication, 16 February 2018.

b Paul Crits-Christoph, personal communication, 26 February 2018.

In equivalence testing, the null and alternative hypotheses of superiority testing are reversed and the statistical alternative hypothesis is consistent with the assumption of equivalence (Lesaffre, Reference Lesaffre 2008 ; Walker and Nowacki, Reference Walker and Nowacki 2011 ). To test for equivalence, two one-sided tests are performed determining whether the upper and the lower boundary of the CI are included in the margin, whereas, for testing non-inferiority, one one-sided test inspecting the lower boundary is used (Lesaffre, Reference Lesaffre 2008 ; Walker and Nowacki, Reference Walker and Nowacki 2011 ). A statistically significant result implies here that the effect size and its CI are within the margin, demonstrating equivalence or non-inferiority (Walker and Nowacki, Reference Walker and Nowacki 2011 ). A recent meta-analysis testing equivalence of PDT to other approaches established in efficacy reported a significant result indicating that the effect sizes and their CIs were completely included in the margin (Steinert et al. , Reference Steinert 2017 ). Thus, the recently given interpretation by Rief and Hofmann ( Reference Rief and Hofmann 2018 , p. 2) that Steinert et al. ( Reference Steinert 2017 ) ‘… found a significant disadvantage of PDT [psychodynamic therapy] compared with other treatments (including CBT)’ is simply wrong (Lesaffre, Reference Lesaffre 2008 ; Walker and Nowacki, Reference Walker and Nowacki 2011 ).

Equivalence and non-inferiority testing need to be differentiated (Treadwell et al. , Reference Treadwell 2012 ). In non-inferiority testing, for example, the test treatment is expected to be superior to the standard treatment in measures not related to efficacy such as side effects or costs (Treadwell et al. , Reference Treadwell 2012 ). Rief and Hofmann did not make this differentiation. In fact, the meta-analysis by Steinert et al. ( Reference Steinert 2017 ), for example, was a test of equivalence, not of non-inferiority as suggested by Rief and Hofmann ( Reference Rief and Hofmann 2018 ).

Equivalence and non-inferiority testing require that the efficacy of the comparator is ensured and that the study conditions are comparable with in which the efficacy of the comparator was established (Treadwell et al. , Reference Treadwell 2012 ). In those context, Rief and Hofmann ( Reference Rief and Hofmann 2018 ) claim that specific issues of (low) study quality favour non-inferiority results, e.g. low response rates found in specific studies or low treatment integrity. Again, however, these claims are not supported by evidence ( Table 1 ). This applies to several further issues put forward by Rief and Hofmann ( Reference Rief and Hofmann 2018 ) which are briefly discussed in Table 1 , for example to the relationship between equivalence testing and the number of studies available for a specific treatment ( Table 1 ).

Equivalence and non-inferiority testing pose specific methodological problems (Piaggio et al. , Reference Piaggio 2012 ; Treadwell et al. , Reference Treadwell 2012 ), for example, in defining a margin, statistical testing, and ensuring the efficacy of the comparator or comparability of study conditions ( Table 1 ). Conclusions about equivalence and non-inferiority testing differing from Rief and Hofmann's ( Reference Rief and Hofmann 2018 ) are presented which are more consistent with the available evidence and usual standards across a range of scientific disciplines.

Crossref logo

This article has been cited by the following publications. This list is generated based on data provided by Crossref .

  • Google Scholar

View all Google Scholar citations for this article.

Save article to Kindle

To save this article to your Kindle, first ensure [email protected] is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle .

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

  • Volume 48, Issue 11
  • Falk Leichsenring (a1) , Allan Abbass (a2) , Ellen Driessen (a3) , Mark Hilsenroth (a4) , Patrick Luyten (a5) (a6) , Sven Rabung (a7) and Christiane Steinert (a1) (a8)
  • DOI: https://doi.org/10.1017/S0033291718001289

Save article to Dropbox

To save this article to your Dropbox account, please select one or more formats and confirm that you agree to abide by our usage policies. If this is the first time you used this feature, you will be asked to authorise Cambridge Core to connect with your Dropbox account. Find out more about saving content to Dropbox .

Save article to Google Drive

To save this article to your Google Drive account, please select one or more formats and confirm that you agree to abide by our usage policies. If this is the first time you used this feature, you will be asked to authorise Cambridge Core to connect with your Google Drive account. Find out more about saving content to Google Drive .

Reply to: Submit a response

- No HTML tags allowed - Web page URLs will display as text only - Lines and paragraphs break automatically - Attachments, images or tables are not permitted

Your details

Your email address will be used in order to notify you when your comment has been reviewed by the moderator and in case the author(s) of the article or the moderator need to contact you directly.

You have entered the maximum number of contributors

Conflicting interests.

Please list any fees and grants from, employment by, consultancy for, shared ownership in or any close relationship with, at any time over the preceding 36 months, any organisation whose interests may be affected by the publication of the response. Please also list any non-financial associations or interests (personal, professional, political, institutional, religious or other) that a reasonable reader would want to know about in relation to the submitted work. This pertains to all the authors of the piece, their spouses or partners.

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings
  • My Bibliography
  • Collections
  • Citation manager

Save citation to file

Email citation, add to collections.

  • Create a new collection
  • Add to an existing collection

Add to My Bibliography

Your saved search, create a file for external citation management software, your rss feed.

  • Search in PubMed
  • Search in NLM Catalog
  • Add to Search

Recommendations for applying tests of equivalence

Affiliation.

  • 1 Department of Psychology, York University, Toronto, ON, Canada. [email protected]
  • PMID: 14692005
  • DOI: 10.1002/jclp.10217

Researchers in psychology reliably select traditional null hypothesis significance tests (e.g., Student's t test), regardless of whether the research hypothesis relates to whether the group means are equivalent or whether the group means are different. Tests of equivalence, which have been popular in biopharmaceutical studies for years, have recently been introduced and recommended to researchers in psychology for demonstrating the equivalence of two group means. However, very few recommendations exist for applying tests of equivalence. A Monte Carlo study was used to compare the test of equivalence proposed by Schuirmann with the traditional Student t test for deciding if two group means are equivalent. It was found that Schuirmann's test of equivalence is more effective than Student's t test at detecting population mean equivalence with large sample sizes; however, Schuirmann's test of equivalence performs poorly relative to Student's t test with small sample sizes and/or inflated variances.

Copyright 2003 Wiley Periodicals, Inc. J Clin Psychol.

PubMed Disclaimer

Similar articles

  • A proposed framework for conducting data-based test analysis. Slaney KL, Maraun MD. Slaney KL, et al. Psychol Methods. 2008 Dec;13(4):376-90. doi: 10.1037/a0014269. Psychol Methods. 2008. PMID: 19072000
  • Bootstrap standard error and confidence intervals for the correlation corrected for range restriction: a simulation study. Chan W, Chan DW. Chan W, et al. Psychol Methods. 2004 Sep;9(3):369-85. doi: 10.1037/1082-989X.9.3.369. Psychol Methods. 2004. PMID: 15355154
  • Equivalence tests for comparing correlation and regression coefficients. Counsell A, Cribbie RA. Counsell A, et al. Br J Math Stat Psychol. 2015 May;68(2):292-309. doi: 10.1111/bmsp.12045. Epub 2014 Oct 27. Br J Math Stat Psychol. 2015. PMID: 25346486
  • The incremental validity of psychological testing and assessment: conceptual, methodological, and statistical issues. Hunsley J, Meyer GJ. Hunsley J, et al. Psychol Assess. 2003 Dec;15(4):446-55. doi: 10.1037/1040-3590.15.4.446. Psychol Assess. 2003. PMID: 14692841 Review.
  • The test validation summary. Frederick RI, Bowden SC. Frederick RI, et al. Assessment. 2009 Sep;16(3):215-36. doi: 10.1177/1073191108325005. Epub 2008 Oct 6. Assessment. 2009. PMID: 18838580 Review.
  • Negligible interaction test for continuous predictors. Jabbari Y, Cribbie R. Jabbari Y, et al. J Appl Stat. 2021 Feb 19;49(8):2001-2015. doi: 10.1080/02664763.2021.1887102. eCollection 2022. J Appl Stat. 2021. PMID: 35757595 Free PMC article.
  • Measurement Challenges in Research With Individuals With Cognitive Impairment. Kolanowski A, Gilmore-Bykovskyi A, Hill N, Massimo L, Mogle J. Kolanowski A, et al. Res Gerontol Nurs. 2019 Jan 1;12(1):7-15. doi: 10.3928/19404921-20181212-06. Res Gerontol Nurs. 2019. PMID: 30653646 Free PMC article.
  • A new way to look at the data: Similarities between groups of people are large and important. Hanel PHP, Maio GR, Manstead ASR. Hanel PHP, et al. J Pers Soc Psychol. 2019 Apr;116(4):541-562. doi: 10.1037/pspi0000154. Epub 2018 Dec 31. J Pers Soc Psychol. 2019. PMID: 30596430 Free PMC article.
  • Increasing Literacy in Quantitative Methods: The Key to the Future of Canadian Psychology. Counsell A, Cribbie RA, Harlow LL. Counsell A, et al. Can Psychol. 2016 Aug;57(3):193-201. doi: 10.1037/cap0000056. Can Psychol. 2016. PMID: 28042199 Free PMC article.
  • Exact Power and Sample Size Calculations for the Two One-Sided Tests of Equivalence. Shieh G. Shieh G. PLoS One. 2016 Sep 6;11(9):e0162093. doi: 10.1371/journal.pone.0162093. eCollection 2016. PLoS One. 2016. PMID: 27598468 Free PMC article.

Publication types

  • Search in MeSH

LinkOut - more resources

Full text sources.

full text provider logo

  • Citation Manager

NCBI Literature Resources

MeSH PMC Bookshelf Disclaimer

The PubMed wordmark and PubMed logo are registered trademarks of the U.S. Department of Health and Human Services (HHS). Unauthorized use of these marks is strictly prohibited.

  • Sport Psychology
  • Motor Imagery

Functional equivalence or behavioural matching? A critical reflection on 15 years of research using the PETTLEP model of motor imagery

  • February 2013
  • International Review of Sport and Exercise Psychology 6(1):105-121
  • 6(1):105-121

Caroline Wakefield at Liverpool Hope University

  • Liverpool Hope University

Dave Smith at Manchester Metropolitan University

  • Manchester Metropolitan University

Aidan Moran at University College Dublin

  • University College Dublin

Paul S. Holmes at Manchester Metropolitan University

Abstract and Figures

Schematic of common substrate.

Discover the world's research

  • 25+ million members
  • 160+ million publication pages
  • 2.3+ billion citations

Mohsen Afrouzeh

  • Mohammad Sadegh

Rabiu MUAZU Musa

  • Baki Yılmaz
  • HUM MOVEMENT SCI

Robin Owen

  • Nicole M Fromm

Farida Gayle

  • Iryna Krivyca
  • Olena Nesen
  • Yevheniia Strelnykova
  • Krzysztof Wnorowski
  • J Imagery Res Sport Phys Activ

Zhi Li

  • Courtney N. Reed
  • Marcus Pearce
  • Andrew McPherson
  • John E Schneider
  • Maxwell Blodgett
  • Spenser Lang
  • Sally A. Santen

Riki Lindsay

  • Leigh Jones
  • Richard Mullen

Tadhg Macintyre

  • R.S. Vealey
  • C.A. Greenleaf

Daniel Gould

  • Nicole Damarjian

Christy Greenleaf

  • SPORT PSYCHOL

Dave Smith

  • Jeffrey M. Lohr
  • PSYCHOPHYSIOLOGY
  • Peter J. Lang
  • Recruit researchers
  • Join for free
  • Login Email Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google Welcome back! Please log in. Email · Hint Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google No account? Sign up

APS

Equivalence Testing With TOSTER

  • Experimental Psychology
  • Methodology

Any science that wants to be taken seriously needs to be able to provide support for the null hypothesis. I often see people switching over from frequentist statistics when effects are significant to the use of Bayes factors to be able to provide support for the null hypothesis. But it is possible to test if there is a lack of an effect using p values. (Why no one ever told me this in the 11 years I worked in science is beyond me). It’s as easy as doing a t test, or, more precisely, as doing two t tests.

I’ve created my first R package, TOSTER (as in Two One-Sided Tests for Equivalence in R). Don’t worry, there is also an old-fashioned spreadsheet available as well (see “TOSTER Materials,” below).

Sometimes you perform a study where you might expect the effect to be zero or very small. So how can we conclude an effect is “zero or very small”?

One approach is to specify effect sizes we consider “not small.” For example, we might decide that effects larger than d = 0.3 (or smaller than d = –0.3 in a two-sided  t test) are “not small.” Now, if we observe an effect that falls between the two equivalence bounds of d = –0.3 and d = 0.3, we can act (in the good old-fashioned Neyman–Pearson approach to statistical inferences) as if the effect is “zero or very small.” It might not be exactly zero, but it is small enough.

We can use two one-sided tests to statistically reject effects ≤ –0.3 and ≥ 0.3. This is the basic idea of the TOST (two one-sided tests) equivalence procedure.

The idea is simple, and it is conceptually similar to the traditional null-hypothesis test you probably already use to reject an effect of zero. But whereas all statistics programs will allow you to perform a normal  t test, it is not so simple to perform a TOST equivalence test.

Psychological science really needs a way to show effects are too small to matter (see Morey & Lakens, 2016). So I made a spreadsheet and R package to perform the TOST procedure. The free TOSTER package is available from the Comprehensive R Archive Network (CRAN), which means you can install it using install.packages(“TOSTER”) .

Let’s try a practical example using the vignette that comes along with the R package.

Eskine (2013) showed that participants who had been exposed to organic food were substantially harsher in their moral judgments relative to those in the control condition ( d = 0.81, 95% confidence interval: [0.19, 1.45]). A replication by Moery and Calin-Jageman (2016, Study 2) did not observe a significant effect (control: n = 95, M = 5.25, SD = 0.95; organic food: n = 89, M = 5.22,  SD = 0.83). The authors used Uri Simonsohn’s recommendation to power their study so that they had 80% power to detect an effect that the original study had 33% power to detect. This is the same as saying: We consider an effect to be “small” when it is smaller than the effect size the original study had 33% power to detect.

With n = 21 in each condition, Eskine had 33% power to detect an effect of d = 0.48. This is the effect the authors of the replication study designed their experiment to detect. The original study had shown an effect of d = 0.81, and the authors performing the replication decided that an effect size of d = 0.48 would be the smallest effect size they would aim to detect with 80% power. So we can use this effect size as the equivalence bound. We can use R to perform an equivalence test:

install.packages("TOSTER") library("TOSTER") TOSTtwo(m1=5.25, m2=5.22, sd1=0.95, sd2=0.83, n1=95, n2=89, low_eqbound_d=-0.43, high_eqbound_d=0.43, alpha = 0.05) # Which gives us the following output: Using alpha = 0.05 Student's t-test was non-significant, t(182) = 0.2274761, p = 0.8203089 Using alpha = 0.05 the equivalence test based on Student's t-test was significant, t(182) = -3.026311, p = 0.001417168 TOST results: t-value 1 p-value 1 t-value 2 p-value 2 df 1 3.481263 0.0003123764 -3.026311 0.001417168 182 Equivalence bounds (Cohen's d): low bound d high bound d 1 -0.48 0.48 Equivalence bounds (raw scores): low bound raw high bound raw 1 -0.4291159 0.4291159 TOST confidence interval: Lower Limit 90% CI raw Upper Limit 90% CI raw 1 -0.1880364 0.2480364

You see, we are just using R like a fancy calculator, entering all the numbers in a single function. But I can understand if you are a bit intimidated by R. So, you can also fill in the same info in the spreadsheet.

Using a TOST equivalence procedure with α = .05 and without assuming equal variances (because when sample sizes are unequal, you should report Welch’s t test by default), we can reject effects larger than d = 0.48, t (182) = –3.03, p = .001.

The R package also provides a graph, displaying the observed mean difference (in raw scale units), the equivalence bounds (also in raw scores), and the 90% and 95% CIs. If the 90% CI does not include the equivalence bounds, we can declare equivalence.

equivalence hypothesis in psychology

Moery and Calin-Jageman concluded from this study: “We again found that food exposure has little to no effect on moral judgments.” But what is “little to no”? The equivalence test tells us the authors successfully rejected effects of a size the original study had 33% power to reject. Instead of saying “little to no,” we can put a number on the effect size we have rejected by performing an equivalence test.

If you want to read more about equivalence tests, including how to perform them for one-sample  t tests, dependent  t tests, correlations, or meta-analyses, you can check out a practical primer on equivalence testing using the TOSTprocedure I’ve written. It’s available as a preprint on PsyArXiv. The R code is available on GitHub.

Daniel Lakens will speak at the 2017 APS Annual Convention, May 25–28, 2017, in Boston, Massachusetts. He also will speak at the International Convention of Psychological Science, March 23–25, 2017, in Vienna, Austria.

TOSTER Materials

The TOSTER spreadsheet is available here .

The TOSTER R package can be installed from CRAN using install.packages(“TOSTER”) .

The practical primer on equivalence testing using the TOST procedure is available here .

The R code is available here .

Detailed example vignettes are available here .

Eskine, K. J. (2013). Wholesome foods and wholesome morals? Organic foods reduce prosocial behavior and harshen moral judgments.  Social Psychological and Personality Science ,  4 , 251–254. doi:10.1177/1948550612447114

Lakens, D. (2015). Always use Welch’s t-test instead of student’s t-test. Retrieved from http://daniellakens.blogspot.nl/2015/01/always-use-welchs-t-test-instead-of.html

Lakens, D. (2016a). Introduction to equivalence testing with TOSTER. Retrieved from https://cran.rstudio.com/web/packages/TOSTER/vignettes/IntroductionToTOSTER.html

Lakens, D. (2016b). TOST equivalence testing R package (TOSTER) and spreadsheet. Retrieved from http://daniellakens.blogspot.com/2016/12/tost-equivalence-testing-r-package.html

Lakens, D. (in press). Equivalence tests: A practical primer for t-tests, correlations, and meta-analyses. Social Psychological and Personality Science.

Moery, E., & Calin-Jageman, R. J. (2016). Direct and conceptual Replications of Eskine (2013): Organic food exposure has little to no effect on moral judgments and prosocial behavior.  Social Psychological and Personality Science ,  7 , 312–319. doi: 10.1177/1948550616639649

Morey, R. D., & Lakens, D. (2016). Why most of psychology is statistically unfalsifiable. Retrieved from https://raw.githubusercontent.com/richarddmorey/psychology_resolution/master/paper/response.pdf

About the Author

Daniel Lakens is an experimental psychologist at the Human-Technology Interaction Group at Eindhoven University of Technology, the Netherlands. His main lines of empirical research focus on conceptual thought and meaning. In addition, he writes applied articles on statistics and teaches a free “Improving Your Statistical Inferences” MOOC on Coursera. He can be reached at [email protected] .

equivalence hypothesis in psychology

Careers Up Close: Joel Anderson on Gender and Sexual Prejudices, the Freedoms of Academic Research, and the Importance of Collaboration

Joel Anderson, a senior research fellow at both Australian Catholic University and La Trobe University, researches group processes, with a specific interest on prejudice, stigma, and stereotypes.

equivalence hypothesis in psychology

Experimental Methods Are Not Neutral Tools

Ana Sofia Morais and Ralph Hertwig explain how experimental psychologists have painted too negative a picture of human rationality, and how their pessimism is rooted in a seemingly mundane detail: methodological choices. 

APS Fellows Elected to SEP

In addition, an APS Rising Star receives the society’s Early Investigator Award.

Privacy Overview

CookieDurationDescription
__cf_bm30 minutesThis cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
CookieDurationDescription
AWSELBCORS5 minutesThis cookie is used by Elastic Load Balancing from Amazon Web Services to effectively balance load on the servers.
CookieDurationDescription
at-randneverAddThis sets this cookie to track page visits, sources of traffic and share counts.
CONSENT2 yearsYouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
uvc1 year 27 daysSet by addthis.com to determine the usage of addthis.com service.
_ga2 yearsThe _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_3507334_11 minuteSet by Google to distinguish users.
_gid1 dayInstalled by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
CookieDurationDescription
loc1 year 27 daysAddThis sets this geolocation cookie to help understand the location of users who share the information.
VISITOR_INFO1_LIVE5 months 27 daysA cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSCsessionYSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devicesneverYouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-idneverYouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextIdneverThis cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requestsneverThis cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Startseite der Einrichtung

  • Erweiterte Suche
  • Personen (gesamte HU)

Humboldt-Universität zu Berlin - Lebenswissen­schaftliche Fakultät - Institut für Psychologie

The functional equivalence hypothesis.

This hypothesis states that there is a perceptual overlap between emotion expressions and certain trait markers, which then influences emotion communication. Specifically, Hess and colleagues propose the notion that some aspects of facial expressive behavior and morphological cues to dominance and affiliation are equivalent in their effects on emotional attributions. The hypothesis has been tested with regard to gender related differences in morphology. Specifically, men's faces are generally perceived as more dominant, whereas women's faces are perceived as more affiliative. Mediation analyses confirmed that beliefs about the emotionality of men and women are mediated by these perceptions of dominance and affiliation (Hess, et al., 2005). In turn, judgments of emotional facial expressions by men and women are consistent with these perceptions (Hess, et al., 1997). A recent study further suggests that social role based stereotypes and facial morphology interact to create beliefs about a person's likely emotional reactions (Hess, et al., in press). This line of research has also shown that facial expressions of anger and happiness entrain perceptions of dominance and affiliation respectively, and perceptually overlap with the morphological markers for dominance and affiliation (Hess, et al., 2009a). Specifically, in a double oddball design, an increase in reaction time is found when, from a perceptual perspective, the two types of faces belong to the same category (Campanella, et al., 2002). In Hess, et al. (2009), participants had to identify neutral faces that were embedded in a series of either angry or happy faces. They were significantly slower to identify highly dominant versus affiliative faces when the faces were embedded in the angry series. The converse was the case for affiliative faces embedded in the happy series. This supports the notion that anger and dominance on one hand, and happiness and affiliation on the other share perceptual markers. Further, angry expressions on dominant male faces are perceived as more threatening than the same expressions on more affiliative female faces and conversely, smiles on more affiliative female faces are perceived as more appetitive than smiles perceived on more dominant male faces (Hess, Sabourin, et al., 2007). Hess, U., Thibault, P., Adams, R. B., Jr. & Kleck, R. E. (in press). The influence of gender, social roles and facial appearance on perceived emotionality. European Journal of Social Psychology. Hess, U. Adams, R. B. Jr., Grammer, K., & Kleck, R. E. (2009). If it frowns it must be a man: Emotion expression influences sex labeling. Journal of Vision. 9(12), Article 19, 1-8. Hess, U., Adams, Jr, R.B., Kleck, R.E. (2009). The face is not an empty canvas: How facial expressions interact with facial appearance. Philosophical Transactions of the Royal Society London B, 364, 497-3504. Hess, U., Adams, J. B., Jr., & Kleck, R. E. (2009). The categorical perception of emotions and traits. Social Cognition, 27, 319-325. Hess, U. Adams, R. B. Jr., Kleck, R. E. (2007). Looking at you or looking elsewhere: The influence of head orientation on the signal value of emotional facial expressions. Motivation and Emotion, 31, 137-144. Hess, U., Sabourin, G., Kleck, R. E. (2007). Postauricular and eye-blink startle responses to facial expressions. Psychophysiology, 44, 431-435. Hess, U., Adams, R. B. Jr., & Kleck, R.E. (2005). Who may frown and who should smile? Dominance, affiliation, and the display of happiness and anger. Cognition & Emotion, 19, 515-536. Hess, U., Adams, R. B. Jr. & Kleck, R. E. (2004). Facial appearance, gender, and emotion expression. Emotion, 4, 378-388.

Die HU im WWW

  • Die Humboldt-Universität bei Facebook
  • Die Humboldt-Universität bei BlueSky
  • Die Humboldt-Universität bei Instagram
  • Die Humboldt-Universität bei YouTube
  • Die Humboldt-Universität bei LinkedIn
  • Die RSS-Feeds der Humboldt-Universität
  • Die Humboldt-Universität bei Twitter
  • Seite bearbeiten

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings

Preview improvements coming to the PMC website in October 2024. Learn More or Try it out now .

  • Advanced Search
  • Journal List
  • J Exp Anal Behav
  • v.91(1); 2009 Jan

Acquired Equivalence Changes Stimulus Representations

1 Dept. of Cognitive Psychology, Vrije Universiteit Amsterdam

2 Dept. of Psychology, Columbia University

3 Dept. of Psychology, Rutgers University

Acquired equivalence is a paradigm in which generalization is increased between two superficially dissimilar stimuli (or antecedents) that have previously been associated with similar outcomes (or consequents). Several possible mechanisms have been proposed, including changes in stimulus representations, either in the form of added associations or a change of feature salience. A different way of conceptualizing acquired equivalence is in terms of strategic inference: Confronted with a choice on which it has no evidence, the organism may infer from its history of reinforcement what the best option is, and that inference is observed as acquired equivalence. To test this account, we combined an incremental learning task with an episodic memory test. Drawings of faces were made equivalent through acquired equivalence training, and then paired with words in a list learning paradigm. When participants were asked to recognize specific face-word pairings, they confused faces more often when they had been made equivalent. This suggests that prior acquired equivalence training does influence how memories are coded. We also tested whether this change in coding reflected acquisition of new associations, as suggested by the associative mediation account, or whether stimuli become more similar through a reweighting of stimulus features, as assumed by some categorization theories. Results supported the associative mediation view. We discuss similarities between this view and exemplar theories of categorization performance.

In an acquired equivalence task, an organism learns that two or more stimuli are equivalent in terms of being mapped onto the same outcomes or responses. This is referred to as functional equivalence , as the grouping of stimuli is not based on stimulus characteristics, but only on a functional characteristic such as predicting the same outcome. In a typical experiment, two antecedent stimuli, A1 and A2, are first both followed by a reward, while another antecedent stimulus, B, is not. When, in a second transfer stage, A1 is paired with a shock, the conditioned fear will generalize from A1 to A2, and not to B ( Ward-Robinson & Hall, 1999 , Expt. 2). It is as if A1 and A2 have become equivalent to the animal because they predicted the same outcome (or consequent) in the first stage of the experiment; subsequent learning about A1 then transfers easily to A2. Functional equivalence is interesting in its own right, but it has also been studied as a way in which humans and nonhuman animals learn to categorize ( Urcuioli, 2001 ), and has been suggested to underlie the learning of symbolic reference ( Sidman, 1994 ; Tonneau, 2001 ).

Although often studied in animal learning paradigms (e.g., Bonardi, Rey, Richmond, & Hall, 1993 ; Honey & Hall, 1989 , 1991 ), functional equivalence can also be found in humans using, for example, the ‘Fish’ task ( Myers et al., 2003 ). On each trial of the ‘Fish’ task, participants see a face and a pair of fish, and have to learn through trial and error which of the fish goes with that face ( Figure 1 ). There are four faces (A1, A2, B1, B2), referred to as antecedents, and four possible fish (X1, X2, Y1, Y2), referred to as consequents (more common terminology in the behavior-analytic literature is ‘sample stimuli’ and ‘comparison stimuli’, respectively). In the initial training stages participants learn that, given face A1 or A2, the correct answer is to choose fish X1 over fish Y1; given face B1 or B2, the correct answer is to choose fish Y1 over fish X1. During this phase, participants learn that face A1 and A2 are equivalent with respect to the associated fish; face B1 and B2 are likewise equivalent. Next, participants learn a new set of pairs: Given face A1, choose fish X2 over Y2, and given face B1, choose fish Y2 over X2. Participants are then given a transfer test: Given face A2 or B2, will subjects choose fish X2 or Y2? Having learned that faces A1 and A2 are equivalent, participants may generalize from learning that A1 goes with Y1 to that A2 also goes with X2; the same holds for B2 (equivalent to B1) and Y2 (associated with B1). Healthy adults ( Hall, Mitchell, Graham, & Lavis, 2003 ; Myers et al., 2003 ), children ( Goyos, 2000 ; Schenk, 1994 ) and retarded individuals ( Dube, McIlvane, Maguire, Mackay, & Stoddard, 1989 ) all reliably make this type of generalization. Similar behavior can be observed in nonhuman species including rats and pigeons (e.g., Bonardi et al., 1993 ; Honey & Hall, 1991 ; Urcuioli, 2001 ).

An external file that holds a picture, illustration, etc.
Object name is jeab-91-01-08-f01.jpg

Example screen events during one trial. (A) Stimuli appear. (B) Participant responds and corrective feedback is given.

In the behavior-analytic literature, functional equivalence is interpreted as equivalence training grouping stimuli into equivalence classes based on common reinforced choices (e.g., Sidman & Tailby, 1982 ; Tonneau, 2001 ). By virtue of pairing with outcomes, antecedents A1 and A2 are altered to form one equivalence class , while B1 and B2 form another. When confronted with a new, untrained choice in the transfer test, training with other class members may generalize to this new choice. For example, when two stimuli, A1 and A2, are paired with the same outcome (e.g., X1), and A1 is subsequently paired with another outcome (X2), this training will generalize to the other member of the equivalence class. Generalization between stimuli within an equivalence class is enhanced, while generalization between stimuli in different equivalence classes is reduced. The underlying process is often referred to as transfer of function from A1 to A2. Equivalence classes can come about not only by common reinforced choices, but also when members of a class share a common reinforcer (e.g. Dube et al., 1989 ). Antecedents and consequents also readily form equivalence classes ( Markham, Dougher, & Augustson, 2002 ), as may stimuli and the responses that must be given to them (though see Urcuioli, Lionello-DeNolf, Michalek, & Vasconcelos, 2006 , for contrary evidence with pigeons).

At a cognitive level, there are two ways to understand equivalence classes. On the one hand, generalization within them may reflect a strategic inference . Since organisms do not have any evidence to base their choice on in transfer tests, they may assume that it is most advantageous to base their choice on training with other class members. In the example given above, they may infer that, in the absence of better evidence, A2 is more likely than not to require a choice of X2, given that A1 requires a choice of X2.

Alternatively, members of an equivalence class may be stored in memory in a way that increases generalization between them. Indeed, many explanations for acquired equivalence are based on the idea that, during initial training, the representations of stimuli paired with the same outcome or consequent are modified to increase subsequent generalization between them (e.g., Gluck & Myers, 2001 ; Hall et al., 2003 ; Myers & Gluck, 1996 ). Conversely, representations of stimuli that are associated with different outcomes should become less alike.

Two mechanisms have been proposed to underlie such representational change . One hypothesis is that representations of equivalent stimuli become more similar because overlapping features become more salient. In the fish task, faces have three salient features (age, gender, hair color), of which equivalent faces share one (the specific shared feature is counterbalanced over participants). For example, suppose that two faces with yellow hair are paired with the same consequent, while two brown-haired faces, paired with a different consequent, form a second equivalence class. Equivalence training might make hair color more salient, emphasizing the differences between equivalence classes, while deemphasizing other features (gender and age) that vary within an equivalence class. This would result in antecedents within one equivalence class being perceived as more similar to each other, and less similar to antecedents from other equivalence classes. Hypotheses of this kind, which emphasize selective attention to one or more features that control responding, have been proposed by theories of animal learning ( Mackintosh, 1975 ), have received experimental support (e.g., Foree & LoLordo, 1970 ; Mackintosh, 1965 ; Reynolds, 1961 ), and are common currency in the field of categorization ( Kruschke, 1979 , 1992 ; Medin & Schaffer, 1978 ). Indeed, acquired equivalence tasks are similar to categorization tasks in that stimuli are grouped into classes according to the outcome or response they evoke. In many acquired equivalence studies with human participants, the first stage of training also uses verbal labels, making it equivalent to a categorization task (e.g., participants may be asked to discover that one verbal label applies to A1 and A2, and another to B1 and B2).

This hypothesis, which we will call the feature salience account of acquired equivalence, is intuitively appealing. If, for example, hair color can be used to remember which choice to make in the presence of which face, it seems natural to suggest that participants use hair color to code the faces when remembering the choices to be made.

An alternative hypothesis was presented by Hall and colleagues ( Hall et al., 2003 ; Ward-Robinson & Hall, 1999 ), building on earlier ideas by Hull (1939) . They propose that acquired equivalence training adds an association to stimulus representations. Retrieval of this association later in training then leads to the transfer. In the Fish paradigm, participants might first learn to associate faces A1 and A2 with fish X1. In a later phase, participants are trained to pair face A1 with fish X2. During that training, the previously learned association of face A1 and fish X1 will be activated, leading fish X1 and X2 to become associated, by virtue of their pairing with the same consequent. When face A2 is now shown, it will activate its prior association with X1, which in turn activates X2; thus, the participant will choose X2 in the presence of A2.

Hall et al. (2003) found evidence for this associative mediation account of acquired equivalence. In one experiment, they linked stimuli A1 and A2 to color patches (e.g., a square in a distinct red color). They found that when A1 was coupled with a response in a second training stage, this generalized not only to A2 but also to the color patch. This is what would be expected by the associative mediation account, namely: Generalization can occur between consequents that had previously been paired with the same antecedent. However, the same research group also found evidence for the feature salience account. In an acquired equivalence task, participants learned about four antecedents: two snowflake patterns and two color patches. These were paired with consequents so as to form equivalence classes that either shared a clear feature (two snowflake patterns in one class, two color patches in the other) or did not (one snowflake and color patch in each class). As would be predicted by the feature salience account, the equivalence training had a much stronger effect on later generalization when the equivalent antecedents shared a clear feature than when they did not ( Bonardi, Graham, & Hall, 2005 ). The experiment was set up in such a way that associative mediation was excluded as an explanation for these results.

In the studies reported here, we first sought to clarify whether acquired equivalence training would lead to responses that could not be based on strategic inferences (Experiment 1). If representations are indeed being changed during training, this should also show when they are probed in different ways, such as in an explicit memory test. In particular, if representations of two stimuli have become more similar through equivalence training, then they should also become more confusable in episodic memory. We tested this by combining equivalence training with an episodic associative recognition task. If acquired equivalence changes representations, participants should make more errors on items trained for equivalence (critical lures) than on items that had not been trained for equivalence (control lures). If, on the other hand, acquired equivalence relies on inferences and not on representational change, then performance on the episodic memory test should not be impacted by acquired equivalence. This is so because in the associative recognition task, a response based on the equivalence of stimuli is an error. Since participants have no incentive to make an error, they have no incentive to infer responses from the equivalence training. This contrasts with traditional tests of transfer of function, in which responses based on equivalence training are never predefined as errors; either no information is given about which answers are correct, or participants receive feedback only after giving the response. In such contexts, responding on the basis of experience with other members of an equivalence class could be argued to be a rational strategy.

Next (Experiment 2), we reversed the order of equivalence training and episodic memory task, to determine whether the effects of acquired equivalence resided at training or testing. To preview results, Experiment 1 showed that acquired equivalence training could indeed influence performance in an episodic memory test, while Experiment 2 showed that the effects occur during training, not testing.

Having obtained support for a representational account of acquired equivalence, we wanted to contrast the two possible mechanisms for representational change (feature salience vs. associative mediation) in a design in which the two accounts would lead to opposite predictions (Experiment 3). Results were consistent with the associative mediation account, rather than the feature salience account.

Experiment 1

Experiment 1 tested whether equivalence training altered stimulus representations, by combining equivalence training with an episodic memory task (see Figure 2 ). Participants underwent acquired equivalence training on fish–face pairs, and were then given a generalization test. They then studied words in the presence of the previously trained face antecedents. Participants were only instructed to study the words, but were subsequently tested on their recognition of the word–face pairings (associative recognition). Finally, the face–fish generalization test was repeated to document that face–word training had not disrupted the equivalencies learned earlier.

An external file that holds a picture, illustration, etc.
Object name is jeab-91-01-08-f02.jpg

Stages of Experiment 1. In the first three stages, participants learn via trial and error which faces “have” which fish (shown is one set of face–fish pairings, but these were different for each participant). In transfer tests (stages 4–7), acquired equivalence is tested by letting participants decide on unseen face–fish pairings without receiving feedback. In the face–word study, faces are paired with words. Memory for these pairings is subsequently tested in an associate recognition test. In this test, the words can either be shown with the face that they were paired with in the study stage, with a face that is equivalent to the on it was paired with (left pair, a critical lure ), or with a face that was not made equivalent (right pair, a control lure ).

During the recognition test, participants were confronted with studied word–face pairs as well as with two kinds of lures: critical lures in which a word that had been studied with one face was now shown with an equivalent face (e.g., a word that had been paired with face A1 at training was shown at test with equivalent face A2), and control lures in which the face at test was not equivalent to the face at study (e.g., a word combined with face A1 at training was shown at test with nonequivalent face B1). If acquired equivalence changes representations that are also involved in episodic memory, then participants should make more errors on critical lures than on control lures. If, on the other hand, acquired equivalence relies on inferences and not on representational change, no such difference should be found.

Participants were 34 students at Rutgers University, who received class credit in an introductory psychology class in exchange for their participation. Twenty participants were female and 14 male; mean age was 19 years (range 17–27). For 2 participants, data from the second transfer test were lost due to computer failure; data from all previous stages were analyzed for these participants. All participants signed statements of informed consent before the initiation of any behavioral testing, and research procedures conformed to the regulations established by the Federal Government and by Rutgers University.

Apparatus and Stimuli

Behavioral testing was automated on a computer. The participant was seated at a comfortable viewing distance from the screen (approximately 700 mm). The keyboard was masked except for two keys, labeled “LEFT” and “RIGHT”, which the participant could press to record a response.

Four drawings of faces (man, woman, girl, boy) served as the antecedent stimuli. The boy and woman had yellow hair while the girl and man had brown hair. Thus, each antecedent had three obvious, binary-valued features: age (adult vs. child), gender (male vs. female) and hair color (yellow vs. brown); each antecedent shared exactly one feature with each other antecedent. For each participant, the four face drawings were randomly assigned to be antecedents A1, A2, B1, and B2. For each participant, drawings of a fish colored red, blue, green, and yellow were randomly assigned to X1, X2, Y1, and Y2. Faces and fish appeared about 3 cm high on the computer screen.

For face–word pairing, the 16 words used were: pencil, farmer, curtain, house, parent, garden, turkey, mountain, river, bell, coffee, nose, hat, school, moon, drum. Four words were paired with each antecedent, with face–word assignment randomized across subjects. Words were presented on the screen in black lowercase font (Arial, 24-point), about 1 cm high.

At the start of the experiment, on-screen instructions stated: “You will see drawings of people who each have some pet fish. Different people have different kinds of fish. Your job is to learn which kinds of fish each person has. At first, you will have to guess.” The experimenter read these instructions aloud to the participant.

In all, the experiment consisted of seven stages ( Figure 2 ). Stages 1–4 were the same as previously described in Myers et al. (2003) . The first three stages were face–fish training. On each trial, the screen showed a single antecedent (face), two consequents (fish) presented side-by-side in randomized left–right order, and a prompt: “Which fish does this person have? Use the LEFT or RIGHT key to choose.” An example of screen appearance at the start of a trial is shown in Figure 1 . The participant responded by pressing one of the two labeled keys. The selected consequent (fish) was circled, and corrective feedback was given. The stimuli remained on the screen until the participant responded; feedback was shown for 1 s, and there was a 1-s intertrial interval, during which the screen was blank.

There were three stages of training, each with increasing numbers of trial types as shown in Figure 2 . In the first shaping stage, trials presented one of two faces (A1 or B1) and a pair of fish (X1 and Y1), and the participant learned to pair the correct fish with each face (A1 with X1 and B1 with Y1). Since the fish could appear in either left–right order, this made four trial types in Stage 1 (A1 with X1, Y1; A1 with Y1, X1; B1 with X1,Y1; B1 with Y1,X1). A block of trials in Stage 1 consisted of one of each of these four trial types, in pseudorandom order. Shaping continued for a maximum of eight blocks (32 trials) or terminated early if the participant made eight consecutive correct responses.

The second equivalence training stage then began, without warning to the participant. In equivalence training, the previously learned trial types were intermixed with trials on which one of two new faces (A2 or B2) was presented with the familiar fish (X1 and Y1). Since, again, the fish could appear in either left–right order, there were eight trial types, four old and four new. Each block consisted of eight trials, one with each trial type, in pseudorandom order. Equivalence training continued until the participant reached a criterion of eight consecutive correct responses, or for a maximum of eight blocks (64 trials). The third “new consequents” stage then began, without warning to the participant. Here, on each trial, a familiar face (A1 or B1) was presented with two new fish (X2 and Y2). Again, the fish could appear in either left–right ordering, and these new trial types were interleaved with the previously trained ones; thus, there were 12 trial types in each block. Training with new consequents continued until the participant made 12 consecutive correct responses or until a maximum of eight blocks (96 trials) had been completed.

At the conclusion of training, the following instructions appeared: “Good! In this part of the experiment, you will need to remember what you have learned so far. You will NOT be shown the correct answers. At the end of the experiment, the computer will tell you how many you got right. Good luck!” A transfer test (stage 4) followed, which consisted of 16 trials: all 6 trial types from the acquisition stages plus the 2 new test trial types (A2 paired with X2 or Y2, and B2 paired with X2 or Y2), with the consequents in each possible left–right ordering. On each trial, the screen showed one face and two fish; the fish chosen by the participant was circled, but no corrective feedback was given. Trial order was random for each participant.

In the fifth face–word study stage, participants were instructed to study words. On each trial, a single face (A1, B1, A2, or B2) appeared in its usual position, with a single word underneath. Each pairing was shown for 5 s. Since each face was paired with four words total, there were 16 trial types. Face–word pairings included two passes through a block of 16 trial types, with order randomized within a block, for a total of 32 trials. Participants were instructed to study the words for a later “memory test”, and were not given instructions on how to process the faces.

The sixth stage was an associative recognition test, containing 16 trials—one with each of the 16 studied words. Each word appeared with one of the four face stimuli, and participants were asked whether that particular face–word pairing had occurred during study. On eight of the trials, the word–face pair had indeed been presented during training; errors here were misses . On the other eight trials, a word appeared together with a face with which it had not previously been paired; errors here were false positives . On four of these, the word appeared paired with a face that had been made equivalent with the face presented during study; these were critical lures . On the other four trials, words were paired with nonequivalent faces; these pairs were control lures .

The final stage was a second transfer test, identical to the first one. It was intended as a posttest to confirm that learning the word–face pairs had not disrupted the original face–fish associations.

All participants learned all trained face–fish pairs within the maximum allowable number of trials. Table 1 shows the results from the transfer tests. To make errors amenable to standard parametric tests, we transformed error proportions to a form suitable for analyses of variance 1 ( Winer, Brown, & Michels, 1991 ). In both transfer tests, very few errors were made on the trained pairs. There were more errors on the transfer pairs than on the old pairs in the first transfer test (two-tailed paired-samples t -test, t (34)  =  2.78, p < .01), but many fewer than the 50% that would be expected by chance, t (34)  =  6.34, p <. 001 one-tailed. On the final transfer test, there were again many fewer errors than could be expected by chance, t (31)  =  11.3, p < .001 one-tailed. There was no difference between old and new pairs, t < 1, due to slightly fewer errors on the new pairs in the final transfer test and slightly more errors on the old pairs, although neither difference was significant (first vs. final test for old pairs: t (31)  =  1.01, p  =  .32; for new pairs: t (31)  =  1.6, p  =  .12, both two-tailed). Both transfer tests show a standard acquired equivalence effect, while the second test confirms that the learned equivalencies were maintained during the episodic learning and test stages.

Results from the two transfer tests in Experiment 1 (stage 4 and stage 7) and the one transfer test in Experiment 2.

Expt 1
1st test2nd testExpt 2
old pairs0.020.030.03
transfer pairs0.150.080.16

Figure 3 shows the results from the recognition memory test (sixth stage). Consistent with our hypothesis, participants made more errors on critical lures than on control lures, t (33)  =  2.05, p  =  .024 one-tailed). Eleven subjects made only critical lure errors, while just 2 made only control lure errors and 6 made both kinds of errors.

An external file that holds a picture, illustration, etc.
Object name is jeab-91-01-08-f03.jpg

Error rates in the episodic memory test in Experiment 1 and Experiment 2. Participants had to judge whether words and faces had been shown together at study. Dashed bars give the number of intact pairs erroneously labeled as new (“misses”). “Control lures” are word–face pairings in which the face is nonequivalent to the face shown with the word at study. “Critical lures” are word–face pairings in which the face was made equivalent during equivalence training to the face shown with the word at study. Error bars give 1 SEM.

The results of Experiment 1 show that acquired equivalence may alter performance on an episodic memory test. Our explanation for this is that equivalence training alters the representations of stimuli, which in turn increases generalization between these stimuli when they are subsequently invoked during face–word training. These representational changes should increase similarity between stimuli within an equivalence class, while decreasing similarity between stimuli in different equivalence classes. These representational differences may then lead to the increased confusability we found for equivalent stimuli as compared to nonequivalent stimuli.

The results cannot be explained in terms of strategic inferences. The participants did not have a strategic incentive to respond on the basis of equivalence classes, as such responses were errors in the task. Even if they had guessed that they would be tested on word–face pairings, they would have had an incentive to code what makes the faces distinctive, not what makes them similar to one another. The results thus seem to support an explanation of acquired equivalence in terms of changed representations.

Experiment 2

The results of Experiment 1 suggest that equivalence training alters the representations of antecedent stimuli paired with the same consequent, and that these altered representations affect subsequent episodic learning about those stimuli. It is possible, however, that the effect resides not at acquisition, but at retrieval. It may be that the acquired equivalence training changes the ways in which the faces are used as cues in the recognition test. To test for this possibility, we performed a second experiment where the word–face training preceded the acquired equivalence training; the episodic memory test still followed the acquired equivalence training. If acquired equivalence training changes the way episodic memories are coded, it should not affect performance on episodic memories acquired before the equivalence training. If, on the other hand, acquired equivalence training changes the way cues are used at test, we should replicate the findings of Experiment 1.

Participants were 31 students at Rutgers University, who, as in Experiment 1, received class credit for their participation; 15 were female, and 16 male; mean age was 19 years (range 18–24). All participants signed statements of informed consent before the initiation of any behavioral testing. All research procedures conformed to the regulations established by the Federal Government and by Rutgers University.

Stimuli and stages of Experiment 2 were the same as in Experiment 1, but the ordering of the stages was changed. The face–word pairings study stage (fifth stage of Experiment 1) now came first, and thus preceded the acquired equivalence training and the transfer test (Stages 1–4). The recognition test followed the transfer test. There was no final transfer test as in Experiment 1.

As in Experiment 1, all participants learned all face–fish pairs within the maximum allowable number of trials. Table 1 shows the results from the transfer test. We again transformed error proportions to make them amenable to standard parametric tests. Very few errors were made on the trained pairs. There were more errors on the transfer pairs than on the old pairs, t (30)  =  2.44, p  =  .02 two-tailed, but many fewer than the 50% that would be expected by chance, t (30)  =  6.44, p < .001 one-tailed. The magnitude of acquired equivalence was similar to that obtained in Experiment 1.

Figure 3 shows the results from the recognition memory test. There were more errors on both studied words and lures than in Experiment 1, which is unsurprising given the longer interval between study and test. More importantly, participants did not make more errors on critical lures than on other rearranged words, one-tailed, t < 1. Just 3 subjects made only critical lure errors, with 6 making only control lure errors and 9 made both kinds of errors. Thus, the effect of acquired equivalence on episodic memory obtained in Experiment 1 was abolished if the word–face training preceded the equivalence training.

Experiment 1 showed that equivalence training could influence episodic memory involving the same stimuli, by increasing confusability among stimuli that had previously been associated with the same consequent. This effect is in a sense a memory distortion, as increased similarity led to more responses that are defined as errors by the task. Other forms of distortion are false memory effects such as those in the Deese-Roediger-McDermott paradigm ( Roediger & McDermott, 1995 ). In this paradigm, a list of words is learned that are all related to one critical lure. At test, participants tend to recall or recognize the lure as seen, even though it was itself not presented. Accounts based on both learning and retrieval have been proposed, but recent evidence suggests that all or part of the effect resides at the retrieval stage: At test, the lure activates memories of list items, and is therefore mistaken for one ( Zeelenberg, Boot, & Pecher, in press ). Experiment 2 seems to rule out such retrieval-based accounts for our results, as we did not find any effect of equivalence training on recognition performance when the training occurred after encoding but before retrieval. This suggests that equivalence training changes the representations of the equivalent stimuli, which in turn influences how the stimuli are processed during episodic encoding.

A confounding factor in the comparison between Experiments 1 and 2 is that the delay between episodic study and test was greater in Experiment 2 than in Experiment 1, and that participants therefore made more errors in Experiment 2. It is conceivable that this somehow affected the difference between control and critical lures. However, forgetting swiftly results in the confusion of similar stimuli, whereas discriminations between more distinct stimuli are relatively robust against forgetting (e.g., Kraemer, 1984 ). The difference in performance on the more similar critical lures and the more dissimilar control lures would thus be more likely to grow with time than to shrink.

From a behavioral perspective, there is no reason to expect that reversal of the order of episodic study and equivalence training would result in different performance. Urcuioli (2001) performed a traditional acquired equivalence task in pigeons, but placed the equivalence training after training with new consequents. He found that this reordering did not affect transfer of function in a transfer test. Also, both Dube et al. (1989) and Goyos (2000) found that training subjects to reassign stimuli to a different equivalence class could change how training with new consequents was applied to those reassigned stimuli. Equivalence training can thus affect already stored associations. That it did not in this experiment strengthens our interpretation that the effect of equivalence training on episodic memory performance resides at study, not at test.

Experiment 3

Experiments 1 and 2 together provide converging evidence that acquired equivalence does involve change in stimulus representations. There are two viable accounts of what form this representational change might take. The feature-salience account suggests that features that differ between equivalence classes are emphasized (made more salient), making representations of equivalent stimuli more similar. The associative-mediation account suggests that the important change occurs in associations between consequents. Experiment 3 was designed to test these two accounts.

Consider the case in which one feature, say hair color, determines which faces are equivalent in the fish task: Yellow-haired faces are associated with X1 and brown-haired faces with X2. What would happen if the hair color of one of the faces were switched? The feature-salience account would predict that participants would reassign this face to a different equivalence class, as hair color is the feature that determines antecedent equivalence. The associative-mediation account would predict the contrary: As long as the face remains recognizable, it should continue to evoke the same response as before.

Another case in which the two accounts make opposing predictions is when a new antecedent is introduced. The feature-salience account would predict that new antecedents should be automatically assigned to an equivalence class based on their features. For example, if hair color is the salient feature, a new face with yellow hair should elicit the same responses as other yellow-haired faces. In contrast, the associative-mediation account would assign no special meaning to a single feature, such as hair color; instead, the new face should either be randomly associated with a consequent (as participants simply guess to which class it might belong) or else should be assigned to the class containing stimuli with which it shares the most features overall.

In Experiment 3, we simplified the Fish task to only the shaping and equivalence-training stages. In the experimental condition, equivalence classes were based on hair color (so that yellow-haired faces were mapped to consequent X1, while brown-haired faces were mapped to consequent Y1). This was because hair color is the only feature that can be changed without changing the identity of the face. Hair color is not a weak feature, however, in that participants learn the equivalence task as readily if the grouping is based on hair color as when it is based on a different feature. All learned associations were tested in a third stage, followed by testing on a previously trained face with a hair color change, as well as testing with a new face entirely. In the control condition, equivalence classes were based on age (adult vs. child) or gender (male vs. female) but not hair color, followed by the same transfer test of a previously trained face with a hair color change (see Figure 4 for design and predictions).

An external file that holds a picture, illustration, etc.
Object name is jeab-91-01-08-f04.jpg

Design of Experiment 3, and predictions of the two accounts. In the first two stages (Training), participants learn via trial and error which faces “have” which fish (shown is one set of face–fish pairings, but these were different for each participant). In the experimental condition, equivalence is based on hair color of the face stimuli, while in the control condition it is based on either gender or age. In stage 3 (Test), participants must decide on face–fish pairings without receiving feedback. Given in the lower part of the figure are the responses as predicted by the feature-salience account, the associative-mediation (assoc.) account, and the idea that hair color is a feature that dominates responding independent of what equivalence training was given.

Both the feature-salience and mediated-associations accounts predict that, in the control condition, the hair color change will be ignored, and the face should be mapped as previously. In the experimental condition, however, predictions differ. According to the feature-salience account, a hair color change should lead the face to be classified with other faces that have the same hair color; according to the associative-mediation account, the face should be identified as it always was, regardless of a single new feature (hair color).

Participants were 40 students at Rutgers University, who received class credit in an introductory psychology class in exchange for their participation. Six participants were male, and 34 were female; mean age was 19 years (range 17–31). All participants signed statements of informed consent before the initiation of any behavioral testing, and research procedures conformed to the regulations established by the Federal Government and by Rutgers University. Participants were randomly and evenly assigned to the experimental and control conditions.

Procedurally, the shaping and equivalence training stages were similar to Experiment 1: On each trial, one of the antecedent faces was presented together with two possible consequent fish stimuli. The “new consequents” stage was omitted. The last phase was a no-feedback phase to test learning of face–fish associations.

Four additional drawings were constructed from the usual faces by switching hair color (see Figure 4 ). There were two conditions in the experiment. In the experimental condition, assignment of the face drawings to equivalence classes during Stages 1 and 2 was based on hair color; i.e., the yellow-haired woman (A1) and yellow-haired boy (A2) were associated with consequent X1, while the brown-haired girl (B1) and brown-haired man (B2) were associated with consequent Y1. The other features (gender and age) varied across and within equivalence classes. These antecedents and consequents were trained in Stages 1 and 2, exactly as in the previous experiments. In the transfer test trained pairs (A1–X1, B1–Y1) were tested, as well as presenting a new face: a brown-haired boy that could be paired with either X1 or Y1.

In the control condition, for half the participants, equivalence class was based on gender: the brown-haired girl (A1) and yellow-haired woman (A2) mapped to X1; the yellow-haired boy (B1) and yellow-haired man (B2) mapped to Y1. For the other half of participants in the control condition, equivalence class was based on age (adults vs. children): yellow-haired woman (A1) and yellow-haired man (A2) mapped to X1; brown-haired boy (B1) and yellow-haired girl (B2) mapped to Y1. Equivalence training was followed by the testing of the trained pairs (A1–X1 and B1–Y1) as well as a new face. For participants trained on gender, this was a brown-haired man who, according to both accounts, should be mapped to Y1 like the previously trained yellow-haired man. For those trained on age, it was a brown-haired woman who should be mapped to X1, like the other adults. The control condition was designed to test whether hair color was an overwhelmingly salient feature, in which case the new faces should be mapped to the opposite consequents like the only other antecedent with the same hair color.

For all subjects, the transfer test consisted of one presentation of each of the three trial types, with the fish in each possible left–right order, for a total of six trials. Trials were presented in randomized order for each subject.

Within the control condition, no difference between participants in the age and gender subgroups could be found; data from these subgroups were therefore combined. Errors were transformed as in the previous experiments. There was no difference between the conditions on the number of errors in the first or second stage of training, t (38) < 1, nor on the errors made on the “old” faces in the transfer test [mean number of errors  =  .15 and .30 for the experimental and control conditions respectively; t (38) < 1]. In other words, participants in the control and experimental conditions were equally able to learn and retain equivalencies based on hair color as on age or gender.

Because of the few trials involved, responses to the changed face stimuli were analyzed as follows. Responses could either be based on the identity of the face, or on the hair color. In the first case the response to a face would remain as it was in the training stages, in the latter it changes into what is appropriate for faces with its new hair color. Responses appropriate for faces with the new hair color will be called “hair color responses”, and were the central dependent variable. The feature- salience account predicts hair color responses on changed faces in the experimental condition, but not in the control condition. The associative-mediation account predicts no hair color responses in either condition for changed face stimuli. If hair color is the dominant feature regardless of equivalence training, changed face stimuli should generate hair color responses in both conditions.

Table 2 gives the number of participants that gave 0, 1 or 2 “hair color” responses for changed faces. In the control condition, participants tended to ignore hair color and instead responded by treating the changed face in the same way as the trained face from which it was derived. This result confirms that hair color was not so salient as to swamp all other features, and is consistent with the predictions of both the feature-salience and associative-mediation accounts. Table 2 shows a very similar pattern of behavior for the experimental condition: Most subjects classified the changed face not by hair color but by the face from which it was derived. In other words, even though these subjects had previously learned to treat faces with the same hair color as equivalent, this single diagnostic feature was not enough to determine how a changed face would be classified. There was no difference between the patterns of responding to changed faces in the experimental and control conditions, χ 2 (1)  =  0.19, p > .5 one-tailed. Taking both conditions together, fewer participants responded based on hair color to the changed faces than based on face identity, χ 2 (1)  =  3.13, p  =  .04, one-tailed.

Number of Experiment 3 participants whose responses to the face with changed hair color were based on either face identity or hair color.

Experimental conditionControl condition
all responses based on face identity1011
mixed responding44
all responses based on hair color65

The results of Experiment 3 go counter to the feature salience account. When the diagnostic feature that determines equivalence classes (i.e., hair color) was changed, the responses to the stimulus followed the equivalence class in most participants, not the new single-feature change. This shows at least that attention paid to the diagnostic feature did not outweigh associations to the stimulus as a whole. A caveat is that in the Fish task, there are two stimuli in each stimulus class. It is possible that feature salience does play a role when larger stimulus classes are used. With larger stimulus classes, shared features and not individual stimuli may become the basis for associations with outcomes.

The results are consistent with the associative mediation account. They are not strong evidence for it, however, as in both conditions a minority of participants responded in line with the hair color feature, and not face identity. This is unsurprising insofar as a trained face with changed hair color is a degraded version of the trained stimulus and, as such, would be expected to be less able to retrieve the consequent associated with the original face. Nevertheless, it is fair to say that the associative mediation account is supported more by the elimination of a rival explanation than by confirmation of its prediction.

General Discussion

In acquired equivalence, two or more stimuli become functionally equivalent through pairing with the same outcome. Experiments 1 and 2 showed that such equivalence training can alter the way stimuli are stored in memory. The training made equivalent faces more similar and therefore more confusable, as shown by the pattern of errors in an associative recognition test. Experiment 3 showed that representational change is more likely to take the form of mediated associations ( Hall et al., 2003 ) than of an increased attention to and reliance on diagnostic features.

By showing that acquired equivalence training influences the formation of episodic memories, our findings suggest that acquired equivalence training and episodic memory rely on overlapping memory systems. Gluck and Myers (2001 ; also see Hodder, Gearge, Killcross, & Honey, 2003 ; Myers & Gluck, 1996 ) proposed that such representational modifications might occur in the hippocampal region. Specifically, representations might be changed to increase generalization between stimuli that co-occur or are paired with the same outcome (consequent). Prior work showed that humans with hippocampal atrophy are able to learn the initial face–fish mappings, and are able to continue to respond correctly to trained pairs at test, but are impaired at the novel pairs in a transfer test—failing to generalize and pair A2 with X2 and B2 with Y2 ( Myers et al., 2003 ). Functional neuroimaging (fMRI) studies have also documented hippocampal region activation while healthy adult humans learn a similar task ( Preston, Shrager, Dudukovic, & Gabrieli, 2004 ). Similarly, rats with entorhinal cortex lesions are disrupted at acquired equivalence ( Coutureau, Killcross, Good, Marshall, Ward-Robinson, & Honey, 2002 ), although the same authors found that lesions to the hippocampus proper did not impair acquired equivalence. The fact that our current findings show interactions between equivalence training and episodic memory, usually presumed to rely on the hippocampal region, adds to the body of evidence implicating the hippocampal region in acquired equivalence.

On the other hand, our results from Experiment 3 contrast with the results of Bonardi et al. (2005) , who found that equivalence training has much stronger effects when equivalent stimuli share a clear feature than when they do not. Whereas we found little evidence that individual features determined generalization, Bonardi et al. found that a shared feature facilitates equivalence training. However, the fact that such shared features facilitate acquired equivalence does not necessarily imply that acquired equivalence depends on such shared features. It is only the latter idea that was disconfirmed in Experiment 3. Our results are consistent with those of Goyos (2000) . He had children identity-match eight stimuli. Correct choices were rewarded with beads of one color for four stimuli, and beads of another color for the other four stimuli. In two experiments, he found that most children who participated would spontaneously name the colors while learning new discriminations involving the stimuli. Those that did not showed no generalization in responding in a transfer test. When taught to name the bead colors during extra training, transfer was normal. This suggests that the verbal labels mediated transfer of function, consistent with the associative-mediation account (although Goyos himself did not interpret his results in such terms).

In summary, the current experiments suggest that equivalence training changes the representations of the stimuli involved. Added associations make equivalent stimuli more similar, and therefore more easily confusable. Moreover, they allow responses learned to one stimulus to generalize to other stimuli.

This view is not very different from how exemplar-based models view performance in categorization tasks (e.g., Nosofsky, 1988 ; Nosofsky & Palmeri, 1997 ; Shiffrin, 2003 ). These models assume that category assignments are stored in the memory representations of stimuli, one of which is stored for each trial. This is equivalent to the creation of an association, assumed by associative mediation, between the stimulus and the outcome or response. Then, at test, presentation of the stimulus evokes retrieval of stimulus representations from memory, and a response is given based on the category assignment stored in the retrieved representations. What would an exemplar model of a standard acquired equivalence task look like? First, stimuli A1 and A2 are coupled to response X1, leading response X1 to be incorporated in, or linked to, memory representations of A1 and A2. Then, A1 is coupled with response X2, and so X2 is incorporated in or linked to the representation of A1. What would now happen if A2 were to be presented, and a choice given between response X2 and a different response, say Z? Retrieving stimulus A2 would also retrieve response X1, which leads to the retrieval of representations of A1, which in turn leads to retrieval of X2. Unless a representation of the alternative response Z is similarly activated, A2 will evoke X2—just as observed in an acquired equivalence study. Whether or not such an account would be an improvement over a purely associative account of acquired equivalence remains an issue for further study.

Acknowledgments

This research was supported by a VENI grant to the first author from the Netherlands Society for Scientific Research (NWO).

1 x  =  2 arcsin √( p + 1/2 n ), where x is the transformed variable, p is the proportion, and n is the number of observations underlying the proportion.

  • Bonardi C, Graham S, Hall G. Acquired distinctiveness and equivalence in human distrimination learning: Evidence for an attentional process. Psychonomic Bulletin & Review. 2005; 12 :88–92. [ PubMed ] [ Google Scholar ]
  • Bonardi C, Rey V, Richmond M, Hall G. Acquired equivalence of cues in pigeon autoshaping: Effects of training with common consequences and common antecedents. Animal Learning and Behavior. 1993; 21 :369–376. [ Google Scholar ]
  • Coutureau E, Killcross A.S, Good M, Marshall V.J, Ward-Robinson J, Honey R.C. Acquired equivalence and distinctiveness of cues: II. Neural manipulations and their implications. Journal of Experimental Psychology: Animal Behavior Processes. 2002; 28 :388–396. [ PubMed ] [ Google Scholar ]
  • Dube W.V, McIlvane W.J, Maguire R.W, Mackay H.A, Stoddard L.T. Stimulus class formation and stimulus-reinforcer relations. Journal of the Experimental Analysis of Behavior. 1989; 51 :65–76. [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • Foree D.D, LoLordo V.M. Attention in the pigeon: Differential effects of food-getting versus shock-avoidance procedures. Journal of Comparative and Physiological Psychology. 1970; 85 :551–558. [ PubMed ] [ Google Scholar ]
  • Gluck M.A, Myers C.E. Gateway to Memory: An Introduction to Neural Network Modeling of the Hippocampus in Learning and Memory. Cambridge, MA: MIT Press; 2001. [ Google Scholar ]
  • Goyos C. Equivalence class formation via common reinforcers among preschool children. Psychological Record. 2000; 50 :629–654. [ Google Scholar ]
  • Hall G.C, Mitchell C, Graham S, Lavis Y. Acquired equivalence and distinctiveness in human discrimination learning: Evidence for associative mediation. Journal of Experimental Psychology: General. 2003; 132 :266–276. [ PubMed ] [ Google Scholar ]
  • Hodder K.I, Gearge D.N, Killcross A.S, Honey R.C. Representational blending in human conditional learning: Implications for associative theory. Quarterly Journal of Experimental Psychology B. 2003; 56 :223–238. [ PubMed ] [ Google Scholar ]
  • Honey R, Hall G. Acquired equivalence and distinctiveness of cues using a sensory-preconditioning procedure. Quarterly Journal of Experimental Psychology. 1991; 43B :121–135. [ PubMed ] [ Google Scholar ]
  • Honey R, Hall G.C. Acquired equivalence and distinctiveness of cues. Journal of Experimental Psychology: Animal Behavior Processes. 1989; 15 :338–346. [ PubMed ] [ Google Scholar ]
  • Hull C.L. The problem of stimulus equivalence in behavior theory. Psychological Review. 1939; 46 :9–30. [ Google Scholar ]
  • Kraemer P.J. Forgetting of visual discriminations by pigeons. Journal of Experimental Psychology: Animal Behavior Processes. 1984; 10 :530–542. [ PubMed ] [ Google Scholar ]
  • Kruschke J.K. Dimensional relevance shifts in category learning. 1979. Indiana University Cognitive Science Program Technical Report #79.
  • Kruschke J.K. ALCOVE: An exemplar-based connectionist model of category learning. Psychological Review. 1992; 99 :22–44. [ PubMed ] [ Google Scholar ]
  • Mackintosh N.J. Selective attention in animal discrimination learning. Psychological Bulletin. 1965; 64 :124–150. [ PubMed ] [ Google Scholar ]
  • Mackintosh N.J. A theory of attention: Variations in the associability of stimuli with reinforcement. Psychological Review. 1975; 82 :276–298. [ Google Scholar ]
  • Markham M.R, Dougher M.J, Augustson E.M. Transfer of operant discrimination and respondent elicitation via emergent relations of compound stimuli. Psychological Record. 2002; 52 :325–350. [ Google Scholar ]
  • Medin D, Schaffer M. Context theory of classification learning. Psychological Review. 1978; 85 :207–238. [ Google Scholar ]
  • Myers C.E, Gluck M. Cortico-hippocampal representations in simultaneous odor discrimination learning: A computational interpretation of Eichenbaum, Mathews & Cohen (1989) Behavioral Neuroscience. 1996; 110 :685–706. [ PubMed ] [ Google Scholar ]
  • Myers C.E, Shohamy D, Gluck M, Grossman S, Kluger A, Ferris S, et al. Dissociating hippocampal vs. basal ganglia contributions to learning and transfer. Journal of Cognitive Neuroscience. 2003; 15(2) :1–9. [ PubMed ] [ Google Scholar ]
  • Nosofsky R.M. Exemplar-based accounts of relations between classification, recognition and typicality. Journal of Experimental Psychology: Learning, Memory and Cognition. 1988; 14 :700–708. [ Google Scholar ]
  • Nosofsky R.M, Palmeri T.J. An exemplar-based random walk model of speeded classification. Psychological Review. 1997; 104 :266–300. [ PubMed ] [ Google Scholar ]
  • Preston A.R, Shrager Y, Dudukovic N.M, Gabrieli J.D.E. Hippocampal contribution to the novel use of relational information in declarative memory. Hippocampus. 2004; 14 :148–152. [ PubMed ] [ Google Scholar ]
  • Reynolds G.S. Attention in the pigeon. Journal of the Experimental Analysis of Behavior. 1961; 4 :203–208. [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • Roediger H.L, McDermott K.B. Creating false memories: Remembering words not presented in lists. Journal of Experimental Psychology: Learning, Memory and Cognition. 1995; 21 :803–814. [ Google Scholar ]
  • Schenk J.J. Emergent relations of equivalence generated by outcome-specific consequences in conditional discrimination. Psychological Record. 1994; 44 :537–558. [ Google Scholar ]
  • Shiffrin R.M. Modeling memory and perception. Cognitive Science. 2003; 27 :342–378. [ Google Scholar ]
  • Sidman M. Equivalence relations and behavior: A research story. Boston: Authors Cooperative; 1994. [ Google Scholar ]
  • Sidman M, Tailby W. Conditional discrimination vs. matching to sample: An expansion of the testing paradigm. Journal of the Experimental Analysis of Behavior. 1982; 37 :23–44. [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • Tonneau F. Equivalence relations: A critical analysis. European Journal of Behavioral Analysis. 2001; 2 :1–33. [ Google Scholar ]
  • Urcuioli P. Cook R.G, editor. Categorization & acquired equivalence. Avian visual cognition [On-line]. 2001. In. Available: www.pigeon.psy.tufts.edu/avc/urcuioli/ .
  • Urcuioli P, Lionello-DeNolf K, Michalek S, Vasconcelos M. Some tests of response membership in acquired equivalence classes. Journal of the Experimental Analysis of Behavior. 2006; 86 :81–107. [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • Ward-Robinson J, Hall G. The role of mediated conditioning in acquired equivalence. Quarterly Journal of Experimental Psychology B. 1999; 52 :335–350. [ PubMed ] [ Google Scholar ]
  • Winer B.J, Brown D.R, Michels K.M. Statistical principles in experimental design. New York: McGraw-Hill; 1991. [ Google Scholar ]
  • Zeelenberg R, Boot I, Pecher D. Activating the critical lure during study is unnecessary for false recognition. Consciousness and Cognition. 2005; 14 :316–326. [ PubMed ] [ Google Scholar ]

Mark Shelvock RP(Q), CT, MACP, MA

What We Know About After-Death Communication Experiences

Is death really the end of everything research offers clues..

Updated July 23, 2024 | Reviewed by Devon Frye

  • Understanding Grief
  • Take our Depression Test
  • Find a therapist to heal from grief
  • Having experiences that feel as if one is interacting with a deceased person are common.
  • These experiences tend to be positive and may even be experienced as therapeutic.
  • Research finds that after-death communication experiences can occur outside of grief.

Photo by NEOM on Unsplash

Co-authored with Imants Barušs, Ph.D.

After-death communications (ADCs) refer to the apparent communication of the deceased with the living, which are sensory experiences in nature. Experiences range significantly but can include full-body visualizations, dreams , hearing footsteps, electronic devices flickering on or off, feeling a hug, or conversations with the deceased. 1 Noted psychologist Carl Jung had dreams of his father after his father’s death, which made him think more about life after death.

ADCs are surprisingly common; studies conducted on various populations have found prevalence rates ranging from 31 to 82 percent. 2 These numbers sometimes include those who were previously doubtful of the validity of such experiences—including, in one case, a skeptic who was rattled by the apparently spontaneous playing of a broken radio during his wedding. 3 Even people who identify as atheists can report these experiences, according to empirical survey data across diverse populations. 4

A robust amount of scholarly literature around ADC experiences has emerged in recent times, which explicitly describes ADCs as helpful, reassuring, transformative, and positive in some capacity for most experiencers. This is supported by many clinical experts in thanatology, psychology, end-of-life care, and medicine, who have discussed ADCs as non-pathological experiences that are often supportive, meaningful, and affirming for the bereaved. 5

Continuing Bonds Theory

Continuing bonds theory, which is observable in cross-cultural traditions across the world, demonstrates how having an authentic sense of connection to the deceased tends to be therapeutic rather than harmful. 6 This framework for understanding grief, based partially on attachment theory and cultural studies, outlines how individuals, families, or communities who are grieving do not need to disconnect from the deceased.

Continuing bonds affirms that grief doesn’t subscribe to a rigid timeline, and that people can live their lives freely without needing to repress or sever their relationship with the deceased. 7 By continuing an ongoing relationship with the deceased, people can grapple with the deeper meaning of loss over time in a way that is personally meaningful. Thus, continuing bonds theory normalizes after-death communication experiences.

Traditional continuing bonds theorists frequently emphasize symbolic, emotional, or active participation in memorial activities, which emphasizes the mental representation of the deceased within people who are bereaved. 8 As a result, some continuing bonds scholars simply convey that the "dead are dead." These theorists are correct in acknowledging a person has physically died, and thereby an observable change in a relationship has occurred.

However, after-death communication experiences challenge the notion of what death even means. The entire ADC research literature demonstrates how continuing bonds may not be merely a mental representation; there is some possibility that an interactive connection could potentially occur between the living and the deceased.

After-Death Communications Occur Outside of Grief

Research finds that some people experience anomalous experiences in which the deceased person is clearly a separate entity, and many people report that it is the deceased who "initiates" these experiences spontaneously. 4

Elsaesser et al. (2021) discovered that 27 percent of research participants who had ADC experiences identified as not being in a state of grieving or mourning; some people had never even known the deceased person they believed they were connecting with, and thus never grieved. This demonstrates how some people have after-death communication experiences outside the context of grief and bereavement . In other words, this suggests that ADCs are not merely an extension of the grieving process, and that something else may be happening here.

The Survival Hypothesis

In the eyes of some, after-death communication experiences support the survival hypothesis, which suggests that consciousness continues after physical death. Experiences like these make us question the notion that there is only oblivion upon death. It is possible that interactions with the deceased mean that memories, perceptions, and individual experiences do not stop with physical death.

equivalence hypothesis in psychology

In the West, most of us were educated and socially conditioned to believe these sorts of things aren’t supposed to happen, but they do. Personally, we encourage readers to keep an open mind around after-death communication experiences and to follow the evidence where it leads. Science is an empirical investigation of whatever one wishes, which does not have to be defined by societal norms or follow dogmatic beliefs about materialism . 9

ADC experiences provide us with rich opportunities to explore the nature of consciousness. The question of whether we continue to exist in some form after death has serious consequences for the way we live.

Dr. Imants Barušs / Used with permission.

This post was co-authored with Imants Barušs, Ph.D., a Professor in the Department of Psychology at King's University College at Western University. He is the author of 8 books, over 50 academic papers, and over 100 presentations at conferences and universities around the world.

Readers who are interested in learning more can read Death As An Altered State of Consciousness and Transcendent Mind: Rethinking the Science of Consciousness.

1. Barušs, I. (2023). Death as an altered state of consciousness: A scientific approach . American Psychological Association.

2. Kamp, K. S., Steffen, E. M., Alderson-Day, B., Allen, P., Austad, A., Hayes, J., Larøi, F., Ratcliffe, M., & Sabucedo, P. (2020). Sensory and quasi-sensory experiences of the deceased in bereavement: An interdisciplinary and integrative review. Schizophrenia Bulletin, 46 (6), 1367–1381.

3. Shermer, M. (2014). Infrequencies. Scientific American , 311(4).

4. Elsaesser, E., Roe, C. A., Cooper, C. E., & Lorimer, D. (2021). The phenomenology and impact of hallucinations concerning the deceased. BJPsych Open , 7 (5), Article e148.

5. Penberthy, J. K., Pehlivanova, M., Kalelioglu, T., Roe, C. A., Cooper, C. E., Lorimer, D., & Elsaesser, E. (2023). Factors moderating the impact of after death communications on beliefs and spirituality. OMEGA - Journal of Death and Dying , 87 (3), 884–901.

6. Klass D., & Steffen E. (2018). Introduction: Continuing bonds—20 years on. In D. Klass, & E. Steffen (Eds.), Continuing bonds in bereavement: New directions for research and practice (pp. 1–14). Routledge.

7. Shelvock, M., Kinsella, E. A., & Harris, D. (2022). Beyond the corporatization of death systems: Towards green death practices. Illness, Crisis, and Loss , 30 (4), 640-658.

8. Jahn, D. R., & Spencer-Thomas, S. (2018). A qualitative examination of continuing bonds through spiritual experiences in individuals bereaved by suicide. Religions , 9 (8), 248-.

9. Barušs, I., & Mossbridge, J. (2017). Transcendent mind: Rethinking the science of consciousness . American Psychological Association.

Mark Shelvock RP(Q), CT, MACP, MA

Mark Shelvock, MACP, is a licensed psychotherapist in Canada and a Certified Thanatologist with the Association for Death Education and Counselling. He also teaches in the Grief Education Program at OISE, University of Toronto.

  • Find a Therapist
  • Find a Treatment Center
  • Find a Psychiatrist
  • Find a Support Group
  • Find Online Therapy
  • United States
  • Brooklyn, NY
  • Chicago, IL
  • Houston, TX
  • Los Angeles, CA
  • New York, NY
  • Portland, OR
  • San Diego, CA
  • San Francisco, CA
  • Seattle, WA
  • Washington, DC
  • Asperger's
  • Bipolar Disorder
  • Chronic Pain
  • Eating Disorders
  • Passive Aggression
  • Personality
  • Goal Setting
  • Positive Psychology
  • Stopping Smoking
  • Low Sexual Desire
  • Relationships
  • Child Development
  • Self Tests NEW
  • Therapy Center
  • Diagnosis Dictionary
  • Types of Therapy

July 2024 magazine cover

Sticking up for yourself is no easy task. But there are concrete skills you can use to hone your assertiveness and advocate for yourself.

  • Emotional Intelligence
  • Gaslighting
  • Affective Forecasting
  • Neuroscience

Physical Symbol System Hypothesis

  • Living reference work entry
  • First Online: 08 June 2024
  • Cite this living reference work entry

equivalence hypothesis in psychology

  • Jing Qicheng 2 &
  • Zhang Kan 3  

The Physical Symbol System Hypothesis is an artificial intelligence hypothesis used to simulate human cognitive activities with computers. It was proposed by American researchers Allen Newell and Herbert A. Simon. The hypothesis posits that a computer is a physical symbol system, capable of manipulating symbols. In other words, anything that can be represented by symbols can be processed by a computer. Representations and concepts in the human brain, along with their information processing, are also regarded as symbol manipulation. Therefore, the human brain can also be viewed as a physical symbol system. Through this hypothesis, previously considered abstract psychological activities within the brain can be explored using the same models employed by computer systems that process physical symbols.

A physical symbol system has six functions: (1) Input of symbols: Computers input symbols through methods like punched cards or keyboards, while humans input symbols through sensory organs...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Further Reading

Peng D-L (2004) General psychology. Beijing Normal University Publishing House, Beijing

Google Scholar  

Download references

Author information

Authors and affiliations.

School of Psychological and Cognitive Sciences, Peking University, Beijing, China

Jing Qicheng

Institute of Psychology, Chinese Academy of Sciences (CAS), Beijing, China

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Zhang Kan .

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this entry

Cite this entry.

Qicheng, J., Kan, Z. (2024). Physical Symbol System Hypothesis. In: The ECPH Encyclopedia of Psychology. Springer, Singapore. https://doi.org/10.1007/978-981-99-6000-2_992-1

Download citation

DOI : https://doi.org/10.1007/978-981-99-6000-2_992-1

Received : 03 May 2024

Accepted : 10 May 2024

Published : 08 June 2024

Publisher Name : Springer, Singapore

Print ISBN : 978-981-99-6000-2

Online ISBN : 978-981-99-6000-2

eBook Packages : Springer Reference Behavioral Science and Psychology Reference Module Humanities and Social Sciences Reference Module Business, Economics and Social Sciences

  • Publish with us

Policies and ethics

  • Find a journal
  • Track your research

IMAGES

  1. Equivalence Testing for Psychological Research: A Tutorial

    equivalence hypothesis in psychology

  2. Schematic illustration of the hypothesis of stress equivalence [25

    equivalence hypothesis in psychology

  3. Stimulus Equivalence Aba Examples

    equivalence hypothesis in psychology

  4. 13 Different Types of Hypothesis (2024)

    equivalence hypothesis in psychology

  5. Types of Hypotheses in Psychology Explained #Alevel #Revision

    equivalence hypothesis in psychology

  6. Stimulus Equivalence In ABA: Definition & Examples

    equivalence hypothesis in psychology

VIDEO

  1. Psychology textbook: Psychological Science

  2. Laws and Theories as Organizing Principles (Psychology textbook: Psychological Science)

  3. Hypothesis #psychology #science #research #scientificmethod #hypothesis #theory #psychologyvideos

  4. Psychology textbook: Psychological Science

  5. MORAL EQUIVALENCE

  6. Hypothesis Testing

COMMENTS

  1. Equivalence Testing for Psychological Research: A Tutorial

    Psychologists must be able to test both for the presence of an effect and for the absence of an effect. In addition to testing against zero, researchers can use the two one-sided tests (TOST) procedure to test for equivalence and reject the presence of a smallest effect size of interest (SESOI). The TOST procedure can be used to determine if an observed effect is surprisingly small, given that ...

  2. Equivalence Tests: A Practical Primer for t Tests, Correlations, and

    It is statistically impossible to support the hypothesis that a true effect size is exactly zero. What is possible in a frequentist hypothesis testing framework is to statistically reject effects large enough to be deemed worthwhile. When researchers want to argue for the absence of an effect that is large enough to be worthwhile to examine, they can test for equivalence (Wellek, 2010).

  3. Equivalence Testing for Psychological Research: A Tutorial

    Equivalence testing can be used to test whether an observed effect is surprisingly small, assuming that a meaningful effect exists in the population (see, e.g., Goertzen & Cribbie, 2010; Meyners, 2012; Quertemont, 2011; Rogers, Howard, & Vessey, 1993). The test is a simple variation of widely used null-hypothesis signifi- cance tests.

  4. Equivalence Tests

    There have been previous attempts to introduce equivalence testing to psychology (Quertemont, 2011; Rogers, Howard, & Vessey, ... This is comparable to the large sample sizes that are required to reject a true but small effect when the null hypothesis is a null effect. Equivalence tests require slightly larger sample sizes than traditional null ...

  5. Equivalence Tests: A Practical Primer for t Tests, Correlations, and

    This practical primer with accompanying spreadsheet and R package enables psychologists to easily perform equivalence tests (and power analyses) by setting equivalence bounds based on standardized effect sizes and provides recommendations to prespecify equivalence bounds. Extending your statistical tool kit with equivalence tests is an easy way ...

  6. PDF Equivalence Testing for Psychological Research: A Tutorial

    result would falsify this hypothesis. Equivalence testing can be used to test whether an observed effect is surprisingly small, assuming a meaningful effect exists in the population. The test is a simple variation of widely used null hypothesis significance tests (NHST). To understand the idea behind equivalence tests, it is useful

  7. Understanding Equivalence and Noninferiority Testing

    PROCEDURE. The simplest and most widely used approach to test equivalence is the two one-sided test (TOST) procedure 2.Using TOST, equivalence is established at the α significance level if a (1-2α) × 100% confidence interval for the difference in efficacies (new - current) is contained within the interval (-δ, δ) (Fig. 1).The reason the confidence interval is (1-2α) × 100% and not ...

  8. Equivalence and non-inferiority testing in psychotherapy research

    Statistical hypotheses in equivalence and non-inferiority testing. In equivalence testing, the null and alternative hypotheses of superiority testing are reversed and the statistical alternative hypothesis is consistent with the assumption of equivalence (Lesaffre, Reference Lesaffre 2008; Walker and Nowacki, Reference Walker and Nowacki 2011).To test for equivalence, two one-sided tests are ...

  9. Recommendations for applying tests of equivalence

    Researchers in psychology reliably select traditional null hypothesis significance tests (e.g., Student's t test), regardless of whether the research hypothesis relates to whether the group means are equivalent or whether the group means are different. Tests of equivalence, which have been popular i …

  10. Equivalence testing for linear regression.

    The null hypothesis for an equivalence test can, therefore, be defined as . In other words, equivalence implies that θ is small enough that any nonzero effect would be at most equal to Δ. ... Outside of psychology, there is also a growing literature on equivalence testing methods (e.g., Leday et al., ...

  11. PDF Theory and intervention equivalence and bias: New constructs ...

    STEP in the practice of psychology. Types of Equivalence and Bias Equivalence As others have claimed (van de Vijver, 2001; Ægisdóttir et al., 2008), equivalence is a significant concept ... tionand adaptation of the Contact Hypothesis (Allport, 1954) later known as the Intergroup Contact Theory (Pettigrew, 1998). This hypothesis stipulates ...

  12. Equivalent statistics for a one-sample t -test

    Equivalence of statistics. For a one-sample t -test, we have a null hypothesis, given as H0 : μ = μ0, where μ is a population mean and μ0 is a specific value, and an alternative hypothesis, denoted as HA : μ ≠ μ0. (One can also consider directional alternative hypotheses.)

  13. Cognitive Psychology

    There is an overlap between psychology and philosophy, epistemology in specific, in that both apply a rational approach and deductive reasoning to develop a theory of the nature and scope of knowledge. ... The functional equivalence hypothesis holds that, which visual imagery is not identical to perception, it is treated as functionally ...

  14. Equivalence test

    Equivalence test. Equivalence tests are a variety of hypothesis tests used to draw statistical inferences from observed data. In these tests, the null hypothesis is defined as an effect large enough to be deemed interesting, specified by an equivalence bound. The alternative hypothesis is any effect that is less extreme than said equivalence bound.

  15. (PDF) Functional equivalence or behavioural matching? A critical

    the original functional equivalence hypothesis (as postulated by Fink e, 1979, and Jeannerod, 1994) specified explicitl y that equivalence occurs at the mental representational level rather than ...

  16. Social Psychological and Equivalence Tests: A Practical Primer for

    research methods, equivalence testing, null hypothesis significance testing, power analysis Scientists should be able to provide support for the null hypothesis. A limitation of the widespread use of traditional significance tests, where the null hypothesis is that the true effect size is zero, is that the absence of an effect can be

  17. Noninferiority and Equivalence Designs: Issues and Implications for

    Equivalence designs, a two-sided test, pose a similar question, but also allow for the possibility that the novel intervention is no better than the standard one. ... Both confidence interval and hypothesis testing approaches will be used in the final analysis. ... Journal of Consulting and Clinical Psychology. 1992; 60:748-756. [Google Scholar]

  18. Functional equivalence or behavioural matching? A critical reflection

    Motor imagery, or the mental rehearsal of actions in the absence of physical movement, is an increasingly popular construct in fields such as neuroscience, cognitive psychology and sport psychology. Unfortunately, few models of motor imagery have been postulated to date. Nevertheless, based on the hypothesis of functional equivalence between imagery, perception and motor execution, Holmes and ...

  19. Equivalence Testing With TOSTER

    Any science that wants to be taken seriously needs to be able to provide support for the null hypothesis. I often see people switching over from frequentist statistics when effects are significant to the use of Bayes factors to be able to provide support for the null hypothesis. ... I've created my first R package, TOSTER (as in Two One-Sided ...

  20. Equivalence Tests: A Practical Primer for t Tests, Correlations, and

    A widely recommended approach within a frequentist framework is to test for equivalence. In equivalence tests, such as the two one-sided tests (TOST) procedure discussed in this article, an upper and lower equivalence bound is specified based on the smallest effect size of interest. ... Zeitschrift Für Psychologie/Journal of Psychology, 217 ...

  21. Functional equivalence revisited: Costs and benefits of priming action

    The functional equivalence (FE) hypothesis suggests motor imagery (MI) is comparable with the planning stages of action. A strong interpretation of this hypothesis suggests MI can prime subsequent actions in a way that should be indistinguishable from motor preparation (MP). Alternatively, MI could involve more richly informative motor plans than MP, producing different effects on the ...

  22. The functional equivalence hypothesis

    The functional equivalence hypothesis. This hypothesis states that there is a perceptual overlap between emotion expressions and certain trait markers, which then influences emotion communication. Specifically, Hess and colleagues propose the notion that some aspects of facial expressive behavior and morphological cues to dominance and ...

  23. Acquired Equivalence Changes Stimulus Representations

    Acquired equivalence is a paradigm in which generalization is increased between two superficially dissimilar stimuli (or antecedents) that have previously been associated with similar outcomes (or consequents). Several possible mechanisms have been proposed, including changes in stimulus representations, either in the form of added associations ...

  24. What We Know About After-Death Communication Experiences

    The Survival Hypothesis. ... a Professor in the Department of Psychology at King's University College at Western University. He is the author of 8 books, over 50 academic papers, and over 100 ...

  25. Physical Symbol System Hypothesis

    The Physical Symbol System Hypothesis is an artificial intelligence hypothesis used to simulate human cognitive activities with computers. It was proposed by American researchers Allen Newell and Herbert A. Simon. The hypothesis posits that a computer is a physical symbol system, capable of manipulating symbols.