• Daily Crossword
  • Word Puzzle
  • Word Finder
  • Word of the Day
  • Synonym of the Day
  • Word of the Year
  • Language stories
  • All featured
  • Gender and sexuality
  • All pop culture
  • Writing hub
  • Grammar essentials
  • Commonly confused
  • All writing tips
  • Pop culture
  • Writing tips

Advertisement

speech sound

  • any of the set of distinctive sounds of a given language. Compare phoneme .

“Go” contains the speech sound “o.”

  • any of the sounds of the entire phonetic system of a language.

Discover More

Word history and origins.

Origin of speech sound 1

Example Sentences

It is aptly referred to as “decoding,” because it teaches beginning readers to decipher letters into speech sounds.

Hence salubris oratio means a speech sound in matter, possessing original strength; sana, a temperate and discreet speech.

By the age of two—or less—the child should be able to imitate exactly any speech-sound.

It makes our speech sound like that of a gushing school girl, to whom everything is very, awfully sweet.

It was not a speech sound, but a kind of whine, always coupled with a deep sigh.

He had the power of making his speech sound like a deep, soft music.

Speech in Linguistics

  • An Introduction to Punctuation
  • Ph.D., Rhetoric and English, University of Georgia
  • M.A., Modern English and American Literature, University of Leicester
  • B.A., English, State University of New York

In linguistics , speech is a system of  communication  that uses spoken words  (or sound symbols ). 

The study of speech sounds (or spoken language ) is the branch of linguistics known as phonetics . The study of sound changes in a language is phonology . For a discussion of speeches in rhetoric and oratory , see Speech (Rhetoric) .

Etymology:  From the Old English, "to speak"

Studying Language Without Making Judgements

  • "Many people believe that written language is more prestigious than spoken language--its form is likely to be closer to Standard English , it dominates education and is used as the language of public administration. In linguistic terms, however, neither speech nor writing can be seen as superior. Linguists are more interested in observing and describing all forms of language in use than in making social and cultural judgements with no linguistic basis." (Sara Thorne, Mastering Advanced English Language , 2nd ed. Palgrave Macmillan, 2008)

Speech Sounds and Duality

  • "The very simplest element of speech --and by 'speech' we shall henceforth mean the auditory system of speech symbolism, the flow of spoken words--is the individual sound, though, . . . the sound is not itself a simple structure but the resultant of a series of independent, yet closely correlated, adjustments in the organs of speech." ( Edward Sapir , Language: An Introduction to the Study of Speech , 1921)
  • "Human language is organized at two levels or layers simultaneously. This property is called duality (or 'double articulation'). In speech production, we have a physical level at which we can produce individual sounds, like n , b and i . As individual sounds, none of these discrete forms has any intrinsic meaning . In a particular combination such as bin , we have another level producing a meaning that is different from the meaning of the combination in nib . So, at one level, we have distinct sounds, and, at another level, we have distinct meanings. This duality of levels is, in fact, one of the most economical features of human language because, with a limited set of discrete sounds, we are capable of producing a very large number of sound combinations (e.g. words) which are distinct in meaning." (George Yule, The Study of Language , 3rd ed. Cambridge University Press, 2006)

Approaches to Speech

  • "Once we decide to begin an analysis of speech , we can approach it on various levels. At one level, speech is a matter of anatomy and physiology: we can study organs such as tongue and larynx in the production of speech. Taking another perspective, we can focus on the speech sounds produced by these organs--the units that we commonly try to identify by letters , such as a 'b-sound' or an 'm-sound.' But speech is also transmitted as sound waves, which means that we can also investigate the properties of the sound waves themselves. Taking yet another approach, the term 'sounds' is a reminder that speech is intended to be heard or perceived and that it is therefore possible to focus on the way in which a listener analyzes or processes a sound wave." (J. E. Clark and C. Yallop, An Introduction to Phonetics and Phonology . Wiley-Blackwell, 1995)

Parallel Transmission

  • "Because so much of our lives in a literate society has been spent dealing with speech recorded as letters and text in which spaces do separate letters and words, it can be extremely difficult to understand that spoken language simply does not have this characteristic. . . . [A]lthough we write, perceive, and (to a degree) cognitively process speech linearly--one sound followed by another--the actual sensory signal our ear encounters is not composed of discretely separated bits. This is an amazing aspect of our linguistic abilities, but on further thought one can see that it is a very useful one. The fact that speech can encode and transmit information about multiple linguistic events in parallel means that the speech signal is a very efficient and optimized way of encoding and sending information between individuals. This property of speech has been called parallel transmission ." (Dani Byrd and Toben H. Mintz, Discovering Speech, Words, and Mind . Wiley-Blackwell, 2010)

Oliver Goldsmith on the True Nature of Speech

  • "It is usually said by grammarians , that the use of language is to express our wants and desires; but men who know the world hold, and I think with some show of reason, that he who best knows how to keep his necessities private is the most likely person to have them redressed; and that the true use of speech is not so much to express our wants, as to conceal them." (Oliver Goldsmith, "On the Use of Language." The Bee , October 20, 1759)

Pronunciation: SPEECH

  • Duality of Patterning in Language
  • Phonology: Definition and Observations
  • What Is Phonetics?
  • Definition and Examples of Productivity in Language
  • Spoken English
  • Definition of Voice in Phonetics and Phonology
  • Phonological Segments
  • What Are Utterances in English (Speech)?
  • Sound Symbolism in English: Definition and Examples
  • Grapheme: Letters, Punctuation, and More
  • What Is a Phoneme?
  • Phoneme vs. Minimal Pair in English Phonetics
  • Connected Speech
  • What Is Graphemics? Definition and Examples
  • Assimilation in Speech
  • 10 Titillating Types of Sound Effects in Language

Want to create or adapt books like this? Learn more about how Pressbooks supports open publishing practices.

3 Chapter 3: Phonetics (The Sounds of Speech)

Learning outcomes.

After studying this chapter, you should be able to discuss:

  • the three aspects of speech that make up phonetics
  • the International Phonetic Alphabet
  • the theories of speech perception

Introduction

Think about how you might describe the pronunciation of the English word cat (cat pictured below). If you had to tell someone what’s the first sound of the word, what would you tell them?

image of a cat

Read this poem out loud:

“Hints on pronunciation for foreigners ”

by Anonymous (see note)

I take it you already know of tough and bough and cough and dough. Others may stumble, but not you, On hiccough, thorough, lough* and through. Well done! And now you wish, perhaps, To learn of less familiar traps.

Beware of heard, a dreadful word That looks like beard and sounds like bird. And dead-it’s said like bed, not bead. For goodness sake, don’t call it deed! Watch out for meat and great and threat. They rhyme with suite and straight and debt.

A moth is not a moth in mother, Nor both in bother, broth in brother, And here is not a match for there, Nor dear and fear for pear and bear. And then there’s dose and rose and lose Just look them up–and goose and choose. And cork and work and card and ward. And font and front and word and sword. And do and go, then thwart and cart. Come, come I’ve hardly made a start.

A dreadful language? Man alive, I’d mastered it when I was five!

*Laugh has been changed to lough, which is pronounced “lock” and is suggested as the original spelling here.

NOTE: For interesting tidbits about the origins of this poem, see the comments on this blog post .

What does this poem show us about the need for phonetic description?

In addition to variations of pronunciation of the same groupings of letters, the same word, with the same spelling, can have different pronunciations by different people (for example, because of different dialects). The point here is that English spelling does not clearly and consistently represent the sounds of language. In fact, English writing is infamous for that problem.There are lots of subtle differences between pronunciations, and these can’t always be explained with traditional casual ways of explanation. We need a very systematic and formal way to describe how every sound is made. That’s what phonetics is for.

What is Phonetics?

Phonetics looks at human speech from three distinct but interdependent viewpoints:

  • Articulatory phonetics (The  Production of Speech)…studies how speech sounds are produced.
  • Auditory phonetics (The  Perception of Speech)…studies the way in which humans perceive sounds.
  • Acoustic phonetics (The  Physics of Speech)…studies the physical properties of speech sounds.

Articulatory Phonetics

When people play a clarinet or similar instrument, they can make different sounds by closing the tube in different places or different ways. Human speech works the same way: we make sound by blowing air through our tube (from the lungs, up the throat, and out the mouth and/or nose), and we change sounds by changing the way the air flows and/or closing the tube in different places or in different ways.

There are three basic ways we can change a sound, and they correspond to three basic phonetic “features”. (The way I am categorizing features here may be different than what’s presented in some readings; there are lots of different theories about how to organize phonetic features.) They are as follows:

  • We can change the way the air comes out of our lungs in the first place, by letting our vocal folds vibrate or not vibrate. This is called voicing . (Voicing is also closely related to aspiration , although they are realized in different ways. The complex relationship between voicing and aspiration is beyond the scope of this subject; for our purposes, you can just treat them as the same thing, and you can use the terms “voiced” and “unaspirated” interchangeably, and use the terms “voiceless” and “aspirated” interchangeably.)
  • We can change the way that we close the tube—for example, completely closing the tube will create a one kind of sound, whereas than just narrowing it a little to make the air hiss will create a different kind of sound. This aspect of how we make sound is called manner of articulation .
  • We can change the place that we close the tube — for example, putting our two lips together creates a “closure” further up the tube than touching our tongue to the top of our mouth does. This aspect of how we make sound is called  place of articulation (“articulation” means movement, and we close our tube by moving something—moving the lips to touch each other, moving the tip of the tongue to touch the top of the mouth, etc.—, so “place of articulation” means “the place that you move to close your mouth).

Articulatory phonetics investigates how speech sounds are produced. This involves some basic understanding of

  • The anatomy of speech i.e. the lungs, the larynx and the vocal tract;
  • Airstream mechanisms, that is, the mechanisms involved in initiating and producing the types of airstreams used for speech.

Adopting anatomical and physiological criteria, phoneticians define segmental (i.e. the sounds of speech) and suprasegmental (e.g. tonal phenomena).

The Anatomy of Speech

Three central mechanisms are responsible for the production of speech:

  • Respiration: The lungs produce the necessary energy in form of a stream of air.
  • Phonation: The larynx serves as a modifier to the airstream and is responsible for phonation.
  • Articulation: The vocal tract modifies and modulates the airstream by means of several articulators.

Respiration

Before any sound can be produced at all, there has to be some form of energy. In speech, the energy takes the form of a stream of air normally coming from the lungs. Lung air is referred to as pulmonic air.

The respiratory system is used in normal breathing and in speech and is contained within the chest or thorax. Within the thoracic cavity are the lungs, which provide the reservoir for pulmonic airflow in speech.

The lungs are connected to the trachea, by two bronchial tubes which join at the base of the trachea. At the lower end of the thoracic cavity we find the dome-shaped diaphragm which is responsible for thoracic volume changes during respiration. The diaphragm separates the lungs from the abdominal cavity and lower organs.

The larynx consists of a number of cartilages which are interconnected by complex joints and move about these joints by means of muscular and ligamental force. The larynx has several functions:

  • the protective function
  • the respiratory function
  • the function in speech

The primary biological function of the larynx is to act as a valve, by closing off air from the lungs or preventing foreign substances from entering the trachea. The principal example of this protective function of the larynx is the glottal closure, during which the laryngeal musculature closes the airway while swallowing.

During respiration, the larynx controls the air-flow from subglottal to supraglottal regions. Normally, humans breathe about 15 times per minute (2 sec. inhaling, 2 sec. exhaling). Breathing for speech has a different pattern than normal breathing. Speaking may require a deeper, more full breath than regular inhalation and such inhalation would be done at different intervals. A speaker’s breathing rate is no longer a regular pattern of fifteen to twenty breaths a minute, but rather is sporadic and irregular, with quick inhalation and a long, drawn out, controlled exhalation (exhaling can last 10 to 15 seconds).In speech production, the larynx modifies the air-flow from the lungs in such a way as to produce an acoustic signal. The result are various types of phonation.

  • Normal voice

The most important effect of vocal fold action is the production of audible vibration – a buzzing sound, known as voice or vibration. Each pulse of vibration represents a single opening and closing movement of the vocal folds. The number of cycles per second depends on age and sex. Average male voices vibrate at 120 cycles per second, women’s voices average 220 cycles per second.

Articulation

Once the air passes through the trachea and the glottis, it enters a long tubular structure known as the vocal tract. Here, the airstream is affected by the action of several mobile organs, the active articulators. Active articulators include the lower lip, tongue, and glottis. They are actively involved in the production of speech sounds.

image showing the articulators

The active articulators are supported by a number of passive articulators, i.e. by specific organs or locations in the vocal tract which are involved in the production of speech sounds but do not move. These passive articulators include the palate, alveola, ridge, upper and lower teeth, nasal cavity, velum, pharynx, epiglottis, and trachea.

The production of speech sounds through these organs is referred to as articulation .

Articulation of Consonants in North American English

Introduction to Articulatory Phonetics licensed CC BY .

Articulation of Vowel Sounds in North American English

International Phonetic Alphabet (IPA)

The International Phonetic Alphabet ( IPA ) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standardized representation of speech sounds in written form. The IPA is used by lexicographers, foreign language students and teachers, linguists, speech–language pathologists, singers, actors, constructed language creators and translators.

The IPA is designed to represent those qualities of speech that are part of sounds in oral language: phones, phonemes, intonation, and the separation of words and syllables.

IPA symbols are composed of one or more elements of two basic types, letters and diacritics. For example, the sound of the English letter ⟨t⟩ may be transcribed in IPA with a single letter, [t] , or with a letter plus diacritics, [t̺ʰ] , depending on how precise one wishes to be. Slashes are used to signal phonemic transcription; thus /t/ is more abstract than either [t̺ʰ] or [t] and might refer to either, depending on the context and language.

This website shows the sounds from American English represented with the IPA. In addition, you can type in any English word and get the phonetic conversion!

Suprasegmental Features

Vowels and consonants are the basic segments of speech. Together, they form syllables, larger units, and eventually utterances. Superimposed on the segments are a number of additional features known as suprasegmental or prosodic features. They do not characterize a single segment but a succession of segments. The most important suprasegmental features are:

In a spoken utterance the syllables are never produced with the same intensity. Some syllables are unstressed (weaker), others stressed (stronger).

A stressed syllable is produced by an increase in respiratory activity, i.e. more air is pushed out of the lungs.

The video below suggests that chimpanzees can speak. Is that true? HInt: Think about anatomical reasons that chimpanzees may or may not be able to speak.

Auditory Phonetics

Auditory phonetics investigates the processes underlying human speech perception. The starting point for any auditory analysis of speech is the study of the human hearing system i.e. the anatomy and physiology of the ear and the brain.

Since the hearing system cannot react to all features present in a sound wave, it is essential to determine what we perceive and how we perceive it. This enormously complex field is referred to as speech perception .

This area is not only of interest to phonetics but is also the province of experimental psychology.

The Auditory System

The auditory system consists of three central components:

  • The outer ear – modifies the incoming sound signal and amplifies it at the eardrum.
  • The middle ear – improves the signal and transfers it to the inner ear.
  • The inner ear – converts the signal from mechanical vibrations into nerve impulses and transmits it to the brain via the auditory nerve.

diagram of ear

The outer ear consists of the visible part, known as the auricle or pinna, and of the interior part. The auricle helps to focus sound waves into the ear, and supports our ability to locate the source of a sound.

From here, the ear canal, a 2.5 cm long tube, leads to the eardrum.The main function of the ear canal is to filter out tiny substances that might approach the eardrum. Furthermore, it amplifies certain sound frequencies(esp.between 3, 000 and 4, 000 Hz) and protects the eardrum from changes in temperature as well as from damage.

Behind the eardrum lies the middle ear, a cavity which is filled with air via the Eustachian tube (which is linked to the back of the nose and throat).

The primary function of the middle ear is to convert the sound vibrations at the eardrum into mechanical movements. This is achieved by a system of three small bones, known as the auditory ossicles. They are named after their shape:

  • the malleus (hammer)
  • the incus (anvil)
  • the stapes (stirrup)

When the eardrum vibrates due to the varying air pressure caused by the sound waves, it causes the three small bones, the so-called ossicles, to move back and forth. These three bones transmit the vibrations to the membrane-covered oval opening of the inner ear. Together, the ossicles function as a kind of leverage system, amplifying the vibrations by a factor of over 30 dB by the time they reach the inner ear.

The inner ear contains the vestibular organ with the semi-circular canals, which control our sense of balance, and the cochlea, a coiled cavity about 35 mm long, resembling a snail’s shell. The cochlea is responsible for converting sounds which enter the ear canal, from mechanical vibrations into electrical signals. The mechanical vibrations are transmitted to the oval window of the inner ear via the stapes (stirrup). The conversion process, known as transduction, is performed by specialized sensory cells within the cochlea. The electrical signals, which code the sound’s characteristics, are carried to the brain by the auditory nerve.

The cochlea is divided into three chambers by the basilar membrane. The upper chamber is the scala vestibuli and the bottom chamber is the scala tympani. They are both filled with a clear viscous fluid called perilymph. Between these two chambers is the cochlea duct, which is filled with endolymph.

On the basilar membrane rests the organ of Corti which contains a systematic arrangement of hair cells which pick up the pressure movements along the basilar membrane where different sound frequencies are mapped onto different membrane sites from apex to base.

The hair cells bend in wave-like actions in the fluid and set off nerve impulses which then pass through the auditory nerve to the hearing center of the brain. Short hair cell fibres respond to high frequencies and longer fibres respond to lower frequencies.

Speech Perception

Since the hearing system cannot react to all features present in a sound wave, it is essential to determine what we perceive and how we perceive it. This enormously complex field is referred to as speech perception. Two questions have dominated research:

  • Acoustic cues: Does the speech signal contain specific perceptual cues?
  • Theories of Speech Perception: How can the process of speech perception be modeled?

A further important issue in speech perception, which is also the province of experimental psychology, is whether it is a continuous or – as often assumed – a categorical process.

Acoustic Cues

The speech signal presents us with far more information than we need in order to recognize what is being said. Yet, our auditory system is able to focus our attention on just the relevant auditory features of the speech signal – features that have come to be known as acoustic cues:

  • Voice Onset Time (VOT): the acoustic cue for the voiceless/voiced distinction
  • F2-transition : the acoustic cue for place of articulation
  • Frequency cues

The importance of these small auditory events has led to the assumption that speech perception is by and large not a continuous process, but rather a phenomenon that can be described as discontinuous or categorical perception .

Voice Onset Time

The voice onset time (VOT) is the point when vocal fold vibration starts relative to the release of a closure, i.e.the interval of voicing prior to a voiced sound. It is crucial for us to discriminate between clusters such as [ pa ] or [ ba ]. It is a well-established fact that a gradual delay of VOT does not lead to a differentiation between voiceless and voiced consonants. Rather, a VOT-value of around 30 msecs serves as the key factor. In other words:

  • If VOT is longer than 30 msecs, we hear a voiced sound, such as [ ba ],
  • If VOT is shorter than 30 msecs, the perceptual result is [ pa ].

F2 Transition

The formant pattern of vowels in isolation differs enormously from that of vowels embedded in a consonantal context. If a consonant precedes a vowel, e.g. ka/ba/etc, the second formant (F2) seems to emerge from a certain frequency region, the so-called F2-locus. It seems that speech perception is sensitive to the transition of F2 and that F2-transition is an important cue in the perception of speech. In other words, the F2 frequency of the vowel determines whether or not the initial consonant sound is clear.

Frequency Cues

The frequency of certain parts of the sound wave helps to identify a large number of speech sounds. Fricative consonants, such as [s], for example, involve a partial closure of the vocal tract, which produces a turbulence in the air flow and results in a noisy sound without clear formant structure spreading over a broad frequency range. This friction noise is relatively unaffected by the context in which the fricative occurs and may thus serve as a nearly invariant cue for its identification.

However, the value of frequency cues is only relative since the perception of fricatives is also influenced by the fricative’s formant transitions.

Theories of Speech Perception

Speech perception begins with a highly complex, continuously varying, acoustic signal and ends with a representation of the phonological features encoded in that signal. There are two groups of theories that model this process:

  • Passive theories: This group views the listener as relatively passive and speech perception as primarily sensory. The message is filtered and mapped directly onto the acoustic-phonetic features of language.
  • Active theories: This group views the listener as more active and postulates that speech perception involves some aspects of speech production; the signal is sensed and analysed by reference to how the sounds in the signal are produced.

Passive Theories

Passive theories of speech perception emphasize the sensory side of the perceptual process and relegate the process of speech production to a minor role. They postulate the use of stored neural patterns which may be innate. Two influential passive theories have emerged:

  • The Theory of Template Matching Templates are innate recognition devices that are rudimentary at birth and tuned as language is acquired.

  • The Feature Detector Theory Feature detectors are specialized neural receptors necessary for the generation of auditory patterns.

Active Theories

Active theories assume that the process of speech perception involves some sort of internal speech production, i.e. the listener applies his articulatory knowledge when he analyzes the incoming signal. In other words: the listener acts not only when he produces speech, but also when he receives it.

Two influential active theories have emerged:

  • The Motor Theory of Perception According to the motor theory, reference to your own articulatory knowledge is manifested via direct comparison with articulatory patterns.

  • The Analysis-by-Synthesis Theory The analysis-by-synthesis theory postulates that the reference to your own articulation is via neurally generated auditory patterns.

The McGurk Effect

The McGurk effect is a perceptual phenomenon that demonstrates an interaction between hearing and vision in speech perception. The illusion occurs when the auditory component of one sound is paired with the visual component of another sound, leading to the perception of a third sound. The visual information a person gets from seeing a person speak changes the way they hear the sound. If a person is getting poor quality auditory information but good quality visual information, they may be more likely to experience the McGurk effect.

You are invited to participate in a little experiment on perception. In the video below you see a mouth speaking four items. Your tasks are the following:

  • Watch the mouth closely, but concentrate on what you hear.
  • Now close your eyes. Play the clip again.
  • What did you perceive when you saw and heard the video clip? What did you perceive when you just heard the items?

Acoustic Phonetics

Acoustic phonetics studies the physical properties of the speech signal. This includes the physical characteristics of human speech, such as frequencies, friction noise, etc.

There are numerous factors that complicate the straightforward analysis of the speech signal, for example, background noises, anatomical and physiological differences between speakers etc.

These and other aspects contributing to the overall speech signal are studied under the heading of acoustic phonetics.

Sound Waves

Sound originates from the motion or vibration of a sound source, e.g. from a tuning fork. The result of this vibration is known as a simple sound wave, which can be mathematically modeled as a sine wave. Most sources of sounds produce complex sets of vibrations. They arise from the combination of a number of simple sound waves.

Speech involves the use of complex sound waves because it results from the simultaneous use of many sound sources in the vocal tract.

The vibration of a sound source is normally intensified by the body around it. This intensification is referred to as resonance. Depending on the material and the shape of this body, several resonance frequencies are produced.

Simple sound waves are produced by a simple source, e.g. the vibration of a tuning fork. They are regular in motion and are referred to as periodic. Two properties are central to the measurement of simple sound waves: the frequency and the amplitude.

Practically every sound we hear is not a pure tone but a complex tone; its wave form is not simple but complex. Complex wave forms are synthesized from a sufficient number of simple sound waves. There are two types of complex wave forms:

  • periodic complex sound waves
  • aperiodic complex sound waves

Speech makes use of both kinds. Vowels, for example, are basically periodic, whereas consonants range from periodic to aperiodic:

  • the vowel [ a ], periodic
  • the consonant [ n ], periodic
  • the consonant [ s ], aperiodic
  • the consonant [ t ], aperiodic

The sound wave created by a sound source is referred to as the fundamental frequency or F0 (US: “F zero”).

On a musical instrument, F0 is the result of the vibration of a string or a piece of reed. In speech, it is the result of vocal fold vibration.

In both cases, F0 is a complex sound wave which is filtered (intensified and damped) by numerous parts of the resonating body. The resulting bundles of resonance frequencies or  harmonics  are multiples of F0. They are called  formants  and are numbered F1, F2 and so on.

In speech, these formants can be associated with certain parts of the vocal tract, on a musical instrument they are multiples of F0. For example, on an oboe F0 is the result of the vibration of the reed. This fundamental frequency is intensified (and damped) by the resonating body. As a result, a number of harmonics or formant frequencies are created as multiples of the frequency of F0.

Attributions:

Content adapted from the following:

VLC102 – Speech Science by Jürgen Handke, Peter Franke, Linguistic Engineering Team under CC BY 4.0

“ International Phonetic Alphabet ” licensed under CC BY SA .

“ McGurk Effect ” licensed under CC BY SA .

Introduction to Linguistics by Stephen Politzer-Ahles . CC-BY-4.0 .

More than Words: The Intersection of Language and Culture Copyright © 2022 by Dr. Karen Palmer is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License , except where otherwise noted.

Share This Book

Speech Sound Disorder

Reviewed by Psychology Today Staff

Speech sound disorder (SSD) encompasses a group of communication disorders in which children have persistent difficulty articulating words or sounds correctly. Speech sound production requires both the phonological knowledge of speech sounds and the ability to coordinate the jaw, tongue, and lips with breathing and vocalizing to produce meaningful sounds. Children with speech sound disorder may have difficulty with the phonological knowledge of speech sounds or the ability to coordinate the movements necessary for speech. The communication difficulties can impede the development of children by limiting their ability to effectively participate in social, academic, or occupational activities.

While some speech sound disorders stem from physical structural anomalies, such as cleft palate, others have their origin in perceptual problems such as hearing impairment. Still others, like apraxia, in which the brain does not deliver the correct movement instructions to the target muscles, arise from neurodevelopmental problems.

Most children shorten words and syllables as they’re learning to talk, but children with speech sound disorder continue this simplification process past the age when most children can produce words clearly, generally agreed to be age 7.

The DSM-5 includes the following diagnostic criteria for speech sound disorder:

  • Persistent difficulty with the production of speech sounds that interferes with the intelligibility of one's speech or prevents verbal communication
  • Limitations on communication interfere with social participation or performance at school or work
  • The symptoms begin early in life and are not attributable to other medical or neurological conditions

Developmental experts believe that approximately half of a child’s speech is intelligible by age 2 and most speech intelligible by age 4.

According to the Child Mind Institute, other potential signs of a speech sound disorder include:

  • Leaving out sounds or substituting an incorrect sound for a correct one
  • An unusually hoarse or nasal voice or sudden changes in pitch or loudness that make understanding speech more difficult
  • Running out of air while talking

Stuttering is a common example of a speech sound disorder. Another particularly common example is lisping. Depending on the severity of the lisp or stutter, children may attempt to avoid words or phrases with which they struggle due to the anxiety of anticipating their dysfluency.

A speech-language pathologist tests a child's speech for how well he or she creates sounds and inspects how the child moves his or her lips, jaw, and tongue. The pathologist may also test the child’s hearing.

Speech sound disorder is not the same as a specific language impairment. Speech sound disorder expresses itself with a delayed ability to produce speech sounds, while specific language impairment expresses itself with an inability to incorporate the structures of grammar into speech. Only about 2 percent of children with speech sound disorder also have specific language impairment.

According to the National Institute on Deafness and Other Communication Disorders, the prevalence of speech sound disorder in young children is 8 to 9 percent of the population. By first grade, roughly 5 percent of children exhibit speech sound disorder.

The cause of speech sound disorder is not well understood in many cases. Children who develop speech sound disorder often have family members with a history of speech or language disorder, suggesting a genetic component of this condition. Speech sound disorder may occur along with other delays in the use of facial musculature, such as difficulties in chewing, maintaining a closed mouth, and blowing one’s nose

Hearing impairment or deafness do often result in abnormalities of speech sounds. This does not mean someone with a hearing impairment has a neurodevelopmental speech sound disorder. For instance, a child who receives cochlear implants may be able to catch up to peers in speech sound production due to their new ability to hear others better. 

Speech sound disorder is a neurodevelopmental disorder unrelated to anxiety. The disorder could, however, lead to anxiety in children who are unable to fully express themselves. Such anxiety could be expressed as selective mutism , where children do not speak in certain circumstances due to embarrassment .

There is substantial research suggesting the heritability of speech sound disorder. Studies have demonstrated familial aggregation for SSD, where approximately 26 percent of nuclear family members and 13.6 percent of extended family members were affected in a cohort of children with SSD.

Treatment for speech sound disorder primarily consists of speech and language therapy . A speech-language pathologist typically develops a treatment plan that helps a child identify and correct the specific sounds or words they have difficulty articulating. The speech-language pathologist may show the child how to move their tongue and lips to produce sounds correctly and provide opportunities to practice these skills.

In most cases, children with speech sound disorder respond well to speech therapy; speech difficulties improve over time. The condition appears to resolve in 75 percent of children by age 6. When a language disorder is also present, however, speech sound disorder has a poorer prognosis and may be associated with learning disorders.

According to the American Speech-Language-Hearing Association, treatment is generally done with the help of a speech pathologist and focusing on: establishing a target sounds to work on producing reliably, working up from syllables to words to sentences, and, finally, stabilizing this sound production to the point where the patient is able to self-monitor and self-correct.

  • Find a Therapist
  • Find a Treatment Center
  • Find a Psychiatrist
  • Find a Support Group
  • Find Online Therapy
  • United States
  • Brooklyn, NY
  • Chicago, IL
  • Houston, TX
  • Los Angeles, CA
  • New York, NY
  • Portland, OR
  • San Diego, CA
  • San Francisco, CA
  • Seattle, WA
  • Washington, DC
  • Asperger's
  • Bipolar Disorder
  • Chronic Pain
  • Eating Disorders
  • Passive Aggression
  • Personality
  • Goal Setting
  • Positive Psychology
  • Stopping Smoking
  • Low Sexual Desire
  • Relationships
  • Child Development
  • Therapy Center NEW
  • Diagnosis Dictionary
  • Types of Therapy

May 2024 magazine cover

At any moment, someone’s aggravating behavior or our own bad luck can set us off on an emotional spiral that threatens to derail our entire day. Here’s how we can face our triggers with less reactivity so that we can get on with our lives.

  • Emotional Intelligence
  • Gaslighting
  • Affective Forecasting
  • Neuroscience
  • More from M-W
  • To save this word, you'll need to log in. Log In

Definition of sound

 (Entry 1 of 7)

Definition of sound  (Entry 2 of 7)

transitive verb

intransitive verb

Definition of sound  (Entry 3 of 7)

Definition of sound  (Entry 4 of 7)

Definition of sound  (Entry 5 of 7)

Definition of sound  (Entry 6 of 7)

Definition of sound  (Entry 7 of 7)

Did you know?

English contains several sound homographs, all with distinct histories. For example, the sound that means "something heard" descends from Latin sonus ("sound"), whereas the sound that means "to measure the depth of water" traces to Middle French sonde ("sounding line"). Another sound , as in "of sound mind and body," is the contemporary form of Old English's gesund . Gesund is related to several words in other languages, such as Old Saxon gisund ("sound"), Old Frisian sund ("fresh, unharmed, healthy"), and Gothic swinths ("sound" or "healthy"). Another relative is Old High German's gisunt ("healthy"), which led to modern German's gesund, the root of gesundheit .

  • reverberate
  • analytical
  • well-founded
  • well-grounded

healthy , sound , wholesome , robust , hale , well mean enjoying or indicative of good health.

healthy implies full strength and vigor as well as freedom from signs of disease.

sound emphasizes the absence of disease, weakness, or malfunction.

wholesome implies appearance and behavior indicating soundness and balance.

robust implies the opposite of all that is delicate or sickly.

hale applies particularly to robustness in old age.

well implies merely freedom from disease or illness.

valid , sound , cogent , convincing , telling mean having such force as to compel serious attention and usually acceptance.

valid implies being supported by objective truth or generally accepted authority.

sound implies a basis of flawless reasoning or of solid grounds.

cogent may stress either weight of sound argument and evidence or lucidity of presentation.

convincing suggests a power to overcome doubt, opposition, or reluctance to accept.

telling stresses an immediate and crucial effect striking at the heart of a matter.

Examples of sound in a Sentence

These examples are programmatically compiled from various online sources to illustrate current usage of the word 'sound.' Any opinions expressed in the examples do not represent those of Merriam-Webster or its editors. Send us feedback about these examples.

Word History

Middle English soun , from Anglo-French son, sun , from Latin sonus , from sonare to sound; akin to Old English swinn melody, Sanskrit svanati it sounds

Middle English, from Old English gesund ; akin to Old High German gisunt healthy

Middle English, from Old English sund swimming, sea & Old Norse sund swimming, strait; akin to Old English swimman to swim

Middle English, from Middle French sonder , from Old French *sonde sounding line, probably from Old English or Middle English sund- (as in Old English sundlīne sounding line) from sund sea

French sonde , from Middle French, literally, sounding line

13th century, in the meaning defined at sense 1a

13th century, in the meaning defined at transitive sense 1a

14th century, in the meaning defined above

14th century, in the meaning defined at sense 1a

15th century, in the meaning defined at transitive sense 1

1739, in the meaning defined above

Phrases Containing sound

  • (as) sound as a bell
  • break the sound barrier
  • not like the sound of (something)
  • of sound mind
  • raise / sound the alarm
  • sound and fury
  • sound - and - light show
  • sound barrier
  • sound board
  • sound check
  • sound effects
  • sound pressure
  • sound / ring / toll the death knell
  • sound / signal the retreat
  • sound sleeper
  • sound system
  • sound track
  • sound truck
  • surround sound
  • the sound barrier

Articles Related to sound

linking verbs copulas how to use

To Be or Not To Be: A Look at Other...

To Be or Not To Be: A Look at Other Linking Verbs

Watch as we go rogue.

Theme music by Joshua Stamper ©2006 New Jerusalem Music/ASCAP

Get Word of the Day delivered to your inbox!

Dictionary Entries Near sound

Cite this entry.

“Sound.” Merriam-Webster.com Dictionary , Merriam-Webster, https://www.merriam-webster.com/dictionary/sound. Accessed 16 May. 2024.

Kids Definition

Kids definition of sound.

 (Entry 1 of 6)

Kids Definition of sound  (Entry 2 of 6)

Kids Definition of sound  (Entry 3 of 6)

Kids Definition of sound  (Entry 4 of 6)

Kids Definition of sound  (Entry 5 of 6)

Kids Definition of sound  (Entry 6 of 6)

Old English gesund "free from injury or disease"

Middle English soun "a sound, something that can be heard," from early French son (same meaning), from Latin sonus "a sound" — related to unison

Old English sund "a sea, swimming"

Middle English sounden "to measure the depth of," from early French sonder (same meaning), from sonde "a line used for measuring depths"; probably of Germanic origin

Medical Definition

Medical definition of sound.

 (Entry 1 of 4)

Medical Definition of sound  (Entry 2 of 4)

Medical Definition of sound  (Entry 3 of 4)

Medical Definition of sound  (Entry 4 of 4)

Legal Definition

Legal definition of sound.

 (Entry 1 of 2)

Legal Definition of sound  (Entry 2 of 2)

More from Merriam-Webster on sound

Nglish: Translation of sound for Spanish Speakers

Britannica English: Translation of sound for Arabic Speakers

Subscribe to America's largest dictionary and get thousands more definitions and advanced search—ad free!

Play Quordle: Guess all four words in a limited number of tries.  Each of your guesses must be a real 5-letter word.

Can you solve 4 words at once?

Word of the day.

See Definitions and Examples »

Get Word of the Day daily email!

Popular in Grammar & Usage

More commonly misspelled words, your vs. you're: how to use them correctly, every letter is silent, sometimes: a-z list of examples, more commonly mispronounced words, how to use em dashes (—), en dashes (–) , and hyphens (-), popular in wordplay, birds say the darndest things, a great big list of bread words, 10 scrabble words without any vowels, 12 more bird names that sound like insults (and sometimes are), 8 uncommon words related to love, games & quizzes.

Play Blossom: Solve today's spelling word game by finding as many words as you can using just 7 letters. Longer words score more points.

U.S. flag

A .gov website belongs to an official government organization in the United States.

A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

  • Signs and Symptoms

Signs of Noise-Induced Hearing Loss

What to know.

  • Don’t wait until you show signs of hearing loss.
  • If you have any signs of or if you are at risk for noise-induced hearing loss, get your hearing tested.

woman gets ear exam

What to Look For

Prevention and early detection of noise-induced hearing loss are important. It is also important to pay attention to signs of hearing loss and to get your hearing tested. Signs include:

  • Speech and other sounds seem muffled.
  • Trouble hearing high-pitched sounds.
  • Trouble understanding conversations when in a noisy place, such as a restaurant.
  • Trouble understanding speech over the phone.
  • Trouble hearing certain sounds such as the difference between s and f in speech.
  • Asking others to speak more slowly and clearly.
  • Asking someone to speak more loudly or repeat what they said.
  • Turning up the volume of the television or radio.
  • Ringing in the ears.
  • Certain sounds are very bothersome or create pain.

Loud Noises Can Cause Hearing Loss

Learn about causes, signs, and prevention of noise-induced hearing loss.

IMAGES

  1. PPT

    definition of speech sound

  2. Speech Sounds by Age (and How to Teach Them)

    definition of speech sound

  3. Speech & sound disorders in children

    definition of speech sound

  4. SPEECH SOUNDS SYSTEM of ENGLISH CONTENT Definition

    definition of speech sound

  5. What Is Sound

    definition of speech sound

  6. PPT

    definition of speech sound

VIDEO

  1. What Did a Public Speech Sound Like in Ancient Rome

  2. Definition Speech Self-Reflection

  3. Definition Speech

  4. Definition speech 483A1FC4 6359 4055 A24D EC009D56F773

  5. Self Definition #shorts #jordanpeterson

  6. What is the Speech Sound Pics SSP Approach ? Speech to Print Phonics with Miss Emma

COMMENTS

  1. SPEECH SOUND Definition & Meaning

    Speech sound definition: any of the set of distinctive sounds of a given language.. See examples of SPEECH SOUND used in a sentence.

  2. Phoneme

    phoneme, in linguistics, smallest unit of speech distinguishing one word (or word element) from another, as the element p in "tap," which separates that word from "tab," "tag," and "tan." A phoneme may have more than one variant, called an allophone (q.v.), which functions as a single sound; for example, the p's of "pat," "spat," and "tap" differ slightly ...

  3. Phonetics

    phonetics, the study of speech sounds and their physiological production and acoustic qualities. It deals with the configurations of the vocal tract used to produce speech sounds (articulatory phonetics), the acoustic properties of speech sounds (acoustic phonetics), and the manner of combining sounds so as to make syllables, words, and ...

  4. Phonology

    phonology, study of the sound patterns that occur within languages. Some linguists include phonetics, the study of the production and description of speech sounds, within the study of phonology. Diachronic (historical) phonology examines and constructs theories about the changes and modifications in speech sounds and sound systems over a period ...

  5. Speech

    Speech is a human vocal communication using language.Each language uses phonetic combinations of vowel and consonant sounds that form the sound of its words (that is, all English words sound different from all French words, even if they are the same word, e.g., "role" or "hotel"), and using those words in their semantic character as words in the lexicon of a language according to the syntactic ...

  6. 1

    Most speech is produced by an air stream that originates in the lungs and is pushed upwards through the trachea (the windpipe) and the oral and nasal cavities. During its passage, the air stream is modified by the various organs of speech. Each such modification has different acoustic effects, which are used for the differentiation of sounds.

  7. Speech sound Definition & Meaning

    speech sound: [noun] any one of the smallest recurrent recognizably same constituents of spoken language produced by movement or movement and configuration of a varying number of the organs of speech in an act of ear-directed communication.

  8. Speech Sounds: Consonants (Chapter 2)

    Phonetics describes a sound in terms of the movements of the organs of speech, the physical properties of the sound produced and the features perceived by the listener. But such an exhaustive description is very complex and is beyond the scope of this book. Since our purpose is to teach the pronunciation of English, our analysis will be based ...

  9. Speech (Linguistics) Definition and Examples

    Richard Nordquist. Updated on July 03, 2019. In linguistics, speech is a system of communication that uses spoken words (or sound symbols ). The study of speech sounds (or spoken language) is the branch of linguistics known as phonetics. The study of sound changes in a language is phonology. For a discussion of speeches in rhetoric and oratory ...

  10. What Is Speech? What Is Language?

    Speech is how we say sounds and words. Speech includes: How we make speech sounds using the mouth, lips, and tongue. For example, we need to be able to say the "r" sound to say "rabbit" instead of "wabbit.". How we use our vocal folds and breath to make sounds. Our voice can be loud or soft or high- or low-pitched.

  11. Speech Sound Disorders-Articulation and Phonology

    The incidence of speech sound disorders refers to the number of new cases identified in a specified period. The prevalence of speech sound disorders refers to the number of children who are living with speech problems in a given time period.. Estimated prevalence rates of speech sound disorders vary greatly due to the inconsistent classifications of the disorders and the variance of ages studied.

  12. What are Speech Sounds?

    Speech sounds are the vocal sounds we use to make up the words of the English language. We use them every time we say a word out loud. Saying the right sounds in the right order is what allows us to communicate with other people and understand what they are saying. It can help to differentiate speech sounds from the alphabet.

  13. Chapter 3: Phonetics (The Sounds of Speech)

    Articulatory phonetics investigates how speech sounds are produced. This involves some basic understanding of. The anatomy of speech i.e. the lungs, the larynx and the vocal tract; Airstream mechanisms, that is, the mechanisms involved in initiating and producing the types of airstreams used for speech.

  14. Speech

    Speech is the faculty of producing articulated sounds, which, when blended together, form language. Human speech is served by a bellows-like respiratory activator, which furnishes the driving energy in the form of an airstream; a phonating sound generator in the larynx (low in the throat) to transform the energy; a sound-molding resonator in ...

  15. What are Speech Sounds?

    Speech sounds are the vocal sounds we use to make up the words of the English language. We use them every time we say a word out loud. Saying the right sounds in the right order is what allows us to communicate with other people and understand what they are saying. It can help to differentiate speech sounds from the alphabet.

  16. Speech Sound Disorder

    Speech sound production requires both the phonological knowledge of speech sounds and the ability to coordinate the jaw, tongue, and lips with breathing and vocalizing to produce meaningful sounds.

  17. Speech sound

    speech sound: 1 n (phonetics) an individual sound unit of speech without concern as to whether or not it is a phoneme of some language Synonyms: phone , sound Types: show 24 types... hide 24 types... phoneme (linguistics) one of a small set of speech sounds that are distinguished by the speakers of a particular language vowel , vowel sound a ...

  18. Speech Sound Disorders

    Signs and Symptoms of Speech Sound Disorders. Your child may substitute one sound for another, leave sounds out, add sounds, or change a sound. It can be hard for others to understand them. It is normal for young children to say the wrong sounds sometimes. For example, your child may make a "w" sound for an "r" and say "wabbit" for "rabbit."

  19. Pronunciation

    pronunciation, in a most inclusive sense, the form in which the elementary symbols of language, the segmental phonemes or speech sounds, appear and are arranged in patterns of pitch, loudness, and duration.In the simplest model of the communication process in language—encoding, message, decoding—pronunciation is an activity, shaping the output of the encoding stage, and a state, the ...

  20. Sound Definition & Meaning

    sound: [adjective] free from injury or disease. free from flaw, defect, or decay.

  21. Developmental Norms for Speech and Language

    Developmental Norms for Speech and Language. This is a collective resource of norms and milestones for speech-language development. SLPs are often asked questions regarding typical age of sound acquisition and development of language. This information will help to answer those questions and provides resources to share with parents and colleagues.

  22. Signs of Noise-Induced Hearing Loss

    Signs include: Speech and other sounds seem muffled. Trouble hearing high-pitched sounds. Trouble understanding conversations when in a noisy place, such as a restaurant. Trouble understanding speech over the phone. Trouble hearing certain sounds such as the difference between s and f in speech. Asking others to speak more slowly and clearly.