Bayley Scales of Infant and Toddler Development

Original Editor - Romy Hageman
Top Contributors - Romy Hageman and Lauren Heydenrych

Introduction[edit | edit source]

The Bayley Scales of Infant and Toddler Development (BSID) is designed to assess and/or monitor very young children at risk of developmental delays or disorders.[1][2] It provides an indication of the child's current developmental level and can also offer a cautious prognosis regarding expected development. The BSID stands out as the most frequently utilized instrument for assessing development during the initial two years of a child's life.[3][4] The current version of the BSID is the third edition.[5] It has proven to be a reliable diagnostic method for recognizing children with early developmental delays .[6][7][8]

Objective[edit | edit source]

The primary purpose of the BSID is to evaluate various areas of a child's development to gain insight into their strengths and potential challenges.

The key objectives of using the BSID include:[1][9]

  • Early identification: Detecting developmental delays at an early age, allowing for timely intervention and support.
  • Monitoring progress: Assessing developmental progress over time, particularly for children at risk of developmental delays or disorders.
  • Research: Providing researchers and clinicians with a standardized measure for studying infant and toddler development across different populations.
  • Intervention planning: Assisting in the development of targeted intervention plans for children with identified developmental concerns.
  • Outcome measurement: Evaluating the effectiveness of interventions or treatments by comparing pre- and post-intervention developmental scores.

Intended population[edit | edit source]

Children at the age of 16 days - 42 months and 15 days.[1]

Clinical indications for administration include:[5]

  • Prenatal complications and congenital disorders: such as brain malformation and anomalies.
  • Neonatal complications: such as preterm birth and neonatal encephalopathy.
  • Neurodevelopmental disorders: such as developmental disorders and Cerebral palsy.

Target examiner population: Physical therapists, Occupational therapists, Paediatricians, Research psychologists, Speech and language therapists, Neonatologists, early childhood specialists.[1][9].

Method of use[edit | edit source]

The test BSID measures the following domains:[10][11]

  • Cognition
  • Language (receptive and expressive language abilities)
  • Motor skills (Fine and Gross motor skills)
  • Socio-emotional behavior
  • Adaptive behavior

Administration time[edit | edit source]

Time to administer the whole test[5][9]:

  • Children 0-6 months old: 30 minutes
  • Children 6-12 months old: 50 minutes
  • Children 13 months and above: 90 minutes

Time to administer only the motor skills[5]:

  • Children 0-6 months old: 10 minutes
  • Children 6-12 months old: 20 minutes
  • Children 13 months and above: 30 minutes

Administration procedure[edit | edit source]

The BSID is an observation list that must be administered in a specific order. Furthermore, administration of the BSID requires training.[1][9]

Administration process[edit | edit source]

  1. Preparation: before initiating the BSID, the therapist should familiarize themselves with the test components and ensure a suitable testing environment. This includes minimizing distractions and creating a comfortable setting to facilitate the child's engagement.[1]
  2. Introduction to tasks: Begin the assessment by introducing the tasks in a child-friendly manner, fostering a positive and cooperative atmosphere.
  3. Sequential task administration: The BSID is structured to be administered in a specific order to maintain consistency and reliability. Follow the sequence of tasks, starting with the age-appropriate entry level.[12]
  4. Observation and scoring: During task administration, keen observation is essential. Note the child's responses, behaviors and any challenges encountered.
  5. Collaboration with caregivers: Incorporate collaboration with caregivers throughout the assessment process. They can help the child to cooperate. It is important to instruct the caregiver to be encouraging, but not to interfere with their child's performance[1].
  6. Documentation and reporting: Accurate documentation, scores, and any noteworthy behaviors is crucial. The therapist should use standardized forms and procedures for consistency.


The following video is an example of an administration of the BSID:

[13]

Scoring[edit | edit source]

Scores are derived by converting raw scores from completed tasks into scale scores and composite scores. These scores are then utilized to assess the child's performance in relation to norms established based on typically developing children of the same age. The BSID provides four variants of norm-referenced scores[12]:

  • Scaled subtest scores (range 1-19)
  • Index scores (range: 40-160)
  • Percentile scores (range: 0-100)
  • Growth scores (range: 200-800)

The Bayley-III composite score is determined by comparing the child's performance to a normative age-matched sample defined by the assessment. Composite scores are calculated for the cognitive, language, and motor scales, and are interpreted as follows[14]:

  • Mean score of 100 (SD=15) at the 50th percentile signifies mid-average functioning.
  • Scores below 85 (1 SD below the mean), at the 16th percentile, indicate mild impairment of being 'at risk' of developmental delay. Monitoring is recommended, along with advice to parents on techniques to enhance development or referral to a therapist based on the level of impairment.
  • Score below 70 (2 SD below the mean), at the second percentile, indicate moderate to severe impairment.

In general, scores falling in the lowest 10th percentile indicate developmental delay. Children with scores in this range need additional assessment from the healthcare professional.[14]

Evidence[edit | edit source]

The psychometric properties of the Bayley-III are:[9][12][15]

  • Test-retest[16]:
    • 2-4 months (n=50): FM r=0.67: the correlation coefficient (r) for fine motor skills (FM) between the first and second test periods is 0.67. This indicates the extent to which children's fine motor performances are consistent in children aged 2-4 months. GM r=0.77: the correlation coefficient (r) for gross motor skills (GM) between the first and second test periods is 0.77. This indicates the extent to which children's gross motor performances are consistent in children aged 2-4 months.
    • 9-13 months (n=50): FM r=0.86: the correlation coefficient (r) for fine motor skills (FM) between the first and second test periods is 0.86. This indicates a strong level of consistency in fine motor performances in children aged 9-13 months between two measurement points. GM r=0.86: the correlation coefficient (r) for gross motor skills (GM) between the first and second test periods is 0.86. This suggests that gross motor performances in children aged 9-13 months are also remarkably consistent.
    • In both age groups, the high correlation coefficients indicate a significant level of agreement between the scores of the first and second test, indicating that the Bayley-III is reliable for measuring fine and gross motor skills in these age groups.
  • Internal consistency:
    • 1-12 months old normal population (n=1700): FM r=0.77-0.89: the correlation coefficients for fine motor (FM) range from 0.77 to 0.89. This indicates how consistent the scores on different fine motor items are within the same testing period for the normal population of 1-12 months old children. GM r=0.86-0.94: the correlation coefficients for gross motor (GM) range from 0.86 to 0.94. This indicates how consistent the scores on different gross motor items are within the same testing periode for the normal population of 1-12 months old children.
    • 1-12 months old atypical population (n=688): FM r=0.90-0.92: the correlation coefficients for fine motor (FM) range from 0.90 to 0.92. This indicates how consistent the scores on different fine motor items are within the same testing periode for the atypical population of 1-12 months old children. GM r=0.93-0.96: the correlation coefficients for gross motor (GM) range from 0.93-0.96. This indicates how consistent the scores on different gross motor items are within the same testing period for the atypical population of 1-12 months old children.
    • Overall, the high correlation coefficients in both normal and atypical populations suggest strong consistency within the fine and gross motor subscales of the Bayley-III. This indicates that the items within each subscale reliably cohere, demonstrating the consistency and stability of measurements over time.
  • Reliability: Fair[17]
  • Measurement error: Good[17]
  • Content validity: Excellent[17][18]
  • Structural validity: Good[17]
  • Criterion validity: Good[17][19]
  • Minimal detectable change: SEM gross motor subtest 0.85-1.08 of motor component =3.00-474 (24-42 months)[9]

Criticisms of the BISID-III[edit | edit source]

Criticism of the Bayley-III often revolves around the duration required to administer the test. It can sometimes take too much time for families to set aside.

The Bayley-III is also seen as a somewhat broad assessment tool because it evaluates each developmental domain on a global level. Consequently, it might overlook specific abilities and subtle deficits within particular scales.

It has been observed that the Bayley-III tends to artificially increase scores in typically developing children while simultaneously underestimating developmental delay. In comparison to the earlier Bayley-II, children tend to achieve notably higher scores in the Bayley-III. This score inflation is partly attributed to inconsistencies in the normative sample and inaccuracies in administering the tool[20].

Strengths of the BSID-III[edit | edit source]

The Bayley-III is shown to be valid and reliable in different cultures after appropriate translation considering cultural, historical and social features: Vietnam[21], Suriname[22], Egypt[23] and Russia[24].

The Bayley scales have been commonly employed in multiple countries, including the Netherlands[11], Iran[25], Australia[26], and various Asian countries[8].

References[edit | edit source]

  1. 1.0 1.1 1.2 1.3 1.4 1.5 1.6 Del Rosario C, Slevin M, Molloy EJ, Quigley J, Nixon E. How to use the Bayley Scales of Infant and Toddler Development. Archives of Disease in Childhood - Education & Practice. 2020; 319063
  2. Yue A, Jiang Q, Wang B, Abbey C, Medina A, Shi Y, et al. Concurrent validity of the ages and stages questionnaire and the bayley scales of infant development III in China. PLoS One. 2019:14
  3. Anderson PJ, de Luca CR, Hutchinson E, Roberts G, Doyle LW. Underestimation of the developmental delay by the new Bayley-III Scale. Arch Pediatr adole. 2010;164:352–6.
  4. Mansson J, Stjernqvist K, Serenius F, Adén U, Källen K. Agreement between Bayley-III measurements and WISC-IV measurements in typically developing children. J Psychoeduc. 2018;309.
  5. 5.0 5.1 5.2 5.3 Bayley N. Bayley Scales of infant and toddler development: administration manual. San Antonio: Harcourt Assessment; 2006.
  6. Harris SR, Megens AM, Backman CL, Hayes VE. Stability of the Bayley II Scales of Infant Development in a sample of low-risk and high-risk infants. Developmental Medicine & Child Neurology. 2005; 47(12):820-3
  7. Wu YT, Tsou KI, Hsu CH, Fang LJ, Yao G, Jeng SF. Brief report: Taiwanese infants mental and motor development - 6-24 months. Journal of Pediatric Psychology. 2008;33(1):102-8
  8. 8.0 8.1 Godamunne P, Liyanage C, Wimaladharmasooriya N, Pathmeswaran A, Wickremasinghe AR, Patterson C, et al. Comparison of performance of Sri Lankan and US children on cognitive and motor scales of the Bayley scales of infant development. BCM research notes. 2014; 7(1):1-5
  9. 9.0 9.1 9.2 9.3 9.4 9.5 Griffiths A, Toovey R, Morgan PE, Spittle AJ. Psychometric properties of gross motor assessment tools for children: a systematic review. BJM Open. 2018;8(10)
  10. Soleimani F, Azari N, Vameghi R, Barekati SH, Lomejad H, Kraskian A. Standardization of the Bayley Scales of Infant and Toddler Development for Persian Children. Archives of Rehabilitation. 2022; 23(1):8-31
  11. 11.0 11.1 Steenis LJ, Verhoeven M, Hessen DJ, Van Baar AL. Performance of Dutch children on the Bayley III: a comparison study of US and Dutch norms. PloS one. 2015;10(8)
  12. 12.0 12.1 12.2 Spittle AJ, Doyle LW, Boyd RN. A systematic review of the clinimetric properties of neuromotor assessments for preterm infants during the first year of life. Dev Med Child Neurol. 2008:50(4):254-66
  13. Dr Heather Johnson. Part 1 Bayley Infant &Toddler III Screen. Available from: https://www.youtube.com/watch?v=_njdDWG8AuY
  14. 14.0 14.1 Johnson S, Moore T, Marlow N. Using the Bayley-III to assess neurodevelopmental delay: which cut-off should be used? Pediatr Res. 2014:75:670-4
  15. Morgan C, Honan I, Allsop A, Novak I, Badawi N. Psychometric Properties of Assessments of Cognition in Infants With Cerebral Palsy or Motor Impairment: A systematic review. J Pediatr Psychol. 2019;44(2):238-252
  16. Bayley N. Bayley Scales of Infant and Toddler Development. 3rd ed. San Antonio. Harcourt Assessment, 2005.
  17. 17.0 17.1 17.2 17.3 17.4 Bayley N. Bayley scales of infant development and toddler development: technical manual. The PsychCorp, 2006.
  18. Visser L, Ruiter SA, Van der Meulen BF, et al. Low verbal assessment with the Bayley-III. Res Dev Disabil. 2015;36C:230-43
  19. Spittle AJ, Spencer-Smith MM, Eeles AL, et al. Does the Bayley-III Motor Scale at 2 years predict motor outcome at 4 years in very preterm children? Dev Med Child Neurol. 2012;55:448-52
  20. Anderson PJ, De Luca CR, Hutchinson E, Et al. Underestimation of the developmental delay by the new Bayley-III scale. Arch Pediatr Adolesc Med. 2010; 164:352-6
  21. Sun L, Sabanathan S, Thanh PN, Kim A, Doa TTM, Thwaites CL, et al. Bayley III in Vietnamese children: lessons for cross-cultural comparisons. Wellcome Open Res. 2019:4:98
  22. McLester-Davis LWY, Shankar A, Kataria LA, Hidalgo AG, van Eer ED, Koendjbiharie AP, et al. Validity, reliability, and transcultural adaptations of the bayley scales of infant and toddler development (BSID-III-NL) for children in Suriname. Early Hum. Dev. 2021;160:105416
  23. Salah El-Din EM, Monir ZM, Shehata MA, Abouelnaga MW, Abushady MM, Youssef MM, et al. A comparison of the performance of normal middle social class Egyptian infants and toddlers with the reference norms of the bayley scales-(Bayley III): a pilot study. PloS One. 2021;16
  24. Pavlova P, Maksimov D, Chegodaev D, Kiselev S. A psychometric study of the Russian-language version of the Bayley Scales of Infant and Toddler Development - third edition: An assessment of reliability and validity. Frontiers in Psychology. 2022; 13.
  25. Azari N, Soleimani F, Vameghi R, Sajedi F, Shahshahani S, Karimi H, et al. A psychometric study of the Bayley scales of infant and toddler development in Persian language children. Iranian journal of child neurology. 2017;11(1):50
  26. Msall ME. Measuring outcomes after extreme prematurity with the Bayley-III Scales of infant and toddler development: a cautionary tale from Australia. Archives of pediatrics & adolescent medicine. 2010;164(4):391-3