Icon Legend

This session is not in your schedule.

This session is in your schedule. Click again to remove it.

Presentation Icons

Additional registration fee required

Faculty have requested this content not be shared outside of the session

CME Credit Offered

Abstract Award

Recording available 2/16-5/2

AIUM Credit

Foundation Awardee

Poster Icons

Abstract Award

Foundation Awardee

Poster Session 1

Category: Digital Health Technologies (DHT)

Poster Session 1

(386) Highly Reliable Data Collection by a Large Language Model in Fetal Ultrasound Reports

Wednesday, February 11, 2026

10:30 AM - 12:00 PM

Presenting Author(s)

Thomas A. Reynolds, MFA, MBA (he/him/his)

Richard D. Wood, Jr Center for Fetal Diagnosis and Treatment, Children's Hospital of Philadelphia
Children's Hospital of Philadelphia, Pennsylvania, United States

Coauthor(s)

SL

Sierra D. Land, MA

Richard D. Wood, Jr Center for Fetal Diagnosis and Treatment, Children's Hospital of Philadelphia
Children's Hospital of Philadelphia, Pennsylvania, United States
JL

Jake Ligouri, MS

Richard D. Wood, Jr Center for Fetal Diagnosis and Treatment, Children's Hospital of Philadelphia
Children's Hospital of Philadelphia, Pennsylvania, United States
RT

Ryan Treftz, MS

Richard D. Wood, Jr Center for Fetal Diagnosis and Treatment, Children's Hospital of Philadelphia
Children's Hospital of Philadelphia, Pennsylvania, United States
NA

N Scott Adzick, MD

Richard D. Wood, Jr Center for Fetal Diagnosis and Treatment, Children's Hospital of Philadelphia
Children's Hospital of Philadelphia, Pennsylvania, United States
HH

Holly L. Hedrick, MD

Richard D. Wood, Jr Center for Fetal Diagnosis and Treatment, Children's Hospital of Philadelphia
Children's Hospital of Philadelphia, Pennsylvania, United States

Submitting Author(s)

Juliana S. Gebb, MD

Associate Professor
Richard D. Wood Jr Center for Fetal Diagnosis and Treatment, Children's Hospital of Philadelphia
Children's Hospital of Philadelphia, Pennsylvania, United States

Objective:

To develop a reliable and efficient data collection process for unstructured fetal ultrasound reports involving single- and higher-order gestations. At our institution, documentation of middle cerebral artery (MCA) Doppler often includes more than one value and range per fetus, contributing to variability in report structure and complexity. We evaluated the performance of an off-the-shelf Large Language Model (LLM) integrated into our Clinical Outcomes Data Archive (CODA) platform, aiming to reduce human effort while maintaining high data fidelity.

Study Design:

A subset of fetal ultrasound reports was processed using an LLM in CODA. For each fetus, we abstracted whether an MCA Doppler was performed and extracted the peak systolic velocity (PSV), and up to two multiple-of-the-median (MoM) values when a range was reported. All outputs were reviewed by a human validator. Discrepancies were recorded, and agreement between the LLM and final validated values were calculated for each variable.

Results:

Of 494 fetal ultrasound reports processed using an LLM in CODA, the LLM correctly identified 349 reports that had at least one MCA Doppler study. These reports included 630 fetus-level evaluations, yielding 1,249 datapoints for analysis. Among 244 studies ultimately deemed not performed, the LLM incorrectly labeled 10 as performed, though it accurately marked all three study values as missing in these cases. All three target values for the MCA Doppler parameters performed reached >99% accuracy.

Conclusion:

An off-the-shelf LLM integrated into the CODA platform demonstrated high reliability in abstracting structured data from unstructured fetal ultrasound reports. The model achieved >99% accuracy for all extracted MCA Doppler values and maintained perfect performance in marking missing data. These findings support the feasibility of LLM-assisted abstraction to reduce manual workload while preserving data quality in complex clinical documentation, including higher-order gestations.