K Number
K242800
Date Cleared
2024-11-15

(59 days)

Product Code
Regulation Number
892.1550
Panel
RA
Reference & Predicate Devices
AI/MLSaMDIVD (In Vitro Diagnostic)TherapeuticDiagnosticis PCCP AuthorizedThirdpartyExpeditedreview
Intended Use

The intended use of the 5000 Compact series ultrasound systems is diagnostic ultrasound imaging and fluid flow analysis of the human body with the following Indications for Use:Abdominal, Cardiac Adult, Cardiac Pediatric, Carotid, Cerebral Vascular, Cephalic (Adult),Cephalic (Neonatal), Fetal Echo, Fetal/Obstetric, Gynecological, Intraoperative (Vascular), Lung, Musculoskeletal (Conventional), Musculoskeletal (Superficial), Ophthalmic, Pediatric, Peripheral Vessel, Small Parts, Transesophageal (Cardiac), Transrectal, Transvaginal, and Urology.

Device Description

The purpose of this Traditional 510(k) Pre-Market Notification is to introduce the Auto Measure Artificial Intelligence-Machine Learning software feature onto the 5000 Compact Series Ultrasound Systems.

The Auto Measure feature utilizes machine learning to provide a subset of semi-automated and editable measures during an echocardiography or when reviewing an already acquired echocardiography. When Auto Measure Version 2 is enabled, the healthcare professional performs an echocardiography with a workflow that provides the user with a semi-automated measurement that can be edited, accepted, or rejected.

Philips has designed Auto Measure as a "locked" algorithm prior to marketing. As defined by FDA in the discussion paper Proposed Requlatory Framework for Modifications to AI/ML Based Software as a Medical Device (SaMD) published April 2, 2019, this "locked" algorithm provides the same result each time the same input is applied to it and does not change with use.

The Auto Measure software feature does not introduce new modes, presets, measurements, or system components (e.g. transducers) to the Philips 5000 Compact Series Ultrasound Systems K222648.

No hardware changes to the 5000 Compact Series Ultrasound Systems K222648 are required when using the Auto Measure feature, and existing, commercialized Philips transducers are used for the Auto Measure feature.

5000 Compact Series Ultrasound Systems are part of the VM platform product family, The Auto Measure Version 1 feature was originally cleared (K211597) on EPIQ and Affiniti models running software version 9.0 (VM9.0). The Auto Measure feature is also available to all software releases following VM9.0.

Since the initial Auto Measure feature initial clearance (Version 1.0), a subset of integrated measurement detectors has been trained with additional training data in the Auto Measure feature (Version 2) which is scope of this submission

Auto Measure for this premarket notification utilizes the same software version platform VM as the Reference Device, Affiniti Diagnostic Ultrasound Systems K211597.

AI/ML Overview

This document describes the Philips 5000 Compact Series Ultrasound Systems with the new Auto Measure Version 2 AI-Machine Learning software feature. The information provided focuses on the performance data to demonstrate substantial equivalence.

1. Table of Acceptance Criteria and Reported Device Performance

The acceptance criteria for each measurement detector are based on the Checked Limits of Agreement (LoAs), derived from clinical literature on human interobserver variability. The reported device performance is the Measured Limits of Agreement (LoA) between the detector's prediction and the manual ground truth. Each detector met its specified acceptance criteria.

ParameterDetectorN (Sample Size)Checker Limits of Agreement (LoAs)Measured Limits of Agreement (LoA) (detector prediction vs manual ground truth)
Ao Sinus DiameterDETECTOR_ID_BMOD E_AO_AOSV308[-35.0%, 35.0%][-11.0022%, 11.2816%]
Ao STJ DiameterDETECTOR_ID_BMOD E_AO_AOSTJ301[-35.0%, 35.0%][-11.4181%, 13.1139%]
Asc Ao DiameterDETECTOR_ID_BMOD E_AO_AOASC204[-35.0%, 35.0%][-14.7794%, 15.9598%]
IVSdDETECTOR_ID_BMOD E_LV_LVDISTANCE_S AME_LINE305[-35.0%, 35.0%][-33.2114%, 28.1632%]
LVIDdDETECTOR_ID_BMOD E_LV_LVDISTANCE_S AME_LINE457[-35.0%, 35.0%][-14.1564%, 12.4223%]
LVIDsDETECTOR_ID_BMOD E_LV_LVID_ES469[-35.0%, 35.0%][-23.4752%, 26.0242%]
LVOT DiameterDETECTOR_ID_BMOD E_LV_LVOT453[-35.0%, 35.0%][-17.729%, 16.1588%]
LVPWdDETECTOR_ID_BMOD E_LV_LVDISTANCE_S AME_LINE305[-35.0%, 35.0%][-33.1364%, 29.9544%]
RV BaseDETECTOR_ID_BMOD E_RV_RVD_BASE302[-35.0%, 35.0%][-16.1373%, 25.9079%]
RV MidDETECTOR_ID_BMOD E_RV_RVD_MID243[-35.0%, 35.0%][-25.0913%, 30.2573%]
RV LengthDETECTOR_ID_BMOD E_RV_RVL117[-35.0%, 35.0%][-14.6089%, 13.3871%]
TV AnnulusDETECTOR_ID_BMOD E_RV_TVANN53[-35.0%, 35.0%][-19.3628%, 18.0347%]
MV Decel. TimeDETECTOR_ID_DOPPLER_MV_DECEL_E_DURATION136[-25.0%, 25.0%][-23.5717%, 22.8591%]
MV Peak A VelDETECTOR_ID_DOPPLER_MV_VMAX_A_VELOCITY229[-24.0%, 24.0%][-12.3694%, 15.2363%]
MV Peak E VelDETECTOR_ID_DOPPLER_MV_VMAX_E_VELOCITY136[-24.0%, 24.0%][-9.2081%, 9.1223%]
AV VTIDETECTOR_ID_DOPPLER_AV_VTI247[-22.0%, 22.0%][-21.5393%, 19.8925%]
LVOT VTIDETECTOR_ID_DOPPLER_LVOT_VTI234[-22.0%, 22.0%][-17.267%, 17.2093%]
PV VTIDETECTOR_ID_DOPPLER_PV_VTI66[-22.0%, 22.0%][-20.6140%, 21.0567%]

2. Sample Sizes Used for the Test Set and Data Provenance

The test set consisted of a total of 500 studies, with one study per subject. This included 200 known normal subjects and 300 patients with a confirmed pathology.

The data provenance is described as:

  • Anonymized transthoracic echocardiography DICOM data and metadata.
  • Data recorded by multiple sonographers and physicians qualified in echocardiography.
  • Large sample of adults of various ethnicities.
  • Echocardiographic recordings acquired according to guideline-standard echocardiographic procedures between 2010 and 2021.
  • Data collected from 20 centers and 16 countries.

The number of studies used for validation (N) per specific detector varied, as not all measurements are performed in all studies, ranging from 53 to 469 as shown in the table above.

3. Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications of Those Experts

The ground truth for the test set was established by clinical experts during routine care. While the exact number is not specified for the test set's ground truth, it is stated that "All detector models were trained from the ground up... Training of the detectors was based on manual measurements performed by human experts done for diagnostic purpose." This implies the ground truth for both training and testing was established by qualified human experts performing manual measurements for diagnostic purposes. The data was collected by "multiple sonographers and physicians qualified in echocardiography."

4. Adjudication Method for the Test Set

The adjudication method is not explicitly described as a formal 'X+Y' consensus method. The document states that the Auto Measure results were "compared to ground truth measurements established by clinical experts during routine care." This suggests that the established clinical measurements served as the ground truth directly for comparison, rather than a separate adjudication process of multiple readers for the test set.

5. If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study Was Done, and the Effect Size of How Much Human Readers Improve with AI vs. Without AI Assistance

A Multi-Reader Multi-Case (MRMC) comparative effectiveness study was not explicitly detailed in this summary. The study focuses on evaluating the standalone performance of the Auto Measure feature against expert manual measurements, using human-human interobserver variability data from clinical studies as a benchmark for acceptance criteria, not for direct comparison of human performance with and without AI assistance. The document mentions that the feature is a "workflow improvement" and that "the operator is responsible for the final result and has to apply manual edits to the automated output whenever required."

6. If a Standalone (Algorithm Only Without Human-in-the-Loop Performance) Was Done

Yes, a standalone performance study was done. The "Detector performance results" table directly compares the "detector prediction" against "manual ground truth." The text further clarifies: "Auto Measure analyzed a set of new, previously images, and its automated results were compared to ground truth measurements established by clinical experts during routine care. Both the manual and automated measurements were performed on the same images without any adjustments to the software's output or the clinical data used as ground truth." This indicates a direct evaluation of the algorithm's output without human intervention in the Auto Measure readings during the test phase.

7. The Type of Ground Truth Used

The ground truth used was expert consensus / manual measurements performed by clinical experts during routine care for diagnostic purposes. These manual measurements are considered the "current gold standard" against which the Auto Measure output is compared.

8. The Sample Size for the Training Set

The total training pool comprises more than 6000 studies.

9. How the Ground Truth for the Training Set Was Established

The ground truth for the training set was established through manual measurements performed by human experts for diagnostic purposes. The data was collected by "qualified medical [personnel] in echocardiography using TTA software (versions TTA2.31.00 and TTA2.50.00) or various Philips ultrasound systems annotation and following established guidelines (Mitchel et al 2019)." These data were then used for training and tuning in a cross-validation framework.

§ 892.1550 Ultrasonic pulsed doppler imaging system.

(a)
Identification. An ultrasonic pulsed doppler imaging system is a device that combines the features of continuous wave doppler-effect technology with pulsed-echo effect technology and is intended to determine stationary body tissue characteristics, such as depth or location of tissue interfaces or dynamic tissue characteristics such as velocity of blood or tissue motion. This generic type of device may include signal analysis and display equipment, patient and equipment supports, component parts, and accessories.(b)
Classification. Class II.