Search Results

Aurora is a Software as a Medical Device (SaMD) that establishes sleep quality. Aurora automatically analyzes, displays, and summarizes Photoplethysmogram (PPG) data collected during sleep using compatible devices. Aurora is intended for use by and by order of a healthcare professional to aid in the diagnosis of sleep disorders including sleep apnea in adults.

The Aurora output, including automatically detected respiratory events and parameters, may be displayed and edited by a qualified healthcare professional. The Aurora output is not intended to be interpreted or clinical action taken without consultation of a qualified healthcare professional.

Aurora is not intended for use with polysomnography devices.

Device Description

Aurora is a Class II Software as a Medical Device (SaMD), intended to aid in the evaluation of sleep disorders, where it may inform or drive clinical management. Aurora is a software application that is indicated for use on a general-purpose computing platform. It is a cloud-based software-as-a-medicaldevice (SaMD) with a user interface that runs in a web browser.

Aurora automatically analyzes and displays photoplethsmography (PPG) signal data including SPO2 and pulse/heart rate only from compatible FDA-cleared medical purpose pulse oximeters that meet Aurora's data acquisition requirements for sampling rate, digital resolution, measurement range, and accuracy range.

Following upload of a compatible PPG study to the cloud software, the algorithm functions by verifying minimum signal quality, study length, and technical adequacy requirements, preprocessing the data including normalization, digital filtration, and artifact detection/rejection procedures, applying machine learning algorithms including multiple deep neural network machine learning models, statistical signal processing analyses including time-domain and time-frequency domain analyses over multiple time and resolution scales, and other analyses output a detected set of events and derived signals for the PPG study that are post-processed and logically filtered according to algorithm rules based on the American Academy of Sleep Medicine (AASM) recommended scoring event, desaturation, and association rules. Aurora algorithm outputs, including scored respiratory events, sleep stages, Aurora Apnea-Hypopnea Index (eAHI), Total Sleep Time (TST), Sleep Efficiency (SE), Sleep Latency (SL), Wake After Sleep Onset (WASO), and Oxygen Desaturation Events Index (ODI) measures, are stored and made available for display, editing, and review in Aurora by qualified healthcare professionals.

Aurora reports results of the automated data analysis based on AASM guidelines, including the Aurora output Apnea-Hypopnea Index (eAHI) and total sleep time (TST). The algorithm outputs are graphical and numerical displays and reports of sleep latency, sleep quality, and sleep pathologies including sleep disordered breathing. The Aurora displays and reports are for the order of physicians, trained technicians, or other healthcare professionals to evaluate sleep disorders where it may inform or drive clinical management taking into consideration other factors that normally are considered for clinical management of sleep disorders for adults.

The clinician can view raw data for interpretation, edit events, write clinical notes, and customize sleep reports for the patient.

Aurora output is not intended to be interpreted or clinical action taken without consultation of a qualified healthcare professional.

AI/ML Overview

The document provides detailed information about the performance evaluation of the Aurora device, a Software as a Medical Device (SaMD) intended to aid in the diagnosis of sleep disorders.

Here's a breakdown of the acceptance criteria and the study that proves the device meets them:

1. Acceptance Criteria and Reported Device Performance

The acceptance criteria for Aurora are implied by the performance metrics reported, demonstrating its accuracy in detecting Apnea Hypopnea Index (eAHI) and performing sleep staging against polyomnography (PSG) ground truth. While explicit numerical "acceptance criteria" tables are not provided, the reported sensitivity, specificity, and regression/Bland-Altman statistics serve as the evidence of meeting performance expectations for substantial equivalence.

Table of Performance Data (Implied Acceptance Criteria)

Metric	Acceptance Criteria (Implied)	Reported Device Performance (Aurora)
Apnea Hypopnea Index (eAHI) - 3% Desaturation	High Sensitivity and Specificity at AHI >= 5 cutoff, comparable to predicate.	Sensitivity: 92.6% (87.2%, 97.2%) Specificity: 71.6% (59.2%, 83.7%)
Apnea Hypopnea Index (eAHI) - 4% Desaturation	High Sensitivity and Specificity at AHI >= 5 cutoff, comparable to predicate.	Sensitivity: 89.4% (81.6%, 96.1%) Specificity: 76.8% (67.1%, 85.4%)
Sleep Staging - Wake	High Sensitivity and Specificity for Wake epoch detection.	Sensitivity: 86.7% (86.5%, 87.0%) Specificity: 93.5% (93.4%, 93.7%)
Sleep Staging - Light NREM	High Sensitivity and Specificity for Light NREM epoch detection.	Sensitivity: 80.9% (80.6%, 81.2%) Specificity: 85.5% (85.2%, 85.7%)
Sleep Staging - Deep NREM	Reasonably high Sensitivity and Specificity for Deep NREM epoch detection, balancing known challenges in this stage.	Sensitivity: 63.4% (62.4%, 64.3%) Specificity: 95.9% (95.7%, 96.0%)
Sleep Staging - REM	High Sensitivity and Specificity for REM epoch detection.	Sensitivity: 83.6% (83.0%, 84.2%) Specificity: 97.5% (97.4%, 97.5%)
Sleep Profile & Oxygen Saturation Accuracy (eAHI 3%)	Deming Regression slope near 1, intercept near 0; Bland-Altman Mean Difference near 0, narrow limits.	Deming Regression: Slope: 0.936 (0.853, 1.033), Intercept: 0.023 (-1.185, 1.122) Bland-Altman: Mean Difference: 1.000 (0.630, 1.367), ULOA: 14.575 (13.779, 15.363), LLOA: -12.574 (-13.371, -11.786)
Sleep Profile & Oxygen Saturation Accuracy (eAHI 4%)	Deming Regression slope near 1, intercept near 0; Bland-Altman Mean Difference near 0, narrow limits.	Deming Regression: Slope: 0.982 (0.903, 1.130), Intercept: 1.219 (0.116, 1.985) Bland-Altman: Mean Difference: -1.039 (-1.326, -0.749), ULOA: 9.307 (8.692, 9.931), LLOA: -11.386 (-12.001, -10.763)
Sleep Profile & Oxygen Saturation Accuracy (TST)	Deming Regression slope near 1, intercept near 0; Bland-Altman Mean Difference near 0, narrow limits.	Deming Regression: Slope: 1.159 (1.035, 1.318), Intercept: -0.695 (-1.576, -0.005) Bland-Altman: Mean Difference: -0.093 (-0.132, -0.059), ULOA: 1.145 (1.060, 1.216), LLOA: -1.330 (-1.414, -1.259)
Sleep Profile & Oxygen Saturation Accuracy (SE)	Deming Regression slope near 1, intercept near 0; Bland-Altman Mean Difference near 0, narrow limits.	Deming Regression: Slope: 1.154 (1.031, 1.317), Intercept: -0.088 (-0.205, 0.003) Bland-Altman: Mean Difference: -0.011 (-0.017, -0.007), ULOA: 0.163 (0.151, 0.173), LLOA: -0.185 (-0.198, -0.176)
Sleep Profile & Oxygen Saturation Accuracy (SL)	Deming Regression slope near 1, intercept near 0; Bland-Altman Mean Difference near 0, narrow limits.	Deming Regression: Slope: 1.114 (0.997, 1.290), Intercept: -0.023 (-0.185, 0.090) Bland-Altman: Mean Difference: -0.129 (-0.154, -0.089), ULOA: 0.884 (0.831, 0.970), LLOA: -1.143 (-1.196, -1.057)
Sleep Profile & Oxygen Saturation Accuracy (WASO)	Deming Regression slope near 1, intercept near 0; Bland-Altman Mean Difference near 0, narrow limits.	Deming Regression: Slope: 1.073 (0.938, 1.219), Intercept: -0.271 (-0.436, -0.121) Bland-Altman: Mean Difference: 0.167 (0.140, 0.196), ULOA: 1.131 (1.073, 1.193), LLOA: -0.797 (-0.855, -0.735)
Sleep Profile & Oxygen Saturation Accuracy (ODI)	Deming Regression slope near 1, intercept near 0; Bland-Altman Mean Difference near 0, narrow limits.	Deming Regression: Slope: 0.962 (0.896, 1.056), Intercept: 1.667 (0.330, 2.847) Bland-Altman: Mean Difference: -1.046 (-1.417, -0.677), ULOA: 13.223 (12.426, 14.015), LLOA: -15.315 (-16.111, -14.522)

2. Sample Size Used for the Test Set and Data Provenance

Test Set Sample Size:
- For eAHI performance (sensitivity/specificity): 158 adult patients.
- For Sleep Staging:
  - Wake: 52,622 epochs
  - Light NREM: 69,438 epochs
  - Deep NREM: 10,195 epochs
  - REM: 14,459 epochs
Data Provenance: The document does not explicitly state the country of origin but implies clinical settings where PSG (Polysomnography) and HSAT (Home Sleep Apnea Test) recordings are collected. The study involved simultaneous PSG and HSAT recordings, suggesting a prospective collection of data for testing purposes to facilitate direct comparison.

3. Number of Experts and Qualifications for Ground Truth

Number of Experts: Three registered polysomnographic technologists were used for manual scoring, and one board-certified sleep physician reviewed each PSG.
Qualifications of Experts:
- Scorers: Registered polysomnographic technologists.
- Reviewer/Confirmer: Board-certified sleep physician.

4. Adjudication Method for the Test Set

Adjudication Method: A 2+1 consensus method. For an event to be officially scored or reported, a consensus of at least two-thirds among the three scorers was required. Additionally, each PSG was reviewed by a board-certified sleep physician to provide clinical confirmation of scoring and technical adequacy, serving as a final adjudication layer.

5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study

The document does not indicate that a Multi-Reader Multi-Case (MRMC) comparative effectiveness study was done to assess how much human readers improve with AI vs. without AI assistance. The study focuses on the standalone performance of the Aurora algorithm against expert-scored ground truth. The device output may be displayed and edited by a qualified healthcare professional, suggesting a human-in-the-loop workflow, but the study described does not quantify the effect of AI assistance on human reader performance.

6. Standalone Performance Study

Yes, a standalone performance study was done. The reported sensitivity, specificity, Deming regression, and Bland-Altman analyses directly evaluate the algorithm's performance (Aurora) against the expert-scored PSG as ground truth, without a human in the loop for the performance metrics themselves.

7. Type of Ground Truth Used

The type of ground truth used was expert consensus from manual scoring of Polysomnography (PSG) data. Specifically, PSG recordings were manually scored by three registered polysomnographic technologists using guidelines following the 3% desaturation guidance. This was further reviewed and confirmed by a board-certified sleep physician.

8. Sample Size for the Training Set

The document does not specify the sample size for the training set. The provided details pertain exclusively to the test set used for performance validation.

9. How the Ground Truth for the Training Set Was Established

The document does not specify how the ground truth for the training set was established. Information regarding the training data, its collection, or annotation methods is not included in this summary.

Ask a Question

Ask a specific question about this device

K Number

K210034

Device Name

EnsoSleep

Manufacturer

EnsoData, Inc.

Date Cleared

2021-06-16

(161 days)

Product Code

Regulation Number

Type

Panel

Reference & Predicate Devices

K112102,K162627

Predicate For

N/A

Intended Use

EnsoSleep is intended for use in the diagnostic evaluation by a physician to assess sleep quality and as an aid for physicians in the diagnosis of sleep disorders and respiratory related sleep disorders in pediatric as follows:

· Pediatric patients 13 years and older with polysomnography (PSG) tests obtained in a Hospital or Sleep Clinic
· Adult patients with PSGs obtained in a Hospital or Sleep Clinic
· Adult patients with Home Sleep Tests
EnsoSleep is a software-only medical device to be used under the supervision of a clinician to analyze physiological signals and automatically score sleep study results, including the staging of arousals, leg movements, and sleep disordered breathing events including obstructive apneas (OSA), central sleep apneas (CSA), and hypopneas.
All automatically scored events and physiological signals which are retrieved, analyzed, displayed, and summarized are subject to verification by a qualified clinician. Central sleep apneas (CSA) should be manually reviewed and modified as appropriate by a clinician.
All events can be manually marked or edited within records during review.
Photoplethysmography (PPG) total sleep time is not intended for use when electroencephalograph (EEG) data is recorded. PPG total sleep time is not intended to be used as the sole or primary basis for diagnosing any sleep related breathing disorder, prescribing treatment, or determining whether additional diagnostic assessment is warranted.

Device Description

EnsoSleep is a software-only medical device that analyzes previously recorded physiological signals obtained during sleep. Users of EnsoSleep are consistent with the roles required to run a sleep clinic: sleep physicians, sleep technicians, clinic operations managers, and IT administrators. EnsoSleep can analyze at-home and in-lab sleep studies for both adult and pediatric patients who are at least 13 years old. Automated algorithms are applied to the raw signals in order to derive additional signals and interpret the raw and derived signal information. The software automates recognition of the following: respiratory events, sleep staging events, arousal events, movement events, cardiac events, derived signals, and calculated indices. EnsoSleep does not interpret the results, nor does it suggest a diagnosis. The device only marks events of interest for review by a physician who is responsible for diagnoses. The device does not analyze data that are different from those analyzed by human scorers.
The signals and automated analyses can be visually inspected and edited prior to the results being integrated into a sleep study report.
The software consists of 4 major components:

The Application Platform runs on local clinic workstations and manages the detection, upload, and download of study records and scoring to and from the Storage Platform
The Processing Platform accepts raw physiological signals as inputs in order to recognize events, derive signals, and calculate indices
. The Storage Platform facilitates file and database storage in the EnsoSleep cloud through an API
The Dashboard is a web-based user interface to support configuration, clinic management, and sleep study scoring

AI/ML Overview

Here's a breakdown of the acceptance criteria and the study that proves the device meets them, based on the provided text:

1. Table of Acceptance Criteria and Reported Device Performance

The text describes that "the acceptance criteria were selected based on PA, NA, OA, MAE, Deming Regression coefficient, Bland-Altman mean difference and limits of agreement performance criteria that validate performance substantially equivalent to, or greater than, but not lesser than by more than 10% or similarly defined criteria in any category of the predicate 510(k) reported device performance across all endpoints respectively."

For detailed performance metrics, the document presents tables for specific endpoints. Below is a structured representation focusing on the key performance indicators mentioned and a comparison to the predicate device where available.

Endpoint 1: Sleep Staging Event Detection (EnsoSleep K210034 vs. Predicate K162627)

Metric	Acceptance Criteria (Conceptual)	EnsoSleep (K210034) Adult Sample (Reported)	EnsoSleep (K210034) Pediatric Sample (Reported)	Predicate (K162627) Adult Sample (Reported)
Positive Agreement (PA)	≥ Predicate PA - 10% (Ideally ≥ Predicate PA)	Wake: 93.5%	Wake: 93.1%	Wake: 86%
		N1: 37.0%	N1: 43.2%	N1: 41%
		N2: 88.3%	N2: 92.6%	N2: 77%
		N3: 80.0%	N3: 92.3%	N3: 81%
		REM: 90.9%	REM: 80.9%	REM: 79%
Negative Agreement (NA)	≥ Predicate NA - 10% (Ideally ≥ Predicate NA)	Wake: 97.2%	Wake: 99.2%	Wake: 97%
		N1: 98.3%	N1: 98.8%	N1: 94%
		N2: 89.3%	N2: 89.4%	N2: 87%
		N3: 96.3%	N3: 97.5%	N3: 93%
		REM: 99.3%	REM: 99.1%	REM: 99%
Overall Agreement (OA)	≥ Predicate OA - 10% (Ideally ≥ Predicate OA)	Wake: 96.1%	Wake: 97.9%	Wake: 94%
		N1: 95.0%	N1: 97.0%	N1: 91%
		N2: 88.8%	N2: 90.9%	N2: 83%
		N3: 95.0%	N3: 96.6%	N3: 92%
		REM: 98.3%	REM: 97.0%	REM: 96%
Total (Overall) OA (Pooled)	≥ Predicate Total OA - 10% (Ideally ≥ Predicate Total OA)	Adult: 86.6%	Pediatric: 89.7%	Adult: 78%
Conclusion on Acceptance Criteria (Endpoint 1)	All 3 EnsoSleep PA, NA, and OA point-estimates vs 2/3 Majority Scoring were observed to be greater than the predicate device PA, NA, and OA point-estimates in some events, with statistically significant results in terms of higher agreement for several stages (e.g., Adult REM, Pediatric Wake, N3, Total). None were observed 10% or lower than 2/3 Majority.

Endpoint 2: Sleep Apnea Diagnostic Agreement (Per-Patient AHI)

Metric	Acceptance Criteria (Conceptual)	EnsoSleep (K210034) Adult Sample (Reported)	EnsoSleep (K210034) Pediatric Sample (Reported)	Predicate (K162627) Adult Sample (Reported)
Positive Percent Agreement (PA)	≥ Predicate PA - 10% (Ideally ≥ Predicate PA)	AHI ≥ 5: 94.4%	AHI ≥ 1: 94.4%	AHI ≥ 5: 91%
		AHI ≥ 15: 94.0%	AHI ≥ 5: 90.5%	AHI ≥ 15: 95%
		REM AHI ≥ 5: 86.7%	AHI ≥ 10: 78.6%	REM AHI ≥ 5: 83%
		REM AHI ≥ 15: 81.5%	AHI ≥ 15: 85.7%	REM AHI ≥ 15: 79%
Negative Percent Agreement (NA)	≥ Predicate NA - 10% (Ideally ≥ Predicate NA)	AHI ≥ 5: 89.7%	AHI ≥ 1: 77.8%	AHI ≥ 5: 76%
		AHI ≥ 15: 96.3%	AHI ≥ 5: 100.0%	AHI ≥ 15: 98%
		REM AHI ≥ 5: 83.0%	AHI ≥ 10: 94.9%	REM AHI ≥ 5: 89%
		REM AHI ≥ 15: 93.3%	AHI ≥ 15: 100.0%	REM AHI ≥ 15: 96%
Overall Percent Agreement (OA)	≥ Predicate OA - 10% (Ideally ≥ Predicate OA)	AHI ≥ 5: 93.0%	AHI ≥ 1: 89.4%	AHI ≥ 5: 85%
		AHI ≥ 15: 95.0%	AHI ≥ 5: 95.7%	AHI ≥ 15: 97%
		REM AHI ≥ 5: 85.0%	AHI ≥ 10: 91.5%	REM AHI ≥ 5: 86%
		REM AHI ≥ 15: 90.0%	AHI ≥ 15: 97.9%	REM AHI ≥ 15: 92%
Conclusion on Acceptance Criteria (Endpoint 2)	EnsoSleep PA, NA, and OA point-estimates vs 2/3 Majority Scoring were observed to be greater than the predicate device for some OSA severity categories in both adult and pediatric samples. Only one instance (Pediatric AHI > 10 PA) was within 10% of the predicate. All met or exceeded objective performance goals.

Endpoint 3: Sleep Scoring Event Detection (EnsoSleep K210034 vs. Predicate K162627 and Reference K112102)

Metric	Acceptance Criteria (Conceptual)	EnsoSleep (K210034) Adult Sample (Reported)	EnsoSleep (K210034) Pediatric Sample (Reported)	Predicate (K162627) Adult Sample (Reported)	Reference (K112102) Adult Sample (Reported)
Positive Agreement (PA)	≥ Reference PA - 10% (Ideally ≥ Reference PA)	SDB: 75.4%	SDB: 72.7%	SDB: 67%	N/A (for SDB)
		Hypopnea: 66.3%	Hypopnea: 68.8%	Hypopnea: 60.3%	Hypopnea: 60.3%
		Obstructive Apnea: 74.1%	Obstructive Apnea: 45.5%	Obstructive Apnea: 53%	N/A (for Obstructive Apnea)
		Central Apnea: 65.3%	Central Apnea: 68.9%	Central Apnea: 63.8%	Central Apnea: 63.8%
		Arousal: 73.6%	Arousal: 78.6%	Arousal: 66%	N/A (for Arousal)
		Leg Movement: 82.0%	Leg Movement: 66.0%	Leg Movement: 71%	N/A (for Leg Movement)
Negative Agreement (NA)	≥ Reference NA - 10% (Ideally ≥ Reference NA)	SDB: 97.0%	SDB: 98.6%	SDB: 93%	N/A (for SDB)
		Hypopnea: 97.1%	Hypopnea: 98.9%	Hypopnea: 97.6%	Hypopnea: 97.6%
		Obstructive Apnea: 99.3%	Obstructive Apnea: 99.7%	Obstructive Apnea: 97%	N/A (for Obstructive Apnea)
		Central Apnea: 99.5%	Central Apnea: 99.7%	Central Apnea: 99.6%	Central Apnea: 99.6%
		Arousal: 95.6%	Arousal: 97.0%	Arousal: 90%	N/A (for Arousal)
		Leg Movement: 92.4%	Leg Movement: 95.5%	Leg Movement: 90%	N/A (for Leg Movement)
Overall Agreement (OA)	≥ Reference OA - 10% (Ideally ≥ Reference OA)	SDB: 94.9%	SDB: 97.6%	SDB: 91%	N/A (for SDB)
		Hypopnea: 95.5%	Hypopnea: 98.0%	Hypopnea: N/R	Hypopnea: N/R
		Obstructive Apnea: 98.8%	Obstructive Apnea: 99.5%	Obstructive Apnea: 96%	N/A (for Obstructive Apnea)
		Central Apnea: 98.9%	Central Apnea: 99.5%	Central Apnea: N/R	Central Apnea: N/R
		Arousal: 93.2%	Arousal: 95.5%	Arousal: 87%	N/A (for Arousal)
		Leg Movement: 91.7%	Leg Movement: 94.5%	Leg Movement: 89%	N/A (for Leg Movement)
Conclusion on Acceptance Criteria (Endpoint 3)	EnsoSleep PA, NA, and OA point-estimates observed to be greater than or within 5% of the reference device performance for all event types, with statistically significant differences (greater performance) in a majority of cases. All met or exceeded objective performance goals.

Endpoint 4: Total Sleep Time (TST) and Respiratory Rate (RR)

Metric	Acceptance Criteria (Conceptual)	EnsoSleep PPG-TST (K210034) RR Sample (Reported)	EnsoSleep EEG-TST (K210034) RR Sample (Reported)	EnsoSleep EEG-TST (K210034) Adult Sample (Reported)	EnsoSleep EEG-TST (K210034) Pediatric Sample (Reported)
Deming Regression Slope ($β$1)	Near unity (0.90 < $β$1 < 1.10)	0.964	0.984	1.037	1.006
Deming Regression Intercept ($β$0) [hours]	Near zero ($β$0 < 15 minutes / e.g., < 0.25 hours)	0.089	0.156	-0.181	0.021
Bland-Altman Mean Difference (MD) [minutes]	Near zero (MD within ≤15 minutes)	5.380	-4.785	0.515	-3.255
Bland-Altman Upper Limit of Agreement (ULOA) [min]	Within <90 minutes	73.463	32.922	57.750	10.654
Bland-Altman Lower Limit of Agreement (LLOA) [min]	Within <90 minutes	-62.703	-42.492	-56.720	-17.164
RR Performance	≥ 90% percent epochs within ≤2 brpm of reference; MAE ≤ 2 brpm of reference	Met acceptance criteria (no specific numerical values provided in the table)	Met acceptance criteria (no specific numerical values provided in the table)	N/A	N/A
Conclusion on Acceptance Criteria (Endpoint 4)	EnsoSleep PPG-TST and EEG-TST demonstrated statistically similar performance to the predicate device in all samples, meeting or exceeding all acceptance criteria for Deming regression and Bland-Altman analysis, demonstrating no clinically significant deviations.

Overall Conclusion on Study Meeting Acceptance Criteria:
"The subject EnsoSleep device event detection and diagnostic agreement performance were observed to meet or exceed the PA, NA, and OA performance acceptance in the 26 total experiments (26/26) across all 4 experimental endpoints evaluated..." indicating that the device successfully met all predefined acceptance criteria.

2. Sample Sizes Used for the Test Set and Data Provenance

Test Set Sample Sizes:
- Adult Sample: N=100 adult subjects (for Sleep Staging, Sleep Apnea Diagnostic, Event Detection, and TST/RR)
- Pediatric Sample: N=47 pediatric subjects (for Sleep Staging, Sleep Apnea Diagnostic, and Event Detection, and TST/RR)
- Respiratory Rate (RR) Sample: N=100 adult subjects (specifically for Respiratory Rate and TST analysis)
Data Provenance:
- "Archived collection of retrospective diagnostic clinical PSG subject data."
- The data was collected from "five (5) clinical testing laboratories" where "two (2) AASM Accredited Sleep Testing Facilities were selected each with two (2) regional sleep testing centers."
- No specific country of origin is mentioned, but the regulatory submission is to the FDA (U.S. Food & Drug Administration). Given the use of AASM (American Academy of Sleep Medicine) accreditation, it is highly likely the data originated from the United States.
- The data is explicitly stated as retrospective.

3. Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications of Those Experts

Number of Experts: A panel of 9 total registered scoring technologists (RPSGTs) was used to establish the ground truth. From this panel, any given subject was assigned to 3 additional, prospective scorers.
Qualifications of Experts: The RPSGTs had "5 to 20+ years clinical experience". They were verified to meet "all defined study scoring-acquisition, scoring-blind, and rater-quality certification controls."

4. Adjudication Method for the Test Set

The ground truth was established using a "2/3 Majority Scoring consensus reference". This means that for any given event or stage, at least two out of the three independent scorers had to agree for it to be considered part of the ground truth.

5. If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study was Done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance

The study described is not a Multi-Reader Multi-Case (MRMC) comparative effectiveness study evaluating human readers' improvement with AI assistance.
Instead, it's a standalone performance study comparing the EnsoSleep algorithm's performance against expert consensus (human-scored ground truth) and demonstrating substantial equivalence to a predicate device.
The study design focused on the agreement of the device with human scorers, not on how the device assists human readers or changes their performance.

6. If a Standalone (i.e., algorithm only without human-in-the-loop performance) was Done

Yes, a standalone performance study was done. The described tests for "event detection and diagnostic agreement performance" of the EnsoSleep device relate to its direct output compared to the ground truth established by human experts. The system "automatically score sleep study results" and "does not interpret the results, nor does it suggest a diagnosis." It marks events for a clinician's review and verification, implying its standalone detection capability is what was assessed.

7. The Type of Ground Truth Used (Expert Consensus, Pathology, Outcomes Data, etc.)

The type of ground truth used was expert consensus, specifically a "2/3 Majority Scoring consensus reference" established by independent, registered sleep technologists (RPSGTs).

8. The Sample Size for the Training Set

The document does not explicitly state the sample size for the training set. The sample sizes provided (N=100 Adult, N=47 Pediatric, N=100 RR) refer specifically to the test sets used for clinical validation, described as "study sample[s]... to construct the final study sample."

9. How the Ground Truth for the Training Set Was Established

Since the training set size and characteristics are not detailed, the method for establishing its ground truth is also not described in this document. It is implied that the algorithm was trained on prior data, but the specifics of that process are outside the scope of this 510(k) summary, which focuses on the validation of the final product.

Ask a Question

Ask a specific question about this device

K Number

K162627

Device Name

EnsoSleep

Manufacturer

EnsoData, Inc.

Date Cleared

2017-03-31

(192 days)

Product Code

Regulation Number

Type

Panel

Reference & Predicate Devices

K112102,K153412

Predicate For

K210034,K221179,K223922,K250058

Intended Use

EnsoSleep is intended for use for the diagnostic evaluation by a physician to assess sleep quality and as an aid for the diagnosis of sleep and respiratory related sleep disorders in adults only. EnsoSleep is a software-only medical device to be used under the supervision of a clinician to analyze physiological signals and automatically score sleep study results, including the staging of sleep, detection of arousals, leg movements, and sleep disordered breathing events including obstructive apneas. All automatically scored events are subject to verification by a qualified clinician. Central apneas, mixed apneas, and hypopneas must be manually marked within records.

Device Description

EnsoSleep is a software application that analyzes previously recorded physiological signals obtained during sleep. The EnsoSleep software can analyze any EDF or EDF+ files.

Automated algorithms are applied to the raw signals in order to derive additional signals and interpret the raw and derived signal information. The software automates recognition of:

Sleep Stage Events

Wake
Stage N1
Stage N2
Stage N3
Stage REM

Respiratory Events

Sleep disordered breathing (apneas and hypopneas)
Apneas detected with airflow signal are classified as obstructive apnea (OSA), and can be edited to be central or mixed appeas
Sleep disordered breathing events not detected to be apneas are marked as hypopnea
Central apneas, mixed apneas, and hypopneas must be manually marked within records

Arousal Events

Arousals

Movement Events

Periodic Leg Movements during Sleep (PLMS)

The EnsoSleep software can be used as a stand-alone application for use on Microsoft Windows 7 & 8 operating system platforms. All processing, scoring, and analysis of signal data occurs on the EnsoSleep cloud servers.

AI/ML Overview

Here's a breakdown of the acceptance criteria and study details for the EnsoSleep device, based on the provided FDA 510(k) summary:

1. Table of Acceptance Criteria and Reported Device Performance

The acceptance criteria were established competitively based on the performance reported in the predicate device (Advanced Brain Monitoring, Inc. Sleep Profiler, K153412) and the reference device (Younes Sleep Technologies MICHELE Sleep Scoring System, K112102) 510(k) documentation. The study aimed to demonstrate that EnsoSleep's performance was statistically similar to or greater than these benchmarks.

Endpoint 1: Sleep Staging Performance

Sleep Stage (overall-epochs)	EnsoSleep Performance (Bootstrapped Point Estimate & 95% CI)	Predicate Device (Sleep Profiler K153412) Performance (Point Estimate)
Wake	PA: 86% (82%, 88%), NA: 97% (95%, 98%), OA: 94% (92%, 95%)	PA: 73%, NA: 94%, OA: 89%
N1	PA: 41% (33%, 48%), NA: 94% (93%, 96%), OA: 91% (90%, 93%)	PA: 25%, NA: 93%, OA: 89%
N2	PA: 77% (73%, 81%), NA: 87% (85%, 90%), OA: 83% (80%, 85%)	PA: 77%, NA: 84%, OA: 81%
N3	PA: 81% (74%, 88%), NA: 93% (91%, 95%), OA: 92% (90%, 94%)	PA: 76%, NA: 94%, OA: 91%
REM	PA: 79% (72%, 84%), NA: 99% (98%, 99%), OA: 96% (96%, 97%)	PA: 74%, NA: 97%, OA: 95%
Total (Overall Epochs)	PA: 78% (77%, 80%), NA: 95% (94%, 95%), OA: 91% (91%, 92%)	PA: 73%, NA: 93%, OA: 87%

Conclusion on Endpoint 1: EnsoSleep showed no statistically significant differences or was significantly greater than the predicate device in all comparisons for sleep staging performance.

Endpoint 2: Sleep Apnea Diagnostic Agreement Performance

Diagnostic Agreement (Per-Patient)	EnsoSleep Performance (Bootstrapped Point Estimate & 95% CI)	Predicate Device (Sleep Profiler K153412) Performance (Point Estimate)
AHI >= 5 (overall-mild)	PA: 91% (82%, 98%), NA: 76% (61%, 90%), OA: 85% (77%, 92%)	PA: 100%, NA: 85%
AHI >= 15 (overall-moderate)	PA: 95% (83%, 100%), NA: 98% (94%, 100%), OA: 97% (93%, 100%)	PA: 100%, NA: 97%
REM AHI >= 5 (REM-mild)	PA: 83% (72%, 94%), NA: 89% (79%, 97%), OA: 86% (79%, 93%)	PA: 84%, NA: 90%
REM AHI >= 15 (REM-moderate)	PA: 79% (56%, 94%), NA: 96% (90%, 100%), OA: 92% (85%, 97%)	PA: 73%, NA: 96%

Conclusion on Endpoint 2: EnsoSleep showed no statistically significant differences in PA and NA compared to the predicate device, with one exception for overall-mild PA (a 2% difference in EnsoSleep's CI upper bound vs. predicate point estimate). EnsoSleep's point estimates for PA, NA, and OA exceeded, were equivalent to, or were within 10% of the predicate device's.

Endpoint 3: Event Detection Performance

Event Type (Overall-Epochs)	EnsoSleep Performance (Bootstrapped Point Estimate & 95% CI)	Reference Device (MICHELE Sleep Scoring K112102) Performance (Point Estimate)
SDB	PA: 67% (58%, 75%), NA: 93% (92%, 94%), OA: 91% (90%, 92%)	PA: 75.5%, NA: 98.1%, OA: 93.0%
Apnea	PA: 56% (41%, 70%), NA: 96% (96%, 97%), OA: 95% (95%, 96%)	N/A (did not provide PA/NA)
Obstructive Apnea	PA: 53% (35%, 71%), NA: 97% (96%, 97%), OA: 96% (95%, 97%)	PA: 55.9%, NA: 99.3%
Arousal	PA: 66% (61%, 71%), NA: 90% (88%, 91%), OA: 87% (85%, 88%)	PA: 60.0%, NA: 94.1%, OA: 89.9%
Leg Movement	PA: 71% (60%, 80%), NA: 90% (89%, 92%), OA: 89% (87%, 90%)	PA: 78.4%, NA: 97.6%, OA: 95.7%

Conclusion on Endpoint 3: EnsoSleep's point-estimates for PA and NA event detection performance exceeded, were equivalent to, or were within 10% of the reference device, with statistically significant differences observed in a minority of cases.

2. Sample Size for Test Set and Data Provenance

Sample Size (Test Set): N=72 subjects (59719 epochs for epoch-based analyses, 72 for per-patient analyses).
Data Provenance: Retrospective clinical PSG data. The data constituted an "archived collection of retrospective diagnostic clinical PSG data collected from an AASM Accredited" facility. The country of origin is not explicitly stated but implied to be the US given the FDA submission.

3. Number of Experts for Ground Truth and Qualifications

Number of Experts: Three (3) independent registered sleep technologists (RPSGT).
Qualifications: Registered sleep technologists (RPSGT) who met "all acquisition, scoring-bind, and rater controls." Specific years of experience are not mentioned.

4. Adjudication Method for Test Set

Adjudication Method: 2/3 Majority Scoring was used to establish the designated comparative reference (ground truth).

5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study

No MRMC comparative effectiveness study was done to evaluate human readers with and without AI assistance. The study described is a standalone performance evaluation of the AI algorithm against expert consensus.

6. Standalone (Algorithm Only) Performance Study

Yes, a standalone study was performed. The entire clinical performance testing described evaluates the EnsoSleep algorithm's performance (without human-in-the-loop) against a human expert consensus ground truth.

7. Type of Ground Truth Used

Type of Ground Truth: Expert consensus (2/3 Majority Scoring by three independent RPSGTs).

8. Sample Size for Training Set

The document does not specify the sample size for the training set. It only describes the validation/test set.

9. How Ground Truth for Training Set Was Established

The document does not provide information on how the ground truth for the training set was established. It focuses solely on the clinical performance testing (validation/test set).

Ask a Question

Ask a specific question about this device

Page 1 of 1