(311 days)
-
encevis is intended for the review, monitoring and analysis of EEG recordings made by electroencephalogram (EEG) devices using scalp electrodes and to aid neurologists in the assessment of EG. This device is intended to be used by qualified medical practitioners who will exercise professional judgment in using the information.
-
The seizure detection component of encevis is intended to mark previously acquired sections of adult (greater than or equal to 18 years) EEG recordings that may correspond to electrographic seizures, in order to assist qualified clinical practitioners in the assessment of EEG traces. EEG recordings should be obtained with a full scalp montage according to the standard 10/20-system.
-
The spike detection component of encevis is intended to mark previously acquired sections of the patient's EEG recordings that may correspond to spikes, in order to assist qualified clinical practitioners in the assessment of EEG traces. The Spike Detection component is intended to be used in adult patients greater than or equal to 18 years. encevis Spike Detection performance has not been assessed for intracranial recordings.
-
encevis includes the calculation and display of a set of quantitative measures intended to monitor and analyze the EEG waveform. These include frequency bands, rhythmic and periodic patterns and burst suppression. These quantitative EEG measures should always be interpreted in conjunction with review of the original EEG waveforms.
-
The aEEG functionality included in encevis is intended to monitor the state of the brain.
-
encevis provides notifications on an on-screen display for seizure detection, spike detection, quantitative EEG and a EG that can be used when processing a record during acquisition. Delays of up to several minutes can occur between the beginning of a seizure, the occurrence of a spike or detection of quantitative EEG features and when the encevis notifications will be shown to a user. encevis notifications cannot be used as a substitute for real time monitoring of the underlying EEG by a trained expert.
-
encevis PureEEG (Artifact Reduction) is intended to reduce artifacts in a standard 10-20 EEG recording. PureEEG does not remove the entiract signal, and is not effective for other types of artifacts. PureEEG may modify portions of waveforms representing cerebral activity. Waveforms must still be read by a qualified medical practitioner trained in recognizing artifact, and any interpretation or diagnosis must be made with reference to the original waveforms.
-
This device does not provide any diagnostic conclusion about the patient's condition to the user.
encevis combines several modalities for viewing and analyzing EEG data in one integrated software package. Encevis consists of the following modalities: encevis EEG-Viewer, encevis artifact reduction (PureEEG), encevis seizure detection (EpiScan), encevis spike detection (EpiSpike), encevis rhythmic and periodic patterns, encevis aEEG, encevis frequency bands, encevis Burst Suppression.
Here's a breakdown of the acceptance criteria and the studies performed to prove the device meets these criteria, based on the provided document. The information is organized according to your requested points.
encevis Device Performance Study Summary
1. Table of Acceptance Criteria and Reported Device Performance
The document presents performance metrics for various components of the encevis device, often comparing them to a predicate device (Persyst). The acceptance criteria are implicitly defined by the non-inferiority margins used in statistical testing.
Feature (Component) | Acceptance Criterion (Implicit) | Reported Device Performance (encevis) | Comparison to Predicate (Persyst) |
---|---|---|---|
Seizure Detection (EpiScan) | Non-inferiority to predicate device (Persyst) with margins: |
- PPA: 10%
- NDR: 1 (false detections/24 hours) | Average PPA: 75.40% (95% CI=[64.5, 86.3])
Average NDR: 7.01 false detections in 24 hours (95% CI=[5.9, 8.2]) | PPA is non-inferior (Predicate PPA: 75.94% (95% CI=[65.5, 86.4]))
NDR is non-inferior (Predicate NDR: 10.61 false detections in 24 hours (95% CI=[6.8, 14.5])) |
| Spike Detection (EpiSpike) | Non-inferiority to predicate device (Persyst) with margins: - PPA: 3%
- NPA: 3%
- PLPA: 3% | Average PPA: 61.02% (95% CI=[47.9-74.1])
Average NPA: 98.58% (95% CI=[97.9-99.3])
Average PLPA: 96.09% (95% CI=[88.8-103.4]) | PPA is non-inferior (Predicate PPA: 8.7% (95% CI=[4.4-13.0]))
NPA is non-inferior (Predicate NPA: 99.69% (95% CI=[99.4-99.9]))
PLPA is non-inferior (Predicate PLPA: 93.97% (95% CI=[83.6-104.3])) |
| Artifact Reduction (PureEEG) | Non-inferiority to predicate device (Persyst) with margins: - Relative suppression of true EEG: 1 dB (delta CI = [-0.09, -0.04])
- Signal-to-noise ratio (SNR) after artifact removal: 0.10 (delta CI = [4.78, 6.60]) | Relative suppression of clean EEG lower (better) than predicate in 82/111 cases.
SNR after artifact removal higher (better) than predicate in 75/80 cases. | Both parameters found non-inferior. |
| Rhythmic and Periodic Patterns (NeuroTrend) | Sensitivity and Specificity for pattern detection:
(No explicit acceptance criteria stated as thresholds, but performance shown relative to expert annotations) | ANY (all patterns):
Sensitivity: 77.59% (95% CI=[75.5-79.7])
Specificity: 86.50% (95% CI=[85.8-87.2])
PD:
Sensitivity: 63.37% (95% CI=[60.7-66.0])
Specificity: 96.57% (95% CI=[96.2-96.9])
ARA (RTA, RAA, SW):
Sensitivity: 92.72% (95% CI=[88.2-97.2])
Specificity: 94.76% (95% CI=[94.4-95.2])
RDA:
Sensitivity: 92.56% (95% CI=[87.5-97.7])
Specificity: 90.44% (95% CI=[89.9-91.0]) | Not directly compared to a predicate for these metrics. Inter-reader agreement for localization was moderate (Kappa=0.45, CI=0.43-0.48). |
| aEEG (NeuroTrend) | Functional equivalence to proposed method (Zhang and Ding, 2013) and good accordance with predicate (Persyst) | Frequency response very similar to published version; both hemispheres show same characteristic; stop band suppression -30dB or higher; slope in pass band approx. -12dB/decade.
Good accordance with Persyst and raw EEG values. | Demonstrated good accordance with predicate (Persyst) |
| Frequency Bands (NeuroTrend) | Correct assignment to frequency bands (Delta, Theta, Alpha, Beta) with measurement error for amplitudes below 5%; correctly identifies globally dominant background frequency. | Correctly assigns test signals to corresponding frequency bands; measurement error for amplitudes below 5%.
Relative proportion corresponding to true frequency band > 50%. | Not directly compared to a predicate. |
| Burst Suppression (NeuroTrend) | Validated by sensitivity, specificity, PPV, NPV relative to expert consensus. | Sensitivity: 87.28% (95% CI=[90.8, 92.3])
Specificity: 92.16% (95% CI=[91.3, 92.9])
PPV: 61.09% (95% CI=[57.9, 64.2])
NPV: 98.09% (95% CI=[97.6, 98.5])
ACC: 91.56% (95% CI=[90.8,92.3]) | Not directly compared to a predicate. |
2. Sample Size Used for the Test Set and Data Provenance
-
Seizure Detection:
- Sample Size: 55 subjects. 50 patients with seizure events, 5 subjects without epilepsy.
- Data Provenance: Scalp-EEG recordings from video-EEG monitoring in an epilepsy monitoring unit. Patients were 18 years of age or older. The specific country of origin is not stated but context suggests a European context (AIT Austrian Institute of Technology). The data is retrospective.
- Total EEG data reviewed: 1619 hours of EEG, with a maximum of 30 hours per subject.
-
Spike Detection:
- Sample Size: 23 patients. 18 subjects (>=18 years) with spike events, 5 subjects (>=18 years) diagnosed with no epilepsy.
- Data Provenance: Scalp-EEG recordings from video-EEG monitoring in an epilepsy monitoring unit. The specific country of origin is not stated but context suggests a European context. The data is retrospective.
-
Artifact Reduction:
- Sample Size: 128 EEG data records. 60 records from epilepsy monitoring units (31 seizure segments from 31 subjects, 33 spike segments from 6 subjects) and 65 from ICU patients (65 subjects).
- Data Provenance: From different patient groups, covering adult patients in epilepsy monitoring and in critical care. The specific country of origin is not stated but context suggests a European context. The data is retrospective. Each record was 10 seconds of data.
-
Rhythmic and Periodic Patterns:
- Sample Size: 83 long-term EEGs.
- Data Provenance: Prospectively recorded from ICU-patients at two different centers. The specific country of origin is not stated but context suggests a European context.
- Total Annotation Segments: 11935 common annotation segments (first minute of each hour, split into three 20-second segments).
-
aEEG (NeuroTrend):
- Sample Size: Not explicitly stated for real EEG data comparison with Persyst, beyond "real EEG data were used."
- Data Provenance: Not explicitly stated.
-
Frequency Bands (NeuroTrend):
- Sample Size: Not explicitly stated for manually selected EEGs from epilepsy or ICU patients, beyond "Each of these EEG samples is representative..."
- Data Provenance: Manually selected from epilepsy or ICU patients.
-
Burst Suppression (NeuroTrend):
- Sample Size: 83 long-term EEGs.
- Data Provenance: Recorded from intensive care patients from two different centers. The specific country of origin is not stated but context suggests a European context.
- Total Annotation Segments: 3978 valid annotation segments (first minute of each hour).
3. Number of Experts Used to Establish the Ground Truth for the Test Set and Their Qualifications
- Seizure Detection: 3 independent neurologists for blinded review. Qualifications not explicitly detailed (e.g., years of experience).
- Spike Detection: 3 independent neurologists for blinded review. Qualifications not explicitly detailed.
- Artifact Reduction: 3 independent epileptologists or neurologists for blinded review. Qualifications not explicitly detailed.
- Rhythmic and Periodic Patterns: 2 clinical neurophysiologists who were naive to these EEGs. Qualifications not explicitly detailed.
- Burst Suppression: 2 clinical neurophysiologists who were naive to these EEGs. Qualifications not explicitly detailed.
4. Adjudication Method for the Test Set
- Seizure Detection: An event was considered a "true seizure" if the time interval of two out of three reviewers overlapped by at least 1 second. The seizure epoch was defined as the overlapping time range of these two reviewers. This is a 2+1 consensus method.
- Spike Detection: An event was considered a "true spike" only if the time interval of two out of three reviewers overlapped. For localization, the 3D-coordinates of the electrode next to the spike maximum, averaged over reviewers, were used. This implies a 2+1 consensus for detection and a form of consensus/averaging for localization.
- Artifact Reduction: The document states that "annotations of clean EEG recordings without any artifacts, and moreover annotations of artifacts that can be superimposed to the clean recordings" were needed, and "three independent epileptologists or neurologists for blinded review of the EEG data" were engaged. It doesn't explicitly state the adjudication method (e.g., 2+1, 3+1) for these annotations, but likely implies a consensus approach to establish ground truth about "clean" vs "artifact" segments.
- Rhythmic and Periodic Patterns: Annotations had to be consistent between both reviewers to be used in the sensitivity and specificity measurement. This implies 100% agreement between 2 reviewers.
- Burst Suppression: The performance was analyzed for consensus annotations of the two reviewers, meaning annotation segments where both reviewers showed the same decision. This implies 100% agreement between 2 reviewers.
5. If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study Was Done
No, an MRMC study comparing human readers with AI assistance versus without AI assistance was not explicitly conducted or reported. The studies focused on the standalone performance of the AI components and their non-inferiority to a predicate device (another software), not on the improvement of human readers with AI assistance.
6. If a Standalone Performance (Algorithm Only Without Human-in-the-Loop Performance) Was Done
Yes, standalone performance was assessed for all tested components. The clinical performance data presented (seizure detection, spike detection, artifact reduction, rhythmic and periodic patterns, aEEG, frequency bands, burst suppression) evaluated the algorithm's output against established ground truth without a human in the loop interpreting the AI's results. For instance, seizure detection compared the algorithm's detected seizure time points to the expert consensus.
7. The Type of Ground Truth Used
- Seizure Detection: Expert consensus (2 out of 3 neurologists).
- Spike Detection: Expert consensus (2 out of 3 neurologists).
- Artifact Reduction: Expert annotations (epileptologists/neurologists) for clean EEG and artifact patterns.
- Rhythmic and Periodic Patterns: Expert consensus (2 clinical neurophysiologists with 100% agreement).
- aEEG: Based on the proposed method of Zhang and Ding (2013) for frequency response, and comparison to another FDA-approved software (Persyst) for real EEG data.
- Frequency Bands: Based on predefined frequency borders (Delta, Theta, Alpha, Beta) and manual selection of EEG recordings representative of specific background-EEG-frequency bands by experts.
- Burst Suppression: Expert consensus (2 clinical neurophysiologists with 100% agreement) for burst suppression patterns.
8. The Sample Size for the Training Set
The document does not explicitly provide the sample size for the training set for any of the algorithms. It often refers to "large amount of EEG data from different centers" for bench testing, but does not distinguish between training and test data or specify training set sizes.
9. How the Ground Truth for the Training Set Was Established
Similarly, the document does not explicitly describe how ground truth for the training set was established, as it does not detail the training process or the data used for it. The focus is on the validation/test set and its ground truth establishment.
§ 882.1400 Electroencephalograph.
(a)
Identification. An electroencephalograph is a device used to measure and record the electrical activity of the patient's brain obtained by placing two or more electrodes on the head.(b)
Classification. Class II (performance standards).