(52 days)
BriefCase-Triage is a radiological computer-aided triaqe and notification software indicated for use in the analysis of CTPA images in adults or transitional adolescents aged 18 and older. The device is intended to assist hospital networks and appropriately trained medical specialists in workflow triage by flagging and communicating suspected positive cases of Central Pulmonary Embolism (Central PE).
BriefCase-Triage uses an artificial intelligence algorithm to analyze images and highlight cases with detected findings in parallel to the ongoing standard of care image interpretation. The user is presented with notifications for cases with suspected Central PE findings. Notifications include compressed preview images that are meant for informational purposes only and not intended for diagnostic use beyond notification. The device does not alter the original medical image and is not intended to be used as a diagnostic device.
The results of BriefCase-Triage are intended to be used in conjunction with other patient information and based on their professional judgment, to assist with triage/prioritization of medical images. Notified clinicians are responsible for viewing full images per the standard of care.
BriefCase-Triage is a radiological computer-assisted triage and notification software device. The software is based on an algorithm programmed component and is intended to run on a linux-based server in a cloud environment.
The BriefCase-Triage receives filtered DICOM images, and processes them chronologically by running the algorithms on each series to detect suspected cases. Following the Al processing, the output of the algorithm analysis is transferred to an image review software (desktop application). When a suspected case is detected, the user receives a pop-up notification and is presented with a compressed, low-quality, grayscale image that is captioned "not for diagnostic use, for prioritization only" which is displayed as a preview function. This preview is meant for informational purposes only, does not contain any marking of the findings, and is not intended for primary diagnosis beyond notification.
Presenting the users with worklist prioritization facilitates efficient triage by prompting the user to assess the relevant original images in the PACS. Thus, the suspect case receives attention earlier than would have been the case in the standard of care practice alone.
The algorithm was trained during software development on images of the pathology. As is customary in the field of machine learning, deep learning algorithm development consisted of training on manually labeled ("tagged") images. In that process, critical findings were tagged in all CTs in the training data set.
Here's a breakdown of the acceptance criteria and study details for the Aidoc BriefCase-Triage device, based on the provided FDA 510(k) summary:
1. Table of Acceptance Criteria and Reported Device Performance
The FDA 510(k) summary explicitly states the primary performance goals and the achieved results.
Acceptance Criteria (Performance Goal) | Reported Device Performance |
---|---|
Sensitivity ≥ 80% | 89.2% (95% CI: 82.5%, 93.9%) |
Specificity ≥ 80% | 94.5% (95% CI: 90.3%, 97.2%) |
Note on Secondary Endpoints: While time-to-notification, PPV, NPV, PLR, and NLR were assessed as secondary endpoints, the document does not state explicit acceptance criteria for them, but rather presents them as comparative data or additional performance metrics.
2. Sample Size and Data Provenance
- Test Set Sample Size: 328 cases from unique patients.
- Data Provenance: Retrospective, multi-center study with data from 6 US-based clinical sites. The cases collected for the pivotal dataset were distinct in time or center from the cases used to train the algorithm.
3. Number of Experts and Qualifications for Ground Truth
- Number of Experts: 3 senior board-certified radiologists.
- Qualifications of Experts: Senior board-certified radiologists. (No specific years of experience are detailed, but "senior" implies extensive experience).
4. Adjudication Method for the Test Set
- Adjudication Method: Majority voting among the three senior board-certified radiologists was used to establish the ground truth.
5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study
- Was an MRMC study done? No, a comparative effectiveness study involving human readers with and without AI assistance (MRMC) was not performed as the primary evaluation for this device. The study compared the algorithm's performance to ground truth, and the device is intended for workflow triage/notification, not as a diagnostic tool replacing human interpretation. The time-to-notification comparison was done between the device and a predicate device (PETN), not human readers.
6. Standalone (Algorithm Only) Performance Study
- Was a standalone study done? Yes. The pivotal study directly evaluated the BriefCase-Triage software's performance (sensitivity and specificity) in identifying Central PE by comparing its output against the established ground truth. This is a standalone performance evaluation.
7. Type of Ground Truth Used
- Type of Ground Truth: Expert consensus (majority voting by three senior board-certified radiologists).
8. Sample Size for the Training Set
- The document states that the algorithm was "trained during software development on images of the pathology." However, the exact sample size for the training set is not specified in the provided text.
9. How Ground Truth for the Training Set Was Established
- Method: "Critical findings were tagged in all CTs in the training data set." This implies manual labeling/tagging of findings by experts. The specific number or qualifications of these "tagging" experts are not detailed, but it's consistent with a machine learning development process.
§ 892.2080 Radiological computer aided triage and notification software.
(a)
Identification. Radiological computer aided triage and notification software is an image processing prescription device intended to aid in prioritization and triage of radiological medical images. The device notifies a designated list of clinicians of the availability of time sensitive radiological medical images for review based on computer aided image analysis of those images performed by the device. The device does not mark, highlight, or direct users' attention to a specific location in the original image. The device does not remove cases from a reading queue. The device operates in parallel with the standard of care, which remains the default option for all cases.(b)
Classification. Class II (special controls). The special controls for this device are:(1) Design verification and validation must include:
(i) A detailed description of the notification and triage algorithms and all underlying image analysis algorithms including, but not limited to, a detailed description of the algorithm inputs and outputs, each major component or block, how the algorithm affects or relates to clinical practice or patient care, and any algorithm limitations.
(ii) A detailed description of pre-specified performance testing protocols and dataset(s) used to assess whether the device will provide effective triage (
e.g., improved time to review of prioritized images for pre-specified clinicians).(iii) Results from performance testing that demonstrate that the device will provide effective triage. The performance assessment must be based on an appropriate measure to estimate the clinical effectiveness. The test dataset must contain sufficient numbers of cases from important cohorts (
e.g., subsets defined by clinically relevant confounders, effect modifiers, associated diseases, and subsets defined by image acquisition characteristics) such that the performance estimates and confidence intervals for these individual subsets can be characterized with the device for the intended use population and imaging equipment.(iv) Stand-alone performance testing protocols and results of the device.
(v) Appropriate software documentation (
e.g., device hazard analysis; software requirements specification document; software design specification document; traceability analysis; description of verification and validation activities including system level test protocol, pass/fail criteria, and results).(2) Labeling must include the following:
(i) A detailed description of the patient population for which the device is indicated for use;
(ii) A detailed description of the intended user and user training that addresses appropriate use protocols for the device;
(iii) Discussion of warnings, precautions, and limitations must include situations in which the device may fail or may not operate at its expected performance level (
e.g., poor image quality for certain subpopulations), as applicable;(iv) A detailed description of compatible imaging hardware, imaging protocols, and requirements for input images;
(v) Device operating instructions; and
(vi) A detailed summary of the performance testing, including: test methods, dataset characteristics, triage effectiveness (
e.g., improved time to review of prioritized images for pre-specified clinicians), diagnostic accuracy of algorithms informing triage decision, and results with associated statistical uncertainty (e.g., confidence intervals), including a summary of subanalyses on case distributions stratified by relevant confounders, such as lesion and organ characteristics, disease stages, and imaging equipment.