Search Results
Found 1 results
510(k) Data Aggregation
(232 days)
The MAGNETOM system is indicated for use as a magnetic device (MRDD) that produces transverse, sagittal, coronal and oblique cross sectional images, and that displays the internal structure and/or function of the head or extremities. Other physical parameters derived from the images may also be produced. Additionally, the MAGNETOM system is intended to produce Sodium images for the head and Phosphorus spectroscopic images and/or spectra for whole body, excluding the head. These images and/or spectra and the physical parameters derived from the images and/or spectra when interpreted by a trained physician yield information that may assist in diagnosis.
MAGNETOM Terra and MAGNETOM Terra.X with software syngo MR XA60A include new and modified hardware and software compared to the predicate device, MAGNETOM Terra with software syngo MR E12U. A high level summary of the new and modified hardware and software is provided below: Hardware: New Hardware (Combiner (pTx to sTx), MC-PALI, GSSU control unit, 8Tx32Rx Head coil), Modified Hardware (Main components such as: Upgrade of GPA, New Host computer hardware, New MaRS computer hardware, Upgrade the SEP, The new shim cabinet ASC5 replaces two ACS4 shim cabinets; Other components such as: RFPA, Use of a common MR component which provides basic functionality that is required for all MAGNETOM system types, The multi-nuclear (MNO) option has been modified, OPS module, Cover with UI update on PDD). Software: New Features and Applications (Static B1 shimming, TrueForm (1ch compatibility mode), Deep Resolve Boost, Deep Resolve Gain, Deep Resolve Sharp, Bias field correction (marketing name: Deep RxE), The new BEAT pulse sequence type, BLADE diffusion, The PETRA pulse sequence type, TSE DIXON, The Compressed Sensing (CS) functionality is now available for the SPACE pulse sequence type, The Compressed Sensing (CS) functionality is now available for the TFL pulse sequence type, IDEA, The Scientific Suite), Modified Features and Applications (EP2D DIFF and TSE with SliceAdjust, The Turbo Flash (TFL)), Modified Software / Platform (Stimulation monitoring, "dynamic research labeling"), Other Modifications and / or Minor Changes (Intended use, SAR Calculation and Weight limit reduction for 31P/1H TxRx Flex Loop Coil, X-upgrade for MAGNETOM Terra to MAGNETOM Terra.X, Provide secure MR scanner setup for DoD (Department of Defense) -Information Assurance compliance).
The provided text describes the acceptance criteria and supporting study for the AI features (Deep Resolve Boost, Deep Resolve Sharp, and Deep RxE) within the MAGNETOM Terra and MAGNETOM Terra.X devices.
Here's a breakdown of the requested information:
1. Table of Acceptance Criteria and Reported Device Performance
| AI Feature | Acceptance Criteria | Reported Device Performance |
|---|---|---|
| Deep Resolve Boost | Characterization by several quality metrics such as peak signal-to-noise ratio (PSNR) and structural similarity index (SSIM). Visual inspection to ensure potential artifacts are detected. Successful passing of quality metrics tests. Work-in-progress packages delivered and evaluated in clinical settings. (Implicit: No misinterpretation, alteration, suppression, or introduction of anatomical information, and potential for faster image acquisition and significant time savings). | The impact of the network has been characterized by several quality metrics such as peak signal-to-noise ratio (PSNR) and structural similarity index (SSIM). Additionally, images were inspected visually to ensure that potential artifacts are detected that are not well captured by the metrics listed above. After successful passing of the quality metrics tests, work-in-progress packages of the network were delivered and evaluated in clinical settings with cooperation partners. In a total of seven peer-reviewed publications, the investigations covered various body regions (prostate, abdomen, liver, knee, hip, ankle, shoulder, hand, and lumbar spine) on 1.5T and 3T systems. All publications concluded that the work-in-progress package and the reconstruction algorithm can be beneficially used for clinical routine imaging. No cases have been reported where the network led to a misinterpretation of the images or where anatomical information has been altered, suppressed, or introduced. Significant time savings are reported. |
| Deep Resolve Sharp | Characterization by several quality metrics such as peak signal-to-noise ratio (PSNR), structural similarity index (SSIM), and perceptual loss. Verification and validation by in-house tests including visual rating and evaluation of image sharpness by intensity profile comparisons. (Implicit: Increased edge sharpness). | The impact of the network has been characterized by several quality metrics such as peak signal-to-noise ratio (PSNR), structural similarity index (SSIM), and perceptual loss. In addition, the feature has been verified and validated by in-house tests. These tests include visual rating and an evaluation of image sharpness by intensity profile comparisons of reconstruction with and without Deep Resolve Sharp. Both tests show increased edge sharpness. |
| Deep RxE | 1. During training, the loss (difference to ground truth) is monitored, and the training step with the lowest test loss is taken as the final trained network. 2. Automated unit-tests are set up to test the consistency of the generated output to a previously defined reference output. 3. During verification, the performance of the network is tested on a phantom against the ground truth with a maximal allowed NRMSE of 11% (for 2D network) and 8.7% (for 3D network). 4. The trained final network was used in the clinical study. (Implicit: Increases image homogeneity in a reproducible way on the receive profile, and images acquired with Deep RxE are rated better for image quality in the clinical study). | 1. During training, the loss, as the difference to a ground truth, is monitored and the training step with the lowest test loss is taken as the final trained network. 2. Automated unit-tests are set up to test the consistency of the generated output to a previously defined reference output. 3. During verification, the performance of the network is tested on a phantom against the ground truth with a maximal allowed NRMSE of 11% (11% for the 2D network and 8.7% for the 3D network were achieved). 4. The trained final network was used in the clinical study. The tests show that Deep RxE increases image homogeneity in a reproducible way on the receive profile. Images acquired with Deep RxE (DL bias field correction) are rated better for image quality than the ones acquired without it in the clinical study that was conducted. |
Note on Acceptance Criteria: The document directly states acceptance criteria for Deep RxE (e.g., NRMSE < 11%). For Deep Resolve Boost and Deep Resolve Sharp, the "acceptance criteria" are more implicitly derived from the described validation and evaluation metrics and outcomes (e.g., "successful passing of quality metrics tests," "increased edge sharpness," "no misinterpretation").
2. Sample Size Used for the Test Set and Data Provenance
| AI Feature | Test Set Sample Size | Data Provenance |
|---|---|---|
| Deep Resolve Boost | 1,874 2D slices (from validation set) | In-house measurements and collaboration partners. (Retrospective, as input data was retrospectively created from ground truth by data manipulation and augmentation). |
| Deep Resolve Sharp | 2,057 2D slices (from validation set) | In-house measurements. (Retrospective, as input data was retrospectively created from ground truth by data manipulation). |
| Deep RxE | 23,992 2D slices / 404 3D volumes (validation and test set) | All data from two 7T MR systems (MAGNETOM Terra and MAGNETOM Terra.X). (Implied retrospective, as data was separated into independent sets). |
Patient characteristics (Gender/Age) were recorded for Deep RxE: female: 56%, male: 41%, phantom: 3%. Age range 20-80 years. Not recorded for other features.
Ethnicity was not recorded for any feature. The document states that due to network architecture, attributes like gender, age, and ethnicity are not relevant to training data.
3. Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications
- Deep Resolve Boost & Deep Resolve Sharp: The document does not mention the use of experts for ground truth establishment for the test set regarding these features. Images were visually inspected and quality metrics were used.
- Deep RxE: The document mentions that images acquired with Deep RxE were "rated better for image quality than the ones acquired without it in the clinical study that was conducted." This implies expert evaluation, but the number of experts or their qualifications for the test set ground truth for Deep RxE is not explicitly stated.
- Separately, for the overall device clearance, "radiologist's evaluation reports from two U.S. board-certified radiologists have been provided" for software modifications and new hardware. This is a general statement for the device and not specifically linked to the ground truth of the AI features' test sets.
4. Adjudication Method for the Test Set
The document does not explicitly describe an adjudication method (like 2+1, 3+1) for establishing ground truth for the test sets of these AI features. For Deep Resolve Boost and Sharp, the ground truth was derived from the acquired datasets themselves, which were then manipulated to create input data. For Deep RxE, the ground truth for phantom testing was a "previously defined reference output" or the acquired data, and for the clinical study, images were "rated better" but the adjudication process for this rating isn't detailed.
5. Multi Reader Multi Case (MRMC) Comparative Effectiveness Study
No MRMC comparative effectiveness study is directly mentioned specifically for the AI features (Deep Resolve Boost, Deep Resolve Sharp, or Deep RxE) that compares human readers with vs. without AI assistance. The document alludes to radiologists evaluating images with new software features or comparing images from subject/predicate devices, and for Deep Resolve Boost, it mentions clinical settings and "seven peer-reviewed publications" concluding beneficial use for clinical routine, with reports of "significant time savings." For Deep RxE, images were "rated better for image quality," which implies a reader study, but no details on methodology, number of readers, or specific effect size are provided to quantify human reader improvement with AI assistance.
6. Standalone (Algorithm Only) Performance
Yes, standalone performance was conducted for all three AI features:
- Deep Resolve Boost: Characterized by PSNR and SSIM, and visual inspection.
- Deep Resolve Sharp: Characterized by PSNR, SSIM, perceptual loss, visual rating, and intensity profile comparisons.
- Deep RxE: Performance of the network tested on a phantom against ground truth (maximal allowed NRMSE of 11% for 2D, 8.7% for 3D achieved). Unit-tests and a two-step test procedure involving validation on unseen data and RMS error calculation against ground truth.
These indicate evaluation of the algorithm's performance without a human in the loop, beyond initial visual inspections by evaluators.
7. Type of Ground Truth Used
- Deep Resolve Boost: The acquired datasets themselves, retrospectively manipulated through data manipulation and augmentation (under-sampling, lowering SNR, mirroring k-space data) to create input data.
- Deep Resolve Sharp: The acquired datasets themselves, retrospectively manipulated through data manipulation (cropping k-space data) to create corresponding low-resolution input and high-resolution output/ground truth.
- Deep RxE: For phantom testing, a "previously defined reference output" was used, and for other evaluations, the acquired datasets themselves were used for comparison against bias field correction methods (homodyne filtering, N4, UNICORN).
8. Sample Size for the Training Set
| AI Feature | Training Set Sample Size |
|---|---|
| Deep Resolve Boost | 24,599 2D slices |
| Deep Resolve Sharp | 11,920 2D slices |
| Deep RxE | 119,955 2D slices / 2007 3D volumes |
9. How the Ground Truth for the Training Set Was Established
- Deep Resolve Boost: The "acquired datasets represent the ground truth for the training and validation." Input data was "retrospectively created from the ground truth by data manipulation and augmentation."
- Deep Resolve Sharp: The "acquired datasets represent the ground truth for the training and validation." Input data was "retrospectively created from the ground truth by data manipulation."
- Deep RxE: The document states that "During training the loss, as the difference to a ground truth, is monitored." The method of establishing this initial ground truth is implicitly the raw acquired data from the 7T MRI scanners, as the network aims to correct for B1 inhomogeneities. It also states "All data from the two MR systems were separated into independent training, validation and test datasets," implying the raw or processed raw data served as reference for training.
Ask a specific question about this device
Page 1 of 1