Search Results

Transpara software is intended for use as a concurrent reading aid for physicians interpreting screening full-field digital mammography exams and digital breast tomosynthesis exams from compatible FFDM and DBT systems, to identify regions suspicious for breast cancer and assess their likelihood of malignancy. Output of the device includes locations of calcifications groups and soft-tissue regions, with scores indicating the likelihood that cancer is present, and an exam score indicating the likelihood that cancer is present in the exam. Patient management decisions should not be made solely on the basis of analysis by Transpara.

Device Description

Transpara is a software only application designed to be used by physicians to improve interpretation of full-field digital mammography (FFMD) and digital breast tomosynthesis (DBT). Deep learning algorithms are applied to images for recognition of suspicious calcifications and soft tissue lesions (including densities, masses, architectural distortions, and asymmetries). Algorithms are trained with a large database of biopsy-proven examples of breast cancer, benign abnormalities, and examples of normal tissue.

Transpara offers the following functions which may be used at any time in the reading process, to improve detection and characterization of abnormalities and enhance workflow:

AI findings for display in the images to highlight locations where the device detects suspicious calcifications or soft tissue lesions, along with region scores per finding on a scale ranging from 1-100, with higher scores indicating a higher level of suspicion.
Links between corresponding regions in different views of the breast, which may be utilized to enhance user interfaces and workflow.
An exam-based score which categorizes exams with increasing likelihood of cancer on a scale of 1-10 or in three risk categories labeled as 'low', 'intermediate' or 'elevated'.

The concurrent use indication implies that it is up to the users to decide how to use Transpara in the reading process. Transpara functions can be used before, during or after visual interpretation of an exam by a user.

Results of Transpara are computed in a standalone processing appliance which accepts mammograms in DICOM format as input, processes them, and sends the processing output to a destination using the DICOM protocol in a standardized mammography CAD DICOM format. Common destinations are medical workstations, PACS and RIS. The system can be configured using a service interface. Implementation of a user interface for end users in a medical workstation is to be provided by third parties.

AI/ML Overview

The provided text describes the acceptance criteria and a study that proves the device, Transpara (2.1.0), meets these criteria.

Here's an organized breakdown of the information requested:

Acceptance Criteria and Reported Device Performance

The acceptance criteria are implicitly defined by the reported performance metrics. The study aims to demonstrate non-inferiority and superiority to the predicate device, Transpara 1.7.2. The key metrics reported are sensitivity at various specificity levels and Exam-based Area Under the Receiver Operating Characteristic Curve (AUC).

Table 1: Acceptance Criteria (Implied by Performance Goals) and Reported Device Performance (Standalone without Temporal Analysis)

Metric	Acceptance Criteria (Implied/Target)	Reported Performance (FFDM)	Reported Performance (DBT)
Sensitivity (Sensitive Mode @ 70% Specificity)	Non-inferior & Superior to Predicate Device 1.7.2 (quantitative value not specified, but implied by comparison)	97.4% (96.3 - 98.5)	96.9% (95.5 - 98.3)
Sensitivity (Specific Mode @ 80% Specificity)	Non-inferior & Superior to Predicate Device 1.7.2	95.2% (93.7 - 96.7)	95.1% (93.3 - 96.8)
Sensitivity (Elevated Risk @ 97% Specificity)	Non-inferior & Superior to Predicate Device 1.7.2	80.8% (78.0 - 83.6)	78.4% (75.1 - 81.7)
Exam-based AUC	Non-inferior & Superior to Predicate Device 1.7.2	0.960 (0.953 - 0.966)	0.955 (0.947 - 0.963)

Table 2: Acceptance Criteria (Implied by Performance Goals) and Reported Device Performance (Standalone with Temporal Analysis - TA)

Metric	Acceptance Criteria (Implied/Target)	Reported Performance (FFDM with TA)	Reported Performance (DBT with TA)
Sensitivity (Sensitive Mode @ 70% Specificity)	Superior to performance without temporal comparison	95.7% (93.7 - 97.6)	94.6% (91.2 - 98.0)
Sensitivity (Specific Mode @ 80% Specificity)	Superior to performance without temporal comparison	95.4% (93.4 - 97.4)	91.0% (86.7 - 95.4)
Sensitivity (Elevated Risk @ 97% Specificity)	Superior to performance without temporal comparison	82.7% (79.1 - 86.4)	74.9% (68.3 - 81.4)
Exam-based AUC	Superior to performance without temporal comparison	0.958 (0.946 - 0.969)	0.941 (0.921 - 0.958)

Study Details

Sample Size Used for the Test Set and Data Provenance:
- Main Test Set (without temporal analysis): 10,207 exams (5,730 FFDM, 4,477 DBT).
  - Normal: 8,587 exams
  - Benign: 270 exams
  - Cancer: 1,350 exams (750 FFDM, 600 DBT)
- Temporal Analysis Test Set: 5,724 exams (4,266 FFDM, 1,458 DBT).
  - Normal: 4,998 exams
  - Benign: 83 exams
  - Cancer: 643 exams (471 FFDM, 172 DBT)
- Data Provenance: Independent dataset acquired from multiple centers in seven EU countries and the US. Retrospective in nature, as it was acquired and not used for algorithm development and included normal exams with at least one year follow-up. The data included images from various manufacturers (Hologic, GE, Philips, Siemens, Fujifilm).
Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications of those Experts:
- The document states that the cancer cases in the test set were "biopsy-proven cancer." It does not specify the number or qualifications of experts used to establish the ground truth for the entire test set (including normal and benign cases, and detailed lesion characteristics). The mechanism for establishing the "normal" and "benign" status is not explicitly detailed beyond "normal follow-up of at least one year."
Adjudication Method for the Test Set:
- The document does not explicitly describe an adjudication method involving multiple readers for establishing ground truth for the test set. The ground truth for cancer cases is stated as "biopsy-proven."
If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study was done:
- No, the document does not describe a Multi-Reader Multi-Case (MRMC) comparative effectiveness study. The performance assessment is a standalone evaluation of the algorithm's performance, not a human-in-the-loop study comparing human readers with and without AI assistance.
If a Standalone (i.e., algorithm only without human-in-the-loop performance) was done:
- Yes, standalone performance tests were conducted. The results presented in Tables 2 and 5 are for the algorithm's performance only.
The Type of Ground Truth Used:
- The primary ground truth for cancer cases is biopsy-proven cancer. For normal exams within the test set, the ground truth was established by "a normal follow-up of at least one year," implying outcomes data (absence of diagnosed cancer over a follow-up period).
The Sample Size for the Training Set:
- The document does not explicitly state the sample size of the training set. It mentions "Deep learning algorithms are applied to images for recognition of suspicious calcifications and soft tissue lesions... Algorithms are trained with a large database of biopsy-proven examples of breast cancer, benign abnormalities, and examples of normal tissue."
How the Ground Truth for the Training Set Was Established:
- The ground truth for the training set was established using a "large database of biopsy-proven examples of breast cancer, benign abnormalities, and examples of normal tissue." This implies a similar methodology to the test set for cancer cases (biopsy verification) and likely clinical follow-up or expert consensus for benign/normal cases, though not explicitly detailed for the training set.

Ask a Question

Ask a specific question about this device

Page 1 of 1