Search Results
Found 11 results
510(k) Data Aggregation
(52 days)
The XRpad2 4336 HWC-M, when used with a radiographic imaging system, is indicated for use in generating radiographic images of human anatomy for diagnostic X-ray procedures, wherever conventional screen-film (SF), digital radiography (DR), or computed radiography (CR) systems may be used. It is not intended for mammographic use.
The XRpad2 4336 HWC-M is a lightweight, cassette-sized, flat panel X-ray detector for digital radiography. The X-ray detector consists of an amorphous silicon flat panel with a directly deposited CsI:T1 scintillator and dedicated read-out, scan, and control electronics, all packaged in a carbon-fiber and aluminum enclosure. The outside dimensions of the detector are 460.0 mm × 383.6 mm × 15.5 mm, which fits into a standard X-ray cassette Bucky.
The detector can be integrated into a fixed room X-ray system to enable digital radiography.
This document describes the 510(k) summary for the PerkinElmer XRpad2 4336 HWC-M Flat Panel Detector, which seeks to establish substantial equivalence to a predicate device.
1. Table of Acceptance Criteria and Reported Device Performance
The document does not explicitly state "acceptance criteria" in a quantitative table format with corresponding "reported device performance" values for the XRpad2 4336 HWC-M. Instead, it relies on demonstrating equivalence to a predicate device (PerkinElmer XRpad 4336 MED, K140551) through non-clinical testing and stating that the physical values are "comparable."
However, a comparison chart for device characteristics is provided (Page 5), which implicitly indicates the performance metrics considered for demonstrating equivalence. The "acceptance criteria" for the new device are essentially that its performance is equivalent to the predicate.
| Characteristic | Predicate Device Performance (PerkinElmer XRpad 4336 MED, K140551) | Proposed Device Performance (PerkinElmer XRpad2 4336 HWC-M) | Reference/Comment (Implicit Acceptance Criteria) |
|---|---|---|---|
| Intended Use / Indications for Use | As described for predicate | Same | Must be the same as the predicate. |
| Panel | Single substrate amorphous silicon active TFT/diode array | Same | Must be the same. |
| Scintillator | Direct deposition CsI:Tl | Same | Must be the same. |
| Pixel pitch | 100 µm | Same | Must be the same. |
| Pixel matrix | 3556 × 4320 pixels | 3524 × 4288 pixels | Slightly different, but not noted as impacting image quality or substantial equivalence. |
| Limiting resolution | 5 lp/mm | Same | Must be the same. |
| Binning capable | 2 × 2 binning for 200 µm | Same | Must be the same. |
| Data transmit area | 355 mm × 432 mm | 352 mm × 429 mm | Slightly different, corresponding to pixel matrix change, but not noted to impact image quality. |
| External dimensions (w×l×h) | 384 mm × 460 mm × 15 mm | 384 mm × 460 mm × 15.5 mm | Minimal difference, attributed to housing update. |
| Weight | 3.8 kg | 3.1 kg | Different, due to housing update, and stated not to impact image quality. |
| Housing material | Aluminum with carbon-fiber | Same | Must be the same. |
| Finish | Matte carbon-fiber front with white silk-screen active area markers | Same | Must be the same. |
| Communication interface | Gb Ethernet or 802.11n WiFi | Same | Must be the same. |
| WiFi band | 5.1-5.3 GHz | 5.1-5.9 GHz | Expanded, but considered equivalent for networking. |
| X-ray synchronization interface | Dedicated trigger in/out signal lines or Automatic Exposure Detection | Same | Must be the same. |
| Power | External power supply or battery | Same | Must be the same. |
| Software library | Windows OS | Windows and Linux OS | Expanded OS support, but functionality considered equivalent. |
| DQE and MTF | Not explicitly stated, but established for predicate | Comparable to predicate device | Must be comparable to the predicate device to demonstrate substantial equivalence (non-clinical test). |
2. Sample Size Used for the Test Set and Data Provenance
A new clinical study was not conducted for the XRpad2 4336 HWC-M device. The clinical data supporting the predicate device, XRpad 4336 MED, was derived from testing for K122495 (XRD 1622 AP3 MED). Therefore, no specific "test set" sample size or data provenance is provided for the current device's clinical performance. The current device relies on demonstrating that its differences "do not invalidate the applicability of the clinical study data submitted in K122495."
3. Number of Experts Used to Establish Ground Truth for the Test Set and Qualifications of Those Experts
Since a new clinical study was not conducted for this device, and it relies on previous clinical data, this information is not provided in the document. The document refers to the predicate clearance (K140551) which itself references K122495 for clinical data. Details on experts and ground truth for K122495 are not in this document.
4. Adjudication Method for the Test Set
As no new clinical study test set was used for this device, a specific adjudication method is not described.
5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study
No MRMC comparative effectiveness study was done for the XRpad2 4336 HWC-M. The submission focuses on non-clinical equivalence to a predicate device.
6. Standalone (Algorithm Only Without Human-in-the-Loop Performance) Study
This device is a flat panel X-ray detector, not an AI algorithm. Therefore, a standalone (algorithm only) performance study as typically understood for AI/CAD systems is not applicable or performed. Its performance is measured by image quality metrics (like DQE, MTF) and compared to a predicate device.
7. Type of Ground Truth Used
For the current device, direct clinical ground truth (e.g., pathology, outcomes data) was not established in a new study. The device's "ground truth" for clinical performance is indirectly established by demonstrating its equivalence to a predicate device whose clinical effectiveness was previously established, likely through methods typical for X-ray detectors (e.g., image quality assessment, possibly expert consensus on diagnostic images from K122495).
8. Sample Size for the Training Set
This device is an X-ray detector, not a machine learning algorithm that requires a "training set" in the conventional sense. The document describes engineering, design, and performance validation, not algorithmic training.
9. How the Ground Truth for the Training Set Was Established
As this is not an AI/ML device requiring a training set, this question is not applicable.
Ask a specific question about this device
(48 days)
The XRpad2 3025 HWC-M, when used with a radiographic imaging system, is indicated for use in generating radiographic images of human anatomy for diagnostic X-ray procedures, wherever conventional screen-film (SF), digital radiography (DR), or computed radiography (CR) systems may be used. It is not intended for mammonfions .
standard X-ray cassette Bucky
- topad IPU-2 (Interface and Power Un
- KRpad LPT2 Detector Cable
- XRpad Protective Insert 307
- OV Cable IEC 60320 C13
- ........ Cable IEC 60320 CLS
- frigger Cable 5 m/16.5
- ger Cable 20 m/65
- GigE Interface Cable 15.25 m/ Interface Cable 7.6 m/2
- E Interface Cable 30.5 m/100
The provided text is a 510(k) premarket notification for the PerkinElmer XRpad2 3025 HWC-M Flat Panel Detector. It seeks to prove substantial equivalence to a predicate device (PerkinElmer XRpad 4336 MED, K140551) rather than demonstrating that the device meets specific acceptance criteria through a clinical study with a test set of human images.
Therefore, many of the requested elements regarding acceptance criteria, study design, and ground truth establishment for a clinical performance study on human images are not explicitly present or applicable in this document because the submission focuses on non-clinical performance and technical equivalence for an X-ray detector, not an AI/CADe device that interprets images. This document describes the device itself and its technical performance, not its diagnostic accuracy in a clinical setting with human readers.
However, I can extract information related to the technical performance of the device, which serves as the "acceptance criteria" and "proof" in the context of this 510(k) submission.
Summary of Device Acceptance Criteria and Performance (Based on Technical Equivalence):
This submission is a 510(k) for an X-ray flat panel detector, not an AI or CADe device. The "acceptance criteria" here are based on demonstrating that the new device's technical specifications and performance are substantially equivalent to a previously cleared predicate device, ensuring it performs as intended for generating radiographic images.
1. Table of Acceptance Criteria and Reported Device Performance:
| Characteristic (Acceptance Criteria - implicitly comparable to predicate) | Predicate Device (XRpad 4336 MED, K140551) | Proposed Device (XRpad2 3025 HWC-M) | Fulfilled? (Based on "Same" or acceptable differences) |
|---|---|---|---|
| Intended Use / Indications for Use | Generating radiographic images of human anatomy for diagnostic X-ray procedures. Not for mammographic use. | Same | Yes |
| Scintillator | Direct deposition CsI:Tl | Same | Yes |
| Pixel pitch | 100 µm | Same | Yes |
| Limiting resolution | 5 lp/mm | Same | Yes |
| Binning capable | 2x2 binning for 200 µm | Same | Yes |
| Communication interface | Gb Ethernet or 802.11n WiFi | Same | Yes |
| WiFi band | 5.1-5.3 GHz | 5.1-5.9 GHz | Yes (Wider band acceptable) |
| X-ray synchronization interface | Dedicated trigger in/out signal lines or Automatic Exposure Detection | Same | Yes |
| Power | External power supply or battery | Same | Yes |
| Software library | Windows OS | Windows and Linux OS | Yes (Expanded OS support acceptable) |
| Key Physical Differences (Not direct "acceptance criteria" but noted) | |||
| Pixel matrix | 3556 x 4320 pixels | 2508 x 3004 pixels | Different (Smaller size, noted as "decrease in dimensions") |
| Data transmit area | 355 mm x 432 mm | 250 mm x 300 mm | Different (Smaller size, noted as "decrease in dimensions") |
| External dimensions (w x l x h) | 384 mm x 460 mm x 15 mm | 282 mm x 332 mm x 15.5 mm | Different (Smaller size) |
| Weight | 3.8 kg | 1.8 kg | Different (Lighter) |
| Housing material | Aluminum with carbon-fiber | Same | Yes |
| Finish | Matte carbon-fiber front with white silk-screen active area markers | Same | Yes |
Note: The document states "The modifications are related to the decrease in dimensions and does not impact image quality." This implies that while the dimensions are different, the critical image quality parameters, which are the 'acceptance criteria' for this type of device, are maintained.
2. Sample size used for the test set and the data provenance:
- This is a non-clinical performance study. The "test set" consists of the device itself undergoing various physical and electrical performance tests.
- The document explicitly states: "PerkinElmer has performed internal non-clinical testing to demonstrate the safety and effectiveness of the XRpad2 3025 HWC-M Flat Panel Detector."
- There is no mention of a sample size of human images or data provenance (country of origin, retrospective/prospective) because this submission is about the technical performance of the X-ray detector hardware, not its diagnostic performance on patient images assessed by humans or AI.
3. Number of experts used to establish the ground truth for the test set and the qualifications of those experts (e.g. radiologist with 10 years of experience):
- Not applicable. This submission relies on engineering and physics measurements to establish the performance of the X-ray detector, not diagnostic interpretations by medical experts. The ground truth for this device's performance would be derived from quantitative measurements like DQE, MTF, resolution, etc., performed in a lab setting.
- The document mentions "BE and MTF are comparable to the predicate device. The non-clinics [sic] findings show that the detector complies with standards and regulations." This refers to objective physical measurements.
4. Adjudication method (e.g. 2+1, 3+1, none) for the test set:
- Not applicable. There is no human image interpretation in this submission, so no adjudication of expert opinions is required.
5. If a multi-reader multi-case (MRMC) comparative effectiveness study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance:
- Not applicable. This is an X-ray detector, not an AI or CADe device. No MRMC study was performed or is relevant for this type of 510(k) submission.
6. If a standalone (i.e. algorithm only without human-in-the-loop performance) was done:
- Not applicable. This is an X-ray detector, not an algorithm, so "standalone performance" in the context of AI is not relevant. Its "standalone performance" is its technical image acquisition capabilities.
7. The type of ground truth used (expert consensus, pathology, outcomes data, etc.):
- The ground truth for this device's "performance" is based on quantitative physical measurements (e.g., Modulation Transfer Function (MTF) to assess spatial resolution, Detective Quantum Efficiency (DQE) to assess dose efficiency, electrical and mechanical tests) according to recognized industry standards (e.g., IEC standards, as implied by "complies with standards and regulations").
- The document specifically mentions "BE and MTF are comparable to the predicate device." BE likely refers to "Beam Energy" or some other physical characteristic.
8. The sample size for the training set:
- Not applicable. This is a hardware device. It does not use a "training set" in the machine learning sense. Its design and validation are based on engineering principles and physical testing.
9. How the ground truth for the training set was established:
- Not applicable. As above, no training set for machine learning is involved.
Ask a specific question about this device
(31 days)
The XRoad 4343 F MED, when used with a radiographic imaging system, is indicated for use in generating radiographic images of human anatomy for diagnostic X-ray procedures, wherever conventional screen-film (SF), digital radiography (DR), or computed radiography (CR) systems may be used. It is not intended for mammographic use.
The XRpad 4343 F MED is a lightweight, cassette-sized, flat panel X-ray detector for digital radiography. The X-ray detector consists of an amorphous silicon flat panel with a directly deposited CsI:T1 scintillator and dedicated read-out, scan, and control electronics, all packaged in a carbon-fiber and aluminum enclosure. The outside dimensions of the detector are 460 mm × 15 mm, which fits into a standard X-ray cassette Bucky. The active area is 430 mm × 430mm at a pixel pitch of 100um. The detector can be integrated into a fixed room X-ray system to enable digital radiography.
The PerkinElmer XRpad 4343 F MED Flat Panel Detector is a digital radiography device. The acceptance criteria and supporting study details are as follows:
1. Table of Acceptance Criteria and Reported Device Performance
The provided document does not explicitly list quantitative acceptance criteria in a dedicated table format with corresponding reported performance for the XRpad 4343 F MED. Instead, it relies on demonstrating equivalence to a predicate device (XRpad 4336 MED, K140551) through comparable non-clinical performance values and the applicability of the predicate's clinical data.
However, key comparative performance characteristics are presented:
| Characteristic | Acceptance Criteria (Predicate Device K140551) | Reported Device Performance (XRpad 4343 F MED) |
|---|---|---|
| Intended Use / | Same as predicate | Same |
| Indications for Use | ||
| Customer Applications | Single energy, single shot RAD in 100 µm | Same |
| Panel | Single substrate amorphous silicon active TFT/diode array | Same |
| Scintillator | Direct deposition CsI:T1 | Same |
| Pixel pitch | 100 µm | Same |
| Limiting resolution | 5 lp/mm | Same |
| Binning capable | 2 × 2 binning for 200 µm | Same |
| Housing material | Aluminum with carbon-fiber | Same |
| X-ray synchronization interface | Dedicated trigger in/out signal lines or Automatic Exposure Detection | Same |
| Software | Windows OS | Same |
| DQE (Detective Quantum Efficiency) | Comparable to predicate device | Comparable to predicate device |
| MTF (Modulation Transfer Function) | Comparable to predicate device | Comparable to predicate device |
Differences noted for the proposed device (not framed as acceptance criteria but as distinguishing features from the predicate):
- Pixel matrix: 3556 × 4320 pixels (Predicate) vs. 4318 × 4320 pixels (Proposed)
- Active area: 355 mm × 430 mm (Predicate) vs. 430 mm × 430 mm (Proposed)
- External dimensions: 384 mm × 460 mm × 15 mm (Predicate) vs. 460 mm × 460 mm × 15 mm (Proposed)
- Weight: Approximately 4 kg (Predicate) vs. Approximately 5 kg (Proposed)
- Communication interface: Gb Ethernet or 802.11n WiFi (Predicate) vs. Gb Ethernet, No wireless capability (Proposed)
- Power: External power supply or battery (Predicate) vs. External power supply (Proposed)
- Battery capacity: 53Wh (Predicate) vs. No battery option (Proposed)
The foundational acceptance criterion is "Substantial Equivalence" to the predicate device, demonstrated by showing that the proposed device has comparable performance and operational standards, produces images of equivalent diagnostic quality, and that potential hazards have been studied and controlled.
2. Sample size used for the test set and the data provenance:
- The document states that a new clinical study was not required for the XRpad 4343 F MED device.
- Instead, the predicate device, XRpad 4336 MED, was cleared using clinical data derived from testing to support K122495 (XRD 1622 AP3 MED).
- The document implies that the "clinical study data submitted in K122495" served as the basis for the clinical evaluation, but it does not provide details on the sample size or data provenance (e.g., country of origin, retrospective/prospective nature) for K122495. It only states that this data was deemed "applicable" to the XRpad 4336 MED and, through non-clinical testing, also applicable to the XRpad 4343 F MED.
3. Number of experts used to establish the ground truth for the test set and the qualifications of those experts:
- The document does not provide information on the number of experts or their qualifications, as it refers to a prior 510(k) submission (K122495) for clinical data.
4. Adjudication method (e.g., 2+1, 3+1, none) for the test set:
- The document does not provide information on the adjudication method, as it refers to a prior 510(k) submission (K122495) for clinical data.
5. If a multi-reader multi-case (MRMC) comparative effectiveness study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance:
- No MRMC comparative effectiveness study was done or reported in this 510(k) summary for the XRpad 4343 F MED, nor does it involve AI assistance. This device is a flat panel x-ray detector, not an AI-powered diagnostic tool. The document relies on demonstrating non-inferiority/equivalence to a predicate physical device.
6. If a standalone (i.e. algorithm only without human-in-the loop performance) was done:
- This question is not applicable as the XRpad 4343 F MED is a hardware device (flat panel detector) for capturing radiographic images, not a standalone algorithm.
7. The type of ground truth used (expert consensus, pathology, outcomes data, etc.):
- The document does not explicitly state the type of ground truth used for the clinical data referenced from K122495.
- However, given the nature of radiographic imaging devices, it is highly probable that the ground truth for the clinical data in K122495 would have been established through expert radiological interpretation and potentially confirmed by pathology or clinical outcomes, but this is not confirmed in the provided text.
8. The sample size for the training set:
- The document does not mention any training set for the XRpad 4343 F MED, as it is a hardware device and not an algorithm requiring a training set in the AI/machine learning sense.
9. How the ground truth for the training set was established:
- This question is not applicable as no training set (in the context of algorithms) is mentioned or used for this device.
Ask a specific question about this device
(150 days)
The XRpad 4336 MED, when used with a radiographic imaging system, is indicated for use in generating radiographic images of human anatomy for diagnostic X-ray procedures, wherever conventional screen-film (SF), digital radiography (DR), or computed radiography (CR) systems may be used. It is not intended for mammographic use.
The XRpad 4336 MED is a flat panel X-ray detector consisting of an amorphous silicon panel with a directly deposited CsI:Tl scintillator. The XRpad 4336 MED detector has an active area of 43.2cm x 35.5cm at a pixel pitch of 100μm. Data and control communication is accomplished via a Gigabit Ethernet interface or 802.11n WiFi. The detector can be integrated into a fixed room X-ray system to enable digital radiography. The XRpad 4336 MED detector is designed to work with any X-ray system (consisting of an X-ray source, generator, collimator, and positioner) intended for use in generating radiographic images of human anatomy for diagnostic X-ray procedures. Applicable detector parameters, such as dynamic range, exposure time range, energy range, image size, resolution, detective quantum efficiency, etc are designed to support the necessary compatibility.
The provided text does not contain detailed information about the acceptance criteria or a specific study proving the device meets those criteria, as typically found in a clinical performance study. This document is a 510(k) summary, which focuses on demonstrating substantial equivalence to a predicate device rather than presenting a detailed clinical study with performance metrics.
However, based on the available information, here's what can be inferred and what is explicitly stated:
1. A table of acceptance criteria and the reported device performance:
The document states that the device successfully completed "internal nonclinical testing" and "complies with standards and regulations such as UL and IEC." It also mentions that the proposed device and predicate device "have been shown to produce images of equivalent diagnostic quality."
-
Acceptance Criteria (Inferred from regulatory compliance and substantial equivalence claim):
- Compliance with UL and IEC standards (general safety and performance).
- Produce images of equivalent diagnostic quality to the predicate device.
- Meet applicable detector parameters such as dynamic range, exposure time range, energy range, image size, resolution, and detective quantum efficiency (implied as supporting compatibility).
-
Reported Device Performance:
- Successfully completed internal nonclinical testing.
- Complies with UL and IEC standards.
- Produces images of equivalent diagnostic quality to the predicate device.
- (Specific numerical performance metrics for acceptance criteria like DQE, resolution, etc., are not provided in this summary.)
2. Sample size used for the test set and the data provenance (e.g. country of origin of the data, retrospective or prospective):
The document explicitly states: "No clinical studies were conducted in support of the XRpad 4336 MED as agreed upon during Pre-Submission discussions with the Agency for the predicate device (K122495). The conduct of a clinical concurrence study was deemed unnecessary to demonstrate substantial equivalence."
Therefore, there is no test set, sample size, or data provenance from a clinical study to report. The evaluation was based on nonclinical testing and comparison to engineering specifications and performance of the predicate device.
3. Number of experts used to establish the ground truth for the test set and the qualifications of those experts:
Since no clinical studies were conducted and no test set with human-interpreted ground truth was used for a direct performance claim, this information is not applicable and not provided.
4. Adjudication method (e.g. 2+1, 3+1, none) for the test set:
Not applicable, as no clinical test set was used.
5. If a multi-reader multi-case (MRMC) comparative effectiveness study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance:
Not applicable. The device is a flat panel X-ray detector, not an AI-powered diagnostic tool. No MRMC study was performed, and no AI assistance is mentioned.
6. If a standalone (i.e. algorithm only without human-in-the loop performance) was done:
Not applicable, as the device is hardware (an X-ray detector) and not an algorithm.
7. The type of ground truth used (expert consensus, pathology, outcomes data, etc):
For the nonclinical testing mentioned, the "ground truth" would be engineering specifications and measurements against established physical standards and benchmarks, rather than clinical ground truth like pathology or expert consensus. The primary "ground truth" for showing equivalence was the performance data and specifications of the predicate device (XRD 1622 AP3 MED X-ray system).
8. The sample size for the training set:
Not applicable, as this is hardware, not an algorithm requiring a training set.
9. How the ground truth for the training set was established:
Not applicable.
Ask a specific question about this device
(259 days)
The PerkinElmer 226 Sample Collection Devices are intended to be used as a medium to collect and transport whole blood specimen spots to a laboratory, in newborn screening. The device includes a tear-apart form for the collection of demographic information.
PerkinElmer 226 Sample Collection Device is designed to provide a uniform surface for the collection of blood spots. A drop of blood is applied to the filter paper and allowed to soak through the paper. The sample is then air dried and sent to a laboratory for further analysis. The PerkinElmer 226 Sample Collection contains Ahlstrom 226 filter paper that is made from 100% pure cotton linters with no wet-strength additives added and conforms to the Recognized Standard CLSI LA4-A4. The Ahlstrom 226 filter paper has four performance characteristics that can be assessed with lysed or intact red blood cells; blood absorption time, blood spot diameter, serum absorption volume and homogeneity. Physical properties of the Ahlstrom 226 filter paper monitored during manufacturing are basis weight, pH and ash content. Basis weight should be 110 lb ± 5% per ream (179 g/m2 ± 5%). A ream is defined as 500 sheets 24" x 36" (ASTM D646-96). The pH should be 5.7 to 7.5 (Test method ISO 6599:1981). Ash percent limit is a maximum of 0.1% (Test method A of ASTM D586-97a). Ahlstrom (manufacturer of paper) name and lot number appears on the PerkinElmer 226 Sample Collection Device along with a PerkinElmer specific lot identifier.
1. Table of acceptance criteria and reported device performance:
| Performance Characteristic | Acceptance Criteria (from CLSI LA4-A4) | Reported Performance (PerkinElmer 226 Sample Collection Device) |
|---|---|---|
| Serum retention volume (1.8-inch paper punch) | 1.37 - 1.71 µL | Ahlstrom paper lot 1: 1.477 µLAhlstrom paper lot 2: 1.443 µL |
| Homogeneity (spot-to-spot and sheet-to-sheet variability) | p-value > 0.05 (F-test for equivalence of mean values) | Ahlstrom paper lot 1: p=0.937Ahlstrom paper lot 2: p=0.607 |
| Diameter of dried blood aliquot circle | 15 - 17 mm (left to right and top to bottom) | Ahlstrom paper lot 1: 15.98 mmAhlstrom paper lot 2: 16.75 mm |
| Absorption time for 100 µL blood aliquot | 5 - 30 seconds | Ahlstrom paper lot 2: 12.74 seconds (Lot 1 data missing from document) |
| Printing ink and printing process interference | No clinically significant interference | No clinically significant interference observed with analytic test procedures of four representative newborn screening assays and with spiked samples for specific kits. |
| Clinical performance (relative to predicate) | Non-significant clinical difference in population median values of 2-10% for all 12 analytes tested. | Achieved: "non-significant clinical difference in population median values of 2-10% for all 12 analytes tested." |
2. Sample size used for the test set and data provenance:
- Physical Properties (Absorption capacity, homogeneity, diameter, absorption time): The exact sample size for each physical test (e.g., number of punches, number of blood aliquots) is not explicitly stated. The tests were performed on "Ahlstrom paper lot 1" and "Ahlstrom paper lot 2." The provenance is internal testing by the manufacturer or an associated entity (Ahlstrom).
- Clinical Performance (Comparison to predicate):
- Sample Size: N=2000 per device type (meaning 2000 samples collected with the PerkinElmer 226 and presumably 2000 with the Whatman 903 for comparison, though this is implied rather than explicitly stated as a direct paired comparison).
- Data Provenance: Retrospective data provided by a U.S. newborn screening laboratory. The data was gathered "during the process of transitioning from the Whatman 903 Specimen Collection Paper to the PerkinElmer 226 Sample Collection Device."
3. Number of experts used to establish the ground truth for the test set and their qualifications:
This device is a sample collection medium, not an AI or diagnostic device that requires expert interpretation for ground truth establishment. The "ground truth" for its performance is based on established physical properties and analytical performance without human subjective assessment.
4. Adjudication method for the test set:
Not applicable. This is not a study requiring adjudication of expert interpretations. The device's performance is assessed based on objective physical measurements and analytical results.
5. If a multi-reader multi-case (MRMC) comparative effectiveness study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance:
Not applicable. This is a medical device for blood specimen collection, not an AI system or diagnostic tool requiring human reader involvement for performance assessment in an MRMC study.
6. If a standalone (i.e. algorithm only without human-in-the-loop performance) was done:
Not applicable. This is a physical sample collection device, not an algorithm.
7. The type of ground truth used:
- Physical Properties: The ground truth for the physical properties (absorption capacity, homogeneity, diameter, absorption time) is based on quantitative measurements against the pre-defined ranges and statistical criteria specified in the CLSI LA4-A4 standard.
- Clinical Performance: The "ground truth" for clinical performance was based on the analytical results of specific newborn screening assays (NeoGram Amino Acids and Acylcarnitines Tandem Mass Spectrometry test system, AutoDELFIA 17OHP, IRT and T4 kits, Neonatal GALT kit, Neonatal Biotinidase kit, Hemoglobinopathy data) from a U.S. newborn screening laboratory, compared between the new device and the predicate. The outcome was the measured concentration/value of the analytes.
8. The sample size for the training set:
Not applicable. This device does not involve machine learning or AI, so there is no "training set."
9. How the ground truth for the training set was established:
Not applicable, as there is no training set for this type of device.
Ask a specific question about this device
(104 days)
The XRD 1622 AP3 MED, when used with a radiographic imaging system, is indicated for use in generating radiographic images of human anatomy for diagnostic X-ray procedures, wherever conventional screen-film (SF), digital radiography (DR), or computed radiography (CR) systems may be used. It is not intended for mammographic use.
The XRD 1622 AP3 MED is a flat panel x-ray detector consisting of an amorphous silicon panel with a directly deposited CsI:Tl scintillator. The XRD 1622 AP3 MED detector has an active area of 41cm x 41cm at a pixel pitch of 200μm. Data and control communication is accomplished via a Gigabit Ethernet interface. The detector can be integrated into a fixed room x-ray system to enable digital radiography.
Here's an analysis of the provided text, focusing on the acceptance criteria and study information for the PerkinElmer XRD 1622 AP3 MED Flat Panel Detector:
Acceptance Criteria and Device Performance Study
The primary study presented aims to demonstrate substantial equivalence to a predicate device, focusing on "equivalent diagnostic quality" of the generated images.
1. Acceptance Criteria and Reported Device Performance
| Acceptance Criteria Category | Acceptance Criteria | Reported Device Performance |
|---|---|---|
| Image Quality / Diagnostic Equivalence | Produce images of equivalent diagnostic quality compared to the predicate device. | "The PerkinElmer XRD 1622 AP3 MED flat panel detector has successfully completed external testing in actual user testing facility and was found to produce images of equivalent diagnostic quality in a study of 30 image pairs of different anatomical regions reviewed by three board certified radiologists." "The proposed device and predicate device (flat panel detector of the predicate) both utilize similar technology and materials, are similar in design and construction, and have been shown to produce images of equivalent diagnostic quality in a clinical setting." |
| Non-Clinical Testing / Specifications | Meet internal specifications, standards, and regulations (UL, IEC). Conform to product safety, radiology, and imaging standards. | "The PerkinElmer XRD 1622 AP3 MED flat panel detector has successfully completed internal nonclinical testing, complies with standards and regulations such as UL and IEC. The device has completed verification and validation testing to confirm it meets the specifications and operates as planned. Tests included image quality test with internal experts. The product, manufacturing and development processes have been showen to conform to product safety, radiology and imaging standards." |
Note: The provided document is a 510(k) summary, which focuses on demonstrating substantial equivalence to a predicate device rather than setting explicit quantifiable performance metrics as "acceptance criteria" in the same way a de novo submission might. The key acceptance criterion here is the subjective assessment of "equivalent diagnostic quality."
2. Sample Size and Data Provenance for the Test Set
- Sample Size for Test Set: 30 image pairs.
- Data Provenance: "external testing in actual user testing facility." This implies prospective data collection, possibly within a clinical environment, specifically for the purpose of this evaluation. The country of origin is not explicitly stated.
3. Number of Experts and Qualifications for Ground Truth
- Number of Experts: Three.
- Qualifications of Experts: "board certified radiologists." The number of years of experience is not specified.
4. Adjudication Method for the Test Set
The adjudication method is not explicitly stated. It mentions the images were "reviewed by three board certified radiologists," but it doesn't detail how their opinions were combined or resolved if there were disagreements (e.g., 2+1, 3+1, majority vote, etc.). It simply concludes they "found to produce images of equivalent diagnostic quality."
5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study
- Was an MRMC study done? Not explicitly in the sense of comparing human readers with vs. without AI assistance. The study described is a comparison of two different imaging devices (the new device vs. the predicate device), with human readers assessing the image quality from both. It evaluates the diagnostic equivalence of the devices, not the improvement of human readers with AI assistance.
- Effect Size of Human Readers' Improvement with AI vs. without AI assistance: Not applicable, as this type of study was not conducted or reported.
6. Standalone (Algorithm Only) Performance Study
This device is a hardware component (Flat Panel Detector), not an AI algorithm. Therefore, a "standalone algorithm only" performance study is not applicable. The device's performance is intrinsically linked to its ability to capture images that can be interpreted by humans.
7. Type of Ground Truth Used
The ground truth for the comparison was expert consensus / subjective evaluation of diagnostic quality by three board-certified radiologists. They assessed the "equivalent diagnostic quality" of images produced by the new device compared to the predicate device.
8. Sample Size for the Training Set
- Training Set Sample Size: Not applicable. This device is a hardware component (a flat panel detector), not an AI-driven software or algorithm that requires a "training set" in the conventional machine learning sense. The device's design and engineering are based on established physics and imaging principles, not trained on a dataset.
9. How the Ground Truth for the Training Set Was Established
- Ground Truth for Training Set Establishment: Not applicable for the same reasons as #8. The "training" for such a device is its engineering, calibration, and adherence to physical principles and manufacturing specifications.
Ask a specific question about this device
(244 days)
The Neobase Non-derivatized MSMS reagent kit (for use on the PerkinElmer TQD MSMS Screening System) is intended for the measurement and evaluation of amino acids, succinylacetone, free carnitine, and acylcarnitine concentrations from newborn heel prick blood samples dried on filter paper. Quantitative analysis of these analytes (Table 1) and their relationship with each other is intended to provide analyte concentration profiles that may aid in screening newborns for metabolic disorders.
The measurement of amino acids, succinylacetone, free carnitine, and acylcarnitines with the NeoBase assay involves extraction of dried blood spots from newborns with a solution containing stable-isotope labeled internal standards and analysis using a tandem mass spectrometry (MSMS) system. The each analyte relative to their response of internal stable-isotope labeled corresponding standard is proportional to analyte concentration.
The device being described is the NeoBase Non-derivatized MSMS Kit, intended for the measurement and evaluation of amino acid, succinylacetone, free carnitine, and acylcarnitine concentrations from newborn heel prick blood samples dried on filter paper. This quantitative analysis aids in screening newborns for metabolic disorders.
The study presented is a non-clinical study comparing the performance of the NeoBase Non-derivatized MSMS kit on the PerkinElmer TQD Triple Quadrupole Mass Spectrometer System (PerkinElmer TQD platform) against its predicate devices, the MS2 and PerkinElmer Quattro Micro platforms (QMicro). The goal was to demonstrate substantial equivalence.
Here's the breakdown of the acceptance criteria and study information:
1. Table of Acceptance Criteria and Reported Device Performance
The document does not explicitly state "acceptance criteria" as a set of predefined thresholds. Instead, it demonstrates equivalence to predicate devices by showing comparable performance characteristics. The key performance metrics evaluated were:
- Precision (Imprecision Percent Coefficient of Variation - %CV): Lower %CV indicates higher precision.
- Recovery (Mean % Recovery and 95% Confidence Interval): Indicates the accuracy of analyte measurement.
- Measurable Ranges: The range over which the device can accurately quantify analytes, ensuring coverage of clinically significant levels.
- Method Correlation (Ratio of Measured Concentration): Comparing the TQD platform with predicate devices (MS2 and QMicro). A ratio of 1.0 indicates equivalent concentration measurements.
- Clinical Correlation (Percent Agreement in clinical determinations): How well the TQD platform agrees with predicate platforms in classifying samples above or below clinical cutoffs.
- Detection of True Positive Samples: The ability of the device to correctly identify known positive cases.
Here's a summary of the reported device performance, focusing on the TQD platform's comparison to predicate devices:
| Performance Characteristic | Acceptance Criteria (Implied: Comparable to Predicate) | NeoBase Non-derivatized MSMS Kit (TQD Platform) Performance (as compared to MS2/QMicro) |
|---|---|---|
| Precision (Average Total Imprecision %CV) | Should be comparable to or better than predicate devices. | Amino Acids: Generally around 10-18% for TQD, similar to or slightly better than MS2/QMicro (e.g., ALA 10, ARG 10, MET 18, TYR 8, VAL 12). Carnitines/Acylcarnitines: Not explicitly provided for all, but overall implied to be adequate based on predicate comparison. |
| Recovery (Mean % Recovery) | Should be comparable to predicate devices. | Amino Acids: Ranges from 57% (SA) to 104% (C0) for TQD, generally comparable to MS2/QMicro. 95% CI also presented. |
| Measurable Ranges | Should cover all clinically significant ranges. | For all analytes, the TQD range (µM) includes or extends beyond the "Cutoff Range (µM)", demonstrating sufficiency for clinical use. |
| Method Correlation (Mean Ratio of Measured Concentration) | Ratios close to 1.0 (indicating statistical equivalence). | MS2/TQD: Ratios ranged from 0.89 to 1.09, with small variation, indicating statistical equivalence. QMicro/TQD: Ratios ranged from 0.92 to 1.08, with small variation, indicating statistical equivalence. |
| Clinical Correlation (% Agreement) | High percentage agreement with predicate devices. | All Analytes: Ranged from 99.2% to 100.0% agreement between TQD and MS2/Sciex platforms in clinical determinations. |
| Detection of True Positive Samples | 100% agreement with predicate devices in detection. | 100% agreement for all 17 true positive samples (representing 14 disorders) between TQD and MS platforms. |
2. Sample Size Used for the Test Set and Data Provenance
- Non-clinical (Analytical) Test Set:
- Precision and Recovery: The tables provided (5.3, 5.4, 5.5) show averaged data, but the explicit number of samples/replicates isn't detailed for each specific test. However, the method correlation section states that "enriched samples (five levels) was analyzed (as singlicates of each level) for 16 runs to provide a total of 80 individual measurements" for each analyte on each platform.
- Method Correlation: Data from "dried blood spots enriched with the analytes of interest," specifically "5 levels times 5 runs per analyte" resulted in 25 means per platform for each analyte (80 individual measurements total, as above).
- Clinical Test Set:
- Clinical Correlation (Percent Agreement): 2499 random newborn screening specimens (presumptive negative data set) and 17 specimens with true positive diagnoses. Some analytes specify 2598 total observations (2499 + 80 individual measurements from enriched samples for other tests?), while others specify 2518* (2499 presumptive negatives + 19 true positives, including newly acquired NKH and H-ALA samples mentioned in footnote).
- True Positive Samples: 17 samples with true positive diagnoses representing 14 disorders.
- Data Provenance: The document does not specify the country of origin for the samples. It mentions "newborn heel prick blood samples dried on filter paper," which is a standard collection method. The data is retrospective in the sense that these were pre-existing biological samples used for evaluation. It's not a prospective collection of new patients for this specific study.
3. Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications
This study focuses on diagnostic device performance (quantitative measurements of analytes) rather than interpretive tasks that would typically require human expert adjudication of images or complex clinical scenarios.
- Analytical Performance (Precision, Recovery, Measurable Ranges, Method Correlation): Ground truth is established by the known concentrations of analytes in the spiked/enriched samples, and the quantitative measurements determined by specialized laboratory equipment (Mass Spectrometers). No human experts are involved in establishing this type of ground truth.
- Clinical Correlation and True Positive Samples: The "ground truth" for the 17 true positive samples is referred to as "true positive diagnoses." The document does not specify how these diagnoses were established (e.g., whether by pathology, genetic testing, or clinical consensus) nor does it mention the number or qualifications of experts involved in these initial diagnoses.
4. Adjudication Method for the Test Set
Not applicable. This is an analytical/quantitative device performance study rather than an interpretive study requiring human adjudication. The "agreement" for clinical correlation refers to the concordance between the numerical results of the TQD platform and the predicate platforms against established clinical cutoffs, not human expert consensus on a diagnosis.
5. If a Multi Reader Multi Case (MRMC) Comparative Effectiveness Study was done
No, an MRMC comparative effectiveness study was not done. This study is evaluating the analytical performance and clinical correlation of a laboratory diagnostic assay, not a device that requires human interpretation of outputs.
6. If a Standalone Study (algorithm only without human-in-the-loop performance) was done
Yes, this is essentially a standalone (algorithm only) study. The "device" is a reagent kit used on an automated mass spectrometry system. The study compares the quantitative results generated by the TQD platform (with the NeoBase kit) directly to the predicate MS2 and QMicro platforms (also using the NeoBase kit) without a human-in-the-loop interpretation step being evaluated as part of the primary outcome for device clearance. The output is a numerical concentration.
7. The Type of Ground Truth Used
- Analytical Performance: Ground truth is based on known concentrations in control and enriched samples. These are prepared by spiking analytes into a matrix (dried blood spots) at specific, verifiable concentrations.
- Clinical Correlation and True Positive Samples:
- For the 2499 random newborn screening specimens, the "ground truth" for clinical "agreement" is whether the analyte concentrations fall above or below their respective clinical cutoffs (as determined by the predicate device).
- For the 17 true positive samples, the ground truth is established clinical diagnoses of metabolic disorders ("true positive diagnoses").
8. The Sample Size for the Training Set
The document describes evaluation of the device performance, not the training of an AI algorithm requiring a specific "training set." This device is a reagent kit for a mass spectrometry system, not an AI/ML software device in the typical sense. Therefore, there is no explicit "training set" in the context of an AI model.
Historically, the predicate devices (MS2 and QMicro) and their associated kits would have undergone extensive validation and optimization (which could be conceptually analogous to a training phase, but for analytical chemistry rather than AI). The current study is demonstrating the equivalence of the NeoBase kit on a new platform (TQD) to these previously validated systems.
9. How the Ground Truth for the Training Set was Established
As there is no distinct "training set" for an AI algorithm in this context, this question is not applicable. The methods used in developing and validating the NeoBase kit and its use on mass spectrometry platforms would involve many iterations of experiments to establish linearity, accuracy, precision, and other analytical specifications. This process relies on robust analytical chemistry principles and reference materials with known concentrations, rather than a "ground truth" established by human experts for AI training.
Ask a specific question about this device
(149 days)
The GSP Neonatal GALT kit is intended for the quantitative determination of r ne Sose - 1-phosphate uridyl transferase (GALT) activity in blood specimens dried on filter paper as an aid in screening newborns for classical galactosemia caused by GALT deficiency using the GSP™ instrument.
The GSPTM Neonatal GALT assay is an adaptation of the quantitative enzymatic assay of Beutler and Baluda. The fluorescence is measured with the GSP Instrument using an excitation wavelength of 355 nm and an emission wavelength of 460 nm. The GSP Neonatal GALT assay uses prompt fluorescence technology.
Here's a breakdown of the acceptance criteria and study information for the GSP Neonatal GALT kit, based on the provided 510(k) summary:
Acceptance Criteria and Device Performance
The provided document does not explicitly state pre-defined acceptance criteria in terms of specific thresholds for precision, linearity, detection limits, or agreement with the predicate device. Instead, it presents the results of these performance characteristics studies and a comparison with the predicate device.
However, based on the intent of the comparison study with the predicate device, we can infer some implicit acceptance goals relating to agreement with the established method. The screening performance tables directly compare the classification of samples between the new device and the predicate.
Inferred Acceptance Criteria & Reported Device Performance:
| Performance Metric | Inferred Acceptance Criteria (Implicit) | Reported Device Performance |
|---|---|---|
| Precision (CV%) | Values within acceptable clinical laboratory ranges for quantitative assays, typically demonstrating good reproducibility. (Specific quantitative targets for CV% are not stated, but results are presented to demonstrate low variability across multiple conditions.) | Within-Run Variation: CV% range from 3.0% to 12.8%Within-Lot Variation: CV% range from 4.9% to 14.3%Total Variation: CV% range from 5.2% to 15.9% |
| Linearity | Demonstrated linearity across the expected measuring range for clinical utility. | Linear throughout the measuring range of 2.5 U/dL to 25 U/dL. Maximum observed difference between linear and 3rd order regression models is -2.6% for GALT activities >4 U/dL, and max absolute difference of 0.07 U/dL for GALT activities ≤ 4 U/dL. |
| Detection Limit (LoQ) | Ability to accurately quantify GALT activity at low concentrations relevant for screening GALT deficiency. | LoB: 1.6 U/dLLoD: 2.5 U/dLLoQ: 2.5 U/dL (with total CV ≤ 20%) |
| Analytical Specificity | No significant interference from common endogenous substances or therapeutic compounds (e.g., icteric, lipemic samples, ascorbic acid, galactose). Minimal or explainable interference from other relevant substances (e.g., glutathione, GAL-1-P) at clinically relevant concentrations. | No interference from icteric (bilirubin ≤ 40 mg/dL), lipemic (Intralipid ≤ 1000 mg/dL), ascorbic acid (≤ 3 mg/dL), or galactose (≤ 50 mg/dL). Glutathione: Interference (decrease up to 63%) above 18.8, 37.5, 56.3 mg/dL at 3, 6, 12 U/dL GALT activity respectively.GAL-1-P: No effect on 3 U/dL samples; interference (decrease up to 37%) on 6 and 12 U/dL samples at 12.5 mg/dL.Total protein (HSA): No effect on 12 U/dL samples; interference (increase up to 30%) on 3 and 6 U/dL samples above 3000 mg/dL. |
| Comparison with Predicate - Screening Performance (using 0.5th percentile cut-off) | High overall agreement, positive percent agreement, and negative percent agreement with the predicate device (NG-1100/4100), demonstrating comparable screening classification. | Overall percent agreement: 99.6% (CI 99.3%-99.9%)Positive percent agreement: 92.9% (CI 83.9%-100%)Negative percent agreement: 99.8% (CI 99.5%-100%) |
| Comparison with Predicate - Screening Performance (using 1.0st percentile cut-off) | Similarly high agreement metrics at a different cut-off. | Overall percent agreement: 99.3% (CI 98.9%-99.7%)Positive percent agreement: 84.9% (CI 74.3%-95.5%)Negative percent agreement: 99.6% (CI 99.3%-99.9%) |
| Comparison with Predicate - Screening Performance (using 1.5th percentile cut-off) | Similarly high agreement metrics at a different cut-off. | Overall percent agreement: 98.9% (CI 98.4%-99.4%)Positive percent agreement: 83.6% (CI 73.5%-93.7%)Negative percent agreement: 99.3% (CI 99.0%-99.7%) |
Study Information
-
Sample size used for the test set and the data provenance:
- Precision Study: 8 samples (S1-S8), each tested with 206-216 replicates (n values).
- Detection Limit Study:
- LoB: 83 GALT deficient samples
- LoD: 351 determinations of five low-level samples
- LoQ: 209 replicates
- Analytical Specificity Study: Whole blood samples with three different GALT activities (approx. 3, 6, and 12 U/dL) were tested. Hematocrit effect was tested on three whole blood samples (approx. <1, 6, and 15 U/dL). Number of replicates (n) for hematocrit testing ranged from 10 to 12 for each hematocrit level.
- Comparison Studies (Screening Performance): A total of 2205 infants.
- Routine screening specimens: 2146 samples.
- Retrospective low GALT activity specimens: 33 samples.
- Data Provenance: Not explicitly stated, but the comparison study was performed "in a routine screening laboratory," implying real-world clinical samples. The study does not specify the country of origin, but the submission is from Finland with a US contact, suggesting potential international collaboration or a study done in a US lab, though this is not definitively stated. The use of "routine" and "retrospective" specimens indicates it's a mix of prospective (routine) and retrospective (low GALT activity) data.
-
Number of experts used to establish the ground truth for the test set and the qualifications of those experts (e.g., radiologist with 10 years of experience):
- The ground truth for the comparison study was established by the predicate device (NG-1100/4100 Neonatal GALT kit). Therefore, no human experts were involved in establishing the ground truth for the comparison study. For the underlying clinical truth, the predicate device's performance would have been validated against clinical outcomes or confirmed diagnoses (e.g., pathology, genetic testing), but this information is not provided for the predicate device within this summary.
-
Adjudication method (e.g., 2+1, 3+1, none) for the test set:
- Not applicable. The ground truth was based on the measurement from the predicate device, not on expert adjudication.
-
If a multi reader multi case (MRMC) comparative effectiveness study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance:
- Not applicable. This is a medical device for quantitative determination of a biomarker, not an AI-assisted diagnostic imaging or interpretation system requiring human readers.
-
If a standalone (i.e. algorithm only, without human-in-the-loop performance) was done:
- Yes, the performance characteristics (precision, linearity, detection limit, analytical specificity) and the comparison study represent the standalone performance of the GSP Neonatal GALT kit (algorithm only). The device's intended use is for quantitative determination using the GSP instrument, which is an automated process without direct human-in-the-loop interpretation that would alter the quantitative result provided by the device. The "screening performance" discussed refers to the device's ability to classify samples based on its quantitative output and a defined cut-off, not a human-AI interaction.
-
The type of ground truth used (expert consensus, pathology, outcomes data, etc.):
- For the comparison study, the "ground truth" (or reference standard) was the predicate device (NG-1100/4100 Neonatal GALT kit). The predicate device's classifications were used to assess the agreement of the new device. For the analytical studies, the ground truth was based on known concentrations or properties of the samples used (e.g., GALT deficient samples for LoB, known GALT activities for linearity and specificity).
-
The sample size for the training set:
- Not applicable. This device is a diagnostic assay kit, not an AI/machine learning model that typically requires a distinct "training set." The performance characteristics and comparison studies are conducted on test sets to validate the device's analytical performance.
-
How the ground truth for the training set was established:
- Not applicable, as there is no training set in the context of an AI/machine learning model. For the assay, the "ground truth" in development would refer to the standards and controls used to establish the assay's biochemical principles and calibrate its measurements.
Ask a specific question about this device
(408 days)
The Neonatal Biotinidase kit is intended for the semiquantitative determination of biotinidase activity in blood specimens dried on filter paper as an aid in screening newborns for biotinidase deficiency.
Biotinidase is found in the blood sample itself. Filter paper disks from newborn dried blood spot samples, calibrators and controls are punched into the wells of a microplate. When biotin substrate reagent containing biotin 6-aminoquinoline (6-AQ) is added to a well containing a punched dried blood spot, the reagent extracts and reconstitutes the proteins and enzymes in the spot. The biotinidase enzyme in the sample cleaves the substrate to biotin and fluorescent 6-AQ The addition of the ethanol stops the reaction and precipitates the proteins to cover the bottom of the well and the extracted spot. The fluorescent product (6-AQ) formed during the reaction is measured with a fluorometer. The biotinidase activity is defined against a calibration curve. The biotinidase activity of the sample is determined by comparing the fluorescence intensity of the sample to a calibration curve.
The Neonatal Biotinidase kit is intended for the semi-quantitative determination of biotinidase activity in blood specimens dried on filter paper as an aid in screening newborns for biotinidase deficiency. The study aimed to demonstrate substantial equivalence to the Astoria-Pacific SPOTCHECK® Biotinidase 50-Hour Reagent Kit (K010844).
1. Table of Acceptance Criteria and Reported Device Performance:
The document describes a method comparison study to establish substantial equivalence rather than explicit acceptance criteria with pre-defined thresholds. However, based on the provided results, the implicit acceptance criteria would involve high positive percent agreement (PPA) and overall percent agreement (OPA) with the predicate device.
| Metric | Acceptance Criteria (Implicit) | Reported Device Performance |
|---|---|---|
| Positive Percent Agreement (PPA) | High agreement with predicate for positive cases | 90.9% (20/22) |
| Overall Percent Agreement (OPA) | High agreement with predicate for all cases | 99.8% ((20+1493)/1516) |
2. Sample Size Used for the Test Set and Data Provenance:
- Test Set Sample Size: 1516 samples.
- Data Provenance: The samples included 1516 newborn dried blood spot specimens representing the US population. The study included:
- 1496 routine screening specimens (prospective, from the US population).
- 20 retrospective specimens diagnosed positive for biotinidase deficiency (provenance not explicitly stated but implies confirmed cases, likely from a US population or referral lab).
3. Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications:
The document does not specify the number or qualifications of experts used to establish the ground truth. It states that 20 retrospective specimens were "diagnosed positive for biotinidase deficiency," implying a clinical diagnosis, but details on who made these diagnoses are not provided. For the routine screening samples, the ground truth is established by the comparison to the predicate device and clinical outcome (diagnosed biotinidase deficiency or not).
4. Adjudication Method for the Test Set:
Not applicable. This study is a method comparison between two assays, not a diagnostic study requiring adjudication against a clinical outcome based on expert review of ambiguous cases. The "ground truth" for the deficient samples appears to be established by prior clinical diagnosis, and routine samples are compared against the predicate as well as clinical diagnosis outcomes.
5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study:
No, an MRMC comparative effectiveness study was not done. This study is a comparison of two diagnostic assays, not an assessment of human reader performance with or without AI assistance.
6. Standalone (Algorithm Only) Performance Study:
Yes, a standalone study was performed. The device (Neonatal Biotinidase kit) was tested on its own and its results were compared to those from a commercially available predicate device. The performance metrics (PPA, OPA) are based on the direct output of the device.
7. Type of Ground Truth Used:
The ground truth for the 20 positive cases was established by clinical diagnosis ("retrospective specimens diagnosed positive for biotinidase deficiency"). For the routine samples, the "ground truth" for comparison purposes within the study was intrinsically linked to the results of the predicate device and the absence of a biotinidase deficiency diagnosis.
8. Sample Size for the Training Set:
The document does not provide information on a specific training set or its sample size for the Neonatal Biotinidase kit. Kits for in vitro diagnostic (IVD) use typically involve extensive assay development and validation, but specific "training sets" in the machine learning sense are not usually reported in this context. The calibration curve and internal controls are part of the assay's design and standardization.
9. How the Ground Truth for the Training Set Was Established:
As no specific training set in the AI/ML sense is mentioned, information on how its ground truth was established is not provided. The kit's calibration curve is established using six levels of dried blood spots prepared from porcine blood, representing different biotinidase activity levels, which serves as a form of internal reference material.
Ask a specific question about this device
(157 days)
This kit is intended for the quantitative determination of human thyroid stimulating hormone (hTSH) in blood specimens dried on filter paper as an aid in screening newborns for congenital (neonatal) hypothyroidism using the GSP instrument.
The GSPTM Instrument is a fully automated, high throughput batch analyzer for time resolved analysis of samples in microtitration plates. It is in intended for in vitro quantitative / qualitative determination of analytes in body fluids.
The GSP instrument (genetic screening processor) is a fully automated, high throughput batch analyzer for timeresolved and prompt fluorescence analysis of samples in microtitration plates. It is intended for in vitro quantitative and qualitative determination of analytes in body fluids. The GSP instrument and GSP chemistries are for professional use only.
The GSPTM Neonatal hTSH assay is a solid phase, twosite fluoroimmunometric assay based on the direct sandwich technique in which two monoclonal antibodies (derived from mice) are directed against two separate antigenic determinants on the hTSH molecule. Calibrators, controls and test specimens containing hTSH are reacted simultaneously with immobilized monoclonal antibodies directed against a specific antigenic site on the ß hTSH subunit and europium-labeled monoclonal antibodies (directed against a different antigenic site located partly on the B subunit and partly on the a subunit) in assay buffer. The assay buffer elutes hTSH from the dried blood spots on the filter paper disks. The complete assay requires only one incubation step.
DELFIA Inducer dissociates europium ions from the labeled antibody into solution where they form highly fluorescent chelates with components of DELFIA Inducer. The fluorescence in each well is then measured. The fluorescence of each sample is proportional to the concentration of hTSH in the sample.
Here's a summary of the acceptance criteria and study information for the GSP Neonatal hTSH kit, based on the provided 510(k) summary:
1. Table of Acceptance Criteria and Reported Device Performance
| Performance Characteristic | Acceptance Criteria (Implied) | Reported Device Performance |
|---|---|---|
| Precision (TSH) | "Total variation (% CV)" should be acceptably low for assay use. | Using full calibration curve on each plate:- Sample 1 (10.5 µU/mL): 10.1% CV- Sample 2 (23.2 µU/mL): 8.9% CV- Sample 3 (102 µU/mL): 8.5% CV- Sample 4 (241 µU/mL): 8.7% CVUsing one calibration curve valid for 24h:- Sample 1 (10.6 µU/mL): 9.9% CV- Sample 2 (23.4 µU/mL): 8.3% CV- Sample 3 (102 µU/mL): 7.7% CV- Sample 4 (241 µU/mL): 7.9% CV |
| Linearity (TSH) | Should be linear across the intended reportable range. | Linear from 0.66 uU/mL to 375 uU/mL blood. |
| Limit of Blank (LoB) | Acceptably low to distinguish from blank. | 0.96 uU/mL blood. |
| Limit of Detection (LoD) | Acceptably low for clinical application. | 1.31 uU/mL blood. |
| Limit of Quantitation (LoQ) | Lowest concentration with total CV < 20%. | 1.31 uU/mL blood (with total CV < 20%). |
| Analytical Specificity | No significant interference from common interferents or cross-reactants. | No interference from icteric, lipemic, or high hemoglobin samples. Minimal to no cross-reactivity with hFSH, hLH, and hCG at high concentrations. |
| Method Comparison (Overall Agreement) | Good agreement (>95-97%) with predicate device. | Site 1: 98.4% (95% CI: 97.9%-99.0%)Site 2: 98.4% (95% CI: 97.8%-98.9%) |
| Method Comparison (Positive Agreement) | Good agreement (>70%) for positive samples with predicate device. | Site 1: 75.9% (95% CI: 66.1%-85.7%)Site 2: 75.3% (95% CI: 66.0%-84.6%) |
| Method Comparison (Negative Agreement) | Good agreement (>98-99%) for negative samples with predicate device. | Site 1: 99.4% (95% CI: 99.0%-99.8%)Site 2: 99.5% (95% CI: 99.1%-99.8%) |
| Internal Method Comparison (Deming Regression) | Slope near 1, intercept near 0, demonstrating good correlation. | Slope: 0.97 (95% CI: 0.94, 1.01)Intercept: -0.21 (95% CI: -0.37, -0.16) |
Note: The document provides specific performance results but often implies the acceptance criteria through the presentation of these results in the context of predicate device comparison and clinical guidelines (e.g., AAP recommendations for TSH cut-offs).
2. Sample Sizes Used for the Test Set and Data Provenance
-
Precision Study:
- Sample Size: 4 spiked dry whole blood spot samples, run over 23 days in 27 runs, each consisting of 2 plates with 4 replicates per sample. (Specific total number of individual measurements for precision across all samples is not explicitly stated but is substantial: 4 samples * 27 runs * 2 plates * 4 replicates = 864 individual measurements.)
- Data Provenance: The document does not specify the country of origin for these spiked samples. It is implied to be laboratory-generated per NCCLS (CLSI) guidelines. Retrospective or prospective is not specified, but typically, these are prospective internal lab studies.
-
Detection Limit Study:
- LoB Sample Size: 216 blank samples.
- LoD Sample Size: 432 determinations (72 blank and 216 low-level samples included).
- Data Provenance: Not specified, but implied to be prospective internal lab studies following CLSI guidelines.
-
Analytical Specificity (Cross-reactivity) Study:
- Sample Size: Not explicitly stated how many individual samples were used for each interferent, but presented for two different hTSH concentrations for each cross-reactant (hFSH, hLH, hCG).
- Data Provenance: Not specified, likely prospective internal lab studies.
-
Comparison Studies (Site 1 & Site 2):
- Site 1 Test Set Sample Size: 2053 samples (total). 20 diagnosed positive samples.
- Site 2 Test Set Sample Size: 2104 samples (total). 26 known positive samples.
- Data Provenance: Not explicitly stated, but these are likely clinical samples from the sites where the studies were performed. The terms "routine screening and spiked blood spot samples" are used in the Internal Method Comparison, suggesting a mix, but for Site 1 and Site 2 comparisons, they appear to be real-world samples. Retrospective or prospective is not specified, but comparison studies like this often use retrospective collections or samples run in a prospective manner against a standard.
-
Internal Method Comparison:
- Sample Size: N=162 samples.
- Data Provenance: "routine screening and spiked blood spot samples". Not specified by country.
3. Number of Experts Used to Establish the Ground Truth for the Test Set and Their Qualifications
- No information is provided regarding the number or qualifications of experts used to establish ground truth for the test set.
- For the comparison studies, "diagnosed positive samples" and "known positive samples" are mentioned, suggesting a clinical diagnosis as the implicit ground truth, but the method of diagnosis is not detailed, nor are the experts involved.
4. Adjudication Method for the Test Set
- No adjudication method is described. The comparison studies simply compare the GSP device's classification with that of the predicate device. For "diagnosed positive samples," the diagnosis itself serves as a form of ground truth, but how conflicting diagnoses (if any) were resolved is not stated.
5. If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study was Done
- No, an MRMC comparative effectiveness study was not done. This device is an in-vitro diagnostic (IVD) test, not an image-reading or human-interpretation device. The studies described focus on the analytical performance of the instrument and kit, and its agreement with a predicate IVD device. Therefore, the concept of "human readers improve with AI vs without AI assistance" does not apply.
6. If a Standalone (Algorithm Only Without Human-in-the-Loop Performance) was Done
- Yes, the studies presented are all standalone (algorithm only) performance. The GSP Instrument and GSP Neonatal hTSH kit are automated systems for quantitative determination of hTSH. The results are generated directly by the instrument and its associated software/algorithm, without human interpretation as part of the primary measurement. Human users operate the instrument, but their "reading" of the result is simply recording the quantitative value provided by the system.
7. The Type of Ground Truth Used
- For the Precision, Linearity, Detection Limit, and Analytical Specificity studies:
- The ground truth is reference values based on known dilutions or spiked concentrations in samples, following recognized laboratory standards (e.g., CLSI documents).
- For the Comparison Studies (Site 1 & Site 2) and Internal Method Comparison:
- The ground truth is primarily based on the results obtained from the predicate device (AutoDELFIA Neonatal hTSH kit), which is an already legally marketed and established method for hTSH screening.
- Additionally, for a subset of samples, "diagnosed positive samples" or "known positive samples" are mentioned, implying clinical diagnosis of congenital hypothyroidism (likely based on follow-up and confirmatory tests) served as a form of clinical ground truth for these specific cases.
8. The Sample Size for the Training Set
- No information is provided about a specific "training set" for the GSP Neonatal hTSH kit or instrument. This is typical for traditional IVD assays, which are developed and validated using analytical methods and comparison to established predicate devices, rather than machine learning algorithms that require distinct training and test sets in the same way. The development and optimization of the assay would involve various experiments, but these are not typically referred to as a "training set" in this context.
9. How the Ground Truth for the Training Set Was Established
- As a specific "training set" is not mentioned in the context of an AI/ML algorithm development, this question is not applicable based on the provided document. The development of such an IVD kit involves extensive analytical characterization and optimization, but not in the framework of machine learning training data.
Ask a specific question about this device
Page 1 of 2