Search Results
Found 4 results
510(k) Data Aggregation
(91 days)
02451
Re: K193103
Trade/Device Name: NeoBase 2 Non-derivatized MSMS Kit Regulation Number: 21 CFR 862.1055
freecarnitine, and acylcarnitines using tandem mass spectrometry |
| Regulation: | 21 CFR 862.1055
The NeoBase™ 2 Non-derivatized MSMS kit is intended for the measurement and evaluation of amino acid, succinylacetone, free carnitine, acylcarnitine, nucleoside and lysophospholipid concentrations (Table 1) with a tandem mass spectrometer from newborn heel prick blood specimens dried on filter paper. Quantitative analytes and their relationship with each other is intended to provide analyte concentration profiles that may aid in screening newborns for metabolic disorders.
Each NeoBase 2 Non-derivatized MSMS kit contains reagents for 960 assays. The kit is designed to be used with NeoBase 2 Non-derivatized Assay Solutions consisting of Neo MSMS Flow Solvent and NeoBase 2 Extraction Solution and NeoBase 2 Succinylacetone Assay Solution.
- NeoBase 2 Internal Standards - 1 vial
- NeoBase 2 Controls Low, High - 3 filter paper cassettes (Whatman, no. 903) containing 3 spots of each level per cassette
- Microplate, U-bottomed - 20 plates
- Adhesive microplate covers - 20 sheets
- Barcode labels for the plates - 30 pcs (10 different barcodes, 3 pcs of each)
- Lot-specific quality control certificate
This kit contains components manufactured from human blood. The source materials have been tested by FDA-approved methods for hepatitis B surface antigen, anti-hepatitis C and anti-HIV 1 and 2 antibodies and found to be negative.
Instruments used: - . QSight® 210 MD Screening System is comprised of:
- QSight® 210 MD Mass Spectrometer
- QSight® HC Autosampler MD
- QSight® Binary Pump MD ●
- Simplicity™ 3Q MD Software
- PerkinElmer MSMS Workstation software ●
Here's a breakdown of the acceptance criteria and the study that proves the device meets them, based on the provided text.
Note: This document describes a medical device for newborn screening. Performance is demonstrated through analytical studies (precision, sensitivity, linearity, interference) and screening performance data comparing it to a predicate device, rather than human-in-the-loop studies common for AI/imaging devices. Thus, several sections typically relevant to AI-based devices (e.g., number of experts, adjudication methods, MRMC studies) are not applicable here.
Acceptance Criteria and Device Performance
The acceptance criteria for the NeoBase 2 Non-derivatized MSMS kit are implied by the comprehensive analytical and screening performance evaluation studies conducted. The goal is to demonstrate that the device performs equivalently to the legally marketed predicate device (NeoBase 2 Non-derivatized MSMS kit, K173568) and provides reliable results for newborn screening.
The reported device performance below highlights key analytical and screening metrics. Specific numerical acceptance criteria are not explicitly stated as hard thresholds (e.g., "CV must be < X%"), but rather the results of the studies are presented to demonstrate acceptable performance comparable to a cleared predicate.
Table of Acceptance Criteria (Implied) and Reported Device Performance
| Acceptance Criterion Area | Study Conducted / Metric | Reported Device Performance and Observation |
|---|---|---|
| Precision | Repeatability, Within-Laboratory, Between-Lot, Between-Instrument Variation (measured as SD and CV%) for all analytes. | Detailed tables provided for each analyte showing: - Repeatability CV%: Generally low (e.g., Ala: 5.3-6.5%, Arg: 3.3-6.0%, C0: 4.3-5.6%, C26:0-LPC: 5.0-16%). Higher CV% for lower concentrations (e.g., Asa Sample 1: 29%, Met Sample 1: 18%, SA Sample 1: 17%, ADO Sample 1: 19%, C26:0-LPC Sample 1: 16%). - Within-Lab CV%: Slightly higher than repeatability, still low (e.g., Ala: 6.3-9.8%, Arg: 5.0-8.3%, C0: 5.0-6.5%). - Between-lot and Between-instrument CV%: Generally low, indicating good consistency. - Total Variation CV%: Overall low, with higher CV% at lower concentrations (e.g., Asa Sample 1: 32%, Met Sample 1: 20%, SA Sample 1: 13%, ADO Sample 1: 17%, C26:0-LPC Sample 1: 23%). |
| Analytical Sensitivity | Lowest measurable analyte concentrations (Limit of Quantitation, LOQ). | Specific analytical sensitivity limits in µmol/L are provided for each analyte. (e.g., Ala: 3.66 µmol/L, Arg: 0.64 µmol/L, C0: 0.18 µmol/L, SA: 0.24 µmol/L). Some entries indicate <LOQ for percentiles in screening performance tables. |
| Linearity | Linear range (lower and upper limits, µmol/L) for all analytes. | Specific linear ranges defined for each analyte (e.g., Ala: 163-1450 µmol/L, Arg: 1.84-359 µmol/L, C0: 7.80-407 µmol/L, SA: 0.24-88.2 µmol/L, C26:0-LPC: 0.22-7.08 µmol/L). |
| Interference | Evaluation of potential interfering substances (endogenous and exogenous). | A list of 12 substances were not found to interfere. Identified Interferents: Sarcosine (Ala), Creatine (Ala, Glu, Leu), L-Asparagine (Orn), L-Lysine (Arg, Gln, Glu), L-Glutamic acid (Met), L-Methionine sulfone (Tyr), Verapamil metabolite D617 (ASA), L-Ornithine (Pro), Albumin (ADO), Intralipid (C24:0-LPC), Chlorhexidine digluconate (C24:0-LPC, C26:0-LPC), Hemoglobin (SA, C24:0-LPC, ADO, Val), Hematocrit (Arg). Note on Interferents: The report concludes that most identified interferences are unlikely to impact routine testing due to physiological concentration ranges or by incorporation into user-established cut-offs. Specific caution provided for Benzocaine (Phe), C5 isomer pivalylcarnitine (C5), C8 isomer valproylcarnitine (C8), and cross-reactivity of isomers/isobars (e.g., C3DC/C4OH, C16:1OH/C17). |
| Reproducibility | Between-Site Reproducibility (measured as SD and CV%) across 2 external sites and 1 internal site. | Detailed tables provided for each analyte showing: - Within-lab CV%: Generally low (e.g., Ala: 5.4-5.9%, Arg: 4.2-6.0%, C0: 5.0-5.7%, C26:0-LPC: 7.9-24%). - Between-lab CV%: Generally low (e.g., Ala: 1.0-3.5%, Arg: 1.2-3.8%, C0: 0.66-2.3%). - Reproducibility CV%: Overall low (e.g., Ala: 5.2-6.5%, Arg: 5.2-6.3%, C0: 5.1-6.1%). Higher CV% for lower concentrations (e.g., Asa Sample 1: 28%, SA Sample 1: 22%, C26:0-LPC Sample 1: 54%). |
| Screening Performance | Comparison of newborn population distributions and positive case detection against predicate device (TQD) using defined percentiles. | Population Distribution (n=2530): Mean, Median, 1st, 10th, 99th, and 99.5th percentiles are provided for all 47 analytes, demonstrating the device's ability to measure concentrations across a typical newborn population range. Confirmed Positive Specimens: - Amino Acid Disorders (AA): QSight detected either all 19 (99th percentile) or 18/19 (99.5th percentile) confirmed cases that were also detected by TQD. - Fatty Acid Oxidation (FAO): QSight detected all 9 (99th/99.5th percentile) or all 4 (low percentile) confirmed cases that were also detected by TQD. - Organic Acid Condition (OA): QSight detected all 16 (99th/99.5th percentile) confirmed cases that were also detected by TQD. - ADA-SCID: QSight detected both 2 confirmed cases (99th/99.5th percentile) that were also detected by TQD. - X-ALD: QSight detected both 2 confirmed cases (99th/99.5th percentile) that were also detected by TQD, with notes on borderline samples. |
Study Details
-
Sample Size and Data Provenance:
- Test Set (Screening Performance): 2530 routine newborn screening specimens.
- Country of Origin: United States (one routine screening laboratory in the US).
- Retrospective/Prospective: The nature of "routine screening specimens" usually implies retrospective use of collected samples for evaluating a new method, but explicit detail (e.g., prospective collection specifically for this study) is not given. However, the study focuses on comparing the new device (QSight) to the predicate (TQD) on these samples, suggesting an evaluation of existing data or new data collected in a routine setting.
- Analytical Studies (Precision, Linearity, Interference):
- Precision: 80 determinations (40 plates x 2 replicates) for repeatability and within-laboratory; 75 determinations (15 plates x 5 replicates x 3 lots) for between-lot; 50 determinations (10 plates x 5 replicates x 2 QSights) for between-instrument.
- Reproducibility: 75 determinations (5 plates x 5 working days x 5 replicates) across 3 labs (1 internal, 2 external).
- Linearity, Sensitivity, Interference: Not explicitly quantified by sample size in the same way as precision. These studies typically use a controlled set of samples spiked with known concentrations or interfering substances.
- Test Set (Screening Performance): 2530 routine newborn screening specimens.
-
Number of Experts and Qualifications for Ground Truth:
- This device is a quantitative assay for measuring analyte concentrations, not an AI/imaging device requiring expert interpretation of images.
- The "ground truth" for the screening performance study are confirmed positive specimens for various metabolic disorders (Amino Acid, Fatty Acid Oxidation, Organic Acid, ADA-SCID, X-ALD). These confirmations are typically established through gold standard clinical diagnostic methods (e.g., genetic testing, enzyme assays, or further biochemical analysis), not through expert panel review of the screening test results themselves. The document states "confirmed positive specimens," implying these are definitive diagnoses.
- Therefore, the concept of "experts" establishing ground truth for the test set, in the way a radiologist reads an image, does not directly apply here.
-
Adjudication Method for the Test Set:
- Not applicable as this is a quantitative diagnostic assay. "Adjudication" typically refers to resolving disagreements among human readers/interpreters, which is not the primary function of this device or study design. The study compares the new device's quantitative results and detection rates against those of a predicate device and known confirmed positive cases.
-
Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study:
- Not applicable. This is not a study comparing human readers with and without AI assistance. It is a comparison of two analytical instruments/kits (the new NeoBase 2 on QSight vs. predicate NeoBase 2 on TQD).
-
Standalone Performance (Algorithm-only without human-in-the-loop):
- Yes, this study is a standalone performance evaluation. The device (NeoBase 2 kit run on QSight MD Screening System) provides quantitative measurements of analytes. The performance tables directly show the device's analytical capabilities (precision, sensitivity, linearity, reproducibility) and its ability to detect known positive cases in a patient population without human interpretation as part of the core measurement. Human expert judgment comes in for confirming the disease in patient specimens (the ground truth), but not in interpreting the raw output of the device itself.
-
Type of Ground Truth Used:
- Confirmed positive specimens: For the screening performance study, the ground truth was based on specimens from individuals clinically confirmed to have the targeted metabolic disorders (Amino Acid Disorders, Fatty Acid Oxidation, Organic Acid Conditions, ADA-SCID, X-ALD). This likely involved definitive diagnostic tests beyond initial screening, such as genetic testing, enzyme activity assays, or more specific quantitative metabolite analysis. This is a form of outcomes data/definitive diagnosis.
-
Sample Size for the Training Set:
- This is a medical device for quantitative measurement, not a machine learning or AI model that requires a "training set" in the conventional sense (i.e., data used to teach an algorithm to learn patterns). The device's performance validation relies on analytical studies to demonstrate its accuracy, precision, and consistency, and then clinical performance by comparing its results to a predicate device and confirmed clinical cases. Therefore, the concept of a training set as understood in AI/ML is not relevant here.
-
How Ground Truth for the Training Set Was Established:
- Not applicable, as there is no "training set" in the context of an AI/ML model for this type of device. The device's performance is intrinsically linked to its chemical reagents and mass spectrometry technology, which are developed and calibrated through standard analytical chemistry procedures.
Ask a specific question about this device
(288 days)
02451
Re: K173568
Trade/Device Name: NeoBase 2 Non-derivatized MSMS Kit Regulation Number: 21 CFR 862.1055
nucleosides, and lysophospholipidsusing tandem mass spectrometry |
| Regulation: | 21 CFR 862.1055
The NeoBase™ 2 Non-derivatized MSMS kit is intended for the measurement and evaluation of amino acid, succinylacetone, free carnitine, acylcarnitine, nucleoside and lysophospholipid concentrations (Table 1) with a tandem mass spectrometer from newborn heel prick blood specimens dried on filter paper. Quantitative analytis of these analytes and their relationship with each other is intended to provide analyte concentration profiles that may aid in screening newborns for metabolic disorders.
Not Found
The provided text describes the acceptance criteria and study results for the NeoBase 2 Non-derivatized MSMS kit.
1. Table of Acceptance Criteria and Reported Device Performance
The document does not explicitly state numerical acceptance criteria for the screening performance studies (e.g., minimum sensitivity or specificity targets). Instead, it states that "All verification studies were successfully concluded and met the respective study's predetermined acceptance criteria." The clinical studies for screening performance are presented as agreement between the new device (NeoBase 2) and the predicate device (NeoBase). The agreement is presented as contingency tables (e.g., "Screening positive" vs "Screening negative" for both devices).
The performance is demonstrated by the agreement between the NeoBase 2 Non-derivatized MSMS kit and the predicate device, NeoBase Non-derivatized MSMS kit, in detecting various metabolic disorders in newborn screening. The results are presented in terms of the number of positive and negative screens detected by each device, along with the number of confirmed positive specimens.
Summary of Device Performance (from Tables A, B, C, D):
| Disorder Group | Cut-off Type (Percentile) | NeoBase 2 Screening Positive (with Predicate Positive) | NeoBase 2 Screening Negative (with Predicate Negative) | Total Specimens | Confirmed Positive Specimens (detected by both methods) |
|---|---|---|---|---|---|
| Study 1 | |||||
| Amino acid disorders | 99th | 621 | 1591 | 1751 | 15 |
| Amino acid disorders | 99.5th | 452 | 1645 | 1751 | 15 |
| Amino acid disorders | 1st | 161 | 1687 | 1737 | 1 (OTCD) |
| Fatty acid oxidation | 99th | 801 | 1581 | 1746 | 10 |
| Fatty acid oxidation | 99.5th | 451 | 1661 | 1746 | 10 |
| Fatty acid oxidation | Low Percentile | 1732 | 1386 | 1738 | 2 (CUD) |
| Organic acid condition | 99th | 571 | 1660 | 1751 | 15 |
| Organic acid condition | 99.5th | 361 | 1697 | 1751 | 15 |
| ADA-SCID | 99th | 2 | 1661 | 1738 | 2 |
| ADA-SCID | 99.5th | 2 | 1700 | 1738 | 2 |
| X-ALD | 99th | 2 | 1724 | 1738 | 2 |
| X-ALD | 99.5th | 2 | 1731 | 1738 | 2 |
| Study 2 | |||||
| Amino acid disorders | 99th | 1161 | 2353 | 2648 | 19 |
| Amino acid disorders | 99.5th | 782 | 2474 | 2648 | 18 |
| Amino acid disorders | 1st | 141 | 2571 | 2631 | 2 (OTCD) |
| Fatty acid oxidation | 99th | 1601 | 2326 | 2641 | 12 |
| Fatty acid oxidation | 99.5th | 1081 | 2442 | 2641 | 12 |
| Fatty acid oxidation | Low Percentile | 1581 | 2363 | 2632 | 3 |
| Organic acid condition | 99th | 861 | 2479 | 2642 | 13 |
| Organic acid condition | 99.5th | 422 | 2561 | 2642 | 12 |
| ADA-SCID | 99th | 2 | 2563 | 2631 | 2 |
| ADA-SCID | 99.5th | 2 | 2578 | 2631 | 2 |
| X-ALD | 99th | 2 | 2626 | 2631 | 2 |
| X-ALD | 99.5th | 2 | 2628 | 2631 | 2 |
2. Sample Size Used for the Test Set and Data Provenance
-
Study 1 Sample Size:
- Amino acid disorders, Fatty acid oxidation, Organic acid conditions: 1751 samples (for 99th and 99.5th percentile cut-offs) and 1737-1746 samples (for 1st and low percentile cut-offs).
- ADA-SCID and X-ALD: 1738 samples.
-
Study 2 Sample Size:
- Amino acid disorders, Fatty acid oxidation, Organic acid conditions: 2631-2648 samples.
- ADA-SCID and X-ALD: 2631 samples.
-
Data Provenance: The data was obtained from "routine newborn screening" in "two CLIA-certified state laboratories." The confirmed positive specimens were described as "retrospective" for Study 2. This suggests a retrospective study design using existing samples and accompanying diagnostic information. The country of origin is not explicitly stated but is implied to be the US due to "CLIA-certified state laboratories."
3. Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications of Those Experts
The document does not specify the number or qualifications of experts used to establish the ground truth for the test set. It mentions "confirmed positive specimens," implying a definitive diagnostic process was followed to establish the true disease status of these samples, but details on the experts involved are not provided.
4. Adjudication Method for the Test Set
The document does not describe an adjudication method for the test set, such as 2+1 or 3+1. The acceptance is based on the agreement between the new device and the predicate device, using established cut-offs derived from routine newborn screening data.
5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study
No MRMC comparative effectiveness study was done. This device is a diagnostic kit measuring analyte concentrations, not an AI system assisting human readers. Therefore, the concept of "how much human readers improve with AI vs without AI assistance" is not applicable.
6. Standalone (Algorithm Only Without Human-in-the-Loop Performance) Study
The study described is a comparison of the new device (NeoBase 2) to a predicate device (NeoBase) in obtaining analyte concentrations. While not explicitly stated as an "algorithm only" study, it's a standalone performance comparison of two test kits. The results (analyte concentrations and screening positive/negative classifications) are derived directly from the kit's operation with a tandem mass spectrometer, without human interpretation being part of the primary measurement process itself. The interpretation of the analyte profiles to aid in screening for metabolic disorders would typically involve medical professionals, but the performance data presented is on the analytical and classification output of the device.
7. Type of Ground Truth Used
The ground truth for the test set was based on "confirmed positive specimens." This implies that the true disease status of these specimens was established through clinical diagnosis and follow-up, which would typically involve a combination of clinical outcomes, biochemical testing, and/or genetic testing, ultimately confirmed by clinical experts. For ADA-SCID and X-ALD, it explicitly states "comparing the result... to the clinical condition."
8. Sample Size for the Training Set
The document does not explicitly mention a "training set" in the context of machine learning or AI. The term "cut-offs for both methods were determined by calculating the 99.5th and 99th percentile for all analytes" using "data from routine newborn screening." This large volume of routine newborn screening data could be considered analogous to a training or reference population used to establish the operating characteristics of the screening test. The specific sample size for this cut-off determination is not given, but it is implied to be a large dataset from the "two CLIA-certified state laboratories."
9. How the Ground Truth for the Training Set Was Established
As discussed in point 8, there isn't a traditional "training set" for an AI model. However, the cut-off values (e.g., 99th, 99.5th, 1st, 10th percentiles) used to define "screening positive" or "screening negative" were established using "data from routine newborn screening." This means the ground truth for establishing these cut-offs would inherently come from the statistical distribution of analyte levels in a large, presumably healthy and general newborn population, along with the understanding of what analyte levels are indicative of various metabolic disorders. The document states that the cut-off values "only apply to these studies."
Ask a specific question about this device
(244 days)
acids, free Classification Name: carnitine, and acylcarnitines using tandem mass spectrometry (21 CFR § 862.1055
2010
Re: K093916
Trade Name: NeoBase Non-derivatized MSMS reagent kit Regulation Number: 21 CFR 862.1055
The Neobase Non-derivatized MSMS reagent kit (for use on the PerkinElmer TQD MSMS Screening System) is intended for the measurement and evaluation of amino acids, succinylacetone, free carnitine, and acylcarnitine concentrations from newborn heel prick blood samples dried on filter paper. Quantitative analysis of these analytes (Table 1) and their relationship with each other is intended to provide analyte concentration profiles that may aid in screening newborns for metabolic disorders.
The measurement of amino acids, succinylacetone, free carnitine, and acylcarnitines with the NeoBase assay involves extraction of dried blood spots from newborns with a solution containing stable-isotope labeled internal standards and analysis using a tandem mass spectrometry (MSMS) system. The each analyte relative to their response of internal stable-isotope labeled corresponding standard is proportional to analyte concentration.
The device being described is the NeoBase Non-derivatized MSMS Kit, intended for the measurement and evaluation of amino acid, succinylacetone, free carnitine, and acylcarnitine concentrations from newborn heel prick blood samples dried on filter paper. This quantitative analysis aids in screening newborns for metabolic disorders.
The study presented is a non-clinical study comparing the performance of the NeoBase Non-derivatized MSMS kit on the PerkinElmer TQD Triple Quadrupole Mass Spectrometer System (PerkinElmer TQD platform) against its predicate devices, the MS2 and PerkinElmer Quattro Micro platforms (QMicro). The goal was to demonstrate substantial equivalence.
Here's the breakdown of the acceptance criteria and study information:
1. Table of Acceptance Criteria and Reported Device Performance
The document does not explicitly state "acceptance criteria" as a set of predefined thresholds. Instead, it demonstrates equivalence to predicate devices by showing comparable performance characteristics. The key performance metrics evaluated were:
- Precision (Imprecision Percent Coefficient of Variation - %CV): Lower %CV indicates higher precision.
- Recovery (Mean % Recovery and 95% Confidence Interval): Indicates the accuracy of analyte measurement.
- Measurable Ranges: The range over which the device can accurately quantify analytes, ensuring coverage of clinically significant levels.
- Method Correlation (Ratio of Measured Concentration): Comparing the TQD platform with predicate devices (MS2 and QMicro). A ratio of 1.0 indicates equivalent concentration measurements.
- Clinical Correlation (Percent Agreement in clinical determinations): How well the TQD platform agrees with predicate platforms in classifying samples above or below clinical cutoffs.
- Detection of True Positive Samples: The ability of the device to correctly identify known positive cases.
Here's a summary of the reported device performance, focusing on the TQD platform's comparison to predicate devices:
| Performance Characteristic | Acceptance Criteria (Implied: Comparable to Predicate) | NeoBase Non-derivatized MSMS Kit (TQD Platform) Performance (as compared to MS2/QMicro) |
|---|---|---|
| Precision (Average Total Imprecision %CV) | Should be comparable to or better than predicate devices. | Amino Acids: Generally around 10-18% for TQD, similar to or slightly better than MS2/QMicro (e.g., ALA 10, ARG 10, MET 18, TYR 8, VAL 12). Carnitines/Acylcarnitines: Not explicitly provided for all, but overall implied to be adequate based on predicate comparison. |
| Recovery (Mean % Recovery) | Should be comparable to predicate devices. | Amino Acids: Ranges from 57% (SA) to 104% (C0) for TQD, generally comparable to MS2/QMicro. 95% CI also presented. |
| Measurable Ranges | Should cover all clinically significant ranges. | For all analytes, the TQD range (µM) includes or extends beyond the "Cutoff Range (µM)", demonstrating sufficiency for clinical use. |
| Method Correlation (Mean Ratio of Measured Concentration) | Ratios close to 1.0 (indicating statistical equivalence). | MS2/TQD: Ratios ranged from 0.89 to 1.09, with small variation, indicating statistical equivalence. QMicro/TQD: Ratios ranged from 0.92 to 1.08, with small variation, indicating statistical equivalence. |
| Clinical Correlation (% Agreement) | High percentage agreement with predicate devices. | All Analytes: Ranged from 99.2% to 100.0% agreement between TQD and MS2/Sciex platforms in clinical determinations. |
| Detection of True Positive Samples | 100% agreement with predicate devices in detection. | 100% agreement for all 17 true positive samples (representing 14 disorders) between TQD and MS platforms. |
2. Sample Size Used for the Test Set and Data Provenance
- Non-clinical (Analytical) Test Set:
- Precision and Recovery: The tables provided (5.3, 5.4, 5.5) show averaged data, but the explicit number of samples/replicates isn't detailed for each specific test. However, the method correlation section states that "enriched samples (five levels) was analyzed (as singlicates of each level) for 16 runs to provide a total of 80 individual measurements" for each analyte on each platform.
- Method Correlation: Data from "dried blood spots enriched with the analytes of interest," specifically "5 levels times 5 runs per analyte" resulted in 25 means per platform for each analyte (80 individual measurements total, as above).
- Clinical Test Set:
- Clinical Correlation (Percent Agreement): 2499 random newborn screening specimens (presumptive negative data set) and 17 specimens with true positive diagnoses. Some analytes specify 2598 total observations (2499 + 80 individual measurements from enriched samples for other tests?), while others specify 2518* (2499 presumptive negatives + 19 true positives, including newly acquired NKH and H-ALA samples mentioned in footnote).
- True Positive Samples: 17 samples with true positive diagnoses representing 14 disorders.
- Data Provenance: The document does not specify the country of origin for the samples. It mentions "newborn heel prick blood samples dried on filter paper," which is a standard collection method. The data is retrospective in the sense that these were pre-existing biological samples used for evaluation. It's not a prospective collection of new patients for this specific study.
3. Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications
This study focuses on diagnostic device performance (quantitative measurements of analytes) rather than interpretive tasks that would typically require human expert adjudication of images or complex clinical scenarios.
- Analytical Performance (Precision, Recovery, Measurable Ranges, Method Correlation): Ground truth is established by the known concentrations of analytes in the spiked/enriched samples, and the quantitative measurements determined by specialized laboratory equipment (Mass Spectrometers). No human experts are involved in establishing this type of ground truth.
- Clinical Correlation and True Positive Samples: The "ground truth" for the 17 true positive samples is referred to as "true positive diagnoses." The document does not specify how these diagnoses were established (e.g., whether by pathology, genetic testing, or clinical consensus) nor does it mention the number or qualifications of experts involved in these initial diagnoses.
4. Adjudication Method for the Test Set
Not applicable. This is an analytical/quantitative device performance study rather than an interpretive study requiring human adjudication. The "agreement" for clinical correlation refers to the concordance between the numerical results of the TQD platform and the predicate platforms against established clinical cutoffs, not human expert consensus on a diagnosis.
5. If a Multi Reader Multi Case (MRMC) Comparative Effectiveness Study was done
No, an MRMC comparative effectiveness study was not done. This study is evaluating the analytical performance and clinical correlation of a laboratory diagnostic assay, not a device that requires human interpretation of outputs.
6. If a Standalone Study (algorithm only without human-in-the-loop performance) was done
Yes, this is essentially a standalone (algorithm only) study. The "device" is a reagent kit used on an automated mass spectrometry system. The study compares the quantitative results generated by the TQD platform (with the NeoBase kit) directly to the predicate MS2 and QMicro platforms (also using the NeoBase kit) without a human-in-the-loop interpretation step being evaluated as part of the primary outcome for device clearance. The output is a numerical concentration.
7. The Type of Ground Truth Used
- Analytical Performance: Ground truth is based on known concentrations in control and enriched samples. These are prepared by spiking analytes into a matrix (dried blood spots) at specific, verifiable concentrations.
- Clinical Correlation and True Positive Samples:
- For the 2499 random newborn screening specimens, the "ground truth" for clinical "agreement" is whether the analyte concentrations fall above or below their respective clinical cutoffs (as determined by the predicate device).
- For the 17 true positive samples, the ground truth is established clinical diagnoses of metabolic disorders ("true positive diagnoses").
8. The Sample Size for the Training Set
The document describes evaluation of the device performance, not the training of an AI algorithm requiring a specific "training set." This device is a reagent kit for a mass spectrometry system, not an AI/ML software device in the typical sense. Therefore, there is no explicit "training set" in the context of an AI model.
Historically, the predicate devices (MS2 and QMicro) and their associated kits would have undergone extensive validation and optimization (which could be conceptually analogous to a training phase, but for analytical chemistry rather than AI). The current study is demonstrating the equivalence of the NeoBase kit on a new platform (TQD) to these previously validated systems.
9. How the Ground Truth for the Training Set was Established
As there is no distinct "training set" for an AI algorithm in this context, this question is not applicable. The methods used in developing and validating the NeoBase kit and its use on mass spectrometry platforms would involve many iterations of experiments to establish linearity, accuracy, precision, and other analytical specifications. This process relies on robust analytical chemistry principles and reference materials with known concentrations, rather than a "ground truth" established by human experts for AI training.
Ask a specific question about this device
(259 days)
system for amino acids, free carnitine, and acylcarnitines using tandem mass spectrometry (21 CFR § 862.1055
- 9 2009**
Re: K083130
Trade Name: NeoBase Non-Derivatized MSMS Kit Regulation Number: 21 CFR §862.1055
The NeoBase Non-derivatized MSMS reagent kit is intended for the measurement and evaluation of amino acids, succinylacetone, free carnitine, and acylcarnitine concentrations from newborn heel prick blood samples dried on filter paper. Quantitative analysis of these analytes (Table 1) and their relationship with each other is intended to provide analyte concentration profiles that may aid in screening newborns for metabolic disorders.
The measurement of amino acids, succinylacetone, free carnitine, and acylcarnitines with the NeoBase assay involves extraction of dried blood spots from newborns with a solution containing stable-isotope labeled internal standards and analysis using a tandem mass spectrometry (MSMS) system. The response of each analyte relative to their corresponding stable-isotope labeled internal standard is proportional to analyte concentration.
The provided 510(k) summary describes the NeoBase Non-derivatized MSMS Kit, intended for newborn screening of metabolic disorders by measuring amino acids, succinylacetone, free carnitine, and acylcarnitine concentrations from dried blood spots. The device's performance was compared to a legally marketed predicate device, the NeoGram Amino Acids and Acylcarnitines Tandem Mass Spectrometry Kit (K031878).
Here's a breakdown of the requested information based on the provided text:
1. Table of Acceptance Criteria and Reported Device Performance
The document does not explicitly state "acceptance criteria" as numerical thresholds for precision, recovery, or correlation that the device must meet. Instead, it presents the device's performance characteristics and compares them to those reported for the predicate device to demonstrate substantial equivalence. The implication is that performance comparable to the predicate device is considered acceptable.
| Performance Characteristic | Acceptance Criteria (Implicit) | NeoBase Non-derivatized MSMS Kit Performance (Reported) | Predicate Device (NeoGram) Performance (Reported) |
|---|---|---|---|
| Precision (Averaged Total Imprecision %CV for Amino Acids) | Comparable to predicate | < 10% (median across analytes, approximated from table) | < 20% (median across analytes, approximated from table) |
| Precision (Averaged Total Imprecision %CV for Carnitines & Acylcarnitines) | Comparable to predicate | < 10% (median across analytes, approximated from table) | < 20% (median across analytes, approximated from table) |
| Recovery (Average % for Amino Acids) | Comparable to predicate, ideally close to 100% | 89-101% | 68-96% |
| Recovery (Average % for Carnitines & Acylcarnitines) | Comparable to predicate, ideally close to 100% | 93-102% | 67-139% |
| Measurable Ranges | Must cover clinically significant ranges | All NeoBase analyte ranges cover or extend beyond "Normal" and "Cutoff" clinical ranges. | All Predicate analyte ranges cover or extend beyond "Normal" and "Cutoff" clinical ranges. |
| Method Correlation (R for Amino Acids) | "Correlated very well" with predicate (R-values close to 1) | Most R values ≥ 0.95 (approximated from Table 5.7) | Most R values ≥ 0.95 (approximated from Table 5.7) |
| Method Correlation (R for Carnitines & Acylcarnitines) | "Correlated very well" with predicate (R-values close to 1) | Most R values ≥ 0.95 (approximated from Table 5.8) | Most R values ≥ 0.95 (approximated from Table 5.8) |
| Clinical Agreement (% Agreement for all analytes) | High percentage agreement with predicate (implied close to 100%) | 97.2% to 100.0% | N/A (agreement between methods) |
| Detection of True Positive Samples (Disorders) | Comparable sensitivity to predicate for common disorders; ability to detect Tyrosinemia Type I | Detected 107/108 true positive samples (excluding 2 samples decayed due to storage) | Detected 103/108 true positive samples (excluding 2 samples decayed due to storage, and 4 Tyrosinemia Type I samples) |
| Detection of Tyrosinemia Type I specifically | Successful detection of Tyrosinemia Type I (using SA) | Detected 4/4 Tyrosinemia Type I samples via SA | Detected 0/4 Tyrosinemia Type I samples |
2. Sample Size Used for the Test Set and Data Provenance
- Non-clinical (Precision, Recovery, Measurable Ranges): The document doesn't specify a distinct "test set" for these parameters in terms of a separate sample size. The data for these characteristics is generated through laboratory experiments likely using controlled samples (e.g., spiked samples, internal controls) rather than clinical patient samples. The provenance is implied to be laboratory-generated.
- Method Correlation:
- Sample Size: 158 samples.
- Data Provenance: Not explicitly stated, but samples were "prepared in duplicates" and assayed using both devices. Implied to be laboratory-controlled or clinical samples processed for comparison, but specific origin (e.g., country) is not mentioned. Given the context of newborn screening, they would likely be dried blood spots.
- Clinical Correlation Studies:
- Sample Size:
- 9416 random neonatal samples
- 104 samples with true positive diagnoses
- 320 artificially enriched dried blood spots
- Data Provenance: From two different US newborn screening laboratories. The studies evaluated the NeoBase kit "in parallel to the predicate device (identical specimens were analyzed as paired samples by both methods)." This suggests a retrospective collection of these samples, as they are referred to as "random neonatal samples" and "samples with true positive diagnoses."
- True Positive Samples (Specific breakdown): 108 total true positive samples were analyzed (104 from screening sites + 4 Tyrosinemia Type I samples analyzed by PerkinElmer R&D). The two samples that weren't detected by either assay were CPT-2 and VLCAD cases which had degraded due to storage.
- Sample Size:
3. Number of Experts Used to Establish the Ground Truth for the Test Set and Their Qualifications
The document does not specify the number of experts or their qualifications used to establish the ground truth for the clinical correlation studies or the true positive samples.
For the true positive samples, the text refers to them as "samples with true positive diagnoses" and later lists "Disorder Full name" for each, implying that the diagnoses were established clinically through standard diagnostic procedures. However, the exact method of ground truth confirmation or number of experts involved is not detailed.
4. Adjudication Method for the Test Set
The document does not describe an adjudication method for the test set.
For the "Clinical Correlation Studies," it states that "Clinical correlation was established by assessing whether or not the methods were concordant in determining the paired samples to have analyte concentration values above or below their corresponding cutoffs." This implies a direct comparison of the readings from the NeoBase and predicate device against predetermined clinical cutoffs, rather than an expert adjudication process determining the ultimate "true" status of each sample for the purpose of the study.
5. If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study Was Done
No, a Multi-Reader Multi-Case (MRMC) comparative effectiveness study was not done.
This device is an in vitro diagnostic (IVD) kit for quantitative measurement of analytes using tandem mass spectrometry. Its performance evaluation focuses on analytical characteristics (precision, recovery, measurable range) and method correlation with a predicate device, as well as clinical correlation of quantitative measurements to established cutoffs. MRMC studies are typically used for imaging devices or other diagnostic tools where human readers interpret results, and the study would then compare human performance with and without AI assistance. This is not applicable here.
6. If a Standalone (i.e. algorithm only without human-in-the-loop performance) Was Done
Yes, a standalone performance evaluation was done.
The entire study described in the 510(k) refers to the performance of the device kit (NeoBase Non-derivatized MSMS Kit) itself. This is an assay system that produces quantitative values. The precision, recovery, measurable ranges, and direct comparisons of analyte concentrations to clinical cutoffs are all measures of the kit's standalone analytical performance. Human involvement is limited to operating the instrument and following the assay protocol, not to interpreting ambiguous results in a way that would alter the quantitative output.
7. The Type of Ground Truth Used
- Non-clinical (Precision, Recovery, Measurable Ranges): The ground truth for these studies would be based on known concentrations of analytes in controlled, laboratory-prepared samples (e.g., spiked samples, calibrators).
- Method Correlation: The "ground truth" here is the measurement from the predicate device. The study compares the new device's measurements against those from the established predicate device.
- Clinical Correlation Studies:
- For the 9416 random neonatal samples, the ground truth was effectively the clinical cutoff values for each analyte. Samples were categorized as "above or below their corresponding cutoffs."
- For the 104 (plus 4 Tyrosinemia Type I) "true positive samples," the ground truth was the established clinical diagnosis of specific metabolic disorders (e.g., 3-Methylcrotonyl-CoA Carboxylase Deficiency, Phenylketonuria, Tyrosinemia Type I). This implies outcomes data or a strong clinical consensus for the natural diagnosis itself.
8. The Sample Size for the Training Set
The document does not explicitly mention a "training set" for the NeoBase Non-derivatized MSMS Kit. This is an in vitro diagnostic assay, not a machine learning algorithm that typically undergoes a distinct training phase on a large dataset. The development of such a kit involves analytical validation studies to optimize reagents and protocols, which is different from "training" an algorithm.
9. How the Ground Truth for the Training Set Was Established
As no training set is explicitly described in the context of an algorithm or machine learning, this question is not applicable based on the provided text. The "ground truth" in the development of an IVD kit is established through standard analytical chemistry principles, reference methods, and clinical validation.
Ask a specific question about this device
Page 1 of 1