Search Filters

Search Results

Found 43 results

510(k) Data Aggregation

    K Number
    K241176
    Date Cleared
    2025-01-16

    (262 days)

    Product Code
    Regulation Number
    862.1117
    Reference & Predicate Devices
    Predicate For
    N/A
    AI/MLSaMDIVD (In Vitro Diagnostic)TherapeuticDiagnosticis PCCP AuthorizedThirdpartyExpeditedreview
    Intended Use

    The Alere NT-proBNP for Alinity i assay is a chemiluminescent microparticle immunoassay (CMIA) used for the in vitro quantitative determination of N-terminal pro B-type natriuretic peptide (NT-proBNP) in human serum and plasma on the Alinity i system.

    In the emergency department, measurements of NT-proBNP are used as an aid in the diagnosis of heart failure (HF) in patients with clinical suspicion of new onset or worsening HF.

    Device Description

    The Alere NT-proBNP for Alinity i assay is an automated, two-step immunoassay for the in vitro quantitative determination of NT-proBNP in human serum and plasma using chemiluminescent microparticle immunoassay (CMIA) technology. Sample and anti-NT-proBNP coated paramagnetic microparticles are combined and incubated. The NT-proBNP present in the sample binds to the anti-NT-proBNP coated microparticles. The mixture is washed. Anti-NT-proBNP acridinium-labeled conjugate is added to create a reaction mixture and incubated. Following a wash cycle, Pre-Trigger and Trigger Solutions are added. The resulting chemiluminescent reaction is measured as a relative light unit (RLU). There is a direct relationship between the amount of NT-proBNP in the sample and the RLU detected by the system optics.

    AI/ML Overview

    Despite the request for acceptance criteria and study proving the device meets said criteria, the provided document is a 510(k) summary for a diagnostic test (Alere NT-proBNP for Alinity i Reagent Kit). This type of document focuses on demonstrating substantial equivalence to a predicate device, and thus does not explicitly list "acceptance criteria" for performance in the same way one might find for a new medical device claiming superiority or non-inferiority.

    Instead, the document details various performance characteristics of the device, comparing them to relevant standards (CLSI guidelines) and providing statistical data. It aims to show that the new device performs acceptably and similarly to a previously cleared device.

    Therefore, I cannot extract a table of "acceptance criteria" as such a table is not explicitly presented. However, I can infer the implied acceptance criteria from the reported performance, specifically from the "No Significant Interference" and "within acceptable performance" statements in the nonclinical performance section, and the effectiveness of the cutoffs for diagnosis in the clinical performance. The "reported device performance" will be the actual numbers provided in the document.

    Here's a summary of the available information, structured to address your points as much as possible given the document type:


    Implied Acceptance Criteria and Reported Device Performance

    As this is a 510(k) submission, explicit quantitative acceptance criteria are not stated in a dedicated table format. Instead, the device's performance characteristics are presented as evidence of substantial equivalence to a predicate device and adherence to recognized standards. The implied acceptance criteria are that the device demonstrates acceptable accuracy, precision, and clinical utility for its stated indications for use.

    Here's a table summarizing key performance indicators that would implicitly serve as acceptance criteria given standard diagnostic device requirements:

    Performance CharacteristicImplied Acceptance CriterionReported Device Performance
    Analytical Measuring Interval (AMI)The range over which results can be reliably quantified.15.8 to 35,000.0 pg/mL (1.9 to 4130.0 pmol/L). Extended Measuring Interval (EMI) up to 350,000 pg/mL (41,300.0 pmol/L) for diluted samples.
    LinearityDevice should demonstrate linear response across AMI.Linear across the AMI of 15.8 to 35,000.0 pg/mL.
    Within-Laboratory Precision (Overall CV)Low variability; specific CV targets for different concentration levels.Low Control: 6.2% CVMedium Control: 4.1% CVHigh Control: 4.0% CVPanels A-F: 3.6% - 10.0% CVPanel G: 4.0% CVPanel H (Supplemented): 7.7% CV
    Reproducibility (Overall CV)Low variability across sites, days, and lots.Low Control: 4.7% CVMedium Control: 4.8% CVHigh Control: 6.7% CVPanel 1: 18.9% CVPanels 2-6: 4.3% - 6.0% CVPanel 7 (Supplemented): 6.6% CVPanel 8 (Supplemented): 7.2% CV
    Lower Limits of Measurement (LoQ)Detect and quantify analyte at low concentrations with acceptable precision.LoQ: 15.8 pg/mL (1.9 pmol/L) (defined as lowest concentration at which 20% CV was met).LoB: 0.1 pg/mLLoD: 3.6 pg/mL (0.4 pmol/L)
    Analytical Specificity (Interference)Interference within ±10.0% for listed substances/drugs.No significant interference (within ±10.0%): Bilirubin, Biotin, Cholesterol, HAMA, Hemoglobin, IgG, Intralipid, RF (up to 600 IU/mL), Total Protein (up to 12.6 g/dL), and a comprehensive list of 50+ drugs at specified concentrations. Interference beyond ±10.0% observed for: RF at 1520 IU/mL (-8.9% to -11.4%), Total Protein at 15.2 g/dL (-12.7%).
    Cross-Reactivity% recovery within 100% ± 10% for listed cross-reactants.All evaluated cross-reactants (e.g., Adrenomedullin, Aldosterone, Angiotensin I/II/III, ANP, BNP, CNP, Endothelin, NT-proANP, Renin, Urodilatin) showed % recovery within 100% ± 10%.
    High Dose HookNo hook effect up to a specified high concentration.No hook effect observed up to 372,620 pg/mL.
    Clinical Performance (Posttest Probability for HF)Positive test result to show high posttest probability of HF; Negative test result to show high posttest probability of Non-HF.All Subjects (Positive): 75.2% (708/942) posttest probability of HF.All Subjects (Negative): 94.0% (794/845) posttest probability of Non-HF.Grayzone: 35.6% posttest probability of HF.Similar detailed results provided for various age groups, sexes, eGFR, BMI, and comorbidity subgroups.
    Clinical Performance (Likelihood Ratios for HF)High LR (Positive), Low LR (Negative).All Subjects (Positive): 4.29 (3.80, 4.83)All Subjects (Negative): 0.09 (0.07, 0.12)Grayzone: 0.78 (0.64, 0.96) Similar detailed results provided for various age groups, sexes, eGFR, BMI, and comorbidity subgroups.

    Study Details:

    1. Sample sizes used for the test set and the data provenance:

      • Clinical Performance Study (test set): 2127 Emergency Department (ED) subjects.
        • Provenance: Multi-center prospective study across 17 collection sites in the US.
        • Demographics: 1030 (48.4%) female, 1097 (51.6%) male, age 19-97 years. Predominantly White (53.1%) and Black/African American (39.5%). 90.9% non-Hispanic/Latino.
      • Nonclinical Performance (examples):
        • Within-Laboratory Precision: 240 replicates (controls/panels).
        • Reproducibility: 360 replicates (controls/panels) per assay (across 3 sites).
        • Lower Limits of Measurement: n ≥ 60 replicates for LoB, LoD, LoQ.
        • Analytical Specificity/Interference: Each substance tested at 2 analyte levels (approximately 125 pg/mL and 2000 pg/mL).
    2. Number of experts used to establish the ground truth for the test set and the qualifications of those experts:

      • The ground truth for the clinical study was an "adjudicated diagnosis" determined by a panel of board-certified cardiologists. The exact number of cardiologists on the panel is not specified in the provided text.
    3. Adjudication method (e.g. 2+1, 3+1, none) for the test set:

      • The document states "An adjudicated diagnosis was determined by a panel of board-certified cardiologists." It does not specify the exact adjudication method (e.g., majority vote, sequential review, etc.).
    4. If a multi reader multi case (MRMC) comparative effectiveness study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance:

      • No, this document describes the validation of a quantitative in vitro diagnostic (IVD) reagent kit for measuring NT-proBNP levels using an automated chemiluminescent immunoassay (CMIA) system. It is not an AI-assisted diagnostic imaging device, so an MRMC study is not relevant to this submission. The "readers" are the automated analyzers and laboratory personnel interpreting numerical results.
    5. If a standalone (i.e., algorithm only without human-in-the-loop performance) was done:

      • This device is a standalone diagnostic test in the sense that it provides a quantitative NT-proBNP result. The assay itself is a fully automated process on the Alinity i system. The performance data presented (precision, linearity, limits, specificity, clinical performance tables) represent the performance of the device "standalone" in generating these quantitative results, which are then used by clinicians as an "aid in diagnosis." There isn't a "human-in-the-loop" component to the measurement itself, though medical professionals interpret the results in a clinical context.
    6. The type of ground truth used (expert consensus, pathology, outcomes data, etc.):

      • For the clinical performance study, the ground truth for Heart Failure (HF) diagnosis was established by expert consensus (adjudicated diagnosis by a panel of board-certified cardiologists).
    7. The sample size for the training set:

      • This document describes a 510(k) submission for an in vitro diagnostic reagent kit. Unlike AI/ML software, such devices typically undergo analytical and clinical validation studies with defined test sets but do not have a "training set" in the sense of machine learning algorithms. The development and optimization of the assay would have involved various internal samples and experiments, but these are not explicitly termed "training sets" and their size is not reported in this context.
    8. How the ground truth for the training set was established:

      • As explained above, the concept of a "training set" with established ground truth, as typically applied to machine learning or AI models, does not directly apply to the regulatory submission type for this diagnostic reagent kit. The assay is based on chemical and biological principles (CMIA) rather than learned algorithms from large datasets.
    Ask a Question

    Ask a specific question about this device

    K Number
    K232164
    Device Name
    Access NT-proBNP
    Date Cleared
    2024-04-12

    (266 days)

    Product Code
    Regulation Number
    862.1117
    Reference & Predicate Devices
    Predicate For
    N/A
    AI/MLSaMDIVD (In Vitro Diagnostic)TherapeuticDiagnosticis PCCP AuthorizedThirdpartyExpeditedreview
    Intended Use

    The Access NT-proBNP assay is a paramagnetic particle, chemiluminescent immunoassay for the quantitative determination of N-terminal pro B-type natriuretic peptide levels in human serum and plasma using the automated DxI Access Immunoassay Analyzers to aid in the following:

    1. diagnosis of patients suspected of having acute heart failure in the Emergency Department
    2. assessment of heart failure severity
    3. risk stratification of patients with heart failure
    4. risk stratification of patients with acute coronary syndrome
    Device Description

    The Access NT-proBNP assay is a paramagnetic particle, chemiluminescent immunoassay for the quantitative determination of N-terminal pro B-type natriuretic peptide levels in human serum and plasma using the automated Dxl 9000 Access Immunoassay Analyzers to aid in the following: 1) diagnosis of patients suspected of acute heart failure in the Emergency Department, 2) assessment of heart failure severity, 3) risk stratification of patients with heart failure, 4) risk stratification of patients with acute coronary syndrome.

    The Access NT-proBNP is a two-site immunoenzymatic (sandwich) assay. Paramagnetic particles coated with monoclonal anti-NT-proBNP antibody and monoclonal anti-NTproBNP antibody conjugated to alkaline phosphatase are added to a reaction vessel along with a surfactant-containing buffer and serum or plasma sample. The human NTproBNP binds to the anti-NT-proBNP antibody on the solid phase, while the anti-NTproBNP antibody-alkaline phosphatase conjugate reacts with a different antigenic site on the NT-proBNP molecule. After incubation, materials bound to the solid phase are held in a magnetic field while unbound materials are washed away. Then the chemiluminescent substrate is added to the vessel and light generated by the reaction is measured with a luminometer. The light production is directly proportional to the concentration of analyte in the sample. Analyte concentration is automatically determined from a stored calibration.

    Other items required to use the assay include calibrators, Lumi-Phos PRO, and wash buffer. The Access NT-proBNP reagent packs. Access NT-proBNP calibrators, along with the Access wash buffer, and Lumi-Phos PRO are designed for use on the Dxl 9000 Access Immunoassay Analyzers in a clinical laboratory setting.

    AI/ML Overview

    Here's a breakdown of the acceptance criteria and study details for the Beckman Coulter Access NT-proBNP device, based on the provided text:

    Acceptance Criteria and Reported Device Performance

    The acceptance criteria are implied by the clinical and non-clinical studies conducted, with the device performance needing to meet expected ranges and exhibit substantial equivalence to predicate devices. Specific quantitative acceptance criteria are given for imprecision, LoB/LoD/LoQ, and linearity.

    Acceptance Criteria CategorySpecific Criteria (Implied or Stated)Reported Device Performance
    Imprecision- ≤ 4.0 ng/L SD at concentrations ≤ 50 ng/L - ≤ 8.0% CV at concentrations > 50 ng/LSample 1 (Mean 31 ng/L): SD 1.1 ng/L (3.5% CV). Meets criterion. Sample 2 (Mean 129 ng/L): SD 7.8 ng/L (6.0% CV). Meets criterion. Sample 3 (Mean 266 ng/L): SD 16.7 ng/L (6.3% CV). Meets criterion. Sample 4 (Mean 377 ng/L): SD 26.4 ng/L (7.0% CV). Meets criterion. Sample 5 (Mean 1,777 ng/L): SD 89.5 ng/L (5.0% CV). Meets criterion. Sample 6 (Mean 12,076 ng/L): SD 676.7 ng/L (5.6% CV). Meets criterion. Sample 7 (Mean 26,126 ng/L): SD 1516.1 ng/L (5.8% CV). Meets criterion.
    High Dose Hook EffectNo observed high dose hook effect within a specified high concentration range (e.g., up to 300,000 pg/mL, similar to predicate)No high dose hook effect observed up to 400,000 ng/L. Meets implied criterion (exceeds predicate device's demonstrated hook effect range).
    Limit of Blank (LoB)< 10.0 ng/L (pg/mL)Maximum Observed Result: 1.1 ng/L. Meets criterion.
    Limit of Detection (LoD)≤ 10.0 ng/L (pg/mL)Maximum Observed Result: 4.8 ng/L. Meets criterion.
    Limit of Quantitation (LoQ)≤ 10.0 ng/L (pg/mL) with ≤ 20% within-lab CVMaximum Observed Result: 4.8 ng/L. Meets criterion (implied that CV was ≤20%).
    Linearity- Within ± 10% for values > 50 ng/L - Within ± 5.0 ng/L for values ≤ 50 ng/LDemonstrated acceptable non-linearity across the analytical measuring range (10.0 - 35,000 ng/L) and meets the specified ±10% and ±5.0 ng/L criteria.
    Matrix ComparisonAll indicated sample types (serum, lithium heparin plasma, EDTA plasma) are suitable for use.Study with 68 matched samples showed all sample types are suitable. Meets criterion.
    Interfering SubstancesNo significant interference (defined as > 10% shift in dose) by common substances at specified concentrations.None of the tested compounds caused significant interference (>10% shift). Meets criterion.
    Cross ReactivityNo significant cross-reactivity (>10%) with structurally similar substances.No significant cross-reactivity (>10%) observed. Meets criterion.
    Clinical Performance (AHF Diagnosis)Aid in diagnosis of acute heart failure with comparable diagnostic accuracy to predicate device (Elecsys proBNP II), as evidenced by ROC AUC.AUC for Access NT-proBNP was 0.8536 (95% CI: 0.8362 - 0.8710), comparable to Elecsys proBNP II at 0.8562 (95% CI: 0.8361 - 0.8762). Meets criterion of comparability.
    Clinical Performance (NYHA Correlation)Significant trend relationship between NT-proBNP values and NYHA classification for all subjects, females, and males.JT test of trending resulted in statistically significant p-values (<0.0001 for all subjects, 0.0005 for females, 0.0033 for males), indicating a significant trend relationship. Meets criterion.
    Clinical Performance (Risk Stratification)Demonstrated prognostic utility for risk stratification in patients with heart failure and acute coronary syndrome (supported by peer-reviewed literature).Supported by analysis of peer-reviewed literature and guidelines for the same analyte, confirming correlation between NT-proBNP and cardiovascular events/mortality. Meets criterion.

    Study Details

    1. Sample Size and Data Provenance:

      • Test Set (Clinical Study for AHF Diagnosis): 2,384 patients presenting to the Emergency Department with clinical suspicion of acute heart failure.
        • Provenance: This was a "multicenter prospective study." While specific countries are not mentioned, regulatory submissions to the FDA typically involve studies conducted in the US or other regions following ICH GCP guidelines. The study type is explicitly prospective.
      • Test Set (Reference Interval Study): 675 apparently healthy adults for the overall ULN, broken down into 306 males and 369 females.
        • Provenance: This was a "multicenter prospective reference interval study." Again, specific countries are not mentioned but it was prospective.
      • Test Set (Imprecision): Number of runs (minimum 20 days) and replicates (duplicate) are specified for samples 1-7, with N ranging from 83 to 86 tests per sample.
      • Test Set (LoB, LoD, LoQ): Multiple reagents lots and 3 instruments over 3-5 days.
      • Test Set (Linearity): Native patient samples.
      • Test Set (Matrix Comparison): Sixty-eight (68) matched serum, lithium heparin, and EDTA plasma samples.
      • Test Set (Interfering Substances/Cross Reactivity): Lithium heparin plasma samples (concentrations approx. 125 ng/L and 1,800 ng/L) spiked with various substances. Number of individual samples not specified, but the number of substances tested is large.
    2. Number of Experts and Qualifications for Ground Truth:

      • For the clinical sensitivity study (AHF diagnosis), "Final diagnoses were adjudicated by an independent committee of medical doctors."
      • Number of Experts: Not explicitly stated (e.g., "3 independent physicians"). It mentions "an independent committee," which implies more than one.
      • Qualifications: "Medical doctors." No specific specialties (e.g., cardiologists, emergency physicians) or years of experience are listed in the provided text.
      • For risk stratification, the basis for effectiveness comes from "an objective and systematic analysis of recent peer-reviewed literature and clinical practice guidelines," implying consensus from the broader medical community rather than a specific set of experts for this particular study.
    3. Adjudication Method for the Test Set:

      • For the AHF diagnosis study, the text states: "Final diagnoses were adjudicated by an independent committee of medical doctors who decided on presence of acute heart failure. Adjudicators were blinded to the Access NT-proBNP assay results."
      • The specific method (e.g., 2+1, 3+1) is not explicitly detailed. It simply states "adjudicated by an independent committee," which could imply consensus, majority vote, or a specific tie-breaking rule, but the exact mechanism is not provided.
    4. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study:

      • No, this device is an in-vitro diagnostic (IVD) assay, not an AI/imaging device. Therefore, an MRMC study and human reader improvement with AI assistance are not applicable. The comparison made is against another IVD assay (Elecsys proBNP II) as a standalone algorithm/test.
    5. Standalone Performance:

      • Yes, the primary performance shown is standalone (algorithm only/test only) performance. The Access NT-proBNP assay is a quantitative determination of NT-proBNP levels. Its performance (e.g., analytical performance, diagnostic accuracy via ROC curves) is measured and reported as the output of the assay itself. The clinical performance section directly presents the assay's ability to diagnose AHF and correlates its results with NYHA classification and risk stratification, without human interpretation of the assay's output as an input to a further AI system.
    6. Type of Ground Truth Used:

      • Clinical Diagnosis (Expert Consensus/Adjudication): For the acute heart failure diagnosis study, the ground truth was established by an "independent committee of medical doctors" adjudicating the final diagnoses. This falls under expert consensus based on clinical information.
      • Clinical Outcomes/Literature Review: For risk stratification, the ground truth is based on the correlation of NT-proBNP with "increased incidence of cardiovascular events, mortality and composite outcomes" as established in existing peer-reviewed literature and clinical practice guidelines.
      • Reference Intervals for Healthy Population: Established by recruiting "apparently healthy adults" based on defined exclusion criteria and screening with additional biomarkers (eGFR, hsTnl) to ensure the health status.
    7. Sample Size for the Training Set:

      • The provided document describes the validation studies (test set) for the Access NT-proBNP device. It does not mention a separate "training set" in the context of an AI/machine learning model. This is an IVD assay, not an AI device trained on data. The development process for such an assay involves R&D, optimization, and internal verification stages, but the term "training set" doesn't directly apply in the same way it would for AI products.
    8. How Ground Truth for the Training Set Was Established:

      • As this is not an AI/machine learning device, the concept of a "training set" with established ground truth as per AI/ML typically doesn't apply. The development of the assay itself would involve internal studies and optimization based on known reference materials and clinical samples, but these are part of the assay's design and analytical verification, not a training phase for a learning algorithm.
    Ask a Question

    Ask a specific question about this device

    K Number
    K220265
    Date Cleared
    2023-09-24

    (601 days)

    Product Code
    Regulation Number
    862.1117
    Reference & Predicate Devices
    Predicate For
    N/A
    AI/MLSaMDIVD (In Vitro Diagnostic)TherapeuticDiagnosticis PCCP AuthorizedThirdpartyExpeditedreview
    Intended Use

    The ADVIA Centaur® NT-proBNPII (PBNPII) assay is for in vitro diagnostic use in the quantitative determination of N-terminal pro-brain natriuretic peptide (NT-proBNP) in human serum and plasma (EDTA and lithium heparin) using the ADVIA Centaur® XP system.

    In the Emergency Department (ED) and Outpatient (OP) populations, measurements of NT-proBNP are used as an aid in the diagnosis of heart failure (HF) in patients with clinical suspicion of new onset or worsening HF.

    Device Description

    The ADVIA Centaur® NT-proBNPII (PBNPII) assay kit includes the Primary Reagent ReadyPack and the Calibrator. The Primary Reagent ReadyPack contains Lite Reagent, Solid Phase Reagent, and Ancillary Well Reagent. The Calibrator includes Low and High Calibrators which are lyophilized.

    AI/ML Overview

    The provided document discusses the ADVIA Centaur® NT-proBNPII (PBNPII) assay, an in vitro diagnostic device for measuring N-terminal pro-brain natriuretic peptide (NT-proBNP) to aid in the diagnosis of heart failure in Emergency Department (ED) and Outpatient (OP) populations. The submission aims to demonstrate substantial equivalence to the predicate device, the Roche Elecsys proBNP II assay (K072437).

    Here's an analysis of the acceptance criteria and the studies that support them:


    1. Table of Acceptance Criteria and Reported Device Performance

    The document doesn't explicitly state "acceptance criteria" as a single, consolidated table with pass/fail values for all performance characteristics. Instead, it presents various performance studies with their results. Based on the "Comparison of Technological Characteristics with the Predicate Device" and the "Performance Characteristics" sections, we can infer some criteria and compare the device's performance.

    Performance CharacteristicAcceptance Criteria (Implied/Predicate)Reported Device Performance (ADVIA Centaur PBNPII)
    Intended UseAid in diagnosis of suspected congestive heart failure.Aid in diagnosis of HF in ED and OP populations with clinical suspicion of new onset or worsening HF.
    MeasurementQuantitativeQuantitative
    TechnologyChemiluminescence immunoassayChemiluminescence immunoassay (1-step sandwich)
    Sample TypePlasma and SerumHuman serum, plasma (EDTA and lithium heparin)
    Assay Range5-35,000 pg/mL (Predicate)35-35,000 pg/mL
    Hook EffectNo hook effect up to 300,000 pg/mL (Predicate)No hook effect up to 300,000 pg/mL (will report >35,000 pg/mL)
    Precision (Total CV)For Serum: e.g., <4.6% @ 44 pg/mL, <1.8% @ 2,410 pg/mL (Predicate)For Serum: 6.7% @ 116 pg/mL, 4.0% @ 271 pg/mL, 2.3% @ 380 pg/mL, 2.1% @ 806 pg/mL, 2.0% @ 1,597 pg/mL, 2.8% @ 25,073 pg/mL. QCs also within acceptable limits (presumably).
    ReproducibilityNot explicitly stated, typically within acceptable CV limits across sites.Total CVs ranging from 3.0% to 5.0% for serum samples (141-10428 pg/mL) and 3.3% to 4.5% for controls.
    LinearityN/A (implied to be demonstrated across assay range)Linear for 35-35,000 pg/mL
    Limit of Blank (LoB)LoB < LoD (Predicate)13 pg/mL
    Limit of Detection (LoD)5.00 pg/mL (Predicate)20 pg/mL
    Limit of Quantitation (LoQ)50.0 pg/mL (Predicate)35 pg/mL
    Interference (HIL & Other)Bias not to exceed 10% (Implied)Bias due to hemoglobin, bilirubin, lipemia <10%; various other substances also do not interfere (bias <10%).
    Cross-ReactivityNot Detectable / <1.0% for most (Implied, similar to predicate)Most cross-reactants <1.0% or not detectable. ProBNP (glycosylated) 1.0%-19.0%, ProBNP (non-glycosylated) 13.0%-30.0%.
    Specimen EquivalencyRegression equation close to y=x+0, r close to 1 (Implied)Plasma (EDTA, Lithium heparin), SST, RST vs. Serum: Regression equations y=1.00x +/- small constant, r ~0.999-1.000.
    Clinical Performance (ED Pop.)Likelihood Ratios (LR+, LR-) and Post-test Risk (Implied to be clinically useful for diagnosis)Age-dependent rule-in/out cutoffs provided with LR+ and LR- (0.06-0.09 for Negative LR-, suggesting good rule-out).
    Clinical Performance (OP Pop.)Sensitivity, Specificity, PPV, NPV (Implied to be clinically useful for diagnosis)Overall: Sensitivity 86.0%, Specificity 64.3%, PPV 34.4%, NPV 95.5% (at 125 pg/mL cutoff). Detailed breakdown by age/sex also provided.

    Note: The "Acceptable Criteria" values for precision were primarily taken directly from comments in the tables (indicated by "<" followed by a number), which appear to be the applicant's internal criteria rather than directly from the predicate. The predicate's precision data is shown for comparison purposes in the "Comparison of Technological Characteristics."

    2. Sample Size Used for the Test Set and Data Provenance

    • Precision and Reproducibility:
      • Precision: Assayed in duplicate in 2 runs per day for 20 days. Number of samples not explicitly stated but implied to be multiple levels of serum and QCs (6 serum, 3 QC samples listed).
      • Reproducibility: N=90 samples assayed in duplicate in 2 runs per day for 5 days at 3 sites.
    • Linearity: Not specified.
    • Detection Limit (LoB, LoD, LoQ): Not specified.
    • Interference (HIL & Other Substances): Not specified.
    • Cross-Reactivity: Not specified.
    • Specimen Equivalency: N=50 samples for each tube type comparison to serum.
    • Expected Values (Reference Study Group): 723 apparently healthy subjects (362 females, 361 males).
    • Clinical Performance (ED Population): 3128 subjects with signs and symptoms of acute HF who presented to the ED. 1148 adjudicated as acute HF, 1980 as without HF.
    • Clinical Performance (OP Population): 1033 OP subjects with signs and symptoms of new onset HF. 185 adjudicated as new onset HF, 848 as without HF.

    Data Provenance: The document does not explicitly state the country of origin for the clinical study data or if it was retrospective or prospective. However, the term "prospectively enrolled" is used for both the ED and OP clinical performance studies, indicating these were prospective studies.

    3. Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications

    • Clinical Performance (ED Population): An "independent central adjudication panel of expert clinicians (Cardiologists)" determined the diagnosis and severity of HF. The exact number of experts is not stated, nor are their specific qualifications (e.g., number of years of experience).
    • Clinical Performance (OP Population): An "independent central adjudication panel of expert clinicians (Cardiologists)" determined the diagnosis of new onset HF. Similar to the ED population, the exact number and specific qualifications are not detailed.

    4. Adjudication Method (for the test set)

    The adjudication method used by the "independent central adjudication panel of expert clinicians (Cardiologists)" is not explicitly described (e.g., 2+1, 3+1, none). It only states they "determined" the diagnosis.

    5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study

    No MRMC comparative effectiveness study was done. This device is an in vitro diagnostic assay (laboratory test), not an imaging or interpretive device that typically involves human readers in the same way an MRMC study would apply. Therefore, there is no effect size reported for human readers improving with or without AI assistance.

    6. Standalone Performance Study

    Yes, a standalone (algorithm only) performance study was performed for the assay. The entire "Performance Characteristics" section (9.1 through 9.12) describes the analytical and clinical performance of the ADVIA Centaur PBNPII assay in isolation, without human interpretation of the assay results in the context of the study design. The clinical utility is assessed by how accurately the assay's quantitative results correlate with the adjudicated clinical diagnosis.

    7. Type of Ground Truth Used

    • Clinical Performance (ED and OP Populations): Clinical diagnosis adjudicated by an "independent central adjudication panel of expert clinicians (Cardiologists)." This falls under expert consensus clinical diagnosis.
    • Analytical Performance (Precision, Linearity, Detection Limits, Interference, Cross-Reactivity, Specimen Equivalency): These are quantitative measurements against established reference methods, standards, or known concentrations, serving as the ground truth for analytical validity.

    8. Sample Size for the Training Set

    The document describes an in vitro diagnostic assay for measuring biomarkers. These types of assays typically do not have a "training set" in the same sense as machine learning algorithms, which require labeled data for model development. Instead, they involve method development and validation experiments. The document describes studies for:

    • Establishing expected values (reference study group, N=723).
    • Establishing disease study groups for statistical analysis of correlation (ED population N=1148, OP population N=185).
    • Various analytical validation studies such as precision, linearity, and detection limits.

    Therefore, the concept of a "training set" for the assay, in the context of algorithm development, is not directly applicable or discussed. The studies mentioned above contribute to the overall validation and characterization of the assay's performance.

    9. How the Ground Truth for the Training Set Was Established

    As mentioned above, the concept of a "training set" for an assay like this is not directly applicable. If we consider the data used to characterize the assay's performance as analogous to "training data" (i.e., the data used to define the assay's characteristics and clinical cut-offs), then:

    • Clinical Cut-offs: The clinical cut-offs for the ED and OP populations were established based on the measured NT-proBNP values in patient populations (ED and OP disease study groups) where the diagnosis of heart failure was determined by an "independent central adjudication panel of expert clinicians (Cardiologists)." This expert consensus clinical diagnosis serves as the ground truth for establishing the clinical utility of the various cut-off values.
    • Reference Intervals: The reference interval was established using 723 apparently healthy subjects. The "healthy" status would have been determined through standard medical screening and criteria, serving as the ground truth for defining the normal range.
    • Analytical Studies: Ground truth for analytical studies (e.g., linearity, detection limit) is established through reference materials, spiked samples with known concentrations, and comparison to established methods or gold standard instrumentation.

    Ask a Question

    Ask a specific question about this device

    K Number
    K223637
    Manufacturer
    Date Cleared
    2023-07-21

    (228 days)

    Product Code
    Regulation Number
    862.1117
    Reference & Predicate Devices
    Predicate For
    N/A
    AI/MLSaMDIVD (In Vitro Diagnostic)TherapeuticDiagnosticis PCCP AuthorizedThirdpartyExpeditedreview
    Intended Use

    Immunoassay for the in vitro quantitative determination of N-terminal pro-Brain natriuretic peptide in human serum and plasma. This assay is used as an aid in the diagnosis of individuals suspected of having heart failure. It can be used as an aid in the diagnosis of acute decompensated heart failure (ADHF) in patients presenting with signs and symptoms of ADHF to the emergency department (ED). The test is further indicated for the risk stratification of patients with acute coronary syndrome and heart failure. The test may also serve as an aid in the assessment of increased risk of cardiovascular events and mortality in patients at risk for heart failure who have stable coronary artery disease.

    Device Description

    Elecsys proBNP II and proBNP II STAT are second-generation assays by Roche Diagnostics for the in vitro quantitative determination of N-terminal pro-Brain natriuretic peptide (NT-proBNP) in human serum and plasma. The STAT and the 18 Minute assays are intended for use on the cobas e 601. The cobas e family of analyzers employs the electrochemiluminescence immunoassay "ECLIA" technology. The assays are sandwich principle methods using two monoclonal antibodies which are specifically directed against NT-proBNP. For the neutralization of free biotin in serum and plasma, Roche developed an antibody, which binds to free biotin. The antibodies are specific for free biotin and do not bind to or interact with the biotin-linker conjugates.

    AI/ML Overview

    The provided document describes the Elecsys proBNP II and Elecsys proBNP II STAT assays, which are in vitro quantitative determination tests for N-terminal pro-Brain natriuretic peptide (NT-proBNP) used as an aid in the diagnosis of heart failure, specifically acute decompensated heart failure (ADHF). The document details the analytical and clinical performance evaluations of these devices.

    Since the device is an in vitro diagnostic (IVD) test and not an AI-powered medical device for image analysis or similar, the acceptance criteria and study that proves the device meets them are focused on analytical performance (precision, sensitivity, linearity, interference) and clinical performance (diagnostic accuracy based on cut-points), rather than typical AI/ML metrics like AUC, sensitivity, specificity for image interpretation, or MRMC studies.

    Therefore, many of the requested AI/ML specific information points (e.g., number of experts to establish ground truth for test set, adjudication method, MRMC study, sample size for training set, how ground truth for training set was established) are not directly applicable in the context of this IVD device's evaluation as described.

    Here's an analysis based on the available information:

    1. Table of Acceptance Criteria and Reported Device Performance

    The document does not explicitly present a single table of "acceptance criteria" alongside "reported device performance" for the clinical diagnostic accuracy in a pass/fail format. Instead, it provides detailed analytical performance results and clinical likelihood ratios for various patient subgroups. However, for analytical performance, conclusions are stated:

    Analytical Performance Acceptance Criteria and Results:

    TestAcceptance CriteriaReported Device PerformanceConclusion
    Precision (CLSI EP05-A3)Not explicitly stated as numerical criteria, but implied "met predetermined acceptance criteria."Detailed CV% and SD values across various NT-proBNP concentrations (e.g., Repeatability CV%: 1.4-2.9%, Intermediate precision CV%: 3.3-4.8%)Met
    Limit of Blank (LoB) (CLSI EP17-A2)LoB ≤ 8 pg/mL (as per labeling claim)LoB = 1.48 pg/mLMet
    Limit of Detection (LoD) (CLSI EP17-A2)LoD ≤ 10 pg/mL (as per labeling claim)LoD = 2.57 pg/mLMet
    Limit of Quantitation (LoQ) (CLSI EP17-A2)Intermediate precision of 20% CVThe lowest concentration meeting 20% CV was at 10.8 pg/mL (15.7% CV for Sample_1)The LoQ claim in labeling is 36 pg/mL, implying samples at or above this value met the 20% CV criterion.
    Linearity/Reportable Range (CLSI EP06-Ed2)"Linearity specifications were met" across the measured range.Linear regression analysis passed between 24.3 - 35902 pg/mL, meeting precision and allowed deviation specifications.Met
    Endogenous Interferences (CLSI EP07-A3)No interference up to specified concentrations for Bilirubin, Hemoglobin, Lipemia, Biotin, Rheumatoid Factor.Reported specific non-interference levels for each substance.Met

    Clinical Performance:
    For clinical performance, the criterion is generally that the device provides "adequate performance when aiding in the diagnosis of acutely decompensated heart failure" and supports a "substantial equivalent decision" to the predicate. This is demonstrated through likelihood ratios (LR+ and LR-) for the various age and gender groups. While no explicit "acceptance criteria" for these LR values are given (e.g., LR+ > X and LR- < Y), the presentation of these values, along with post-test probabilities, is intended to show clinical utility and substantial equivalence.

    2. Sample Size Used for the Test Set and Data Provenance

    • Clinical Performance Test Set (ADHF Diagnosis Study):

      • Sample Size: 1485 subjects.
      • Data Provenance: Multi-center trial performed in the United States (US subjects).
      • Retrospective/Prospective: The description "multi-center trial was performed" suggests a prospective study design. Patients were enrolled with suspected ADHF and their ADHF status was subsequently adjudicated.
    • Analytical Performance Test Sets:

      • Precision: 8 serum samples and 2 controls (n=84 determinations for each sample/control).
      • LoB: 60 determinations of an analyte-free sample.
      • LoD: 5 low-level human serum samples, 60 determinations per reagent lot (total of three reagent lots tested).
      • LoQ: 10 native, unaltered serum samples, 5 replicates per sample, over 5 days (n=25 measured values per sample).
      • Linearity: 10 levels of diluted high analyte human native serum.
      • Endogenous Interferences: Three different analyte concentration levels for serum samples.

    3. Number of Experts Used to Establish the Ground Truth for the Test Set and Their Qualifications

    • Clinical Performance Test Set (ADHF Diagnosis Study):
      • Ground truth for ADHF diagnosis was established "based on adjudication by a clinical events committee."
      • The number of experts on this committee and their specific qualifications (e.g., radiologist with X years of experience) are not explicitly specified in the provided text.

    4. Adjudication Method for the Test Set

    • Clinical Performance Test Set:
      • Ground truth was based on "adjudication by a clinical events committee."
      • The specific adjudication method (e.g., 2+1, 3+1, majority vote, etc.) for this committee is not explicitly detailed in the document.

    5. If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance

    • No, this is not applicable. The device is an in vitro diagnostic (IVD) test (immunoassay), not an AI-powered diagnostic imaging tool. Therefore, an MRMC study and the concept of "human readers improving with AI assistance" are not relevant to this type of device and its evaluation as described. The clinical performance evaluation focuses on the assay's ability to aid in diagnosis using predefined cut-points.

    6. If a Standalone (i.e. algorithm only without human-in-the-loop performance) was done

    • Yes, in essence. The "standalone performance" of the Elecsys proBNP II and Elecsys proBNP II STAT assays is demonstrated by their analytical performance metrics (precision, LoB, LoD, LoQ, linearity, interference) and their diagnostic performance (likelihood ratios) when applied to patient samples. There isn't an "algorithm" in the typical AI sense, but rather a direct measurement of NT-proBNP concentration against established cut-points. The interpretation relies on these quantitative measurements.

    7. The Type of Ground Truth Used (expert consensus, pathology, outcomes data, etc.)

    • Clinical Performance Test Set (ADHF Diagnosis Study): The ground truth for Acute Decompensated Heart Failure (ADHF) status was established by adjudication of a clinical events committee. This implies a panel of medical experts reviewed patient data (e.g., symptoms, signs, other diagnostic tests, clinical course) to determine the true ADHF status. This falls under a form of expert consensus on clinical diagnosis.

    8. The Sample Size for the Training Set

    • Not Applicable in the AI/ML sense. This is an in vitro diagnostic immunoassay, not an AI/ML device that requires a separate "training set" to develop an algorithm. The assay's analytical characteristics and clinical utility are established through the studies mentioned (analytical and clinical performance evaluation). The "training" of the device is inherent in its chemical and biological design and manufacturing, not a machine learning process.

    9. How the Ground Truth for the Training Set was Established

    • Not Applicable. As explained in point 8, there isn't a "training set" for an AI/ML algorithm in the context of this immunoassay. The analytical performance is validated against known standards and controls, and clinical performance is validated against clinical diagnoses adjudicated by a committee.
    Ask a Question

    Ask a specific question about this device

    K Number
    K210546
    Manufacturer
    Date Cleared
    2022-03-31

    (399 days)

    Product Code
    Regulation Number
    862.1117
    Reference & Predicate Devices
    Predicate For
    AI/MLSaMDIVD (In Vitro Diagnostic)TherapeuticDiagnosticis PCCP AuthorizedThirdpartyExpeditedreview
    Intended Use

    Immunoassay for the in vitro quantitative determination of N terminal pro Brain natriuretic peptide in human serum and plasma. This assay is used as an aid in the diagnosis of individuals suspected of having heart failure. The test is further indicated for the risk stratification of patients with acute coronary syndrome and heart failure. The test may also serve as an aid in the assessment of increased risk of cardiovascular events and mortality in patients at risk for heart failure who have stable coronary artery disease.

    The electrochemiluminescence immunoassay "ECLIA" is intended for use on cobas e immunoassay analyzers.

    Device Description

    Elecsys proBNP II (updated assay) is a second-generation assay by Roche Diagnostics for the in vitro quantitative determination of N-terminal pro-Brain natriuretic peptide (NT-proBNP) in human serum and plasma with increased biotin tolerance. The electrochemiluminescence immunoassay "ECLIA" is intended for use on cobas e immunoassay analyzers.

    The cobas e family of analyzers employs the electrochemiluminescence immunoassay "ECLIA" technology. The assays are an 18-minute (Elecsys proBNP II) and 9 minute (Elecsys proBNP II STAT) application following a sandwich principle using two monoclonal antibodies which are specifically directed against NT-proBNP.

    AI/ML Overview

    Acceptance Criteria and Device Performance Study for Elecsys proBNP II and Elecsys proBNP II STAT Assays

    The document describes the analytical performance studies conducted for the Elecsys proBNP II and Elecsys proBNP II STAT assays, which are in vitro diagnostic devices. These assays are intended for the quantitative determination of N-terminal pro-Brain natriuretic peptide (NT-proBNP) in human serum and plasma, aiding in the diagnosis of heart failure, risk stratification, and assessment of cardiovascular event risk. The assays have been modified to include increased biotin tolerance.

    The studies aim to demonstrate that the updated assays meet predetermined acceptance criteria for various analytical parameters, ensuring their safety and effectiveness and substantial equivalence to their predicate devices.

    1. Table of Acceptance Criteria and Reported Device Performance

    The provided document doesn't explicitly list "acceptance criteria" alongside "reported device performance" in a single, dedicated table for all parameters. However, the "Conclusion" sections for each analytical study implicitly state whether the acceptance criteria were met. Based on the provided text, a table can be constructed as follows:

    Acceptance Criteria CategorySpecific MetricPredetermined Acceptance Criterion (Implicitly Met)Reported Device Performance (Elecsys proBNP II)Reported Device Performance (Elecsys proBNP II STAT)
    Analytical SensitivityLimit of Blank (LoB)≤ 8 pg/mLAll lots met ≤ 8 pg/mLAll lots met ≤ 8 pg/mL
    Limit of Detection (LoD)≤ 10 pg/mLAll lots met ≤ 10 pg/mLAll lots met ≤ 10 pg/mL
    Limit of Quantitation (LoQ) (Intermediate precision 20% CV)≤ 36 pg/mL (Inferred from reported LoQ values)32.7 - 35.7 pg/mL7.28 - 13.6 pg/mL
    PrecisionRepeatability (Within-run precision)Low CV% (Specific numerical thresholds not stated)See tables on pages 8-9 for detailed CVSee tables on pages 9-10 for detailed CV
    Intermediate Precision (Within-laboratory precision)Low CV% (Specific numerical thresholds not stated)See tables on pages 8-9 for detailed CVSee tables on pages 9-10 for detailed CV
    Inter-Instrument Variability (Inter-laboratory precision)Low CV% (Specific numerical thresholds not stated)See table on page 10 for detailed CVSee table on page 11 for detailed CV
    Linearity/Reportable RangeMeasurements across claimed measuring range are linearNot explicitly quantified, but demonstratedLinearity data on page 15Linearity data on page 15
    InterferenceBilirubin (Conjugated & Unconjugated)No interference up to 25.0 mg/dLNo interference up to 25.0 mg/dLNo interference up to 25.0 mg/dL
    HemoglobinNo interference up to 1000 mg/dLNo interference up to 1000 mg/dLNo interference up to 1000 mg/dL
    LipemiaNo interference up to 1500 mg/dLNo interference up to 1500 mg/dLNo interference up to 1500 mg/dL
    BiotinNo interference up to 3500 ng/mLNo interference up to 5000 ng/mLNo interference up to 5000 ng/mL
    Rheumatoid FactorNo interference up to 1500 IU/mLNo interference demonstratedNo interference demonstrated
    AlbuminNo interference up to 7 g/dLNo interference demonstratedNo interference demonstrated
    Cross-ReactivityAbsence of significant cross-reactivity with various substancesRecovery % close to 100% (Implied)See tables on pages 19-20See tables on pages 19-20
    Exogenous InterferenceNo interference with listed common and special therapeutic drugsNo significant interference (Implied)No interference seen with tested drugsNo interference seen with tested drugs
    Matrix ComparisonsEquivalence between Serum, Li-Heparin, and K2-EDTA plasmaSlope close to 1, Intercept close to 0, High 'r'Slope 0.990-1.01, Intercept -0.985-0.898, r≥0.998Not explicitly detailed for STAT
    Method ComparisonSubstantial equivalence to predicate deviceHigh Pearson's r, Passing-Bablok Slope close to 1, Intercept close to 0Pearson's r ≥ 0.999, Slope 0.98, Intercept -2.88Pearson's r ≥ 0.999, Slope 1.01, Intercept -1.60

    2. Sample Sizes and Data Provenance

    • Test Set Sample Sizes:

      • Precision (Repeatability & Intermediate): 8 human serum samples and 2 controls (n=10 total) for each assay, measured in 84 determinations over 21 operating days. The exact number of individual patient samples contained within the "8 human serum samples" is not specified but appears to be 8 distinct pools/matrices.
      • Precision (Inter-Instrument): 8 native human serum sample pools and 2 quality control levels (n=10 total) for each assay, with 75 determinations per sample/QC level (due to 5 days, 3 laboratories, 25 determinations/site, 5x5=25, 3 sites x 25 = 75 total reported).
      • Analytical Sensitivity (LoB): 60 determinations of an analyte-free sample.
      • Analytical Sensitivity (LoD): 5 low-level human serum samples, with 60 determinations per reagent lot.
      • Analytical Sensitivity (LoQ): 9 native, unaltered serum samples for Elecsys proBNP II and 10 native, unaltered serum samples for Elecsys proBNP II STAT, each tested with 25 measured values.
      • Linearity/Assay Reportable Range: One high analyte human, native serum sample diluted to 11 concentrations.
      • Endogenous Interference (Bilirubin, Hemoglobin, Lipemia, Biotin, Rheumatoid Factor, Albumin): Three different analyte concentration levels (low, medium, high) in human native serum samples. Specific number of samples at each level not explicitly stated but implied to be several.
      • Cross-Reactivity: Two human native serum samples (low and high analyte levels) spiked with potential cross-reactants.
      • Exogenous Interference (Drugs): Two human native serum samples (low and high analyte concentrations).
      • Matrix Comparisons: Single donor samples drawn into Serum (reference), Li-Heparin (98 pairs), and K2-EDTA plasma (111 pairs).
      • Method Comparison: 1928 subjects for Elecsys proBNP II STAT and 1940 subjects for Elecsys proBNP II.
    • Data Provenance: The document generally refers to "human serum samples" and "native human serum samples" without specifying the country of origin. The studies are described as internal (e.g., "one internal site" for precision). There is no explicit mention of the data being retrospective or prospective, but the nature of in vitro diagnostic device performance studies (analytical validation) typically involves prospective testing of samples under controlled laboratory conditions, simulating diagnostic use.

    3. Number of Experts and Qualifications for Ground Truth

    The document does not describe the use of human experts to establish "ground truth" for the test set in the context of diagnostic interpretation (e.g., radiologists, cardiologists). This document details the analytical performance of an in vitro diagnostic assay, not an AI/imaging diagnostic device that requires expert adjudication of images. The "ground truth" in this context refers to the true analytical concentration of NT-proBNP in the samples, established through well-defined laboratory methodologies and traceable standards, or comparison to a previously cleared predicate device.

    4. Adjudication Method for the Test Set

    Not applicable. As this is an analytical validation of an in vitro diagnostic assay, there is no "adjudication method" in the sense of multiple human readers or experts resolving discrepancies in diagnostic interpretation. The methods involve quantitative analytical measurements of biochemical markers.

    5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study

    Not applicable. This is not an imaging or AI-assisted diagnostic device that would typically undergo an MRMC study. The study focuses on the analytical performance of a laboratory immunoassay.

    6. Standalone (Algorithm Only) Performance

    Not applicable in the typical sense for medical imaging AI. The "algorithm" here is the chemical reaction and measurement process of the immunoassay itself. The analytical performance metrics (precision, sensitivity, linearity, interference, matrix comparison) are effectively the "standalone performance" of the device. The method comparison study directly compares the performance of the updated device against its predicate (older version) without human intervention in the result generation.

    7. Type of Ground Truth Used

    The ground truth used for these analytical studies is primarily measured analytical concentration, established through:

    • Reference materials: Calibrators and controls with known concentrations.
    • Spiking experiments: Adding known amounts of analyte or interfering substances to samples.
    • Dilutions: Creating samples with predictable concentrations.
    • Comparison to predicate device: The method comparison studies compare the new device's results against the results from the previously cleared Elecsys proBNP II and Elecsys proBNP II STAT (older versions) as the reference.
    • Consensus laboratory methods/standards: Studies like LoB, LoD, LoQ, and precision follow CLSI (Clinical and Laboratory Standards Institute) guidelines, which are established consensus standards for analytical validation.

    There is no mention of pathology, clinical outcomes data, or expert consensus interpretation of results as a primary ground truth in this analytical performance section.

    8. Sample Size for the Training Set

    Not applicable. This document describes the analytical validation of laboratory assays, not a machine learning model that requires a "training set" in the same sense. The assay works based on established biochemical principles (electrochemiluminescence immunoassay, ECLIA, utilizing specific antibodies) and wet-lab procedures, not on learned patterns from a large dataset. Reagent formulation and process optimization might involve internal development data, but it's not a "training set" in the AI/ML context.

    9. How the Ground Truth for the Training Set was Established

    Not applicable, as there is no "training set" for an AI/ML model in this context. The "ground truth" for the development of the assay itself would be based on fundamental analytical chemistry principles, extensive laboratory testing, control materials, and established reference methods for quantifying NT-proBNP.

    Ask a Question

    Ask a specific question about this device

    K Number
    K211199
    Date Cleared
    2021-11-08

    (200 days)

    Product Code
    Regulation Number
    862.1117
    Reference & Predicate Devices
    Predicate For
    N/A
    AI/MLSaMDIVD (In Vitro Diagnostic)TherapeuticDiagnosticis PCCP AuthorizedThirdpartyExpeditedreview
    Intended Use

    The Tosoh ST AIA-PACK BNP assay is designed for IN VITRO DIAGNOSTIC USE ONLY for the quantitative measurement of BNP in human (K2EDTA) plasma on Tosoh AIA System Analyzers. BNP is used as an aid in the diagnosis of heart failure (HF) in patients presenting to the emergency department (ED) with clinical suspicion of new onset HF, acutely decompensated or exacerbated HF.

    Device Description

    The ST AIA-PACK BNP is a two-site immunoenzymometric assay which is performed entirely in the ST AIA-PACK BNP test cups. BNP present in the test sample is bound with monoclonal antibody immobilized on magnetic beads and enzyme-labeled monoclonal antibody. The magnetic beads are washed to remove unbound enzyme-labeled monoclonal antibody and are then incubated with a fluorogenic substrate, 4-methylumbelliferyl phosphate (4MUP). The amount of enzyme-labeled monoclonal antibody that binds to the beads is directly proportional to the BNP concentration in the test sample. A standard curve is constructed, and unknown sample concentrations are calculated using the curve.

    AI/ML Overview

    The Tosoh ST AIA-PACK BNP Assay aims to extend its measuring interval beyond 2,000 pg/mL through manual and automated 1:5 and 1:10 dilutions. This is a special 510(k) submission, meaning it refers to a device that has already been cleared (K192380) and modifications were made to it that do not significantly alter its performance or safety (e.g., modified firmware, revised labeling, minor material changes, etc.). Since this is a Special 510(k) and not a de novo submission, this document is a justification for substantial equivalence to its predicate device (K192380), and does not contain detailed information for how the predicate device was evaluated.

    Therefore, the study summary below only refers to the performance of the modified device compared to its predicate device, and not a full evaluation of the predicate device's performance.


    1. Acceptance Criteria and Reported Device Performance

    Acceptance CriteriaReported Device Performance
    Manual Dilution
    Recovery value102% for 1:5 dilution
    Recovery value100% for 1:10 dilution
    On-board (automated) Dilution
    Recovery value95% for 1:5 dilution
    Recovery value95% for 1:10 dilution

    2. Sample size used for the test set and data provenance

    The document does not specify the sample sizes used for the manual and automated dilution studies, nor does it specify the country of origin or whether the data was retrospective or prospective.

    3. Number of experts used to establish the ground truth for the test set and the qualifications of those experts

    Not applicable. The ground truth method does not involve human experts; it relies on direct measurement.

    4. Adjudication method for the test set

    Not applicable. The ground truth method does not involve human experts.

    5. If a multi-reader multi-case (MRMC) comparative effectiveness study was done, if so, what was the effect size of how much human readers improve with AI versus without AI assistance

    Not applicable. This is not an AI-assisted diagnostic device, but an in vitro diagnostic assay.

    6. If a standalone (i.e., algorithm only without human-in-the-loop performance) was done

    Not applicable. This device is an in vitro diagnostic assay, not an algorithm.

    7. The type of ground truth used

    The ground truth used for both manual and automated dilution studies is the recovery value of the BNP concentration after dilution, indicating the accuracy of the dilution process. This is a direct measurement based on the expected concentration after dilution.

    8. The sample size for the training set

    Not applicable. This device is an in vitro diagnostic assay and does not involve machine learning or a training set.

    9. How the ground truth for the training set was established

    Not applicable. This device is an in vitro diagnostic assay and does not involve machine learning or a training set.

    Ask a Question

    Ask a specific question about this device

    K Number
    K201312
    Date Cleared
    2021-10-04

    (504 days)

    Product Code
    Regulation Number
    862.1117
    Reference & Predicate Devices
    Predicate For
    N/A
    AI/MLSaMDIVD (In Vitro Diagnostic)TherapeuticDiagnosticis PCCP AuthorizedThirdpartyExpeditedreview
    Intended Use

    For in vitro diagnostic use only.

    For the quantitative measurement of N-terminal pro Brain Natriuretic Peptide (NT-proBNP) in human serum and plasma (K2 EDTA or Lithium Heparin) using the VITROS 3600 Immunodiagnostic System to aid in the diagnosis of heart failure. The test can also be used in the assessment of heart failure severity in patients diagnosed with heart failure.

    Device Description

    The VITROS NT-proBNP II test is performed using the VITROS VITROS NT-proBNP II Reagent Pack and the VITROS NT-proBNP II Calibrators on the VITROS Systems.

    The VITROS NT-proBNP II test utilizes a one-step immunometric bridging assay design. A well is pushed from the pack and patient sample is dispensed into the antibody coated well. The assay reagent and the conjugate reagent are then dispensed into the well with the patient sample. NT-proBNP present in the sample binds with horseradish peroxidase (HRP)-labeled antibody conjugate which is captured by biotinylated anti-NT-proBNP capture antibody which is bound to Streptavidin coated microwells. The well is incubated for 8 minutes, before unbound materials are removed by washing.

    The bound HRP conjugate is measured by a luminescent reaction. A reagent containing luminogenic substrate (a luminol derivative and a peracid salt) and an electron transfer agent, is added to the wells. The HRP in the bound conjugate catalyzes the oxidation of the luminol derivative, producing light. The electron transfer agent (a substituted acetanilide) increases the level of light produced and prolongs its emission. The light signals are read by the System. The amount of HRP conjugate bound is directly proportional to the concentration of NT-proBNP present.

    AI/ML Overview

    The provided document describes the analytical and clinical performance of the VITROS Immunodiagnostic Products NT-proBNP II Reagent Pack, an in vitro diagnostic device used to aid in the diagnosis and assessment of heart failure.

    Here's an analysis of the acceptance criteria and the study proving the device meets them:

    1. Table of acceptance criteria and the reported device performance

    The document does not explicitly present a "table of acceptance criteria" in terms of pre-defined thresholds for performance metrics that the device must meet for clearance. Instead, it describes the design goals and then reports the observed performance. For clarity, I will create a table summarizing the reported performance, which implicitly indicates the criteria were met or exceeded for FDA clearance.

    Test CategorySpecific Test / MetricAcceptance Criteria (Implicit/Design Goal from predicate or general IVD standards)Reported Device Performance
    Analytical Performance
    PrecisionRepeatability (Within-run) %CVTypically < 10% for quantitative assays; lower for low concentrations.Range: 1.0% - 2.1% (SD 0.52 to 370)
    Within-calibration %CVTypically < 15%Range: 1.9% - 5.0% (SD 1.63 to 710)
    Within-lab %CVTypically < 20%Range: 2.4% - 5.7% (SD 1.91 to 730)
    Total Precision %CV (additional analysis)Consistent with product performance expectations.Range: 2.62% - 8.96% (SD 2.86 to 761)
    Limit of Detection (LoD)Designed to be <= 30.0 pg/mL0.46 pg/mL (Observed); 0.49 pg/mL (Claimed)
    Limit of Quantitation (LoQ)Designed to be <= 30.0 pg/mL at 20% CV0.46 pg/mL at 20% CV (Observed); 20.0 pg/mL (Claimed, to maintain linearity)
    LinearityAcross measuring rangeLinear over the measuring rangeLinear from 20.0 to 30,000 pg/mL
    Matrix ComparisonSerum vs. EDTA plasma vs. Lithium Heparin plasmaNo significant effect, meet acceptance criteria.All three tube types suitable for use (met acceptance criteria).
    Analytical SpecificityBias from common substances < 10%No bias > 10% observed for most tested compounds.Specific interferents (Cefoxitin sodium, Sodium Azide) showed >10% bias.
    Cross-ReactivityVarious related peptides (e.g., ANP, proBNP, BNP32)Low cross-reactivity desired.Ranges from <1.0% to 39.1% (non-glycosylated proBNP). Most are <1%.
    High Dose Hook EffectNo effect up to X concentrationNo high dose hook effect up to 300,000 pg/mL.
    Clinical Performance
    Aid in HF Diagnosis (ED Setting)AUC (Overall)Not explicitly stated, but typically high (e.g., > 0.85 or 0.90 for this type of test)0.920 (95% CI: 0.909-0.931)
    AUC (Age-stratified)Ranged from 0.904 to 0.954
    AUC (Clinical Subgroups)Ranged from 0.899 to 0.945
    Posttest Probability of HF (Positive result)High positive predictive value/posttest probability for rule-in.Range: 80.4% - 85.7% across age groups
    Posttest Probability of non-HF (Negative result)High negative predictive value/posttest probability for rule-out.Range: 96.5% - 98.3% across age groups
    Likelihood Ratio Positive (LR+)High (e.g., > 5-10 for strong rule-in)Range: 4.52 - 6.84 across age groups
    Likelihood Ratio Negative (LR-)Low (e.g., < 0.1-0.2 for strong rule-out)Range: 0.01 - 0.05 across age groups
    Aid in HF Diagnosis (Outpatient Setting)AUC (Overall)0.880 (95% CI: 0.822 to 0.937)
    AUC (Clinical Subgroups)Ranged from 0.838 to 0.940
    Sensitivity (Rule-out cutoff 125 pg/mL)High (for rule-out, e.g., >90%)91.7% (44/48)
    Specificity (Rule-out cutoff 125 pg/mL)Reasonable (for rule-out, may be lower)67.2% (490/729)
    NPV (Rule-out cutoff 125 pg/mL)High (for rule-out, e.g., >90%)99.2% (490/494)
    PPV (Rule-out cutoff 125 pg/mL)(for rule-out, may be lower)15.6% (44/283)
    Correlation with NYHAStatistical significance of relationship with HF severityStatistically significant trend.Jonckheere-Terpstra test p < 0.0001 (statistically significant correlation).

    2. Sample size used for the test set and the data provenance

    • ED Setting (Diagnosis of Heart Failure):
      • Sample Size: 2200 subjects.
      • Data Provenance: Multi-center prospective study, 20 collection sites across the United States.
    • Outpatient Setting (Diagnosis of Heart Failure):
      • Sample Size: 777 subjects.
      • Data Provenance: Multi-center prospective study, 10 collection sites across the United States.
    • Correlation with NYHA Functional Classification:
      • Sample Size: 1143 subjects with heart failure.
      • Data Provenance: Not explicitly stated if this was a separate collection or a subset of the ED/outpatient studies, but given it's "subjects with heart failure," it likely derives from similar clinical populations.

    3. Number of experts used to establish the ground truth for the test set and the qualifications of those experts

    • For both the ED and Outpatient settings: The final clinical diagnosis (ground truth) was adjudicated by independent cardiologists or ED physicians experienced in diagnosing HF.
    • Number of Experts: Not specified. The document only mentions "independent cardiologists or ED physicians," implying more than one, but not a precise number.
    • Qualifications: "Experienced in diagnosing HF." No specific number of years of experience or board certifications are mentioned.

    4. Adjudication method (e.g. 2+1, 3+1, none) for the test set

    • The document states that the final clinical diagnosis was "adjudicated by independent cardiologists or ED physicians experienced in diagnosing HF." It does not specify a quantitative adjudication method like "2+1" or "3+1." This suggests that the adjudication process was qualitative and relied on the clinical expertise of the adjudicators to reach a consensus diagnosis for each patient, rather than a fixed number of readers and a tie-breaking rule.

    5. If a multi-reader multi-case (MRMC) comparative effectiveness study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance

    • No, a MRMC comparative effectiveness study was not done. This document describes the performance validation of an in vitro diagnostic (IVD) immunoassay kit (VITROS NT-proBNP II Reagent Pack), which measures blood biomarkers. This is not an AI/imaging device where human readers would typically be assisted by AI. The product is a laboratory test, and its performance is evaluated based on its accuracy in measuring the biomarker and its diagnostic utility compared to clinical truth.

    6. If a standalone (i.e. algorithm only without human-in-the loop performance) was done

    • Yes, this is effectively a standalone performance study. The VITROS NT-proBNP II Reagent Pack provides a quantitative measurement of NT-proBNP. Its performance is assessed purely on the analytical accuracy of the measurement and its direct correlation with clinical outcomes (diagnosis of HF, NYHA classification) as determined by the expert adjudicators, without human interpretation of the device's numerical output for diagnosis (i.e., it's a direct measurement, not an interpretative AI). The output values are then interpreted against established cutoffs to aid in diagnosis, but the device itself does not involve a human-in-the-loop for its direct performance.

    7. The type of ground truth used (expert consensus, pathology, outcomes data, etc.)

    • The primary ground truth for heart failure diagnosis was expert clinical judgment/adjudication. For the diagnostic studies, "The final clinical diagnosis was adjudicated by independent cardiologists or ED physicians experienced in diagnosing HF."
    • For the assessment of heart failure severity, the ground truth was New York Heart Association (NYHA) Functional Classification, which is a clinical classification based on patient symptoms and physical activity levels.

    8. The sample size for the training set

    • This document describes the validation of an in vitro diagnostic device (a reagent pack and system) for measuring a biomarker, not a machine learning or AI algorithm in the typical sense that would involve a "training set" for model development. The assays (immunoassays) are based on chemical reactions rather than statistical models trained on large datasets.
    • The "training" of such a device primarily involves rigorous analytical development and characterization, ensuring the chemical processes are precise and accurate. Therefore, the concept of a separate "training set" with ground truth data, as used in AI/ML, is not directly applicable here. The document focuses on the performance validation of the developed assay.

    9. How the ground truth for the training set was established

    • As noted above, the concept of a "training set" with ground truth for an AI/ML algorithm doesn't directly apply to this type of IVD immunoassay. The analytical methods and performance characteristics (precision, linearity, LoD, LoQ, analytical specificity, etc.) are established through laboratory experiments and characterization studies, not by training on a clinical dataset with ground truth in the AI/ML sense. Clinical performance is then validated against independently established clinical diagnoses (expert adjudication).
    Ask a Question

    Ask a specific question about this device

    K Number
    K192380
    Device Name
    ST AIA-PACK BNP
    Date Cleared
    2020-08-24

    (360 days)

    Product Code
    Regulation Number
    862.1117
    Reference & Predicate Devices
    Predicate For
    AI/MLSaMDIVD (In Vitro Diagnostic)TherapeuticDiagnosticis PCCP AuthorizedThirdpartyExpeditedreview
    Intended Use

    The Tosoh ST AIA-PACK BNP assay is designed for IN VITRO DIAGNOSTIC USE ONLY for the quantitative measurement of BNP in human K2EDTA plasma on Tosoh AIA System analyzers. BNP is used as an aid in the diagnosis of heart failure (HF) in patients presenting to the emergency department (ED) with clinical suspicion of new onset HF, acutely decompensated or exacerbated HF.

    Device Description

    The ST AIA-PACK BNP is a two-site immunoenzymometric assay which is performed entirely in the ST AIA-PACK BNP test cups. BNP present in the test sample is bound with monoclonal antibody immobilized on magnetic beads and enzyme-labeled monoclonal antibody. The magnetic beads are washed to remove unbound enzyme-labeled monoclonal antibody and are then incubated with a fluorogenic substrate, 4-methylumbelliferyl phosphate (4MUP). The amount of enzyme-labeled monoclonal antibody that binds to the beads is directly proportional to the BNP concentration in the test sample. A standard curve is constructed, and unknown sample concentrations are calculated using the curve.

    AI/ML Overview

    The provided text describes the performance characteristics and clinical study results for the ST AIA-PACK BNP assay, an in vitro diagnostic device. This device is intended for the quantitative measurement of BNP in human K2EDTA plasma as an aid in the diagnosis of heart failure (HF).

    Here's an analysis of the acceptance criteria and the study proving the device meets these criteria, based on the provided document:

    1. A table of acceptance criteria and the reported device performance

    The document does not explicitly state "acceptance criteria" in a table format with pass/fail thresholds. Instead, it presents performance data from various analytical and clinical studies. We can infer performance parameters that would typically be subject to acceptance criteria in such a submission.

    Performance CharacteristicAcceptance Criteria (Inferred)Reported Device Performance
    Analytical Performance
    PrecisionCV% within acceptable range across various concentrations and sources of variationCombined Lots (n=240):K2EDTA Plasma-1 (mean 10.588 pg/mL): Total CV 10.8%K2EDTA Plasma-2 (mean 49.873 pg/mL): Total CV 3.6%K2EDTA Plasma-3 (mean 106.718 pg/mL): Total CV 3.0%K2EDTA Plasma-4 (mean 519.429 pg/mL): Total CV 5.1%K2EDTA Plasma-5 (mean 1050.712 pg/mL): Total CV 6.1%(Detailed SD and CV% for within run, between run, between day, between lot are provided for each of 3 lots and combined data, generally showing good precision.)
    Linearity/Reportable RangeAssay demonstrated to be linear over the stated rangeLinear from 4.0 to 2000 pg/mL
    Detection Limit (LoD)LoD within acceptable clinical rangeLoD = 1.9 pg/mL
    Quantitation Limit (LoQ)LoQ within acceptable clinical rangeLoQ = 3.5 pg/mL
    Analytical Specificity (Interference)Interference due to common substances and cross-reactants < +/- 10% recovery (or other relevant thresholds)Common Substances: Hemoglobin, Bilirubin (Unconjugated/Conjugated), Lipemia, Total Protein, Ascorbic Acid, Rheumatoid Factor, Human IgG, Cholesterol, Creatinine, Alkaline Phosphatase, HAMA IgG showed no interference (within +/- 10% recovery) at specified concentrations.Cross Reactivity: Reported % Cross Reactivity for various related peptides, generally very low.Therapeutics: 50+ therapeutic agents tested, % recovery for each was within 100+/-10% of the control.
    TraceabilityDemonstrated traceability to internal reference standardsCompared to internal reference standards; no international consensus reference method/material exists.
    StabilitySupport for stated shelf-lifeSupported 12-month shelf life.
    Clinical Performance
    Overall Performance (at 100 pg/mL cutoff)High sensitivity and specificity for HF diagnosisSensitivity: 88.4% (95% CI: 84.5-91.5%)Specificity: 70.6% (95% CI: 66.0-74.9%)PPV: 71.5%NPV: 88.0%Concordance: 78.7%AUC: 0.881

    2. Sample sized used for the test set and the data provenance

    • Test Set Sample Size:
      • Analytical Performance:
        • Precision: 6 K2EDTA plasma samples tested, each at n=80 per lot across 3 lots (total n=240 for combined lot analysis).
        • Linearity: 14 samples ranging from 3.1 - 2271.5 pg/mL.
        • Detection/Quantitation Limit: 11 low-level samples tested in 10 replicates each (2 replicates over 5 days).
        • Interference: K2EDTA samples with known BNP concentrations spiked with various interferents/cross-reactants. Specific N for each interferent test not explicitly stated but implied multiple samples.
      • Clinical Performance:
        • Number of samples assayed: 825
        • Number of samples used for analysis (test set): 724 (101 excluded due to hemolysis, severe renal insufficiency, etc.)
    • Data Provenance:
      • Country of Origin: Not explicitly stated for analytical samples. For clinical study: Samples were collected from patients presenting to Emergency Departments (EDs) at 8 clinical sites. The specific country is not mentioned, but given the FDA submission, it is likely the US or a region adhering to similar clinical trial standards.
      • Retrospective or Prospective:
        • Analytical performance: Appears to be prospective testing (e.g., precision study conducted at one site with specified reagents/analyzers, linearity study explicitly designed).
        • Clinical Performance: Prospective study, as it states "The prospective study enrolled male and female patients from 8 clinical sites comprised of Emergency Departments (ED)."

    3. Number of experts used to establish the ground truth for the test set and the qualifications of those experts

    • Number of Experts: An independent central adjudication panel comprised of 4 expert cardiologists and 1 ER physician.
    • Qualifications of Experts: Expert Cardiologists and an ER Physician. Their specific years of experience are not stated, but their designation as "expert" and the role in an adjudication panel for a clinical trial implies significant experience and qualifications in their respective fields related to heart failure diagnosis.

    4. Adjudication method for the test set

    • Adjudication Method: Diagnosis of HF or non-HF was determined by an independent central adjudication panel. The panel had access to patient CRFs and clinical information (including echocardiography, other cardiac/thoracic imaging, and standard of care BNP or NT-proBNP results if available). They were blinded to the attending physician's final diagnosis and NYHA classification. The document states that the adjudication was done "to ensure standardization and accuracy of diagnosis per 2013 ACCF/AHA Guidelines for Management of HF." This indicates a consensus-based approach based on comprehensive clinical data, supervised by multiple blinded experts.

    5. If a multi reader multi case (MRMC) comparative effectiveness study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance

    • No, an MRMC comparative effectiveness study was not done. This submission is for an in vitro diagnostic (IVD) device (a blood test for BNP), not an AI-assisted diagnostic imaging device. Therefore, the concept of "human readers improving with AI vs without AI assistance" does not apply in this context. The study evaluates the performance of the assay itself compared to a ground truth established by clinical adjudication.

    6. If a standalone (i.e. algorithm only without human-in-the-loop performance) was done

    • Yes, a standalone performance was done. The clinical study evaluates the performance of the "ST AIA-PACK BNP assay" itself to quantitatively measure BNP and its ability to aid in the diagnosis of HF. The results (sensitivity, specificity, AUC) are presented as the performance of the device at the given cutoff, without human interpretation of the device's output influencing the performance metrics being reported. The human element (adjudication panel) was used to establish the ground truth for diagnosis, against which the device's measurements were compared.

    7. The type of ground truth used (expert consensus, pathology, outcomes data, etc.)

    • Type of Ground Truth: The ground truth for the clinical study was established by expert consensus based on comprehensive clinical information and per established guidelines. Specifically, the "Diagnosis of HF or non- HF was determined by an independent central adjudication panel (comprised of 4 expert cardiologists and 1 ER physician) ... per 2013 ACCF/AHA Guidelines for Management of HF." They had access to patient CRFs, echocardiography, other cardiac/thoracic imaging, and standard of care BNP/NT-proBNP results. This represents a robust form of expert consensus based on extensive clinical data.

    8. The sample size for the training set

    • The document does not explicitly mention a separate "training set" for the device's development. The "Performance Characteristics" section details analytical and clinical validation studies. For a quantitative IVD like this, the "training" aspect is typically related to the assay's development, calibration, and optimization of reagents and protocols, rather than an explicit "training set" used in machine learning. The clinical study described is a validation study (test set).

    9. How the ground truth for the training set was established

    • As a specific "training set" for algorithmic learning is not mentioned (as this is a biochemical assay), the concept of ground truth for a training set in that sense does not apply directly. However, the development of such an assay involves extensive work to ensure its accuracy and reliability across the dynamic range, linearity, precision, and minimizing interference. This often uses well-characterized, sometimes synthetic or spiked, samples with known concentrations to inform the assay's design and calibration. The "Traceability" section mentions that "The ST AIA-PACK BNP CALIBRATOR SET contains assigned concentrations of BNP. The assigned value is determined on a lot-by-lot basis and is designed to provide an assay calibration range of 4.0 to 2,000 pg/mL of BNP. The calibrators in this set are prepared gravimetrically and are compared to internal reference standards." This implies the "ground truth" for the internal calibration and ongoing quality control of the assay is established through gravimetric preparation and comparison to internal reference standards.
    Ask a Question

    Ask a specific question about this device

    K Number
    K092649
    Date Cleared
    2010-02-04

    (160 days)

    Product Code
    Regulation Number
    862.1117
    Reference & Predicate Devices
    Predicate For
    AI/MLSaMDIVD (In Vitro Diagnostic)TherapeuticDiagnosticis PCCP AuthorizedThirdpartyExpeditedreview
    Intended Use

    Immunoassay for the in vitro quantitative determination of N-terminal pro-brain natriuretic peptide in human serum and plasma. The Elecsys proBNP II STAT assay is used as an aid in the diagnosis of individuals suspected of having congestive heart failure. The test is further indicated for the risk stratification of patients with acute coronary syndrome and congestive heart failure. The test may also serve as an aid in the assessment of increased risk of cardiovascular events and mortality in patients at risk for heart failure who have stable coronary artery disease.

    The electrochemiluminescence immunoassay "ECLIA" is intended for use on the cobas e 601 immunoassay analyzer.

    Device Description

    The proBNP II Assay is a sandwich immunoassay with two antibodies directed towards epitopes within the N-terminal portion of the proBNP molecule. The capture antibody is biotinylated to react with streptavidin-coated microparticles. The signal antibody is tagged with ruthenium. Both antibodies are monoclonal. The electrochemiluminescence immunoassay "ECLIA" is intended for use on the cobas e 601 immunoassay analyzer.

    AI/ML Overview

    Acceptance Criteria and Device Performance Study for Elecsys proBNP II STAT Immunoassay

    This document describes the acceptance criteria and the study that demonstrates the Elecsys proBNP II STAT Immunoassay meets these criteria, based on the provided 510(k) summary.

    1. Table of Acceptance Criteria and Reported Device Performance

    The acceptance criteria for the Elecsys proBNP II STAT Immunoassay are primarily demonstrated by showing substantial equivalence to the predicate device, the Elecsys proBNP II Assay (K072437), with key performance characteristics either matching or falling within acceptable ranges. The primary difference is the faster "STAT" application time and limited instrument platform.

    FeatureAcceptance Criteria (Predicate Device K072437)Reported Device Performance (Elecsys proBNP II STAT Assay)
    Intended Use/Indications for UseImmunoassay for the in vitro quantitative determination of N-terminal pro-brain natriuretic peptide in human serum and plasma. Aid in diagnosis of individuals suspected of congestive heart failure. Risk stratification of patients with acute coronary syndrome and congestive heart failure. Aid in assessment of increased risk of cardiovascular events and mortality in patients at risk for heart failure with stable coronary artery disease. Intended for use on Elecsys and cobas e immunoassay analyzers.Same, except: Assay name changed to "Elecsys proBNP II STAT assay". Intended for use only on the cobas e 601 immunoassay analyzer.
    Assay ProtocolSandwich assaySame
    Detection ProtocolElectrochemiluminescent ImmunoassaySame
    Application Time18 MinuteSTAT (9 Minute) - This is a key improvement/change.
    Instrument PlatformRoche Elecsys 2010/cobas e 411 and MODULAR ANALYTICS E170/cobas e 601cobas e 601 - Limited to this specific platform.
    Sample Volume15 µLSame
    Sample TypeHuman serum and plasma treated with K2-EDTA, K3-EDTA, lithium heparin and Na-heparin plasma.Same
    ReagentsSandwich immunoassay with two monoclonal antibodies (biotinylated capture (streptavidin-coated microparticles) and ruthenium-tagged signal) directed towards epitopes within the N-terminal portion of the proBNP molecule.Same antibodies and same epitopes.
    Traceability/StandardizationStandardized against the Elecsys proBNP assay (K022516), which was standardized against reference standards by weighing pure synthetic NT-proBNP (1-76 amino acids) into equine serum matrix.Same.
    CalibratorElecsys proBNP II CalSet (K072437)Elecsys proBNP II STAT CalSet (different material number, identical stability, value assignment, and matrix).
    Calibration IntervalOnce per reagent lot (fresh reagent, <24h since registration). Renewed calibration recommended: MODULAR ANALYTICS E170, Elecsys 2010 and cobas e analyzers: After 1 month (28 days) when using same reagent lot; After 7 days (when using same reagent kit). Elecsys 1010 analyzer: With every reagent kit; After 7 days (ambient 20-25°C); After 3 days (ambient 25-32°C).Once per reagent lot (fresh reagent, <24h since registration). Renewed calibration recommended: cobas e 601: After 1 month (28 days) when using same reagent lot; After 7 days (when using same reagent kit).
    ControlsElecsys PreciControl Cardiac II (K072437).Same.
    Reagent Stability/StorageUnopened: 2-8°C until expiration. After opening: 2-8°C—12 weeks. On MODULAR ANALYTICS E170 and cobas e 601—8 weeks. On Elecsys2010 and cobas e 411—8 weeks.Same, but only reported on the cobas e 601.
    Measuring Range5-35,000 pg/mLSame
    Precision (cobas e 601 - predicate)Within-run (Repeatability): 1.9% CV @ 64.0 pg/mL, 1.5% CV @ 124.0 pg/mL, 1.3% CV @ 14142.0 pg/mL, 1.8% CV @ 77.0 pg/mL, 1.2% CV @ 2105.0 pg/mL. Total (Intermediate): 3.1% CV @ 46.0 pg/mL, 2.7% CV @ 125.0 pg/mL, 2.7% CV @ 32805.0 pg/mL, 2.7% CV @ 77.0 pg/mL, 2.7% CV @ 2170.0 pg/mL.Precision (cobas e 601 - STAT assay)Within-run (Repeatability): 2.4% CV @ 130.0 pg/mL, 2.2% CV @ 4942.0 pg/mL, 3.5% CV @ 59.0 pg/mL, 2.0% CV @ 142.0 pg/mL, 1.8% CV @ 522.0 pg/mL, 1.9% CV @ 934.5 pg/mL, 2.0% CV @ 6552.0 pg/mL, 2.9% CV @ 30,870.0 pg/mL. Total (Intermediate): 2.5% CV @ 130.0 pg/mL, 2.6% CV @ 4942.0 pg/mL, 3.5% CV @ 59.0 pg/mL, 2.5% CV @ 142.0 pg/mL, 2.0% CV @ 522.0 pg/mL, 2.5% CV @ 934.5 pg/mL, 2.3% CV @ 6552.0 pg/mL, 5.4% CV @ 30,870.0 pg/mL. Performance is comparable or improved across the measuring range, especially considering the STAT nature.
    Analytical SensitivityLimit of Detection (LoD): 5.00 pg/mL. Functional Sensitivity (LoQ): 50.00 pg/mL. (LoB not reported)Limit of Blank (LoB): 5.00 pg/mL. Limit of Detection (LoD): 5.00 pg/mL. Functional Sensitivity (LoQ): 50.00 pg/mL. This demonstrates equivalent or improved analytical sensitivity.
    Hook EffectNo high-dose hook effect up to 300,000 pg/mL (300 ng/mL).Same.
    Heterophilic Antibody/InterferenceUnaffected by: Hemoglobin ≤ 0.1 g/dL, Bilirubin up ≤ 25 mg/dL, Triglycerides ≤ 1,500 mg/dL, Biotin ≤ 30 ng/mL, Rheumatoid factors ≤ 1,500 IU/mL. No interference with 51 commonly used pharmaceuticals. Minimized risk of interference from immunological interactions with rare sera and monoclonal mouse antibodies. Measures taken to minimize interference from high titers of antibodies to streptavidin and ruthenium.Same.
    Method Comparison (Correlation between STAT and Predicate)N/A (this is the study's objective)Passing/Bablok: Slope 0.992, Intercept -2.141, Tau/r/rho 0.987. Linear Regression: Slope 0.986, Intercept 13.964, r 0.998.Deming Regression: Slope 0.988, Intercept 8.83, rho 1.00.These results demonstrate strong correlation between the two assays, indicating substantial equivalence in measurement.

    2. Sample Size Used for the Test Set and Data Provenance

    • Test Set Sample Size: n = 120 patient samples were used for the method comparison study.
    • Data Provenance: The document does not explicitly state the country of origin or whether the data was retrospective or prospective. However, based on the context of a 510(k) submission seeking equivalence to a previously approved device, these data are typically generated in controlled laboratory settings (often prospective testing of patient samples) by the manufacturer or authorized agents.

    3. Number of Experts Used to Establish Ground Truth and Qualifications

    • Number of Experts: Not applicable. This device is an immunoassay measuring a biomarker (NT-proBNP) quantitatively. The "ground truth" for the test set values is considered the measurements obtained by the predicate device itself, not an expert consensus on a diagnosis.
    • Qualifications of Experts: Not applicable.

    4. Adjudication Method for the Test Set

    • Adjudication Method: Not applicable. As this is a quantitative immunoassay comparison, there is no "adjudication" in the sense of independent expert review of cases. The comparison is statistical, comparing the results of the new device against the predicate device.

    5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study

    • MRMC Study: No, a Multi-Reader Multi-Case (MRMC) comparative effectiveness study was not done. This type of study is relevant for diagnostic imaging or subjective interpretation tasks where human readers' performance is being evaluated, possibly with AI assistance. For a quantitative immunoassay, the performance is assessed by comparing its measurements to a reference method (the predicate device) and evaluating its analytical characteristics (precision, sensitivity, range).
    • Effect Size of Human Readers with/without AI assistance: Not applicable.

    6. Standalone Performance Study

    • Standalone Study: Yes, a standalone performance study was done for the Elecsys proBNP II STAT Assay. The "method comparison" section directly shows the performance of the new device alone against the predicate device on patient samples (n=120) and demonstrates its analytical performance characteristics (precision, analytical sensitivity, measuring range, hook effect, limitations) independently. The core of this 510(k) summary is to demonstrate that the device itself performs equivalently to the predicate to warrant a "substantially equivalent" classification.

    7. Type of Ground Truth Used

    • Type of Ground Truth: The "ground truth" for the method comparison study was the measurements obtained from the existing, legally marketed predicate device (Elecsys proBNP II Assay, K072437). This is a common approach in 510(k) submissions for in vitro diagnostic (IVD) devices seeking substantial equivalence where a well-established predicate device exists. The predicate device's measured values are treated as the reference standard for comparison.

    8. Sample Size for the Training Set

    • Training Set Sample Size: The document does not explicitly mention a separate "training set" or its size in the context of typical machine learning models. This is an immunoassay, not an AI/ML-based device in the common understanding. The development and optimization of such assays typically involve iterative laboratory testing and refinement of reagents, protocols, and instrument parameters, which could be considered an internal "training" process, but not in the sense of a distinct, labeled dataset for algorithm training.

    9. How the Ground Truth for the Training Set Was Established

    • Ground Truth for Training Set: Not applicable in the conventional sense of AI/ML training. The "ground truth" for developing and optimizing the assay itself would be based on established analytical chemistry principles, characterization of assay components (antibodies, calibrators), and performance against known reference materials or other validated laboratory methods. This iterative process leads to the final assay formulation and protocol, which is then formally validated using the test set described above against the predicate.
    Ask a Question

    Ask a specific question about this device

    K Number
    K082645
    Date Cleared
    2008-09-22

    (11 days)

    Product Code
    Regulation Number
    862.1117
    Reference & Predicate Devices
    Predicate For
    N/A
    AI/MLSaMDIVD (In Vitro Diagnostic)TherapeuticDiagnosticis PCCP AuthorizedThirdpartyExpeditedreview
    Intended Use

    The NTP method is an in vitro diagnostic assay for the quantitative measurement of N-terminal pro-brain natriuretic peptide (NT-proBNP) in human serum and plasma on the Dimension® EXLTM integrated chemistry system with LOCI® Module. In individuals suspected of having congestive heart failure (CHF), measurements of NT-proBNP are used as an aid in the diagnosis and assessment of severity. The test is further indicated for the risk stratification of patients with acute coronary syndrome and heart failure.

    Device Description

    The EXL NTP method is a one-step sandwich chemiluminescent immunoassay based on LOCI® technology, LOCI® reagents include two synthetic bead reagents and a biotinylated monoclonal antibody fragment which recognize an epitope located in the N-terminal part of proBNP. The first bead reagent (Sensibeads) is coated with streptavidin and contains photosensitive dye. The second bead reagent (Chemibeads) is coated with a second antibody specific for a second independent epitope on NTproBNP and contains chemiluminescent dye. Sample is incubated with Chemibeads and biotinylated antibody to form a particle/NT-proBNP/biotinylated antibody sandwich. Sensibeads then are added and bind to the biotin to form a bead-aggregated immunocomplex. Illumination of the complex by light at 680 nm generates singlet oxygen from Sensibeads, which diffuses to the Chemibeads and triggers a chemiluminescent reaction. The resulting chemiluminescent signal is measured at 612 nm and is directly proportional to the concentration of NT-proBNP in the sample.

    AI/ML Overview

    Here's a breakdown of the acceptance criteria and study information for the Dimension® EXL™ NTP Flex® Reagent Cartridge (RF623), based on the provided text:

    1. Table of Acceptance Criteria and Reported Device Performance:

    The document establishes substantial equivalence by comparing the performance characteristics of the Dimension® EXL™ NTP Flex® Reagent Cartridge (RF623) to its predicate device, the Dimension Vista® PBNP Flex® reagent cartridge (K6423A). For each characteristic, the acceptance criterion is implicitly the performance of the predicate device, which the new device matches.

    FeatureAcceptance Criteria (Predicate Device Performance)Reported Device Performance (Dimension® EXL™ NTP Flex® RF623)
    Intended UseQuantitative measurement of N-terminal pro-brain natriuretic peptide (NT-proBNP) in human serum and plasma for diagnosis and assessment of CHF severity, and risk stratification for acute coronary syndrome and heart failure.Matches predicate's intended use (see detailed description in the document).
    Device TechnologyChemiluminescentChemiluminescent
    Measuring Range5 - 35,000 pg/mL5 - 35,000 pg/mL
    AntibodyMonoclonal Sheep AntibodyMonoclonal Sheep Antibody
    Cut-off125 pg/mL for < 75 years old; 450 pg/mL for ≥ 75 years old125 pg/mL for < 75 years old; 450 pg/mL for ≥ 75 years old
    Analytical Sensitivity≤5 pg/mL≤5 pg/mL
    Functional Sensitivity≤30 pg/mL≤30 pg/mL
    Analytical SpecificityNo significant cross-reactivity with Natrecor® (0 or 125 pg/mL NT-PBNP) and sixteen other substances.No significant cross-reactivity with Natrecor® (0 or 125 pg/mL NT-PBNP) and sixteen other substances.
    InterferencesNo significant interference from: Bilirubin conjugated up to 60 mg/dL, Bilirubin unconjugated up to 60 mg/dL, Hemoglobin up to 1000 mg/dL, Triglyceride up to 3000 mg/dL.No significant interference from: Bilirubin conjugated up to 60 mg/dL, Bilirubin unconjugated up to 60 mg/dL, Hemoglobin up to 1000 mg/dL, Triglyceride up to 3000 mg/dL.
    Hook EffectNo effect up to 400,000 pg/mLNo effect up to 400,000 pg/mL
    Calibration Interval30 days, same reagent lot30 days, same reagent lot
    Sample Volume8 µL8 µL

    Study Proving Device Meets Acceptance Criteria:

    The document states: "Comparative testing performed by Siemens Healthcare Diagnostics Inc. demonstrates substantial equivalent performance." This indicates that a study was conducted to show the new device performs comparably to the predicate device across the characteristics listed above.

    2. Sample Size Used for the Test Set and Data Provenance:

    The document does not explicitly state the sample size used for the comparative testing. It also does not specify the data provenance (e.g., country of origin, retrospective or prospective nature of the sample collection).

    3. Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications of Those Experts:

    This information is not provided in the document. For an in vitro diagnostic (IVD) test measuring a biomarker like NT-proBNP, the "ground truth" for the test results themselves would typically be the measurement obtained from a calibrated reference method or the predicate device, not necessarily established by human experts in the same way as, for example, image interpretation. However, the ground truth for the clinical conditions (CHF, acute coronary syndrome) mentioned in the intended use would be established by medical experts (e.g., cardiologists, emergency physicians) based on clinical evaluation, imaging, and other diagnostic tests. The document does not detail how the clinical ground truth for any patient samples used in the "comparative testing" was established, nor the qualifications of those experts.

    4. Adjudication Method for the Test Set:

    This information is not provided in the document. Given that this is a comparison to a predicate device for an IVD, the "adjudication method" would likely refer to how discrepancies between the new device and predicate device measurements (if any) were resolved, or how the clinical diagnosis as "ground truth" was confirmed, but this is not detailed.

    5. If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study Was Done, and Effect Size:

    No, a Multi-Reader Multi-Case (MRMC) comparative effectiveness study was not done. This type of study is relevant for devices involving human interpretation (e.g., imaging AI). This device is an in vitro diagnostic assay with quantitative output, so an MRMC study is not applicable. The device's "improvement" is in its performance matching the predicate, not in improving human reader performance.

    6. If a Standalone (i.e., algorithm only without human-in-the-loop performance) Was Done:

    Yes, the performance evaluated is inherently "standalone" in the context of an IVD. The "comparative testing" would have assessed the performance of the Dimension® EXL™ NTP Flex® Reagent Cartridge (RF623) directly against the Dimension Vista® PBNP Flex® reagent cartridge (K6423A) without human intervention in the measurement process itself, beyond operating the instruments. The output is a quantitative value from the device.

    7. The Type of Ground Truth Used:

    The ground truth for the device's performance is implicitly the results obtained from the legally marketed predicate device (Dimension Vista® PBNP Flex® reagent cartridge (K6423A)). The document doesn't mention the use of pathology, or direct outcomes data as the primary ground truth for establishing equivalence in analytical performance but rather the established performance of the predicate device.

    8. The Sample Size for the Training Set:

    The document does not specify a training set sample size. For an IVD like this, "training set" is not typically a concept that applies in the same way it does for machine learning algorithms that learn from data. The device's formulation (reagents, antibodies) and measurement principles are established through standard assay development, manufacturing, and validation processes, not through a "training set" in the common AI sense.

    9. How the Ground Truth for the Training Set Was Established:

    As mentioned above, the concept of a "training set" and associated "ground truth" for this type of IVD device is not applicable in the same way it is for AI/ML devices. The "ground truth" for the development and validation of the assay would be based on known concentrations of NT-proBNP standards and controls, as well as samples characterized by a reference method or the predicate device itself during its initial development. The current document focuses on demonstrating substantial equivalence to an existing device, which is a different regulatory pathway than initial device development.

    Ask a Question

    Ask a specific question about this device

    Page 1 of 5