Search Results
Found 27 results
510(k) Data Aggregation
(158 days)
hormone test system | Free thyroxine test system |
| Regulation Number | 21 CFR 862.1690 | 21 CFR 862.1695
The TSHL method is an in vitro diagnostic test for the quantitative measurement of Thyroid Stimulating Hormone (TSH, thyrotropin) in human serum and plasma on the Dimension® EXL™ integrated chemistry system with LOCI® Module. Measurements of TSH are used in the diagnosis and monitoring of thyroid disease.
The FT4L method is an in vitro diagnostic test for the quantitative measurement of Free Thyroxine in human serum and plasma on the Dimension® EXL™ integrated chemistry system with LOCI® Module. Measurements of free thyroxine are used in the diagnosis and monitoring of thyroid disease.
The Dimension® LOCI® Thyroid Stimulating Hormone Flex® reagent cartridge (TSHL) and Dimension® LOCI® Free Thyroxine Flex® reagent cartridge (FT4L) assays were cleared under K081074 and K073604, respectively. The components of the cleared assays were modified to reduce biotin interference.
The modified Assays are comprised of the following components:
Dimension® LOCI® Thyroid Stimulating Hormone Flex® reagent cartridge (TSHL): prepackaged liquid reagents in a plastic eight-well cartridge. Wells 1-2 contain Biotinylated TSH antibody (7.5 µg/mL mouse monoclonal), wells 3-4 contain TSH antibody coated Chemibeads (200 µg/mL mouse monoclonal), and wells 5-6 contain Streptavidin Sensibeads (1400 µg/mL recombinant E. coli). Wells 1-6 contain buffers, stabilizers and preservatives. Wells 7-8 are empty.
Dimension® LOCI® Free Thyroxine Flex® reagent cartridge (FT4L): prepackaged liquid reagents in a plastic eight-well cartridge. Wells 1-2 contain Streptavidin Sensibeads (225 µg/mL recombinant E. coli), wells 3-4 contain T3 Chemibeads (200 µg/mL), and wells 5-6 contain FT4 Biotinylated antibody (50 ng/mL mouse monoclonal). Wells 1-6 contain buffers, stabilizers and preservatives. Wells 7-8 are empty.
Test Principle: Both devices use a homogeneous chemiluminescent immunoassay based on LOCI® technology.
For TSHL, it's a sandwich immunoassay where sample is incubated with biotinylated antibody and Chemibeads to form bead-TSH-biotinylated antibody sandwiches. Sensibeads are added and bind to the biotin to form bead-pair immunocomplexes. Illumination at 680 nm generates singlet oxygen from Sensibeads which diffuses into Chemibeads, triggering a chemiluminescent reaction. The resulting signal is measured at 612 nm and is a direct function of TSH concentration.
For FT4L, it's a sequential immunoassay where sample is incubated with biotinylated antibody. T3 Chemibeads are added and form bead/biotinylated antibody immunocomplexes with the non-saturated fraction of the biotinylated antibody. Sensibeads are then added and bind to the biotin to form bead pair immunocomplexes. Illumination at 680 nm generates singlet oxygen from Sensibeads which diffuses into the Chemibeads, triggering a chemiluminescent reaction. The resulting signal is measured at 612 nm and is an inverse function of FT4 concentration.
The document provided is a 510(k) clearance letter from the FDA for two in-vitro diagnostic (IVD) devices: Dimension® LOCI® Thyroid Stimulating Hormone Flex® reagent cartridge (TSHL) and Dimension® LOCI® Free Thyroxine Flex® reagent cartridge (FT4L). It describes the devices, their intended use, and the performance characteristics tested to demonstrate substantial equivalence to previously cleared predicate devices.
However, it's crucial to understand that this document describes a reagent cartridge, which is a laboratory assay, not an AI/ML-driven device or an imaging device. Therefore, many of the requested criteria (e.g., sample size for training/test sets for AI, data provenance like country of origin for AI, ground truth establishment by experts, adjudication methods, MRMC studies, standalone AI performance) are not applicable to this type of device. The document details the performance of the assay itself in measuring biomarker concentrations, not an AI's ability to interpret images or assist human readers.
I will interpret the request based on the information provided for this specific IVD device, noting where certain requested details are not relevant to the nature of the device.
Acceptance Criteria and Study to Prove Device Meets Criteria (for an IVD Reagent Cartridge)
The device in question, a reagent cartridge for quantitative measurement of TSH and FT4, is a laboratory assay, not an AI/ML or imaging interpretation device. Therefore, the "acceptance criteria" and "study" are focused on analytical performance characteristics (accuracy, precision, linearity, interference, detection limits, etc.) compared to a predicate device, rather than diagnostic accuracy metrics of an AI.
1. Table of Acceptance Criteria and Reported Device Performance
For an IVD reagent cartridge, "acceptance criteria" are typically defined by ranges, limits, or statistical agreementsdemonstrating analytical performance comparable or superior to the predicate device and meeting relevant clinical or analytical standards (e.g., CLSI guidelines). The reported performance demonstrates that the modified devices meet these standards.
Performance Characteristic | Acceptance Criteria (Implicit from CLSI Guidelines/Predicate Comparison) | Reported Device Performance (TSHL) | Reported Device Performance (FT4L) |
---|---|---|---|
Detection Limits | Meet/Be comparable to predicate; within acceptable analytical ranges. | LoB: 0.003 µIU/L | |
LoD: 0.005 µIU/L | |||
LoQ: 0.007 µIU/L | LoB: 0.03 ng/dL | ||
LoD: 0.05 ng/dL | |||
LoQ: 0.06 ng/dL | |||
Linearity / Measuring Interval | Linear across the claimed measuring range with acceptable bias. | 0.007 – 100 µIU/mL | 0.1 – 8.0 ng/dL |
Method Comparison (vs. Predicate) | High correlation (r close to 1), slope close to 1, small y-intercept. | N=145 Serum samples | |
y = 0.99x + 0.039 µIU/mL | |||
(Correlation (r) implicitly high, as regression equation suggests strong agreement) | N=146 Serum samples | ||
y = 1.02x + 0.03 ng/dL | |||
(Correlation (r) implicitly high, as regression equation suggests strong agreement) | |||
Precision (Repeatability) | Within-run and total precision (SD/CV) within acceptable clinical laboratory limits. | TSHL: | |
Levels 0.110-88.676 µIU/mL | |||
Within-Run %CV: 2.6-4.4% | |||
Total %CV: 1.1-3.0% (Note: Table 5 "Total" %CV for Level 1 is 2.6%, matching within-run %CV, but for others, it's lower. This might be a typo in the table, typically Total CV > Within-Run CV). | FT4L: | ||
Levels 0.81-6.41 ng/dL | |||
Within-Run %CV: 2.2-2.6% | |||
Total %CV: 0.9-1.1% | |||
Precision (Reproducibility) | Total reproducibility (SD/CV) across lots and systems within acceptable clinical laboratory limits. | TSHL: | |
Levels 0.094-81.372 µIU/mL | |||
Reproducibility %CV: 4.6-7.6% | FT4L: | ||
Levels 0.70-6.49 ng/dL | |||
Reproducibility %CV: 1.8-2.4% | |||
Recovery (Dilution) | For TSHL, diluted samples should show recovery close to 100% of the true value. | TSHL: | |
Recovery ranged from 100% to 106% for various samples diluted 5x. | N/A (FT4L not described for dilution recovery) | ||
Interference (Biotin) | Modified assay shows significantly reduced interference compared to predicate. | TSHL & FT4L: Specimens with biotin up to 1200 ng/mL demonstrate ≤10% change in results (significant improvement from predicate's 250 ng/mL for TSHL and 100 ng/mL for FT4L). | TSHL & FT4L: Specimens with biotin up to 1200 ng/mL demonstrate ≤10% change in results. |
Reference Range Verification | Results from healthy samples confirm the established reference intervals. | TSHL: Verified for adults (0.358-3.74 µIU/mL) and pediatric populations. | FT4L: Verified for adults (0.76-1.46 ng/dL) and pediatric populations. |
Matrix Comparison | Comparable performance across different sample matrices. | Comparable values to serum samples for lithium heparin, sodium heparin, and K2-EDTA plasma. | Same as TSHL. |
Hook Effect | No significant hook effect within specified range. | No hook effect observed up to 30,000 µIU/mL. | N/A (FT4L not described for hook effect) |
2. Sample Sizes and Data Provenance for the Test Set
The concept of a "test set" in the context of an IVD reagent cartridge refers to the set of samples used for various analytical performance studies. These are not typically split into "training" and "test" sets as in AI/ML.
- Method Comparison:
- TSHL: 145 patient samples (serum)
- FT4L: 146 patient samples (serum)
- Precision (Repeatability): 5 serum samples (TSHL), 3 serum samples (FT4L)
- Precision (Reproducibility): 5 serum samples (TSHL), 3 serum samples (FT4L)
- Linearity: Low and high human serum pools used to create dilution series (TSHL: 12 levels, FT4L: 10 levels)
- Interference (Biotin and HIL): Samples spiked with interferents, specific TSH/FT4 levels tested.
- Dilution Recovery: 7 samples (TSHL)
- Reference Range Verification: "Apparently healthy samples" (specific N not provided, but typically a statistically significant number for verification per CLSI EP28-A3C).
- Matrix Comparison: Samples of various tube types (Serum, lithium heparin, sodium heparin, K2-EDTA plasma)
Data Provenance: The document does not specify the country of origin of the patient samples. The studies are explicitly described as analytical performance studies rather than clinical outcome studies, and they are retrospective (samples tested in the lab, not followed prospectively).
3. Number of Experts and Qualifications for Ground Truth
This is not applicable as the device is a quantitative IVD assay (reagent cartridge), not an AI/ML device requiring expert interpretation of complex clinical data or images. The "ground truth" for this device is the actual concentration of TSH or FT4 in the sample, typically established either by:
- Reference methods (e.g., mass spectrometry, although not explicitly stated as the ground truth method here).
- The predicate device itself (as used in method comparison studies, where the predicate is the "comparison assay").
- Spiking known concentrations into matrices.
4. Adjudication Method for the Test Set
This is not applicable for a quantitative IVD reagent. Adjudication methods (e.g., 2+1, 3+1) are typically used in scenarios where human experts interpret data (like medical images), and their disagreements need to be resolved to establish a definitive ground truth for AI model evaluation.
5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study
This is not applicable. An MRMC study is designed to evaluate the diagnostic performance of human readers, often with and without AI assistance, on a set of cases. This device is a reagent cartridge that provides a quantitative measurement, not an AI that assists human interpretation.
6. Standalone Performance (Algorithm Only Without Human-in-the-Loop)
This is not applicable. This device is a reagent cartridge that runs on an automated system, providing a quantitative result. It's inherently "standalone" in providing the measurement, but it's not an "algorithm only" in the sense of an AI interpreting complex data. The performance metrics listed (precision, accuracy relative to predicate, linearity, etc.) are its "standalone" performance.
7. Type of Ground Truth Used
The "ground truth" for this type of quantitative diagnostic test is based on:
- Comparison to a legally marketed predicate device: The current, FDA-cleared versions of the TSHL and FT4L assays (K081074 and K073604) acted as the "gold standard" or comparison method for the method comparison studies.
- Known concentrations: For linearity, recovery, and interference studies, samples were prepared with known concentrations or spiked with known amounts of analytes or interferents.
- Analytically verified samples: Samples used for precision studies have mean values derived from repeated measurements.
8. Sample Size for the Training Set
This is not applicable as the device is a non-AI/ML IVD reagent cartridge. There is no concept of a "training set" for this type of product. The development and optimization of the reagent formulation are internal processes, but they don't involve "training" a model on a dataset in the AI sense.
9. How Ground Truth for the Training Set Was Established
This is not applicable for the same reason as point 8.
Ask a specific question about this device
(35 days)
Drive Chaska, Minnesota 55318
Re: K240273
Trade/Device Name: Access Free T4 Regulation Number: 21 CFR 862.1695
Free Thyroxine Assay Classification Name: Free Thyroxine Test system Classification Regulation: 21 CFR 862.1695
The Access Free T4 assay is a paramagnetic particle, chemiluminescent immunoassay for the quantitative determination of free thyroxine levels in human serum and plasma (heparin) for the diagnosis and treatment of thyroid diseases using the Access Immunoassay Systems.
Assay type: two-step, competitive
The Access Free T4 assay is a two-step enzyme immunoassay. The Access Free T4 assay consists of the reagent pack and calibrators. Other items needed to run the assay include substrate and wash buffer. The Access Free T4 reagent pack, Access Free T4 calibrators, along with Wash Buffer II are designed for use with the Access Immunoassay Systems in a clinical laboratory setting.
The Access Free T4 contains the following components:
- R1a: Dynabeads paramagnetic particles coated with streptavidin and mouse monoclonal anti-Thyroxine (T4) coupled to biotin; preservative
- R1b: TRIS buffered saline with protein (avian), surfactant, preservative
- R1c: TRIS buffered saline with protein (avian), surfactant, preservative.
- R1d: Triiodothyronine-alkaline phosphatase (bovine) conjugate in a TRIS buffer with protein (avian), surfactant, preservative.
- R1e: TRIS buffer with protein (avian and murine), surfactant, preservative
The provided 510(k) summary describes the Access Free T4 assay, a paramagnetic particle, chemiluminescent immunoassay for the quantitative determination of free thyroxine levels in human serum and plasma for the diagnosis and treatment of thyroid diseases.
Here's an analysis of the acceptance criteria and the study that proves the device meets them:
1. Table of Acceptance Criteria and Reported Device Performance
The document outlines acceptance criteria and performance for several studies.
Study Type | Acceptance Criteria | Reported Device Performance (Access 2 / Dxl 9000 Immunoassay Analyzer) |
---|---|---|
Method Comparison | Slope 1.00 ± 0.12 | Access 2: Slope = 1.02 (95% CI: 1.00 to 1.04), Correlation Coefficient R = 0.98. Dxl 9000: Slope = 1.02 (95% CI: 0.99 to 1.05), Correlation Coefficient R = 0.95. Both devices met the acceptance criteria. |
Imprecision | Within-laboratory (total) %CV ≤ 10.0% for values ≥ 0.61 ng/dL. Within-laboratory (total) SD ≤ 0.06 for concentrations =0.61 ng/dL, but it is below 0.61 ng/dL so the SD criteria applies, and 0.05 is below 0.06).** Samples 2-5 (0.92 - 4.3 ng/dL): Within-lab %CVs 3.3% - 4.6%. Overall, the performance seems to meet the criteria for all samples based on the appropriate metric (SD for low conc., %CV for high conc). | |
Linearity | The assay should demonstrate linearity across the measuring interval. Measuring interval: Access 2: 0.40 - 6.0 ng/dL. Dxl 9000: 0.32 - 6.0 ng/dL. | A study based on CLSI EP06-Ed2 determined the assay demonstrated linearity across the measuring interval for both instruments. No specific quantitative linearity results (e.g., coefficient of determination) are provided, but the statement indicates success. |
Analytical Specificity | Change in concentration between control and test sample within specified percentages for various cross-reactants (e.g., D-T4: ≤ 100%, L-T3: ≤ 2%, R-T3: ≤ 25%, etc.). | Testing was performed with various cross-reactants. The document states that the observed changes in concentration met the specified acceptance criteria for all tested cross-reactants. |
Interference (Common Substances) | Change in concentration between the control sample and the test sample within ± 10%. | No significant interference (± 10%) was observed for any of the tested interferents at their highest concentrations (e.g., Albumin, Aspirin, Bilirubin, Biotin, Hemoglobin, Lipemia, Methimazole, Phenylbutazone, Phenytoin, Prealbumin, Sodium Salicylate, Thiouracil, Thyroxine Binding Globulin). The specific concentration for Biotin interference was 3,510 ng/mL (previous device was ≤ 10 ng/mL). |
Sample Type Comparison | For determination of equivalency between sample types, the results should be comparable. (Implied acceptance based on Passing-Bablok regression with slopes near 1 and tight CIs). | Access 2 (Serum vs. LiHep Plasma): N=41, Estimate (slope) = 0.99, 95% CI: 0.94 - 1.04. Dxl 9000 (Serum vs. LiHep Plasma): N=43, Estimate (slope) = 0.97, 95% CI: 0.93 - 0.99. Both instruments show good agreement between serum and lithium heparin plasma. |
Detection Capability (LoB, LoD, LoQ) | Established limits based on CLSI guideline EP17-A2. | Access 2: LoB = 0.25 ng/dL, LoD = 0.40 ng/dL, LoQ = 0.40 ng/dL. Dxl 9000: LoB = 0.25 ng/dL, LoD = 0.32 ng/dL, LoQ = 0.32 ng/dL. The results are listed and implicitly met the established limits for these studies. |
2. Sample Sizes Used for the Test Set and Data Provenance
- Method Comparison: 163 serum samples.
- Imprecision: For both Access 2 and Dxl 9000, "multiple samples" were tested in duplicate in 2 runs per day for a minimum of 20 days. The tables show
N
values per sample type ranging from 80 to 84 (representing the total number of replicates over the 20+ days for each control sample). - Detection Capability (LoB, LoD, LoQ): "multiple reagent lots and 3 instruments over a minimum of 3 days" (LoB) and "multiple reagent lots and 3 instruments over a minimum of 5 days" (LoD, LoQ).
- Analytical Specificity: Serum samples with two concentrations of Free T4, tested in replicates of six each.
- Interference: Patient serum samples with two levels of Free T4, with 6 to 12 replicates tested for each control sample preparation.
- Sample Type Comparison: A minimum of 40 matched sets of patient samples were tested with each reagent lot. Specifically, 41 matched sets for Access 2 and 43 matched sets for Dxl 9000 were reported.
Data Provenance: The document does not explicitly state the country of origin of the data or whether it was retrospective or prospective. Given the nature of these clinical performance studies for an in vitro diagnostic device, it is typically prospective, involving controlled testing in a laboratory setting with collected human samples. The term "patient serum samples" suggests human origin.
3. Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications
This section is not applicable to this type of device (in vitro diagnostic immunoassay). The "ground truth" for an immunoassay is typically established by reference methods, consensus methods, or highly characterized samples, not by expert human readers. The predicate device itself acts as a comparative standard in the method comparison study.
4. Adjudication Method
This section is not applicable for this type of device as there is no human interpretation or decision-making process that would require adjudication.
5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study
This section is not applicable. This device is an automated immunoassay for quantitative determination of a specific analyte (Free T4), not an imaging or diagnostic device that relies on human reader interpretation (with or without AI assistance).
6. Standalone Performance Study
Yes, the entire set of studies described (Method Comparison, Imprecision, Detection Capability, Linearity, Analytical Specificity, Interference, Sample Type Comparison) represents the standalone performance of the modified device (Access Free T4) when compared against the predicate device or a recognized standard (e.g., CLSI guidelines). The performance metrics provided (slope, correlation, %CV, SD, LoB, LoD, LoQ, % difference) quantify the device's accuracy, precision, and analytical characteristics.
7. Type of Ground Truth Used
The ground truth used depends on the specific study:
- Method Comparison: The predicate device (Access Free T4 Assay on the Access Immunoassay Analyzer K982250) served as the comparator or "reference" for comparison.
- Imprecision: Statistical methods define precision (e.g., repeatability, within-laboratory variability). Control samples with known target values serve as the basis for evaluation.
- Detection Capability (LoB, LoD, LoQ): These limits are derived statistically from measurements of blank samples and low-concentration samples.
- Linearity: Expected proportional changes in analyte concentration across a range are compared against measured values.
- Analytical Specificity and Interference: Known concentrations of potential cross-reactants or interferents added to serum samples with known Free T4 levels are used. The "true" effect is zero interference/cross-reactivity.
- Sample Type Comparison: Matched patient samples (serum vs. plasma) are used, where the expectation is agreement between results from different sample matrices from the same individual.
8. Sample Size for the Training Set
This document only describes analytical performance studies for the modified device. There is no mention of a "training set" in the context of machine learning or AI, as this is an immunoassay device, not a software algorithm that would typically undergo a training phase.
9. How the Ground Truth for the Training Set Was Established
This is not applicable as there is no training set for this type of device.
Ask a specific question about this device
(414 days)
Road Indianapolis, IN 46250
Re: K220456
Trade/Device Name: Elecsys FT4 IV Regulation Number: 21 CFR 862.1695
Assay for the in vitro quantitative determination of free thyroxine in human serum and plasma. Measurements obtained by this device are used in the diagnosis and treatment of thyroid disease.
The electrochemiluminescence immunoasay "ECLIA" is intended for use on the cobas e immunoassay analyzers.
The Elecsys FT4 IV immunoassay a fourth generation FT4 assay by Roche Diagnostics for the for the in vitro quantitative determination of free thyroxine in human serum and plasma. It is intended for use on the cobas e immunoassay analyzers. The cobas e family of analyzers uses electrochemiluminescence immunoassay "ECLIA" technology. The assay is an 18 minute assay utilizing a competition principle using a monoclonal antibody which is specifically directed against free thyroxine. Results are determined via a calibration curve which is instrument specifically generated by 2-point calibration against the master curve for that reagent lot.
Here's a breakdown of the acceptance criteria and the study information for the Elecsys FT4 IV device, based on the provided text:
Device: Elecsys FT4 IV (Free Thyroxine Test System)
Predicate Device: Elecsys FT4 II (K131244)
Intended Use: In vitro quantitative determination of free thyroxine in human serum and plasma for the diagnosis and treatment of thyroid disease, intended for use on cobas e immunoassay analyzers.
1. Table of Acceptance Criteria and Reported Device Performance
The document does not explicitly state acceptance criteria in a dedicated section with specific numerical targets for each performance characteristic. Instead, it generally states that "All predefined acceptance criteria was met" for various studies. The reported device performance is presented as the results obtained from these studies.
Performance Characteristic | Reported Device Performance (Elecsys FT4 IV) | Acceptance Criteria (Implicitly Met) |
---|---|---|
Precision | "All predefined acceptance criteria was met" (CLSI guideline EP05-A3) | |
- Repeatability (CV%) | Ranges from 0.9% to 2.5% | Implicitly met for various sample levels |
- Intermediate Precision (CV%) | Ranges from 1.7% to 5.6% | Implicitly met for various sample levels |
Lot-to-lot Reproducibility | Not explicitly stated, but "All predefined acceptance criteria was met" | Implicitly met (CLSI guideline EP05-A3) |
Analytical Sensitivity | "All predefined acceptance criteria was met" (CLSI EP17-A2) | |
- Limit of Blank (LoB) | 0.02 ng/dL (0.3 pmol/L) | Implicitly met, this is the derived claim |
- Limit of Detection (LoD) | 0.04 ng/dL (0.5 pmol/L) | Implicitly met, this is the derived claim |
- Limit of Quantitation (LoQ) | 0.101 ng/dL (1.3 pmol/L) at ≤ 20% intermediate precision CV | Implicitly met, this is the derived claim |
Linearity/Assay Reportable Range | Linear in range 0.098-8.13 ng/dL (1.26-105 pmol/L) | Implicitly met (CLSI EP6-Ed2) |
- Measuring Range Claim | 0.101-7.77 ng/dL (1.3-100 pmol/L) | Implicitly met based on linearity study |
Human Anti-Mouse Antibodies (HAMA) | Not Applicable (No mouse antibodies used) | N/A |
Endogenous Interferences | No significant interference for tested compounds (e.g., Bilirubin ≤ 701 µmol/L, Biotin ≤ 1200 ng/mL) | "All predefined acceptance criteria was met" (CLSI guideline EP07-A3) |
Analytical Specificity/Cross-Reactivity | Low cross-reactivity for tested compounds (e.g., L-T3 0.005%, D-T3 0.003%, rT3 0.002%) | Implicitly met |
Exogenous Interferences (Drugs) | No significant interference for most common therapeutic drugs, specific drugs like Furosemide, Carbamazepine, Phenytoin, and Levothyroxine Sodium (L-T4) caused elevated FT4 findings at daily therapeutic dosage levels. | Implicitly met for non-interfering drugs; specific findings reported for interfering drugs. |
Sample Matrix Comparison | Serum, Li-Heparin, K2-EDTA, K3-EDTA plasma are acceptable sample types | "All predefined acceptance criteria was met" |
Method Comparison to Predicate | Passing Bablok: y = 1.03x - 0.025, τ = 0.967; Linear Regression: y = 1.04x - 0.034, r = 0.999 | Implicitly demonstrates substantial equivalence to predicate |
Reagent Stability After First Opening | Up to 84 days when stored at 2-8°C | Implicitly met |
Reagent On-board Stability | Up to 28 days | Implicitly met |
Lot Calibration Stability | Recommended every 28 days | Implicitly met |
On-board Calibration Stability | Up to 7 days without new calibration | Implicitly met |
Expected Values/Reference Range | 0.92 – 1.68 ng/dL (11.9 – 21.6 pmol/L) (95% CI of 2.5th: 0.81-0.96 ng/dL; 95% CI of 97.5th: 1.51-2.00 ng/dL) | Established from 150 healthy subjects in the United States |
2. Sample Sizes Used for the Test Set and Data Provenance
- Precision (Repeatability and Intermediate Precision): Not explicitly stated how many individual samples were used, but multiple human serum samples (6 levels) and PreciControl Universal (2 levels) were tested. Repeatability involved measurements over 21 days and 5 days, and intermediate precision over the same periods.
- Analytical Sensitivity (LoB, LoD, LoQ):
- LoB: One blank sample (fT4 depleted human serum pool) with ten replicates per run, across six runs over three or more days, evaluated on three reagent lots. Total 60 determinations.
- LoD: Five low-level human serum sample pools (diluted) with two replicates/sample/run, across six runs over three or more days, evaluated on three reagent lots.
- LoQ: At least five low-level samples of serum, with five replicates per sample per run, over five days (total 25 replicates/sample/reagent lot), evaluated on three reagent lots.
- Linearity/Assay Reportable Range: Three individual human serum samples (spiked and diluted) to prepare dilution series. 13 concentrations (levels) prepared. Samples assayed in 4-fold determination within a single run.
- Endogenous Interferences: Nine endogenous substances evaluated. The number of samples for each is not specified, but typically involves spiked samples.
- Analytical Specificity/Cross-Reactivity: For each potential cross-reacting compound, two human serum samples (low and slightly elevated FT4 levels) were tested.
- Exogenous Interferences (Drugs): 17 commonly and 15 specially used pharmaceutical compounds evaluated.
- Sample Matrix Comparison: Samples (native single donors and pools, spiked or diluted) drawn into Serum, Li-Heparin, K2-and K3-EDTA plasma primary tubes. Number of samples not specified.
- Method Comparison to Predicate: 121 serum samples.
- Expected Values/Reference Range: 150 apparently healthy subjects.
Data Provenance:
- For the Expected Values/Reference Range study, serum samples were obtained from a "commercial vendor" and collected from "health donors in the United States."
- Other studies generally refer to "human serum" or "human serum samples," implying samples of human origin, but specific countries or retrospective/prospective nature are not typically detailed for non-clinical analytical performance studies unless relevant regulations require it. However, given the context of an IVD, these are typically laboratory-based studies using banked or ethically sourced human samples.
3. Number of Experts Used to Establish the Ground Truth for the Test Set and Their Qualifications
The document describes the analytical performance of an in vitro diagnostic (IVD) device. For IVDs measuring biomarkers, "ground truth" is typically established by existing reference methods, certified reference materials, or by the analytical properties of the substance itself (e.g., purified analytes for spiking studies).
- No human experts are explicitly mentioned for establishing ground truth in the context of diagnostic accuracy for the Elecsys FT4 IV performance studies.
- The ground truth for FT4 concentration in samples used for studies like linearity or precision is based on the expected concentration determined by established laboratory practices, dilutions, spiking with known amounts of analyte, or values obtained from a reference method (in the case of method comparison).
- For the Reference Range study, classification as "apparently healthy subjects" serves as a form of ground truth for establishing normal ranges, likely based on medical history or screening criteria provided by the commercial vendor.
4. Adjudication Method for the Test Set
Not applicable. This is an IVD device measuring a quantitative biomarker (free thyroxine). The performance evaluation involves analytical studies (precision, sensitivity, linearity, interference, method comparison) against established analytical standards or a predicate device, not subjective interpretation requiring adjudication by multiple readers or experts.
5. If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study Was Done
No. MRMC studies are typically performed for imaging devices or devices that involve human interpretation of results. This is an IVD that quantitatively measures a biomarker on an automated analyzer. The device produces a numerical result, and there is no "human reader" in the loop for interpreting device output in a way that would necessitate an MRMC study for comparative effectiveness.
6. If a Standalone (Algorithm Only Without Human-in-the-Loop Performance) Was Done
Yes, this is implicitly a standalone study for the device. The entire non-clinical performance evaluation (Sections 4.1 to 4.11) describes the performance of the Elecsys FT4 IV assay itself, as an automated system on the cobas e immunoassay analyzers. It evaluates the device's accuracy, precision, sensitivity, and resistance to interference independently of human interpretation of the final numerical result. The output (a quantitative FT4 value) is what clinicians interpret in the context of a patient's condition.
7. The Type of Ground Truth Used
The ground truth used for these analytical studies primarily consists of:
- Reference Standards/Known Concentrations: For studies like linearity, analytical sensitivity (LoB, LoD, LoQ), and interference, samples are often prepared with known concentrations of the analyte or interfering substances.
- Comparative Reference Method: For the method comparison study, the predicate device (Elecsys FT4 II) served as the reference for comparison.
- Absence of Analyte: For LoB determination, "fT4 depleted human serum sample pool" was used.
- Clinically Defined Status: For the reference range study, samples from "apparently healthy donors" were used to establish normal values, which serves as a clinical ground truth for a non-diseased population.
8. The Sample Size for the Training Set
The document describes an analytical device and its performance verification, not a machine learning or AI algorithm in the traditional sense that requires a "training set" for model development.
- The Elecsys FT4 IV assay is an ECLIA (Electrochemiluminescence Immunoassay), which is a chemical reaction-based detection system, not a software algorithm that learns from data like an AI model.
- Therefore, the concept of a "training set" in the context of AI/ML is not directly applicable here. The device's calibration curve is generated using calibrators ("CalSet FT4 IV") against a master curve for each reagent lot, which is a standard procedure for IVDs, not an AI training process.
9. How the Ground Truth for the Training Set Was Established
As explained in point 8, the concept of a "training set" for an AI/ML model is not applicable to an immunoassay device like the Elecsys FT4 IV.
The "ground truth" for the device's operational parameters (like its calibration curve) is established through:
- Reference materials/calibrators: The device uses specific "CalSet FT4 IV" calibrators. These calibrators have assigned values traceable to reference methods or primary standards, which serve as the "ground truth" for calibrating the device for quantitative measurements.
- Master curve: The instrument-specific calibration curve is generated by 2-point calibration against a master curve for that reagent lot. The master curve itself is derived from extensive characterization of a reagent lot using precisely known reference materials across the measuring range.
Ask a specific question about this device
(28 days)
Gaithersburg, MD 20877
Re: K182423
Trade/Device Name: MAGLUMI 2000 FT4 Regulation Number: 21 CFR 862.1695
|--------------|----------|-------------------------|----------------------------|
| CEC | 862.1695
The MAGLUMI 2000 FT4 assay is for in vitro diagnostic use in the quantitative determination of free thyroxine (FT4) in human serum. The measurement of FT4 is used in the diagnosis of thyroid disorders.
MAGLUMI 2000 FT4 kit consists of the following reagents: Magnetic Microbeads- coated with T4 antigen, BSA, NaN3(
Here's an analysis of the acceptance criteria and study findings for the MAGLUMI 2000 FT4 device, based on the provided text:
1. Table of Acceptance Criteria and Reported Device Performance
Performance Characteristic | Acceptance Criteria (Implicit) | Reported Device Performance |
---|---|---|
Precision | CV% within acceptable ranges for clinical chemistry devices at various concentrations (implied by CLSI EP5-A2). For example, Total CV for Control 1: ≤ 10% (often a general guideline for low-concentration analytes). Total CV for other controls/samples: ≤ 8-10% depending on concentration. Specific pre-defined limits are not explicitly stated, but the study aims to demonstrate acceptable precision. | Control 1 (1.006 ng/dL): Total CV 9.34% |
Control 2 (1.9952 ng/dL): Total CV 7.81% | ||
Control 3 (3.9978 ng/dL): Total CV 5.04% | ||
Calibrator low (0.3333 ng/dL): Total CV 9.99% | ||
Calibrator high (7.1929 ng/dL): Total CV 3.20% | ||
Serum Pools/Patient Pools: Total CVs ranged from 3.76% to 9.96% | ||
Linearity | The assay should demonstrate linearity across its claimed measuring range (0.19 to 10.0 ng/dL) with a strong correlation (R² close to 1) and a slope close to 1, and y-intercept close to 0 (implied by CLSI EP6-A). | Linear between 0.19 and 10.0 ng/dL. Observed = 0.9898 (Expected) + 0.01364, R² = 0.9991. |
Stability (Reagents) | Reagents, calibrators, and controls should be stable for a specified shelf life at recommended storage conditions (e.g., 2-8°C for 12 months for accelerated stability, confirmed by real-time studies). | Accelerated stability at 37°C showed controls, calibrators, and reagents are stable for 12 months at 2-8°C. Real-time stability is on-going. |
Detection Limit (Limit of Blank - LOB) | 95th percentile value from analyte-free samples (implied by CLSI EP17-A guidelines). A low LOB is desired. | 0.087 ng/dL (highest of 3 lots). |
Detection Limit (Limit of Detection - LOD) | Lowest analyte concentration that can be detected (implied by CLSI EP17-A guidelines). A low LOD is desired. | 0.143 ng/dL (highest of 3 lots). |
Detection Limit (Limit of Quantitation - LOQ) | Lowest analyte concentration that can be reproducibly measured with an intermediate precision CV of ≤ 20% (as per CLSI EP17-A guidelines). | 0.190 ng/dL (highest of 3 lots). |
Interference (Cross-Reactivity) | % Cross-reactivity for relevant compounds should be very low (e.g., 0.95 or >0.98), a slope close to 1, and a y-intercept close to 0 (implied by CLSI EP9-A2). | Y = 1.0451X - 0.05375, R² = 0.9919, for MAGLUMI FT4 (y) vs. ADVIA CENTAUR FT4 (x). |
Note on Acceptance Criteria: The document often refers to CLSI guidelines (e.g., EP5-A2, EP6-A, EP17-A, EP9-A2) for performance characteristic studies. While explicit numerical acceptance criteria are not always stated directly in the text, compliance with these guidelines implicitly means that the performance falls within industry-accepted ranges and statistical thresholds. For precision, for instance, a total CV of less than 10% (or even lower for higher concentrations) is generally considered acceptable in clinical chemistry. For linearity and method comparison, an R² value close to 1, a slope near 1, and a y-intercept near 0 are standard indicators of good performance.
2. Sample Size Used for the Test Set and Data Provenance
- Precision Study:
- Sample Size: 240 measurements per control/calibrator/pool (N=240, from 80 samples analyzed per level on each of 3 instruments).
- Data Provenance: Not explicitly stated, but implied to be patient serum pools and native patient samples from clinical settings, likely domestic to the manufacturer's testing location or within industry standard practices for clinical evaluation. The study was conducted on "three controls, two calibrators, four spiked patient serum pools and four native patient sample pools."
- Linearity Study:
- Sample Size: 11 levels with FT4 concentrations from 0.19 to 10.0 ng/dL, each measured in quadruplicate on 3 lots of reagent.
- Data Provenance: Samples prepared by spiking T4 USP Standard into T4-free human serum samples.
- Detection Limit Study (LOB, LOD, LOQ):
- LOB: 80 measurements of T4 free human serum samples.
- LOD: Four levels of low samples, measured in 80 replicates over 5 days per sample.
- LOQ: Six low serum samples, in six replicates per run, one run per day, over 5 days.
- Data Provenance: T4 free human serum samples and low serum samples.
- Interference Study:
- Sample Size: Variable, involves preparing solutions, and for common drugs/interfering substances, three serum samples (low, medium, high FT4) per substance. For HAMA/RF/Protein, specific FT4 human serum samples.
- Data Provenance: Spiked solutions, human serum pools, FT4 human serum samples.
- Method Comparison Study:
- Sample Size: 224 human serum samples.
- Data Provenance: Human serum samples. Provenance (e.g., country of origin, retrospective/prospective) is not explicitly stated.
- Expected Values/Reference Range Study:
- Sample Size: 157 serum samples.
- Data Provenance: Normal, apparently healthy adult individuals (22 years and older). Provenance (e.g., country of origin, retrospective/prospective) is not explicitly stated.
3. Number of Experts Used to Establish the Ground Truth for the Test Set and Their Qualifications
This information is not provided in the document. For an in vitro diagnostic device like the MAGLUMI 2000 FT4, the "ground truth" for analytical performance tests (precision, linearity, detection limits, interference) is typically established by the reference methods, spiked concentrations, or consensus values derived from the testing procedures themselves, rather than human expert opinion. For method comparison, the "ground truth" is typically the result from the established predicate device.
4. Adjudication Method for the Test Set
The concept of an "adjudication method" (like 2+1 or 3+1 for clinical diagnoses based on expert review) is not applicable to the analytical performance studies described for this in vitro diagnostic device. The studies rely on quantitative measurements and statistical analysis rather than subjective expert interpretation of results for ground truth.
5. If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study was done
No, an MRMC comparative effectiveness study was not done. This type of study is primarily relevant for medical imaging devices or other diagnostic tools where human readers interpret results, and the AI system acts as an aid to improve human performance. For an automated in vitro diagnostic assay like MAGLUMI 2000 FT4, the performance is evaluated through direct analytical comparisons.
6. If a Standalone (i.e., algorithm only without human-in-the-loop performance) was done
Yes, the studies described are for the standalone performance of the MAGLUMI 2000 FT4 device. These are analytical performance studies (precision, linearity, detection limits, interference, method comparison) that evaluate the device's ability to accurately and precisely measure FT4 concentrations in human serum, independent of human interpretation or intervention in the measurement process itself. The "algorithm" here refers to the immunoassay's chemistries and the instrument's measurement principles.
7. The Type of Ground Truth Used
- Precision: Reference materials (controls, calibrators) with established target values, and pooled human serum samples where the mean serves as a statistical "ground truth" for variability assessment.
- Linearity: Gravimetrically or volumetrically prepared "expected" concentrations of T4 in serum.
- Stability: Initial measurements at time 0, with subsequent measurements compared to these baselines.
- Detection Limits (LOB, LOD, LOQ): Analyte-free samples and low-concentration samples measured extensively to statistically determine minimum detectable/quantifiable levels.
- Interference (Cross-Reactivity): Known amounts of potential interferents/cross-reactants.
- Method Comparison: Results from the predicate device (Siemens ADVIA Centaur FT4) are used as the reference "ground truth" for comparison.
- Expected Values/Reference Range: Statistical distribution of measurements from a reference population (157 apparently healthy adults) to establish a normal range.
8. The Sample Size for the Training Set
This information is not provided. The document describes performance characteristic studies (test set), but does not detail the development or "training" specific to an AI algorithm for this device. For in vitro diagnostics, "training set" would typically refer to samples used during the assay development and optimization phase to establish parameters like calibration curves, reagent formulations, or instrument settings. The document focuses on validation studies.
9. How the Ground Truth for the Training Set Was Established
This information is not provided as a "training set" isn't explicitly defined in the context of this submission. If referring to the development/optimization phase of a typical immunoassay, ground truth would be established through:
- Reference materials: Highly purified FT4 standards.
- Known concentrations: Samples prepared with precise, known concentrations of FT4.
- Comparative data: Initial comparisons against existing reference methods or well-characterized assays during the research and development phase.
- Clinical samples: Use of a large number of clinical samples to optimize the assay's dynamic range and specificity.
Ask a specific question about this device
(132 days)
Road Indianapolis, IN 46250
Re: K181233
Trade/Device Name: Elecsvs FT4 III Regulation Number: 21 CFR 862.1695
Assay for the in vitro quantitative determination of free thyroxine in human serum and plasma. Measurements obtained by this device are used in the diagnosis and treatment of thyroid disease.
The electrochemiluminescence immunoassay "ECLIA" is intended for use on the cobas e 411 immunoassay analyzer.
The Elecsys FT4 III immunoassay is used for the in vitro quantitative determination of free thyroxine in human serum and plasma. It is intended for use on the cobas e 411 immunoassay analyzer. The cobas e family of analyzers uses electrochemiluminescence immunoassay "ECLIA" technology. The reagent working solutions include: Rackpack (kit placed on analyzer) M: Streptavidin-coated microparticles. R1: Anti-T4-Abbiotin. R2: Anti-T4-AbRu(bpy).
This document describes the premarket notification for the Elecsys FT4 III device, an immunoassay for the quantitative determination of free thyroxine. The submission details the non-clinical performance evaluation of the device as it relates to its substantial equivalence to a predicate device.
Here's an analysis of the provided information regarding acceptance criteria and supporting studies:
1. Table of Acceptance Criteria and Reported Device Performance
The document provides a comparative assay table between the Elecsys FT4 II (predicate device) and Elecsys FT4 III (candidate device), and also presents detailed studies for various performance characteristics. I will synthesize the acceptance criteria (often implied by the "same" or within a stated range compared to the predicate, or by specific criteria in the study descriptions) and the reported performance as presented in the document.
Feature | Acceptance Criteria (Implied/Stated) | Reported Device Performance (Elecsys FT4 III) |
---|---|---|
Intended Use | Same as predicate. | "Assay for the in vitro quantitative determination of free thyroxine in human serum and plasma. Measurements obtained by this device are used in the diagnosis and treatment of thyroid disease. The electrochemiluminescence immunoassay "ECLIA" is intended for use on the cobas e 411 immunoassay analyzer." (Matches predicate, with specified analyzer). |
Detection Method | Same as predicate. | Same: Electrochemiluminescent Assay. |
Applications/Test Time | Same as predicate. | Same: 18 minute application. |
Sample/Reagent Ratio | Same as predicate. | Same: 15 µL. |
Sample Type/Matrix | Same as predicate. | Same: Serum and plasma (Li-heparin, K2-EDTA, K3-EDTA). |
Calibration Method | Same as predicate. | Same: 2-point calibration. |
Calibration Interval | Same as predicate. | Same: Once per reagent lot (renewed after 1 month/28 days when using same lot, after 7 days using same kit, or as required by QC). |
Controls | Same as predicate. | Same: PreciControl Universal. |
Reagent Stability | Same as predicate: Unopened up to expiration date; After opening at 2-8°C, 84 days (12 weeks); On the analyzers, 28 days (4 weeks); or 56 days (8 weeks) with alternative storage, total onboard not exceeding 120 hours. | Studies executed to support these claims. Study 1: "Reagent stability after first opening at 2-8°C (84 days)"; Study 2: "On board reagent stability (28 days)"; Study 3: "Alternate storage in the refrigerator / on the analyzer (56 days, not exceeding 120 hours onboard)". Real-time stability study ongoing for shelf-life claim. Performance for early time-points supports stability. |
Measuring Range | Same as predicate. | Same: 0.101-7.77 ng/dL (1.3-100 pmol/L). |
LoB (Limit of Blank) | Values within acceptable limits compared to predicate (predicate: 0.03 ng/dL). | Reported: 0.02 ng/dL (0.3 pmol/L). (Better than predicate). |
LoD (Limit of Detection) | Values within acceptable limits compared to predicate (predicate: 0.05 ng/dL). | Reported: 0.04 ng/dL (0.5 pmol/L). (Better than predicate). |
LoO (Limit of Quantitation) | Same as predicate. | Same: 0.101 ng/dL (1.3 pmol/L). |
**Analytical Specificity | ||
(Cross-reactivity)** | Cross-reactivity percentages for various substances should be comparable to or better than the predicate. | The reported cross-reactivity percentages for Elecsys FT4 III are identical to those of the predicate device for L-T3, D-T3, rT3, 3-iodo-L-tyrosine, 3,5-diiodo-L-tyrosine, 3,3',5-triiodothyracetic acid, and 3,3',5,5'-tetraiodothyroacetic acid, at similar tested concentrations. |
Endogenous Interferences | Criterion: Recovery of ≤ ± 0.6 pmol/L of initial value ≤ 6 pmol/L and ± 10 % of initial value > 6 pmol/L for Bilirubin, Hemoglobin, Intralipid, Rheumatoid factors, IgG, IgA, IgM. Biotin interference should be improved compared to the predicate, with a specific threshold. | Bilirubin: ≤ 701 µmol/L or ≤ 41 mg/dL; Hemoglobin: ≤ 0.621 mmol/L or ≤ 1000 mg/dL; Intralipid: ≤ 2000 mg/dL; Rheumatoid factors: ≤ 1200 IU/mL; IgG: ≤ 7 g/dL; IgA: ≤ 1.6 g/dL; IgM: ≤ 1 g/dL. |
Biotin Interference: Specimens with biotin concentrations up to 104 ng/mL demonstrated ≤ 10% bias. Biotin concentrations > 104 ng/mL can lead to higher positive bias. This is an improvement from predicate's limitation of "> 5 mg/day" and "until at least 8 hours following the last biotin administration". The new device quantifies the effect and states "Do not test samples from patients who take biotin." | ||
Special Thyroid Drugs | No interference with the assay found for specified drugs at given concentrations. Criterion: Recovery within ± 10 % of initial value. | No interference was found for Iodide, Carbimazole, Thiamazole, Propylthiouracil, Perchlorate, Propranolol, Amiodarone, Prednisolone, Hydrocortisone, Flurocortolone, Octreotide at the specified concentrations. (These are the same drugs and concentrations as the predicate, and "no interference" finding is consistent). |
Precision | Meeting CLSI guideline EP05-A3 for repeatability and intermediate precision. | Precision was evaluated on a single cobas e 411 analyzer according to CLSI guideline EP05-A3. Results supporting this are expected to be presented in the full study report (not explicitly tabularized here, but stated that it was evaluated). |
Sample Matrix Comparison | Acceptable comparison values between serum and various plasma types. | Comparison study performed between Serum, K2-EDTA plasma, and K3-EDTA plasma. Findings indicate acceptable performance, as it is presented as a completed study without issues. |
Method Comparison to Predicate | "Assess the bias" between the two assays. Substantial equivalence to predicate expected. | 141 serum samples were measured. FT4 values ranged between 2.31 and 91.9 pmol/L for the Reference Method (X). The study aims to support substantial equivalence. (Specific statistics on bias are not provided in this summary, but the study execution implies findings were acceptable). |
Lot Calibration Stability | Calibration of a reagent lot is recommended every 28 days (4 weeks), and fresh reagent kits of the same lot can use the Day 0 calibration curve within this period. Recovery of samples within acceptable limits after 36 days using a new kit of the same lot with Day 0 calibration curve. | Elecsys FT4 III was calibrated with a fresh reagent kit on day 0. After 36 days a new reagent kit of the same lot was used and recovery of samples was determined using the calibration curve of day 0. (The study implies acceptable recovery, as "Lot calibration study" is listed as successfully completed). |
On-board Cal Stability | Reagent kits can be stored on board for up to 7 days without a new calibration. | Samples measured on Day 0 and Day 8 (on-board condition) using the Day 0 calibration curve. (The study implies acceptable results, as "On-board Calibration Stability" is listed as successfully completed). |
2. Sample Size Used for the Test Set and the Data Provenance
-
Precision (Repeatability and Intermediate Precision):
- Sample Size: Two levels of control (PreciControl Universal) and five human sera (native single donors, native serum pools, spiked serum pools). Tested in 2 replicates per run, 2 runs per day for 21 days.
- Data Provenance: Not explicitly stated, but "human sera" and "single donors" suggest human samples, likely retrospective pools or prospectively collected for the study. No country of origin mentioned.
-
Endogenous Interferences (Biotin):
- Sample Size: Three human serum samples containing low, mid, and high concentrations of fT4, tested in accordance with CLSI EP07-A2.
- Data Provenance: Human serum samples (single donors, native as well as spiked).
-
Sample Matrix Comparison:
- Sample Size: 53 serum/plasma pairs per sample material (serum, K2-EDTA plasma, K3-EDTA plasma).
- Data Provenance: Native single donors and pools, as well as spiked samples; human.
-
Method Comparison to Predicate:
- Sample Size: 141 serum samples (138 native human serum samples and 3 spiked sample pools).
- Data Provenance: Human serum samples.
-
Reagent Stability (After First Opening, On-board, Alternate Storage):
- Sample Size: Six human serum (HS) samples (native single donor serum, native serum pools, and spiked serum pools) and two controls (PreciControl Universal). Tested in duplicate (for "first opening" and "alternate storage") or two-fold determination (for "on-board").
- Data Provenance: Human serum samples.
-
Reagent Real-time Stability:
- Sample Size: Three human serum samples.
- Data Provenance: Human serum samples (stored at -20°C).
-
Calibration Stability (Lot and On-board):
- Sample Size: Five human serum (HS) samples (single donor serum, native serum pools, and spiked serum pools) and two controls (PreciControl Universal). Tested in two-fold determination.
- Data Provenance: Human serum samples.
3. Number of Experts Used to Establish the Ground Truth for the Test Set and the Qualifications of Those Experts
This document describes the performance evaluation of an in vitro diagnostic (IVD) device, specifically an immunoassay for free thyroxine. For such devices, "ground truth" is typically established by reference methods, consensus values for control materials, or by testing against well-characterized clinical samples in comparison to an established predicate device. It is not generally established by a panel of human experts reviewing individual cases in the same way an imaging AI algorithm might be.
Therefore, the concept of "number of experts used to establish the ground truth" as it applies to imaging or clinical decision support AI is not applicable here. The ground truth for these types of studies is based on biochemical measurements and analytical comparisons.
4. Adjudication Method for the Test Set
Again, for an IVD device evaluating a biomarker, traditional "adjudication" by human experts in the context of clinical interpretation of individual cases (e.g., 2+1, 3+1 for discordances in imaging) is not applicable. The performance is adjudicated against analytical measurements and statistical criteria defined in CLSI guidelines and internal validation plans.
5. If a Multi Reader Multi Case (MRMC) comparative effectiveness study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance
Not Applicable. This is an in vitro diagnostic device (immunoassay), not an AI-powered clinical decision support or imaging interpretation tool that "assists human readers." Therefore, no MRMC study with human readers improving with AI assistance would be performed.
6. If a standalone (i.e. algorithm only without human-in-the-loop performance) was done
The Elecsys FT4 III is an automated immunoassay intended for use on the cobas e 411 analyzer. Its performance is inherently "standalone" in the sense that the analyzer and reagents perform the measurement autonomously. A human laboratory professional loads samples and reagents, and reviews the results. The studies described (precision, interference, method comparison) are all evaluating the standalone analytical performance of this device.
Therefore, yes, the studies described inherently evaluate the "standalone" performance of the algorithm/device system in quantifying free thyroxine in samples.
7. The Type of Ground Truth Used
The ground truth for the various studies is established through:
- Reference methods / Predicate Device: For method comparison, the Elecsys FT4 II assay (K131244) served as the reference method ("X").
- Known concentrations: For interference studies (e.g., biotin, cross-reactivity), samples were spiked with known concentrations of interfering substances or analytes.
- Industry Standards / Guidelines: Adherence to CLSI guidelines (e.g., EP05-A3 for precision, EP07-A2 for interferences) implies that the ground truth for acceptability criteria is based on established laboratory practice and industry consensus for robust analytical performance.
- Controls and Calibrators: PreciControl Universal and CalSet FT4 III are used, which have established target values and ranges for quality control and calibration, respectively.
8. The Sample Size for the Training Set
This document describes the performance validation of a commercially manufactured in vitro diagnostic device, not the development of a machine learning algorithm. Therefore, the concept of a "training set" in the context of AI model development is not directly applicable here.
The device's underlying principles are based on established electrochemiluminescence immunoassay (ECLIA) technology and biochemically derived reagent formulations. While there is a development process that optimizes reagents and assay parameters, this is not typically referred to as "training" in the AI sense, nor does the document provide details about such a "training set" if it were analogous.
9. How the Ground Truth for the Training Set Was Established
Given that "training set" as understood in AI/ML model development is not directly applicable to this IVD device's validation data, this question is not applicable in the context of the provided document. The "ground truth" for the device's development (the underlying chemical and physical principles on which it operates) would have been established through extensive biochemical research, assay development, and optimization processes, rather than a discrete "training set" with established ground truth labels in the AI sense.
Ask a specific question about this device
(28 days)
ROAD ABBOTT PARK IL 60064
Re: K173122
Trade/Device Name: ARCHITECT Free T4 Regulation Number: 21 CFR 862.1695
Abbott ARCHITECT Free T4 Common Name: Radioimmunoassay, Free Thyroxine Governing Regulation: 21 CFR 862.1695
The ARCHITECT Free T4 (FT4) assay is a Chemiluminescent Microparticle Immunoassay (CMIA) for the quantitative determination of free thyroxine (Free T4) in human serum and plasma.
ARCHITECT Free T4 (FT4) assay is to be used as an aid in the assessment of thyroid status.
The ARCHITECT Free T4 assay is a two-step immunoassay for the quantitative determination of free thyroxine (Free T4) in human serum and plasma using CMIA technology with flexible assay protocols, referred to as Chemiflex.
In the first step, sample and anti-T4 coated paramagnetic microparticles are combined. Free T4 (unbound) present in the sample binds to the anti-T4 coated microparticles. After washing, Ts acridinium-labeled conjugate is added to create a reaction mixture. Following another wash cycle, pre-trigger and trigger solutions are added to the reaction mixture. The resulting chemiluminescent reaction is measured as relative light units (RLUs). An inverse relationship exists between the amount of Free T4 in the sample and the RLUs detected by the ARCHITECT i System optics.
The provided text describes a 510(k) premarket notification for a modification to the ARCHITECT Free T4 assay, not a study proving device meets acceptance criteria in the context of AI/ML or comparative effectiveness. The device is an in-vitro diagnostic (IVD) test, and the information requested about AI/ML models, human readers, and ground truth types (pathology, outcomes data) is not applicable to this type of device or documentation.
However, I can extract the acceptance criteria and performance data related to the nonclinical performance verification for the modified ARCHITECT Free T4 assay, as presented in the document.
Acceptance Criteria and Reported Device Performance
The device under review is an in-vitro diagnostic (IVD) assay, not an AI/ML device. Therefore, acceptance criteria and performance are based on analytical performance studies. The document states that the "device passed all of the tests based on pre-determined acceptance criteria." While the specific numerical acceptance criteria are not detailed in this summary, the types of tests performed and the general outcome are reported.
Acceptance Criteria Category | Reported Device Performance |
---|---|
Limit of Blank/Detection/Quantitation | Device passed based on pre-determined acceptance criteria. |
Precision (20-Day) | Device passed based on pre-determined acceptance criteria. |
Precision at Limits of Measuring Interval | Device passed based on pre-determined acceptance criteria. |
Accuracy by Correlation | Device passed based on pre-determined acceptance criteria. |
Accelerated Life Testing (ALT) Stability | Device passed based on pre-determined acceptance criteria. |
Reagent On Board Stability | Device passed based on pre-determined acceptance criteria. |
Linearity | Device passed based on pre-determined acceptance criteria. |
1. Sample sized used for the test set and the data provenance:
The document does not specify the exact sample sizes used for each individual nonclinical performance study (Limit of Blank/Detection/Quantitation, Precision, Accuracy, Stability, Linearity). It also does not explicitly state the country of origin of the samples or if they were retrospective or prospective, though for IVD analytical performance, these would typically be control samples and patient samples collected for various analytical evaluations to represent the intended use population.
2. Number of experts used to establish the ground truth for the test set and the qualifications of those experts:
Not applicable. For an immunoassay, the "ground truth" for analytical performance studies is established by the assay's ability to accurately and precisely measure the analyte (Free T4) in samples against known concentrations or reference methods, not by expert interpretation of images or clinical data.
3. Adjudication method (e.g. 2+1, 3+1, none) for the test set:
Not applicable. Adjudication methods like 2+1 are used for expert consensus on subjective interpretations (e.g., radiology reads), which is not relevant for an IVD analytical performance study.
4. If a multi reader multi case (MRMC) comparative effectiveness study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance:
Not applicable. This is an IVD assay, not an AI/ML diagnostic tool involving human readers.
5. If a standalone (i.e., algorithm only without human-in-the-loop performance) was done:
Not applicable. This is an IVD immunoassay, not an algorithm or AI model. Its performance is inherent to the assay chemistry and instrument.
6. The type of ground truth used:
For the analytical performance studies, the "ground truth" is established through:
- Known concentrations: For studies like linearity, limits, and precision, samples with known concentrations of Free T4 would be used.
- Reference methods/comparative methods: For accuracy by correlation, the results of the ARCHITECT Free T4 assay would be compared against a legally marketed predicate device (as referenced, K123379) or a recognized reference method.
7. The sample size for the training set:
Not applicable. This is an immunoassay, not an AI/ML model, so there is no "training set" in the computational sense. The development of the assay reagents and protocols is based on chemical and biological research and optimization.
8. How the ground truth for the training set was established:
Not applicable, as there is no "training set" in the context of an AI/ML model. The development of the assay components (reagents, microparticles, conjugate) and the associated manufacturing process enhancement (reduced microparticle percent solids) are validated through the nonclinical performance studies listed.
Ask a specific question about this device
(85 days)
|
| Regulation Number | 862.1690 | 862.1695
for use (i.e. for use in the diagnosis of thyroid or pituitary disorders), as described in 21 CFR §862.1695
For in vitro diagnostic use with the IMMULITE® 2000 Systems Analyzers — for the quantitative measurement of thyrotropin (TSH) in serum. Measurements of thyroid stimulating hormone produced by the anterior pituitary are used in the diagnosis of thyroid or pituitary disorders.
For in vitro diagnostic use with the IMMULITE® 2000 Systems Analyzers — for the quantitative measurement of non-protein-bound thyroxine (free T4) in serum and heparinized plasma. Measurements of free thyroxine are used in the diagnosis and treatment of thyroid disease.
Not Found
The provided text describes a 510(k) submission for the IMMULITE 2000 Third Generation TSH and IMMULITE 2000 Free T4 assays. The submission primarily focuses on adding pediatric reference intervals to the device's labeling and asserts that no modifications to the device's design or manufacturing processes are required, thus the predicate devices and subject devices are the same. This means that no new device performance studies were conducted for this specific submission, as the changes are limited to labeling updates based on existing assay capabilities and establishing new reference intervals for a specific population.
Therefore, the requested information regarding acceptance criteria and device performance for a new device study is largely not applicable in the context of this 510(k) summary, as it explicitly states that additional analytical performance data is not needed. The document emphasizes that existing performance characteristics continue to apply.
However, I can extract information related to the establishment of the pediatric reference intervals, which is a form of study performed to define normative values for a specific population for the device.
1. Table of Acceptance Criteria and Reported Device Performance
Since this submission is about establishing new reference intervals for an already approved device and not evaluating the device's analytical performance against new acceptance criteria, the "acceptance criteria" here refer to the statistical methodology used for defining these reference intervals. The "reported device performance" is the established reference interval.
Parameter | Acceptance Criteria (Methodology for Reference Interval Establishment, per CLSI EP28-A3c) | Reported Device Performance (Established Pediatric Reference Intervals) |
---|---|---|
IMMULITE 2000 Free T4 | ||
Infants (01 – 23 months) | Robust Symmetric (90% CI for Lower/Upper Limit) | 0.80 – 1.27 ng/dL (10.3 – 16.3 pmol/L) |
Children (02 – 12 years) | Non-Parametric | 0.74 – 1.28 ng/dL (9.5 – 16.5 pmol/L) |
Adolescents (13 – 20 years) | Non-Parametric | 0.75 – 1.27 ng/dL (9.7 – 16.3 pmol/L) |
IMMULITE 2000 Third Gen TSH | ||
Infants (01 – 23 months) | Robust Symmetric after Log Transform (90% CI for Lower/Upper Limit) | 0.83 – 6.5 µIU/mL (mIU/L) |
Children (02 – 12 years) | Non-Parametric | 0.58 – 4.1 µIU/mL (mIU/L) |
Adolescents (13 – 20 years) | Non-Parametric | 0.39 – 4.0 µIU/mL (mIU/L) |
Note: The primary "acceptance criteria" for this submission are that the methods used for establishing the reference intervals conform to CLSI EP28-A3c guidance and that the established pediatric reference intervals fall within the existing analytical measuring capability of the assay.
2. Sample size used for the test set and the data provenance
IMMULITE 2000 Free T4:
- Sample Size: A total of 426 patients were analyzed:
- Infants (01 – 23 months): 81
- Children (02 – 12 years): 197
- Adolescents (13 – 20 years): 148
- Data Provenance: The document does not specify the country of origin of the data. It is prospective testing, as it states "Data from a total of [N] patients... tested with the IMMULITE 2000 Free T4 assay were analyzed to establish the reference intervals."
IMMULITE 2000 Third Generation TSH:
- Sample Size: A total of 433 patients were analyzed:
- Infants (01 – 23 months): 90
- Children (02 – 12 years): 195
- Adolescents (13 – 20 years): 148
- Data Provenance: The document does not specify the country of origin of the data. Similar to Free T4, this appears to be prospective testing based on the phrasing "Data from a total of [N] patients... tested with the IMMULITE 2000 Third Generation TSH assay were analyzed to establish the reference intervals."
3. Number of experts used to establish the ground truth for the test set and the qualifications of those experts
For reference interval studies, the "ground truth" is typically defined by statistical methods applied to a population defined as "healthy" or "normal" for the analyte in question, rather than through expert consensus on individual cases. The document states that the reference intervals were established per the CLSI EP28-A3c guideline: "Defining, Establishing and Verifying Reference Intervals in the Clinical Laboratory." This guideline outlines statistical procedures for determining reference intervals. Therefore, independent experts for case-by-case ground truth establishment are not typically involved in this type of study. The expertise lies in the clinical chemists or statisticians who apply the CLSI guideline.
4. Adjudication method for the test set
Not applicable. Reference interval studies do not typically involve adjudication of individual cases in the way diagnostic accuracy studies do. The process involves identifying a healthy reference population and then statistically determining the range of values for that population.
5. If a multi-reader multi-case (MRMC) comparative effectiveness study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance
Not applicable. This submission is for an in-vitro diagnostic (IVD) assay (a lab test), not an AI-based diagnostic imaging or interpretive device that would involve human readers or AI assistance in that context.
6. If a standalone (i.e. algorithm only without human-in-the-loop performance) was done
Not applicable. This is an IVD assay, which generates quantitative results. The device itself is the "standalone" component in the sense that it performs the measurement. The establishment of reference intervals is for interpreting these quantitative results within a specific population.
7. The type of ground truth used (expert consensus, pathology, outcomes data, etc.)
The "ground truth" for the pediatric reference interval studies relies on the selection of a "healthy" pediatric population (presumably free from known thyroid or pituitary disorders) from whom samples were collected and tested. The reference intervals are then statistically derived from the measurements of these healthy individuals, following CLSI EP28-A3c guidelines. This is a statistical definition of "normal range" within a defined population rather than a case-specific ground truth like pathology or expert consensus for individual diagnoses. The assumption is that the participants in these groups represented a healthy pediatric population for thyroid function.
8. The sample size for the training set
Not applicable in the typical sense of machine learning. The data described (426 for Free T4, 433 for TSH) is used to establish the reference intervals (similar to a development/validation set in traditional statistics), rather than "training" an algorithm that would then be separately tested. The entire dataset is used to directly calculate the intervals.
9. How the ground truth for the training set was established
Not applicable as a "training set." The "ground truth" for the reference interval establishment relies on the statistical methodology (Robust Symmetric, Non-Parametric) applied to a population described as healthy, as per CLSI EP28-A3c. The selection criteria for this "healthy" pediatric population are implied to be part of the study design to appropriately define the "normal" range.
Ask a specific question about this device
(175 days)
RIVER, NJ 08753
Re: K152422
Trade/Device Name: FREND™ Free T4 Test System Regulation Number: 21 CFR 862.1695 |
---|
Generic Name: |
Regulation Number: |
The FREND™ Free T4 Test System is a rapid indirect competitive immunoassay for the quantitation of free thyroxine (FT4) in human serum and lithium heparinized plasma specimens using the FREND™ Free T4 system. Measurements of free thyroxine (FT4) are used in the diagnosis of thyroid disorders. The FREND™ Free T4 Test System is intended for use in clinical laboratories. For in vitro diagnostic use only. The test is not intended for point-of-care facilities.
The FREND™ Free T4 is a rapid fluorescence immunoassay that measures Free T4 in human serum and in lithium heparinized plasma using the FREND™ Free T4 Test System. The FREND™ Free T4 is a single use fluorescence immunoassay designed to quantify the concentration of free thyroxine in serum and lithium heparinized plasma samples, The FREND™ Free T4 test is a two-step competitive immunoassay with gold nanoparticles labeled with T4-specific monoclonal anti-T4- antibody (mouse), T3-BSA labeled with fluorescent nanoparticles, and fluorescence detection by the FREND™ System. The FREND™ Free T4 Test utilizes microfluidic technology and detects immunecomplexes bound to Free T4. A 70ul Sample is first incubated during Step 1 for five minutes at 37 degrees C in the Free T4 Gold AB Tube with monoclonal anti-T4 antibody conjugated with gold nanoparticles. In Step 2, 35 ul of the mixture from Step 1 is manually loaded into the inlet of the cartridge, where it hydrates a T3-BSA fluorescent bead conjugate and migrates along the test strip. During migration the bound Free T4 in the sample and the fluorescent bead conjugates of T3-BSA compete to form antigenantibody complex in the test zone. Unbound T3-BSA fluorescent conjugates flow through and bind to the anti-T4 antibody that is fixed on the surface in the reference zone. Step 2 takes approximately four minutes after which the fluorescent signals in the test and reference zones are measured. Free T4 quantification is based upon the ratio of the intensity of the test and reference zones. A lower ratio of fluorescence is indicative of a higher Free T4 concentration, in other words, the maqnitude of the fluorescence ratio is inversely proportional to the amount of Free T4 in the sample. The free T4 detection range of the FREND™ Free T4 Test System is 0.4 to 6.0 ng/dL. Results are determined via a lot-specific calibration curve which is generated by the manufacturer using a six-point calibration determined from values averaged from five replicates at each level. The established curve is uploaded to the FREND™ System via the Free T4 Code-chip and is valid until the lot expiration date. The established curve is saved in the code-chip and valid until the expiration date of the test cartridge lot. The FREND™ Free T4 Test cartridge is a disposable plastic device that houses the reagents and contains a port or opening (inlet) where the sample is applied. Once the sample is applied, it will mix with the reagents and travel towards the detection area via capillary action. The FREND™ System is a portable, automated FREND™ cartridge reader. The FREND™ System is based on quantitative immunoassay technology capable of quantifying single or multiple analytes by measuring laser-induced fluorescence in a single-use disposable reagent cartridge. The FREND™ cartridge utilizes micro-fluidics lateral flow technology where the analyte of interest in the sample forms immune complexes while moving through the fluidics pathway in the cartridge. The concentration of the analyte of interest in an unknown sample is calculated using the ratio of the fluorescent intensity of the test zone and the reference zone. The FREND™ System is a bench top fluorescence reader containing a touch screen user interface. The System has a slot that accepts the FREND™ Free T4 Test Cartridge (which contains the reagents and sample), and is programmed to analyze the Test when the sample has fully reacted with the on-board in-cartridge reagents. Results of the test are displayed on the screen and can be printed on an optional printer. The FREND™ System software controls the graphical user interface, communication with hardware, database manaqement and data analysis. The software also controls the functions of the mechanical components including the motor, laser, printer control and acquisition of data from the sensor. The user can set the time and date and enter patient ID through the graphic user interface. The user cannot make any changes to the software.
Here's a summary of the acceptance criteria and study details for the FREND™ Free T4 Test System, based on the provided 510(k) submission:
1. Table of Acceptance Criteria and Reported Device Performance
The acceptance criteria are not explicitly stated as pass/fail values in many sections, but rather implied by the statistical analyses meeting CLSI guidelines and demonstrating comparability to the predicate device. The performance data is presented against these implicit standards.
Performance Metric | Acceptance Criteria (Implied) | Reported Device Performance |
---|---|---|
Precision | Repeatability, Between-run, Between-day, Within-laboratory CV% within acceptable limits for a diagnostic assay. | Sample Pool 1 (0.917 ng/dL): Repeatability CV% 7.3, Within-lab CV% 8.1 |
(Single lot imprecision) | (Based on CLSI EP5-A3 protocol) | Sample Pool 2 (1.850 ng/dL): Repeatability CV% 5.6, Within-lab CV% 6.7 |
Sample Pool 3 (3.979 ng/dL): Repeatability CV% 4.7, Within-lab CV% 6.5 | ||
Linearity/Reportable Range | Acceptable linearity across the measuring range. | Linear across 0.11 ~ 7.5 ng/dL (Slope = 0.978, Intercept = -0.0881, R² = 0.9938). Measuring range (0.4 ~ 6 ng/dL) is within this linear range. |
Detection Limit (LoD) | LoD to support the stated measuring range. | 0.32 ng/dL |
Limit of Quantitation (LoQ) | LoQ to support the stated measuring range. | 0.36 ng/dL |
Interference Studies | Recovery between 90% to 110% of expected Free T4. | All tested endogenous substances and pharmaceuticals showed recovery within 90-110%, except for a few instances (e.g., Biotin 94%, Hydrocortisone 90.1% at high Free T4, Iodide 86.8% at high Free T4). RF, HAMA, Avidin, Au-nanoparticles also within range. |
Cross-Reactivity | No significant cross-reactivity with related substances, except for L-Thyroxine itself. | No significant cross-reactivity (below 0.03%) with tested substances other than L-Thyroxine (T4), which showed 99.44-99.57%. |
Method Comparison with Predicate Device | Difference between test device and predicate device concentrations less than allowable difference; good correlation. | N=358 samples. Slope: 1.010 (95% CI: 0.992 to 1.028), y-Intercept: 0.057 (95% CI: 0.021 to 0.094), r: 0.986. Range tested: 0.43 ~ 5.99 ng/dL. Concluded to compare favorably. |
Matrix Comparison | Comparability between serum and lithium heparin plasma. | N=48 paired samples. Passing-Bablok regression: Slope: 1.017 (95% CI: 0.991 to 1.044), y-Intercept: -0.008 (95% CI: -0.055 to 0.0451). Range tested: 0.44 ~ 5.63 ng/dL. Indicated measurement equally well in both matrices. |
2. Sample Size Used for the Test Set and Data Provenance
- Precision Study: 3 serum pools
- Linearity Study: 1 serum base pool diluted to 11 levels
- Interference Studies: Two levels of free T4 for various endogenous substances and pharmaceuticals.
- Cross-Reactivity Study: Two concentrations for various cross-reactants.
- Method Comparison: 358 samples
- Matrix Comparison: 48 paired serum and lithium heparin samples
- Expected Values/Reference Range: 196 normal, apparently healthy adult individuals
The specific country of origin for the test set data is not explicitly stated, but the precision, linearity, interference, and matrix comparison studies were performed at the NanoEnTek laboratory. The method comparison study was performed at a CLIA-certified laboratory testing facility. The clinical samples for the method comparison were evaluated at that CLIA laboratory. The reference range study used samples from 196 normal, apparently healthy adult individuals. The nature of "clinical samples" for the method comparison implies they would be patient samples, and "normal, apparently healthy adult individuals" for the reference range are also patient-like samples. The studies appear to be prospective in nature, as they involve testing the device with specific protocols (e.g., CLSI guidelines).
3. Number of Experts Used to Establish the Ground Truth for the Test Set and Their Qualifications
- None stated for the analytical performance studies. The ground truth for analytical studies (precision, linearity, LoD, LoQ, interference, cross-reactivity) is generally based on the inherent properties of the samples and the performance of established reference methods or spiked concentrations.
- For Method Comparison: The ground truth was established by a predicate device, the Abbott ARCHITECT Free T4 Assay. The number of human experts involved in interpreting results from either the predicate device or the FREND™ Free T4 Test System is not specified, as this is a quantitative immunoassay where results are read by a machine.
4. Adjudication Method for the Test Set
Not applicable for this type of quantitative immunoassay performance study. Ground truth is established by reference methods or defined concentrations, not human adjudication.
5. If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study Was Done, and Effect Size
No, a multi-reader multi-case (MRMC) comparative effectiveness study was not done. This device is a quantitative immunoassay, not an imaging device requiring human interpretation, so MRMC studies are not relevant.
6. If a Standalone (i.e., algorithm only without human-in-the-loop performance) Was Done
Yes, the studies described are standalone performance studies for the FREND™ Free T4 Test System. The device is intended for use in clinical laboratories, and the results are quantitatively measured by the FREND™ System directly. There is no human-in-the-loop performance evaluation described beyond standard laboratory procedures for operating the instrument and handling samples.
7. The Type of Ground Truth Used
- Analytical Performance:
- Precision, Linearity, LoD, LoQ, Interference, Cross-reactivity: Ground truth was established by prepared samples with known concentrations, spiking experiments, and measurements against established laboratory protocols and standards (CLSI guidelines).
- Method Comparison: Ground truth was established by the predicate device, the Abbott ARCHITECT Free T4 Assay (K123379). This is a comparison against an existing, legally marketed device.
- Matrix Comparison: Ground truth (or comparative standard) derived from measuring the same analyte in different matrices (serum vs. plasma), with the expectation of comparable results.
8. The Sample Size for the Training Set
The document does not explicitly mention a "training set" in the context of an AI/machine learning algorithm. For this immunoassay, the system's "training" involves the manufacturer-generated lot-specific calibration curve (six-point calibration determined from values averaged from five replicates at each level), which is then uploaded to the FREND™ System via a Code-chip.
9. How the Ground Truth for the Training Set Was Established
As described above, the "ground truth" for the device's operational curve (its calibration) is established by:
- Internal standards prepared according to CLSI C45-A Measurement of Free Thyroid Hormones.
- Gravimetric methods based on L-Thyroxine (Sigma T1775, Cell culture grade).
- Confirmation of calibrator Free T4 levels by measurement on the ARCHITECT i free T4 assay (K123379).
This process ensures that the device's internal calibration accurately reflects known Free T4 concentrations.
Ask a specific question about this device
(705 days)
CRAPONNE 69290 FRANCE
June 8, 2015
Re: K132058
Trade/Device Name: VIDAS® FT4 Regulation Number: 21 CFR 862.1695
|
| Classification Name: | 21 CFR 862.1695
VIDAS® FT4 is an automated quantitative enzyme immunoassay for use on the instruments of the VIDAS® family, for the determination of free thyroxine (FT4) in human serum or plasma (lithium heparin), using the ELFA technique (Enzyme Linked Fluorescent Assay). Measurement of Free Thyroxin is intended for use as an aid in the diagnosis and treatment monitoring of thyroid disorders.
VIDAS® FT4 is an automated quantitative enzyme immunoassay for use on the instruments of the VIDAS® family, for the determination of free thyroxine (FT4) in human serum or plasma (lithium heparin), using the ELFA technique (Enzyme Linked Fluorescent Assay). Measurement of free thyroxine is intended for use as an aid in the diagnosis and treatment monitoring of thyroid disorders.
The assay principle combines a one-step enzyme immunoassay competition method with a final fluorescent detection (ELFA).
The Solid Phase Receptacle (SPR®) serves as the solid phase as well as the pipetting device for the assay. Reagents for the assay are ready-to-use and pre-dispensed in the sealed reagent strips. All of the assay steps are performed automatically by the instrument. The reaction medium is cvcled in and out of the SPR several times.
The sample is collected and transferred into the well containing an alkaline phosphataselabeled anti-T4 antibody (conjugate). The antiqen present in the sample and the T4 antigen coated on the interior of the SPR compete for the available sites on the specific anti-T4 antibody conjugated to alkaline phosphatase.
During the final detection step, the substrate (4-Methyl-umbellifery) phosphate) is cvcled in and out of the SPR. The conjugate enzyme catalyzes the hydrolysis of this substrate into a fluorescent product (4-Methyl-umbelliferone) the fluorescence of which is measured at 450 nm. The intensity of the fluorescence is inversely proportional to the concentration of free thyroxine present in the sample. At the end of the assay, results are automatically calculated by the instrument in relation to the calibration curve stored in memory, and then printed out.
The provided text describes the performance of the VIDAS® FT4 device, an automated quantitative enzyme immunoassay for free thyroxine. However, it does not explicitly state "acceptance criteria" as a separate, pre-defined set of thresholds. Instead, it presents various performance study results (e.g., precision, linearity, correlation, analytical sensitivity, specificity) which implicitly demonstrate that the device meets the necessary performance expectations for its intended use and for demonstrating substantial equivalence to its predicate device.
1. Table of Acceptance Criteria and Reported Device Performance
As explicit acceptance criteria are not tabulated in the document, I will infer the performance metrics considered crucial from the studies presented. For "Acceptance Criteria," I will describe the objective assessed in each study.
Performance Metric (Implied Acceptance Criteria) | Reported Device Performance (VIDAS® FT4) |
---|---|
Analytical Specificity | |
Interference with Bilirubin | No significant interference up to 22.5 mg/dL. |
Interference with Lipids | No significant interference up to 750 mg/dL. |
Interference with Hemoglobin | No significant interference up to 500 mg/dL. |
Interference with HAMA | No significant interference up to 0.2 mg/dL. |
Interference with Albumin | No significant interference up to 6300 mg/dL. |
Interference with Drugs | No significant interference observed up to specified concentrations for various drugs (e.g., Acetylsalicylic Acid: 32.61 mg/dL, Amiodarone: 0.58 mg/dL, Carbamazepine: 1.5 mg/dL, etc.). |
Cross-reactivity (Structurally related molecules) | No significant interference observed at specified concentrations for 3,5-diiodotyrosine (16.770 µg/dL, 0.00095% cross-reactivity), 3,5-diodothyronine (27.310 µg/dL, 0.00040% cross-reactivity), and L-triiodothyronine (0.678 µg/dL, 0.02655% cross-reactivity). |
Analytical Sensitivity | |
Limit of Blank (LoB) | 0.02 ng/dL |
Limit of Detection (LoD) | 0.07 ng/dL |
Limit of Quantitation (LoQ) | 0.13 ng/dL (defined as lowest analyte concentration reproducibly measured with a within-laboratory precision CV of ≤ 20%). |
Measurement Range | 0.13 ng/dL to 6.61 ng/dL. |
Linearity | Linear over the whole measurement range (0.13 ng/dL to 6.61 ng/dL) on both VIDAS® and miniVIDAS® instruments. |
Matrix Comparison | High correlation coefficients (r > 0.99) and regression analyses (slopes close to 1, intercepts close to 0) indicating equivalence between different blood collection tube types (plastic tube with clot activator, plastic tube with separation gel, plastic tube with lithium heparin, plastic tube with lithium heparin and separation gel) and the reference silicone-coated glass tube. E.g., Plastic tube with clot activator: Y = 0.98X + 0.01, r = 0.999. |
Precision | VIDAS® instruments: |
Repeatability CVs ranged from 2.3% to 6.3%. | |
Reproducibility CVs ranged from 5.1% to 13.4%. | |
miniVIDAS® instruments: | |
Repeatability CVs ranged from 2.0% to 11.2%. | |
Reproducibility CVs ranged from 2.8% to 15.4%. | |
Correlation | VIDAS®: Slope 1.03 (0.99-1.06 CI), Intercept -0.02 (-0.13-0.05 CI), Correlation coefficient (r) 0.988 (0.980-0.993 CI) when compared to a commercially available Free T4 EIA. |
miniVIDAS®: Slope 1.04 (1.00-1.08 CI), Intercept 0.01 (-0.06-0.12 CI), Correlation coefficient (r) 0.987 (0.977-0.992 CI) when compared to a commercially available Free T4 EIA. | |
Reference Range / Expected Values | Established reference interval: 0.77 - 1.51 ng/dL (2.5th and 97.5th percentiles). 90% CI for lower limit: 0.70 - 0.79 ng/dL. 90% CI for upper limit: 1.41 - 1.59 ng/dL. This was determined from a population of 544 healthy subjects. |
2. Sample Sizes Used for the Test Set and Data Provenance
Due to the nature of an in vitro diagnostic (IVD) device, the "test set" is not a singular set of patient cases for a diagnostic decision, but rather refers to various samples used across different analytical and clinical performance studies. The data provenance is generally not explicitly stated as "country of origin" for these types of analytical validation studies but implies laboratory testing within the manufacturer's R&D facilities or contracted labs, primarily related to the device itself.
- Analytical Specificity (Interferences): For common interferences (bilirubin, lipids, hemoglobin, HAMA, albumin), the study involved serum samples at FT4 analyte levels close to the lower and higher limits of the euthyroid range. The number of samples for each interference type is not specified beyond "samples." For drugs, the number of samples is not specified. For structurally related molecules, the study involved tested FT4 concentrations of approximately 1.2 ng/dL and 2.5 ng/dL, but the number of samples is not explicitly given. This data is likely generated prospectively in a laboratory setting.
- Analytical Sensitivity (LoB, LoD, LoQ):
- LoB: One blank sample, N=60 measures per lot on each instrument (2 lots on one VIDAS®, 1 lot on one miniVIDAS®).
- LoD: Low concentration samples (number not specified for VIDAS®, 4 samples for miniVIDAS®), N=150 measures per lot on VIDAS®, N=100 measures per lot on miniVIDAS®.
- LoQ: 9 low concentration samples, N=40 measures per sample and per lot on each instrument (2 lots on one VIDAS®, 1 lot on one miniVIDAS®).
- This data is prospectively generated in a laboratory setting.
- Linearity: The study was conducted on the VIDAS® and miniVIDAS® instruments. The number of samples or points tested is not explicitly stated. This data is prospectively generated in a laboratory setting.
- Matrix Comparison: 31 sample sets (each set from one donor during one draw, consisting of a reference tube type and four different blood collection tubes). Samples ranged from 0.13 ng/dL to 6.45 ng/dL. This data is prospectively generated.
- Precision: Panel members covering the measuring range were tested. 6 samples for VIDAS® instruments, and 5 samples for miniVIDAS® instruments. Total N=240 measures for each sample/instrument type. This data is prospectively generated in a laboratory setting.
- Correlation: 54 samples were included in the study for comparison to a commercially available Free T4 EIA. Range tested: VIDAS® (0.41 - 6.42 ng/dL), miniVIDAS® (0.52 - 6.35 ng/dL). This data is prospectively generated.
- Reference Range / Expected Values: 544 apparently healthy subjects. The population characteristics are described (45.5% males, 55.5% female, 83.8% Caucasian, 6.1% African-American, 9.6% Hispanic and 0.6% Asian). The country of origin is not specified but the demographic breakdown suggests a Western population. This is prospective data collection.
3. Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications of Those Experts
For an IVD device like VIDAS® FT4, "ground truth" is typically established by:
- Using reference materials with assigned values.
- Comparing to a legally marketed predicate device or a well-established, validated method (e.g., mass spectrometry, if applicable, though not explicitly mentioned here as a primary reference method).
- For clinical studies like reference range determination, it relies on clinical assessment of health status (e.g., "apparently healthy subjects").
In this documentation:
- Metrological traceability: The VIDAS® FT4 assay is standardized against the Elecsys® FT4 assay (Roche Diagnostic), which serves as a clinical reference method (predicate device).
- Value assignment procedure for calibrator (S1) and control (C1): Reference Calibrators are assigned values via a method comparison between VIDAS® FT4 and Roche Elecsys FT4.
- Correlation study: The VIDAS® FT4 assay was compared to a commercially available Free T4 EIA (likely the predicate Elecsys® FT4 Assay or similar). This serves as the ground truth for that specific study.
- Reference Range / Expected Values: This ground truth is based on the clinical status of "apparently healthy subjects." There is no mention of "experts" in the sense of clinicians or radiologists establishing ground truth for individual cases. The health status of the subjects would be determined by standard medical screening procedures, presumably overseen by medical professionals (e.g., physicians, lab personnel) but not explicitly detailed as "experts."
Therefore, there isn't a stated number of experts or specific qualifications in the traditional sense of consensus reading for image or pathology-based diagnostics. The ground truth relies on established analytical methods and clinical definitions of health.
4. Adjudication Method for the Test Set
Not applicable in the context of this IVD device's analytical and clinical performance studies as described. Adjudication (e.g., 2+1, 3+1) is typically used for resolving discrepancies in expert interpretations in studies involving subjective assessments (like radiology or pathology image analysis). For quantitative assays, the "ground truth" is determined by reference methods or validated values, not by expert consensus on individual results.
5. If a Multi Reader Multi Case (MRMC) Comparative Effectiveness Study was Done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance
Not applicable. This is an in vitro diagnostic (IVD) device for quantitative measurement of free thyroxine (FT4). It is not an AI-assisted diagnostic imaging or pathology device that involves human readers interpreting cases.
6. If a Standalone (i.e. algorithm only without human-in-the-loop performance) was Done
Yes, the studies presented are all standalone performance evaluations of the VIDAS® FT4 assay system (instrument + reagents + algorithm), which is an automated system designed to provide quantitative results directly. The results are "automatically calculated by the instrument in relation to the calibration curve stored in memory, and then printed out." Human involvement is limited to operating the instrument, performing calibration, and reviewing the generated reports, not to interpreting raw data points or making subjective diagnostic decisions the device is intended to automate.
7. The type of ground truth used (expert consensus, pathology, outcomes data, etc.)
The ground truth used primarily consists of:
- Reference Method Comparison: For several studies, the predicate Elecsys® FT4 assay or another commercially available, validated Free T4 EIA served as the comparative "ground truth" or reference method for demonstrating substantial equivalence.
- Reference Materials: For analytical sensitivity (LoB, LoD, LoQ) and linearity, the ground truth is based on gravimetrically prepared or otherwise certified reference materials with known concentrations.
- Clinical Definition: For the reference range determination, the ground truth for the healthy population was based on subjects deemed "apparently healthy."
8. The sample size for the training set
This document describes the validation of an IVD assay, not a machine learning or AI algorithm in the context of "training data." The assay relies on a pre-defined chemical reaction and detection method, with instrument calibration. Therefore, there is no "training set" in the sense of data used to train a predictive model. The "calibration curve" is established using a set of calibrators with known concentrations, which is a standard procedure for quantitative assays and not an AI training process.
9. How the ground truth for the training set was established
As there is no "training set" for an AI algorithm in this context, this question is not applicable. For the calibration of the assay, the "ground truth" (assigned values for calibrators) is established through reference method comparison with the Elecsys® FT4 assay and potentially other established metrological traceability chains, as described under "Value assignment procedure for calibrator (S1) and control (C1)."
Ask a specific question about this device
(238 days)
TSHL Regulation Number: 21 CFR 862.1695 Regulation Name: Free thyroxine test system Regulatory Class:
|
| Regulation Number | 862.1695
established indications for use in the diagnosis and treatment of thyroid disease), as described in 21 CFR §862.1695
The FT4L method is an in vitro diagnostic test for the quantitative measurement of Free Thyroxine in human serum and plasma on the Dimension® EXL™ integrated chemistry system with LOCI® Module. Measurements of free thyroxine are used in the diagnosis and monitoring of thyroid disease.
The TSHL method is an in vitro diagnostic test for the quantitative measurement of Thyroid Stimulating Hormone (TSH, thyrotropin) in human serum and plasma on the Dimension® EXL™ integrated chemistry system with LOCI® Module. Measurements of TSH are used in the diagnosis and monitoring of thyroid disease.
The Dimension® LOCI Free Thyroxine Flex® reagent cartridge, FT4L and Dimension® LOCI Thyroid Stimulating Hormone Flex® reagent cartridge, TSHL are in vitro diagnostic tests for the quantitative measurement of Free Thyroxine and Thyroid Stimulating Hormone, respectively, in human serum and plasma on the Dimension® EXL™ integrated chemistry system with LOCI® Module. The submission is for the inclusion of pediatric reference intervals to the labeling (Package Inserts) of these assays. No changes were made to the reagents, device design, or manufacturing process.
The provided document is a 510(k) summary for the Siemens Healthcare Diagnostics Dimension® LOCI Free Thyroxine Flex® Reagent Cartridge (FT4L) and Dimension® LOCI Thyroid Stimulating Hormone Flex® Reagent Cartridge (TSHL). The focus of this submission is the addition of pediatric reference intervals to the existing devices, not a primary submission for a new device. Therefore, the information provided primarily pertains to the establishment of these pediatric reference intervals.
Here's an analysis based on the provided text:
1. Table of Acceptance Criteria and Reported Device Performance
The acceptance criteria for the pediatric reference intervals are defined by the statistical methods used to establish them and the resulting intervals. The performance reported is the calculated pediatric reference intervals.
Acceptance Criteria | Reported Device Performance (FT4L) | Reported Device Performance (TSHL) |
---|---|---|
Infant (01-23 months): Reference interval established using robust symmetrical method due to sample size |
Ask a specific question about this device
Page 1 of 3