Search Results

The Safecare Urinary Tract Infection Test is for the qualitative detection of Leukocytes (LEU,white blood cells) and nitrite (NIT) in urine as an aid in the screening of a Urinary Tract infection (UTI). It is intended for over-the-counter home use.

Device Description

Urinary Tract Infection Test is in vitro diagnostic test device for qualitative detection of leukocyte and nitrite in urine. The device is composed of two-color pads aligned on a strip. One pad is employed for testing leukocyte and the other for nitrite by visually reading the color change of the pad and comparing with the corresponding blocks on a color chart. The Safecare Urinary Tract Infection Test are for the qualitative detection of Leukocytes (white blood cells) and nitrite in urine as an aid in the screening of a Urinary Tract infection (UTI). It is intended for over-the-counter home use only.

AI/ML Overview

This document, particularly the "510(k) Summary" section, details the performance characteristics of the Safecare Urinary Tract Infection Test. While it doesn't present a formal "acceptance criteria" table in the typical sense of a pre-defined set of numerical thresholds for device performance, the data provided in the "Lay user Study" serves as the proof that the device meets the necessary performance for its intended use, especially for over-the-counter home use, by demonstrating high agreement with a predicate device.

Here's a breakdown of the requested information based on the provided text:

Table of Acceptance Criteria and Reported Device Performance

The acceptance criteria are implicitly defined by the agreement rates deemed acceptable for an over-the-counter device intended for screening, where high agreement with a physician-used predicate device signifies adequate performance for lay users. The study aimed to demonstrate that lay users could achieve results comparable to healthcare professionals using a predicate device.

Analyte (Color Grade)	Implicit Acceptance Criteria (High Agreement with Predicate)	Reported Device Performance (Agreement with Predicate - Exact Match)
Leukocytes:	High % agreement
+++	(Not explicitly stated, but implies close to 100%)	90.00%
++	(Not explicitly stated, but implies close to 100%)	90.9%
+	(Not explicitly stated, but implies close to 100%)	91.18%
Trace	(Not explicitly stated, but implies close to 100%)	88.89%
- (Negative)	(Not explicitly stated, but implies close to 100%)	100.0%
Nitrite:
Positive	(Not explicitly stated, but implies close to 100%)	100%
Negative	(Not explicitly stated, but implies close to 100%)	100%
Leukocytes:	(Not explicitly stated, but implies close to 100%)	100% (% Agreement +/- Color Block for all grades)

Note: For the Leukocytes, "Agreement (+/- Color Block)" was 100% for all grades, indicating excellent performance when allowing for slight variations in visual interpretation around the color block, which is common in visual assays.

2. Sample Size Used for the Test Set and Data Provenance

Sample Size: 154 lay users with UTI symptoms.
Data Provenance: The document does not explicitly state the country of origin for the lay user study data. It was conducted at "Three (3) sites." Given the applicant is Safecare Biotech (Hangzhou) Co., Ltd. in China, it is plausible the study was conducted in China, though this is not confirmed. The study was prospective, as it involved recruiting participants to test their own urine samples in real-time.

3. Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications of Those Experts

Number of Experts: The ground truth was established by "healthcare professionals" using the predicate device. The exact number of individual healthcare professionals is not specified, but it implies a standard clinical practice setting where trained personnel perform the testing.
Qualifications of Experts: They are referred to as "healthcare professionals." Their specific qualifications (e.g., medical technologists, nurses, physicians) and years of experience are not detailed in this summary.

4. Adjudication Method for the Test Set

The document does not describe a formal adjudication method (e.g., 2+1, 3+1). The ground truth was established by "healthcare professionals using the predicate device." It appears to be a direct comparison between the lay user's result on the Safecare device and the healthcare professional's result on the predicate device, implying the predicate device's result, as performed by a healthcare professional, was taken as the reference.

5. If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance

Not Applicable. This document describes a diagnostic test strip for visual interpretation (Urinary Tract Infection Test). It does not involve AI assistance or a comparison of human reader performance with and without AI. It focuses on the ability of lay users to correctly interpret the visual results compared to healthcare professionals using a predicate device.

6. If a Standalone (i.e., algorithm only without human-in-the-loop performance) was done

Not Applicable. This device is a visual test strip intended for human (lay user) interpretation. There is no automated algorithm or standalone performance without human input.

7. The Type of Ground Truth Used

The ground truth for the lay user study was established by comparison to a legally marketed predicate device (Healgen URS Test Strips, K231045) as interpreted by healthcare professionals. This serves as a clinical reference standard for diagnostic performance in the context of this 510(k) submission.

8. The Sample Size for the Training Set

The document does not explicitly mention a "training set" in the context of a machine learning model, as this is a traditional in-vitro diagnostic test strip. However, if "training set" refers to the data used for internal development, optimization, or early analytical studies, the following information is provided:

Precision and Reproducibility Study: "A total of forty-five (45) assays results on each of eight levels of control were obtained." This involved three (3) clinical sites, two (2) operators per site, three (3) replicate assays over five (5) days.
Analytical Specificity Interference: "Potentially interfering substances were added to negative urine with different leukocyte and nitrite concentrations. These samples were tested with three lots of the Safecare Urinary Tract by three different operators (one operator per lot)."
Assay Cut-off/Sensitivity Study: "Urine samples were spiked to known concentrations of each analyte. These samples were then diluted to the lowest positive concentrations...Each sample was tested in 30 replicates with three (3) different operators."

These analytical studies use controlled samples and protocols to define the device's inherent performance characteristics, prior to the lay user study which demonstrates real-world applicability.

9. How the Ground Truth for the Training Set Was Established

For the analytical studies (which might be considered analogous to a "training/development" phase for traditional IVDs):

Precision and Reproducibility: Ground truth involved preparing "eight levels of control" which are presumably well-characterized, spiked samples with known concentrations.
Analytical Specificity Interference: Ground truth involved "negative urine with different leukocyte and nitrite concentrations" and then spiking them with "potentially interfering substances" at known concentrations.
Assay Cut-off/Sensitivity: Ground truth involved "Urine samples...spiked to known concentrations of each analyte." This means the true concentration of the analytes was known by design. Readings were taken by multiple operators and likely compared against these known concentrations to define the lower limits of detection and appropriate cut-offs.

In summary, the "ground truth" for the analytical and development phases was established through controlled laboratory experiments using precisely prepared and characterized samples with known analyte concentrations.

Ask a Question

Ask a specific question about this device

K Number

K240654

Device Name

SAFECARE Fentanyl Urine Test Cassette; SAFECARE FYL Urine Test Cassette

Manufacturer

Safecare Biotech (Hangzhou) Co., Ltd.

Date Cleared

2024-04-10

(34 days)

Product Code

Regulation Number

Type

Panel

Reference & Predicate Devices

K233417

Predicate For

N/A

Intended Use

SAFECARE Fentanyl Urine Test Cassette is a competitive binding, lateral flow immunochromatographic assay for qualitative detection of fentanyl in human urine at the cutoff concentrations of 1 ng/mL. The test provides only preliminary test results. A more specific alternative chemical method must be used in order to obtain a confirmed analytical result. GC/MS or LC/MS is the preferred confirmatory method. The test is intended for over-the-counter use and for in vitro diagnostic use only.

Device Description

The SAFECARE Fentanyl Urine Tests are immunoassays intended for the qualitative detection of fentanyl in human urine. Each SAFECARE Fentanyl Urine Test device consists of a Test Cassette, a Dropper and a package insert. Each Test Cassette is sealed with sachets of desiccant in an aluminum pouch.

AI/ML Overview

Here's a breakdown of the acceptance criteria and the study details for the SAFECARE Fentanyl Urine Test Cassette, based on the provided FDA 510(k) summary:

1. Table of Acceptance Criteria and Reported Device Performance

The document doesn't explicitly state "acceptance criteria" percentages (e.g., "must achieve X% sensitivity"). Instead, it presents performance data that implies the device met sufficient standards for clearance. The "performance" column below summarizes the key findings.

Category	Description of Performance (Implicit Acceptance Criteria)	Reported Device Performance
Precision	Demonstrate consistent and accurate results across different concentrations relative to the cutoff, ideally showing 100% agreement for concentrations significantly below or above the cutoff, and a mix around the cutoff.	Lot 1: 100% negative for -100%, -75%, -50% cutoff; 98% negative (1 positive) for -25% cutoff; 52% positive/48% negative for cutoff; 100% positive for +25%, +50%, +75%, +100% cutoff. Lot 2: 100% negative for -100%, -75%, -50% cutoff; 98% negative (1 positive) for -25% cutoff; 50% positive/50% negative for cutoff; 100% positive for +25%, +50%, +75%, +100% cutoff. Lot 3: 100% negative for -100%, -75%, -50% cutoff; 96% negative (2 positive) for -25% cutoff; 54% positive/46% negative for cutoff; 100% positive for +25%, +50%, +75%, +100% cutoff.
Stability	Device must remain functional and accurate for a specified shelf life under defined storage conditions.	Devices are stable for 24 months at 36-86°F based on accelerated stability studies.
Interference	No interference from a list of common physiological, pathological, or drug substances at specified concentrations when fentanyl is absent or at known concentrations (50% below/above cutoff).	No interference observed from a comprehensive list of compounds (e.g., Acetaminophen, Albumin, Glucose, Ibuprofen, Nicotine) at 100µg/mL or specified concentrations when fentanyl was at ±50% cutoff levels.
Specificity (Cross-Reactivity)	Minimal to no cross-reactivity with structurally similar compounds or other common opioids/drugs, especially for compounds listed as "no cross-reactivity." For fentanyl analogs, a defined cross-reactivity profile.	Detected various fentanyl analogs with varying cross-reactivity percentages (e.g., Acetyl fentanyl: 100%, Ocfentanil: 40%). Showed no cross-reactivity for a list of over 30 opioid compounds (e.g., Codeine, Morphine, Buprenorphine) tested at 100 µg/mL.
Effect of Urine Specific Gravity & pH	Device performance should not be significantly affected by variations in urine specific gravity (1.000-1.035) or pH (4-9) at critical fentanyl concentrations (50% below and 50% above cutoff).	All results were positive for samples at and above +50% Cut-Off and all negative for samples at and below -50% Cut-Off, regardless of specific gravity or pH within the tested ranges.
Method Comparison (Clinical Samples)	Demonstrated agreement with a confirmatory method (LC/MS) for both negative and positive clinical urine samples across a range of fentanyl concentrations, including those near the cutoff. A low number of discordant results is expected, especially near the cutoff.	Operator 1: - 39/40 (97.5%) agreement for Negative/Low Negative/Near Cutoff Negative. - 38/40 (95%) agreement for Near Cutoff Positive/High Positive. - Discordant (1 positive at 0.985 ng/mL LC/MS; 2 negative at 1.055 and 1.097 ng/mL LC/MS). Operator 2: - 40/40 (100%) agreement for Negative/Low Negative/Near Cutoff Negative. - 39/40 (97.5%) agreement for Near Cutoff Positive/High Positive. - Discordant (2 positive at 0.954 and 0.993 ng/mL LC/MS; 1 negative at 1.055 ng/mL LC/MS). Operator 3: - 40/40 (100%) agreement for Negative/Low Negative/Near Cutoff Negative. - 39/40 (97.5%) agreement for Near Cutoff Positive/High Positive. - Discordant (2 positive at 0.980 and 0.985 ng/mL LC/MS; 1 negative at 1.055 ng/mL LC/MS).
Lay-User Study	High percentage of correct results by lay users when following instructions, particularly for concentrations clearly below or above the cutoff. User instructions should be easily understood.	Correct result percentages: 100% for -100%, -75%, -50% cutoff; 95% for -25% cutoff; 100% for +25%, +50%, +75% cutoff. All lay users indicated instructions were easily followed. Flesch-Kincaid score indicated reading grade level < 7.

2. Sample Sizes Used for the Test Set and Data Provenance

Precision Studies:
- Test Set Size: For each of 3 lots, 50 tests were performed at each of 9 concentrations (-100%, -75%, -50%, -25%, cutoff, +25%, +50%, +75%, +100%). This totals 3 lots * 9 concentrations * 50 tests/concentration = 1350 tests.
- Data Provenance: Samples were prepared by spiking fentanyl into negative samples. The document does not specify the origin of the "negative samples." It states the fentanyl concentration was confirmed by LC/MS.
Interference & Specificity Studies:
- Test Set Size: Not explicitly stated as a number of individual tests, but each interfering substance/cross-reactant was tested using "three batches of each device" with drug-free urine and target drug fentanyl urine (at ±50% cutoff).
- Data Provenance: Samples were prepared by spiking test substances into drug-free urine or urine containing fentanyl.
Effect of Urine Specific Gravity and Urine pH:
- Test Set Size: Not explicitly stated as an exact number of tests, but samples within specific gravity (1.000-1.035) and pH (4-9) ranges were spiked with fentanyl at 50% below and 50% above cutoff. These were then tested using "three lots of device."
- Data Provenance: Samples were prepared by altering the specific gravity/pH and spiking fentanyl into urine.
Method Comparison (Clinical Samples):
- Test Set Size: 80 unaltered clinical samples (40 negative and 40 positive).
- Data Provenance: "Unaltered clinical samples" - implies naturally occurring samples from a clinical population. The country of origin is not specified, but it's generally understood to be within the scope of where such tests would be used.
Lay-user Study:
- Test Set Size: 140 lay persons, each testing one blind-labeled sample. A total of 140 tests were performed. The samples themselves were 20 for each of 7 concentrations (-100%, -75%, -50%, -25%, +25%, +50%, +75% cutoff), so 7 * 20 = 140 unique prepared samples were used.
- Data Provenance: Samples were prepared by spiking fentanyl into "drug free-pooled urine specimens."

3. Number of Experts Used to Establish the Ground Truth for the Test Set and Their Qualifications

Method Comparison (Clinical Samples): The ground truth was established by LC/MS (Liquid Chromatography/Mass Spectrometry), which is a highly accurate and routinely used laboratory method for drug confirmation. This is a scientific instrument, not an "expert" in the human sense. The operators running the LC/MS would be trained laboratory personnel.
Precision, Interference, Specificity, Effect of Urine Specific Gravity & pH, Lay-User Study: For these studies, the "ground truth" was established by spiking known concentrations of fentanyl (or other substances) into urine samples and confirming by LC/MS. This process relies on the accuracy of the LC/MS method and the precision of the spiking process.

4. Adjudication Method for the Test Set

Method Comparison (Clinical Samples): The document states samples were "blind labeled" and compared directly to "LC/MS results." This implies a direct comparison without an explicit adjudication process involving multiple human readers beyond the initial device reading by each of the three operators. Discordant results were simply listed for each operator.
Lay-user Study: Samples were "blind-labeled and randomized." Each participant provided a result, which was then compared to the known LC/MS-confirmed concentration. There was no mention of an adjudication process among lay users or reviewers.

5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study

No MRMC comparative effectiveness study was done comparing human performance with and without AI assistance.
This device is a rapid diagnostic test (lateral flow immunoassay), not an AI-powered diagnostic imaging or interpretation system. Therefore, an MRMC study in the context of AI assistance is not applicable. The study involved three operators for the method comparison and 140 lay users for usability, but these were for direct device performance and ease of use, not AI-assisted interpretation.

6. Standalone Performance

Yes, a standalone (algorithm only without human-in-the-loop) performance study was done implicitly. This device is the standalone "algorithm" (the immunoassay itself). The precision, interference, and specificity studies evaluate the inherent performance of the device without a human interpreting a complex image or data set. The human involvement is limited to performing the test procedure and reading a clear positive/negative line, which is a direct output of the device's chemistry.

7. The Type of Ground Truth Used

LC/MS (Liquid Chromatography/Mass Spectrometry): This was the primary method used to establish the ground truth for all studies involving fentanyl concentrations (precision, method comparison, lay-user study, and confirming spiked concentrations for interference/specificity/pH/SG studies). LC/MS is a highly accurate and quantitative analytical chemistry technique considered the "gold standard" for confirming drug concentrations in toxicology.

8. The Sample Size for the Training Set

This document describes performance studies for a finished medical device (an immunoassay rapid test). Immunochromatographic assays do not have a "training set" in the machine learning sense. Their performance is inherent to their design, reagents, and manufacturing process. Therefore, there is no training set mentioned or applicable for this type of device.

9. How the Ground Truth for the Training Set Was Established

As there is no "training set" for this type of device, this question is not applicable. The "ground truth" (LC/MS confirmation) is used for validation of the device's performance, not for training it.

Ask a Question

Ask a specific question about this device

K Number

K200133

Device Name

hCG Urine Test Strip, hCG Urine Test Cassette, hCG Urine Test Midstream

Manufacturer

Safecare Biotech (Hangzhou) Co., Ltd.

Date Cleared

2020-08-13

(205 days)

Product Code

Regulation Number

Type

Panel

Reference & Predicate Devices

k043443

Predicate For

K240242

Intended Use

The hCG Urine Test Strip is a rapid one step assay designed for qualitative detection of human chorionic gonadotropin (hCG) in urine for early detection of pregnancy.

The hCG Urine Test Cassette is a rapid one step assay designed for qualitative detection of human chorionic gonadotropin (hCG) in urine for early detection of pregnancy.

The hCG Urine Test Midstream is a rapid one step assay designed for qualitative detection of human chorionic gonadotropin (hCG) in urine for early detection of pregnancy.

Device Description

The hCG Urine Test will be sold in three formats: cassette, test strip, and midstream. The test strip and midstream kits consist of one test device and a package insert. The cassette kit consists of one test device and a disposable plastic dropper, and a package insert. Each test device contains mouse monoclonal anti-βhCG antibody coated membrane and a pad containing mouse monoclonal anti-α-hCG antibody colloidal gold conjugate. The control antibodies are goat anti-rabbit IgG.

AI/ML Overview

The provided document is an FDA 510(k) summary for a set of hCG Urine Test devices. This document focuses on demonstrating analytical performance and user comprehension/accuracy for an In-Vitro Diagnostic (IVD) device, specifically a pregnancy test, not an AI-powered medical device. Therefore, many of the requested criteria related to AI/MRMC studies, expert ground truth, and training sets are not applicable to this type of device and submission.

However, I can extract the relevant information regarding the acceptance criteria and study proving the device meets these for an IVD device.

Acceptance Criteria and Device Performance for hCG Urine Test Strip, Cassette, and Midstream

This submission primarily focuses on analytical performance characteristics and comparison studies to demonstrate substantial equivalence to a predicate device.

1. Table of Acceptance Criteria and Reported Device Performance

Performance Characteristic	Acceptance Criteria (Implicit from Study Design/FDA Guidance)	Reported Device Performance
Analytical Sensitivity (Detection Limit)	Must accurately detect hCG at 25 mIU/mL (claimed sensitivity, matches predicate).	100% positive detection for samples at 25 mIU/mL, 50 mIU/mL, and 100 mIU/mL across all formats and lots.
Analytical Precision/Reproducibility	Consistent results (positive/negative) across different lots, operators, and days for given hCG concentrations.	100% agreement (positive/negative) for all tested concentrations (0 mIU/mL, 12.5 mIU/mL, 18.75 mIU/mL, 25 mIU/mL, 50 mIU/mL, 100 mIU/mL) across 3 lots, 3 operators, over 10 days for all three formats (strip, cassette, midstream via dip and simulated midstream).
Analytical Specificity (Cross-reactivity)	No interference from closely related hormones (LH, FSH, TSH) at specified concentrations.	No cross-reaction observed with LH at 500 mIU/mL, FSH at 1000 mIU/mL, and TSH at 1000 μIU/mL.
Analytical Specificity (Interfering Substances)	No interference from common exogenous compounds or endogenous substances at specified concentrations.	No interference observed from 14 listed substances (e.g., Acetaminophen, Aspirin, Glucose, Albumin, Bilirubin, etc.) at their specified challenge concentrations for both 0 mIU/mL and 25 mIU/mL hCG samples.
Analytical Specificity (hCG β-core fragment)	No interference from hCG β-core fragment at likely physiological levels.	No interference observed at β-core fragment concentrations up to 1,000,000 pmol/mL.
Analytical Specificity (pH)	Performance not affected by varying urine pH within a physiological range.	No interference observed for urine pH range of 3 to 9.
Analytical Specificity (Specific Gravity)	Performance not affected by varying urine specific gravity within a physiological range.	No interference observed for specific gravity between 1.001-1.039.
High Dose Hook Effect	Device should accurately detect high concentrations of hCG without false negatives (hook effect).	Positive results obtained for HCG concentrations ranging from 5,000 to 850,000 mIU/ml.
Comparison (Professional Use)	High agreement with the predicate device when interpreted by professionals.	100% agreement (53 positive, 67 negative) with the predicate device for all three formats using urine samples from 120 women.
Comparison (Lay User Comprehension/Accuracy)	High agreement between lay user interpretation and professional interpretation (ground truth).	100% conformity between lay user interpretation and professional interpretation for all three formats (strip, cassette, midstream dip, midstream actual) across 360 women.
OTC User Suitability	Untrained users can accurately use and interpret the test.	100% agreement between lay user results and professional laboratory results for masked spiked samples (30 positive at 31.25 mIU/mL, 30 negative at 18.75 mIU/mL) for all formats. Participants rated the test as "very easy" or "easy" to read and interpret.
Shelf-life Stability	Maintain performance characteristics over the claimed shelf life.	Data supports a 30-month shelf life when stored at 4-30℃.

2. Sample Sizes and Data Provenance

Analytical Performance Studies (Precision, Specificity, Hook Effect, pH, Specific Gravity):
- Sample Size: Varies by test. For precision, it's 3 lots x 3 runs/day x 10 days x 6 concentrations = 540 tests per format per operator x 3 operators (though total tests are reported per concentration per lot, e.g., 30 results). For other analytical tests, specific numbers are not always given but imply multiple runs with spiked samples.
- Data Provenance: Fresh urine samples from normal, non-pregnant females, spiked with HCG or interfering substances. The origin of the samples (e.g., country) is not specified but the manufacturer is in Hangzhou, China. The studies are retrospective in the sense that they are laboratory-controlled experiments designed to test specific analytical characteristics.
Comparison Studies (Professional Method):
- Sample Size: 120 women's urine samples.
- Data Provenance: Urine samples collected from women at a hospital laboratory. Origin not specified (likely China). Retrospective collection for testing by both devices.
Comparison Studies (Lay User Method):
- Sample Size: 360 women.
- Data Provenance: Urine samples collected from women at a hospital laboratory. Origin not specified (likely China). Retrospective collection.
OTC User Performance (Spiked Masked Samples):
- Sample Size: 30 positive samples (31.25 mIU/mL hCG) and 30 negative samples (18.75 mIU/mL hCG) were tested by lay users for each of the four test methods (strip, cassette, midstream dip, midstream simulated). So, 60 samples per format/method, totaling 240 masked spiked samples.
- Data Provenance: Spiked urine samples prepared in a laboratory setting. This is a prospective experimental design for evaluating lay user performance.

3. Number of Experts and Qualifications for Ground Truth (Test Set)

Analytical Studies: Ground truth for these studies (e.g., specific hCG concentrations, presence/absence of interfering substances) is established by the preparation of the samples themselves in a laboratory setting using traceable reference materials (e.g., WHO 5th IS for hCG). The "experts" here are essentially the laboratory personnel setting up and verifying these conditions. No specific number or qualifications are provided beyond general lab personnel.
Professional Method Comparison: The "predicate device professional" interpretation serves as a comparator, implying professional users (e.g., lab technicians, medical staff) were involved in interpreting the predicate device results. The document does not specify the number of individuals or their specific qualifications.
Lay User Method Comparison & OTC User Performance: The "professional interpretation of the hCG Test Strip/cassette/midstream" (Safecare device professional) appears to serve as the ground truth against which lay user interpretation/performance is compared. This implies that trained professionals used the new device to establish the correct result for each sample before the lay users interpreted their results. The number and qualifications of these "professionals" are not specified.

4. Adjudication Method for the Test Set

For analytical studies, the "ground truth" is defined by the prepared samples (e.g., 25 mIU/mL hCG is positive, 0 mIU/mL is negative). No formal adjudication method is mentioned for reconciling conflicting results, as the expectation is 100% conformity based on the controlled sample preparation.
For comparison studies (professional and lay user), the document states "the agreement of hCG Test Strip/cassette/midstream with the predicate device was 100%" or "the conformity between the user interpretation of the and the professional interpretation... is 100%." This implies that there were no discrepancies that required adjudication, or if there were, they are not reported. The "professional interpretation" acts as the reference standard.

5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study

No, an MRMC study was not done. This type of study is primarily relevant for AI-powered medical imaging devices where the AI assists human readers, and the goal is to show improved diagnostic performance with AI assistance. For a simple in-vitro diagnostic test like a pregnancy strip, this framework does not apply.
The comparison studies involved both professional and lay users, but it was to compare the new device's performance against a predicate (professional) or a professional's interpretation of the new device (lay user), not to evaluate the effect of AI assistance on human readers.

6. Standalone (Algorithm-Only) Performance

Yes, in the context of an IVD device, the "standalone" performance is assessed through the analytical performance studies. The device (the test kits) is the "algorithm," and its ability to correctly detect or not detect hCG at various concentrations (sensitivity, specificity, precision, hook effect, etc.) is its standalone performance demonstrated in the "Analytical performance" section. There is no separate digital algorithm being evaluated here.

7. Type of Ground Truth Used

Analytical Studies: Lab-defined ground truth based on precisely prepared samples using traceable standards (e.g., WHO 5th IS for hCG) at specified concentrations. This is a form of spike-in ground truth.
Professional Method Comparison: The results from the predicate device served as the comparator/reference.
Lay User Method Comparison & OTC User Performance: The professional interpretation of the new device's results served as the ground truth.

8. Sample Size for the Training Set

Not applicable. This document describes the validation of a manufactured in-vitro diagnostic test kit, not an AI/Machine Learning model that requires a training set. The device itself is a chemical-based test.

9. How the Ground Truth for the Training Set Was Established

Not applicable. (As above, no training set for an AI/ML model here).

Ask a Question

Ask a specific question about this device

K Number

K191924

Device Name

SAFECARE® THC Urine Strip Test

Manufacturer

Safecare Biotech (Hangzhou) Co., Ltd.

Date Cleared

2019-08-16

(29 days)

Product Code

Regulation Number

Type

Panel

Reference & Predicate Devices

N/A

Predicate For

N/A

Intended Use

SAFECARE® THC Urine Strip Test is a competitive binding, lateral flow immunochromatographic assay for qualitative detection of Marijuana in human urine at the cutoff concentrations of 50 ng/mL.

The test provides only preliminary test results. A more specific alternative chemical must be used in order to obtain a confirmed analytical result. GC/MS or LC/MS is the preferred confirmatory method.

The test is intended for over-the-counter use.

Device Description

SAFECARE® THC Urine Strip Test devices are immunochromatographic assays for the qualitative detection of 11-nor-A9-THC-9 COOH (target analyte) in human urine. The product is a single-use in vitro diagnostic device. It contains a Test Device and a package insert. Each test device is sealed with a desiccant in an aluminum pouch.

AI/ML Overview

The provided document outlines the acceptance criteria and performance data for the SAFECARE® THC Urine Strip Test, which is a qualitative lateral flow immunoassay for detecting Marijuana in human urine.

Here's the breakdown of the requested information:

1. Table of Acceptance Criteria and Reported Device Performance

The document does not explicitly state formal "acceptance criteria" for precision or accuracy using specific numerical thresholds (e.g., >95% agreement). Instead, it presents the results of various performance studies. The overall acceptance is inferred from the device being deemed "substantially equivalent" to a predicate device.

However, based on the performance characteristics, we can infer performance goals for accuracy at different concentrations relative to the cutoff.

Acceptance Criteria Category	Specific Metric (Inferred)	Acceptance Threshold (Inferred/Observed)	Reported Device Performance	Comments on Performance
Analytical Performance
Precision	Agreement at various	High agreement expected, especially
	concentrations	at concentrations further from cutoff.
	-100% Cut-off	100% Negative (0% Positive)	Lot 1, 2, 3: 50-/0+	Meets expectation
	-75% Cut-off	100% Negative (0% Positive)	Lot 1, 2, 3: 50-/0+	Meets expectation
	-50% Cut-off	100% Negative (0% Positive)	Lot 1, 2, 3: 50-/0+	Meets expectation
	-25% Cut-off	100% Negative (0% Positive)	Lot 1, 2, 3: 50-/0+	Meets expectation
	Cut-off (50 ng/mL)	~50% Positive, ~50% Negative	Lot 1, 2: 25-/25+; Lot 3:	Meets expectation
			26-/24+
	+25% Cut-off	100% Positive (0% Negative)	Lot 1, 2, 3: 50+/0-	Meets expectation
	+50% Cut-off	100% Positive (0% Negative)	Lot 1, 2, 3: 50+/0-	Meets expectation
	+75% Cut-off	100% Positive (0% Negative)	Lot 1, 2, 3: 50+/0-	Meets expectation
	+100% Cut-off	100% Positive (0% Negative)	Lot 1, 2, 3: 50+/0-	Meets expectation
Interference	No interference from	No false positives or negatives due to	Compounds at 100µg/mL	No interference reported
	common substances	specified interfering substances when	(list provided) showed no
		drug is absent or present above cutoff	interference.
Specificity	Cross-reactivity with	Low or no cross-reactivity with	Provided specific cross-	Demonstrated specificity
	related compounds	non-target compounds.	reactivity percentages for
			related cannabinoids.
Urine Specific Gravity	Correct results across	Correct classification (negative for	Samples from 1.000 to 1.035	Device performs reliably
	range of SG	-25% cutoff, positive for +25% cutoff).	with THC at +/-25% cutoff	across range of urine SG
			showed expected results.
Urine pH	Correct results across	Correct classification (negative for	Samples from pH 4 to 9 with	Device performs reliably
	range of pH	-25% cutoff, positive for +25% cutoff).	THC at +/-25% cutoff	across range of urine pH
			showed expected results.
Comparison Studies	Agreement with LC/MS	High agreement between device results
	for clinical samples	and LC/MS, particularly for samples
		not near the cutoff.
	Overall Agreement	Not explicitly quantified, but inferred	Viewer A: 78/80 (97.5%)	High agreement for
		from discordant results.	Viewer B: 78/80 (97.5%)	human readers
			Viewer C: 79/80 (98.75%)
Lay-User Study	Ease of use	Instructions are easily understood.	All lay users indicated	Instructions are clear
			instructions were easy to
	Accuracy of self-testing	High percentage of correct results,
		decreasing near the cutoff.
	-100% Cutoff	100% agreement	100% Correct	Excellent
	-75% Cutoff	100% agreement	100% Correct	Excellent
	-50% Cutoff	100% agreement	100% Correct	Excellent
	-25% Cutoff	>90% agreement, with expected	90% Correct	Acceptable, slight dip
		increase in discordance.		near cutoff.
	+25% Cutoff	>90% agreement, with expected	95% Correct	Acceptable, slight dip
		increase in discordance.		near cutoff.
	+50% Cutoff	100% agreement	100% Correct	Excellent
	+75% Cutoff	100% agreement	100% Correct	Excellent

2. Sample Size Used for the Test Set and Data Provenance

Precision Study:
- Sample Size: 9 different concentrations, with 50 tests performed for each concentration per lot (2 runs per day for 25 days). Since 3 lots were tested, this amounts to 9 concentrations * 50 tests/lot * 3 lots = 1350 tests in total for the precision study itself. The samples were prepared by spiking drug in "negative samples." The provenance is not explicitly stated but implies laboratory-prepared samples.
Comparison Studies (Clinical Samples):
- Sample Size: 80 unaltered clinical samples (40 negative and 40 positive).
- Data Provenance: The document states "unaltered clinical samples." The country of origin for these clinical samples is not specified. It is a retrospective analysis as the samples were collected and then tested.
Lay-User Study:
- Sample Size: 140 lay persons tested individual samples. There were 7 concentration levels, with 20 samples per concentration. So, 7 concentrations * 20 samples/concentration = 140 samples in total.
- Data Provenance: The samples were "spiked drug THC into drug free-pooled urine specimens." The country of origin is not specified. It is a prospective study in the sense that the lay users tested the devices with prepared samples, but the samples themselves were laboratory-prepared.

3. Number of Experts Used to Establish the Ground Truth for the Test Set and Their Qualifications

Precision Study: The ground truth was established by laboratory preparation of samples with known concentrations confirmed by LC/MS. No human experts are explicitly mentioned for ground truth.
Comparison Studies (Clinical Samples): The ground truth was established by LC/MS results. "LC/MS is the preferred confirmatory method." The number of LC/MS operators or analysts is not specified, nor are their qualifications.
Lay-User Study: The ground truth for the prepared urine samples was established by LC/MS. The number of LC/MS operators or analysts is not specified, nor are their qualifications.

4. Adjudication Method for the Test Set

Precision Study: Not applicable, as results were quantitative (known concentrations) rather than subjective interpretations needing adjudication.
Comparison Studies (Clinical Samples): The document mentions "three different laboratory assistants" performing the tests. Their results were compared to LC/MS. There is no explicit adjudication method stated for discrepancies between human readers or between human readers and LC/MS. The discordant results table simply lists them.
Lay-User Study: Not explicitly stated. Each lay person provided their own result. Their individual results were then compared to the LC/MS confirmed concentration.

5. If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study Was Done, and the Effect Size of How Much Human Readers Improve with AI vs without AI Assistance

No, an MRMC comparative effectiveness study was not done.
This device is a standalone diagnostic strip test intended for visual interpretation, not an AI-assisted diagnostic tool. Therefore, there is no AI component or human readers improving with/without AI assistance.

6. If a Standalone (i.e. algorithm only without human-in-the-loop performance) Was Done

Yes, in essence, the "device performance" in all studies represents a standalone performance relative to the ground truth (LC/MS). The device itself (the strip test) produces a visual result. While a human reads the result, the performance studies assess the accuracy of the device's output (presence/absence of a line) against known concentrations.
For the "Comparison Studies," the performance of the "Safecare THC Urine Strip Test" is measured for each viewer by comparing their reading of the device against LC/MS. This measures the combined device-human performance.
For the "Lay-User Study," it also measures the combined device-lay user performance.

7. The Type of Ground Truth Used

For all performance studies (Precision, Comparison, Lay-User), the ground truth was primarily established by LC/MS (Liquid Chromatography-Mass Spectrometry). This is a highly accurate and quantitative laboratory method.
For the precision study, it specifies "THC drug concentration was confirmed by LC/MS."
For the comparison studies, it states "The samples were blind labeled and compared to LC/MS results. LC/MS is the preferred confirmatory method."
For the lay-user study, it states "The concentrations of the samples were confirmed by LC/MS."

8. The Sample Size for the Training Set

Based on the provided document, this is a 510(k) submission for a diagnostic device. Such submissions typically focus on analytical and clinical performance validation rather than explicitly detailing a "training set" for an algorithm.
There is no information provided regarding a "training set" as this is not an AI/machine learning device. The immunoassay device relies on chemical reactions, not on data training.

9. How the Ground Truth for the Training Set Was Established

As there is no mention of a "training set" or an algorithm that requires training, this question is not applicable based on the provided document.

Ask a Question

Ask a specific question about this device

K Number

K182654

Device Name

SAFECARE Multi-Drug Urine Test Cup, SAFECARE Multi-Drug Urine Test Dip Card

Manufacturer

Safecare Biotech (Hangzhou) Co., Ltd.

Date Cleared

2018-11-19

(55 days)

Product Code

NGG,NFT,NFV,NFW,NFY,NGI,NGL,NGM,PTG,PTH,QAW,QBF

Regulation Number

862.3610

Type

Traditional

Panel

Toxicology

Reference & Predicate Devices

k181968,k153646,K142396

Predicate For

K201120

Intended Use

SAFECARE® Multi-Drug Urine Test Dip Card is competitive binding, lateral flow immunochromatographic assays for qualitative and simultaneous detection of Amphetamine, Oxazepam, Marijuana, Methamphetamine, Morphine, Oxycodone, Secobarbital, Buprenorphine, Methylenedioxymethamine, Phencyclidine, Methadone, Nortriotyline and d-Propoxyphene in human urine at the cutoff concentrations of:

Drug(Identifier)	Cut-off level
Amphetamine	1000 ng/mL
Oxazepam	300 ng/mL
Cocaine	300 ng/mL
Marijuana	50 ng/mL
Methamphetamine	1000 ng/mL
Morphine	2000 ng/mL
Oxycodone	100 ng/mL
Secobarbital	300 ng/mL
Buprenorphine	10 ng/mL
Methylenedioxymethamphetamine	500 ng/mL
Phencyclidine	25 ng/mL
Methadone	300 ng/mL
Nortriptyline	1000 ng/mL
d-Propoxyphene	300 ng/mL

Configuration of SAFECARE® Multi-Drug Urine Test Dip Card can consist of any combination of the above listed drug analytes.

The test may yield positive results for the prescription drugs Buprenorphine. Oxazepam, Secobarbital. Propoxyphene and Oxycodone when taken at or above prescribed doses. It is not intended to distinguish between prescription use or abuse of these drugs. Clinical consideration and professional judgment should be exercised with any drug of abuse test result, particularly when the preliminary result is positive. The test provides only preliminary test results. A more specific alternative chemical must be used in order to obtain a confirmed analytical result. GCMS or LC/MS is the preferred confirmatory method.

The tests are intended for over-the-counter use.

SAFECARE® Multi-Drug Urine Test Cup is competitive binding, lateral flow immunichromatographic assays for qualitative and simultaneous detection of Amphetanine, Oxazepam, Methamphetamine, Morghine, Oxycodone, Secobarbital, Buprenorphine, Methylenedioxymethamphetamine, Phencyclidine, Methadone, Nortriptyline and d-Propoxyphene in human urine at the cutoff concentrations of:

Drug(Identifier)	Cut-off level
Amphetamine	1000 ng/mL
Oxazepam	300 ng/mL
Cocaine	300 ng/mL
Marijuana	50 ng/mL
Methamphetamine	1000 ng/mL
Morphine	2000 ng/mL
Oxycodone	100 ng/mL
Secobarbital	300 ng/mL
Buprenorphine	10 ng/mL
Methylenedioxymethamphetamine	500 ng/mL
Phencyclidine	25 ng/mL
Methadone	300 ng/mL
Nortriptyline	1000 ng/mL
d-Propoxyphene	300 ng/mL

Configuration of SAFECARE® Multi-Drug Urine Test Cup contination of the above listed drug analytes.

The test may yield positive results for the prescription drugs Buprenorphine. Oxazenam, Secobarbital, Propoxyphene and Oxycodone when taken at or above prescribed doses. It is not intended to distinguish between prescription use or abuse of these drugs. Clinical consideration and professional judgment should be exercised with any drug of abuse test result, particularly when the preliminary result is positive. The test provides only preliminary test results. A more specific alternative chemical method must be used in order to obtain a confirmed analytical result. GCMS or LC/MS is the preferred confirmatory method.

The tests are intended for over-the-counter use.

Device Description

The SAFECARE Dip Card Tests and SAFECARE Cup Tests are immunochromatographic assays that use a lateral flow system for the qualitative detection of Amphetamine, Oxazepam, Cocaine, Cannabinoids, Methamphetamine, Morphine, Secobarbital, Methadone, Methylenedioxymethamphetamine, Oxycodone, Buprenorphine, Phencyclidine, Nortriptyline and Propoxyphene (target analytes) in human urine. The products are single-use in vitro diagnostic devices, which come in the formats of Dip Cards or Cups. Each test kit contains a Test Device (in one of the two formats), a package insert and a urine cup for sample collection. Each test device is sealed with a desiccant in an aluminum pouch.

AI/ML Overview

The document describes the performance characteristics and studies for the SAFECARE Multi-Drug Urine Test Dip Card and SAFECARE Multi-Drug Urine Test Cup.

Here's an analysis of the acceptance criteria and study that proves the device meets them:

1. Table of Acceptance Criteria and Reported Device Performance

The document does not explicitly state "acceptance criteria" in a separate section. However, the performance data presented implies a very high standard for accuracy, particularly around the established cutoff concentrations. The "precision" studies demonstrate the device's ability to consistently produce correct results for samples at varying concentrations relative to the cutoff. The "comparison studies" compare the device's qualitative results to a quantitative gold standard (LC/MS) to establish concordance. The "lay-user study" assesses the ability of untrained individuals to correctly use and interpret the device.

Based on the provided data, the implied acceptance criteria for qualitative immunoassay devices for drug testing would typically be:

100% agreement for samples significantly below the cutoff (Negative).
100% agreement for samples significantly above the cutoff (Positive).
High percentage agreement for samples near the cutoff (typically >90%).

For the Precision Study (Analytical Performance - "Test Set" Data):

The table below summarizes the reported performance for Methylenedioxymethamphetamine (MDMA), Oxycodone, Buprenorphine, Phencyclidine, Nortriptyline, and Propoxyphene (results from 3 lots, 50 tests per concentration per lot, per device type). The format is (Number of Negative Results)/(Number of Positive Results).

Drug (Identifier)	Cut-off Level	Concentration	Reported Performance (Typical across 3 lots)	Implied Acceptance Criteria Met?
Methylenedioxymethamphetamine (MDMA) (Dip Card & Cup)	500 ng/mL	-100% Cut-off	50-/0+	Yes (100% negative)
		-75% Cut-off	50-/0+	Yes (100% negative)
		-50% Cut-off	50-/0+	Yes (100% negative)
		-25% Cut-off	50-/0+	Yes (100% negative)
		Cut-off	25-/25+ or 26-/24+	Yes (near 50/50 split at cutoff)
		+25% Cut-off	50+/0-	Yes (100% positive)
		+50% Cut-off	50+/0-	Yes (100% positive)
		+75% Cut-off	50+/0-	Yes (100% positive)
		+100% Cut-off	50+/0-	Yes (100% positive)
Oxycodone (Dip Card & Cup)	100 ng/mL	-100% Cut-off	50-/0+	Yes (100% negative)
		-75% Cut-off	50-/0+	Yes (100% negative)
		-50% Cut-off	50-/0+	Yes (100% negative)
		-25% Cut-off	50-/0+	Yes (100% negative)
		Cut-off	24-/26+ or 26-/24+ or 25-/25+	Yes (near 50/50 split at cutoff)
		+25% Cut-off	50+/0-	Yes (100% positive)
		+50% Cut-off	50+/0-	Yes (100% positive)
		+75% Cut-off	50+/0-	Yes (100% positive)
		+100% Cut-off	50+/0-	Yes (100% positive)
Buprenorphine (Dip Card & Cup)	10 ng/mL	-100% Cut-off	50-/0+	Yes (100% negative)
		-75% Cut-off	50-/0+	Yes (100% negative)
		-50% Cut-off	50-/0+	Yes (100% negative)
		-25% Cut-off	50-/0+	Yes (100% negative)
		Cut-off	27-/23+, 25-/25+, 24-/26+, 26-/24+	Yes (near 50/50 split at cutoff)
		+25% Cut-off	50+/0-	Yes (100% positive)
		+50% Cut-off	50+/0-	Yes (100% positive)
		+75% Cut-off	50+/0-	Yes (100% positive)
		+100% Cut-off	50+/0-	Yes (100% positive)
Phencyclidine (Dip Card & Cup)	25 ng/mL	-100% Cut-off	50-/0+	Yes (100% negative)
		-75% Cut-off	50-/0+	Yes (100% negative)
		-50% Cut-off	50-/0+	Yes (100% negative)
		-25% Cut-off	50-/0+	Yes (100% negative)
		Cut-off	26-/24+, 25-/25+, 24-/26+	Yes (near 50/50 split at cutoff)
		+25% Cut-off	50+/0-	Yes (100% positive)
		+50% Cut-off	50+/0-	Yes (100% positive)
		+75% Cut-off	50+/0-	Yes (100% positive)
		+100% Cut-off	50+/0-	Yes (100% positive)
Nortriptyline (Dip Card & Cup)	1000 ng/mL	-100% Cut-off	50-/0+	Yes (100% negative)
		-75% Cut-off	50-/0+	Yes (100% negative)
		-50% Cut-off	50-/0+	Yes (100% negative)
		-25% Cut-off	50-/0+	Yes (100% negative)
		Cut-off	25-/25+, 26-/24+, 24-/26+	Yes (near 50/50 split at cutoff)
		+25% Cut-off	50+/0-	Yes (100% positive)
		+50% Cut-off	50+/0-	Yes (100% positive)
		+75% Cut-off	50+/0-	Yes (100% positive)
		+100% Cut-off	50+/0-	Yes (100% positive)
Propoxyphene (Dip Card & Cup)	300 ng/mL	-100% Cut-off	50-/0+	Yes (100% negative)
		-75% Cut-off	50-/0+	Yes (100% negative)
		-50% Cut-off	50-/0+	Yes (100% negative)
		-25% Cut-off	50-/0+	Yes (100% negative)
		Cut-off	23-/27+, 26-/24+, 24-/26+, 25-/25+	Yes (near 50/50 split at cutoff)
		+25% Cut-off	50+/0-	Yes (100% positive)
		+50% Cut-off	50+/0-	Yes (100% positive)
		+75% Cut-off	50+/0-	Yes (100% positive)
		+100% Cut-off	50+/0-	Yes (100% positive)

For the Lay-user Study:

The acceptance criteria for the lay-user study would typically be a high percentage of correct results, especially for samples clearly negative or positive, and a reasonable proportion around the cutoff. The document shows the "percentage of correct results (%)" for each concentration level. The implied acceptance criteria for OTC urine drug screens are usually very high concordance (e.g., >90%) with the ground truth for samples away from the cutoff, and some level of agreement for near-cutoff samples (where variability is expected). The device performance meets these generally implied high standards, especially for samples clearly below or above the cutoff (100% for most categories). For samples near +/-25% of cutoff, the accuracy ranges from 85-95%, which is typical for qualitative tests and acceptable for over-the-counter use where confirmation is recommended for positive results.

2. Sample Size Used for the Test Set and Data Provenance

Precision Studies (Test Set):
- For each drug, device type (Dip Card/Cup), and each of the 9 concentration levels: 50 tests were performed per concentration.
- This was repeated for 3 different lots for each device type.
- Therefore, for a single drug and device type, the test set size for precision was: 9 concentrations * 50 tests/concentration * 3 lots = 1350 tests.
- Data Provenance: The samples were prepared by "spiking drug in negative samples" and confirmed by LC/MS. This suggests controlled, prospective laboratory-prepared samples rather than retrospective clinical samples from specific countries. The testing was "in-house."
Comparison Studies (Test Set):
- For each drug and device type (Dip Card/Cup): 80 unaltered clinical samples (40 negative and 40 positive) were used.
- Data Provenance: "unaltered clinical samples." The document does not specify the country of origin, but given the manufacturer (China) and submission to US FDA, the samples could be from various global sources or acquired for the study. It's a retrospective analysis against independently confirmed samples.
Lay-user Study (Test Set):
- For each device format (presumably Dip Card and Cup, though only combined results are shown in the given table): 300 lay persons participated.
- The total number of samples processed for each drug across the different concentration levels in the lay-user study tables is: 20 + 20 + 160 + 20 + 20 + 40 + 20 = 300 samples. (It seems 300 samples were tested for each drug type, probably with each of the 300 lay users testing one sample)
- Data Provenance: Urine samples were "prepared at the following concentrations; negative, +/-75%, +/-50%, +/-25% of the cutoff by spiking drugs into drug free-pooled urine specimens." These were laboratory-prepared samples. The study involved "three intended user sites."

3. Number of Experts Used to Establish the Ground Truth for the Test Set and Qualifications of Those Experts

Precision Studies: Ground truth was established by LC/MS (Liquid Chromatography-Mass Spectrometry), which is a definitive quantitative analytical method. It's an instrumental method, so human "experts" in interpretation are typically not specified in the same way as for imaging or clinical diagnosis. The samples were "blindly labeled by the person who prepared the samples," which indicates blinding to the LC/MS results during the device testing.
Comparison Studies: Ground truth was established by LC/MS results. The document states, "The samples were blind labeled and compared to LC/MS results." These are definitive chemical analysis results.
Lay-user Study: Ground truth for the samples was established by LC/MS confirmation of the spiked drug concentrations.

4. Adjudication Method for the Test Set

Precision Studies: No explicit human adjudication method mentioned, as the device results are quantitative categories (positive/negative) and compared to definitive LC/MS values. The results are reported as counts (e.g., 25-/25+).
Comparison Studies: The device results were read by "three laboratory assistants." For discordant results, the individual viewer's result is listed against the LC/MS truth, but no specific human adjudication process (e.g., 2+1 majority voting) is described for resolving discrepancies among the assistants or between the assistants and LC/MS. The LC/MS is the definitive ground truth here.
Lay-user Study: The results are presented as aggregated counts of positive/negative readings by the lay users compared to the known ground truth (LC/MS confirmed concentrations). No human adjudication process is mentioned among the lay users.

5. If a Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study was done, If so, what was the effect size of how much human readers improve with AI vs without AI assistance

No, an MRMC comparative effectiveness study was not done. This type of study is relevant for AI-assisted diagnostic devices where human readers interpret images or complex data, and the AI serves as an aid.
This device is a qualitative, over-the-counter urine drug test. Its performance is evaluated against chemical analytical ground truth, and its usability is assessed with lay users, not by evaluating how it assists expert readers.

6. If a Standalone (i.e. algorithm only without human-in-the loop performance) was done

This device is a physical, qualitative immunoassay test kit, not a standalone algorithm/AI for analysis. The "performance" of the device itself (the reaction on the dip card/cup) is evaluated.
However, the Comparison Studies and Precision Studies could be considered the "standalone" performance of the device when read by trained laboratory personnel (3 laboratory assistants), as it's the device's output itself being compared to LC/MS.
The Lay-user study evaluates the device performance when interpreted by the intended "human-in-the-loop" (a lay user).

7. The Type of Ground Truth Used (expert consensus, pathology, outcomes data, etc.)

The primary ground truth used for all performance studies (Precision, Comparison, Lay-user) is quantitative chemical analysis by LC/MS (Liquid Chromatography-Mass Spectrometry). This is a highly accurate and widely accepted method for confirming drug concentrations in urine.
For the precision and lay-user studies, urine samples were spiked with known concentrations of drugs, and these concentrations were confirmed by LC/MS.
For the comparison studies, "unaltered clinical samples" were used, with LC/MS providing the definitive truth for the presence and concentration of drugs.

8. The Sample Size for the Training Set

This document describes premarket notification (510(k)) for a drug test kit, which is a physical diagnostic device based on immunochromatography, not an AI/Machine Learning algorithm.
Therefore, there is no "training set" in the context of machine learning model development. The device's performance relies on its physical and chemical design, not on a trained algorithm.

9. How the Ground Truth for the Training Set Was Established

As there is no AI/ML algorithm or training set, this question is not applicable.

Ask a Question

Ask a specific question about this device

Page 1 of 1