K Number
K243851
Device Name
CHLOE BLAST
Manufacturer
Date Cleared
2025-08-15

(242 days)

Product Code
Regulation Number
884.6195
Reference & Predicate Devices
Predicate For
N/A
AI/MLSaMDIVD (In Vitro Diagnostic)TherapeuticDiagnosticis PCCP AuthorizedThirdpartyExpeditedreview
Intended Use

CHLOE BLAST is indicated to provide adjunctive information on events occurring during embryo development that may predict further development to the blastocyst stage on Day 5 of development. This adjunctive information aids in the selection of embryo(s) for transfer on Day 3, when, following morphological assessment, there are multiple embryos deemed suitable for transfer or freezing.

CHLOE BLAST is to be used only for the analysis of images captured by the EmbryoScope version D incubator system.

Device Description

CHLOE BLAST is a decision support tool designed to automatically analyze time lapse videos of developing embryos, retrieved from EmbryoScope (version D) Time Lapse Incubators (TLI) system. It is intended to provide adjunctive information on developmental events up to Day 3 that may predict progression to the blastocyst stage by Day 5.

CHLOE BLAST is a cloud-based software as a medical device (SaMD) that uses a convolutional neural network (CNN) to analyze TLI videos from insemination to Day 3. The output is the "CHLOE Score", which is a blastocyst development prediction value associated with the likelihood of the embryo reaching blastocyst stage at Day 5.

This information aids in the selection of embryo(s) for transfer on Day 3, when, following morphological assessment, there are multiple normally fertilized embryos deemed suitable for transfer or freezing. In a clinical setting, the CHLOE score is intended to be used by the embryologist as adjunctive information, to be used only after the embryologists complete their independent morphological assessments based on the lab's standard of care (e.g., Istanbul Consensus Grading).

The main user interaction is via the graphic user interface (GUI) available via Chrome browsers. It includes screens for treatments overview, manual embryo assessment, and score presentation, and integrates with the day-to-day normal operation in IVF clinics using TLI.

AI/ML Overview

Here's a breakdown of the acceptance criteria and the study proving the device meets those criteria, based on the provided FDA clearance letter:

Acceptance Criteria and Device Performance

1. Table of Acceptance Criteria and Reported Device Performance

Note: The document presents acceptance criteria primarily as "AUC lower bound >0.8" for various performance metrics. It also establishes an Odds Ratio (OR) greater than 1 as the primary endpoint for clinical utility.

Metric / TestAcceptance CriterionReported Device PerformanceMeets Criterion?
Non-Clinical Performance - Algorithm Validation
Morphokinetic Events Detection Accuracy (Overall)N/A (Accuracy reported, not AUC)0.82 (95% CI: 0.81, 0.84)N/A
Morphokinetic Events Detection Accuracy (2PNs)N/A (Accuracy reported, not AUC)0.84 (95% CI: 0.83, 0.85)N/A
Morphokinetic Events Detection (Overall AUC)AUC lower bound >0.8N/A (Accuracy reported, not AUC for overall)Yes (Implicitly, as sub-model AUCs are mentioned in relation to this criterion)
Morphokinetic Events Detection (2PNs Sub-model AUC)AUC lower bound >0.80.84 (95% CI: 0.83, 0.85) - This appears to be the accuracy value, not AUC. The text states "Accuracy of the sub-model... was 0.84". However, it immediately follows the criterion "AUC lower bound >0.8 were met." This is a slight inconsistency in the document's reporting. Assuming the 0.84 is indeed AUC, then: YesYes (Assuming 0.84 refers to AUC)
Morphokinetic Events Detection (Sub-groups: Age <35, 41≤)AUC lower bound >0.8Not met (Performance was not consistent, indicating some subgroups might not have met the criterion, though specific AUC values for these subgroups are not provided)No (Stated in text)
Morphokinetic Events Detection (Sub-groups: Underweight, Obese BMI)AUC lower bound >0.8Not met (Performance was not consistent, indicating some subgroups might not have met the criterion, though specific AUC values for these subgroups are not provided)No (Stated in text)
Blast Prediction (Overall AUC)AUC lower bound >0.80.88 (95% CI: 0.86, 0.90)Yes
Blast Prediction (All Subgroups except Obese BMI)AUC lower bound >0.8AUC similar and higher than 0.8Yes
Blast Prediction (Obese BMI Subgroup AUC)AUC lower bound >0.8Not met (However, specific AUC for this subgroup is not provided, only that it "was not met")No (Stated in text)
Blast Prediction (2PN embryos AUC)N/A (Reduction in AUC observed, but no specific criterion for this subgroup)0.81 (95% CI: 0.78, 0.83)N/A (But still > 0.8)
Blast Prediction (Good/Fair embryos AUC)N/A (Reduction in AUC observed, but no specific criterion for this subgroup)0.74 (95% CI: 0.69, 0.78)N/A (Lower than 0.8, but explanation given for clinical study focusing on this subgroup)
Non-Clinical Performance - Reproducibility Test
AUC with Optical AugmentationsAUC lower bound >0.8All AUCs > 0.89, CI lower bound > 0.87Yes
Clinical Performance - Primary Endpoint
Odds Ratio (OR) for Good/Fair Embryos (CHLOE-assisted)OR > 15.67 (95% CI: 4.6, 6.99)Yes
Clinical Performance - Secondary Endpoints (Highlights)
OR for All Embryos (CHLOE-assisted)N/A (Secondary endpoint)8.51 (95% CI: 6.97, 10.38)N/A
Sensitivity (CHLOE-assisted)N/A (Performance measure)0.846N/A
Specificity (CHLOE-assisted)N/A (Performance measure)0.444N/A
PPV (CHLOE-assisted)N/A (Performance measure)0.629N/A
NPV (CHLOE-assisted)N/A (Performance measure)0.721N/A
OR for Individual Embryologists (CHLOE-assisted)OR > 1Improved and > 1 for all embryologistsYes
OR in Subgroups (Age and BMI) (CHLOE-assisted)OR > 1OR > 1 in all subgroups (lower bound of CI > 1 in all but one age and one BMI category)Yes (Mostly)
Subject-level Sensitivity (CHLOE-assisted)N/A (Performance measure)87.50% to 92.86%N/A
Top 2 Embryo Analysis OR (CHLOE-assisted)N/A (Performance measure)10.73 (95% CI: 6.19, 18.60)N/A

Study Details Proving Device Meets Acceptance Criteria

2. Sample Sizes and Data Provenance

  • Non-Clinical Performance (Algorithm Validation):
    • Morphokinetic Events Detection: 1,094 embryos from 143 slides. Collected from two sites: one in the US and one in Norway. The data provenance is retrospective, as it's a "test dataset... entirely independent from the dataset utilized in the CHLOE BLAST clinical study."
    • Blast Prediction: 1,726 embryos from 233 slides. Collected from two sites: one in the US and one in Norway. The data provenance is retrospective.
  • Clinical Performance (CHLOE BLAST Clinical Study):
    • 703 embryos from 59 mothers.
    • Data collected from three different sites located in the United States.
    • Data provenance: Prospective collection for the purpose of this study (described as a "pivotal, multicenter, single arm, observational, prospective assessment study").

3. Number of Experts and Qualifications for Ground Truth

  • Non-Clinical Performance (Algorithm Validation):
    • Morphokinetic Stages and Blast Annotations: Three independent embryologists.
    • Qualifications: "The annotators were not involved in the training or tuning of the model and were blinded to each other's labels." No explicit years of experience are stated for these annotators.
  • Clinical Performance (CHLOE BLAST Clinical Study):
    • Morphology Grading (Assessors): Three embryologists.
    • Qualifications: "blinded to CHLOE information," and performed grading according to SART standards. No explicit years of experience are stated.
    • Clinical Assessment (Panelists): Five independent embryologists.
    • Qualifications: All "in practice during the study period and from a range of geographical areas within the United States." 3 were senior embryologists with over 10 years of clinical embryology experience each, and the other 2 were junior embryologists with less than 3 years of clinical embryology experience.

4. Adjudication Method for the Test Set

  • Non-Clinical Performance (Algorithm Validation):
    • Ground Truth: "Each embryo video was viewed by three independent embryologists who provided their morphokinetic stages and Blast annotations based on the time-lapse videos. The annotators were not involved in the training or tuning of the model and were blinded to each other's labels." It implies a consensus-based approach, but directly states, "The TLI videos were annotated at a frame level with the ground truth of one of the morphokinetic stages and at a video level with blastulation results."
  • Clinical Performance (CHLOE BLAST Clinical Study):
    • Morphology Grading (Assessors): "Then, the following parameters were categorized by majority agreement (at least 2 of 3 Assessors): Severe asymmetry (yes/no), Fragmentation > 25% (yes/no), Number of cells (1 through 8, 9≤)." This is a clear 2 out of 3 (2+1) consensus method for specific parameters.
    • Clinical Assessment (Panelists): No explicit adjudication method is stated for the Panelists' predictions. Each Panelist performed their own independent predictions, and the study analyzed the collective performance as well as individual improvements.

5. Multi-Reader Multi-Case (MRMC) Comparative Effectiveness Study

  • Yes, an MRMC comparative effectiveness study was done as part of the clinical performance study.
  • Effect Size of Human Readers Improvement with AI vs. without AI assistance:
    • The primary endpoint focused on the Odds Ratio (OR) for predicting blastocyst formation in Good/Fair embryos.
    • Without AI assistance (Morphology Only): OR = 3.77 (95% CI: 2.97, 4.79)
    • With AI assistance (Morphology + CHLOE Score): OR = 5.67 (95% CI: 4.6, 6.99)
    • This represents an improvement in the Odds Ratio from 3.77 to 5.67 for the primary endpoint.
    • For all embryos, the OR improved from 6.93 (without CHLOE) to 8.51 (with CHLOE).
    • For subject-level sensitivity, it improved from 80.36%-83.93% (traditional morphology) to 87.50%-92.86% (with CHLOE).
    • For Top 2 Embryo analysis, the OR improved from 3 (without CHLOE) to 10.73 (with CHLOE).

6. Standalone (Algorithm Only without Human-in-the-Loop) Performance

  • Yes, a standalone performance assessment was done as part of the "Non-Clinical Performance – Algorithm Validation" section.
  • The algorithm's performance in predicting blastocyst formation was assessed independently, yielding an AUC of 0.88 (95% CI: 0.86, 0.90). This demonstrates the algorithm's capability on its own.

7. Type of Ground Truth Used

  • For Non-Clinical Performance (Algorithm Validation):
    • Expert Consensus: Morphokinetic stages and blast annotations were established by three independent embryologists.
    • Outcomes Data: The "blastulation results" (blastocyst Yes/No) are actual outcomes.
  • For Clinical Performance (CHLOE BLAST Clinical Study):
    • Expert Consensus: Morphology grading by three "Assessors" with majority agreement (2 out of 3).
    • Outcomes Data: The "actual blastocyst outcome" (Yes/No) which the algorithm and human readers are predicting.

8. Sample Size for the Training Set

  • The document states: "The study dataset included data collected specifically for the purpose of this study according to the predefined inclusion and exclusion criteria and was segregated from algorithm training and verification datasets."
  • "The dataset used for the performance test was entirely independent from the dataset utilized in the CHLOE BLAST clinical study described in section 9, and the clinics that provided data for the performance dataset were not used to collect data for the clinical study."
  • The specific sample size for the training set is NOT PROVIDED in this document. It only clearly states that the various test sets were independent from the training data.

9. How Ground Truth for the Training Set Was Established

  • The document implies that the training data exists and was used to develop the CNN, but it does NOT specify how the ground truth for the training set was established. It only focuses on how ground truth was established for the independent testing and clinical validation sets.
# U.S. Food & Drug Administration FDA Clearance Letter ## Page 1 **U.S. Food & Drug Administration** 10903 New Hampshire Avenue Silver Spring, MD 20993 www.fda.gov **Doc ID # 04017.08.00** August 15, 2025 Fairtility Ltd. ℅ Susan Alpert Official Correspondent SFADC LLC 2425 L Street NW, Apt 307 Washington, District of Columbia 20037 **Re: K243851** Trade/Device Name: CHLOE BLAST Regulation Number: 21 CFR 884.6195 Regulation Name: Assisted Reproduction Embryo Image Assessment System Regulatory Class: II Product Code: PBH Dated: July 13, 2025 Received: July 14, 2025 Dear Susan Alpert: We have reviewed your section 510(k) premarket notification of intent to market the device referenced above and have determined the device is substantially equivalent (for the indications for use stated in the enclosure) to legally marketed predicate devices marketed in interstate commerce prior to May 28, 1976, the enactment date of the Medical Device Amendments, or to devices that have been reclassified in accordance with the provisions of the Federal Food, Drug, and Cosmetic Act (the Act) that do not require approval of a premarket approval application (PMA). You may, therefore, market the device, subject to the general controls provisions of the Act. Although this letter refers to your product as a device, please be aware that some cleared products may instead be combination products. The 510(k) Premarket Notification Database available at https://www.accessdata.fda.gov/scripts/cdrh/cfdocs/cfpmn/pmn.cfm identifies combination product submissions. The general controls provisions of the Act include requirements for annual registration, listing of devices, good manufacturing practice, labeling, and prohibitions against misbranding and adulteration. Please note: CDRH does not evaluate information related to contract liability warranties. We remind you, however, that device labeling must be truthful and not misleading. If your device is classified (see above) into either class II (Special Controls) or class III (PMA), it may be subject to additional controls. Existing major regulations affecting your device can be found in the Code of Federal Regulations, Title 21, Parts 800 to 898. In addition, FDA may publish further announcements concerning your device in the Federal Register. Additional information about changes that may require a new premarket notification are provided in the FDA guidance documents entitled "Deciding When to Submit a 510(k) for a Change to an Existing Device" ## Page 2 K243851 - Susan Alpert Page 2 (https://www.fda.gov/media/99812/download) and "Deciding When to Submit a 510(k) for a Software Change to an Existing Device" (https://www.fda.gov/media/99785/download). Your device is also subject to, among other requirements, the Quality System (QS) regulation (21 CFR Part 820), which includes, but is not limited to, 21 CFR 820.30, Design controls; 21 CFR 820.90, Nonconforming product; and 21 CFR 820.100, Corrective and preventive action. Please note that regardless of whether a change requires premarket review, the QS regulation requires device manufacturers to review and approve changes to device design and production (21 CFR 820.30 and 21 CFR 820.70) and document changes and approvals in the device master record (21 CFR 820.181). Please be advised that FDA's issuance of a substantial equivalence determination does not mean that FDA has made a determination that your device complies with other requirements of the Act or any Federal statutes and regulations administered by other Federal agencies. You must comply with all the Act's requirements, including, but not limited to: registration and listing (21 CFR Part 807); labeling (21 CFR Part 801); medical device reporting (reporting of medical device-related adverse events) (21 CFR Part 803) for devices or postmarketing safety reporting (21 CFR Part 4, Subpart B) for combination products (see https://www.fda.gov/combination-products/guidance-regulatory-information/postmarketing-safety-reporting-combination-products); good manufacturing practice requirements as set forth in the quality systems (QS) regulation (21 CFR Part 820) for devices or current good manufacturing practices (21 CFR Part 4, Subpart A) for combination products; and, if applicable, the electronic product radiation control provisions (Sections 531-542 of the Act); 21 CFR Parts 1000-1050. All medical devices, including Class I and unclassified devices and combination product device constituent parts are required to be in compliance with the final Unique Device Identification System rule ("UDI Rule"). The UDI Rule requires, among other things, that a device bear a unique device identifier (UDI) on its label and package (21 CFR 801.20(a)) unless an exception or alternative applies (21 CFR 801.20(b)) and that the dates on the device label be formatted in accordance with 21 CFR 801.18. The UDI Rule (21 CFR 830.300(a) and 830.320(b)) also requires that certain information be submitted to the Global Unique Device Identification Database (GUDID) (21 CFR Part 830 Subpart E). For additional information on these requirements, please see the UDI System webpage at https://www.fda.gov/medical-devices/device-advice-comprehensive-regulatory-assistance/unique-device-identification-system-udi-system. Also, please note the regulation entitled, "Misbranding by reference to premarket notification" (21 CFR 807.97). For questions regarding the reporting of adverse events under the MDR regulation (21 CFR Part 803), please go to https://www.fda.gov/medical-devices/medical-device-safety/medical-device-reporting-mdr-how-report-medical-device-problems. For comprehensive regulatory information about medical devices and radiation-emitting products, including information about labeling regulations, please see Device Advice (https://www.fda.gov/medical-devices/device-advice-comprehensive-regulatory-assistance) and CDRH Learn (https://www.fda.gov/training-and-continuing-education/cdrh-learn). Additionally, you may contact the Division of Industry and Consumer Education (DICE) to ask a question about a specific regulatory topic. See ## Page 3 K243851 - Susan Alpert Page 3 the DICE website (https://www.fda.gov/medical-devices/device-advice-comprehensive-regulatory-assistance/contact-us-division-industry-and-consumer-education-dice) for more information or contact DICE by email (DICE@fda.hhs.gov) or phone (1-800-638-2041 or 301-796-7100). Sincerely, **Monica D. Garcia -S** Monica D. Garcia, Ph.D. Assistant Director DHT3B: Division of Reproductive, Gynecology, and Urology Devices OHT3: Office of Gastrorenal, ObGyn, General Hospital, and Urology Devices Office of Product Evaluation and Quality Center for Devices and Radiological Health Enclosure ## Page 4 **DEPARTMENT OF HEALTH AND HUMAN SERVICES** Food and Drug Administration **Form Approved: OMB No. 0910-0120** **Expiration Date: 07/31/2026** **See PRA Statement below.** # Indications for Use **Submission Number (if known)** K243851 **Device Name** CHLOE BLAST **Indications for Use (Describe)** CHLOE BLAST is indicated to provide adjunctive information on events occurring during embryo development that may predict further development to the blastocyst stage on Day 5 of development. This adjunctive information aids in the selection of embryo(s) for transfer on Day 3, when, following morphological assessment, there are multiple embryos deemed suitable for transfer or freezing. CHLOE BLAST is to be used only for the analysis of images captured by the EmbryoScope version D incubator system. **Type of Use (Select one or both, as applicable)** ☒ Prescription Use (Part 21 CFR 801 Subpart D) ☐ Over-The-Counter Use (21 CFR 801 Subpart C) **CONTINUE ON A SEPARATE PAGE IF NEEDED.** This section applies only to requirements of the Paperwork Reduction Act of 1995. ***DO NOT SEND YOUR COMPLETED FORM TO THE PRA STAFF EMAIL ADDRESS BELOW.*** The burden time for this collection of information is estimated to average 79 hours per response, including the time to review instructions, search existing data sources, gather and maintain the data needed and complete and review the collection of information. Send comments regarding this burden estimate or any other aspect of this information collection, including suggestions for reducing this burden, to: Department of Health and Human Services Food and Drug Administration Office of Chief Information Officer Paperwork Reduction Act (PRA) Staff PRAStaff@fda.hhs.gov *"An agency may not conduct or sponsor, and a person is not required to respond to, a collection of information unless it displays a currently valid OMB number."* ## Page 5 # 510k Summary – CHLOE BLAST **K243851** ## 1. Submitter Information Fairtility Ltd. 8 Ha'Melacha St Tel-Aviv 6721508 Israel **Contact Person** Maya Baranes Zeevi maya.baranes@fairtility.com +975-52-4697725 **Correspondent** SFADC LLC 2425 L Street NW, Apt 307 Washington, DC WA 20037 United States 612-202-7019 Dr. Susan Alpert drsusanalpert@gmail.com ## 2. Date Prepared: August 14, 2025 ## 3. Device Information **Device Trade Name:** CHLOE BLAST **Common Name:** Assisted Reproduction Embryo Image Assessment System **Regulatory Class** II **Regulation Number:** 21 CFR 884.6195 **Regulation Name:** Assisted Reproduction Embryo Image Assessment System **Product Code:** PBH (Embryo Image Assessment System, Assisted Reproduction) ## 4. Predicate Device EEVA 2.0, DEN120015, Auxogyn Inc. The predicate device has not been subject to a design-related recall. ## 5. Device Description CHLOE BLAST is a decision support tool designed to automatically analyze time lapse videos of developing embryos, retrieved from EmbryoScope (version D) Time Lapse Incubators (TLI) system. It is intended to provide adjunctive information on developmental events up to Day 3 that may predict progression to the blastocyst stage by Day 5. CHLOE BLAST is a cloud-based software as a medical device (SaMD) that uses a convolutional neural network (CNN) to analyze TLI videos from insemination to Day 3. The output is the ## Page 6 "CHLOE Score", which is a blastocyst development prediction value associated with the likelihood of the embryo reaching blastocyst stage at Day 5. This information aids in the selection of embryo(s) for transfer on Day 3, when, following morphological assessment, there are multiple normally fertilized embryos deemed suitable for transfer or freezing. In a clinical setting, the CHLOE score is intended to be used by the embryologist as adjunctive information, to be used only after the embryologists complete their independent morphological assessments based on the lab's standard of care (e.g., Istanbul Consensus Grading). The main user interaction is via the graphic user interface (GUI) available via Chrome browsers. It includes screens for treatments overview, manual embryo assessment, and score presentation, and integrates with the day-to-day normal operation in IVF clinics using TLI. ## 6. Intended Use/Indications for Use CHLOE BLAST is indicated to provide adjunctive information on events occurring during embryo development that may predict further development to the blastocyst stage on Day 5 of development. This adjunctive information aids in the selection of embryo(s) for transfer on Day 3, when, following morphological assessment, there are multiple embryos deemed suitable for transfer or freezing. CHLOE BLAST is to be used only for the analysis of images captured by the EmbryoScope version D incubator system. ## 7. Comparison of Intended Use and Technological Characteristics of the Subject and Predicate Device ### 7.1. Intended Use | **Subject device (K243851) – CHLOE BLAST** | **Predicate device (DEN120015) – EEVA 2.0** | |---|---| | **Indications for Use** | | | CHLOE BLAST is indicated to provide adjunctive information on events occurring during embryo development that may predict further development to the blastocyst stage on Day 5 of development. This adjunctive information aids in the selection of embryo(s) for transfer on Day 3, when, following morphological assessment, there are multiple embryos deemed suitable for transfer or freezing.<br><br>CHLOE BLAST is to be used only for the analysis of images captured by the EmbryoScope version D incubator system. | The Eeva System is indicated to provide adjunctive information on events occurring during the first two days of development that may predict further development to the blastocyst stage on Day 5 of development. This adjunctive information aids in the selection of embryo(s) for transfer on Day 3 when, following morphological assessment on Day 3, there are multiple embryos deemed suitable for transfer or freezing. The device may also be used to collect additional time-lapse images until Day 5 of development for embryos not selected for transfer, to allow monitoring of continued embryo development. | The intended use of the subject device is the same as the predicate device – both are intended to provide adjunctive information (decision support) to the embryologist/clinician at Day 3 of culture to aid the user in selecting embryos for transfer/freezing during assisted reproduction procedures. ### 7.2. Technological Characteristics ## Page 7 | **Characteristics** | **Subject Device K243851** | **Predicate Device DEN120015** | **Comparison** | |---|---|---|---| | **Manufacturer** | Fairtility | Auxogyn Inc. | N/A | | **Trade name** | CHLOE BLAST | EEVA 2.0 | N/A | | **Device Design** | Software that automatically analyzes and identifies embryo development events from images captured by Embryoscope version D (K111715) for use in a blastocyst prediction model. | Time-lapse system used in conventional incubator with analysis software that automatically identifies embryo development events for use in a blastocyst prediction model. | **Different:** The subject device is a software accessory to be used with the Embryoscope imaging incubator to provide adjunctive information regarding blastocyst quality. The predicate device is a time lapse imaging system to be used in a conventional incubator that utilizes automated image analysis to provide adjunctive blastocyst quality information. These differences do not raise different questions of safety and effectiveness. | | **Algorithm** | Software<br>Cell tracking and event inference used as algorithm inputs. | Software<br>Cell tracking and event inference used as algorithm inputs. | Same | | **Annotation method** | Image analysis software automatically identifies embryo development events and timing. | Image analysis software automatically identifies embryo development events and timing. | Same | The technological characteristics of the subject and predicate device are different – the subject device has different predictive algorithms for blastocyst quality, and different software and annotation functions. However, different questions of safety and effectiveness are not raised by these differences in technological characteristics. ## 8. Summary of Non-Clinical Performance Testing Devices classified under 21 CFR 884.6195 (Assisted Reproduction Embryo Image Assessment System) and product code PBH must address several non-clinical special controls, including software validation, verification, and hazard analysis, an assessment of light exposure and output, simulated use, cleaning and disinfection, package integrity and transit testing, electrical safety and electromagnetic compatibility testing, and prediction algorithm reproducibility. As CHLOE BLAST is a software, some of the special controls do not apply to this submission. Software was evaluated at the Basic Documentation level as recommended in the 2023 FDA guidance document "Content of Premarket Submissions for Device Software Functions." ### 8.1. Performance Testing – Algorithm Validation ## Page 8 The purpose of the test was to assess the performance of the CHLOE BLAST algorithm in predicting an embryo's likelihood to form a blastocyst. The algorithm uses Time-Lapse Incubator (TLI) videos from insemination to Day 3 as an input to make its predictions. The test included both the verification of the accuracy of the morphokinetic event detection and the overall performance of the algorithm in predicting blastocyst formation. The dataset used for the performance test was entirely independent from the dataset utilized in the CHLOE BLAST clinical study described in section 9, and the clinics that provided data for the performance dataset were not used to collect data for the clinical study. Each embryo video was viewed by three independent embryologists who provided their morphokinetic stages and Blast annotations based on the time-lapse videos. The annotators were not involved in the training or tuning of the model and were blinded to each other's labels. The TLI videos were annotated at a frame level with the ground truth of one of the morphokinetic stages and at a video level with blastulation results. Number of pronuclei (PNs) and embryo quality (according to SART) were also annotated to allow subgroup analysis. The test dataset for the morphokinetic events detection comprised 1,094 embryos collected from two sites, one located in the US and one in Europe (Norway). The data distribution is described in Table 1 below: | **Clinic Location** | **US** | **Norway** | **Total** | |---|---|---|---| | Number of slides | 88 (62%) | 55 (38%) | 143 | | Number of embryos | 671 (61%) | 423 (39%) | 1094 | | Number of embryos per slide Mean (SD) Min, Max | 7.6 (2.9) 1, 12 | 7.7 (3.0) 1, 12 | 7.7 (2.9) 1, 12 | **Table 1 - MKS Detection Performance Test Data Inventory** The mean age was 36.5 years old (with standard deviation of 4.8) and the mean BMI was 24.1 (with standard deviation of 4.6). The Age and BMI categories distribution is presented in Table 2 below: | **Category** | **Total** | |---|---| | **Age** | | | Age < 35 | 44 (30.8%) | | 35 ≤ Age < 38 | 29 (20.3%) | | 38 ≤ Age < 41 | 45 (31.5%) | | 41 ≤ Age | 25 (17.5%) | | **BMI** | | | Underweight (<18.5) | 5 (3.5%) | | Normal (18.5-24.9) | 97 (67.8%) | | Overweight (25-29.9) | 24 (16.8%) | | Obese (≥30) | 17 (11.9%) | **Table 2 - MKS Detection Performance Test Data Distribution** The overall accuracy of the morphokinetic events detection was 0.82 (95% CI: 0.81, 0.84). The Accuracy of the sub-model when used on the sub-population of embryos with 2PNs was 0.84 (95% CI: 0.83, 0.85). The validation acceptance criteria (AUC lower bound >0.8) were met. When assessing the performance of the morphokinetic events detection across various sub-groups, including age and BMI categories, it was found that the sub-model maintained consistent performance throughout these different categories with the exception of two age groups (<35 and 41≤) and two BMI categories (Underweight and Obese) that didn't meet the predefined acceptance criteria (AUC lower bound >0.8). ## Page 9 The test dataset for the Blast prediction comprised 1,726 embryos collected from two sites, one located in the US and one in Europe (Norway). The data distribution is described in Table 3 below: | **Clinic Location** | **US** | **Norway** | **Total** | |---|---|---|---| | Number of slides | 216 (92.7%) | 17 (7.3%) | 233 | | Number of embryos | 1608 (93%) | 118 (7%) | 1726 | | Number of embryos per slide Mean (SD) Min, Max | 7.4 (3.0) 1, 12 | 6.9 (3.2) 1, 11 | 7.4 (3.0) 1, 12 | **Table 3 - Blast Prediction Performance Test Data Inventory** The mean age was 37.7 years old (with standard deviation of 4.4) and the mean BMI was 23.3 (with standard deviation of 4.1). The Age and BMI categories distribution is presented in Table 4 below: | **Category** | **Total** | |---|---| | **Age** | | | Age < 35 | 49 (21.1%) | | 35 ≤ Age < 38 | 55 (23.7%) | | 38 ≤ Age < 41 | 57 (24.6%) | | 41 ≤ Age | 71 (30.6%) | | **BMI** | | | Underweight (<18.5) | 10 (4.3%) | | Normal (18.5-24.9) | 169 (72.5%) | | Overweight (25-29.9) | 36 (15.5%) | | Obese (≥30) | 18 (7.7%) | **Table 4 - Blast Prediction Performance Test Data Distribution** The AUC of the overall performance of the Blast prediction was 0.88 (95% CI: 0.86, 0.90), and the study acceptance criterion (AUC lower bound >0.8) was met. The AUC-ROC analysis was repeated for the different Maternal Age and BMI categories. In all subgroups, the AUC was similar and higher than 0.8, demonstrating the model performance is preserved over the different subgroups. For one BMI category (Obese), the predefined acceptance criteria (AUC lower bound >0.8) was not met. A reduction in AUC was observed with AUC of 0.81 (95% CI: 0.78, 0.83) for 2PN embryos and 0.74 (95% CI: 0.69, 0.78) for Good/Fair embryos. The effectiveness of CHLOE BLAST for its intended use as an adjunctive tool to support embryo selection following morphological assessment was further evaluated in the company's clinical study. The study was designed to evaluate the clinical utility of CHLOE BLAST in the population of Good/Fair quality embryos. Overall, the results of the performance tests indicate that CHLOE is effective in predicting blastocyst formation and that its performance is consistent across various subgroups. ### 8.2. Reproducibility Test The test validated the robustness of the algorithm to different inputs, including several different optical augmentations of an embryo video and simulating several technical occlusions that may occur in the TLI (that are not related to CHLOE). The simulated technical occlusions included corruptions involving occasional issues related to mechanical movement and corruptions related to image acquisition by the EmbryoScope Version D. This was done by comparing the algorithm's performance in terms of AUC-ROC against the ground truth (blastocyst Yes/No) on the original (non-augmented videos) with its performance on videos that were augmented to simulate clinical technical errors. Performance was calculated for augmented videos with each type of corruption and for a combination of all corruption types. ## Page 10 AUCs for all augmentations were higher than 0.89, with CI lower bound above 0.87. Thus, the acceptance criterion (AUC lower bound >0.8) was met. It was therefore concluded that the reproducibility of the algorithm under clinical conditions was validated. The non-clinical performance data demonstrate that CHLOE BLAST meets the applicable Special Controls for Embryo Image Assessment System, Assisted Reproduction. ## 9. Summary of Clinical Performance Testing ### 9.1. CHLOE BLAST Clinical Study Overview To evaluate the safety and effectiveness of CHLOE BLAST, a pivotal, multicenter, single arm, observational, prospective assessment study was conducted to assess the performance of the CHLOE BLAST algorithm for the prediction of blastocyst formation in women undergoing In-Vitro Fertilization. This was a non-interventional clinical study in which CHLOE BLAST was not used during patient treatment. ### 9.2. CHLOE BLAST Clinical Study Protocol **Study Summary:** The study evaluated the performance of the CHLOE BLAST algorithm (Odds Ratio and other measures) in the prediction of blastocyst formation using embryo assessment conducted based on Day 3 morphology alone, and on Day 3 morphology with CHLOE score as adjunct information. The study consisted of 4 parts: 1. **Retrospective Data Collection:** Time-lapse videos of embryo development and associated maternal and embryo information were collected. 2. **Morphology Grading:** Three independent embryologists ("Assessors") evaluated embryo development until 68 hours (Day 3) using standard grading techniques, based on time-lapse videos. 3. & 4. **Blinded Clinical Assessments:** Five independent embryologists ("Panelists") performed two assessments - In the first, they predicted blastocyst formation based on the morphology grading of the Assessors alone, and in the second, they predicted blastocyst formation based on the morphology grading in addition to the CHLOE score. A two-week washout period between parts 3 and 4 was used to ensure unbiased evaluations. **Study Data:** The study dataset included data collected specifically for the purpose of this study according to the predefined inclusion and exclusion criteria and was segregated from algorithm training and verification datasets. The data was collected from three different sites located in the United States from patients who underwent In-Vitro Fertilization (IVF) during 2020-2021. 703 embryos from 59 mothers were included in the study. Study data contained coded imaging data of embryos cultured at least to Day 5 in an EmbryoScope version D time-lapse incubator set to default settings (i.e., Incubator temperature: 37.0°C, CO₂ concentration: 5%-6%, O₂ concentration: 5% O₂ (balance N₂), Humidity: High (≥90%), Imaging Interval: Every 5-20 min, Focus Mode: Autofocus on each well, Light Intensity: Very low, Image Capture: Multiple focal planes (1-17) per timepoint), as well as the maternal and embryo information. ## Page 11 **Morphology Grading and Clinical Assessments:** - **Morphology Grading:** Three embryologists ("Assessors"), blinded to CHLOE information, performed Day 3 morphological grading according to Society for Assisted Reproductive Technology (SART) embryo morphology grading. They reviewed embryo time-lapse videos up to 68 hours and assessed the following parameters: - Number of cells (1 through 8, 9≤) - Fragmentation (0%, <10%, 11-25%, >25%) - Symmetry (Perfect, Moderately Asymmetric, Severely Asymmetric). Then, the following parameters were categorized by majority agreement (at least 2 of 3 Assessors): - Severe asymmetry (yes/no) - Fragmentation > 25% (yes/no) - Number of cells (1 through 8, 9≤) - **Clinical Assessment:** A panel of 5 embryologists ("Panelists") participated in the prediction of blastocyst status. The Panelists in the study were all in practice during the study period and from a range of geographical areas within the United States. Out of the 5 Panelists, 3 were senior embryologists with over 10 years of clinical embryology experience each, and the other 2 were junior embryologists with less than 3 years of clinical embryology experience. The Panelists were not involved in embryo culturing or morphological grading and were blinded to imaging data. There was no overlap between "Assessors" and "Panelists." In Clinical Assessment Part 1, each Panelist received the morphological assessments done by the Assessors, along with the mother's age. Based on this information, they completed the following tasks: - Assign an embryo category (A: Good, B: Fair+, C: Fair–, D: Poor) - Predict developmental outcome (Blastocyst or Arrested) - Select the top 2 embryos from each subject's full cohort In this part, the Panelists were blinded to the CHLOE score. After a two-week washout period, in Clinical Assessment Part 2, the Panelists were given the same information as in Part 1, with the addition of CHLOE score. They then repeated the same three tasks. In this part, the Panelists were blinded to the results of part 1. ### 9.3. CHLOE BLAST Clinical Study Endpoint **Primary Endpoint:** Association between the adjunct prediction using CHLOE BLAST of blastocyst outcome and the actual blastocyst outcome for Good/Fair embryos, measured by an Odds Ratio (OR) greater than 1. This was used to demonstrate that adjunctive use of CHLOE BLAST leads to embryologist predictions for Day 5 blastulation that are informative for outcome (i.e. blastocyst formation: Yes/No). **Secondary Endpoints:** - OR for predicting blastocyst formation based on adjunct prediction with CHLOE for all embryos. ## Page 12 - OR for predicting blastocyst formation using traditional morphology only, for all embryos as well as for the subset of embryos graded as Good/Fair. - OR for Good/Fair embryos predicting blastocyst formation for each individual embryologist, using traditional morphology only and using morphology plus CHLOE. - The following embryo-level diagnostic performance measures: Specificity, Sensitivity, Negative predictive value (NPV), Positive predictive value (PPV), Negative likelihood ratio, Positive likelihood ratio. - Subgroup analyses (Good/Fair embryos) were conducted for several factors, including age group, BMI category, Race and Ethnicity categories, and clinical site (with CHLOE score). - Subject-level analysis: a subject-level performance of CHLOE. Of note, the following definitions were used during the analysis: - **True Positive (TP)** - 1-2 embryos from the subject are predicted as "Blastocyst" and one of these forms a blastocyst, or - >2 embryos are predicted as "Blastocyst" and at least one of the Top 2, as selected by the panelist, forms a blastocyst. - **False Negative (FN)** - None of the embryos from the subject are predicted as "Blastocyst" but at least one forms a blastocyst, or - 1-2 embryos are predicted as "Blastocyst" but none forms a blastocyst, and at least one other embryo forms a blastocyst, or - >2 embryos are predicted as "Blastocyst" but none in the Top 2, as selected by the panelist, forms a blastocyst, and at least one not in the Top 2 forms a blastocyst. - **False Positive (FP)** - Some embryos from the subject are predicted as "Blastocyst" but none of the embryos from the subject forms a blastocyst. - **True Negative (TN)** - None of the embryos from the subject are predicted as "Blastocyst" and none form a blastocyst. - Top 2 embryo analysis: analysis of embryos that were selected as one of the top two by at least one panelist based on morphology alone. The analysis was conducted on Good/Fair embryos, following the same methodology used for the analysis of the primary endpoint (Odd Ratio **Note:** this was a retrospective, image-based study with no subject interaction. As such, no safety endpoints were applicable or collected, and no adverse events could occur. ### 9.4. CHLOE BLAST Clinical Study Results The primary endpoint of the study was met, with blastocyst prediction for Good/Fair embryos using CHLOE achieving an Odds Ratio (OR) of 5.67 (95% CI: 4.6, 6.99), surpassing the prediction without CHLOE (OR=3.77; 95% CI: 2.97, 4.79). ## Page 13 When performing the analysis for all embryos, the prediction was also higher, with OR of 8.51 (95% CI: 6.97, 10.38) for the prediction using CHLOE, compared to 6.93 (95% CI: 5.58, 8.61) for the prediction without CHLOE. Below is a summary of the performance for Good/Fair embryos across various descriptive diagnostic measures, comparing the use of CHLOE as adjunct information to the use of traditional morphology alone (Clinical Assessment Part 2 / Clinical Assessment Part 1): - Odd Ratio (95% CI): 5.67 (4.6-6.99) / 3.77 (2.97-4.79) - Sensitivity: 0.846 / 0.893 - Specificity: 0.444 / 0.246 - PPV (Positive Predictive Value): 0.629 / 0.569 - NPV (Negative predictive Value): 0.721 / 0.675 - Positive likelihood ratio: 1.52 / 1.185 - Negative likelihood ratio: 0.348 / 0.433 To determine the consistency of the Day 3 morphology followed by adjunct use of CHLOE score among individual Panelists, assessments among the Good/Fair morphology embryos were further evaluated per Panelist using the Odds Ratio. When the CHLOE score was used adjunctively to morphology, the Odds Ratio was improved and was greater than 1 for all embryologists. Repeating the analysis on the subgroups of Age and BMI categories and by site for clinical part 2, showed OR >1 in all subgroups. In all but one age category (≥41) and one BMI category (underweight), the lower bound of the CI was also greater than 1. Analysis of Race and Ethnicity subgroups was found to not support definitive conclusions regarding performance in these groups due to the small number of participants in the non-White and Hispanic subgroups. A subject level performance analysis presented sensitivity (for the 56 subjects who had at least one embryo reach the blastocyst stage) ranging from 80.36% to 83.93% for traditional morphology, and from 87.50% to 92.86% when using CHLOE (for all panelists). The calculation of specificity was limited due to the small sample size of subjects with all embryos arrested. An analysis was performed on the subgroup of embryos that were selected as one of the top two by at least one panelist based on morphology alone (Good/Fair embryos), demonstrating OR of 10.73 (95% CI: 6.19, 18.60) for clinical part 2, compared to 3 (95% CI: 0.96, 9.32) for clinical part 1. ### 9.5. Safety and Adverse Events This study was retrospective, involving the use of de-identified embryo images collected from completed IVF cycles. The subject device was applied solely to existing image data and was not used for clinical decision-making, did not involve patient contact, and was not deployed in real-time. As such, there was no potential for device-related adverse events to occur within the context of the study and hence adverse events were not collected. ### 9.6. CHLOE BLAST Clinical Study Discussion and Conclusion ## Page 14 The study met its primary endpoint, demonstrating a statistically significant association between the adjunctive prediction using CHLOE BLAST and actual blastocyst outcomes for Good/Fair embryos, as indicated by an Odds Ratio (OR) greater than 1. The OR calculated for Good/Fair morphology embryos considered for selection, increased with the addition of CHLOE. Further analysis by maternal Age and BMI categories and by site demonstrated that the OR remained greater than 1 across all subgroups (with only one BMI category and one age category having a lower bound of the CI lower than 1). For all panelists, CHLOE's adjunct prediction showed higher subject-level sensitivity compared to traditional morphology. The top two embryos analysis demonstrated that CHLOE was predictive among embryos considered promising by traditional criteria. The clinical performance data demonstrate that CHLOE BLAST meets the Special Controls for Embryo Image Assessment System, Assisted Reproduction. ## 10. Conclusion The results of the performance testing described above demonstrate that CHLOE BLAST is as safe and effective as the predicate device and supports a determination of substantial equivalence.

§ 884.6195 Assisted Reproduction Embryo Image Assessment System.

(a)
Identification. An Assisted Reproduction Embryo Image Assessment System is a prescription device that is designed to obtain and analyze light microscopy images of developing embryos. This device provides information to aid in the selection of embryo(s) for transfer when there are multiple embryos deemed suitable for transfer or freezing.(b)
Classification. Class II (special controls). The special control(s) for this device are:(1) Clinical performance testing must demonstrate a reasonable assurance of safety and effectiveness of the device to predict embryo development. Classification performance (sensitivity and specificity) and predictive accuracy (Positive Predictive Value and Negative Predictive Value) must be assessed at the subject and embryo levels.
(2) Software validation, verification, and hazard analysis must be provided.
(3) Non-clinical performance testing data must demonstrate the performance characteristics of the device. Testing must include the following:
(i) Total light exposure and output testing;
(ii) A safety analysis must be performed based on maximum (worst-case) light exposure to embryos, which also includes the safety of the light wavelength(s) emitted by the device;
(iii) Simulated-use testing;
(iv) Mouse Embryo Assay testing to assess whether device operation impacts growth and development of mouse embryos to the blastocyst stage;
(v) Cleaning and disinfection validation of reusable components;
(vi) Package integrity and transit testing;
(vii) Hardware fail-safe validation;
(viii) Electrical equipment safety and electromagnetic compatibility testing; and
(ix) Prediction algorithm reproducibility.
(4) Labeling must include the following:
(i) A detailed summary of clinical performance testing, including any adverse events;
(ii) Specific instructions, warnings, precautions, and training needed for safe use of the device
(iii) Appropriate electromagnetic compatibility information;
(iv) Validated methods and instructions for cleaning and disinfection of reusable components; and
(v) Information identifying compatible cultureware and explain how they are used with the device.