Identifying and optimizing human endometrial gene expression signatures for endometrial dating

P Diaz-Gimeno, P Sebastian-Leon… - Human …, 2022 - academic.oup.com
P Diaz-Gimeno, P Sebastian-Leon, JM Sanchez-Reyes, K Spath, A Aleman, C Vidal…
Human Reproduction, 2022academic.oup.com
STUDY QUESTION What are the key considerations for developing an enhanced
transcriptomic method for secretory endometrial tissue dating? SUMMARY ANSWER
Multiple gene expression signature combinations can serve as biomarkers for endometrial
dating, but their predictive performance is variable and depends on the number and identity
of the genes included in the prediction model, the dataset characteristics and the technology
employed for measuring gene expression. WHAT IS KNOWN ALREADY Among the new …
STUDY QUESTION
What are the key considerations for developing an enhanced transcriptomic method for secretory endometrial tissue dating?
SUMMARY ANSWER
Multiple gene expression signature combinations can serve as biomarkers for endometrial dating, but their predictive performance is variable and depends on the number and identity of the genes included in the prediction model, the dataset characteristics and the technology employed for measuring gene expression.
WHAT IS KNOWN ALREADY
Among the new generation of transcriptomic endometrial dating (TED) tools developed in the last decade, there exists variation in the technology used for measuring gene expression, the gene makeup and the prediction model design. A detailed study, comparing prediction performance across signatures for understanding signature behaviour and discrepancies in gene content between them, is lacking.
STUDY DESIGN, SIZE, DURATION
A multicentre prospective study was performed between July 2018 and October 2020 at five different centres from the same group of clinics (Spain). This study recruited 281 patients and finally included in the gene expression analysis 225 Caucasian patients who underwent IVF treatment. After preprocessing and batch effect filtering, gene expression measurements from 217 patients were combined with artificial intelligence algorithms (support vector machine, random forest and k-nearest neighbours) allowing evaluation of different prediction models. In addition, secretory-phase endometrial transcriptomes from gene expression omnibus (GEO) datasets were analysed for 137 women, to study the endometrial dating capacity of genes independently and grouped by signatures. This provided data on the consistency of prediction across different gene expression technologies and datasets.
PARTICIPANTS/MATERIALS, SETTING, METHODS
Endometrial biopsies were analysed using a targeted TruSeq (Illumina) custom RNA expression panel called the endometrial dating panel (ED panel). This panel included 301 genes previously considered relevant for endometrial dating as well as new genes selected for their anticipated value in detecting the secretory phase. Final samples (n = 217) were divided into a training set for signature discovery and an independent testing set for evaluation of predictive performance of the new signature. In addition, secretory-phase endometrial transcriptomes from GEO were analysed for 137 women to study endometrial dating capacity of genes independently and grouped by signatures. Predictive performance among these signatures was compared according to signature gene set size.
MAIN RESULTS AND THE ROLE OF CHANCE
Testing of the ED panel allowed development of a model based on a new signature of 73 genes, which we termed ‘TED’ and delivers an enhanced tool for the consistent dating of the secretory phase progression, especially during the mid-secretory endometrium (3–8 days after progesterone (P) administration (P + 3–P + 8) in a hormone replacement therapy cycle). This new model showed the best predictive capacity in an independent test set for staging the endometrial tissue in the secretory phase, especially in the expected window of implantation (average of 114.5 ± 7.2 h of progesterone administered; range in our patient population of 82–172 h). Published sets of genes, in current use for endometrial dating and the new TED genes, were evaluated in parallel in whole-transcriptome datasets and in the ED panel dataset. TED …
Oxford University Press