Skip to main content

Cornell University

Dissertation Seminar: Li Yao (Yu Lab)

Thesis Defense Li Yao

Graduate Field of Computational Biology.

MEMO. 

To: Members of the Graduate Field of Computational Biology.

FROM: Philipp Messer, Director of Graduate Studies DATE: April 2, 2025.

Li Yao has scheduled his B exam on Thursday, April 10, 2025, at 2:30 p.m., in G01 Biotechnology. Li’s title is ” Genome-Wide Identification of Cell-Type-Specific Active Enhancers.” His dissertation advisor is Haiyuan Yu, Professor, Computational Biology. 

Abstract: 

Enhancers play an essential role in regulating gene expression, and their identification is crucial for understanding molecular mechanisms and disease etiology. Previous studies relied on epigenomic marks, such as chromatin accessibility and histone modifications (e.g., H3K27ac, H3K4me1), which may not offer the specificity necessary for precise enhancer detection. Recent advances have revealed that enhancers can transcribe their own RNAs, known as eRNAs, providing a novel approach for identifying active enhancers through RNA sequencing. With a variety of sequencing assays available, selecting one that adequately addresses the low abundance and short half-life of eRNAs is critical for accurately identifying active enhancers genome-wide. 

This dissertation compared 13 RNA sequencing assays for detecting eRNAs, demonstrating that run-on assays such as GRO/PRO-cap excel in sensitivity and resolution due to their ability to capture nascent transcripts. To better identify transcribed enhancers, the computational tool Peak Identifier for Nascent Transcription Starts (PINTS) was developed, designed specifically for libraries generated by assays profiling transcription initiation sites of the nascent transcriptome. Application of PINTS led to a comprehensive compendium of transcribed enhancers across diverse tissues and cell types, providing an invaluable resource for studying gene regulation. 

Despite its strengths, GRO/PRO-cap requires substantial input material similar to many other bulk assays, limiting its utility in dissecting cell-type-specific regulatory patterns in tissue samples. To overcome this limitation, the quasi-supervised cross-modality deconvolution framework, Deep-learning-based DEconvolution of Tissue profiles with Accurate Interpretation of Locus-specific Signals (DeepDETAILS), was introduced. This model reconstructs cell-type-specific profiles from bulk sequencing libraries using a reference scATAC-seq library. DeepDETAILS can deconvolve signals captured from various regulatory steps, including transcription initiation, pause-release, and histone modifications, demonstrating its broad applicability. 

By employing DeepDETAILS, a comprehensive compendium of cell-type-specific regulatory maps was created, encompassing 39 human tissues and 86 distinct cell types. This resource facilitated the identification of a risk variant potentially contributing to the etiology of primary sclerosing cholangitis. Together, this dissertation identified cell-type-specific enhancers genome-wide, advancing our understanding of transcriptional regulation and its implications for cellular function and disease mechanisms. 

Publication list: 

1. Yao, L., Shah, S. R., Ozer, A., Zhang, J., Pan, X., Xia Leung, A. K.-Y., Lis, J. T., & Yu, H. (2025). High-resolution reconstruction of cell-type specific transcriptional regulatory processes from bulk sequencing samples. bioRxiv. 

2. Yao, L., Liang, J., Ozer, A., Leung, A. K.-Y., Lis, J. T.†, & Yu, H.† (2022). A comparison of experimental assays and analytical methods for genome-wide identification of active enhancers. Nature Biotechnology, 40(7), Article 7. 

3. Chen, Y.*, Paramo, M. I.*, Zhang, Y.*, Yao, L.*, Shah, S. R., Jin, Y., Zhang, J., Pan, X., & Yu, H. (2023). Finding Needles in the Haystack: Strategies for Uncovering Noncoding Regulatory Variants. Annual Review of Genetics, 57(1), 201–222. 

4. Leung, A. K.-Y.*, Yao, L.*, & Yu, H. (2022). Functional genomic assays to annotate enhancer–promoter interactions genome wide. Human Molecular Genetics, 31(R1), R97–R104. 

5. Wang, N., Pachai, M. R., Li, D., Lee, C. J., Warda, S., Khudoynazarova, M. N., Cho, W. H., Xie, G., Shah, S. R., Yao, L., Qian, C., Wong, E. W. P., Yan, J., Tomas, F. V., Hu, W., Kuo, F., Gao, S. P., Luo, J., Smith, A. E., … Chen, Y. (2025). Loss of Kmt2c or Kmt2d primes urothelium for tumorigenesis and redistributes KMT2A–menin to bivalent promoters. Nature Genetics, 1–15. 

6. Zhang, J., Leung, A. K.-Y., Zhu, Y., Yao, L., Willis, A., Pan, X., Ozer, A., Zhou, Z., Liang, J., Tippens, N. D., Siklenka, K., Barrera, A., Reddy, T. E., ENCODE consortium, J. T.†, & Yu, H.† (2025). Comprehensive Evaluation of Diverse Massively Parallel Reporter Assays to Functionally Characterize Human Enhancers Genome-wide. bioRxiv. 

7. Paramo, M. I., Leung, A. K.-Y., Shah, S. R., Zhang, J., Tippens, N. D., Liang, J., Yao, L., Jin, Y., Pan, X., Ozer, A., Lis, J. T.†, & Yu, H.† (2025). Simultaneous measurement of intrinsic promoter and enhancer potential reveals principles of functional duality and regulatory reciprocity (p. 2025.03.14.643265). bioRxiv. 

8. Cotter, K. A., Shah, S. R., Paramo, M. I., Lou, S., Yao, L., Rubin, P. D., Chen, Y., Gerstein, M., Rubin, M. A., & Yu, H. (2022). Capped nascent RNA sequencing reveals novel therapy-responsive enhancers in prostate cancer (p. 2022.04.08.487666). bioRxiv. 

9. Fragoza, R., Das, J., Wierbowski, S. D., Liang, J., Tran, T. N., Liang, S., Beltran, J. F., Rivera-Erick, C. A., Ye, K., Wang, T.-Y., Yao, L., Mort, M., Stenson, P. D., Cooper, D. N., Wei, X., Keinan, A., Schimenti, J. C., Clark, A. G., & Yu, H. (2019). Extensive disruption of protein interactions by genetic variants across the allele frequency spectrum in human populations. Nature Communications, 10(1), Article 1. 

Start Date: April 10, 2025
Start Time: 2:30 pm
Location: Biotech
Room: G01

Skip to toolbar