Publications
The best thing about being a statistician is that you get to play in everyone's backyard. By John Tukey.
2025
- Integrating Multi-Omics Data to Uncover Prostate Tissue DNA Methylation Biomarkers and Target Genes for Prostate Cancer RiskMolecular Carcinogenesis, 2025
- Integrative Multi-Omics Approach for Improving Causal Gene IdentificationGenetic Epidemiology, 2025
- The Role of Double-Zero-Event Studies in Evidence Synthesis: Evaluating Robustness Using the Fragility IndexJournal of Evaluation in Clinical Practice, 2025
- Transcriptome-Wide Association Study Identified Novel Blood Tissue Gene Biomarkers for Prostate Cancer RiskThe Prostate, 2025
2024
- Associations between genetically predicted plasma protein levels and Alzheimer’s disease risk: a study using genetic prediction modelsAlzheimer’s Research & Therapy, 2024
- A Deep Learning Approach to Nonparametric Propensity Score Estimation with Optimized Covariate BalancearXiv preprint arXiv:2404.04794, 2024
- Regional analysis to delineate intrasample heterogeneity with RegionalSTBioinformatics, 2024
- SUMMIT-FA: a new resource for improved transcriptome imputation using functional annotationsHuman Molecular Genetics, 2024
- Sensitivity analysis with iterative outlier detection for systematic reviews and meta-analysesStatistics in Medicine, 2024
- GigaScienceProteome-wide association study and functional validation identify novel protein markers for pancreatic ductal adenocarcinomaGigaScience, 2024
- omicsMIC: a comprehensive benchmarking platform for robust comparison of imputation methods in mass spectrometry-based omics dataNAR Genomics and Bioinformatics, 2024
- MIMOSA: a resource consisting of improved methylome prediction models increases power to identify DNA methylation-phenotype associationsEpigenetics, 2024
- An atlas of genetic effects on the monocyte methylome across European and African populationsmedRxiv, 2024
- Benchmarking DNA Foundation Models for Genomic Sequence ClassificationbioRxiv, 2024
- Multi-View Integrative Approach For Imputing Short-Chain Fatty Acids and Identifying Key factors predicting Blood SCFAbioRxiv, 2024
- Plos Med.Variability in performance of genetic-enhanced DXA-BMD prediction models across diverse ethnic and geographic populations: A risk prediction studyPLoS Medicine, 2024
- OpencanSARchem: chemistry registration and standardization pipeline for FAIR integration of bioassay data2024
- Multi-scale variational autoencoder for imputation of missing values in untargeted metabolomics using whole-genome sequencing dataComputers in Biology and Medicine, 2024
- DiabetologiaIdentification of proteins associated with type 2 diabetes risk in diverse racial and ethnic populationsDiabetologia, 2024
- Age-Related Hearing Impairment: Genome and Blood Methylome Data Integration Reveals Candidate Epigenetic BiomarkersOMICS: A Journal of Integrative Biology, 2024
- A Zero-Inflated Hierarchical Generalized Transformation Model to Address Non-Normality in Spatially-Informed Cell-Type DeconvolutionbioRxiv, 2024
2023
- Ann. Stat.Breaking the winner’s curse in Mendelian randomization: Rerandomized inverse variance weighted estimatorThe Annals of Statistics, 2023
- BiometricsEfficient targeted learning of heterogeneous treatment effects for multiple subgroupsBiometrics, 2023
- Splicing transcriptome-wide association study to identify splicing events for pancreatic cancer riskCarcinogenesis, 2023
- A splicing transcriptome-wide association study identifies candidate altered splicing for prostate cancer riskOMICS: A Journal of Integrative Biology, 2023
- JASAAssessing the most vulnerable subgroup to type II diabetes associated with statin usage: Evidence from electronic health record dataJournal of the American Statistical Association, 2023
- A splicing transcriptome-wide association study identifies novel altered splicing for Alzheimer’s disease susceptibilityNeurobiology of Disease, 2023
- The effect direction should be taken into account when assessing small-study effectsJournal of Evidence-Based Dental Practice, 2023
- Transl. PsychiatryIdentification of candidate DNA methylation biomarkers related to Alzheimer’s disease risk by integrating genome and blood methylome dataTranslational psychiatry, 2023
- Identification of blood protein biomarkers associated with prostate cancer risk using genetic prediction models: analysis of over 140,000 subjectsHuman Molecular Genetics, 2023
- Mediation Analysis with Mendelian Randomization and Efficient Multiple GWAS IntegrationarXiv preprint arXiv:2312.10563, 2023
- Winner’s Curse Free Robust Mendelian Randomization with Summary DataarXiv preprint arXiv:2309.04957, 2023
- Large-scale imputation models for multi-ancestry proteome-wide association analysisbioRxiv, 2023
2022
- A transcriptome-wide association study identifies novel candidate susceptibility genes for prostate cancer riskInternational Journal of Cancer, 2022
- A transcriptome-wide association study identifies novel blood-based gene biomarker candidates for Alzheimer’s disease riskHuman Molecular Genetics, 2022
- Using R for Cell-Type Composition Imputation in Epigenome-Wide Association StudiesIn Epigenome-Wide Association Studies: Methods and Protocols, 2022
- An autoencoder-based deep learning method for genotype imputationFrontiers in Artificial Intelligence, 2022
- BMC Med.Polygenic risk score improves the accuracy of a clinical risk score for coronary artery diseaseBMC medicine, 2022
2021
- Genet. Med.An integrative multiomics analysis identifies putative causal genes for COVID-19 severityGenetics in Medicine, 2021
- Cancer Comm.Novel strategy for disease risk prediction incorporating predicted gene expression and DNA methylation data: a multi-phased study of prostate cancerCancer Communications, 2021
- Ann. Stat.
- A gene-level methylome-wide association analysis identifies novel Alzheimer’s disease genesBioinformatics, 2021
- Genome Med.A transcriptome-wide association study of Alzheimer’s disease using prediction models of relevant tissues identifies novel candidate susceptibility genesGenome Medicine, 2021
- InTACT: An adaptive and powerful framework for joint-tissue transcriptome-wide association studiesGenetic Epidemiology, 2021
- Associations between genetically predicted protein levels and COVID-19 severityThe Journal of Infectious Diseases, 2021
- Nat. Comm.Accurate recognition of colorectal cancer with semi-supervised deep learning on pathological imagesNature communications, 2021
- Systematic identification of risk factors and drug repurposing options for Alzheimer’s diseaseAlzheimer’s & Dementia: Translational Research & Clinical Interventions, 2021
- Cancer Res.Associations of genetically predicted blood protein biomarkers with pancreatic ductal adenocarcinoma risk: a study using comprehensive protein genetic prediction modelsCancer Research, 2021
- BMC Med.Accurate diagnosis of colorectal cancer based on histopathology images using artificial intelligenceBMC Medicine, 2021
- A machine-learning approach for detection of local brain networks and marginally weak signals identifies novel AD/MCI differentiating connectomic neuroimaging biomarkersbioRxiv, 2021
- Sharp inference on selected subgroups in observational studiesarXiv preprint arXiv:2102.11338, 2021
- Optimized intracellular staining reveals heterogeneous cytokine production ability of murine and human hematopoietic stem and progenitor cellsFrontiers in Immunology, 2021
2020
- JLMRA Regularization-Based Adaptive Test for High-Dimensional GLMsJournal of Machine Learning Research, 2020
- Associations between genetically predicted blood protein biomarkers and pancreatic cancer riskCancer Epidemiology, Biomarkers & Prevention, 2020
- Leveraging existing GWAS summary data of genetically correlated and uncorrelated traits to improve power for a new GWASGenetic Epidemiology, 2020
- Cancer Res.A transcriptome-wide association study identifies candidate susceptibility genes for pancreatic cancer riskCancer Research, 2020
- Nat. Comm.An integrative multi-omics analysis to identify candidate DNA methylation biomarkers related to prostate cancer riskNature Communications, 2020
- Integrating DNA sequencing and transcriptomic data for association analyses of low-frequency variants and lipid traitsHuman Molecular Genetics, 2020
- An adaptive test for meta-analysis of rare variant association studiesGenetic Epidemiology, 2020
- A powerful fine-mapping method for transcriptome-wide association studiesHuman Genetics, 2020
- A review of integrative imputation for multi-omics datasetsFrontiers in Genetics, 2020
- Mendelian randomization analysis to characterize causal association between coronary artery disease and COVID-19medRxiv, 2020
- Multi-trait genome-wide analyses of the brain imaging phenotypes in UK BiobankGenetics, 2020
2019
- An adaptive test on high-dimensional parameters in generalized linear modelsStatistica Sinica, 2019
- Integration of methylation QTL and enhancer-target gene maps with schizophrenia GWAS summary results identifies novel genesBioinformatics, 2019
- Cancer Res.Identification of novel susceptibility loci and genes for prostate cancer risk: a transcriptome-wide association study in over 140,000 European descendantsCancer Research, 2019
2018
- GeneticsIntegration of enhancer-promoter interactions with GWAS summary results identifies novel schizophrenia-associated genes and pathwaysGenetics, 2018
- Adaptive SNP-set association testing in generalized linear mixed models with application to family studiesBehavior Genetics, 2018
- Comparison between two post-dentin bond strength measurement methodsScientific Reports, 2018
- Integrating eQTL data with GWAS summary statistics in pathway-based analysis with application to schizophreniaGenetic Epidemiology, 2018
- An adaptive gene-level association test for pedigree dataBMC Genetics, 2018
- An adaptive gene-based test for methylation dataIn BMC Proceedings, 2018
- Epigenome-Wide Association Study of Moderate-Vigorous Physical Activity in Adult African Americans Identifies Loci Near HCCA2In CIRCULATION, 2018
2017
- AOASA novel and efficient algorithm for de novo discovery of mutated driver pathways in cancerThe Annals of Applied Statistics, 2017
- NeuroimageImaging-wide association study: integrating imaging endophenotypes in GWASNeuroimage, 2017
- A powerful framework for integrating eQTL and GWAS summary dataGenetics, 2017
2016
- JLMRA new algorithm and theory for penalized regression-based clusteringJournal of Machine Learning Research, 2016
- Imputation of missing covariate values in epigenome-wide analysis of DNA methylation dataEpigenetics, 2016
2014
- Evaluation of microarray-based DNA methylation measurement using technical replicates: the Atherosclerosis Risk In Communities (ARIC) StudyBMC Bioinformatics, 2014