Kei-Hoi Cheung, PhD

Professor of Biomedical Informatics & Data Science

DownloadHi-Res Photo

Appointments

Biomedical Informatics & Data Science

Primary

Biostatistics

Secondary

Additional Titles

Professor, Biostatistics

Contact Info

kei.cheung@yale.edu

About

Titles

Professor of Biomedical Informatics & Data Science

Professor, Biostatistics

Biography

Kei-Hoi Cheung, PhD has distinguished himself as a researcher and educator in the field of Biomedical Informatics with a growing national and international reputation. A particular strength is Dr. Cheung’s ability to forge strong, productive collaborations with a range of different bioscience researchers at Yale, in which his contributions include the development of complex databases and informatics tools that are critical for the research projects being performed. In the context of these collaborations, Dr. Cheung is simultaneously able to carry out his own informatics research on issues involved in robust interoperation and integration of databases and tools in the biosciences. In addition to giving talks and presentations at national and international meetings, he has published his own informatics research in peer-reviewed journals and conference proceedings, as well as contributing to publications focused on his collaborators’ research. He has established a broad base of collaborations with faculty in different departments at Yale, including Genetics, Pathology, Computer Science, Biostatistics, Molecular Biophysics and Biochemistry, and Biology. He was Director of Biostatistics and Bioinformatics Core of the NIDA Proteomics Center, focused on collaborative informatics support of neuroproteomics research at Yale. In addition to being a collaborator on numerous grants, Dr. Cheung has been PI on several federal grants (NIH and NSF). Dr. Cheung is also a core faculty member of Yale's Ph.D. Program in Computational Biology and Bioinformatics.

Dr. Cheung’ s research interests include the semantic web using the next generation of web technologies to integrate life science data and tools, and is co-editor of two books for Springer-Verlag titled: “Semantic Web: Revolutionizing Knowledge Discovery in the Life Sciences” and “Semantic e-Science”, respectively. Dr. Cheung also led the BioRDF task force (2008-2010) of the Semantic Web for Health Care and Life Sciences Interest Group that is an international community engaging in the creative use of Semantic Web in biomedicine. In addition, Dr. Cheung has recently embarked on natural language processing (NLP) projects in annotating, extracting and retrieving information from clinical text as part of the Veteran Administration (VA) electronic medical records. In summary, Dr. Cheung’s biomedical informatics expertise in database/semantic web research and NLP tool development, his national and international recognition as a researcher/educator, and his research contributions in these areas exemplify the attributes of a prominent researcher in biomedical informatics.

Last Updated on December 05, 2023.

Appointments

Biomedical Informatics & Data Science
Professor
Primary
Biomedical Informatics & Data Science
Biostatistics
Professor
Secondary
Biostatistics

Alzheimer's Disease Research Center (ADRC)
Biomedical Informatics & Data Science
Biostatistics
Computational Biology and Biomedical Informatics
Computational Biology and Bioinformatics
Emergency Medicine York Street Campus Faculty
NIDA Neuroproteomics Center
Yale Combined Program in the Biological and Biomedical Sciences (BBS)
Yale School of Public Health
Yale Superfund Research Center
Yale Ventures
Yale-BI Biomedical Data Science Fellowship

Education & Training

PhD: University of Connecticut, Computer Science (1998)

Research

Overview

Ongoing Projects:

Yale Protein Expression Database (YPED). YPED is an institution-wide database for use by proteomics researchers at Yale and outside of Yale
Human Immunology Project Consortium (HIPC). HIPC was established by NIAID, which generates a wide variety of phenotypic and molecular data from well-characterized patient cohorts, including genome-wide
expression profiling, high-dimensional flow cytometry and serum cytokine concentrations. The adoption and adherence
to data standards is critical to enable data integration across HIPC centers, and facilitate data re-use by the wider scientific community. One key component of HIPC involves data standardization effort, along with the infrastructure that has been developed.
Center for Expanded Data Annotation and Retrieval (CEDAR). CEDAR is part of the Big Data to Knowledge (BD2K) initiative funded by NIH. It studies the creation of comprehensive and expressive metadata for biomedical datasets to facilitate data discovery, data interpretation, and data reuse.
Clinical Natural Language Processing (NLP). To extract and retrieve information from large amounts of clinical notes (unstructured data) for facilitating clinical research, a variety of NLP techniques including the incorporation of ontologies have been explored in different domains including lung/colon cancer, post-traumatic stress disorder, psychogenic nonepileptic seizure, and chronic pain.

Medical Research Interests

Anesthesiology; Databases, Genetic; Emergency Medicine; Medical Informatics; Natural Language Processing; Technology

ORCID
0000-0001-6432-9372
Biomedical Informatics & Data Science
View Lab Website

Research at a Glance

Yale Co-Authors

Frequent collaborators of Kei-Hoi Cheung's published research.

Mihaela Aslan, PhD
View Full Profile
View 2 Common Publications
Nallakkandi Rajeevan, PhD
View Full Profile
View 2 Common Publications
Caroline Zeiss, DACVP, DACLAM
View Full Profile
View Common Publication
Hongyu Zhao, PhD
View Full Profile
View Common Publication
Michael Krauthammer
Former YSM
View Common Publication
Michael Wininger, PhD
View Full Profile
View Common Publication

Publications

Academic Achievements & Community Involvement

Activities

activity
SenseLab Project
01/01/2006 - 01/01/2006Research
Details

News & Links

News

Related Links

Get In Touch

Contacts

kei.cheung@yale.edu

Appointments

Additional Titles

Contact Info

Titles

Biography

Appointments

Biomedical Informatics & Data Science

Biostatistics

Other Departments & Organizations

Education & Training

Overview

Medical Research Interests

ORCID

Biomedical Informatics & Data Science

Research at a Glance

Yale Co-Authors

Mihaela Aslan, PhD

Nallakkandi Rajeevan, PhD

Caroline Zeiss, DACVP, DACLAM

Hongyu Zhao, PhD

Michael Krauthammer

Michael Wininger, PhD

Publications

2026

2025

2024

2013

2012

2010

2007

2005

Activities

SenseLab Project

News

BIDS Showcases at the 2025 AI at Yale Symposium

Cheung Receives NIH Grant to Research Water Contaminants and Human Health

What Does Natural Language Processing Mean for Biomedicine?

Related Links

Contacts