Research – Kuang Lab

Our lab is particularly interested in large-scale genomic and biomedical data analysis with machine learning and network-based methods for research problems in health-related and biological science. The two broad areas for my research are 1) phenome-genome association analysis and 2) cancer outcome prediction and biomarker identification. In the first area, we performed large-scale association analysis between all genes and the complete collection of phenotypes (phenome) by network-based machine learning methods. In the second area, we developed graph-based learning models and kernel methods to capture the structures in single-cell RNA sequencing data, high-dimensional gene (isoform) expressions and DNA copy number variations for improved cancer outcome prediction and robust biomarker identification. In addition, we also developed kernel methods for protein classification. Our current projects center around the following topics,

Spatial and single-cell transcriptomics: Spatial transcriptomics technologies have enabled spatially-resolved RNA profiling of single cells with cell identities and localizations for understanding cells’ organizations and functions. Our group develops new machine learning methods for mining RNA profiles collected from single cells and their spatial locations.
6 entries « ‹ 2 of 2 › »
Zhang, Huanan; Lee, Catherine A. A.; Li, Zhuliu; Garbe, John R.; Eide, Cindy R.; Petegrosso, Raphael; Kuang, Rui; Tolar, Jakub
A Multitask Clustering Approach for Single-cell RNA-Seq Analysis in Recessive Dystrophic Epidermolysis Bullosa Journal Article
In: PLOS Computational Biology, vol. 14, no. 4, 2018.
Abstract | Links | BibTeX
@article{multitask_zhang,
title = {A Multitask Clustering Approach for Single-cell RNA-Seq Analysis in Recessive Dystrophic Epidermolysis Bullosa},
author = {Huanan Zhang and Catherine A. A. Lee and Zhuliu Li and John R. Garbe and Cindy R. Eide and Raphael Petegrosso and Rui Kuang and Jakub Tolar},
url = {http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1006053},
doi = {https://doi.org/10.1371/journal.pcbi.1006053},
year = {2018},
date = {2018-04-05},
journal = {PLOS Computational Biology},
volume = {14},
number = {4},
abstract = {Single-cell RNA sequencing (scRNA-seq) has been widely applied to discover new cell types by detecting sub-populations in a heterogeneous group of cells.
Since scRNA-seq experiments have lower read coverage/tag counts and introduce more technical biases compared to bulk RNA-seq experiments, the limited number of sampled cells combined with the experimental biases and other dataset specific variations presents a challenge to cross-dataset analysis and discovery of relevant biological variations across multiple cell populations. In this paper, we introduce a method of variance-driven multitask clustering of single-cell RNA-seq data (scVDMC) that utilizes multiple single-cell populations from biological replicates or different samples. scVDMC clusters single cells in multiple scRNA-seq experiments of similar cell types and markers but varying expression patterns such that the scRNA-seq data are better integrated than typical pooled analyses which only increase the sample size. By controlling the variance among the cell clusters within each dataset and across all the datasets, scVDMC detects cell sub-populations in each individual experiment with shared cell-type markers but varying cluster centers among all the experiments. Applied to two real scRNA-seq datasets with several replicates and one large-scale Drop-seq dataset on three patient samples, scVDMC more accurately detected cell populations and known cell markers than pooled clustering and other recently proposed scRNA-seq clustering methods. In the case study applied to in-house Recessive Dystrophic Epidermolysis Bullosa (RDEB) scRNA-seq data, scVDMC revealed several new cell types and unknown markers validated by flow cytometry.
},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Close
Single-cell RNA sequencing (scRNA-seq) has been widely applied to discover new cell types by detecting sub-populations in a heterogeneous group of cells.
Since scRNA-seq experiments have lower read coverage/tag counts and introduce more technical biases compared to bulk RNA-seq experiments, the limited number of sampled cells combined with the experimental biases and other dataset specific variations presents a challenge to cross-dataset analysis and discovery of relevant biological variations across multiple cell populations. In this paper, we introduce a method of variance-driven multitask clustering of single-cell RNA-seq data (scVDMC) that utilizes multiple single-cell populations from biological replicates or different samples. scVDMC clusters single cells in multiple scRNA-seq experiments of similar cell types and markers but varying expression patterns such that the scRNA-seq data are better integrated than typical pooled analyses which only increase the sample size. By controlling the variance among the cell clusters within each dataset and across all the datasets, scVDMC detects cell sub-populations in each individual experiment with shared cell-type markers but varying cluster centers among all the experiments. Applied to two real scRNA-seq datasets with several replicates and one large-scale Drop-seq dataset on three patient samples, scVDMC more accurately detected cell populations and known cell markers than pooled clustering and other recently proposed scRNA-seq clustering methods. In the case study applied to in-house Recessive Dystrophic Epidermolysis Bullosa (RDEB) scRNA-seq data, scVDMC revealed several new cell types and unknown markers validated by flow cytometry.

Close
http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1006053
doi:https://doi.org/10.1371/journal.pcbi.1006053
Close
6 entries « ‹ 2 of 2 › »

Cancer genomics: Development of graph-based learning algorithms, sequence alignment algorithms and association rule-mining algorithms for building predictive models and mining biomarkers of cancer phenotypes from microarray or sequencing transcriptome data, DNA copy number variations, SNPs and protein-protein interactions.

13 entries « ‹ 2 of 3 › »

Chien, Jeremy; Kuang, Rui; Landen, Charles; Shridhar, Viji

Platinum-sensitive recurrence in ovarian cancer: the role of tumor microenvironment Journal Article

In: Frontiers in oncology, vol. 3, pp. 251, 2013.