2023
Statistical Methodologies for Analyzing Genomic Data
Duan F, Zhang H. Statistical Methodologies for Analyzing Genomic Data. Springer Handbooks 2023, 621-634. DOI: 10.1007/978-1-4471-7503-2_32.Peer-Reviewed Original ResearchLinear discriminant analysisEmpirical Bayesian approachDifferent clustering methodsModel-based clusteringNeural networkStatistical methodologyK-meansVector machineMicroarray data analysisColon cancer datasetBayesian approachClassification methodRand indexStatistical issuesClustering methodMultiple comparison issuesMicroarray dataCancer datasetsComparison issuesHierarchical clusteringT-statisticAlgorithmClassificationClusteringGenomic data
2012
Simulating Realistic Genomic Data With Rare Variants
Xu Y, Wu Y, Song C, Zhang H. Simulating Realistic Genomic Data With Rare Variants. Genetic Epidemiology 2012, 37: 163-172. PMID: 23161487, PMCID: PMC3543480, DOI: 10.1002/gepi.21696.Peer-Reviewed Original Research
2009
Willows: a memory efficient tree and forest construction package
Zhang H, Wang M, Chen X. Willows: a memory efficient tree and forest construction package. BMC Bioinformatics 2009, 10: 130. PMID: 19416535, PMCID: PMC2683818, DOI: 10.1186/1471-2105-10-130.Peer-Reviewed Original ResearchConceptsMassive genotype dataUser-friendly interfaceExcessive memory demandsHigh-dimensional dataNew software packageGenomic dataFriendly interfaceUse of memoryDimensional dataMassive amountsRandom forestPartitioning techniquesHigh-throughput genomic dataPowerful bioinformatics toolsEfficient treeComputer memorySoftware packageMassive sizeForest methodMemory demandsConstruction packagesSingle nucleotide polymorphismsBioinformatics toolsSNP dataGenotyping platforms
2006
Statistical Methodologies for Analyzing Genomic Data
Duan F, Zhang H. Statistical Methodologies for Analyzing Genomic Data. Springer Handbooks 2006, 607-621. DOI: 10.1007/978-1-84628-288-1_33.Peer-Reviewed Original ResearchLinear discriminant analysisDifferent clustering methodsEmpirical Bayesian approachModel-based clusteringNeural networkK-meansVector machineMicroarray data analysisColon cancer datasetClassification methodRand indexStatistical methodologyClustering methodBayesian approachMicroarray dataStatistical issuesMultiple comparison issuesComparison issuesHierarchical clusteringT-statisticAlgorithmClassificationClusteringGenomic dataClassification analysis