国产成人久久777777-国产农村妇女毛片精品久久-精品少妇人妻AV一区二区-少妇人妻精品一区二区三区-无码人妻精品一区二区

Selections of data preprocessing met

時(shí)間:2023-04-26 10:13:56 自然科學(xué)論文 我要投稿
  • 相關(guān)推薦

Selections of data preprocessing methods and similarity metrics for gene cluster analysis

Clustering is one of the major exploratory techniques for gene expression data analysis. Only with suitable similarity metrics and when datasets are properly preprocessed, can results of high quality be obtained in cluster analysis. In this study, gene expression datasets with external evaluation criteria were preprocessed as normalization by line, normalization by column or logarithm transformation by base-2, and were subsequently clustered by hierarchical clustering, k-means clustering and self-organizing maps (SOMs) with Pearson correlation coefficient or Euclidean distance as similarity metric. Finally, the quality of clusters was evaluated by adjusted Rand index. The results illustrate that k-means clustering and SOMs have distinct advantages over hierarchical clustering in gene clustering, and SOMs are a bit better than k-means when randomly initialized. It also shows that hierarchical clustering prefers Pearson correlation coefficient as similarity metric and dataset normalized by line. Meanwhile, k-means clustering and SOMs can produce better clusters with Euclidean distance and logarithm transformed datasets. These results will afford valuable reference to the implementation of gene expression cluster analysis.

作 者: YANG Chunmei WAN Baikun GAO Xiaofeng   作者單位: YANG Chunmei,WAN Baikun(Department of Biomedical Engineering and Scientific Instrumentations, Tianjin University, Tianjin 300072, China)

GAO Xiaofeng(Motorola (China) Electronics Ltd., Tianjin 300457, China) 

刊 名: 自然科學(xué)進(jìn)展(英文版)  SCI 英文刊名: PROGRESS IN NATURAL SCIENCE  年,卷(期): 2006 16(6)  分類號(hào): N1  關(guān)鍵詞: gene expression   cluster analysis   data preprocessing   similarity metrics   Rand index  

【Selections of data preprocessing met】相關(guān)文章:

I met you online05-04

知識(shí)管理系統(tǒng)Data Solution研發(fā)日記之三 文檔解決方案04-28

主站蜘蛛池模板: 六盘水市| 墨脱县| 响水县| 商丘市| SHOW| 安庆市| 麟游县| 平顺县| 太原市| 铁岭县| 铁岭市| 黄浦区| 霍林郭勒市| 庆阳市| 永清县| 黄陵县| 红安县| 闻喜县| 四会市| 门头沟区| 聂荣县| 乐安县| 礼泉县| 灵石县| 湘乡市| 正蓝旗| 郸城县| 越西县| 永昌县| 闽侯县| 东安县| 施秉县| 灵台县| 新巴尔虎右旗| 东阿县| 阳春市| 永州市| 威远县| 丽江市| 疏勒县| 射洪县|