摘要
已有研究证明,HIV整合位点的选择,与宿主基因组功能和染色体结构特性之间存在着紧密的联系,但是具体的选择机制还不明确。使用聚类分析、特征提取、分类分析等生物信息学方法,对HIV整合位点序列进行分析和研究,挖掘HIV整合位点序列之间的关系,探索HIV整合位点的选择规律。通过实验和计算,从HIV整合位点集合中提取出了含有6个特征向量的向量集,该向量集与大部分整合位点的特征向量具有较高的相关性,从而提示了HIV整合位点选择中的规律性,即符合向量集的宿主DNA序列可为HIV的整合位点。研究结果为进一步揭示HIV整合位点的选择机制提供了可供参考的依据。
Great evidence generated from experiments have pointed out that the selection of HIV integration sites is not random,and HIV integration sites selection relates to host genome functions and chromosome structures.However,the integration mechanism has not been clear.Bioinformatics tools,such as clustering analysis,feature extraction,classification analysis etc.,were applied in this report to analyze the HIV integration sites,which aims to discover the relationship among HIV integration sites and the HIV integration mechnism.A feature set containing 6 features has been clustering classified,and it was found that the set has high correlation with most HIV integration sites.The results imply that the selection sites might be based on sequences fitting into the classified feature set.These results can give support on study of selection mechanism of HIV integration sites.
出处
《生物信息学》
2010年第3期194-197,共4页
Chinese Journal of Bioinformatics
基金
广州市科技计划项目(2006z1-10061)
关键词
HIV
整合位点
选择机制
聚类分析
特征提取
HIV
integration site
selection mechanism
clustering analysis
feature extraction
作者简介
作者简介:孙汉顺,男,江苏盐城人,硕士研究生,研究方向为生物信息学、系统生物学。Email:sunhanshun@hotmail.com
通讯作者:郑文岭,男,博士,教授,博士生导师,研究方向为生物信息学、分子生物学、基因芯片。E-mail:wenli668@gmail.com.