公共文化服务平台

一种改进的六联体使用频率编码测度被引量：2: 2005年; 在基因预测软件中常用的编码测度得到的序列编码潜力大小往往与序列的C+G含量紧密相关,从而影响了对蛋白编码区的识别效果.研究发现六联体使用偏好与其自身C+G含量存在一种近似线性的相关性,据此提出了一种改进的六联体使用偏好模型,通过综合考虑六联体使用频率与六联体的C+G含量,可简便有效地减小序列编码潜力大小对序列C+G含量的依赖性.测试表明,与分类建模策略相比,该方法所需的训练数据较少,而且具有更好的蛋白编码区识别效果,因此可用于基因预测软件中以提高蛋白编码区与基因结构的预测精度.; 周艳红毕然唐睿

基于隧道机制实现H.323中防火墙和NAT的穿越被引量：5: 2005年; 针对H.323协议与防火墙和NAT设备共处时的问题及现有解决方案的不足,提出了一种建立在运输层协议上的隧道穿越技术的解决方案:隧道机制逻辑上由客户和服务器端两部分组成,采用SocksV5协议在客户端和服务器之间建立一条或多条TCP/UDP隧道,并制订了客户端和服务器端的工作规则,从而使H.323通信中的各种信令和媒体数据流能透明的穿越防火墙和NAT设备.仿真结果表明,相比其他解决方案,本方案可以更好地解决H.323通信中穿越防火墙和NAT设备这一问题.; 刘怀兰匡松柏陈光亮黄若宏; 关键词：网络地址转换防火墙隧道 SOCKS V5

基于有效性测度的基因表达数据的模糊聚类分析被引量：6: 2005年; 本文讨论了模糊聚类中的模糊C均值算法和聚类有效性测度。结合基因微阵列的特点,设计并实现了一种基于聚类有效性函数的模糊C均值模型。将该种模型运用于公开的白血病基因表达数据,取得了与实际情况相吻合的实验结果。; 刘青邓庆山; 关键词：基因表达数据模糊聚类 FCM 聚类有效性

基于强泛化神经网络的大规模基因表达数据分析被引量：3: 2005年; DNA微阵列技术使人们可同时观测成千上万个基因的表达水平,对其数据的分析已成为生物信息学研究的焦点。针对微阵列基因表达数据维数高、样本小、非线性的特点,设计并实现了一种基因表达数据分类识别方法,针对结肠数据集的实验表明其泛化效果有所增强。; 刘青周鹏; 关键词：基因表达数据结肠 DNA微阵列技术神经网络

基于支持向量机识别真核生物DNA中的翻译起始位点被引量：3: 2003年; 翻译起始位点(TIS)的识别是真核生物基因预测的关键步骤之一,近年来一直得到研究人员的高度重视。基于TIS附近序列的统计特性,出现了一些辨识TIS的判别方法,但识别精度还有待进一步提高。针对传统支持向量机(SVM)方法中存在的不足,提出了基于数据优化法的SVM,它通过其它统计学模型优化训练数据集,进而提高分类器的辨识精度。实验结果表明基于数据优化法的SVM分类器在翻译起始位点的辨识上可获得比其他判别方法更好的效果。; 詹泳周艳红卢正鼎; 关键词：支持向量机数据优化敏感度 DNA

A Contact Energy Function Considering Residue Hydrophobic Environment and Its Application in Protein Fold Recognition被引量：1: 2005年; The three-dimensional （3D） structure prediction of proteins ：is an important task in bioinformatics. Finding energy functions that can better represent residue-residue and residue-solvent interactions is a crucial way to improve the prediction accu- racy. The widely used contact energy functions mostly only consider the contact frequency between different types of residues; however, we find that the contact frequency also relates to the residue hydrophobic environment. Accordingly, we present an improved contact energy function to integrate the two factors, which can reflect the influence of hydrophobic interaction on the stabilization of protein 3D structure more effectively. Furthermore, a fold recognition （threading） approach based on this energy function is developed. The testing results obtained with 20 randomly selected proteins demonstrate that, compared with common contact energy functions, the proposed energy function can improve the accuracy of the fold template prediction from 20% to 50%, and can also improve the accuracy of the sequence-template alignment from 35% to 65%.; Mo-Jie Duan Yan-Hong Zhou

Prediction and Classification of Human G-protein Coupled Receptors Based on Support Vector Machines被引量：2: 2005年; A computational system for the prediction and classification of human G-protein coupled receptors （GPCRs） has been developed based on the support vector machine （SVM） method and protein sequence information. The feature vectors used to develop the SVM prediction models consist of statistically significant features selected from single amino acid, dipeptide, and tripeptide compositions of protein sequences. Furthermore, the length distribution difference between GPCRs and non-GPCRs has also been exploited to improve the prediction performance. The testing results with annotated human protein sequences demonstrate that this system can get good performance for both prediction and classification of human GPCRs.; Yun-Fei Wang Huan Chen Yan-Hong Zhou; 关键词：GPCR PREDICTION CLASSIFICATION SVM

Predicting the Coupling Specificity of G-protein Coupled Receptors to G-proteins by Support Vector Machines: 2005年; G-protein coupled receptors （GPCRs） represent one of the most important classes of drug targets for pharmaceutical industry and play important roles in cellular signal transduction. Predicting the coupling specificity of GPCRs to G-proteins is vital for further understanding the mechanism of signal transduction and the function of the receptors within a cell, which can provide new clues for pharmaceutical research and development. In this study, the features of amino acid compositions and physiochemical properties of the full-length GPCR sequences have been analyzed and extracted. Based on these features, classifiers have been developed to predict the coupling specificity of GPCRs to G-protelns using support vector machines. The testing results show that this method could obtain better prediction accuracy.; Cui-Ping Guan Zhen-Ran Jiang Yan-Hong Zhou; 关键词：GPCR G-PROTEIN

水稻基因数据集的构建与特征分析被引量：1: 2007年; 水稻基因序列的特征分析解水稻基因组的组成与结构规律具有重要意义。以常用的基因结构特征为基础,进一步分析功能位点信号(翻译起始位点、翻译终止位点、供体位点和受体位点)附近序列各位的保守性、密码子使用、剪接位点上、下游区域的碱基组成等特征,并且发现了这些特征与序列C+G含量的依赖关系。; 朱建丽; 关键词：水稻基因组

支持向量机方法预测离子通道蛋白被引量：1: 2007年; 讨论一种基于蛋白质结构域的方法预测离子通道蛋白。通过将蛋白质的结构域转化成为固定长度的向量,使用支持向量机方法进行离子通道蛋白的预测,并将预测结果与线性判别分析以及利用InterPro与GO映射规则进行预测的结果进行了比较。通过留一法交叉验证,取得最好的预测效果,敏感度为95.9%,专一性为98.3%。; 涂白毕然; 关键词：离子通道结构域支持向量机基因本体

渝B2-20050021-1　渝公网安备 50019002500403号　违法和不良信息举报中心　互联网出版许可证　新出网证(渝)字10号

国家自然科学基金(90203011)