Essential proteins are crucial for biological processes and can be identified through both experimental and computational methods.While experimental approaches are highly accurate,they often demand extensive time and ...Essential proteins are crucial for biological processes and can be identified through both experimental and computational methods.While experimental approaches are highly accurate,they often demand extensive time and resources.To address these challenges,we present a computational ensemble learning framework designed to identify essential proteins more efficiently.Our method begins by using node2vec to transform proteins in the protein–protein interaction(PPI)network into continuous,low-dimensional vectors.We also extract a range of features from protein sequences,including graph-theory-based,information-based,compositional,and physiochemical attributes.Additionally,we leverage deep learning techniques to analyze high-dimensional position-specific scoring matrices(PSSMs)and capture evolutionary information.We then combine these features for classification using various machine learning algorithms.To enhance performance,we integrate the outputs of these algorithms through ensemble methods such as voting,weighted averaging,and stacking.This approach effectively addresses data imbalances and improves both robustness and accuracy.Our ensemble learning framework achieves an AUC of 0.960 and an accuracy of 0.9252,outperforming other computational methods.These results demonstrate the effectiveness of our approach in accurately identifying essential proteins and highlight its superior feature extraction capabilities.展开更多
Essential proteins are inseparable in cell growth and survival. The study of essential proteins is important for understanding cellular functions and biological mechanisms. Therefore, various computable methods have b...Essential proteins are inseparable in cell growth and survival. The study of essential proteins is important for understanding cellular functions and biological mechanisms. Therefore, various computable methods have been proposed to identify essential proteins. Unfortunately, most methods based on network topology only consider the interactions between a protein and its neighboring proteins, and not the interactions with its higher-order distance proteins. In this paper, we propose the DSEP algorithm in which we integrated network topology properties and subcellular localization information in protein–protein interaction(PPI) networks based on four-order distances, and then used random walks to identify the essential proteins. We also propose a method to calculate the finite-order distance of the network, which can greatly reduce the time complexity of our algorithm. We conducted a comprehensive comparison of the DSEP algorithm with 11 existing classical algorithms to identify essential proteins with multiple evaluation methods. The results show that DSEP is superior to these 11 methods.展开更多
Three steam distillation devices (D-1, D-2 and D-4) or one simultaneous distillation (D-3, water-diethyl ether) as well as the process of CO2-SFE (Supercritical fluid extraction) were adopted in extraction of essentia...Three steam distillation devices (D-1, D-2 and D-4) or one simultaneous distillation (D-3, water-diethyl ether) as well as the process of CO2-SFE (Supercritical fluid extraction) were adopted in extraction of essential oils from Chinese-fir (Cunninghamia lancedata (Lamb) Hook.) and the chemical components of the extracted essential oil were analyzed by Gas chromatograph-MS analyses. The results showed that the essential oil could be almost extracted out within 2 hours and the device-3 had the highest extraction efficiency. The major chemical component of the oil was cedrol. The yield of the extracted essential oils from Chinese fir decreased gradually with the increase of the distillation time. The best condition for extraction by means of CO2-SFE is 100 kg·cm?2 in pressure and 40°C in temperature for. Keywords Chinese fir - Essential oil - Cedrol - Supercritical fluid extraction CLC number S781.4 Document code A Foundation item: This paper was support by the Key Foundation Research Project (G1999016001) of China and the Japan International Cooperation AgencyBiography: HUANG Luo-hua (1957-), male, Research associate, Research Institute of Wood Industry, Chinese Academy of forestry, Beijing 100091, P. R. ChinaResponsible editor: Song Funan展开更多
优化了乳清分离蛋白-果胶纳米复合物(whey protein isolate-pectin,W-P)的制备工艺,并考察了W-P稳定的百里香精油纳米乳液(thyme essential oil nanoemulsion,TEON)的稳定性及抑菌效果。结果表明:当乳清分离蛋白(whey protein isolate,W...优化了乳清分离蛋白-果胶纳米复合物(whey protein isolate-pectin,W-P)的制备工艺,并考察了W-P稳定的百里香精油纳米乳液(thyme essential oil nanoemulsion,TEON)的稳定性及抑菌效果。结果表明:当乳清分离蛋白(whey protein isolate,WPI)与果胶(pectin,PEC)的质量比为1∶2、pH为5.0、加热温度为80℃、加热时间为15 min、W-P的质量浓度为25 g/L时,制得的W-P的平均粒径为(504.45±1.53)nm,乳化活性和乳化稳定性分别为(28.63±0.34)m^(2)/g和(1034.16±13.64)min,其稳定的TEON在pH 3.0~7.0、温度30~90℃、离子浓度0~200 mmol/L和储藏15 d(4℃)时稳定性良好。同时,TEON对大肠杆菌和金黄色葡萄球菌表现出明显的抑菌效果。W-P展现出优良的乳化性能,显示出其在食品体系中作为乳化剂的应用潜力。展开更多
基金financially supported by the National Key R&D Program of China(Grant No.2022YFF1202600)the National Natural Science Foundation of China(Grant No.82301158)+4 种基金Science and Technology Innovation Action Plan of Shanghai Science and Technology Committee(Grant No.22015820100)Two-hundred Talent Support(Grant No.20152224)Translational Medicine Innovation Project of Shanghai Jiao Tong University School of Medicine(Grant No.TM201915)Clinical Research Project of Multi-Disciplinary Team,Shanghai Ninth People’s Hospital,Shanghai Jiao Tong University School of Medicine(Grant No.201914)China Postdoctoral Science Foundation(Grant No.2023M742332)。
文摘Essential proteins are crucial for biological processes and can be identified through both experimental and computational methods.While experimental approaches are highly accurate,they often demand extensive time and resources.To address these challenges,we present a computational ensemble learning framework designed to identify essential proteins more efficiently.Our method begins by using node2vec to transform proteins in the protein–protein interaction(PPI)network into continuous,low-dimensional vectors.We also extract a range of features from protein sequences,including graph-theory-based,information-based,compositional,and physiochemical attributes.Additionally,we leverage deep learning techniques to analyze high-dimensional position-specific scoring matrices(PSSMs)and capture evolutionary information.We then combine these features for classification using various machine learning algorithms.To enhance performance,we integrate the outputs of these algorithms through ensemble methods such as voting,weighted averaging,and stacking.This approach effectively addresses data imbalances and improves both robustness and accuracy.Our ensemble learning framework achieves an AUC of 0.960 and an accuracy of 0.9252,outperforming other computational methods.These results demonstrate the effectiveness of our approach in accurately identifying essential proteins and highlight its superior feature extraction capabilities.
基金Project supported by the Gansu Province Industrial Support Plan (Grant No.2023CYZC-25)the Natural Science Foundation of Gansu Province (Grant No.23JRRA770)the National Natural Science Foundation of China (Grant No.62162040)。
文摘Essential proteins are inseparable in cell growth and survival. The study of essential proteins is important for understanding cellular functions and biological mechanisms. Therefore, various computable methods have been proposed to identify essential proteins. Unfortunately, most methods based on network topology only consider the interactions between a protein and its neighboring proteins, and not the interactions with its higher-order distance proteins. In this paper, we propose the DSEP algorithm in which we integrated network topology properties and subcellular localization information in protein–protein interaction(PPI) networks based on four-order distances, and then used random walks to identify the essential proteins. We also propose a method to calculate the finite-order distance of the network, which can greatly reduce the time complexity of our algorithm. We conducted a comprehensive comparison of the DSEP algorithm with 11 existing classical algorithms to identify essential proteins with multiple evaluation methods. The results show that DSEP is superior to these 11 methods.
基金The Key Foundation Research Project (G1999016001) of China and the Japan International Cooperation Agency
文摘Three steam distillation devices (D-1, D-2 and D-4) or one simultaneous distillation (D-3, water-diethyl ether) as well as the process of CO2-SFE (Supercritical fluid extraction) were adopted in extraction of essential oils from Chinese-fir (Cunninghamia lancedata (Lamb) Hook.) and the chemical components of the extracted essential oil were analyzed by Gas chromatograph-MS analyses. The results showed that the essential oil could be almost extracted out within 2 hours and the device-3 had the highest extraction efficiency. The major chemical component of the oil was cedrol. The yield of the extracted essential oils from Chinese fir decreased gradually with the increase of the distillation time. The best condition for extraction by means of CO2-SFE is 100 kg·cm?2 in pressure and 40°C in temperature for. Keywords Chinese fir - Essential oil - Cedrol - Supercritical fluid extraction CLC number S781.4 Document code A Foundation item: This paper was support by the Key Foundation Research Project (G1999016001) of China and the Japan International Cooperation AgencyBiography: HUANG Luo-hua (1957-), male, Research associate, Research Institute of Wood Industry, Chinese Academy of forestry, Beijing 100091, P. R. ChinaResponsible editor: Song Funan