Objective: To analyze longitudinal binary data by using generalized linear models. The correlation between repeated measures were considered. The general method for analyzing longitudinal binary data was given. Method...Objective: To analyze longitudinal binary data by using generalized linear models. The correlation between repeated measures were considered. The general method for analyzing longitudinal binary data was given. Methods: Generalized estimating equations (GEE) proposed by Zeger and Liang was used. For sevens covariance structures, one method was given for estimating regression and correlation parameters. Results: Regression and coerelation parameters were estimated simultaneously. A Set of program was finished and an example was illustrated. Conclusion: Longitudinal dsta often occur in medical researches and clinical trials. For solving the problem of correlation between repeated measures, it is necessary to use some special methods to cope with this Kind of data.展开更多
In this paper, we define a new class of biased linear estimators of the vector of unknown parameters in the deficient_rank linear model based on the spectral decomposition expression of the best linear minimun bias es...In this paper, we define a new class of biased linear estimators of the vector of unknown parameters in the deficient_rank linear model based on the spectral decomposition expression of the best linear minimun bias estimator. Some important properties are discussed. By appropriate choices of bias parameters, we construct many interested and useful biased linear estimators, which are the extension of ordinary biased linear estimators in the full_rank linear model to the deficient_rank linear model. At last, we give a numerical example in geodetic adjustment.展开更多
General linear model (GLM) is the most popular method for functional magnetic resource imaging (fMRI) data analysis . However, its theory is imperfect. The key of this model is how to constitute the design-matrix to m...General linear model (GLM) is the most popular method for functional magnetic resource imaging (fMRI) data analysis . However, its theory is imperfect. The key of this model is how to constitute the design-matrix to model the interesting effects better and separate noises better. For the purpose of detecting brain function activation , according to the principle of GLM,a new convolution model is presented by a new dynamic function convolving with design-matrix,which combining with t-test can be used to detect brain active signal. The fMRI imaging result of visual stimulus experiment indicates that brain activities mainly concentrate among v1and v2 areas of visual cortex, and also verified the validity of this technique.展开更多
Species distribution models are used to aid our understanding of the processes driving the spatial patterns of species’ habitats. This approach has received criticism, however, largely because it neglects landscape m...Species distribution models are used to aid our understanding of the processes driving the spatial patterns of species’ habitats. This approach has received criticism, however, largely because it neglects landscape metrics. We examined the relative impacts of landscape predictors on the accuracy of habitat models by constructing distribution models at regional scales incorporating environmental variables (climate, topography, vegetation, and soil types) and secondary species occurrence data, and using them to predict the occurrence of 36 species in 15 forest fragments where we conducted rapid surveys. We then selected six landscape predictors at the landscape scale and ran general linear models of species presence/absence with either a single scale predictor (the probabilities of occurrence of the distribution models or landscape variables) or multiple scale predictors (distribution models + one landscape variable). Our results indicated that distribution models alone had poor predictive abilities but were improved when landscape predictors were added; the species responses were not, however, similar to the multiple scale predictors. Our study thus highlights the importance of considering landscape metrics to generate more accurate habitat suitability models.展开更多
In this article, robust generalized estimating equation for the analysis of partial linear mixed model for longitudinal data is used. The authors approximate the nonparametric function by a regression spline. Under so...In this article, robust generalized estimating equation for the analysis of partial linear mixed model for longitudinal data is used. The authors approximate the nonparametric function by a regression spline. Under some regular conditions, the asymptotic properties of the estimators are obtained. To avoid the computation of high-dimensional integral, a robust Monte Carlo Newton-Raphson algorithm is used. Some simulations are carried out to study the performance of the proposed robust estimators. In addition, the authors also study the robustness and the efficiency of the proposed estimators by simulation. Finally, two real longitudinal data sets are analyzed.展开更多
Spectrum management and resource allocation(RA)problems are challenging and critical in a vast number of research areas such as wireless communications and computer networks.The traditional approaches for solving such...Spectrum management and resource allocation(RA)problems are challenging and critical in a vast number of research areas such as wireless communications and computer networks.The traditional approaches for solving such problems usually consume time and memory,especially for large-size problems.Recently different machine learning approaches have been considered as potential promising techniques for combinatorial optimization problems,especially the generative model of the deep neural networks.In this work,we propose a resource allocation deep autoencoder network,as one of the promising generative models,for enabling spectrum sharing in underlay device-to-device(D2D)communication by solving linear sum assignment problems(LSAPs).Specifically,we investigate the performance of three different architectures for the conditional variational autoencoders(CVAE).The three proposed architecture are the convolutional neural network(CVAECNN)autoencoder,the feed-forward neural network(CVAE-FNN)autoencoder,and the hybrid(H-CVAE)autoencoder.The simulation results show that the proposed approach could be used as a replacement of the conventional RA techniques,such as the Hungarian algorithm,due to its ability to find solutions of LASPs of different sizes with high accuracy and very fast execution time.Moreover,the simulation results reveal that the accuracy of the proposed hybrid autoencoder architecture outperforms the other proposed architectures and the state-of-the-art DNN techniques.展开更多
in this paper, we describe a new method for assessing the degree in which the individual case influence the maximum likelihood estimation of link parameter in generalized linear model. Several influential statistics ...in this paper, we describe a new method for assessing the degree in which the individual case influence the maximum likelihood estimation of link parameter in generalized linear model. Several influential statistics are illustrated with two examples.展开更多
In stratified survey sampling, sometimes we have complete auxiliary information. One of the fundamental questions is how to effectively use the complete auxiliary information at the estimation stage. In this paper, we...In stratified survey sampling, sometimes we have complete auxiliary information. One of the fundamental questions is how to effectively use the complete auxiliary information at the estimation stage. In this paper, we extend the model-calibration method to obtain estimators of the finite population mean by using complete auxiliary information from stratified sampling survey data. We show that the resulting estimators effectively use auxiliary information at the estimation stage and possess a number of attractive features such as asymptotically design-unbiased irrespective of the working model and approximately model-unbiased under the model. When a linear working-model is used, the resulting estimators reduce to the usual calibration estimator(or GREG).展开更多
Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have dev...Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have developed from the logistic regression model,the geographical weighted logistic regression model,the Lasso regression model,the random forest model,and the support vector machine model based on historical forest fire data from 2000 to 2019 in Jilin Province.The models,along with a distribution map are presented in this paper to provide a theoretical basis for forest fire management in this area.Existing studies show that the prediction accuracies of the two machine learning models are higher than those of the three generalized linear regression models.The accuracies of the random forest model,the support vector machine model,geographical weighted logistic regression model,the Lasso regression model,and logistic model were 88.7%,87.7%,86.0%,85.0%and 84.6%,respectively.Weather is the main factor affecting forest fires,while the impacts of topography factors,human and social-economic factors on fire occurrence were similar.展开更多
为倡导绿色出行理念,解决以往研究在处理重复观测数据时容易忽视的潜在相关性和个体异质性问题,针对如何利用智能手机APP提供的多模式出行信息引导小汽车出行者转向停车换乘(Park-and-Ride,P+R)模式进行了探究,同时引入广义线性混合模型...为倡导绿色出行理念,解决以往研究在处理重复观测数据时容易忽视的潜在相关性和个体异质性问题,针对如何利用智能手机APP提供的多模式出行信息引导小汽车出行者转向停车换乘(Park-and-Ride,P+R)模式进行了探究,同时引入广义线性混合模型(Generalized Linear Mixed Model,GLMM)分析了多模式出行信息对小汽车出行者转向P+R意向的影响。首先,基于上海市路网设计意向调查问卷,整合了自驾和P+R两种出行方式的道路拥堵程度、出行时间、停车费用及地铁车厢座位情况等信息,并运用全因子设计法构建了24种不同信息水平组合的假设情景。然后,通过智能手机APP界面示意图向小汽车出行者展示这些多模式出行信息,并收集其转向P+R的意向数据。最后,运用GLMM方法处理同一个体重复决策数据中潜在的相关性和捕捉个体间的异质性。结果显示,GLMM的应用不仅解决了同一个体重复决策间的相关性,还揭示了不同个体对道路拥堵程度和地铁车厢座位情况的差异化关注;智能手机APP整合的多模式出行信息显著提升了小汽车出行者转向P+R的意愿,且这一转变占比达29.2%;高收入、长驾龄以及对P+R政策不了解的出行者转向P+R的意愿较低。研究表明,通过智能手机APP整合自驾和P+R的多模式出行信息能显著增强P+R方式的吸引力,可为提升P+R的普及率提供新思路,有效促进小汽车出行者向绿色出行方式的转变。展开更多
文摘Objective: To analyze longitudinal binary data by using generalized linear models. The correlation between repeated measures were considered. The general method for analyzing longitudinal binary data was given. Methods: Generalized estimating equations (GEE) proposed by Zeger and Liang was used. For sevens covariance structures, one method was given for estimating regression and correlation parameters. Results: Regression and coerelation parameters were estimated simultaneously. A Set of program was finished and an example was illustrated. Conclusion: Longitudinal dsta often occur in medical researches and clinical trials. For solving the problem of correlation between repeated measures, it is necessary to use some special methods to cope with this Kind of data.
文摘In this paper, we define a new class of biased linear estimators of the vector of unknown parameters in the deficient_rank linear model based on the spectral decomposition expression of the best linear minimun bias estimator. Some important properties are discussed. By appropriate choices of bias parameters, we construct many interested and useful biased linear estimators, which are the extension of ordinary biased linear estimators in the full_rank linear model to the deficient_rank linear model. At last, we give a numerical example in geodetic adjustment.
基金Supported by National Natural Science Foundation of China (No.90208003, 30200059), the 973 Project (No. 2003CB716106), Doctor training Fund of MOE, P.R.C., and Fok Ying Tong Education Foundation (No.91041)
文摘General linear model (GLM) is the most popular method for functional magnetic resource imaging (fMRI) data analysis . However, its theory is imperfect. The key of this model is how to constitute the design-matrix to model the interesting effects better and separate noises better. For the purpose of detecting brain function activation , according to the principle of GLM,a new convolution model is presented by a new dynamic function convolving with design-matrix,which combining with t-test can be used to detect brain active signal. The fMRI imaging result of visual stimulus experiment indicates that brain activities mainly concentrate among v1and v2 areas of visual cortex, and also verified the validity of this technique.
基金supported by the Biota Minas Program(Proc.No.APQ 03549-09)FAPEMIG(Proc.No.PCE-00106-12)
文摘Species distribution models are used to aid our understanding of the processes driving the spatial patterns of species’ habitats. This approach has received criticism, however, largely because it neglects landscape metrics. We examined the relative impacts of landscape predictors on the accuracy of habitat models by constructing distribution models at regional scales incorporating environmental variables (climate, topography, vegetation, and soil types) and secondary species occurrence data, and using them to predict the occurrence of 36 species in 15 forest fragments where we conducted rapid surveys. We then selected six landscape predictors at the landscape scale and ran general linear models of species presence/absence with either a single scale predictor (the probabilities of occurrence of the distribution models or landscape variables) or multiple scale predictors (distribution models + one landscape variable). Our results indicated that distribution models alone had poor predictive abilities but were improved when landscape predictors were added; the species responses were not, however, similar to the multiple scale predictors. Our study thus highlights the importance of considering landscape metrics to generate more accurate habitat suitability models.
基金the Natural Science Foundation of China(10371042,10671038)
文摘In this article, robust generalized estimating equation for the analysis of partial linear mixed model for longitudinal data is used. The authors approximate the nonparametric function by a regression spline. Under some regular conditions, the asymptotic properties of the estimators are obtained. To avoid the computation of high-dimensional integral, a robust Monte Carlo Newton-Raphson algorithm is used. Some simulations are carried out to study the performance of the proposed robust estimators. In addition, the authors also study the robustness and the efficiency of the proposed estimators by simulation. Finally, two real longitudinal data sets are analyzed.
基金supported in part by the China NSFC Grant 61872248Guangdong NSF 2017A030312008+1 种基金Fok Ying-Tong Education Foundation for Young Teachers in the Higher Education Institutions of China (Grant No.161064)GDUPS (2015)
文摘Spectrum management and resource allocation(RA)problems are challenging and critical in a vast number of research areas such as wireless communications and computer networks.The traditional approaches for solving such problems usually consume time and memory,especially for large-size problems.Recently different machine learning approaches have been considered as potential promising techniques for combinatorial optimization problems,especially the generative model of the deep neural networks.In this work,we propose a resource allocation deep autoencoder network,as one of the promising generative models,for enabling spectrum sharing in underlay device-to-device(D2D)communication by solving linear sum assignment problems(LSAPs).Specifically,we investigate the performance of three different architectures for the conditional variational autoencoders(CVAE).The three proposed architecture are the convolutional neural network(CVAECNN)autoencoder,the feed-forward neural network(CVAE-FNN)autoencoder,and the hybrid(H-CVAE)autoencoder.The simulation results show that the proposed approach could be used as a replacement of the conventional RA techniques,such as the Hungarian algorithm,due to its ability to find solutions of LASPs of different sizes with high accuracy and very fast execution time.Moreover,the simulation results reveal that the accuracy of the proposed hybrid autoencoder architecture outperforms the other proposed architectures and the state-of-the-art DNN techniques.
文摘in this paper, we describe a new method for assessing the degree in which the individual case influence the maximum likelihood estimation of link parameter in generalized linear model. Several influential statistics are illustrated with two examples.
基金Supported by the National Natural Science Foundation of China(10571093)
文摘In stratified survey sampling, sometimes we have complete auxiliary information. One of the fundamental questions is how to effectively use the complete auxiliary information at the estimation stage. In this paper, we extend the model-calibration method to obtain estimators of the finite population mean by using complete auxiliary information from stratified sampling survey data. We show that the resulting estimators effectively use auxiliary information at the estimation stage and possess a number of attractive features such as asymptotically design-unbiased irrespective of the working model and approximately model-unbiased under the model. When a linear working-model is used, the resulting estimators reduce to the usual calibration estimator(or GREG).
基金This research was funded by the National Natural Science Foundation of China(grant no.32271881).
文摘Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have developed from the logistic regression model,the geographical weighted logistic regression model,the Lasso regression model,the random forest model,and the support vector machine model based on historical forest fire data from 2000 to 2019 in Jilin Province.The models,along with a distribution map are presented in this paper to provide a theoretical basis for forest fire management in this area.Existing studies show that the prediction accuracies of the two machine learning models are higher than those of the three generalized linear regression models.The accuracies of the random forest model,the support vector machine model,geographical weighted logistic regression model,the Lasso regression model,and logistic model were 88.7%,87.7%,86.0%,85.0%and 84.6%,respectively.Weather is the main factor affecting forest fires,while the impacts of topography factors,human and social-economic factors on fire occurrence were similar.
文摘为倡导绿色出行理念,解决以往研究在处理重复观测数据时容易忽视的潜在相关性和个体异质性问题,针对如何利用智能手机APP提供的多模式出行信息引导小汽车出行者转向停车换乘(Park-and-Ride,P+R)模式进行了探究,同时引入广义线性混合模型(Generalized Linear Mixed Model,GLMM)分析了多模式出行信息对小汽车出行者转向P+R意向的影响。首先,基于上海市路网设计意向调查问卷,整合了自驾和P+R两种出行方式的道路拥堵程度、出行时间、停车费用及地铁车厢座位情况等信息,并运用全因子设计法构建了24种不同信息水平组合的假设情景。然后,通过智能手机APP界面示意图向小汽车出行者展示这些多模式出行信息,并收集其转向P+R的意向数据。最后,运用GLMM方法处理同一个体重复决策数据中潜在的相关性和捕捉个体间的异质性。结果显示,GLMM的应用不仅解决了同一个体重复决策间的相关性,还揭示了不同个体对道路拥堵程度和地铁车厢座位情况的差异化关注;智能手机APP整合的多模式出行信息显著提升了小汽车出行者转向P+R的意愿,且这一转变占比达29.2%;高收入、长驾龄以及对P+R政策不了解的出行者转向P+R的意愿较低。研究表明,通过智能手机APP整合自驾和P+R的多模式出行信息能显著增强P+R方式的吸引力,可为提升P+R的普及率提供新思路,有效促进小汽车出行者向绿色出行方式的转变。