A new recommendation method was presented based on memetic algorithm-based clustering. The proposed method was tested on four highly sparse real-world datasets. Its recommendation performance is evaluated and compared...A new recommendation method was presented based on memetic algorithm-based clustering. The proposed method was tested on four highly sparse real-world datasets. Its recommendation performance is evaluated and compared with that of the frequency-based, user-based, item-based, k-means clustering-based, and genetic algorithm-based methods in terms of precision, recall, and F1 score. The results show that the proposed method yields better performance under the new user cold-start problem when each of new active users selects only one or two items into the basket. The average F1 scores on all four datasets are improved by 225.0%, 61.6%, 54.6%, 49.3%, 28.8%, and 6.3% over the frequency-based, user-based, item-based, k-means clustering-based, and two genetic algorithm-based methods, respectively.展开更多
The Circle algorithm was proposed for large datasets.The idea of the algorithm is to find a set of vertices that are close to each other and far from other vertices.This algorithm makes use of the connection between c...The Circle algorithm was proposed for large datasets.The idea of the algorithm is to find a set of vertices that are close to each other and far from other vertices.This algorithm makes use of the connection between clustering aggregation and the problem of correlation clustering.The best deterministic approximation algorithm was provided for the variation of the correlation of clustering problem,and showed how sampling can be used to scale the algorithms for large datasets.An extensive empirical evaluation was given for the usefulness of the problem and the solutions.The results show that this method achieves more than 50% reduction in the running time without sacrificing the quality of the clustering.展开更多
To improve the productivity of cluster tools in semiconductor fabrications,on the basis of stating scheduling problems,a try and error-based scheduling algorithm was proposed with residency time constraints and an obj...To improve the productivity of cluster tools in semiconductor fabrications,on the basis of stating scheduling problems,a try and error-based scheduling algorithm was proposed with residency time constraints and an objective of minimizing Makespan for the wafer jobs in cluster tools.Firstly,mathematical formulations of scheduling problems were presented by using assumptions and definitions of a scheduling domain.Resource conflicts were analyzed in the built scheduling model,and policies to solve resource conflicts were built.A scheduling algorithm was developed.Finally,the performances of the proposed algorithm were evaluated and compared with those of other methods by simulations.Experiment results indicate that the proposed algorithm is effective and practical in solving the scheduling problem of the cluster tools.展开更多
Similarity measure design on non-overlapped data was carried out and compared with the case of overlapped data.Unconsistant feature of similarity on overlapped data to non-overlapped data was provided by example.By th...Similarity measure design on non-overlapped data was carried out and compared with the case of overlapped data.Unconsistant feature of similarity on overlapped data to non-overlapped data was provided by example.By the artificial data illustration,it was proved that the conventional similarity measure was not proper to calculate the similarity measure of the non-overlapped case.To overcome the unbalance problem,similarity measure on non-overlapped data was obtained by considering neighbor information.Hence,different approaches to design similarity measure were proposed and proved by consideration of neighbor information.With the example of artificial data,similarity measure calculation was carried out.Similarity measure extension to intuitionistic fuzzy sets(IFSs)containing uncertainty named hesitance was also followed.展开更多
基金supporting by grant fund under the Strategic Scholarships for Frontier Research Network for the PhD Program Thai Doctoral degree
文摘A new recommendation method was presented based on memetic algorithm-based clustering. The proposed method was tested on four highly sparse real-world datasets. Its recommendation performance is evaluated and compared with that of the frequency-based, user-based, item-based, k-means clustering-based, and genetic algorithm-based methods in terms of precision, recall, and F1 score. The results show that the proposed method yields better performance under the new user cold-start problem when each of new active users selects only one or two items into the basket. The average F1 scores on all four datasets are improved by 225.0%, 61.6%, 54.6%, 49.3%, 28.8%, and 6.3% over the frequency-based, user-based, item-based, k-means clustering-based, and two genetic algorithm-based methods, respectively.
基金Projects(60873265,60903222) supported by the National Natural Science Foundation of China Project(IRT0661) supported by the Program for Changjiang Scholars and Innovative Research Team in University of China
文摘The Circle algorithm was proposed for large datasets.The idea of the algorithm is to find a set of vertices that are close to each other and far from other vertices.This algorithm makes use of the connection between clustering aggregation and the problem of correlation clustering.The best deterministic approximation algorithm was provided for the variation of the correlation of clustering problem,and showed how sampling can be used to scale the algorithms for large datasets.An extensive empirical evaluation was given for the usefulness of the problem and the solutions.The results show that this method achieves more than 50% reduction in the running time without sacrificing the quality of the clustering.
基金Projects(71071115,60574054) supported by the National Natural Science Foundation of China
文摘To improve the productivity of cluster tools in semiconductor fabrications,on the basis of stating scheduling problems,a try and error-based scheduling algorithm was proposed with residency time constraints and an objective of minimizing Makespan for the wafer jobs in cluster tools.Firstly,mathematical formulations of scheduling problems were presented by using assumptions and definitions of a scheduling domain.Resource conflicts were analyzed in the built scheduling model,and policies to solve resource conflicts were built.A scheduling algorithm was developed.Finally,the performances of the proposed algorithm were evaluated and compared with those of other methods by simulations.Experiment results indicate that the proposed algorithm is effective and practical in solving the scheduling problem of the cluster tools.
文摘Similarity measure design on non-overlapped data was carried out and compared with the case of overlapped data.Unconsistant feature of similarity on overlapped data to non-overlapped data was provided by example.By the artificial data illustration,it was proved that the conventional similarity measure was not proper to calculate the similarity measure of the non-overlapped case.To overcome the unbalance problem,similarity measure on non-overlapped data was obtained by considering neighbor information.Hence,different approaches to design similarity measure were proposed and proved by consideration of neighbor information.With the example of artificial data,similarity measure calculation was carried out.Similarity measure extension to intuitionistic fuzzy sets(IFSs)containing uncertainty named hesitance was also followed.