Journal of Hebei University (Natural Science Edition) ›› 2017, Vol. 37 ›› Issue (3): 309-315.DOI: 10.3969/j.issn.1000-1565.2017.03.014

Previous Articles     Next Articles

Big data modeling of ball mill based on distributed support vector machine on Hadoop platform

GAO Xuewei1,2,FU Zhongguang2,SUN Li1,ZHANG Gang1   

  1. 1.Simulation Center, Shenyang Institute of Engineering, Shenyang 110136, China; 2.Key Laboratory of Condition Monitoring and Control for Power Plant Equipmentof Ministry of Education, North China Electric Power University, Beijing 102206, China
  • Received:2016-04-05 Online:2017-05-25 Published:2017-05-25

Abstract: In the era of big data environment,a large amount of data in thermal power plant is stored in the database and cannot be fully utilized.Because of the complicated process of the double inlet and double outlet mill system,the mathematical model is difficult to build.A method of modeling based on data mining is presented.The actual operation big data which impact the coal mill material is extracted.First, the K-Means clustering is used to delete outliers,and then the principal component analysis(PCA)is used to complete attribute reduction,at last the distributed support vector machine(D_SVM)is used to build a model on the Hadoop platform in MapReduce framework for the parallel computation.The results show that modeling time is greatly reduced due to the use of the method,and the accuracy and applicability of the model are very high. Therefore the model can be used to represent the actual material properties.

Key words: double inlet and double outlet mill, Hadoop platform, D_SVM, K-Means clustering, principal component analysis

CLC Number: