Journal of Hebei University(Natural Science Edition) ›› 2021, Vol. 41 ›› Issue (2): 212-217.DOI: 10.3969/j.issn.1000-1565.2021.02.015

Previous Articles     Next Articles

Design of online aggregation optimization of big data based on MapReduce

LI Jun   

  1. Office of Teaching Affairs, Chengdu Technological University, Chengdu 611730, China
  • Received:2020-06-01 Online:2021-03-25 Published:2021-04-07

Abstract: Aiming at the problems of long execution time, poor execution performance and delayed scheduling performance of big data online aggregation, an optimization program design of big data online aggregation based on MapReduce is proposed. The cluster computing resources of all machines in the cluster can be fully utilized by using the fragment aggregation method, and the heuristic priority method of sub connection is used to optimize the local execution of connection task relation operation of each node, so as to realize the parallel connection of big data online aggregation. The dynamic switch mechanism of big data online aggregation based on hybrid approximate query framework and the dynamic switch mechanism based on progressive approximate estimation are used to reduce the misjudgment rate of hybrid approximate query switching and enhance the execution performance of big data online aggregation. The experimental results show that the online aggregation optimization program designed by this method has less execution time under different data scales, and has significant advantages in basic frequent query performance.

Key words: MapReduce, big data, aggregation optimization, switching mechanism

CLC Number: