河北大学学报(自然科学版) ›› 2023, Vol. 43 ›› Issue (4): 432-441.DOI: 10.3969/j.issn.1000-1565.2023.04.012

• • 上一篇    下一篇

基于目标检测和迁移时间序列的教室人员检测

丛帅1,杨磊2,华征豪3,杨晓晖2   

  • 收稿日期:2022-09-13 出版日期:2023-07-25 发布日期:2023-08-03
  • 通讯作者: 杨磊(1986—)
  • 作者简介:丛帅(1980—),男,河北保定人,河北大学工商学院讲师,主要从事大数据技术、图像处理方面研究.
    E-mail:congshuai@hbu.cn
  • 基金资助:
    国家自然科学基金面上项目(62172139);中国高校产学研创新基金资助项目(2021BCF03002)

Classroom personnel detection based on object detection and time series migration

CONG Shuai1,YANG Lei2,HUA Zhenghao3,YANG Xiaohui2   

  1. 1. Industrial and Commercial College, Hebei University, Baoding 071000, China; 2. Office of Educational Administration, Hebei University, Baoding 071002, China; 3. School of Cyber Security and Computer, Hebei University, Baoding 071000, China
  • Received:2022-09-13 Online:2023-07-25 Published:2023-08-03

摘要: 为了解决教育智能化重构中对于教室人员的高效监督管理和数据分析,结合单阶段目标检测算法的优良特性和卷积神经网络(convolutional neural networks,CNN)良好的特征提取能力,提出了一种基于注意力机制网络和迁移时间序列改进YOLO目标检测算法的教室人员目标检测算法.首先,对源视频流进行逐帧抽取和非畸变的图像放缩,通过生成对抗网络(generative adversarial network,GAN)进行图像超分辨处理;其次,对每帧图像进行多尺度采样和初步目标检测;然后,根据不同尺度得到的候选结果进行非极大抑制(non maximum suppression,NMS)以去除置信度较低的个体;之后,对候选结果进行融合,再使用交并比(intersection over union,IoU)进行重叠度计算以更新数据、去除重合或过于紧密的定位位置,然后将当前帧的检测结果与先前时间区间中的检测结果作为时间序列进行统计学数据迁移融合(time series migration,TSM)获得最后的检测结果.实验结果表明,本文方法不仅有效地提升了教室人员目标检测的准确率,并且可以进行实时检测.

关键词: 目标检测, 多尺度图像特征, 超分辨率, 注意力机制网络, 迁移时间序列

Abstract: In order to solve the problem of efficient supervision and management and data analysis of classroom persons in the intelligent reconstruction of education, this paper combines the good characteristics of single-stage object detection algorithm and the good special extraction ability of convolutional neural networks(CNN), and proposes an improved YOLO object detection algorithm based on attention network and time series migration. Firstly, this paper performs frame-by-frame extraction and non-distortion image deflation on the source video stream, and performs image super-resolution processing- DOI:10.3969/j.issn.1000-1565.2023.04.012基于目标检测和迁移时间序列的教室人员检测丛帅1,杨磊2,华征豪3,杨晓晖2(1.河北大学 工商学院,河北 保定 071000;2.河北大学 教务处,河北 保定 071002;3.河北大学 网络空间安全与计算机学院,河北 保定 071000)摘 要:为了解决教育智能化重构中对于教室人员的高效监督管理和数据分析,结合单阶段目标检测算法的优良特性和卷积神经网络(convolutional neural networks,CNN)良好的特征提取能力,提出了一种基于注意力机制网络和迁移时间序列改进YOLO目标检测算法的教室人员目标检测算法.首先,对源视频流进行逐帧抽取和非畸变的图像放缩,通过生成对抗网络(generative adversarial network,GAN)进行图像超分辨处理;其次,对每帧图像进行多尺度采样和初步目标检测;然后,根据不同尺度得到的候选结果进行非极大抑制(non maximum suppression,NMS)以去除置信度较低的个体;之后,对候选结果进行融合,再使用交并比(intersection over union,IoU)进行重叠度计算以更新数据、去除重合或过于紧密的定位位置,然后将当前帧的检测结果与先前时间区间中的检测结果作为时间序列进行统计学数据迁移融合(time series migration,TSM)获得最后的检测结果.实验结果表明,本文方法不仅有效地提升了教室人员目标检测的准确率,并且可以进行实时检测.关键词:目标检测;多尺度图像特征;超分辨率;注意力机制网络;迁移时间序列中图分类号:TP391.41 文献标志码:A 文章编号:1000-1565(2023)04-0432-10Classroom personnel detection based onobject detection and time series migrationCONG Shuai1,YANG Lei2,HUA Zhenghao3,YANG Xiaohui2(1. Industrial and Commercial College, Hebei University, Baoding 071000, China;2. Office of Educational Administration, Hebei University, Baoding 071002, China;3. School of Cyber Security and Computer, Hebei University, Baoding 071000, China )Abstract: In order to solve the problem of efficient supervision and management and data analysis of classroom persons in the intelligent reconstruction of education, this paper combines the good characteristics of single-stage object detection algorithm and the good special extraction ability of convolutional neural networks(CNN), and proposes an improved YOLO object detection algorithm based on attention network and time series migration. Firstly, this paper performs frame-by-frame extraction and non-distortion image deflation on the source video stream, and performs image super-resolution processing- 收稿日期:2022-09-13 基金项目:国家自然科学基金面上项目(62172139);中国高校产学研创新基金资助项目(2021BCF03002) 第一作者:丛帅(1980—),男,河北保定人,河北大学工商学院讲师,主要从事大数据技术、图像处理方面研究.E-mail:congshuai@hbu.cn 通信作者:杨磊(1986—),男,河北保定人,河北大学助理研究员,主要从事教育技术、智慧教学方面研究.E-mail:403064340@qq.com第4期丛帅等:基于目标检测和迁移时间序列的教室人员检测by Generative Adversarial Network(GAN). Secondly, multi-scale sampling and preliminary object detection are performed for each frame, and then Non Maximum Suppression(NMS)is performed to remove the individuals with low confidence based on the candidate results obtained at different scales. Then, the candidate results are fused, and the overlap is calculated using intersection over union(IoU)to update the data and remove overlapping or too close localization locations. Finally, the detection results of the current frame and those of the previous time interval are used as a time series for statistal data migration to obtain the final detection results. The experimental results show that this method not only effectively improves the accuracy of classroom personnel target detection, but also enables real-time detection.

Key words: object detection, multi-scale image features, super-resolution, attention-network, time series migration

中图分类号: