多生成器生成对抗网络

doi:10.3969/j.issn.1000-1565.2021.06.014

摘要/Abstract

摘要： 生成对抗网络(Generative adversarial networks,GAN)广泛应用于各种领域,尤其在图像生成方面.该模型由生成网络与判别网络2部分组成,在无监督的训练方式下,2个网络相互竞争相互提高.然而,GAN在训练时经常出现模式崩溃问题,进而导致模型收敛较慢,生成样本多样性较差.为解决这一问题,在深度卷积神经网络的基础上提出了一种多生成器生成对抗网络模型.该模型包含多个生成网络,每个生成网络均使用残差网络进行搭建,同时在生成网络间引入协作机制,以加快模型获取信息并减少参数量,最后将各生成网络的特征图进行融合得到最终图像输入到判别网络中.GAN在训练过程中还会出现梯度消失、训练不稳定问题.为避免出现这些问题,将Wasserstein距离和梯度惩罚引入模型的损失函数.通过在多个数据集上与多种相关方法进行实验比较,结果表明提出的模型在缓解模式崩溃问题、加快模型收敛速度以及减少参数量上均明显优于其他几种方法.

关键词: 生成对抗网络, 残差网络, 集成学习, 模式崩溃, Wasserstein距离

Abstract: Generative adversarial networks(GAN)are widely used in various fields, especially in image generation. The model consists of two parts, a generative network and a discriminative network, and the two networks compete with each other to improve each other in an unsupervised training method. However, GAN often suffers from pattern collapse during training, which leads to slow convergence of the model and poor diversity of generated samples. To solve this problem, a multi-generator generative adversarial network model based on deep convolutional neural networks is proposed. The model consists of multiple generative networks, each of which is built using residual networks, and a collaboration mechanism is introduced among the generative networks to speed up the information acquisition and reduce the number of parameters, and finally the feature maps of each generative network are fused to obtain the final image and input to the discriminative network. GAN also has the problems of gradient disappearance and training instability. To address the two problems, Wasserstein distance and gradient penalty are introduced into the loss function of the mode. Through experimental comparison with several related methods on multiple datasets, the results show that the proposed model significantly outperforms several- DOI:10.3969/j.issn.1000-1565.2021.06.014多生成器生成对抗网络申瑞彩,翟俊海,侯璎真(河北大学数学与信息科学学院,河北省机器学习与计算智能重点实验室,河北保定 071002)摘要:生成对抗网络(Generative adversarial networks,GAN)广泛应用于各种领域,尤其在图像生成方面.该模型由生成网络与判别网络2部分组成,在无监督的训练方式下,2个网络相互竞争相互提高.然而,GAN在训练时经常出现模式崩溃问题,进而导致模型收敛较慢,生成样本多样性较差.为解决这一问题,在深度卷积神经网络的基础上提出了一种多生成器生成对抗网络模型.该模型包含多个生成网络,每个生成网络均使用残差网络进行搭建,同时在生成网络间引入协作机制,以加快模型获取信息并减少参数量,最后将各生成网络的特征图进行融合得到最终图像输入到判别网络中.GAN在训练过程中还会出现梯度消失、训练不稳定问题.为避免出现这些问题,将Wasserstein距离和梯度惩罚引入模型的损失函数.通过在多个数据集上与多种相关方法进行实验比较,结果表明提出的模型在缓解模式崩溃问题、加快模型收敛速度以及减少参数量上均明显优于其他几种方法.关键词:生成对抗网络;残差网络;集成学习;模式崩溃;Wasserstein距离中图分类号:TP181 文献标志码:A 文章编号:1000-1565(2021)06-0734-11Multi-generator generative adversarial networksSHEN Ruicai, ZHAI Junhai, HOU Yingzhen(Hebei Key Laboratory of Machine Learning and Computational Intelligence, College of Mathematics and Information Science, Hebei University, Baoding 071002,China)Abstract: Generative adversarial networks(GAN)are widely used in various fields, especially in image generation. The model consists of two parts, a generative network and a discriminative network, and the two networks compete with each other to improve each other in an unsupervised training method. However, GAN often suffers from pattern collapse during training, which leads to slow convergence of the model and poor diversity of generated samples. To solve this problem, a multi-generator generative adversarial network model based on deep convolutional neural networks is proposed. The model consists of multiple generative networks, each of which is built using residual networks, and a collaboration mechanism is introduced among the generative networks to speed up the information acquisition and reduce the number of parameters, and finally the feature maps of each generative network are fused to obtain the final image and input to the discriminative network. GAN also has the problems of gradient disappearance and training instability. To address the two problems, Wasserstein distance and gradient penalty are introduced into the loss function of the mode. Through experimental comparison with several related methods on multiple datasets, the results show that the proposed model significantly outperforms several- 收稿日期:2021-05-26 基金项目:河北省科技计划重点研发计划项目(19210310D); 河北省自然科学基金资助项目(F2021201020) 第一作者:申端彩(1993—),女,河北邯郸人,河北大学在读硕士研究生,主要从事深度学习研究.E-mail:1943303808@qq.com 通信作者:翟俊海(1964—),男, 河北易县人, 河北大学教授, 博士生导师, 主要从事云计算与大数据处理和深度学习方向研究.E-mail:mczjh@hbu.cn第6期申瑞彩等:多生成器生成对抗网络other methods in alleviating the pattern collapse problem, speeding up model convergence, and reducing the number of parameters.

Key words: generative adversarial networks(GAN), residual networks, ensemble learning, pattern collapse, Wasserstein distance

中图分类号:

TP181

申瑞彩,翟俊海,侯璎真. 多生成器生成对抗网络[J]. 河北大学学报(自然科学版), 2021, 41(6): 734-744.

SHEN Ruicai, ZHAI Junhai, HOU Yingzhen. Multi-generator generative adversarial networks[J]. Journal of Hebei University(Natural Science Edition), 2021, 41(6): 734-744.

参考文献

[1] 周志华. 机器学习[M]. 北京: 清华大学出版社,2016.
[2] GOODFELLOW I, POUGE A J, MIRZA M, et al. Generative adversarial networks [J]. Advance in Neural Information Processing Systems, 2014(3): 2672-2680. DOI: 10.1145/3422622.
[3] GHOSH A, KULHARIA V, NAMBOODIRI V P, et al. Multi-agent diverse generative adversarial networks [C] // 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 8513-8521. DOI: 10.1109/CVPR.2018.00888.
[4] ARJOVSKY M, CHINTALA S, BOTTOU L. Wasserstein generative adversarial networks [C] //Proceeding of the 34th International Conference on Machine Learning, New York: ACM, 2017, 214-223.DOI:10.5555/3305381.3305404.
[5] SCHUTZ B, ILYAS S, LE K, et al. Nanoparticle arrays having directed hybrid topology via covalent self-assembly of iron oxide and silica nanoparticles [J]. ACS Applied Nano Materials, 2020, 3(6):5936-5943. DOI:10.1021/acsanm.0c01097.
[6] CHAKRABORTY S, ROY M. A multi-level weighted transformation based neuro-fuzzy domain adaptation technique using stacked auto-encoder for land-cover classification [J]. International Journal of Remote Sensing, 2020, 41(17):6831-6857. DOI:10.1080/01431161.2020.1750735.
[7] ZORAN J, NICOLA C, DAVID B P, et al. A highly parameterizable framework for conditional restricted boltzmann machine based workloads accelerated with FPGAs and OpenCL [J]. Future Generation Computer Systems, 2020(104):201-211. DOI:10.1016/j.future.2019.10.025.
[8] ZHENG J, SU Y X, ZHANG D H, et al. Velocity forecasts using a combined deep learning model in hybrid electric vehicles with V2V and V2I communication [J]. Science China Technological Sciences, 2020, 63(1):55-64. DOI:10.1007/s11431-018-9396-0.
[9] ODENA A, OLAH C, SHLENS J. Conditional image synthesis with auxiliary classifier GAN [C] //Proc of the 34th International Conference on Machine Learning, New York: ACM, 2017. 2642-2651. DOI:10.5555/3305890.3305954.
[10] SALIMANS T, GOODFELLOW I, ZAREMBA W, et al. Improved techniques for training GAN [C] //Proc of the 30th Advances in Neural Information Processing Systems, Massachusetts: MIT Press, 2016: 2226-2234. DOI:10.5555/3157096.3157346.
[11] 周祥全, 张津. 深层网络中的梯度消失现象[J]. 科技展望, 2017, 27(27):284-284.
[12] DENTON E L, CHINTALA S, SZLAM A, et al. Deep generative image models using a Laplacian pyramid of adversarial networks [C] //Proc of the 28th International Conference on Neural Information Processing Systems, Montreal, Canada: MIT Press, 2015: 1486-1494.DOI:10.5555/2969239.2969405.
[13] LIU M Y, TUZEL O. Coupled generative adversarial networks [C] //Proc of the 30th Advances in Neural Information Processing Systems, Massachusetts: MIT Press, 2016: 469-477.DOI:10.5555/3157096.3157149.
[14] MIRZA M, OSINDERO S. Conditional generative adversarial nets [EB/OL]. [2019-07-01]. http://arxiv.org/pdf/1411.1784.pdf.
[15] OORD A, KALCHBRENNER N, ESPEHOLT L, et al. Conditional image generation with PixelCNN decoders [C] //Proc of the 30th Advances in Neural Information Processing Systems, Massachusetts: MIT Press, 2016: 4790-4798.DOI:10.5555/3157382.3157633.
[16] HE K M, ZHANG X, REN S, et al. Deep residual learning for image recognition [C] //Proc of the 34th IEEE Conference on Computer Vision and Pattern Recognition, NJ: IEEE, 2016: 770-778. DOI: 10.1109/CVPR.2016.90.
[17] 徐继伟, 杨云. 集成学习方法:研究综述[J]. 云南大学学报(自然科学版), 2018, 40(6):1082-1092. DOI:10.7540/j.ynu.20180455.
[18] CRESWELL A, WHITE T, DUMOULIN V, et al. Generative adversarial networks: An Overview [J]. IEEE Signal Processing Magazine, 2017, 35(1):53-65. DOI:10.1109/MSP.2017.2765202.
[19] PEI S, TANG F, JI Y, et al. Localized traffic sign detection with multi-scale deconvolution networks [C] //IEEE 42nd Annual Computer Software and Applications Conference, 2018: 355-360. DOI: 10.1109/COMPSAC.2018.00056. (