Computer Science ›› 2019, Vol. 46 ›› Issue (11A): 528-534.

• Interdiscipline & Application • Previous Articles     Next Articles

Parallel Design and Optimization of GRAPES_CUACE On-line Coupled Air Quality Mode

YE Yue-jin1, CHEN De-xun1, HU Jiang-kai2, MA Xin2, ZHANG Xiao-ye3   

  1. (Jiangnan Institute of Computing Technology,Wuxi,Jiangsu 214083,China)1;
    (Numerical Weather Prediction Center of CMA,Beijing 100081,China)2;
    (Chinese Academy of Meteorological Sciences,Beijing 100081,China) 3
  • Online:2019-11-10 Published:2019-11-20

Abstract: This article mainly introduced the research and analysis of the parallel optimization algorithm of the meteorological particulate_meso dust aerosol coupling model under different versions of the x86 architecture.Drawing on the current mainstream parallel optimization design methods at home and abroad,combined with the GRAPES_MESO system’s own program architecture and parallel framework,corresponding parallelization transformation was implemented for different versions of x86 architecture.Using the gprof tool and poke pile timing,the test hotspot module has three main parts:IO,communication and physical process.The main optimization methods for the IO module are:1)continuous reading and writing by discrete reading and writing;2)opening buffer from sparse memory access to continuous memory access;3)asynchronous IO.The following methods are adopted for the communication part:1)the fine-grained communication is changed from fine-grained to coarse-grained;2)the aggregate communication with lower time complexity is adopted.Analysis of optimization results for IO and communication modules show that the time-consuming ratio of IO module optimization decreased from 43.7% to 1.41%.The proportion is greatly reduced,and the optimal performance is improved by 317 times.Therefore,the method described in this paper greatly improves the operating efficiency of the IO module.In addition,the main optimization methods used to optimize the physical process are as follows:1)the multi-layer cyclic calculation process is changed from discrete to continuous;2)the communication mechanism is cyclically shifted;3)the data is reused to reduce computational redundancy;4)the stack variable space is reduced.The computational performance is increased by 22%,which further improves the parallel efficiency of the program and the strong scalability of the model.

Key words: Asynchronous IO, Coarse-grained, Continuous memory access, Aggregate communicatio

CLC Number: 

  • TP302.7
[1]陈德辉,薛纪善,杨学胜,等.GRAPES 新一代全球/区域尺度统一数值预报模式总体设计研究[J].科学通报,2008,53:2396-2407.
[3]GONG S L,ZHANG X Y.CUACE/Dust — An integrated system of observation and modeling systems for operational dust forecasting in Asia[J].Atomospheric Chemisty and Physics,2008,8:2333-2340.
[4]ZHANG X Y.Characterization of soil dust aerosol in China and its transport/distribution during 2001 ACE-Asia.1.Network Observations[J].Journal of Geophysical Research,2003,108(9):4261.
[5]ZHOU C H,GONG S L,ZHANG X Y,et al.Development and evaluation of an operational SDS forecasting system for East Asia:CUACE/Dust[J].Atomospheric Chemisty and Physics,2008,8(4):787-798.
[6]李曼,张载勇,李淑娟,等.CUACE 系统在乌鲁木齐空气质量预报中的效果检验[J].沙漠与绿洲气象,2014,8(5):63-68.
[7]WANG H,GONG S L,ZHANG H L,et al.A new-generation sand and dust storm forecasting system GRAPES_CUACE/Dust:Model development,verification and numerical simulation[J].Sci Bull.
[9]AN X Q,ZHAI S X,JIN M,et al.Tracking influential hazesource areas in North China using an adjoint model,GRAPES-CUACE [J].Geosci.Model Dev.Discuss.,2015,8:7313-7345.
[10]WANG H,SHI G Y,LI W,et al.The impacts of optical properties onradiative forcing due to dust aerosol[J].Adv Atmos Sci ,2006,23:431-441.
[11]WANG H,ZHANG X Y,GONG S L,et al.Radiative feedback ofdust aerosolson the East Asian dust storms[J].J Geophys Res,2010,115:D23214.
[12]WANG H,ZHAO T L,ZHANG X Y,et al.Dust direct radiative affects onthe earth-atmosphere system over East Asia:Early spring cooling and late spring warming [J].Chinese Science Bull,2011,56:1020-1030.
[13]WANG H,SHI G Y,ZHU J,et al.Case study of longwave contribution to dust radiative effects over East Asia [J].Chinese Science Bull,2013,58:3673-3681.
[1] DONG Hui, HAO Xiao-hu and ZHANG Gui-jun. Local Enhancement Differential Evolution Searching Method for Protein Conformational Space [J]. Computer Science, 2015, 42(Z11): 22-26.
[2] HAO Xiao-hu, ZHANG Gui-jun, ZHOU Xiao-gen, CHENG Zheng-hua and ZHANG Qi-peng. Protein Conformational Space Optimization Algorithm Based on Fragment-assembly [J]. Computer Science, 2015, 42(3): 237-240.
Full text



[1] DU Wei, DING Shi-fei. Overview on Multi-agent Reinforcement Learning[J]. Computer Science, 2019, 46(8): 1 -8 .
[2] GAO Li-zheng, ZHOU Gang, LUO Jun-yong, LAN Ming-jing. Survey on Meta-event Extraction[J]. Computer Science, 2019, 46(8): 9 -15 .
[3] CAI Li, LI Ying-zi, JIANG Fang, LIANG Yu. Study on Clustering Mining of Imbalanced Data Fusion Towards Urban Hotspots[J]. Computer Science, 2019, 46(8): 16 -22 .
[4] YANG Zhen, WANG Hong-jun. Important Location Identification of Mobile Users Based on Trajectory Division and Density Clustering Method[J]. Computer Science, 2019, 46(8): 23 -27 .
[5] DENG Cun-bin, YU Hui-qun, FAN Gui-sheng. Integrating Dynamic Collaborative Filtering and Deep Learning for Recommendation[J]. Computer Science, 2019, 46(8): 28 -34 .
[6] ZHONG Feng-yan, WANG Yan, LI Nian-shuang. Node Selection Scheme for Data Repair in Heterogeneous Distributed Storage Systems[J]. Computer Science, 2019, 46(8): 35 -41 .
[7] SUN Guo-dao, ZHOU Zhi-xiu, LI Si, LIU Yi-peng, LIANG Rong-hua. Spatio-Temporal Evolution of Geographical Topics[J]. Computer Science, 2019, 46(8): 42 -49 .
[8] ZHANG Hui-bing, ZHONG Hao, HU Xiao-li. User Reviews Clustering Method Based on Topic Analysis[J]. Computer Science, 2019, 46(8): 50 -55 .
[9] LI Bo-jia, ZHANG Yang-sen, CHEN Ruo-yu. Method for Generating Massive Data with Assignable Distribution[J]. Computer Science, 2019, 46(8): 56 -63 .
[10] LU Xian-guang, DU Xue-hui, WANG Wen-juan. Alert Correlation Algorithm Based on Improved FP Growth[J]. Computer Science, 2019, 46(8): 64 -70 .