Spatial Encoding and Multi-layer Joint Encoding Enhanced Transformer for Image Captioning
FANG Zhong-jun, ZHANG Jing, LI Dong-dong
Computer Science . 2022, (10): 151 -158 .  DOI: 10.11896/jsjkx.210900159