Survey of Cross-media Question Answering and Reasoning Based on Vision and Language
WU A-ming, JIANG Pin, HAN Ya-hong
Computer Science . 2021, (3): 71 -78 .  DOI: 10.11896/jsjkx.201100176