计算机科学 ›› 2018, Vol. 45 ›› Issue (8): 13-16.doi: 10.11896/j.issn.1002-137X.2018.08.003

• 2017 中国多媒体大会 • 上一篇    下一篇

基于视觉的地理定位中PnP算法的精度评估方法

桂逸男, 老松杨, 康来, 白亮   

  1. 国防科技大学系统工程学院 长沙410073
  • 收稿日期:2017-10-24 出版日期:2018-08-29 发布日期:2018-08-29
  • 作者简介:桂逸男(1994-),男,硕士,主要研究领域为计算视觉,E-mail:guiyinan94@gmail.com; 老松杨 男,博士,教授,主要研究领域为多媒体信息系统工程与虚拟现实技术; 康 来 男,博士,讲师,主要研究领域为计算机视觉、模式识别与人工智能; 白 亮 男,博士,副教授,主要研究领域为图像视频处理、多媒体大数据、社交媒体分析,E-mail:xabpz@163.com(通信作者)。
  • 基金资助:
    本文受国家自然科学基金项目(61402487)资助。

Accuracy Assessment Method of PnP Algorithm in Visual Geo-localization

GUI Yi-nan, LAO Song-yang, KANG Lai, BAI Liang   

  1. Institute of Systems Engineering,National University of Defense Technology,Changsha 410073,China
  • Received:2017-10-24 Online:2018-08-29 Published:2018-08-29

摘要: 近年来,基于地理位置服务的需求的飞速增长催生了定位技术的发展。基于视觉的方法利用多幅图像的拍摄参数关系能够恢复较精确的相机位姿,但目前并没有统一的评估方法对其性能进行定量评价。现今主流的相机位姿精度评估方法是与GPS进行比较,由于照片自带的GPS标签存在噪声,且不同坐标系之间的转换存在误差,将照片标签中的GPS作为真实值评估恢复的相机位姿精度不够客观。通过计算得到的位姿来建立参考平面,将PnP算法得到的相机位姿通过相同方法投影至参考平面进行评估,该精度评估方法客观可行。

关键词: PnP算法, 参考平面, 基于视觉的地理定位, 精度评估, 相机位姿

Abstract: In recent years,the rapid growth of demand based on location-based services has led to the development of positioning technology.The vision-based approach utilizes multiple images to restore more accurate camera pose para-meters,but there is no uniform assessment of the performance of its quantitative evaluation.Now the mainstream camerapose assessment method is compared with the GPS data.However,since the photo comes with the GPS tag noise and the conversion between different coordinate systems introduces errors,using GPS tag as ground truth to evaluate the accuracy of the estimated camera pose is not an objective way.In this paper,an objective accuracy evaluation method was proposed.The reference plane was established by the calculated pose.The camera pose obtained by the PnP algorithm was projected onto the reference plane by the same method.

Key words: Accuracy assessment, Camera pose, PnP algorithm, Reference plane, Visual geo-localization

中图分类号: 

  • TP391
[1]FISHLER M A,BOLLES R C.Random sample consensus:a pa-radigm for model fitting with applications to image analysis and automated cartography[J].Communications of the ACM,1981,24(6):381-395.
[2]LI S Q,XU C,XIE M.A Robust O(n) Solution to the Perspective-n-Point Problem [J].IEEE Trans Pattern Anal Mach Intell,2012,34(7):1444-1450.
[3]GUO Y.A Novel Solution to the P4P Problem for an Uncalibrated Camera[J].Journal of Mathematical Imaging and Vision,2012,45(2):189-198.
[4]WU Y,HU Z.PnP problem revisited [J].Journal of Mathematical Imaging and Vision,2006,24(1):131-41.
[5]BUJNAK M,KUKELOVA Z,PAJDLA T.A general solution to the P4P problem for camera with unknown focal length[C]∥IEEE Conference on Computer Vision and Pattern Recognition,2008(CVPR 2008).IEEE,2008:1-8.
[6]HU Z Y,LEI C.A Short Note on P4P Problem[J].Acta Automatica Sinica,2001,27(6):770-776.
[7]HU Z Y,WU F C.A note on the number of solutions of the non-coplanar P4P problem[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24(4):550-555.
[8]WU F C,HU Z Y.A study on the P5P problem[J].Journal of Software,2001,12(5):768-775.
[9]HARALICK B M,LEE C N,OTTENBERG K,et al.Reviewand analysis of solutions of the three point perspective pose est-imation problem [J].International Journal of Computer Vision,1994,13(3):331-356.
[10]FIORE P D.Efficient linear solution of exterior orientate-on[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2001,23(2):140-148.
[11]FRIDMAN L,LEE J,REIMER B,et al.‘Owl’ and ‘Lizard’:patterns of head pose and eye pose in driver gaze c-lassification [J].IET Computer Vision,2016,10(4):308-313.
[12]WAASENMÜLLER O,STRICKER D.Comparison of Kinect v1 and v2 Depth Images in Terms of Accuracy and Precision[C]∥Asian Conference on Computer Vision Workshop (ACCV workshop).Springer,2016.
[13]SATTLER T,TORII A,SIVIC J,et al.Are Large-Scale 3DModels Really Necessary for Accurate Visual Localization?[C]∥CVPR 2017-IEEE Conference on Computer Vision and Pattern Recognition.2017.
[14]TRIGGS B,MCLAUCHLAN P F,HARTLEY R I,et al.Bundle-adjustment—a modern synthesis[C]∥International Workshop on Vision Algorithms.Springer Berlin Heidelberg,1999:298-372.
[15]WU C.Towards linear-time incremental structure from motion[C]∥2013 International Conference on 3DTV-Conference.IEEE,2013:127-134.
[16]WU C,AGARWAL S,CURLESS B,et al.Multicore bundle adjustment[C]∥2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).IEEE,2011:3057-3064.
[17]STRECHA C,HANSEN W V,GOOL L V,et al.Thoennessen on Benchmarking Camera Calibration and MultiView Stereo for High Resolution Imagery[C]∥CVPR 2008.2008:1-8.
[1] 陈志强, 韩萌, 李慕航, 武红鑫, 张喜龙.
数据流概念漂移处理方法研究综述
Survey of Concept Drift Handling Methods in Data Streams
计算机科学, 2022, 49(9): 14-32. https://doi.org/10.11896/jsjkx.210700112
[2] 王明, 武文芳, 王大玲, 冯时, 张一飞.
生成链接树:一种高数据真实性的反事实解释生成方法
Generative Link Tree:A Counterfactual Explanation Generation Approach with High Data Fidelity
计算机科学, 2022, 49(9): 33-40. https://doi.org/10.11896/jsjkx.220300158
[3] 张佳, 董守斌.
基于评论方面级用户偏好迁移的跨领域推荐算法
Cross-domain Recommendation Based on Review Aspect-level User Preference Transfer
计算机科学, 2022, 49(9): 41-47. https://doi.org/10.11896/jsjkx.220200131
[4] 周芳泉, 成卫青.
基于全局增强图神经网络的序列推荐
Sequence Recommendation Based on Global Enhanced Graph Neural Network
计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085
[5] 宋杰, 梁美玉, 薛哲, 杜军平, 寇菲菲.
基于无监督集群级的科技论文异质图节点表示学习方法
Scientific Paper Heterogeneous Graph Node Representation Learning Method Based onUnsupervised Clustering Level
计算机科学, 2022, 49(9): 64-69. https://doi.org/10.11896/jsjkx.220500196
[6] 柴慧敏, 张勇, 方敏.
基于特征相似度聚类的空中目标分群方法
Aerial Target Grouping Method Based on Feature Similarity Clustering
计算机科学, 2022, 49(9): 70-75. https://doi.org/10.11896/jsjkx.210800203
[7] 郑文萍, 刘美麟, 杨贵.
一种基于节点稳定性和邻域相似性的社区发现算法
Community Detection Algorithm Based on Node Stability and Neighbor Similarity
计算机科学, 2022, 49(9): 83-91. https://doi.org/10.11896/jsjkx.220400146
[8] 吕晓锋, 赵书良, 高恒达, 武永亮, 张宝奇.
基于异质信息网的短文本特征扩充方法
Short Texts Feautre Enrichment Method Based on Heterogeneous Information Network
计算机科学, 2022, 49(9): 92-100. https://doi.org/10.11896/jsjkx.210700241
[9] 徐天慧, 郭强, 张彩明.
基于全变分比分隔距离的时序数据异常检测
Time Series Data Anomaly Detection Based on Total Variation Ratio Separation Distance
计算机科学, 2022, 49(9): 101-110. https://doi.org/10.11896/jsjkx.210600174
[10] 聂秀山, 潘嘉男, 谭智方, 刘新放, 郭杰, 尹义龙.
基于自然语言的视频片段定位综述
Overview of Natural Language Video Localization
计算机科学, 2022, 49(9): 111-122. https://doi.org/10.11896/jsjkx.220500130
[11] 曹晓雯, 梁美玉, 鲁康康.
基于细粒度语义推理的跨媒体双路对抗哈希学习模型
Fine-grained Semantic Reasoning Based Cross-media Dual-way Adversarial Hashing Learning Model
计算机科学, 2022, 49(9): 123-131. https://doi.org/10.11896/jsjkx.220600011
[12] 周旭, 钱胜胜, 李章明, 方全, 徐常胜.
基于对偶变分多模态注意力网络的不完备社会事件分类方法
Dual Variational Multi-modal Attention Network for Incomplete Social Event Classification
计算机科学, 2022, 49(9): 132-138. https://doi.org/10.11896/jsjkx.220600022
[13] 戴禹, 许林峰.
基于文本行匹配的跨图文本阅读方法
Cross-image Text Reading Method Based on Text Line Matching
计算机科学, 2022, 49(9): 139-145. https://doi.org/10.11896/jsjkx.220600032
[14] 曲倩文, 车啸平, 曲晨鑫, 李瑾如.
基于信息感知的虚拟现实用户临场感研究
Study on Information Perception Based User Presence in Virtual Reality
计算机科学, 2022, 49(9): 146-154. https://doi.org/10.11896/jsjkx.220500200
[15] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!