计算机科学 ›› 2019, Vol. 46 ›› Issue (10): 14-18.doi: 10.11896/jsjkx.190100087

• 大数据与数据科学* • 上一篇    下一篇

面向评论文本数据的旭日图可视化

易小群, 李天瑞, 陈超   

  1. (西南交通大学信息科学与技术学院 成都611756)
    (西南交通大学人工智能研究院 成都611756)
    (西南交通大学综合交通大数据国家工程实验室 成都611756)
  • 收稿日期:2019-01-09 修回日期:2019-04-20 出版日期:2019-10-15 发布日期:2019-10-21
  • 通讯作者: 李天瑞(1969-),男,教授,博士生导师,CCF杰出会员,主要研究方向为数据挖掘与知识发现、云计算与大数据、粒计算与粗糙集,E-mail:trli@swjtu.edu.cn。
  • 作者简介:易小群(1993-),女,硕士生,CCF会员,主要研究方向为数据可视化;陈超(1993-),男,硕士生,主要研究方向为数据可视化。
  • 基金资助:
    本文受国家自然科学基金(61573292)资助。

Sunburst Visualization for Comment Text Data

YI Xiao-qun, LI Tian-rui, CHEN Chao   

  1. (School of Information Science and Technology,Southwest Jiaotong University,Chengdu 611756,China)
    (Institute of Artificial Intelligence,Southwest Jiaotong University,Chengdu 611756,China)
    (National Engineering Laboratory of Integrated Transportation Big Data Application Technology,Southwest Jiaotong University,Chengdu 611756,China)
  • Received:2019-01-09 Revised:2019-04-20 Online:2019-10-15 Published:2019-10-21

摘要: 旭日图是一种现代饼图,它超越传统的饼图和环图,不仅能表达数据的占比问题,更能表达清晰的层级和归属关系,以父子层次结构来显示数据的构成情况。使用传统的旭日图对文本数据进行可视化时,不能全面地展示实体关系和情感偏向,而且旭日图层数越多,信息的可读性就越低。针对以上问题,对传统的旭日图进行了改进。首先,设计同级相邻圆弧的交叠,展示文本中实体的关系。然后,将旭日图与柱形图相结合,展示评论文本的感情偏向,柱形图体现为圆弧的涂色宽度,表示对于某方面评论的满意度。最后,对数据进行优化重排,包括:1)基于整体的考虑,将凸出部分放在邻接位置以节省空间;2)对局部的数据优化进行重排,使得最外层的节点尽可能高低错落,以提高稀疏性,便于观察。实验结果表明:改进的旭日图能够更全面、清晰地对评论文本进行可视化,为用户提供更灵活、个性化的可视化展示。

关键词: 交互, 可视化, 情感偏向, 数据重排, 旭日图

Abstract: Sunburst is a kind of modern pie chart.It goes beyond the traditional pie chart and ring chart.It can not only express the proportion of data,but also express the clear hierarchy and attribution relationship,and display the data composition with the hierarchical structure of father and son.When Sunburst is used to visualize text data,it can not fully display the entity relationship and emotional bias.In addition,the more hierarchies of the Sunburst is involved,the lower readability of the information will be.In view of the above problems,this paper proposed the following improvements to the traditional Sunburst.Firstly,the overlapping of the same level adjacent arc is designed to show the relation of the entity in the text.Sencondly,the combination of Sunburst and histogram is put forward to show the emotional bias of the comment text.The color width of the arc in histogram chart expresses the satisfaction of the comment on a certain aspect.Thirdly,the data are rearranged optimally,including that for the overall consideration,the protruding part is placed in the adjacent position to save space,and the local data is optimized and rearranged to make the outermost nodes as high and low as possible,so as to improve the sparsity and facilitate observation.The experimental results show that the improved Sunburst can provide more comprehensive and clear visualization of comment text,and provide more flexible and personalized visualization display for users.

Key words: Data rearrangement, Emotive tendency, Interaction, Sunburst, Visualization

中图分类号: 

  • TP391
[1]REN L,DU Y,MA S,et al.Visual analytics towards big data[J].Journal of Software,2014,25(9):1909-1936.(in Chinese)
任磊,杜一,马帅,等.大数据可视分析综述[J].软件学报,2014,25(9):1909-1936.
[2]JOHNSON B.TreeViz:treemap visualization of hierarchically structured information[C]//Proceedings of the SIGCHI Confe-rence on Human Factors in Computing Systems.ACM,1992:369-370.
[3]STASKO J,ZHANG E.Focus+ context display and navigation techniques for enhancing radial,space-filling hierarchy visualizations[C]//Proceedings of 2000 IEEE Symposium on Information Visualization.IEEE,2000:57-65.
[4]STASKO J,CATRAMBONE R,GUZDIAL M,et al.An evaluation of space-filling information visualizations for depicting hierarchical structures[J].International Journal of Human-ComputerStudies,2000,53(5):663-694.
[5]MÉDOC N,GHONIEM M,NADIF M.Exploratory analysis of text collections through visualization and hybrid biclustering[C]//Proceedings of Joint European Conference on Machine Learning and Knowledge Discovery in Databases.Cham:Sprin-ger,2016:59-62.
[6]ZHU Y,SUN L,GARBARINO A,et al.PathRings:A web-based tool for exploration of ortholog and expression data in biological pathways[J].BMC Bioinformatics,2015,16(1):165.
[7]SENATHIRAJAH Y,WANG J,BORYCKI E M,et al.Mapping the Electronic Health Record:A method to study display fragmentation[C]//Proceedings of the 16th World Congress on Medical and Health Informatics.Studies in Health Technology and Informatics,2017:1138-1142.
[8]CHEN Y,ZHANG X,FENG Y,et al.Sunburst with ordered nodes based on hierarchical clustering:A visual analyzing methodfor associated hierarchical pesticide residue data[J].Journal of Visualization,2015,18(2):237-254.
[9]ZHOU J,LI Z,ZHANG Z,et al.Visual analytics of relations of Multi-Attributes in big infrastructure data[C]//Proceedings of IEEE 2016 Big Data Visual Analytics (BDVA).Sydney:IEEE,2016:1-2.
[10]HUSSEI N D M E D M.A survey on sentiment analysis challenges[J].Journal of King Saud University-Engineering Sciences,2018,30(4):330-338.
[11]CERNEA D,WEBER C,EBERT A,et al.Emotion scents:A method of representing user emotions on gui widgets[C]//Proceedings of Visualization and Data Analysis 2013.International Society for Optics and Photonics,2013.
[12]LEE C,LUI S,SO C.Visualization of time-varying joint deve-lopment of pitch and dynamics for speech emotion recognition[J].Journal of the Acoustical Society of America,2014,135(4):2422-2422.
[13]LIAO Z,HE D,CHEN Z,et al.Exploring the Characteristics of Issue-Related Behaviors in GitHub Using Visualization Techniques[J].IEEE Access,2018,6:24003-24015.
[14]SMITH A,HAWES T,MYERS M.Hiearchie:Visualization for hierarchical topic models[C]//Proceedings of the Workshop on Interactive Language Learning,Visualization,and Interfaces.Baltimore,Maryland,USA:Association for Computational Linguistics,2014:71-78.
[15]LI H,CHEN H Q,DONG S,et al.Visualization of Double-Layer Radiation Ring Matrix for Professional Shunt Data[J].Computer Science,2017,44(S1):455-458.(in Chinese)
李慧,陈红倩,董爽,等.针对专业分流数据的双层放射环矩阵可视化[J].计算机科学,2017,44(S1):455-458.
[16]ALTARAWNEH R,HUMAYOUN S R,Al-JAAFREH A.Towards Optimizing the Sunburst Visualization for Smart Mobile Devices[C]//Proceedings of the 15th IFIP TC.13 International Conference on Human-Computer Interaction-INTERACT 2015.Bamberg:University of Bamberg Press,2015:2015323.
[17]KUZNETSOVA I,LUGMAYR A,HOLZINGER A.Visualization Methods of Hierarchical Biological Data:A Survey and Review[J].International SERIES on Information Systems and Management in Creative eMedia (CreMedia),2018,2017(2):32-39.
[18]SZILAS N,MARANO M,ESTUPINAN S.A tool for interactive visualization of narrative acts[C]//Proceedings of International Conference on Interactive Digital Storytelling.Cham:Springer,2018:176-180.
[1] 聂秀山, 潘嘉男, 谭智方, 刘新放, 郭杰, 尹义龙.
基于自然语言的视频片段定位综述
Overview of Natural Language Video Localization
计算机科学, 2022, 49(9): 111-122. https://doi.org/10.11896/jsjkx.220500130
[2] 杨啸, 王翔坤, 胡浩, 朱敏.
面向设备状态监测的可视化技术综述
Survey on Visualization Technology for Equipment Condition Monitoring
计算机科学, 2022, 49(7): 89-99. https://doi.org/10.11896/jsjkx.210900167
[3] 李荪, 曹峰.
智能语音技术端到端框架模型分析和趋势研究
Analysis and Trend Research of End-to-End Framework Model of Intelligent Speech Technology
计算机科学, 2022, 49(6A): 331-336. https://doi.org/10.11896/jsjkx.210500180
[4] 陈慧嫔, 王琨, 杨恒, 郑智捷.
蓝舌病毒基因组序列多元概率特征可视化分析
Visual Analysis of Multiple Probability Features of Bluetongue Virus Genome Sequence
计算机科学, 2022, 49(6A): 27-31. https://doi.org/10.11896/jsjkx.210300129
[5] 张明新.
面向超大规模社会系统仿真的概念模型
Conceptual Model for Large-scale Social Simulation
计算机科学, 2022, 49(4): 16-24. https://doi.org/10.11896/jsjkx.210900136
[6] 朱敏, 梁朝晖, 姚林, 王翔坤, 曹梦琦.
学术引用信息可视化方法综述
Survey of Visualization Methods on Academic Citation Information
计算机科学, 2022, 49(4): 88-99. https://doi.org/10.11896/jsjkx.210300219
[7] 张继凯, 李琦, 王月明, 吕晓琪.
基于单目RGB图像的三维手势跟踪算法综述
Survey of 3D Gesture Tracking Algorithms Based on Monocular RGB Images
计算机科学, 2022, 49(4): 174-187. https://doi.org/10.11896/jsjkx.210700084
[8] 潘志豪, 曾碧, 廖文雄, 魏鹏飞, 文松.
基于交互注意力图卷积网络的方面情感分类
Interactive Attention Graph Convolutional Networks for Aspect-based Sentiment Classification
计算机科学, 2022, 49(3): 294-300. https://doi.org/10.11896/jsjkx.210100180
[9] 李家振, 纪庆革, 朱泳霖.
分子可视化中的光线追踪棋盘渲染
Ray Tracing Checkerboard Rendering in Molecular Visualization
计算机科学, 2022, 49(2): 134-141. https://doi.org/10.11896/jsjkx.210900126
[10] 李家振, 纪庆革.
动态低采样环境光遮蔽的实时光线追踪分子渲染
Dynamic Low-sampling Ambient Occlusion Real-time Ray Tracing for Molecular Rendering
计算机科学, 2022, 49(1): 175-180. https://doi.org/10.11896/jsjkx.210200042
[11] 余乐章, 夏天宇, 荆一楠, 何震瀛, 王晓阳.
面向大数据分析的智能交互向导系统
Smart Interactive Guide System for Big Data Analytics
计算机科学, 2021, 48(9): 110-117. https://doi.org/10.11896/jsjkx.200900083
[12] 朱晨爽, 程时伟.
虚拟现实环境下基于眼动跟踪的导航需求预测与辅助
Prediction and Assistance of Navigation Demand Based on Eye Tracking in Virtual Reality Environment
计算机科学, 2021, 48(8): 315-321. https://doi.org/10.11896/jsjkx.200500031
[13] 刘亮, 蒲浩洋.
基于LSTM的多维度特征手势实时识别
Real-time LSTM-based Multi-dimensional Features Gesture Recognition
计算机科学, 2021, 48(8): 328-333. https://doi.org/10.11896/jsjkx.210300079
[14] 高岩, 闫秋艳, 夏士雄, 张紫涵.
基于骨骼轨迹聚合模型的课堂交互群体发现
Interactive Group Discovery Based on Skeleton Trajectory Aggregation Model in ClassEnvironment
计算机科学, 2021, 48(8): 334-339. https://doi.org/10.11896/jsjkx.201000036
[15] 骆菁菁, 唐卫贞, 丁继婷.
基于皮尔逊系数的管制仿真训练数据独立化与因子分析下的数据可视化研究
Research of ATC Simulator Training Values Independence Based on Pearson Correlation Coefficient and Study of Data Visualization Based on Factor Analysis
计算机科学, 2021, 48(6A): 623-628. https://doi.org/10.11896/jsjkx.210200021
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!