Computer Science ›› 2013, Vol. 40 ›› Issue (5): 1-7.

    Next Articles

Research on PAM Probability Topic Model

YU Miao-miao,WANG Jun-li,ZHAO Xiao-dong and YUE Xiao-dong   

  • Online:2018-11-16 Published:2018-11-16

Abstract: Recently,topic model is emerging as a new hotspot of research in computer science,and has a wide range of applications in natural language processing,document classification,information retrieval and so on.The paper mainly analyzed the Pachinko Allocation Model and its improved models.The improved hierarchical PAM is effective at disco-vering mixtures of topic hierarchies.Nonparametric PAM has more expressive force for complex structures.It has a nonparametric Bayesian prior based on a variant of the hierarchical Dirichlet process.The theory and applications of PAM and its related topic models were summarized,and finally the future directions were discussed.

Key words: Topic model,PAM,Document generation,Statistical inference

[1] 曹娟,张勇东,李锦涛,等.一种基于密度的自适应最优LDA模型选择方法[J].计算机学报,2008,31(10):1780-1787
[2] Deerwester S C,Dumais S T,Landauer T K,et al.Indexing by Latent Semantic Analysis[J].Journal of the American Society for Information Science,1990,41(6):391-407
[3] Hofmann T.Probabilistic Latent Semantic Indexing[C]∥Proceedings of the 22nd Annual International SIGIR Conference.New York:ACM Press,1999:50-57
[4] Blei D,Ng A,Jordan M.Latent Dirichlet Allocation[J].Journal of Machine Learning Research,2003,3:993-1022
[5] 张小平,周雪忠,黄厚宽,等.一种改进的LDA主题模型[J].北京交通大学学报,2010,34(2):111-114
[6] Wang X,McCallum A.Topics over time:A Non-Markov Conti-nuous-Time model of topical trends[C]∥Proceedings of the Conference on Knowledge Discovery and Data Mining(KDD).Philadelphia,USA,2006:424-433
[7] Blei D,Lafferty J.Dynamic topic models[C]∥Proceedings of the 23rd International Conference on Machine Learning.Pittsburgh,Pennsylvania,USA,2006:113-120
[8] 李文波,孙乐,张大鲲.基于Labeled-LDA模型的文本分类新算法[J].计算机学报,2008,31(4):620-627
[9] 张晨逸,孙建伶,丁轶群.基于MB-LDA模型的微博主题挖掘[J].计算机研究与发展,2011,48(10):1795-1802
[10] Wang W,Barnaghi P M,Bargiela A.Probabilistic Topic Models for Learning Terminology Ontologies[J].IEEE Transaction on Knowledge and Data Engineering,2010,22(7):1028-1040
[11] Zavitsanos E,Paliouras G,Petridis S,et al.Learning subsumption hierarchies of ontology concepts from texts[J].Web Intelligence and Agent Systems,2010,8(1):37-51
[12] Zavitsanos E,Paliouras G,Vouros G A,et al.Discovering Subsumption Hierarchies of Ontology Concepts from Text Corpora[C]∥Proceedings of Web Intelligence.2007:402-408
[13] 石晶,胡明,石鑫,等.基于LDA模型的文本分割[J].计算机学报,2008,31(10):1865-1873
[14] 石晶,范猛,李万龙.基于LDA模型的主题分析[J].自动化学报,2009,35(12):1586-1592
[15] 袁柳,张龙波.基于概率主题模型的标签预测[J].计算机科学,2011,38(7):175-180
[16] Blei D M,Lafferty J D.Correlated Topic Models[C]∥Weiss Y,Scholkopf B,Platt J,eds.Advances in Neural Information Processing Systems 18.Cambridge,MA:MIT Press,2006
[17] Li W,McCallum A.Pachinko Allocation:DAG-structured mixture models of topic correlations[C]∥Proceedings of the International Conference on Machine Learning (ICML).Pittsburgh,Pennsylcania,2006:577-584
[18] 许戈,王厚峰.自然语言处理中主题模型的发展[J].计算机学报,2011,34(8):1423-1436
[19] Griffiths T L,Steyvers M.Finding scientific topics[C]∥Proceedings of the National Academy of Sciences.2004,101:5228-5235
[20] Blei D,Griths T,Jordan M,et al.Hierarchical topic models and the nested Chinese restaurant process[C]∥Neural Information Processing Systems. 2004
[21] Mimino D,Li W,McCallum A.Mixtures of hierarchical topicswith Pachinko Allocation[C]∥Proceedings of the ICML.Corvallis,Oregon,USA,2007:424-433
[22] Wallach H.Topic modeling:Beyond bag of words[C]∥Procee-dings of the 23rd International Conference on Machine Lear-ning.2006
[23] Minka T.Estimating a Dirichlet Distribution[R].Technical report.M.I.T.,2000
[24] Li W,Blei D,McCallum A.Nonparametric Bayes pachinko allocation[C]∥Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence.Menlo Park,CA:AUAI Press,2007
[25] Teh Y,Jordan M,Beal M,et al.Hierarchical Dirichlet Processes[J].Journal of the American Statistical Association,2006,101(476):1566-1581
[26] 周建英,王飞跃,曾大军.分层Dirichlet过程及其应用综述[J].自动化学报,2011,37(4):389-407
[27] Mimno D,Wallach H,MaCallum A.Gibbs sampling for logistic normal topic models with graph-based priors[C]∥Proceedings of the NIPS Workshop on Analyzing Graphs.Whistler,Canada,2008
[28] Nallapati R,Cohen W,Lafferty J.Parallelized variational EM for latent dirichlet allocation:An experimental evaluation of speed and scalability[C]∥Proceedings of the ICDM Workshop on High Performance Data Mining.Omaha,USA,2007:349-354
[29] Asuncion A,Smyth P,Welling M.Asynchronous distributedlearning of topic models[C]∥Proceedings of the NIPS.Vancouver,Canada,2008:81-88
[30] Smola A,Narayanamurthy S.An architecture for parallel topic models[C]∥Proceedings of the VLDB.Singapore,2010
[31] Yao L,Mimno D,MaCallum A.Efficient methods for topic mo-del inference on streaming document collections[C]∥Procee-dings of the KDD.Paris,France,2009:937-946
[32] Sethi I,Seetharaman M M.Multi-Document Summarization u-sing Pachinko Allocation Method[D].Sri Sivasubramania Nadar College of Engineering,Chennai-India 603110
[33] Boulemden A,Tlili Y.Image Indexing and Retrieval withPachinko Allocation Model:Application on Local and Global Features[J].Lecture Notes in Computer Science,2012,7457:140-146
[34] Bakalov A,McCallum A,Wallach H,et al.Topic Models forTaxonomies[C]∥Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries.2012:237-240
[35] Ma Hai-ping,Chen En-hong,Xu Lin-li,et al.Capturing correlations of multiple labels:A generative probabilistic model for multi-label learning[J].Neurocomputing,2012,92:116
[36] Li Y,Wang W,Gao W.Object Recognition Based on Dependent Pachinko Allocation Model[C]∥IEEE ICIP.2007:337-340

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!