Computer Science ›› 2007, Vol. 34 ›› Issue (4): 210-212.
Previous Articles Next Articles
GAO Qiang ,ZHANG Jing-Zhi, GENG Hua, PAN Jin-Gui (State Key Lab. for Novel Software Technology, Nanjing University, Nanjing 210093)
Online:
Published:
Abstract: In a data-rich, multiple-record Web page, the "useful and relevant" information items are usually arranged regularly and compactly, with similar pattern of HTML tags and consistent style of presentation. In other words, the semi-structured Web document of
Key words: Web information extraction, Repeated pattern, Suffix tree
GAO Qiang ,ZHANG Jing-Zhi, GENG Hua, PAN Jin-Gui (State Key Lab. for Novel Software Technology, Nanjing University, Nanjing 210093). [J].Computer Science, 2007, 34(4): 210-212.
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: https://www.jsjkx.com/EN/
https://www.jsjkx.com/EN/Y2007/V34/I4/210
Cited