CyberSpy's tags:


Users who used acceptable:

EXPORT LIST RSS ?
CyberSpy's bookmarks matching tag acceptable
 
Number of articles per page:
10 | 25 | 50 | 100
 
Narrative text classification for automatic key phrase extraction in web document corpora
doi.acm.org
Automatic key phrase extraction is a useful tool in many text related applications such as clustering and summarization. State-of-the-art methods are aimed towards extracting key phrases from traditional text such as technical papers. Application of these methods on Web documents, which often contain diverse and heterogeneous contents, is of particular interest and challenge in the information age. In this work, we investigate the significance of narrative text classification in the task of automatic key phrase extraction in Web document corpora. We benchmark three methods, TFIDF, KEA, and Keyterm, used to extract key phrases from all the plain text and from only the narrative text of Web pages. ANOVA tests are used to analyze the ranking data collected in a user study using quantitative measures of acceptable percentage and quality value. The evaluation shows that key phrases extracted from the narrative text only are significantly better than those obtained from all plain text of Web pages. This demonstrates that narrative text classification is indispensable for effective key phrase extraction in Web document corpora.

<< Prev 0      Showing entries 1 to 1 of 1 total      Next 0 >>