--> Document Information


                                             

A NOVEL SEMANTIC APPROACH TO DOCUMENT COLLECTIONS 
Author(s): Andrea Addis, Manuela Angioni, Giuliano Armano, Roberto Demontis, Franco Tuveri, Eloisa Vargiu
Paper abstract: Available document collections are more and more required for supervised text categorization tasks. They are typically collections of documents classified by domain engineers. In this paper, we propose a semantic text categorization approach able to automatically create document collections in which documents are classified according to WordNet Domains taxonomy. Experiments have been performed by training a classifier with an automatic document collection and comparing results with those obtained by training the same classifier with a document collection classified by domain engineers. Experimental results point out that, on average, the performances of the automatic approach are quite similar to those obtained on a document collection classified by hand.
Keywords: Text Categorization, Document Collections, Intelligent Software Systems, Machine Learning.
Type: Journal Paper  
Full Contents ( if you are a member please login):
First Page: 73 
Last Page: 85 
Year: 2009  
Editors: Pedro Isaías and Marcin Paprzycki  
ISBN: ISSN: 1646-3692  
Language: English  
Conference Name: IADIS International Journal on Computer Science and Information System  
Volume: V IV,2  

new search -->

If you are a IADIS member click here to login