Aurangzeb , khan and Baharum, Baharudin and Khairullah, khan (2010) An Overview of E-Documents Classification. In: 2009 International Conference on Machine Learning and Computing.
An_Overview_of_E-Documents_Classification.pdf - Published Version
Restricted to Registered users only
Download (121kB)
Abstract
With the increasing availability of electronic documents and the rapid growth of the World Wide
Web, the task of automatic categorization of documents becomes the key method for organizing the
information, knowledge and trend detection. With the growing availability of online resources, and
popularity of fast and rich resources on web, classification of e-documents, news, personal blogs, and
extraction of knowledge and trend from the documents has become an interesting area for research, as the
World Wide Web is the fastest media for news and events collection from world. So the growing
phenomenon of the textual data needs text mining, machine learning and natural language processing
techniques and methodologies to organize and extract pattern and knowledge from the documents. This
overview focused on the existing literature and explored the main techniques and methods for automatic
documents classification i.e. documents representation, classifier construction and knowledge extraction and
also discussed the issues along with the approaches and opportunities.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Subjects: | T Technology > T Technology (General) |
Departments / MOR / COE: | Departments > Computer Information Sciences |
Depositing User: | Dr Baharum Baharudin |
Date Deposited: | 26 Sep 2011 09:36 |
Last Modified: | 19 Jan 2017 08:24 |
URI: | http://scholars.utp.edu.my/id/eprint/6430 |