Document classification/categorization is a problem in information science. The task is to assign an electronic document to one or more categories, based on its contents.
WWW: document classification; clustering weblog data to discover groups of similar access patterns. Requirements The main requirements that a clustering algorithm should satisfy are: scalability; ...
These applications range from email classification, auto-response and archive tagging to compliance and legal discovery to document classification and taxonomy management within Enterprise Content Management (ECM) systems. ...
Keyword-Based Association Analysis Document Classification Analysis Document Clustering Analysis
Mining the World Wide Web ...
An extension of a taxonomy that includes rules on vocabulary usage for document classification e.g. "preferred terms", "synonym of", "belongs to", "used for" etc. See the article Taxonomy-Enhanced Knowledge Publications.
See also: Classification, Knowledge, Natural language processing, Machine learning, Artificial intelligence
 
|