Publish In
International Journal of Advance Computational Engineering and Networking (IJACEN)-IJACEN
Journal Home
Volume Issue
Volume-1,Issue-9  ( Nov, 2013 )
Paper Title
A Text Categorization On Semantic Analysis
Author Name
Durga Bhavani Dasari, Venu Gopala Rao.K
1Jawaharlal Nehru Technological University, Hyderabad, India 2G. Narayanamma Institute of Technology and Science, Hyderabad, India
Computing semantic relatedness of natural language texts requires access to vast amounts of common-sense and domain-specific world knowledge. We propose Explicit Semantic Analysis (ESA), a novel method that represents the meaning of texts in a high-dimensional space of concepts derived from crops. We use machine learning techniques to explicitly represent the meaning of any text as a weighted vector of crops-based concepts. Assessing the relatedness of texts in this space amounts to comparing the corresponding vectors using conventional metrics (e.g., cosine). Compared with the previous state of the art, using ESA results in substantial improvements in correlation of computed relatedness scores with human judgments: from r =0.56 to 0.75 for individual words and from r =0.60 to 0.72 for texts. Importantly, due to the use of natural concepts, the ESA model is easy to explain to human users. The proposed model showed enhanced precision and recall extraction values over other approaches.
  View Paper