Semantic Similarity Approach For Paraphrase identification And Information Summarization
Abstract
Paraphrase identification is an application of natural language processing and data mining. Now a days,World Wide Web has become a powerful platform for storing and retrieval of information. It may contain text, images and multimedia files. To extract useful information from these data, an efficient algorithm is required. Paraphrase identification is the task of identifying whether two sentences have the same meaning. Here we are identifying the paraphrases by using syntax and semantics analysisof the document. Hierarchical clustering approach is implemented for this purpose.