Combining sentence similarities measures to identify paraphrases. (January 2018)

Record Type:: Journal Article
Title:: Combining sentence similarities measures to identify paraphrases. (January 2018)
Main Title:: Combining sentence similarities measures to identify paraphrases
Authors:: Ferreira, Rafael
Cavalcanti, George D.C.
Freitas, Fred
Lins, Rafael Dueire
Simske, Steven J.
Riss, Marcelo
Abstract:: Highlights: It proposes a new paraphrase identification system based on lexical, syntactic, semantic analysis. It uses different machine learning algorithms to classify the paraphrase. The measure was evaluated using state-of-art dataset: Microsoft Paraphrase Corpus. Abstract: Paraphrase identification consists in the process of verifying if two sentences are semantically equivalent or not. It is applied in many natural language tasks, such as text summarization, information retrieval, text categorization, and machine translation. In general, methods for assessing paraphrase identification perform three steps. First, they represent sentences as vectors using bag of words or syntactic information of the words present the sentence. Next, this representation is used to measure different similarities between two sentences. In the third step, these similarities are given as input to a machine learning algorithm that classifies these two sentences as paraphrase or not. However, two important problems in the area of paraphrase identification are not handled: (i) the meaning problem: two sentences sharing the same meaning, composed of different words; and (ii) the word order problem: the order of the words in the sentences may change the meaning of the text. This paper proposes a paraphrase identification system that represents each pair of sentence as a combination of different similarity measures. These measures extract lexical, syntactic and semantic components of the sentences … (more)
Is Part Of:: Computer speech & language. Volume 47(2018)
Journal:: Computer speech & language
Issue:: Volume 47(2018)
Issue Display:: Volume 47, Issue 2018 (2018)
Year:: 2018
Volume:: 47
Issue:: 2018
Issue Sort Value:: 2018-0047-2018-0000
Page Start:: 59
Page End:: 73
Publication Date:: 2018-01
Subjects:: Sentence similarity -- Paraphrase identification -- Sentence simplification -- Graph-based model
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2017.07.002 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 20832.xml