Çizge Benzerliǧi Yöntemi ile Doküman Siniflandirma
No Thumbnail Available
Date
2019
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Institute of Electrical and Electronics Engineers Inc.
Abstract
The classification of the documents is at the beginning of the topics that are studied extensively today. Using text similarity, many areas are used, such as whether citations are quoted elsewhere or the information searched in search engines is fast and accurate. A variety of methods are used while looking for similarities between documents. Similarity measurements are made by two basic methods, word-based and sentence-based, during the comparison of several documents. While word-based similarity measurements are made, many distance measurement methods such as Jaccard, Dice, Cosine similarity are used. In this study, the paragraphs in different documents will be broken down by sentence basis and they will be represented by a graph, and a study will be done on the classification of the documents using Hamming distance measurements by XOR method of neighborhood matrices obtained from these documents. © 2018 IEEE.
Description
Keywords
Document Similarty, Graphs, Hamming Distance
Turkish CoHE Thesis Center URL
WoS Q
N/A
Scopus Q
N/A
Source
2018 International Conference on Artificial Intelligence and Data Processing, IDAP 2018 -- 2018 International Conference on Artificial Intelligence and Data Processing, IDAP 2018 -- 28 September 2018 through 30 September 2018 -- Malatya -- 144523