ALGORITHM OF EVALUATING THE SIMILARITY OF THE NEWS ARTICLES BASED ON POLYNOMIAL HASHING

Authors

  • M. O. Hranik Vinnytsia National Technical University
  • V. I. Mesiura Vinnytsia National Technical University

Keywords:

news, news comparing, polynomial hashing

Abstract

There has been suggested the algorithm of comparing the similarity of few news articles based on polynomial hashing. The algorithm can be used for clusterization of the news articles.

Author Biographies

M. O. Hranik, Vinnytsia National Technical University

Post-Graduate Student of the Chair of Computer Sciences

V. I. Mesiura, Vinnytsia National Technical University

Cand. Sc. (Eng.), Assistant Professor, Professor of the Chair of Computer Sciences

References

1. Singhal Amit. Modern Information Retrieval: A Brief Overview / Singhal Amit // Bulletin of the IEEE Computer Society Technical Committee on Data Engineering. — 2001. — 24 (4). — P. 35—43.
2. Матеріали курсу Data Mining, що викладався у University of Utah [Електронний ресурс]. — Режим доступу до ма-теріалів : http://www.cs.utah.edu/~jeffp/teaching/cs5955/L4-Jaccard+Shingle.pdf .
3. Karen Spärck Jones. A statistical interpretation of term specificity and its application in retrieval / Karen Spärck Jones // Journal of Documentation. — 2004. — No. 60. — P. 493—502.
4. Lovins Julie Beth. Development of a Stemming Algorithm / Lovins Julie Beth // Mechanical Translation and Computational Linguistics. — 2006. — No. 11. — P. 22—31.
5. All About Stop Words for Text Mining and Information Retrieval [Electronic resource] // Text Mining, Analytics & More. — Access mode: http://www.text-analytics101.com/2014/10/all-about-stop-words-for-text-mining.html .

Downloads

Abstract views: 153

Published

2016-09-05

How to Cite

[1]
M. O. Hranik and V. I. Mesiura, “ALGORITHM OF EVALUATING THE SIMILARITY OF THE NEWS ARTICLES BASED ON POLYNOMIAL HASHING”, Вісник ВПІ, no. 4, pp. 75–79, Sep. 2016.

Issue

Section

Information technologies and computer sciences

Metrics

Downloads

Download data is not yet available.