The effectiveness of Euclidean distance and the Mahalanobis distance in the problems of identification of the text origin

Download article in PDF format

Authors: Shumskaya A. O.

Annotation: The article presents the effectiveness of Euclidean metric and the Mahalanobis distance in identification of the text origin. To calculate the metrics we used text features of original texts and texts generated on the basis of original texts. As a generation method the synonymy and Markovian chain method were used.

Keywords: euclidean distance, mahalanobis distance, text, authorship, automatically generated, identifying, text characteristics

Editorial office address

Executive Secretary of the Editor’s Office

 Editor’s Office: 40 Lenina Prospect, Tomsk, 634050, Russia

  Phone / Fax: + 7 (3822) 701-582

  journal@tusur.ru

 

Viktor N. Maslennikov

Executive Secretary of the Editor’s Office

 Editor’s Office: 40 Lenina Prospect, Tomsk, 634050, Russia

  Phone / Fax: + 7 (3822) 51-21-21 / 51-43-02

  vnmas@tusur.ru

Subscription for updates