• Original research article
  • October 31, 2023
  • Open access

Literary mystifications and the authorial use of numerals

Abstract

This study pertains to stylometry. There are cases when a writer who has achieved fame, for various reasons, begins to create under a different name, attempts to write in a different manner and sometimes achieves success again in a new incarnation. The aim of the study is to test the feasibility of intentionally making significant changes to an author’s literary style. Numerals present in the texts by a particular author are used as a style marker. Examples from English, French and Russian literature demonstrate that the use of numerals is a literary ‘fingerprint’ that manifests in all or most of sufficiently long texts by that author. The obtained results show that, contrary to an author’s attempts to write in a ‘new’ way, the usage of numerals is conservative and allows for the recognition of fictitious authorship. This conclusion is drawn based on the analysis of works by R. Gary and B. Akunin (G. Chkhartishvili), who are known for their literary hoaxes. The analysis of numerals usage is also applied to the issue of authorship regarding Harper Lee’s novel ‘To Kill a Mockingbird’. Conclusions about the similarity/difference of literary styles are made based on hierarchical cluster analysis and are supported by the Pearson chi-squared test. The scientific originality of the paper lies in taking a new approach to the search for a literary ‘fingerprint’ and text attribution.

References

  1. Artinian A. Maupassant Criticism in France, 1880-1940, with an Inquiry into His Present Fame and a Bibliography. N. Y.: Kings Crown Press, 1941.
  2. Benford F. The Law of Anomalous Numbers // Proceedings of the American Philosophical Society. 1938. Vol. 78. No. 4.
  3. Boisen J. Un Picaro métaphysique: Romain Gary et l’art du roman. Odense: Odense University Press, 1996.
  4. Brocardo M. L., Traore I., Woungang I., Obaidat M. S. Authorship Verification Using Deep Belief Network Systems // International Journal of Communication Systems. 2017. Vol. 30. Iss. 12. https://doi.org/10.1002/dac.3259
  5. Burns B. D. Do People Fit to Benford’s Law, or Do They Have a Benford Bias? 2020. https://cognitivesciencesociety.org/cogsci20/papers/0379/index.html
  6. Choiński M., Eder M., Rybicki J. Harper Lee and Other People: A Stylometric Diagnosis // Mississippi Quarterly. 2017/2018. Vol. 70/71. No. 3.
  7. Dugan J. R. Illusion and Reality, A Study of Descriptive Techniques in the Works of Guy de Maupassant. Berlin – Boston: Mouton, 1973.
  8. Hocus Bogus. Romain Gary Writing as Émile Ajar / transl. by D. Bellos. New Haven – L.: Yale University Press, 2010.
  9. Hungerbühler N. Benfords Gesetz über führende Ziffern: wie die Mathematik Steuersündern das Fürchten lehrt. 2007. https://ethz.ch/content/dam/ethz/special-interest/dual/educeth-dam/documents/Unterrichtsmaterialien/mathematik/Benfords%20Gesetz%20über%20führende%20Ziffern%20(Artikel)/benford.pdf
  10. Koppel M., Winter Y. Determining if Two Documents Are Written by the Same Author // Journal of the Association for Information Science and Technology. 2014. Vol. 65. No. 1.
  11. Lloyd C. Guy de Maupassant. L.: Reaktion Books, 2020.
  12. Moisl H. Cluster Analysis for Corpus Linguistics. Berlin – München – Boston: De Gruyter Mouton, 2015.
  13. Poier-Bernhard A. Romain Gary – das brennende Ich: literaturtheoretische Implikationen eines Pseudonymenspiels. Tübingen: Niemeyer, 1996.
  14. Shields C. J. Mockingbird: A Portrait of Harper Lee: From Scout to Go Set a Watchman. 2nd ed. N. Y.: Henry Holt and Co., 2016.
  15. Stamatatos E. A Survey of Modern Authorship Attribution Methods // Journal of the American Society for Information Science and Technology. 2009. Vol. 60. No. 3.
  16. Tempestt N., Kalaivani S., Aneez F., Yiming Y., Yingfei X., Damon W. Surveying Stylometry Techniques and Applications // ACM Computing Surveys. 2017. Vol. 50. No. 6.
  17. Zenkov A. V. A Method of Text Attribution Based on the Statistics of Numerals // Journal of Quantitative Linguistics. 2018. Vol. 25. No. 3.
  18. Zenkov A. V. Stylometry and Numerals Usage: Benford’s Law and Beyond // Stats. 2021. Vol. 4.
  19. Zenkov A., Místecký M. Young Vladimír Vašek? – A Numerals Analysis Contribution to the Bezruč-Hrzánský Identity Issue // Naše řeč. 2022. Vol. 105. No. 3.
  20. Zenkov A. V., Místecký M. The Romantic Clash: Influence of Karel Sabina over Macha’s Cikani from the Perspective of the Numerals Usage Statistics // Glottometrics. 2019. Vol. 46.

Funding

The reported study was funded by the Russian Science Foundation, grant No. 23-28-00750, https://rscf.ru/project/23-28-00750/, the project “Development of a new method of stylometry based on statistics of the use of numerals in authorial texts”.

Author information

Andrei Viacheslavovich Zenkov

PhD

Ural Federal University, Ekaterinburg

About this article

Publication history

  • Received: September 21, 2023.
  • Published: October 31, 2023.

Keywords

  • стилометрия
  • стилеметрия
  • квантитативная лингвистика
  • атрибуция текстов
  • числительные в тексте
  • stylometry
  • quantitative linguistics
  • text attribution
  • numerals in the text

Copyright

© 2023 The Author(s)
© 2023 Gramota Publishing, LLC

User license

Creative Commons Attribution 4.0 International (CC BY 4.0)