eprintid: 60746 rev_number: 2 eprint_status: archive userid: 1 dir: disk0/00/06/07/46 datestamp: 2026-05-26 15:51:53 lastmod: 2026-06-13 06:39:55 status_changed: 2026-05-26 15:51:53 type: article metadata_visibility: show sword_depositor: 1 creators_name: qizi, Sobirova Nazira G‘anijon title: Automatic Text Normalization in Uzbek: Problems, Tools, and Solutions ispublished: pub subjects: Uzbek language, text normalization, natural language processing, artificial intelligence, neural networks, rule-based approach, morphological analysis, BERT, writing systems, linguistic issues keywords: Uzbek language, text normalization, natural language processing, artificial intelligence, neural networks, rule-based approach, morphological analysis, BERT, writing systems, linguistic issues note: Imported from MJST Journal (OAI id oai:ojs.pkp.sfu.ca:article/4181) abstract: In recent years, research in the field of Natural Language Processing (NLP) has increased the demand for automated text analysis across multiple languages, including Uzbek. The multi-form, morphologically complex, and stylistically diverse nature of texts written in Uzbek poses certain challenges for automatic analysis. The central focus of this article is the automatic normalization of Uzbek texts—that is, the process of text normalization. It is dedicated to studying the linguistic and technological issues that arise during automatic text normalization in the Uzbek language. Complex morphological structures, polyform words, dialectal variants, Cyrillic-Latin script differences, and non-standard expressions complicate this process. The results of this research contribute to the deeper digital processing of the Uzbek language and to improving the quality of systems for machine translation, speech-to-text conversion, and text analysis. date: 2025-06-19 date_type: published publisher: Center for Tech and Media Research official_url: https://mjstjournal.com/index.php/mjst/article/view/4181 id_number: oai:ojs.pkp.sfu.ca:article/4181 full_text_status: public publication: Multidisciplinary Journal of Science and Technology citation: qizi, Sobirova Nazira G‘anijon (2025) Automatic Text Normalization in Uzbek: Problems, Tools, and Solutions. Multidisciplinary Journal of Science and Technology. document_url: https://arxiv.universalpublishings.com/id/eprint/60746/1/9807.pdf