Yusupova, Shaxzoda (2026) A Transformer-Based Framework and Preliminary Baseline Experiment for Uzbek Text Classification in Low-Resource NLP. XXI ASRDA INNOVATSION TEXNOLOGIYALAR, FAN VA TAʼLIM TARAQQIYOTIDAGI DOLZARB MUAMMOLAR nomli respublika ilmiy-amaliy konferensiyasi.
To'liq matn arxivda mavjud emas — maqolaning asl manbasiga havola pastda berilgan.Annotatsiya
Natural Language Processing is an important area of artificial intelligence, but many low-resource languages still lack sufficient datasets and optimized models. This paper presents a framework and preliminary baseline experiment for Uzbek text classification. The study focuses on text preprocessing, feature extraction, model selection, and evaluation. Two baseline models, TF-IDF with Logistic Regression and TF-IDF with Support Vector Machine, are used for comparison. The models are evaluated using accuracy, precision, recall, and F1-score. The proposed framework can support future Uzbek NLP applications in education, media, document classification, and automated text processing.
| Hujjat turi: | Maqola |
|---|---|
| Mualliflar: | Muallif Email Yusupova, Shaxzoda UNSPECIFIED |
| Jurnal / Nashr: | XXI ASRDA INNOVATSION TEXNOLOGIYALAR, FAN VA TAʼLIM TARAQQIYOTIDAGI DOLZARB MUAMMOLAR nomli respublika ilmiy-amaliy konferensiyasi |
| Nashriyot: | "XXI ASRDA INNOVATSION TEXNOLOGIYALAR, FAN VA TAʼLIM TARAQQIYOTIDAGI DOLZARB MUAMMOLAR" nomli respublika ilmiy-amaliy konferensiyasi |
| Sana: | 15 May 2026 |
| DOI / ID: | oai:ojs.pkp.sfu.ca:article/18599 |
| Kalit so'zlar: | Natural Language Processing; Uzbek language; text classification; low-resource language |
| Mavzular: | ?? Natural Language Processing ?? ?? Uzbek language ?? ?? text classification ?? ?? low-resource language ?? |
| Eslatma: | Imported from Universal Publishings (OAI id oai:ojs.pkp.sfu.ca:article/18599) |
| URI: | https://arxiv.universalpublishings.com/id/eprint/71847 |
![[pin missing: title]](/style/images/action_view.png)