Журнал «Современная Наука»

Russian (CIS)English (United Kingdom)
MOSCOW +7(495)-142-86-81

USING GENERATIVE ALGORITHMS TO GENERATE DOCUMENTS

Kasymov Alexey Alekseevich  (Postgraduate student, Voronezh State Technical University, Voronezh, Russia)

Maximov Yuri Maksimovich  (PhD student, Voronezh State Technical University, Voronezh, Russia)

This article provides a brief overview of the latest text classification models with an emphasis on data flow, from raw text to output labels. The differences between earlier methods and later methods based on deep learning are emphasized, both in their functioning and in how they transform input data. To give a better idea of text classification, an overview of the data sets for the language is provided, as well as instructions for synthesizing two new data sets with multiple labels. At the end, we describe an overview of new experimental results and discuss the problems of open research related to language models based on deep learning.

Keywords:text classification; tokenization; topic labeling; news classification; transformer; surface learning; deep learning; multicomponent corpora

 

Read the full article …



Citation link:
Kasymov A. A., Maximov Y. M. USING GENERATIVE ALGORITHMS TO GENERATE DOCUMENTS // Современная наука: актуальные проблемы теории и практики. Серия: Естественные и Технические Науки. -2023. -№09. -С. 70-76 DOI 10.37882/2223-2966.2023.09.09
LEGAL INFORMATION:
Reproduction of materials is permitted only for non-commercial purposes with reference to the original publication. Protected by the laws of the Russian Federation. Any violations of the law are prosecuted.
© ООО "Научные технологии"