Improving data entry quality in enterprise applications with NLP methods: a model proposal based on BERT and deep learning

Yükleniyor...
Küçük Resim

Tarih

2025

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Institute of Electrical and Electronics Engineers Inc.

Erişim Hakkı

info:eu-repo/semantics/openAccess

Özet

In digital transformation, which is one of the most important keywords of our time, the completeness and accuracy of the data that users enter into applications directly affects the quality of the process, the accuracy of decision-making systems, and the speed at which data turns into information. Incorrect or incomplete data causes many problems such as prolonged approval processes, decreased trust in data, and negative impact on analysis capabilities. In this study, a data validation system was developed to improve the accuracy of risk management data collected from an ERP application and to minimize data entry errors. In order to prevent users from incorrectly entering or confusing important data such as Potential Risk, Internal Control, Control and Impact of the Risk during data entry, it is aimed to ensure accurate data entry by using NLP methods. Within the scope of the study, training was conducted on historical data and errors in user data entry were detected with various classification methods. Different methods such as BERT, RoBERTa, GPT-2, TFIDF+SVM, Word2Vec+SVM, Embedding GRU and Embedding LSTM were used to prevent these errors. The results show that the BERT model achieves the highest success rate with 94% accuracy. The strong language modelling capabilities of BERT gave it a significant advantage over other methods in detecting errors in data input.

Açıklama

Anahtar Kelimeler

BERT, Classification, Data Validation, NLP, Risk Management

Kaynak

IEEE Access

WoS Q Değeri

Q2

Scopus Q Değeri

Q1

Cilt

13

Sayı

Künye