Enhancing Dongba Pictograph Recognition Using Convolutional Neural Networks and Data Augmentation Techniques

Shihui Li, Lan Thi Nguyen, Wirapong Chansanam*, Natthakan Iam-On, Tossapon Boongoen*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Downloads (Pure)

Abstract

The recognition of Dongba pictographs presents significant challenges due to the pitfalls in traditional feature extraction methods, classification algorithms’ high complexity, and generalization ability. This study proposes a convolutional neural network (CNN)-based image classification method to enhance the accuracy and efficiency of Dongba pictograph recognition. The research begins with collecting and manually categorizing Dongba pictograph images, followed by these preprocessing steps to improve image quality: normalization, grayscale conversion, filtering, denoising, and binarization. The dataset, comprising 70,000 image samples, is categorized into 18 classes based on shape characteristics and manual annotations. A CNN model is then trained using a dataset that is split into training (with 70% of all the samples), validation (20%), and test (10%) sets. In particular, data augmentation techniques, including rotation, affine transformation, scaling, and translation, are applied to enhance classification accuracy. Experimental results demonstrate that the proposed model achieves a classification accuracy of 99.43% and consistently outperforms other conventional methods, with its performance peaking at 99.84% under optimized training conditions—specifically, with 75 training epochs and a batch size of 512. This study provides a robust and efficient solution for automatically classifying Dongba pictographs, contributing to their digital preservation and scholarly research. By leveraging deep learning techniques, the proposed approach facilitates the rapid and precise identification of Dongba hieroglyphs, supporting the ongoing efforts in cultural heritage preservation and the broader application of artificial intelligence in linguistic studies.

Original languageEnglish
Article number362
Number of pages27
JournalInformation
Volume16
Issue number5
Early online date29 Apr 2025
DOIs
Publication statusPublished - 31 May 2025

Keywords

  • deep learning
  • Dongba pictographs
  • data augmentation
  • convolutional neural network
  • digital humanities
  • image processing
  • image classification

Fingerprint

Dive into the research topics of 'Enhancing Dongba Pictograph Recognition Using Convolutional Neural Networks and Data Augmentation Techniques'. Together they form a unique fingerprint.

Cite this