Abstract
Deep learning such as Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN) have been successfully applied to Natural Language Processing (NLP). Previous research shows that RNN and CNN achieved significant results in extracting long and short-term dependencies. However, there is a lack of research on Persian/Dari due to low resources and with many difficulties in preprocessing. In this research paper, we proposed a novel sentiment analysis model using bidirectional GRU and CNN to address the challenges and improve sentiment knowledge in the Persian/Dari language. Firstly, a preprocessor is utilized to enhance data quality by eliminating noise. Secondly, a bidirectional Gated Recurrent Unit is used to capture long-term dependencies. Thirdly, our model(named PersiSentNet) employs CNN with max pooling to extract contextual features and reduce feature dimensionality. Gaussian Noise and Dropout are applied as regularization techniques to avoid the overfitting problem. Finally, the model’s performance is verified on two standard datasets, and the experimental results indicate that our model significantly outperforms the state-of-the-art models, highlighting its robustness and effectiveness in addressing the challenges of Persian/Dari language sentiment analysis.