G. 2073  
Page 1  
Global Research journal of Natural Science  
& Technology (GRJNST)  
Volume: 04 - Issue 2 (2026), 2073  
ISSN P: 2790-7643 ISSN E: 2790-7651  
The Hybrid BERT-LSTM Model for the classification Sindhi Text in NLP  
Received: 27 March 2026. Accepted: 3 April 2026. Published: 27 April 2026  
Nimra Memon  
Lecturer, Dept. Computer Science,  
Govt. Girls Degree College Nawabshah  
Shabana  
Lecturer, Dept. Computer Science,  
Govt. Aisha Girls Degree College Nawabshah  
Waqas Ahmed Memon  
Software developer, at auxiliary  
Shahzad Ayaz  
MS English linguistic Scholar,  
Department of English, QUEST Nawab Shah  
Duaa Noor  
MSCS scholar, Department of computer science,  
DSU, Karachi  
Corresponding Author:Duaa Noor*(duaanoorabbasi@gmail.com)  
GRJNST, Volume: 04 - Issue 2 (2026) / ISSN P: 2790-7643  
Article ID: 2073  
Copyright © 2026 GRJNST. This article is published under an Open Access model. It is made available to the public under the terms of the Creative  
Commons Attribution 4.0 International (CC BY 4.0) license, which permits unrestricted use and distribution  
G. 2073  
Page 2  
Abstract: The traditional ML models lack to capture the relationship having deep  
semantic nature, while deep learning model alone cannot work better with temporal  
and contextual embeddings. In this context the need of efficient Hybrid approach  
BERT-LSTM for the improvement of the text classification. This study proposes the  
Hybrid approach BERT-LSTM on the sindhi text data. The text data is collected in  
sindhi language from hugging face. The dataset contains the labeled samples of the  
sindhi language text having their predefined classes. Total 150 sentences are used  
for the sindhi text classification. The model performed robust performance results by  
the all-evaluation matrices, which achieved macro-average of 0.88, 0.88 accuracy  
and 0.86 precision and recall 0.85. the significant use of the macro-average because  
it confirms the consistent model predictive ability across the sentimental data textual  
classes. The BERT embeddings provide sustainable granularity in sindhi text syntax  
might provide the miss classification with is shown in sense of minimal dispersion  
off-diagonal cells. This study provides the critical gap in sentimental analysis for the  
sindhi text data by providing the hybrid approach BERT-LSTM model architecture.  
The multilingual BERT is provided to add for feature extraction and for the  
modeling capability for the sequential capability the Bidirectional BERT is used.  
The semantic nuance and the low of sindhi text structural behavior is effectively  
captured by the Hybrid approach.  
Keywords: BERT, LSTM, NLP, sindhi language and Sentiment analysis  
1.1 Introduction:  
The advent of Large Language Models (LLMs) has marked revolutionary advancements in  
artificial intelligence, providing systems with emergent capabilities in sophisticated natural  
language understanding and generation across a multitude of scientific and commercial  
domains [1]. To transition these models from powerful linguistics tools to indispensable  
knowledge workers, however, their functionality must extend beyond simple factoid retrieval  
toward complex, compositional reasoning. Multi-hope question answering (QA) represents  
GRJNST, Volume: 04 - Issue 2 (2026) / ISSN P: 2790-7643  
Article ID: 2073  
G. 2073  
Page 3  
the apex of this challenge. Successful multi-hop QA requires the agent to execute a rigorous  
sequence of cognitive steps, including reading comprehension, logical inference, and the  
accurate integration and synthesis of disparate knowledge units. LLMs, particularly those  
replying on sequential generation methods like Chain-of-Thought (CoT), often mimic the  
fast, intuitive decision-making observed in human “System 1” cognition. This inherent  
reliance on sequential, token-level decision, even when augmented by rudimentary planning,  
renders the models highly susceptible accumulating errors [2]. The structural deficiencies  
manifest as specific failure modes, including poor dynamic knowledge adaptability and  
significant knowledge integration errors, such as generating incorrect relational jumps due to  
vector combination inaccuracies during intermediate reasoning steps.  
Figure 1 Applications of NLP  
Crucially, in the context of multi-hop tasks, error propagation is significantly amplified. A  
minor deviation or inaccuracy introduced early in the reasoning chain, weather during initial  
retrieval, content interpretation, or the first logical jump compounds exponentially,  
resulting in catastrophic failures in the final generated output [3]. This fragility demonstrates  
that the limitations in current LLM multi-hop performance are not solely attributable to  
retrieval shortcomings [4]. Empirical analysis reveals that achieving even perfect retrieval  
accuracy, where all necessary contexts is provided, does not eliminate reasoning errors,  
emphasizing that the central challenge lies in the compositional structure and logical  
verification of the inference process itself [5]. Consequently, any viable solution must  
GRJNST, Volume: 04 - Issue 2 (2026) / ISSN P: 2790-7643  
Article ID: 2073  
G. 2073  
Page 4  
introduce architectural changes that enforce a verifiable, logical flow to ensure structural  
soundness, rather than simply optimizing the quantity or quality of retrieved information.  
2. Literature Review:  
The BERT model is used for the classification of the text and focuses generative AI Based  
approach in [6]. The study in [7]The retrieval-Augmented Generation (RAG) paradigm  
successfully addresses the fundamental issue of LLM hallucination by grounded generation  
in external, up-to-date knowledge sources [8]. Traditional RAG systems operate through a  
straightforward, fixed sequence: The query is received, relevant documents are retrieved, and  
the documents augment the prompt for generation. While effective for straightforward  
queries, this static, one-shot retrieval and generation model quickly proves inadequate for the  
demands of multi-step reasoning. For complex, multi-hop queries, the RAG pipeline breaks  
down due to its inability to adapt mid-process. Traditional systems are not designed to handle  
complex requirements like dynamic contextual changes, comparison across multiple datasets,  
or iterative refinement of retrieval based on intermediate results [6]. They rely on a single,  
fixed retrieval path and lack the necessary autonomous intervention to address retrieval issues  
or contextual drift that may arise during a multi-step generation task. The empirical difficulty  
of these tasks is substantiated by standardized benchmarks such as HotpotQA and  
2WikiMultiHop, which are explicitly designed to test the model’s ability to synthesize  
evidence across multiple documents [9]. For instance, questions within the HotpotQA dataset  
are constructed such that their resolution necessitates bridging information found in  
introductory paragraphs of two separate Wikipedia articles, requiring the model to  
demonstrate true relational and synthetic reasoning across textual boundaries [10]. This  
requirement for deep, knowledge extensive composition underscores the necessity for  
architectural innovations that can manage and verify complex reasoning trajectories  
effectively [11].  
Agentic Retrieval-Augmented Generation (Agentic RAG) represents that necessary  
architectural evaluation to overcome these static constraints. Agentic RAG integrates  
autonomous AI agents capable of reasoning, goal-driven behavior, dynamic planning, and  
GRJNST, Volume: 04 - Issue 2 (2026) / ISSN P: 2790-7643  
Article ID: 2073  
G. 2073  
Page 5  
tool use into the knowledge workflow [12]. In this advanced structure, The RAG mechanism  
is re-contextualized not as a fixed pipeline stage but as a sophisticated tool dynamically  
managed by a planning agent. This dynamic orchestration, facilitate genuine multi-step  
problem-solving[13]. Agentic systems inherently break down complex tasks into smaller,  
executable sub-tasks, allowing the system to adapt its strategy on the fly. This includes  
dynamically selecting and querying multiple, specialized knowledge sources, calling external  
APIs, or iterating retrieval efforts based on ongoing results, achieving a level of flexibility  
and accuracy far exceeding traditional RAG [14]. The efficacy of this dynamic approach is  
demonstrated in state-of-the-art architectures like recursive evaluation and adaptive planning  
(REAP)[15]. REAP employs a dual-module framework. the sub-task planner (SP) and the  
fact extractor (FE)-lined by an explicit, recursive feedback loop. The SP maintains a global  
perspective, actively guiding the overall reasoning trajectory to avoid the local reasoning  
impasses common in myopic, step by step systems. The SP evaluates the task state based on  
the fulfillment level of facts extracted by the FE, creating a mutually reinforcing cycle that  
enables planning and reasoning capabilities. However, the adoption of the Agentic RAG  
paradigm introduces a corresponding complexity and associated computational overhead  
[16]. The necessity for multi-agent collaboration, dynamic tool-calling, and multiple  
recursive LLM interactions significantly increases resource requirements, latency, and  
coordination complexity compared to traditional RAG pipelines[17]. This elevated cost  
mandates that any subsequent optimization, particularly concerning recursive error  
mitigation, must be highly targeted and efficient. The system cannot afford to invoke high-  
latency verification checks after every intermediate step; intervention must be strategic and  
predicted on a high-risk assessment.  
3. Problem Statement:  
The classification of text in NLP required the powerful models which can exhibits and  
perceives the contextual understanding with meaningful and sequential text dependencies.  
The traditional ML models lack to capture the relationship having deep semantic nature,  
while deep learning model alone can not work better with temporal and contextual  
embeddings. In this context the need of efficient Hybrid approach BERT-LSTM for the  
GRJNST, Volume: 04 - Issue 2 (2026) / ISSN P: 2790-7643  
Article ID: 2073  
G. 2073  
Page 6  
improvement of the text classification results in terms of accuracy by adding BERTs  
contextual language with LSTM Sequential modeling approach ability.  
4. Objective:  
To propose and design hybrid approach BERT-LSTM for the classification of sindhi  
language text. This extraction of sequential and contextual features from the textual data to  
improve the text classification on sentimental data analysis in sindhi language.  
5. Methodology:  
a. Data collection:  
The text data is collected in sindhi language from hugging face. The data is prepared and  
organized in well-structured manners in single text file. The dataset contains the labeled  
samples of the sindhi language text having their predefined classes. Total 150 sentences are  
used for the sindhi text classification. The dataset view is given below in figure  
GRJNST, Volume: 04 - Issue 2 (2026) / ISSN P: 2790-7643  
Article ID: 2073  
Figure 2 Methodology and work plan of the study  
G. 2073  
Page 7  
Figure 3 Sindhi Language text dataset view  
b. Data preprocessing:  
The data preprocessing is applied to remove symbols and noise which is unnecessarily added  
during data collection phase. The text is converted in to proper input format which is  
acceptable for the tokenization. The BERT model is used for the text tokenization, with  
further added the padding and truncating for the length fixing. The dataset is further spilted  
into testing, training, and validation sets.  
c. Model design:  
The hybrid model is developed for the sindhi language text classification. The BERT is used  
for the text generation and for the embeddings the input text with additional LSTM model is  
further used for the sequential input from the BERT model for capturing temporal and  
sequential dependencies. A dense layer is added with the activation function named SoftMax  
for the prediction of the final class.  
D. Model training:  
The model training is performed with adam optimizer and the categorical cross-entropy is  
used. The hypermeters techniques are used for the better model training as batch size, epoch  
number, learning rate, and units hidden are used to tuned for model optimization. For the  
reducing overfitting dropout or regularization is used.  
GRJNST, Volume: 04 - Issue 2 (2026) / ISSN P: 2790-7643  
Article ID: 2073  
G. 2073  
Page 8  
d. Performance evaluation:  
Performance evaluation is provided on the basis of dataset. three classes are negative, neutral  
sentiment and positive. The model training and testing is performed on the basis of the evaluation  
matrices. The evaluation Matrix precision, recall, accuracy and f1 score below in figure 3. The graph  
is generated to Hybrid model performance evaluation on the sindhi text data.  
Figure 4 BERT+LSTM Performance matrix  
The Hybrid model approach BERT-LSTM architecture is evaluated with the 150 balanced  
dataset of sindhi language sentences. The model performed robust performance results by the  
all-evaluation matrices, which achieved macro-average of 0.88, 0.88 accuracy and 0.86  
precision and recall 0.85. the significant use of the macro-average because it confirms the  
consistent model predictive ability across the sentimental data textual classes. As the classes  
are positive, neutral and negative), whereas the rather than minority classes or majority  
classes. This shows the Hybrid model approach is providing the good results and having  
potential training phases biases, due to the LSTM bidirectional layer, for the sequential input  
data patterns which influences on the data patterns.  
e. Confusion matrix:  
GRJNST, Volume: 04 - Issue 2 (2026) / ISSN P: 2790-7643  
Article ID: 2073  
G. 2073  
Page 9  
The performance evaluation is achieved by the finding confusion matrix also in this study to  
check the correct prediction classes. The figure 4 below shows the negative, predictive and  
pos with actual vs predicted confusion matrix values. The hybrid Model BERT+LSTM shows  
the results in confusion matrix graph as give below in figure 4.  
Figure 5 Confusion matrix of Hybrid Model BERT-LSTM Model  
The figure 4 further shows the insight of the models Classification ability with accuracy and  
its patters of errors. The diagonal concentration values ensure the sentiment instances are  
classified correctly with majority of the sentiment. The model’s ability to distinguish the  
classes as neutral and negative with sentiment of the text, is very critical challenge with weak  
resources in NLP. The BERT embeddings provide sustainable granularity in sindhi text  
syntax might provide the miss classification with is shown in sense of minimal dispersion off-  
diagonal cells. The misclassification of instances is primarily prohibited, where sentence’s  
structure is short length, where as sequential context is limited with availability to the layer of  
LSTM.  
5. The Synergistic efficiency of the Hybrid architecture:  
The experiment gains the optimized performance to the synergistic approach between the  
relationships of the BERT and LSTM components:  
GRJNST, Volume: 04 - Issue 2 (2026) / ISSN P: 2790-7643  
Article ID: 2073  
G. 2073  
Page 10  
a.  
Contextual extraction of the features (BERT): this is  
achieved by the utilizing the bert-base-multingual-cased, the model capitalized the  
linguistics pretend knowledge, which extracting the contextual information and  
features which aware the sindhi language morphological nature.  
b.  
Sequential pattern Modeling (LSTM):  
The token level representation is captured by the BERT and sequential dependencies are  
captured and model effectively by the bidirectional LSTM model layers. The sentiment  
“trajectory-capturing sentiment across the sentences shift are model to maintain, which  
provide the advantage for the standard linear head of classification.  
c.  
overfitting mitigation:  
The given data size of 150 sentences, which provides the overfitting risk. The regularization  
techniques implementation termed as BERT provides the critical layer freezing and early  
stopping. The BERT model pretrained freezing weights, the trainable parameters are  
restricted to the LSTM and head for the classification. This ensures the structural nuances of  
the model which is prioritized the learning process effectively towards the sentiment bearing  
sequences, having without altering the robust, and pretrained semantic understanding which  
is multilingual BERT backing is provided to work with it.  
6. Conclusion:  
This study provides the critical gap in sentimental analysis for the sindhi text data by  
providing the hybrid approach BERT-LSTM model architecture. The multilingual BERT is  
provided to add for feature extraction and for the modeling capability for the sequential  
capability the Bidirectional BERT is used. The semantic nuance and the low of sindhi text  
structural behavior is effectively captured by the Hybrid approach.  
GRJNST, Volume: 04 - Issue 2 (2026) / ISSN P: 2790-7643  
Article ID: 2073  
G. 2073  
Page 11  
References:  
[1]  
M. Alawida, S. Mejri, A. Mehmood, B. Chikhaoui, and O. Isaac Abiodun, “A  
Comprehensive Study of ChatGPT: Advancements, Limitations, and Ethical  
Considerations in Natural Language Processing and Cybersecurity,” Inf., vol. 14, no. 8,  
2023, doi: 10.3390/info14080462.  
[2]  
T. Adão, A. Chojka, D. Pascoal, N. Silva, R. Morais, and E. Peres, “Synthetic Data-  
Driven Methods to Accelerate the Deployment of Deep Learning Models: A Case  
Study on Pest and Disease Detection in Precision Viticulture,” Computers, vol. 14, no.  
8, pp. 125, 2025, doi: 10.3390/computers14080327.  
[3]  
[4]  
[5]  
[6]  
A. Katharria, K. Rajwar, M. Pant, J. D. Velásquez, V. Snášel, and K. Deep,  
“Information Fusion in Smart Agriculture: Machine Learning Applications and Future  
A. Kovari, “A systematic review of AI-powered collaborative learning in higher  
education: Trends and outcomes from the last decade,” Soc. Sci. Humanit. Open, vol.  
11, no. August 2024, p. 101335, 2025, doi: 10.1016/j.ssaho.2025.101335.  
Y. Lei, J. Li, Z. Li, Y. Cao, and H. Shan, “Prompt learning in computer vision: a  
survey,” Front. Inf. Technol. Electron. Eng., vol. 25, no. 1, pp. 4263, 2024, doi:  
10.1631/fitee.2300389.  
E. Mikołajewska, D. Mikołajewski, T. Mikołajczyk, and T. Paczkowski, “Generative  
AI in AI-Based Digital Twins for Fault Diagnosis for Predictive Maintenance in  
Industry  
4.0/5.0,”  
Appl.  
Sci.,  
vol.  
15,  
no.  
6,  
pp.  
122,  
2025,  
doi:  
10.3390/app15063166.  
[7]  
S. Sandiwarno, D. I. Sensuse, H. B. Santoso, D. S. Hidayat, A. S. Nyamawe, and A.  
Yousif, “E-SATNet: Evaluating Student Satisfaction with Lecturer Responses in  
Asynchronous Online Discussions Using Sentiment and Semantic Similarity  
Analysis,” Big Data Cogn. Comput., vol. 9, no. 9, pp. 140, 2025, doi:  
GRJNST, Volume: 04 - Issue 2 (2026) / ISSN P: 2790-7643  
Article ID: 2073  
G. 2073  
Page 12  
10.3390/bdcc9090228.  
[8]  
[9]  
K. W. Church, J. Sun, R. Yue, P. Vickers, W. Saba, and R. Chandrasekar, “Emerging  
trends: A gentle introduction to RAG,” Nat. Lang. Eng., vol. 30, no. 4, pp. 870881,  
2024, doi: 10.1017/S1351324924000044.  
S. Arora, V. Kumar, and T. Sabo, “Designing Software With Artificial,” vol. 08, no.  
03, pp. 839846, 2021.  
[10] P. Weber, K. V. Carl, and O. Hinz, Applications of Explainable Artificial Intelligence  
in Financea systematic review of Finance, Information Systems, and Computer  
Science literature, vol. 74, no. 2. Springer International Publishing, 2024. doi:  
10.1007/s11301-023-00320-0.  
[11] I. Zafat, A. Iqbal, M. Khan, N. Ahmad, and M. Ali Alshara, “GenIIoT: Generative  
Models Aided Proactive Fault Management in Industrial Internet of Things,” Inf., vol.  
16, no. 12, pp. 127, 2025, doi: 10.3390/info16121114.  
[12] W. Zita, S. Abou El Faouz, M. Alayedi, and E. E. Elsayed, “A Hybrid Bayesian  
Machine Learning Framework for Simultaneous Job Title Classification and Salary  
Estimation,” Symmetry  
(Basel)., vol.  
17,  
no. 8, pp. 124, 2025, doi:  
10.3390/sym17081261.  
[13] A. C. Review, “Retrieval-Augmented Generation ( RAG ) in Healthcareꢀ:,” pp. 1–29,  
2025.  
[14] S. M. D. A. C. Jayatilake and G. U. Ganegoda, “Involvement of Machine Learning  
Tools in Healthcare Decision Making,” J. Healthc. Eng., vol. 2021, 2021, doi:  
10.1155/2021/6679512.  
[15] F. Cuconasu et al., “The Power of Noise: Redefining Retrieval for RAG Systems,”  
SIGIR 2024 - Proc. 47th Int. ACM SIGIR Conf. Res. Dev. Inf. Retr., no. July 2024, pp.  
719729, 2024, doi: 10.1145/3626772.3657834.  
[16] M. Arslan, H. Ghanem, S. Munawar, and C. Cruz, “A Survey on RAG with LLMs,”  
GRJNST, Volume: 04 - Issue 2 (2026) / ISSN P: 2790-7643  
Article ID: 2073  
G. 2073  
Page 13  
Procedia  
Comput.  
Sci.,  
vol.  
246,  
no.  
C,  
pp.  
37813790,  
2024,  
doi:  
10.1016/j.procs.2024.09.178.  
[17] C. Krupitzer, “Generative artificial intelligence in the agri-food value chain -  
overview, potential, and research challenges,” Front. Food Sci. Technol., vol. 4, no.  
September, pp. 16, 2024, doi: 10.3389/frfst.2024.1473357.  
GRJNST, Volume: 04 - Issue 2 (2026) / ISSN P: 2790-7643  
Article ID: 2073