Long Short-Term Memory (LSTM) for Sentiment Analysis
1. Why is text preprocessing necessary before training the RNN and LSTM models?
2. What is the purpose of tokenization in the experiment?
3. Why are sequences padded to a fixed maximum length?
4. What role does the Embedding layer play in the models?
5. Why is binary cross-entropy used as the loss function in this experiment?
6. What does a confusion matrix help evaluate in sentiment analysis?
7. Why are ROC and Precision–Recall curves plotted in this experiment?
8. Why does the LSTM model generally perform better than a Simple RNN in this experiment?
9. What does validation accuracy indicate during training?
10. What is the final objective of comparing Simple RNN and LSTM models in this experiment?