0% found this document useful (0 votes)

16 views18 pages

Softcom Assignment1

Uploaded by

Yousuf ali Safin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views18 pages

Softcom Assignment1

Uploaded by

Yousuf ali Safin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

C OU R SE N O : C SE 4 1 1 4

Cou rse Tit le :Pat t e rn R e cogn it ion an d Mach in e L e arn in g

A Comprehensive Study to Sentiment Analysis of Bangla

Cricket-Related Social Media Comments Using ML and LSTM
Models
Research Participants

Adibul Haque Yousuf Ali Miftahul Sheikh

ID: 20200204029 ID: 20200204037 ID: 20200204038

Slide 02
Research Paper Presentation
Outline
01 Abstract 07 Evaluation Metrics

02 Introduction 08 Research Gap

03 Literature Review 9 Conclusion

04 Datasets 10 Contribution of group members

05 Pre-processing Techniques 11 References

06 Methodology

Slide 03
ABSTRACT

• The rise of social media has sadly increased

online bullying, harming victims.

• Machine Learning Models like LR, KNN, SVM,

Random Forest and XGBoost can detect toxic
comments.

• Deep Learning model like LSTM perform

exceptionally well in detecting Bengali toxic
comments.

Slide 04
INTRODUCTION

• Bullying in cricket communities affects

players and fans, both emotionally and
socially.

• With social media growth in Bangladesh,

online harassment in cricket communities is
rising.

• Anonymous trolling makes it hard to control.

• Machine and Deep Learning models

effectively detect abusive cricket comments.

Slide 05
Motivation

• The rise of social media in Bangladesh has

increased online discussions about cricket.

• Limited research exists on analyzing Bangla-

language sentiments regarding cricket.

• We aim to use advanced models to analyze and

understand the sentiments of cricket fans in
Bangladesh.

Slide 04
LITERATURE REVIEW
• The paper analyzes Bangla movie reviews for sentiment.
EVALUATION OF NA¨ IVE BAYES
• It uses Naive Bayes (NB) and Support Vector Machines
AND SUPPORT VECTOR MACHINES
(SVM) for polarity detection.
ON BANGLA TEXTUAL MOVIE
• SVM, with stemmed unigram features, achieved a
REVIEWS.
precision of 0.86.

• Outperformed ANN (81.10%), LinearSVC (75.70%), Logit

A DEEP LEARNING APPROACH TO
(75.20%), MNB (73.90%), and RF (70.50%).
DETECT ABUSIVE BENGALI TEXT.
• LSTM achieved an accuracy of 91% better than other
models.

Slide 06
LITERATURE REVIEW
• The study used 57,000 Bangla news items to identify
A STUDY TOWARDS BANGLA FAKE fake news.
NEWS DETECTION USING MACHINE • Bi-LSTM models with GloVe and FastText achieved up to
LEARNING AND DEEP LEARNING. 96% accuracy.
• GRU model accuracy was 77%.

CRICKET SENTIMENT ANALYSIS FROM

• Applies RNN with LSTM for Bangla cricket sentiment
BANGLA TEXT USING RECURRENT
dataset.
NEURAL NETWORK WITH LONG SHORT
• The LSTM model achieves an accuracy of 95%
TERM MEMORY MODEL.
• Support Vector Machine (SVM), achieved an accuracy of
71.03%

Slide 07
DATASETS

Size: 3000 instances,

manually labeled.

Toxic Categories:
• negative: 2152 (72.24%)
• positive: 566 (19.00%)
• neutral: 261 (8.76%)

Slide 09
PRE-PROCESSING TECHNIQUES

01 02 03

CLEANING NON- T O K E N I Z AT I O N REMOVE

BENGALI TEXT P U N C T UAT I O N

04 05 06

REMOVE EMOJI REMOVE L E M M AT I Z AT I O N

AND URLS STOPWORDS

Slide 10
METHODOLOGY
Feature Extraction:
Dataset Preprocessi • TF-IDF
ng • Bag of Word

Machine Learning
Deep Learning Model: Models:
• LSTM • LR
• KNN
• SVM
• Random Forest
• XGBoost

Result
Analysis

Slide 11
Evaluation Metrics
CLASSIFIER ACCURACY PRECISION RECALL

LOGISTIC REGRESSION GFG STANDARD PROFESSIONAL

SVM 0.7214 0.6852 0.7215

RANDOM FOREST 0.7065 0.6754 0.7012

KNN 0.7114 0.6814 0.7114

XGBOOST 0.7449 0.7350 0.7450

Slide 12
RESEARCH GAP

Paper [1]: Paper [3]:

1 • Limited research on deep learning for
3 • Research on Bangla fake news detection
Bangla toxic comment classification. is limited compared to English, and the
• Larger datasets and advanced models lack of tools like NLTK hampers
needed for better accuracy. accuracy.
• Enhancing Bangla fake news
classifi cation requires better data
preprocessing and advanced models.

Paper [2]:
2 4
Paper [4]:
• The paper notes a gap in advanced • Limited application of sentiment
deep learning models for Bengali analysis on Bangla text hinders insights.
abusive text detection. • A lack of structured resources for
• It highlights the need for an eff ective
cricket-related Bangla sentiment
Bengali spelling correction mechanism analysis restricts research.
to improve accuracy.

Slide 13
CONCLUSION

1 Extensive research of ML
techniques.

Random Forest and Neural Networks are

2 highly accurate.

Feature engineering and

3 preprocessing are crucial.

Larger datasets and real-world tests

4 are needed.

The study suggests future cybersecurity

5 improvements.

Slide 14
Related
Papers
Nayan Banik and Md Hasan Hafizur Rahman. Evaluation Elias Hossain, Md Nadim Kaysar, Abu Zahid Md Jalal
of na¨ ıve bayes and support vector machines on bangla Uddin Joy, MdMizanur Rahman, and Wahidur Rahman. A
01
textual movie reviews. In 2018 international conference 03 study towards bangla fake news detection using machine
on Bangla speech and language processing (ICBSLP), learning and deep learning. In Sentimental Analysis and
pages 1–6. IEEE, 2018. Deep Learning: Proceedings of ICSADL 2021, pages 79–
95. Springer, 2022.

Estiak Ahmed Emon, Shihab Rahman, Joti Banarjee, Amit Md Ferdous Wahid, Md Jahid Hasan, and Md Shahin Alom.
Kumar Das, and Tanni Mittra. A deep learning approach to Cricket sentiment analysis from bangla text using
02 detect abusive bengali text. In 2019 7th International 04 recurrent neural network with long short term memory
Conference on Smart Computing & Communications model. In 2019 International Conference on Bangla
(ICSCC), pages 1–5. IEEE, 2019. Speech and Language Processing (ICBSLP), pages 1–4.
IEEE, 2019.

Slide 15
CONTRIBUTION OF GROUP MEMBERS
Wr i ti n g Rep or t Prep arin g
Pap er P resen tation

Abstract, Introduction, Adib

Yousfu Ali
Conclusion.

Adibul Literature Review,

Mifta
Haque Datasets, References

Pre-processing
Miftahul
Techniques, Models, Yousuf
Sheikh
Evaluation Metrics

Slide 16
THANK YOU

Softcom Assignment1
No ratings yet
Softcom Assignment1
18 pages
Pattern Assignment
No ratings yet
Pattern Assignment
18 pages
PatternProject FinalReport
No ratings yet
PatternProject FinalReport
5 pages
CSE440 G2 SentimentAnalysis
No ratings yet
CSE440 G2 SentimentAnalysis
15 pages
35 - Cricket Sentiment Analysis From Bangla Text Using Recurrent Neural Network With Long Short Term Memory Model
No ratings yet
35 - Cricket Sentiment Analysis From Bangla Text Using Recurrent Neural Network With Long Short Term Memory Model
5 pages
Maisha Et Al. - 2021 - Supervised Machine Learning Algorithms For Sentime
No ratings yet
Maisha Et Al. - 2021 - Supervised Machine Learning Algorithms For Sentime
9 pages
FULLTEXT01
No ratings yet
FULLTEXT01
8 pages
Leveraging NLP Techniques and Explainable AI For Abusive Bangla Comment Detection
No ratings yet
Leveraging NLP Techniques and Explainable AI For Abusive Bangla Comment Detection
6 pages
Mock Test Demo Question
No ratings yet
Mock Test Demo Question
2 pages
Avoid Note
No ratings yet
Avoid Note
8 pages
Thesis - Aru Omarali
No ratings yet
Thesis - Aru Omarali
34 pages
Sentimental Analysis
No ratings yet
Sentimental Analysis
13 pages
Strategies For Enhancing The Performance of News Article Classification in Bangla Handling Imbalance and Interpretation
No ratings yet
Strategies For Enhancing The Performance of News Article Classification in Bangla Handling Imbalance and Interpretation
21 pages
Fake-FInal-000 Final 00
No ratings yet
Fake-FInal-000 Final 00
40 pages
Sentiment Analysis of Social Media With Python - by Haaya Naushan - Towards Data Science
No ratings yet
Sentiment Analysis of Social Media With Python - by Haaya Naushan - Towards Data Science
9 pages
422 News
No ratings yet
422 News
10 pages
18CSE006 Thesis Report
No ratings yet
18CSE006 Thesis Report
23 pages
NLP Final Mini Project
No ratings yet
NLP Final Mini Project
17 pages
ML Projrct Article 2
No ratings yet
ML Projrct Article 2
6 pages
Final Presentation Main
No ratings yet
Final Presentation Main
35 pages
Sen Et Al. - 2021 - Bangla Natural Language Processing A Comprehensiv
No ratings yet
Sen Et Al. - 2021 - Bangla Natural Language Processing A Comprehensiv
46 pages
Deep - Learning - Techniques - For - Sentiment - Analysis - On - Social - Media - Text Final
No ratings yet
Deep - Learning - Techniques - For - Sentiment - Analysis - On - Social - Media - Text Final
51 pages
Research Paper
No ratings yet
Research Paper
4 pages
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
No ratings yet
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
4 pages
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
No ratings yet
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
4 pages
Sentiment Analysis Using Machine Learning Algorithms
No ratings yet
Sentiment Analysis Using Machine Learning Algorithms
23 pages
Bangla Political Cyberbullying Detection
No ratings yet
Bangla Political Cyberbullying Detection
22 pages
NLP Project (Documentation)
No ratings yet
NLP Project (Documentation)
8 pages
Group 9
No ratings yet
Group 9
19 pages
Pattern Final Presentation
No ratings yet
Pattern Final Presentation
7 pages
IR - Group1
No ratings yet
IR - Group1
27 pages
Breaking The Fake News Barrier: Deep Learning Approaches in Bangla Language
No ratings yet
Breaking The Fake News Barrier: Deep Learning Approaches in Bangla Language
6 pages
ML Report Fake News Detection
No ratings yet
ML Report Fake News Detection
15 pages
Bangla Sentiment Analysis Using RNN
No ratings yet
Bangla Sentiment Analysis Using RNN
5 pages
Research Methodology MID UIU MSC Arif Nezami Dec 2020 PDF
No ratings yet
Research Methodology MID UIU MSC Arif Nezami Dec 2020 PDF
16 pages
E-commerce Fraud Detection with NLP
No ratings yet
E-commerce Fraud Detection with NLP
43 pages
3 Merged
No ratings yet
3 Merged
61 pages
Commentclass: A Robust Ensemble Machine Learning Model For Comment Classification
No ratings yet
Commentclass: A Robust Ensemble Machine Learning Model For Comment Classification
20 pages
Bengali Text Suspicion Detection
No ratings yet
Bengali Text Suspicion Detection
2 pages
Toxic Comment Detection Model
No ratings yet
Toxic Comment Detection Model
19 pages
Identifying Fake News
No ratings yet
Identifying Fake News
9 pages
DL Paper
No ratings yet
DL Paper
11 pages
An Expert-Level Report On The Comparative Analysis of Machine Learning and Deep Learning Models For IMDb Sentiment Classification
No ratings yet
An Expert-Level Report On The Comparative Analysis of Machine Learning and Deep Learning Models For IMDb Sentiment Classification
12 pages
Applied Sciences: Roberta-Gru: A Hybrid Deep Learning Model For Enhanced Sentiment Analysis
No ratings yet
Applied Sciences: Roberta-Gru: A Hybrid Deep Learning Model For Enhanced Sentiment Analysis
16 pages
Alam Et Al. - 2021 - A Review of Bangla Natural Language Processing Tas
No ratings yet
Alam Et Al. - 2021 - A Review of Bangla Natural Language Processing Tas
48 pages
Analysis of Multiple Toxicities Using ML Algorithms To Detect Toxic Comments
No ratings yet
Analysis of Multiple Toxicities Using ML Algorithms To Detect Toxic Comments
6 pages
Assamese Toxic Comment Detection On Social Media Using Machine Learning Methods
No ratings yet
Assamese Toxic Comment Detection On Social Media Using Machine Learning Methods
8 pages
Sentiment Analysis IMDB Review - Presentation
No ratings yet
Sentiment Analysis IMDB Review - Presentation
19 pages
Hate Speech Detection in The Bengali Language: A Dataset and Its Baseline Evaluation
No ratings yet
Hate Speech Detection in The Bengali Language: A Dataset and Its Baseline Evaluation
13 pages
Group 3918 Proposal Presentation
No ratings yet
Group 3918 Proposal Presentation
17 pages
Harsh Internship
No ratings yet
Harsh Internship
18 pages
Batch 17
No ratings yet
Batch 17
27 pages
Towards Personalized Education: Integrating AI in Learning Environments Through Bangla Language
No ratings yet
Towards Personalized Education: Integrating AI in Learning Environments Through Bangla Language
60 pages
Sentiment Analysis Using Recurrent Neural Network
No ratings yet
Sentiment Analysis Using Recurrent Neural Network
7 pages
Welco ME
No ratings yet
Welco ME
15 pages
A Dataset To Identify Manipulated Social Media News in Bangla
No ratings yet
A Dataset To Identify Manipulated Social Media News in Bangla
8 pages
Dbms Unit1
No ratings yet
Dbms Unit1
6 pages
Detecting Abusive Bengali Text
No ratings yet
Detecting Abusive Bengali Text
5 pages
Conference Latex Template ECCE
No ratings yet
Conference Latex Template ECCE
6 pages
Deep Learning Lab Assignments - 6-9
No ratings yet
Deep Learning Lab Assignments - 6-9
14 pages
Data Structures in Swift
100% (2)
Data Structures in Swift
41 pages
Mixed Integer Programming For Vehicle Routing Problem With Time Windows
No ratings yet
Mixed Integer Programming For Vehicle Routing Problem With Time Windows
16 pages
Linear Time-Invariant Systems
No ratings yet
Linear Time-Invariant Systems
24 pages
TD 4
No ratings yet
TD 4
2 pages
2-Unit - III - Bankers Algorithm - Safety Algorithm - Practice Exercise
No ratings yet
2-Unit - III - Bankers Algorithm - Safety Algorithm - Practice Exercise
10 pages
Rubik's Cube 4LLL Guide
94% (18)
Rubik's Cube 4LLL Guide
4 pages
Autoencoder
No ratings yet
Autoencoder
14 pages
Linear Programming On Work Scheduling - Operations Management
100% (1)
Linear Programming On Work Scheduling - Operations Management
3 pages
Polygon Filling Methods
No ratings yet
Polygon Filling Methods
10 pages
TSEAV Modul 2-Ekperimen 5 Motion Estimation For Video Coding With MATLAB - Id
No ratings yet
TSEAV Modul 2-Ekperimen 5 Motion Estimation For Video Coding With MATLAB - Id
15 pages
DTSP
No ratings yet
DTSP
4 pages
4 Filters
No ratings yet
4 Filters
19 pages
A Novel Two Stage Hybrid Default Prediction Model 2022 Research in Internat
No ratings yet
A Novel Two Stage Hybrid Default Prediction Model 2022 Research in Internat
24 pages
Second Exam 2021-22
No ratings yet
Second Exam 2021-22
14 pages
Wavelet Theory and Application in Communication An
No ratings yet
Wavelet Theory and Application in Communication An
18 pages
Experiment 4
No ratings yet
Experiment 4
6 pages
Audio Signal Processing
No ratings yet
Audio Signal Processing
7 pages
Is Assignment 1 10 B
No ratings yet
Is Assignment 1 10 B
3 pages
ChatGPT - Convolution and Pooling Operations
No ratings yet
ChatGPT - Convolution and Pooling Operations
43 pages
Bokeh Blur PDF
No ratings yet
Bokeh Blur PDF
8 pages
Transportation Simplex Method
No ratings yet
Transportation Simplex Method
14 pages
Math Optimization Problems
No ratings yet
Math Optimization Problems
4 pages
CFD Pressure-Velocity Coupling
No ratings yet
CFD Pressure-Velocity Coupling
10 pages
Roots: EEE 305 Lecture 3: Bracketing Methods For Finding Roots
No ratings yet
Roots: EEE 305 Lecture 3: Bracketing Methods For Finding Roots
7 pages
2 1graph
No ratings yet
2 1graph
70 pages
Unit IV Ensemble Unsupervised Learning
No ratings yet
Unit IV Ensemble Unsupervised Learning
5 pages
The Levenberg Marquardt Algorithm For
No ratings yet
The Levenberg Marquardt Algorithm For
23 pages
Graph & Sorting Algorithm - Unit VI
No ratings yet
Graph & Sorting Algorithm - Unit VI
38 pages
IRis
No ratings yet
IRis
19 pages

Softcom Assignment1

Uploaded by

Softcom Assignment1

Uploaded by

C OU R SE N O : C SE 4 1 1 4

Cou rse Tit le :Pat t e rn R e cogn it ion an d Mach in e L e arn in g

A Comprehensive Study to Sentiment Analysis of Bangla

Adibul Haque Yousuf Ali Miftahul Sheikh

02 Introduction 08 Research Gap

03 Literature Review 9 Conclusion

04 Datasets 10 Contribution of group members

05 Pre-processing Techniques 11 References

• The rise of social media has sadly increased

• Machine Learning Models like LR, KNN, SVM,

• Deep Learning model like LSTM perform

• Bullying in cricket communities affects

• With social media growth in Bangladesh,

• Anonymous trolling makes it hard to control.

• Machine and Deep Learning models

• The rise of social media in Bangladesh has

• Limited research exists on analyzing Bangla-

• We aim to use advanced models to analyze and

• Outperformed ANN (81.10%), LinearSVC (75.70%), Logit

CRICKET SENTIMENT ANALYSIS FROM

Size: 3000 instances,

CLEANING NON- T O K E N I Z AT I O N REMOVE

REMOVE EMOJI REMOVE L E M M AT I Z AT I O N

LOGISTIC REGRESSION GFG STANDARD PROFESSIONAL

SVM 0.7214 0.6852 0.7215

RANDOM FOREST 0.7065 0.6754 0.7012

KNN 0.7114 0.6814 0.7114

XGBOOST 0.7449 0.7350 0.7450

Paper [1]: Paper [3]:

Random Forest and Neural Networks are

Feature engineering and

Larger datasets and real-world tests

The study suggests future cybersecurity

Abstract, Introduction, Adib

Adibul Literature Review,

You might also like