0% found this document useful (0 votes)

23 views9 pages

Module 1.1

Lecture Notes for Natural Language processing A brief history of natural language processing, language challenges, applications, classical vs statistical vs deep learning-based, Basic concepts in linguistic data Structure: Morphology, syntax, semantics, pragmatics, Tokenized text and pattern matching-Recognizing names, Stemming, Tagging

Uploaded by

Abd Xy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views9 pages

Module 1.1

Uploaded by

Abd Xy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Brief History of NLP

o 1950s: Alan Turing proposed the Turing Test to evaluate a

machine's ability to exhibit intelligent behavior equivalent to,
or indistinguishable from, that of a human.
o 1960s: Development of early NLP systems such as ELIZA, a
computer program by Joseph Weizenbaum that simulated
conversation.
o 1970s-1980s: Introduction of rule-based systems like
SHRDLU and the development of the Chomsky hierarchy in
linguistics.
o 1990s: Statistical approaches began to dominate NLP,
utilizing probabilistic models to handle large corpora of text
data.
o 2000s: The rise of machine learning and deep learning
techniques led to significant advancements in NLP, such as
Google's PageRank algorithm for search engines.
o 2010s-Present: Development of powerful deep learning
models like Word2Vec, GloVe, BERT, and GPT, which
have revolutionized NLP tasks such as language translation,
sentiment analysis, and text generation.

Language Challenges in NLP

o Ambiguity: Words and sentences can have multiple

meanings.
o Example: "The farmer went to the bank." (Is "bank"
referring to the side of a river or a financial institution?)
o Context: Understanding the context is crucial for accurate
interpretation.
 Example: "He banked the plane" vs. "He went to the
bank."
o Sarcasm and Irony: Detecting sarcasm and irony can be
challenging.

 Example: "Oh, great! Another homework

assignment."

o Diverse Syntax and Grammar: Different languages have

different syntax and grammar rules.
 Example: Subject-Verb-Object (SVO) , She eats an
apple in English vs. Subject-Object-Verb (SOV) in
Japanese. Kanojo wa ringo o taberu

o Idioms and Phrases: Recognizing and interpreting

idiomatic expressions.

 Example: "Kick the bucket" meaning "to die."

Applications of NLP

o Machine Translation: Translating text from one language

to another.

 Example: Google Translate translating "Hello, world!"

into Spanish as "¡Hola, mundo!"

o Sentiment Analysis: Determining the sentiment (positive,

negative, neutral) of a text.

 Example: Analyzing product reviews to determine

customer satisfaction.

o Chatbots: Automated systems that interact with users via

text or speech.

 Example: Customer support chatbots like those used

by banks or online retailers.
o Information Retrieval: Extracting relevant information
from large datasets.

 Example: Search engines like Google retrieving

relevant web pages based on user queries.

o Speech Recognition: Converting spoken language into text.

 Example: Voice assistants like Siri, Alexa, and

Google Assistant.

Classical vs. Statistical vs. Deep Learning-based NLP

 Classical NLP:
o Rule-based Approaches: Utilize hand-crafted rules to
process language.

 Example: Parsing sentences using grammar rules.

o Manual Feature Engineering: Involves defining specific

linguistic features for analysis.

 Example: Identifying parts of speech (POS) using

predefined rules.
 Statistical NLP:

o Probabilistic Models: Use statistical methods to model and

predict language patterns.

 Example: Hidden Markov Models (HMMs) for POS

tagging.

o Large Amounts of Data: Relies on extensive corpora to

learn patterns.

 Example: Using n-grams to predict the next word in a

sentence.

 Deep Learning-based NLP:

o Neural Networks: Employ deep neural networks to learn

from raw text data.

 Example: Recurrent Neural Networks (RNNs) for

sequence prediction.

o End-to-End Learning: Models can learn to perform tasks

directly from data without explicit feature engineering.

 Example: Transformers like BERT and GPT for

various NLP tasks.
Basic Concepts in Linguistic Data Structure

 Morphology:

o Study of word structure and formation.

o Example: Analyzing the root, prefix, and suffix of words
like "unhappiness" (un- + happy + -ness).

 Syntax:

o Rules that govern sentence structure.

o Example: English follows Subject-Verb-Object (SVO)
order: "She (S) loves (V) music (O)."

 Semantics:

o Meaning of words and sentences.

o Example: Understanding that "bark" can refer to the sound a
dog makes or the outer covering of a tree.

 Pragmatics:

o Contextual use of language.

o Example: Interpreting "Can you pass the salt?" as a request
rather than a question about ability.
Tokenized Text and Pattern Matching

o Tokenization: Splitting text into individual tokens (words or

sentences).
o Example:
 Input Text: "Natural Language Processing is
fascinating."
 Tokenized Text: ['Natural', 'Language', 'Processing',
'is', 'fascinating', '.']
 Explanation: The sentence is divided into individual
words and punctuation marks.

o Pattern Matching: Identifying patterns within tokenized

text using regular expressions.
o Example:

 Input Text: "The quick brown fox jumps over the

lazy dog."
 Pattern: Words with exactly 4 letters.
 Matched Words: ['quick', 'over', 'lazy']
 Explanation: The pattern identifies words that are
exactly four letters long within the sentence.


Recognizing Names

o Named Entity Recognition (NER): Identifies proper nouns

and classifies them as people, organizations, etc.
o Example:
 Input Text: "Barack Obama was the 44th President of
the United States."
 Recognized Entities:

 'Barack Obama' as PERSON

 '44th President' as TITLE
 'United States' as GPE (Geopolitical Entity)

 Explanation: The NER system identifies and

categorizes names and titles within the text.

Stemming and Lemmatization

 Stemming:
o Reduces words to their base form by removing prefixes or
suffixes.
o Example:

 Input Words: ['running', 'jumps', 'easily', 'fairly']

 Stemmed Words: ['run', 'jump', 'easili', 'fairli']
 Explanation: The words are reduced to their root
forms, which may not always be meaningful.

 Lemmatization:

o Reduces words to their meaningful base form using

vocabulary and morphological analysis.
o Example:

 Input Words: ['running', 'jumps', 'easily', 'fairly']

 Lemmatized Words: ['run', 'jump', 'easy', 'fair']
 Explanation: The words are reduced to their base or
dictionary forms, ensuring they remain meaningful.

Tagging Parts of Speech

o POS Tagging: Assigns part-of-speech tags to each word in a
sentence.
o Example:

 Input Text: "The quick brown fox jumps over the

lazy dog."
 POS Tags: [('The', 'DT'), ('quick', 'JJ'), ('brown', 'JJ'),
('fox', 'NN'), ('jumps', 'VBZ'), ('over', 'IN'), ('the', 'DT'),
('lazy', 'JJ'), ('dog', 'NN')]
 Explanation: Each word is tagged with its
corresponding part of speech, such as determiner (DT),
adjective (JJ), noun (NN), verb (VBZ), and
preposition (IN).

Constituent Structure

o Constituent Structure Analysis: Breaks down sentences

into their sub-parts (constituents).
o Example:

 Input Text: "The quick brown fox jumped over the

lazy dog."
 Constituent Structure:

 Sentence (S)

 Noun Phrase (NP): "The quick brown

fox"
 Determiner (DT): "The"
 Adjectives (JJ): "quick", "brown"
 Noun (NN): "fox"
 Verb Phrase (VP): "jumped over the lazy
dog"

 Verb (VBD): "jumped"

 Prepositional Phrase (PP): "over
the lazy dog"

 Preposition (IN): "over"

 Noun Phrase (NP): "the lazy
dog"
 Determiner (DT):
"the"
 Adjective (JJ): "lazy"
 Noun (NN): "dog"

 Explanation: The sentence is parsed into a

hierarchical structure, showing the relationships
between words and phrases.

Brocode OP
No ratings yet
Brocode OP
133 pages
NLP Pyq Solutions
No ratings yet
NLP Pyq Solutions
59 pages
NLP Unit 1
No ratings yet
NLP Unit 1
44 pages
1 Introduction
No ratings yet
1 Introduction
99 pages
Introduction To NLP - First - Week - Lecture - 1st
No ratings yet
Introduction To NLP - First - Week - Lecture - 1st
6 pages
Introduction To NLPAbebe Zerihun
No ratings yet
Introduction To NLPAbebe Zerihun
45 pages
NLP Unit 1 & 2
No ratings yet
NLP Unit 1 & 2
29 pages
Introduction To Natural Language Processing and NLTK
No ratings yet
Introduction To Natural Language Processing and NLTK
23 pages
Unit I - Natural Language Processing
No ratings yet
Unit I - Natural Language Processing
34 pages
1 NLP
No ratings yet
1 NLP
26 pages
NLP PPT
No ratings yet
NLP PPT
41 pages
NLP Ia1
No ratings yet
NLP Ia1
7 pages
NLP Unit-1 Notes
No ratings yet
NLP Unit-1 Notes
162 pages
Unit-4 NLP
No ratings yet
Unit-4 NLP
54 pages
NLP - Natural Language Processing and APPLICATION
No ratings yet
NLP - Natural Language Processing and APPLICATION
31 pages
NLP Questions
No ratings yet
NLP Questions
26 pages
Chapter - 6 Communicating, Perceiving, and Acting
No ratings yet
Chapter - 6 Communicating, Perceiving, and Acting
30 pages
NLP Unit1
No ratings yet
NLP Unit1
51 pages
NLP Unit 1 Part1
No ratings yet
NLP Unit 1 Part1
61 pages
تعلم ML4
No ratings yet
تعلم ML4
42 pages
NLP for AI and Tech Enthusiasts
No ratings yet
NLP for AI and Tech Enthusiasts
30 pages
Natural Language Processing Tools and Approaches
No ratings yet
Natural Language Processing Tools and Approaches
106 pages
Introduction to Natural Language Processing
No ratings yet
Introduction to Natural Language Processing
88 pages
NLP Unit 1
No ratings yet
NLP Unit 1
43 pages
Unit1 (Part1)
No ratings yet
Unit1 (Part1)
49 pages
Module 1
No ratings yet
Module 1
40 pages
NLP Unit 1 To 5
No ratings yet
NLP Unit 1 To 5
91 pages
NLP Module 1
No ratings yet
NLP Module 1
71 pages
Hadi Pres, 21-12-24-1
No ratings yet
Hadi Pres, 21-12-24-1
16 pages
DLT Unit-5
No ratings yet
DLT Unit-5
48 pages
NLP Sem Imp
No ratings yet
NLP Sem Imp
46 pages
NLP m2
No ratings yet
NLP m2
71 pages
Languages: What Is Natural Language Processing ?
No ratings yet
Languages: What Is Natural Language Processing ?
25 pages
Ai TXT Unit1
No ratings yet
Ai TXT Unit1
13 pages
Module 1
No ratings yet
Module 1
27 pages
NLP Introduction Notes Anna University
No ratings yet
NLP Introduction Notes Anna University
2 pages
Chapter - 1
No ratings yet
Chapter - 1
25 pages
Natural Language Processing
No ratings yet
Natural Language Processing
24 pages
TOPIC 4 Natural Language Processing
No ratings yet
TOPIC 4 Natural Language Processing
26 pages
NLP Introduction Overview
No ratings yet
NLP Introduction Overview
34 pages
Adnan Amin
No ratings yet
Adnan Amin
19 pages
Module 1 Lecture 1
No ratings yet
Module 1 Lecture 1
29 pages
NLP Module 1
No ratings yet
NLP Module 1
124 pages
PresentationDayone-Introduction of NLP
No ratings yet
PresentationDayone-Introduction of NLP
17 pages
Natural Language Processing Lec 1
No ratings yet
Natural Language Processing Lec 1
23 pages
Chapter 6.
No ratings yet
Chapter 6.
31 pages
Artificial Intelligence: Natural Language Processing
No ratings yet
Artificial Intelligence: Natural Language Processing
41 pages
Natural Language Processing
No ratings yet
Natural Language Processing
27 pages
NLP Short Notes
No ratings yet
NLP Short Notes
21 pages
Natural Language Processing: By-Himani (ROLL NO. 43)
No ratings yet
Natural Language Processing: By-Himani (ROLL NO. 43)
19 pages
Chapter-1 Introduction To NLP
No ratings yet
Chapter-1 Introduction To NLP
12 pages
Unit-I NLP
No ratings yet
Unit-I NLP
15 pages
NLP Lab1
No ratings yet
NLP Lab1
33 pages
NLP Merged
100% (1)
NLP Merged
975 pages
NLP Guide: Theory & Practice
No ratings yet
NLP Guide: Theory & Practice
26 pages
Advanced Feature Extraction Guide
No ratings yet
Advanced Feature Extraction Guide
47 pages
Marketing Management Insights
No ratings yet
Marketing Management Insights
161 pages
PublicAdministration Rotated
No ratings yet
PublicAdministration Rotated
16 pages
Module 1.2
No ratings yet
Module 1.2
28 pages
Skills Education: Bhoomika-Valani - Bhoomika07 - Bhoomikavalani
No ratings yet
Skills Education: Bhoomika-Valani - Bhoomika07 - Bhoomikavalani
2 pages
Research Paper On Sentilytics
No ratings yet
Research Paper On Sentilytics
5 pages
Text Mining and Natural Language Processing in Construction
No ratings yet
Text Mining and Natural Language Processing in Construction
16 pages
VADER Sentiment Analysis Lab Guide
No ratings yet
VADER Sentiment Analysis Lab Guide
5 pages
Sentimental Analysis of Movie Review
100% (1)
Sentimental Analysis of Movie Review
58 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
5 pages
Sample Seminar Report
No ratings yet
Sample Seminar Report
14 pages
Doslex: Automatic Generation of All Domain Semantically: Rich Sentiment Lexicon
No ratings yet
Doslex: Automatic Generation of All Domain Semantically: Rich Sentiment Lexicon
28 pages
Data Science & ML Specialization
No ratings yet
Data Science & ML Specialization
5 pages
AYUSH Project Proposal
No ratings yet
AYUSH Project Proposal
13 pages
Unit 6-Case Studies of Data Science
No ratings yet
Unit 6-Case Studies of Data Science
19 pages
NLP Techniques for AI Researchers
No ratings yet
NLP Techniques for AI Researchers
4 pages
A231 Case Study 1 Full
No ratings yet
A231 Case Study 1 Full
8 pages
Decision Fusion in Stock Prediction Review
No ratings yet
Decision Fusion in Stock Prediction Review
16 pages
Revised B.com (Business Analytics) VI - Sem Business Applications For Emerging Technologies
No ratings yet
Revised B.com (Business Analytics) VI - Sem Business Applications For Emerging Technologies
5 pages
A Survey On Cross-Platform Depression Detection Combining Text, Audio, Images To Understand Emotions Over Time
No ratings yet
A Survey On Cross-Platform Depression Detection Combining Text, Audio, Images To Understand Emotions Over Time
7 pages
Abstract:: Keywords: Emotion Detection, Natural Language Processing, Adversarial Transfer Learning
No ratings yet
Abstract:: Keywords: Emotion Detection, Natural Language Processing, Adversarial Transfer Learning
17 pages
DRUG Recommendation System Based On Sentiment Analysis of DRUG Reviews Using Machine Learning
No ratings yet
DRUG Recommendation System Based On Sentiment Analysis of DRUG Reviews Using Machine Learning
5 pages
Movie Recommender System Analysis
No ratings yet
Movie Recommender System Analysis
26 pages
Moon Jae-In's Strategy Amid Covid-19 Pandemic: Reviving The Green in The Korean New Deal
No ratings yet
Moon Jae-In's Strategy Amid Covid-19 Pandemic: Reviving The Green in The Korean New Deal
490 pages
Attention-Based LSTM For Aspect-Level Sentiment Classification
No ratings yet
Attention-Based LSTM For Aspect-Level Sentiment Classification
10 pages
AI & ML Innovations for Healthcare
No ratings yet
AI & ML Innovations for Healthcare
76 pages
A Sentiment Analysis Method of Short Texts in Microblog: Jie Li Lirong Qiu
No ratings yet
A Sentiment Analysis Method of Short Texts in Microblog: Jie Li Lirong Qiu
4 pages
WhiteHat JR ADV 144 New Classes
No ratings yet
WhiteHat JR ADV 144 New Classes
4 pages
Wa0002.
No ratings yet
Wa0002.
14 pages
46 - Fidelity - Detecting Market Manipulation in Small Cap Equities
No ratings yet
46 - Fidelity - Detecting Market Manipulation in Small Cap Equities
25 pages
Class 10 AI: NLP Question Bank
No ratings yet
Class 10 AI: NLP Question Bank
11 pages
IMPACT OF SOCIAL MEDIA ON SECURITY OF MILITARY OPERATIONS (Submitted Script)
No ratings yet
IMPACT OF SOCIAL MEDIA ON SECURITY OF MILITARY OPERATIONS (Submitted Script)
33 pages
NASMEI 2019 Proceedings PDF
No ratings yet
NASMEI 2019 Proceedings PDF
377 pages
Stock News Sentiment Analysis
No ratings yet
Stock News Sentiment Analysis
6 pages