0% found this document useful (0 votes)

7 views24 pages

Introduction

The document outlines a course on Natural Language Processing (NLP) taught by Sourav Kumar Dandapat, detailing class timings, evaluation plans, and the textbook used. It introduces the concept of NLP, its significance in AI, and various applications such as chatbots and machine translation. Additionally, it covers the components of NLP, including lexical, syntactic, and semantic processing, along with challenges like ambiguity and a brief history of the field.

Uploaded by

Gadhethariya Dinesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views24 pages

Introduction

Uploaded by

Gadhethariya Dinesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Natural Language

Processing (NLP)
Regarding Course
Instructor: Sourav Kumar Dandapat(Sourav@iitp.ac.in)
Teaching Assistant:
◦ Arpan Phukan (arpan_2121cs33@iitp.ac.in)
◦ Sudhir Kumar (sudhir_2221cs14@iitp.ac.in)
◦ Pankaj Kumar Paswan (pankaj_2411ai49@iitp.ac.in)
◦ Suman Hazra (suman_2411ai06@iitp.ac.in)
◦ Tanmay Pawar (tanmay_2411ai07@iitp.ac.in)
◦ Aman Kumar (aman_2411ai53@iitp.ac.in)
Class Timing:
◦ Wednesday (12 pm-12.55 pm)
◦ Thursday (9 am-9.55 am)
◦ Friday (10 am-10.55 am)
Course Page: 10.22.10.100/~sourav/nlp_autumn_2025/
Evaluation Plan:
◦ 30% class evaluation
◦ 30% Mid sem
◦ 40% End sem
Tentative Dates for Quizes
◦ 2 quizzes before midsem (19/08, 16/09) [best of 2]
◦ 2 quizzes/presentation/projects after midsem [ will be announced]
Text Book
◦ Speech and Language Processing (Daniel Jurafsky)
Introduction
ØNLP stands for Natural Language Processing, which is a part of Computer Science, Human
language, and Artificial Intelligence.
ØHuman communicate through some form of language either by text or speech.
ØTo make interactions between computers and humans, computers need to understand natural
languages used by humans.
ØNatural language processing is all about making computers learn, understand, analyse,
manipulate and interpret natural(human) languages.
ØProcessing of Natural Language is required when you want an intelligent system like robot to
perform as per your instructions, when you want to hear decision from a dialogue based clinical
expert system, etc.
ØThe ability of machines to interpret human language is now at the core of many applications
that we use every day - chatbots, Email classification and spam filters, search engines, grammar
checkers, voice assistants, and social language translators.
ØThe input and output of an NLP system can be Speech or Written Text
Why Natural Language Processing (NLP)
Natural Language Processing (NLP) is one of the hottest areas of artificial intelligence (AI) thanks
to applications like
text generators that compose coherent essays,
chatbots that fool people into thinking they’re scientiest,
and text-to-image programs that produce photorealistic images of anything you can describe.
Recent years have brought a revolution in the ability of computers to understand human
languages, programming languages, and even biological and chemical sequences, such as DNA
and protein structures, that resemble language.
The latest AI models are unlocking these areas to analyze the meanings of input text and
generate meaningful, expressive output.
The process of computer analysis of input provided in a human language (natural language), and conversion of
this input into a useful form of representation.
The field of NLP is primarily concerned with getting computers to perform useful and interesting tasks with
human languages.
Some salient points
•Makes human-computer interaction more natural

•Powers translation, search engines, chatbots, and

•Helps analyze massive amounts of text

Real-World Applications
•Chatbots & virtual assistants (e.g., Siri, Alexa)
•Machine translation (e.g., Google Translate)
•Sentiment analysis (e.g., social media monitoring)
•Text summarization
Forms of Natural Language
The input/output of a NLP system can be:
◦ written text
◦ speech
We will mostly concerned with written text (not speech).
To process written text, we need:
◦ lexical, syntactic, semantic knowledge about the language
◦ discourse information, real world knowledge
To process spoken language, we need everything required
to process written text, plus the challenges of speech
recognition and speech synthesis.
Components of NLP
Natural Language Understanding
◦ Mapping the given input in the natural language into a useful representation.
◦ Different level of analysis required:
morphological analysis,
syntactic analysis,
semantic analysis,
discourse analysis, …
Natural Language Generation
◦ Producing output in the natural language from some internal representation.
◦ Different level of synthesis required:
deep planning (what to say),
syntactic generation
Why NL Understanding is hard?
Natural language is extremely rich in form and structure, and very ambiguous.
◦ How to represent meaning

One input can mean many different things. Ambiguity can be at different levels.
◦ Lexical (word level) ambiguity -- different meanings of words
◦ Syntactic ambiguity -- different ways to parse the sentence
◦ Interpreting partial information -- how to interpret pronouns
◦ Contextual information -- context of the sentence may affect the meaning of that sentence.
Many input can mean the same thing.
Interaction among components of the input is not clear.
Knowledge of Language
Phonology – concerns how words are related to the sounds that realize them.

Morphology – concerns how words are constructed from more basic meaning
units called morphemes. A morpheme is the primitive meaning bearing unit of a
language.

Syntax – concerns how words can be put together to form correct sentences
and determines what structural role each word plays in the sentence and what
phrases are subparts of other phrases.

Semantics – concerns what words mean and how these meaning combine in
sentences to form a meaningful sentence. The study of context-independent
meaning.
Pragmatics – concerns how sentences are used in different
situations/contexts and how use affects the interpretation of the
sentence.

Discourse – concerns how the immediately preceding sentences

affect the interpretation of the next sentence. For example, interpreting
pronouns and interpreting the temporal aspects of the information.

World Knowledge – includes general knowledge about the world. What

each language user must know about the other’s beliefs and goals.
The process of text analytics involves three stages as given below:
Lexical processing: In this stage, we do basic text pre-processing and text
cleaning such as tokenization, stemming, lemmatization, correcting spellings, etc.
Syntactic processing: In this step, we extract more meaning from the sentence,
by using its syntax this time. Instead of just blindly looking at the words, we here
look at the syntactic structures, i.e., the grammar of the language to understand
the meaning.

Semantic processing: Lexical and syntactic processing do not suffice when it

comes to building advanced NLP applications such as language translation,
chatbots, etc. After performing lexical and syntactic processing, we will still be
incapable of understanding the meaning of each word. Here, we try and extract
the hidden meaning behind the words which is also the most difficult part for
computers.
Lexical Processing

Lexicon describes the vocabulary that makes up a language.

Lexical analysis deciphers and segments language into units or
lexemes such as paragraphs, sentences, phrases, and words. A few
of the techniques involved in Lexical Processing are:

• Word Frequencies and Stop Words

• Stop words removal
• Bag-of-Words and TF-IDF Representation
• Tokenization
• Stemming
• Lemmatization
Syntactic Processing

It is about analysing the syntax or the grammatical structure

of sentences.
Following are some of the popular techniques performed for
the syntactic processing of textual data:
•POS tagging techniques
•Constituency and Dependency parsing
Let’s start with an example to understand Syntactic Processing and
consider two sentences "Canberra is the capital of Australia." and "Is
Canberra the of Australia capital.”

Both sentences have the same set of words

However, only the first one is syntactically correct and comprehensible.
Lexical processing techniques wouldn't be able to tell this difference.
Therefore, more sophisticated syntactic processing techniques are
required to understand the relationship between individual words in the
sentence.
Semantic Processing

Lexical and syntactic processing doesn't suffice when it comes to

building advanced NLP applications such as language translation,
chatbots, etc.
Semantic processing is about understanding the meaning of a
given piece of text.
It is probably the most challenging area in the field of NLP, partly
because the concept of 'meaning' itself is quite wide, and it is a
genuinely hard problem to make machines understand the text
the same way as we humans do
Such as inferring the intent of a statement, meanings of
ambiguous words, dealing with synonyms, detecting sarcasm and
so on.
Semantic text processing focuses on teaching machines
to process meaning of the text in similar ways. There
are various semantics techniques used such as:
Word Sense Disambiguation: Identifying the intended
meaning of an ambiguous word.
Distributional Semantics: The technique helps to
arrange semantically similar words together as
compared to other words.
Topic modelling: Identifying topics being talked
about in documents.
Ambiguity
I made her duck.
•How many different interpretations does this sentence
have?
•What are the reasons for the ambiguity?
•The categories of knowledge of language can be thought of
as ambiguity resolving components.
•How can each ambiguous piece be resolved?
•Does speech input make the sentence even more
ambiguous?
Some interpretations of : I made her
duck.
1. I cooked duck for her.
2. I cooked duck belonging to her.
3. I created a toy duck which she owns.
4. I caused her to quickly lower her head or body.
5. I used magic and turned her into a duck.
Brief History of NLP
1940s –1950s: Foundations
◦ Development of formal language theory (Chomsky, Backus, Naur, Kleene)
◦ Probabilities and information theory (Shannon)
1957 – 1970s:
◦ Use of formal grammars as basis for natural language processing (Chomsky, Kaplan)
◦ Use of logic and logic based programming (Minsky, Winograd, Colmerauer, Kay)
1970s – 1983:
◦ Probabilistic methods for speech recognition (Jelinek, Mercer)
◦ Discourse modeling (Grosz, Sidner, Hobbs)
1983 – 1993:
◦ Finite state models (morphology) (Kaplan, Kay)
1993 – present:
◦ Strong integration of different techniques, different areas.
Topics to be covered:
Regular Expression
Lexical Analysis
Edit Distance
N-Gram Language Model
Naïve Bayes
Logistic Regression
Vector Semantics
Neural Network
RNN And LSTM
Transformer
Large Language Model
Masked Language Model
Prompting
Machine Translation
Question Answering
Dialogue Management
Part Of Speech Tagging
Constituency Parsing
Dependency Parsing
Named Entity Recognition

Unit V
No ratings yet
Unit V
16 pages
Nayie Bayes Classifier 21 Page
No ratings yet
Nayie Bayes Classifier 21 Page
28 pages
NLP Presentation
No ratings yet
NLP Presentation
19 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
30 pages
2 Introduction
No ratings yet
2 Introduction
15 pages
Introduction To Natural Language Processing-03-01-2024
No ratings yet
Introduction To Natural Language Processing-03-01-2024
27 pages
INTRONLP
No ratings yet
INTRONLP
30 pages
Introduction To Natural Language Processing
No ratings yet
Introduction To Natural Language Processing
69 pages
Lesson 1 Introduction To Natural Language Processing
No ratings yet
Lesson 1 Introduction To Natural Language Processing
93 pages
NLP Unit1
No ratings yet
NLP Unit1
51 pages
Natural Language Processing
No ratings yet
Natural Language Processing
30 pages
NLP Module 1
No ratings yet
NLP Module 1
124 pages
NLP Lab1
No ratings yet
NLP Lab1
33 pages
NLP Merged
100% (1)
NLP Merged
975 pages
1.1chap NLP - Introduction
No ratings yet
1.1chap NLP - Introduction
34 pages
NLP for AI and Tech Enthusiasts
No ratings yet
NLP for AI and Tech Enthusiasts
30 pages
Natural Language Processin1
No ratings yet
Natural Language Processin1
86 pages
1 Natural Language Processing-Intro
No ratings yet
1 Natural Language Processing-Intro
16 pages
Natural Language Processing (NLP) : Chapter 1: Introduction To NLP
No ratings yet
Natural Language Processing (NLP) : Chapter 1: Introduction To NLP
96 pages
Chapter 1
No ratings yet
Chapter 1
5 pages
NLP Textbook Star Edu
No ratings yet
NLP Textbook Star Edu
103 pages
Chapter 6
No ratings yet
Chapter 6
21 pages
Natural Language Processing Lec 1
No ratings yet
Natural Language Processing Lec 1
23 pages
NLP Module - 1
No ratings yet
NLP Module - 1
16 pages
NLP PPT1
No ratings yet
NLP PPT1
29 pages
Unit 4
No ratings yet
Unit 4
39 pages
1 Introduction
No ratings yet
1 Introduction
13 pages
Module 1
No ratings yet
Module 1
40 pages
NLP Presentation1
No ratings yet
NLP Presentation1
25 pages
3.1 Natural Language Processing
No ratings yet
3.1 Natural Language Processing
5 pages
6CS4 AI Unit-5
No ratings yet
6CS4 AI Unit-5
65 pages
Introduction To Natural Language Processing
No ratings yet
Introduction To Natural Language Processing
45 pages
NLP Presentation
No ratings yet
NLP Presentation
19 pages
Natural Language Processing
No ratings yet
Natural Language Processing
4 pages
NLP Module 1
No ratings yet
NLP Module 1
10 pages
Lecture 1
No ratings yet
Lecture 1
33 pages
519 Assignment
No ratings yet
519 Assignment
26 pages
Lect1 Intro 3jan08
No ratings yet
Lect1 Intro 3jan08
94 pages
NLP PPT
No ratings yet
NLP PPT
41 pages
NLP Course: Theory & Applications
No ratings yet
NLP Course: Theory & Applications
16 pages
1.introduction To Natural Language Processing (NLP)
100% (1)
1.introduction To Natural Language Processing (NLP)
37 pages
Introduction To Natural Language Processing
No ratings yet
Introduction To Natural Language Processing
32 pages
NLP 1
No ratings yet
NLP 1
20 pages
NLP Notes
No ratings yet
NLP Notes
73 pages
NLP Notes2
No ratings yet
NLP Notes2
27 pages
Unit 1 Extra
No ratings yet
Unit 1 Extra
6 pages
NLP & Linguistics for Researchers
No ratings yet
NLP & Linguistics for Researchers
35 pages
Lec 1.1.2
No ratings yet
Lec 1.1.2
44 pages
NLP Lecture
No ratings yet
NLP Lecture
18 pages
Unit-I NLP
No ratings yet
Unit-I NLP
15 pages
Lec1-UNIT5 - MORE SIMPLER
No ratings yet
Lec1-UNIT5 - MORE SIMPLER
28 pages
DLNLP Chapter-1
No ratings yet
DLNLP Chapter-1
38 pages
Natural Language Processing (NLP)
No ratings yet
Natural Language Processing (NLP)
45 pages
Natural Language Processing
No ratings yet
Natural Language Processing
57 pages
NLP Lecture Notes R20
No ratings yet
NLP Lecture Notes R20
56 pages
NLP - Natural Language Processing and APPLICATION
No ratings yet
NLP - Natural Language Processing and APPLICATION
31 pages
Chapter 7 - Communication Perceving and Acting
No ratings yet
Chapter 7 - Communication Perceving and Acting
21 pages
NLP Module1-4
No ratings yet
NLP Module1-4
100 pages
Grammar Practice Level 1 Unit 2.2 - Answer
No ratings yet
Grammar Practice Level 1 Unit 2.2 - Answer
2 pages
CV Vradii Polly
No ratings yet
CV Vradii Polly
2 pages
BICS in Junior High Boarding School
No ratings yet
BICS in Junior High Boarding School
10 pages
Grade 5 English Exam Guide
No ratings yet
Grade 5 English Exam Guide
5 pages
How Was Your Holiday
No ratings yet
How Was Your Holiday
2 pages
Unidad I. Partes y Órganos Del Habla. Inglés PNF Mecánica
No ratings yet
Unidad I. Partes y Órganos Del Habla. Inglés PNF Mecánica
5 pages
Past Simple Irregular Verbs Quiz
No ratings yet
Past Simple Irregular Verbs Quiz
2 pages
Unseen Paper 4 Tips IGCSE 0475
100% (4)
Unseen Paper 4 Tips IGCSE 0475
6 pages
HS8251 Technical English
No ratings yet
HS8251 Technical English
20 pages
Reflexive and Intensive Pronoun PDF
No ratings yet
Reflexive and Intensive Pronoun PDF
22 pages
TCL Homework Help for Students
100% (1)
TCL Homework Help for Students
8 pages
Plus
No ratings yet
Plus
7 pages
DLP For Teaching Grammar
No ratings yet
DLP For Teaching Grammar
7 pages
Year 4 Literacy Homework Adjectives
100% (1)
Year 4 Literacy Homework Adjectives
5 pages
Semantics Antonymy
No ratings yet
Semantics Antonymy
17 pages
Contexts of Co-Constructed Discourse - Interaction, Pragmatics, and Second Language Applications
No ratings yet
Contexts of Co-Constructed Discourse - Interaction, Pragmatics, and Second Language Applications
251 pages
Remedial English Module 6th STD Clean
No ratings yet
Remedial English Module 6th STD Clean
3 pages
Language Its Structure and Use 5th Edition Edward Finegan Instant Download
No ratings yet
Language Its Structure and Use 5th Edition Edward Finegan Instant Download
74 pages
English Test 1
No ratings yet
English Test 1
5 pages
Your Celpip Scores
No ratings yet
Your Celpip Scores
3 pages
10 Tips On How To Improve English - English With Lucy
No ratings yet
10 Tips On How To Improve English - English With Lucy
10 pages
Organs of Speech
100% (1)
Organs of Speech
8 pages
Editing and Proofreading in Translation
No ratings yet
Editing and Proofreading in Translation
6 pages
The Last Flicker
No ratings yet
The Last Flicker
2 pages
English 8 Q3 Week 7
0% (1)
English 8 Q3 Week 7
10 pages
Q3 English 8 Module 6
No ratings yet
Q3 English 8 Module 6
29 pages
English Grammar
100% (2)
English Grammar
3,437 pages
Sentence Race Present
No ratings yet
Sentence Race Present
3 pages
Monologues
No ratings yet
Monologues
5 pages
Child Language Acquisition A Level English Language Skills and Knowledg
No ratings yet
Child Language Acquisition A Level English Language Skills and Knowledg
6 pages

Introduction

Uploaded by

Introduction

Uploaded by

Natural Language

•Powers translation, search engines, chatbots, and

•Helps analyze massive amounts of text

Discourse – concerns how the immediately preceding sentences

World Knowledge – includes general knowledge about the world. What

Semantic processing: Lexical and syntactic processing do not suffice when it

Lexicon describes the vocabulary that makes up a language.

• Word Frequencies and Stop Words

It is about analysing the syntax or the grammatical structure

Both sentences have the same set of words

Lexical and syntactic processing doesn't suffice when it comes to

You might also like