0% found this document useful (0 votes)

10 views36 pages

NLP Class X AI

The document provides an overview of Natural Language Processing (NLP), a sub-field of AI focused on enabling computers to understand human languages. It discusses various applications of NLP, including automatic summarization, sentiment analysis, text classification, and virtual assistants like chatbots. Additionally, it addresses challenges in processing natural language and outlines data processing techniques such as text normalization, stemming, lemmatization, and the Bag of Words algorithm.

Uploaded by

swastiksambhu10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views36 pages

NLP Class X AI

Uploaded by

swastiksambhu10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

ARTIFICIAL INTELLIGINCE

As Per Latest
CBSE Class X
Syllabus

Natural Language Processing

• It is the sub-field of AI that is focused on

enabling computers to understand and
process human languages.

• It is concerned with the interactions

between computers and human (natural)
languages, in particular how to program
computers to process and analyse large
amounts of natural language data.
Applications of Natural Language
Processing

• Automatic Summarization

• Sentiment Analysis

• Text classification

• Virtual Assistants
Automatic Summarization

• It is the process of shortening a set of data

computationally, to create a summary that
represents the most relevant information within
the original content.

• It comes out as the solution to information

overload.

• It is about understanding emotional meanings

within the information.
Sentiment analysis

• It is about identifying sentiment among several

posts or even in the same post where emotion is not
always explicitly expressed.

• Companies use NLP applications, such as sentiment

analysis, to identify opinions and sentiment online to
help them understand what customers think about
their products and services.
Text classification

• Text classification makes it possible to assign

predefined categories to a document and
organize it to help finding the information
needed.

• For example, an application of text categorization

is spam filtering in email.
Virtual Assistants

• An application program that understands

natural language voice commands and
completes tasks for the user.

• Benefits of AI Assistants:

• Improved customer support

• Ease of key data collection

• Personalized user experience
• Examples:

Chatbots, Voice Assistants, AI Avatars, Domain

Specific Virtual Assistants, etc.
Chatbots

• One of the most common applications of Natural Language Processing is a chatbot.

• An Al software that can simulate a real human conversation with real-time responses to users
based on reinforced learning.

• AI Chatbots either use text messages, voice commands, or both.

Chatbots…
• Ex-
• Mitsuku Bot
https://www.pandorabots.com/mitsuku/
• CleverBot
https://www.cleverbot.com/
• Jabberwacky
http://www.jabberwacky.com/
• Haptik
https://haptik.ai/contact-us
• Rose
http://ec2-54-215-197-164.us-west-1.compute.amazonaws.com/speech.php
• Ochatbot
https://www.ometrics.com/blog/list-of-fun-chatbots/
Chatbots…

• There are 2 types of chatbots:

Ex- bots deployed in the customer care section of various companies

Human Language VS Computer Language

• Human brain continuously processes everything what it gets around, makes sense and stores it in
some place.

• When someone whispers, the focus of our brain automatically shifts(giving more priority) to that
speech and starts processing automatically.

• While, the computer understands the language of numbers.

• Everything that is sent to the machine has to be converted to numbers.

Difficulties during processing natural language
by a machine

Arrangement of the words and meaning

• There are structures/characteristics in the human language that might be easy for a
human to understand but extremely difficult for a computer to understand.

• Different syntax, same semantics:

2+3 = 3+2

• Different semantics, same syntax:

2/3 (Python 2.7) ≠ 2/3 (Python 3)

Difficulties during processing natural language
by a machine

Multiple Meanings of a word

=> His face turned red after he found out that he took the wrong bag.

=> His face turns red after consuming the medicine.

• Both the sentences might have multiple meanings.

Perfect Syntax, but no Meaning

=> Chickens feed extravagantly while the moon drinks tea.

• Both the sentences might have multiple meanings.

Data Processing : (Text Normalization)

• It involves preparing and cleaning text data for machines to be able to

analyze it.

• This process puts data in workable form and highlights features in the
text that an algorithm can work with.

• There are several ways this can be done, including:

Data Processing…

Sentence Segmentation:
In this process the whole corpus is divided into sentences. Each sentence is taken as a different data so
now the whole corpus gets reduced to sentences.
Data Processing…

• Tokenisation:

It is the process of breaking down the sentences into smaller units(tokens) to work with.
Data Processing…

• Removing Stopwords, Special Characters and Numbers:

It is the process of removing common words, special characters, etc(which do not add any
essence to the information) are removed from text so, unique words that offer the most
information about the text remain.

Some examples of stopwords are:

a, an, are, for, etc.

Data Processing…

• Converting text to a common case:

In this process the whole text is converted into a similar case(lower case). This ensures that the machine is
case-insensitive.
Data Processing…

Stemming:
Here, the remaining words are
reduced to their root words. It is the
process in which the affixes of words
are removed and the words are
converted to their base form.
Data Processing…

Lemmatization:
The process in which a word is converted to
its meaningful root form.

Stemming and lemmatization both are

alternative processes to each other as the
role of both the processes is same – removal
of affixes. But the difference between both of
them is that in lemmatization, the word we
get after affix removal (also known as lemma)

is a meaningful one.
Bag of words Algorithm

• A Natural Language
Processing model which helps
in extracting features out of
the text which is very helpful in
machine learning algorithms.

• The occurrences of each word

is counted and the vocabulary
for the corpus is constructed.
Bag of words…

The step-by-step approach to implement bag of words algorithm:

1. Text Normalisation: Collect data and pre-process it.

2. Create Dictionary: Make a list of all the unique words occurring in the
corpus. (Vocabulary).

3. Create document vectors: For each document in the corpus, find out
how many times the word from the unique list of words has occurred.

4. Create document vectors for all the documents.

Bag of words…
Bag of words…

Here are three documents having one sentence each. After text normalization, the text
becomes:

Note that no tokens have been removed in the stopwords removal step. It is because we have
very little data and since the frequency of all the words is almost the same, no word can be said
to have lesser value than the other.
Bag of words…

List down all the words which occur in all three documents:
Bag of words…

In this step,
•The vocabulary is written in the top row.
•Now, for each word in the document, if it matches
with the vocabulary, put a 1 under it.
•If the same word appears again, increment the
previous value by 1.
•And if the word does not occur in that document, put
a 0 under it.
Bag of words…

Since in the first document, we have words: aman, and,

anil, are, stressed. So, all these words get a value of 1 and
rest of the words get a 0 value.
Bag of words…

This gives us the document vector table for our corpus. But the tokens have still not
converted to numbers. This leads us to the final steps of our algorithm: TFIDF
Bag of words…

A plot of occurrence of words versus their value

TFIDF stands for Term Frequency and Inverse Document Frequency.
It helps in identifying the value for each word.
Let us understand each term one by one.

Term Frequency:
▪Term frequency is the frequency of a word in one
document.
▪It can easily be found from the document vector table.
Inverse Document Frequency:
▪It the total number of documents divided by the
document frequency.
▪ IDF =
Total no. of documents
The document frequency
TFIDF(W) = TF(W) * log( IDF(W) )
After calculating all the values:

Conclusion:
The value of a word is inversely proportional to the
IDF value of that word.
Ex-
Total Number of documents: 10
Number of documents in which ‘and’ occurs: 10
Therefore, IDF(and) = 10/10 = 1
Which means: log(1) = 0.
Hence, the value of ‘and’ becomes 0.
On the other hand,
Number of documents in which ‘pollution’ occurs: 3
IDF(pollution) = 10/3 = 3.3333…
Which means: log(3.3333) = 0.522;
Which shows that the word ‘pollution’ has considerable
value in the corpus.
Applications of TFIDF:-

• Document Classification

• Topic Modelling

• Information Retrieval System

• Stop word filtering

NLP Notes CL 10
No ratings yet
NLP Notes CL 10
13 pages
NLP Revision Notes and Applications
No ratings yet
NLP Revision Notes and Applications
4 pages
NLP - CH-6
No ratings yet
NLP - CH-6
4 pages
Natural Language Processing (NLP)
No ratings yet
Natural Language Processing (NLP)
5 pages
Natural Language Processing
No ratings yet
Natural Language Processing
10 pages
NLP Applications and Techniques
No ratings yet
NLP Applications and Techniques
7 pages
AIUnit 6 10
No ratings yet
AIUnit 6 10
8 pages
NLP Ai X
No ratings yet
NLP Ai X
6 pages
PDF NLP
No ratings yet
PDF NLP
7 pages
Q ClassX AI Ch7
No ratings yet
Q ClassX AI Ch7
6 pages
Dupppppppppp
No ratings yet
Dupppppppppp
15 pages
Ai NLP
No ratings yet
Ai NLP
9 pages
517-C-30070-Assignment - Chapter NLP
No ratings yet
517-C-30070-Assignment - Chapter NLP
9 pages
NLP - Notes
No ratings yet
NLP - Notes
3 pages
Natural Language Processing
No ratings yet
Natural Language Processing
6 pages
NLP Essentials for AI Enthusiasts
No ratings yet
NLP Essentials for AI Enthusiasts
4 pages
NLP Techniques and Applications
No ratings yet
NLP Techniques and Applications
17 pages
Unit 6 - AI (NLP)
No ratings yet
Unit 6 - AI (NLP)
37 pages
Natural Language Processing Notes Class 10 AI
No ratings yet
Natural Language Processing Notes Class 10 AI
24 pages
Unit 6 (NLP)
No ratings yet
Unit 6 (NLP)
8 pages
Natural Language Processing Notes Class 10
No ratings yet
Natural Language Processing Notes Class 10
10 pages
NLP Q&A for Class X AI Course
No ratings yet
NLP Q&A for Class X AI Course
7 pages
Natural Language Processing Notes Class 10 AI
No ratings yet
Natural Language Processing Notes Class 10 AI
25 pages
NLP-Questions Class 10 Ai
No ratings yet
NLP-Questions Class 10 Ai
8 pages
Introduction To NLP
No ratings yet
Introduction To NLP
50 pages
Natural Language Processing: Learning Is Not A Course, Its A Path From Passion To Profession
No ratings yet
Natural Language Processing: Learning Is Not A Course, Its A Path From Passion To Profession
19 pages
NLP Challenges & Techniques
No ratings yet
NLP Challenges & Techniques
45 pages
Chapter 7.1 - Introducing Natural Language Processing
No ratings yet
Chapter 7.1 - Introducing Natural Language Processing
39 pages
Genai Unit !
No ratings yet
Genai Unit !
71 pages
Natural Language Processing Notes Class 10 AI
100% (1)
Natural Language Processing Notes Class 10 AI
20 pages
Text Mining & NLP for Academics
No ratings yet
Text Mining & NLP for Academics
38 pages
DLT Unit-5
No ratings yet
DLT Unit-5
48 pages
Unit-6 Natural Language Processing
No ratings yet
Unit-6 Natural Language Processing
7 pages
SKD Academy (CBSE) Session - 2024-2025 Subject - Artificial Intelligence (417) Important Questions Chap - NLP
No ratings yet
SKD Academy (CBSE) Session - 2024-2025 Subject - Artificial Intelligence (417) Important Questions Chap - NLP
7 pages
IP Projects NLP
No ratings yet
IP Projects NLP
8 pages
Assignment of AI Finished
No ratings yet
Assignment of AI Finished
16 pages
Natural Language Processing Tools and Approaches
No ratings yet
Natural Language Processing Tools and Approaches
106 pages
NLP Basics for Beginners
No ratings yet
NLP Basics for Beginners
8 pages
NLP for Tech Enthusiasts
No ratings yet
NLP for Tech Enthusiasts
40 pages
TSP Unit1 Own
No ratings yet
TSP Unit1 Own
20 pages
TSP Unit1 Own
No ratings yet
TSP Unit1 Own
13 pages
NLP2
No ratings yet
NLP2
8 pages
NLP 9
No ratings yet
NLP 9
44 pages
Lecture 3
No ratings yet
Lecture 3
70 pages
Week 6: Introduction To Natural Language Processing
No ratings yet
Week 6: Introduction To Natural Language Processing
18 pages
Natural Language Processing (UNIT 05 - 8 Marks)
No ratings yet
Natural Language Processing (UNIT 05 - 8 Marks)
3 pages
TSA Book
No ratings yet
TSA Book
154 pages
Text Analytics Basics
No ratings yet
Text Analytics Basics
28 pages
NLP Intro
No ratings yet
NLP Intro
74 pages
Bag of Words
No ratings yet
Bag of Words
19 pages
AP For NLP-Word 2 Vec
No ratings yet
AP For NLP-Word 2 Vec
33 pages
Dealing With Textual Data
No ratings yet
Dealing With Textual Data
67 pages
Cs383 Lecture16 PDF
No ratings yet
Cs383 Lecture16 PDF
46 pages
1009 NLP PPT
No ratings yet
1009 NLP PPT
31 pages
NLP Revision Notes
No ratings yet
NLP Revision Notes
6 pages
Ch-6 Natural Language Processing Q&A's
No ratings yet
Ch-6 Natural Language Processing Q&A's
8 pages
Class 12 Mass Media Unit 2 Chapter 4 Notes
No ratings yet
Class 12 Mass Media Unit 2 Chapter 4 Notes
7 pages
Guidelines For Writing A SUMMARY
No ratings yet
Guidelines For Writing A SUMMARY
3 pages
Reference and Sense
No ratings yet
Reference and Sense
8 pages
Literature Review Annotated Bibliography Apa
100% (1)
Literature Review Annotated Bibliography Apa
4 pages
功能对等理论与字幕翻译
No ratings yet
功能对等理论与字幕翻译
76 pages
What Is Semiotics?
No ratings yet
What Is Semiotics?
4 pages
Alternative Assessment - Conference N Interviews
No ratings yet
Alternative Assessment - Conference N Interviews
11 pages
Unit/Week/ Topic Listening and Speaking Reading Writing/Grammar Grammar/Writing Language Arts
No ratings yet
Unit/Week/ Topic Listening and Speaking Reading Writing/Grammar Grammar/Writing Language Arts
20 pages
Understanding Interlanguage Errors
No ratings yet
Understanding Interlanguage Errors
9 pages
Cultural Nonverbal Communication Guide
No ratings yet
Cultural Nonverbal Communication Guide
5 pages
The Mind of A Subtitler - Jessica Rietveld
No ratings yet
The Mind of A Subtitler - Jessica Rietveld
93 pages
Year 3 English Lesson Plans
No ratings yet
Year 3 English Lesson Plans
32 pages
Analytical Exposition 1
No ratings yet
Analytical Exposition 1
2 pages
Rate Card PRMN 2024 - Compressed
100% (1)
Rate Card PRMN 2024 - Compressed
14 pages
Celta Coursework Examples
100% (2)
Celta Coursework Examples
7 pages
CO-CURRICULAR 6th SEMESTER
0% (1)
CO-CURRICULAR 6th SEMESTER
3 pages
LESSON PLAN - Catch-Up-Friday-Quarter-3-Week-7
100% (1)
LESSON PLAN - Catch-Up-Friday-Quarter-3-Week-7
2 pages
Digital Products Complete Guide
No ratings yet
Digital Products Complete Guide
21 pages
Numbers & Figures
No ratings yet
Numbers & Figures
4 pages
ESL Strategies for Children's Language Development
No ratings yet
ESL Strategies for Children's Language Development
2 pages
Snab A Level Biology Coursework
100% (2)
Snab A Level Biology Coursework
9 pages
CBWP2203 - Web Programming - Assignment
No ratings yet
CBWP2203 - Web Programming - Assignment
13 pages
Young Learners & Effective Teaching
100% (1)
Young Learners & Effective Teaching
3 pages
Lecture 8 - Global Brand Positioning
No ratings yet
Lecture 8 - Global Brand Positioning
38 pages
Key Stage 2 Reading Assessment Tool Filipino
No ratings yet
Key Stage 2 Reading Assessment Tool Filipino
3 pages
Starters-listening Part 3 Test1 -Nguyễn Quang Tiến.
No ratings yet
Starters-listening Part 3 Test1 -Nguyễn Quang Tiến.
2 pages
Psycholinguistics: Pidgins & Creoles
No ratings yet
Psycholinguistics: Pidgins & Creoles
3 pages
Twentieth-Century Structural Linguistics in America
No ratings yet
Twentieth-Century Structural Linguistics in America
7 pages
Morphology: Morphemes Are Defined As The Smallest Meaning-Bearing Units in Language. We Can Recognize
No ratings yet
Morphology: Morphemes Are Defined As The Smallest Meaning-Bearing Units in Language. We Can Recognize
1 page
Lecture Method: K - Abirami 1 Year B.Ed Biological Science
No ratings yet
Lecture Method: K - Abirami 1 Year B.Ed Biological Science
13 pages

NLP Class X AI

Uploaded by

NLP Class X AI

Uploaded by

ARTIFICIAL INTELLIGINCE

Natural Language Processing

• It is the sub-field of AI that is focused on

• It is concerned with the interactions

• It is the process of shortening a set of data

• It comes out as the solution to information

• It is about understanding emotional meanings

• It is about identifying sentiment among several

• Companies use NLP applications, such as sentiment

• Text classification makes it possible to assign

• For example, an application of text categorization

• An application program that understands

• Improved customer support

• Ease of key data collection

Chatbots, Voice Assistants, AI Avatars, Domain

• One of the most common applications of Natural Language Processing is a chatbot.

• AI Chatbots either use text messages, voice commands, or both.

• There are 2 types of chatbots:

Ex- bots deployed in the customer care section of various companies

• While, the computer understands the language of numbers.

• Everything that is sent to the machine has to be converted to numbers.

Arrangement of the words and meaning

• Different syntax, same semantics:

• Different semantics, same syntax:

2/3 (Python 2.7) ≠ 2/3 (Python 3)

Multiple Meanings of a word

=> His face turns red after consuming the medicine.

• Both the sentences might have multiple meanings.

Perfect Syntax, but no Meaning

• Both the sentences might have multiple meanings.

• It involves preparing and cleaning text data for machines to be able to

• There are several ways this can be done, including:

• Removing Stopwords, Special Characters and Numbers:

Some examples of stopwords are:

a, an, are, for, etc.

• Converting text to a common case:

Stemming and lemmatization both are

• The occurrences of each word

The step-by-step approach to implement bag of words algorithm:

1. Text Normalisation: Collect data and pre-process it.

4. Create document vectors for all the documents.

Since in the first document, we have words: aman, and,

A plot of occurrence of words versus their value

• Information Retrieval System

• Stop word filtering

You might also like