IES College of Technology Bhopal
Department of Computer Science and Engineering
Subject Name: Natural Language Processing (NLP)
AL-504(B) Date of Submission 17.10.2024
Assignment Question
Faculty Name: Dr. Manmohan Singh
Note: Attempt All questions
Unit 1
1. List and explain the challenge of NLP
2. Write and explain the minimum edit distance algorithm
3. Discuss in detail the basic regular expression patterns with example
4. What is Tokenization with the example
5. What is CFG and its rule in brief.
6. what is regular Expression with suitable example
7. What is language modelling. Explain with suitable example
8. write short note on Finite State Automate
Unit -2
1. What is unsmoothed N-grams, and how do they differ from smoothed N-grams?
2. Explain the process of evaluating N-grams. What metrics and methods are used?
3. Describe different smoothing techniques used in N-gram models. Why is smoothing necessary?
4. What is interpolation and backoff techniques in language modeling? How are they implemented?
5. Define word classes. How are they utilized in natural language processing?
6. Explain part-of-speech tagging and its significance in word-level analysis.
7. Compare and contrast rule-based, stochastic, and transformation-based approaches to part-of-
speech tagging.
8. Discuss common issues encountered in part-of-speech tagging. How can these issues be
mitigated?
9. Explain the Hidden Markov Model (HMM) and its application in part-of-speech tagging.
10. Describe the Viterbi algorithm and EM (Expectation-Maximization) training. How are they used in
HMMs?