0% found this document useful (0 votes)

32 views6 pages

Week-8 NLP Lab Program

This document provides a Python program for converting audio files to text and text files to audio using the NLTK package and other libraries. It includes functions for converting MP3 to WAV, performing speech recognition, and generating audio from text, along with installation instructions for required packages and FFmpeg. The program allows users to choose between converting audio to text or text to audio and provides options for saving the results.

Uploaded by

227r1a7349

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views6 pages

Week-8 NLP Lab Program

Uploaded by

227r1a7349

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

EXPERIMENT-8

Natural Language Processing Lab

Write a python program to convert audio file to text and text file to audio

files using NLTK Package.

Requirements

To run this program, you'll need to install the following packages:

pip install nltk SpeechRecognition gtts pydub

import nltk

from nltk.tokenize import word_tokenize, sent_tokenize

import speech_recognition as sr

from gtts import gTTS

import os

# Download NLTK data (only needed once)

nltk.download('punkt')

from pydub import AudioSegment

import os

def mp3_to_wav(mp3_file_path, wav_file_path=None):

"""

Convert MP3 file to WAV format using pydub.

Args:

mp3_file_path (str): Path to the input MP3 file

wav_file_path (str): Path to save the output WAV file (optional)

If not provided, replaces .mp3 with .wav

Returns:

str: Path to the created WAV file

"""
try:

# If output path not specified, create one by replacing extension

if wav_file_path is None:

wav_file_path = os.path.splitext(mp3_file_path)[0] + '.wav'

# Load MP3 file

audio = AudioSegment.from_mp3(mp3_file_path)

# Export as WAV

audio.export(wav_file_path, format="wav")

print(f"Successfully converted {mp3_file_path} to {wav_file_path}")

return wav_file_path

except Exception as e:

print(f"Error converting MP3 to WAV: {e}")

return None

# Example usage

if __name__ == "__main__":

input_mp3 = "input.mp3" # Change to your MP3 file path

output_wav = "output.wav" # Change to desired WAV file path

mp3_to_wav(input_mp3, output_wav)

def audio_to_text(mp3_file_path):

"""

Convert MP3 to WAV, then perform speech recognition and NLP processing

"""

try:

# First convert MP3 to WAV

wav_file = mp3_to_wav(mp3_file_path)
# Then do speech recognition

recognizer = sr.Recognizer()

with sr.AudioFile(wav_file) as source:

audio_data = recognizer.record(source)

text = recognizer.recognize_google(audio_data)

# NLP processing with NLTK

tokens = word_tokenize(text)

print("Recognized text tokens:", tokens)

return text

except Exception as e:

print(f"Error in MP3 to text conversion: {e}")

return None

def text_to_audio(text, output_file="output.mp3", language='en'):

"""

Convert text to speech and save as an audio file using gTTS.

"""

try:

# Tokenize text into sentences for better processing

sentences = sent_tokenize(text)

processed_text = ' '.join(sentences)

tts = gTTS(text=processed_text, lang=language, slow=False)

tts.save(output_file)

print(f"Audio file saved as {output_file}")

return output_file
except Exception as e:

print(f"Error in text-to-speech conversion: {e}")

return None

def text_file_to_audio(text_file_path, output_file="output.mp3", language='en'):

"""

Read text from a file and convert it to speech.

"""

try:

with open(text_file_path, 'r', encoding='utf-8') as file:

text = file.read()

return text_to_audio(text, output_file, language)

except Exception as e:

print(f"Error reading text file: {e}")

return None

def main():

print("Audio and Text Conversion Tool")

print("1. Audio file to Text")

print("2. Text file to Audio")

choice = input("Enter your choice (1 or 2): ")

if choice == '1':

audio_file = input("Enter audio file path (WAV, AIFF, FLAC): ")

text = audio_to_text(audio_file)

if text:

print("\nConverted Text:")

print(text)

# Save to file

save_choice = input("Save to text file? (y/n): ").lower()

if save_choice == 'y':

output_file = input("Enter output text file name (e.g., output.txt): ")

with open(output_file, 'w', encoding='utf-8') as f:

f.write(text)

print(f"Text saved to {output_file}")

elif choice == '2':

text_file = input("Enter text file path: ")

output_audio = input("Enter output audio file name (e.g., output.mp3): ")

result = text_file_to_audio(text_file, output_audio)

if result:

print(f"Successfully created audio file: {result}")

# Option to play the audio

play_choice = input("Play the audio file? (y/n): ").lower()

if play_choice == 'y':

os.system(f"start {result}" if os.name == 'nt' else f"xdg-open {result}")

else:

print("Invalid choice")

if __name__ == "__main__":

main()
Additionally, you'll need FFmpeg installed on your system:

FFmpeg Installation Guide for Windows

1. Download FFmpeg:

o Direct download link: https://www.gyan.dev/ffmpeg/builds/

o Choose: ffmpeg-release-essentials.zip (latest version)

o Alternative official source: https://ffmpeg.org/download.html

2. Install FFmpeg:

o Extract the ZIP file to a permanent location (e.g., C:\ffmpeg)

o Copy the path to the bin folder (e.g., C:\ffmpeg\bin)

3. Add FFmpeg to System PATH:

o Press Win + R, type sysdm.cpl, and press Enter

o Go to "Advanced" tab → "Environment Variables"

o Under "System variables", find and select "Path" → Click "Edit"

o Click "New" and paste your FFmpeg bin path (e.g., C:\ffmpeg\bin)

o Click "OK" on all windows to save

4. Verify Installation:

o Open Command Prompt (Win + R, type cmd)

o Run: ffmpeg -version

o You should see version information if installed correctly

Notes:

1. Audio to Text:

o Uses Google Speech Recognition API (free but requires internet)

o Works best with uncompressed WAV, AIFF, or FLAC files

o For other formats, you might need to convert them first

2. Text to Audio:

o Uses Google Text-to-Speech (gTTS) which requires internet

o Outputs as MP3 by default

o Includes NLTK sentence tokenization for better speech flow

NLP Exp 8
No ratings yet
NLP Exp 8
2 pages
Training Project - Pptyx
No ratings yet
Training Project - Pptyx
11 pages
Pdf2mp3 Py
No ratings yet
Pdf2mp3 Py
4 pages
TSA Lab 2
No ratings yet
TSA Lab 2
3 pages
Speech Recognition
No ratings yet
Speech Recognition
5 pages
Artificial Intelligence Project Report-Ads18a00095y
No ratings yet
Artificial Intelligence Project Report-Ads18a00095y
3 pages
Dhara NLP Practical
No ratings yet
Dhara NLP Practical
67 pages
Python Text To Spesdfssech
No ratings yet
Python Text To Spesdfssech
2 pages
Jarvis
No ratings yet
Jarvis
2 pages
Speech To Text Conversion
No ratings yet
Speech To Text Conversion
7 pages
Speech Recog
No ratings yet
Speech Recog
5 pages
Voice Assistant Report
No ratings yet
Voice Assistant Report
4 pages
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
No ratings yet
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
48 pages
Department of Computer Science and Engineering) : CGB1121/ EGB1122
No ratings yet
Department of Computer Science and Engineering) : CGB1121/ EGB1122
18 pages
Voice Assistant Suggetion
No ratings yet
Voice Assistant Suggetion
3 pages
2.5 Automatic Speech Recognition
No ratings yet
2.5 Automatic Speech Recognition
8 pages
Voice Identification GLM4 Guide
No ratings yet
Voice Identification GLM4 Guide
2 pages
Priyank Dewashish
No ratings yet
Priyank Dewashish
15 pages
Spoken Language Processing in Python Chapter3
No ratings yet
Spoken Language Processing in Python Chapter3
26 pages
Pydub
No ratings yet
Pydub
26 pages
Labs 9
No ratings yet
Labs 9
4 pages
Ai
No ratings yet
Ai
2 pages
Python Project1
No ratings yet
Python Project1
8 pages
Voice Assistant Python Script
No ratings yet
Voice Assistant Python Script
6 pages
Application Code Exp2
No ratings yet
Application Code Exp2
4 pages
Exno8 Lab
No ratings yet
Exno8 Lab
4 pages
Assistant
No ratings yet
Assistant
2 pages
Text To Speech Presentation
No ratings yet
Text To Speech Presentation
7 pages
Documentation
No ratings yet
Documentation
5 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
12 pages
Lecture
No ratings yet
Lecture
7 pages
Week 8
No ratings yet
Week 8
3 pages
Python Audio Processing Guide
No ratings yet
Python Audio Processing Guide
4 pages
Speech Recognition System
No ratings yet
Speech Recognition System
16 pages
Chat Bot 1
No ratings yet
Chat Bot 1
7 pages
Sphinx Speech Recognition
No ratings yet
Sphinx Speech Recognition
5 pages
Exercise 8
No ratings yet
Exercise 8
2 pages
Building A Windows Desktop AI Assistant (Python, Voice I - O, 3D Avatar)
No ratings yet
Building A Windows Desktop AI Assistant (Python, Voice I - O, 3D Avatar)
5 pages
Create Audio Effects App in Python
No ratings yet
Create Audio Effects App in Python
5 pages
Text-to-Speech Conversion Guide
No ratings yet
Text-to-Speech Conversion Guide
8 pages
Python Ai
No ratings yet
Python Ai
3 pages
Data Sorting Guideline
No ratings yet
Data Sorting Guideline
2 pages
Jarvis For Windows
No ratings yet
Jarvis For Windows
1 page
PBL 2
No ratings yet
PBL 2
5 pages
Speech to Text Guide Using Python
No ratings yet
Speech to Text Guide Using Python
1 page
Aa Alexa
No ratings yet
Aa Alexa
3 pages
Explanation
No ratings yet
Explanation
4 pages
Python SpeechRecognition Guide
No ratings yet
Python SpeechRecognition Guide
23 pages
Presentation On - Ohh Toodle, An Assistant: Presented by Presented To
No ratings yet
Presentation On - Ohh Toodle, An Assistant: Presented by Presented To
10 pages
Text Into Speech Python Report
No ratings yet
Text Into Speech Python Report
18 pages
Python Speech Recognition Guide
No ratings yet
Python Speech Recognition Guide
25 pages
Sujal Kumar Sinha - IOT - MATLAB Mini
No ratings yet
Sujal Kumar Sinha - IOT - MATLAB Mini
13 pages
Speech Recognition Techniques - GUVI
No ratings yet
Speech Recognition Techniques - GUVI
4 pages
Speech To Text
No ratings yet
Speech To Text
17 pages
Python GuiaUser
No ratings yet
Python GuiaUser
23 pages
Voice Assistant Using Python 2
No ratings yet
Voice Assistant Using Python 2
20 pages
Suryanarayan 3
No ratings yet
Suryanarayan 3
2 pages
Voice Assistant Report 40 Pages
No ratings yet
Voice Assistant Report 40 Pages
44 pages
Python Virtual Assistant Guide
No ratings yet
Python Virtual Assistant Guide
8 pages
Bfs 569
No ratings yet
Bfs 569
10 pages
Naveen Resume
No ratings yet
Naveen Resume
1 page
Issues in Machine Learning With Conclution
No ratings yet
Issues in Machine Learning With Conclution
8 pages
Iomp
No ratings yet
Iomp
11 pages
Lab 16
No ratings yet
Lab 16
3 pages
Lab 13 For Manual
No ratings yet
Lab 13 For Manual
4 pages
16th Program
No ratings yet
16th Program
7 pages
Program 12
No ratings yet
Program 12
7 pages
Lab Manual 15
No ratings yet
Lab Manual 15
7 pages
Lesson 1: 1.1. Egyptian Nouns
No ratings yet
Lesson 1: 1.1. Egyptian Nouns
11 pages
Scienceofetymolo 00 Skeauoft
No ratings yet
Scienceofetymolo 00 Skeauoft
274 pages
English 2 II Period 2023 12pm - Top Notch Level 1 3rd Edition 2023-07-25 01.14.39 64bf21ff44851
No ratings yet
English 2 II Period 2023 12pm - Top Notch Level 1 3rd Edition 2023-07-25 01.14.39 64bf21ff44851
480 pages
Butuanon Grammar: Possession
0% (1)
Butuanon Grammar: Possession
12 pages
Daily Vocabulary (Day 2)
No ratings yet
Daily Vocabulary (Day 2)
2 pages
Presentaion For Webinar
No ratings yet
Presentaion For Webinar
13 pages
Fugue Analysis: A Comprehensive Guide
No ratings yet
Fugue Analysis: A Comprehensive Guide
2 pages
Pediatric Growth & Development Guide
No ratings yet
Pediatric Growth & Development Guide
19 pages
Exam Preparation Guide: Week 10
No ratings yet
Exam Preparation Guide: Week 10
98 pages
Alster-Sumerian Proverbs
No ratings yet
Alster-Sumerian Proverbs
17 pages
Maths IB (EM) BLM 21-22
No ratings yet
Maths IB (EM) BLM 21-22
111 pages
PDF Pioneer b2 Tests Compress
100% (4)
PDF Pioneer b2 Tests Compress
65 pages
02 - Linux Checklist
No ratings yet
02 - Linux Checklist
6 pages
Vooma Paybill Application Form
No ratings yet
Vooma Paybill Application Form
2 pages
Evaluacion de Ingles 4
No ratings yet
Evaluacion de Ingles 4
2 pages
Effectiveness of One-On-One Tutoring
No ratings yet
Effectiveness of One-On-One Tutoring
7 pages
Interfacecomponent Siemens
No ratings yet
Interfacecomponent Siemens
16 pages
Internship Report Anguraj
No ratings yet
Internship Report Anguraj
35 pages
ProLoan O
No ratings yet
ProLoan O
12 pages
1 D 6
No ratings yet
1 D 6
2 pages
Ebooks File Introductory Statistics For Data Analysis Warren J. Ewens All Chapters
No ratings yet
Ebooks File Introductory Statistics For Data Analysis Warren J. Ewens All Chapters
49 pages
Light Class 7 QUS ANS
No ratings yet
Light Class 7 QUS ANS
3 pages
Azure Load Balancer: Basic Standard
100% (1)
Azure Load Balancer: Basic Standard
20 pages
BEGC - 114 24-25 Assignment - 250310 - 120425
No ratings yet
BEGC - 114 24-25 Assignment - 250310 - 120425
4 pages
M350 Network Configuration Instructions
No ratings yet
M350 Network Configuration Instructions
13 pages
Reclaiming Our Roman Catholic Birthright The Genius and Timeliness of The Traditional Latin Mass Peter Kwasniewski Download
No ratings yet
Reclaiming Our Roman Catholic Birthright The Genius and Timeliness of The Traditional Latin Mass Peter Kwasniewski Download
136 pages
Legal Implications of All-Caps Names
No ratings yet
Legal Implications of All-Caps Names
23 pages
Top Notch Listening
100% (2)
Top Notch Listening
1 page
Occult Symbolism Explained
100% (1)
Occult Symbolism Explained
26 pages
Haiwell PLC Instruction Guide
No ratings yet
Haiwell PLC Instruction Guide
6 pages