KEMBAR78
DSP in Speech Processing | PDF | Speech Recognition | Digital Signal Processing
0% found this document useful (0 votes)
830 views11 pages

DSP in Speech Processing

It is a small PowerPoint presentation on the role played by Digital Signal Processing in speech processing.

Uploaded by

Ketan Garg
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
830 views11 pages

DSP in Speech Processing

It is a small PowerPoint presentation on the role played by Digital Signal Processing in speech processing.

Uploaded by

Ketan Garg
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 11

DSP IN SPEECH

PROCESSING
Applications of Digital Signal Processing in
Speech Analysis
APPLICATIONS OF DSP IN
SPEECH PROCESSING

Submitted to: Submitted By:


Mrs. Geetika Dua Mam Ketan Garg
101686006
E.C.E - 2
Title and Content

DSP in DSP in DSP in Other


Speech Speech
Speech Speech Speech
Processing What is speech Recognition Speech Synthesis Speech
Coding Speech Coding Applications
processing? Recognition Synthesis

Why DSP of
speech?

Applications in
real time
What is speech processing?

Speech processing is the study


of speech signals and the
processing methods of these
signals. The signals are usually
processed in
a digital representation, so speech
processing can be regarded as a
special case of digital signal
processing, applied to speech
signal. Aspects of speech
processing includes the
acquisition, manipulation,
storage, transfer and output of
speech signals. The input is
called speech recognition and the
output is called speech synthesis.
Why digital processing of speech?

• digital processing of speech signals (DPSS) enjoys an extensive theoretical and experimental base developed
over the past 75 years .
• much research has been done since 1965 on the use of digital signal processing in speech communication
problems .
• highly advanced implementation technology (VLSI) exists that is well matched to the computational
demands of DPSS .
• Digital signals do not get corrupted by noise etc. You are sending a series of numbers that represent the
signal of interest (i.e. audio, video etc.)
• Digital speech processing is reliable and flexible technique.
• Real time implementations can be possible on inexpensive DSP chips
Applications in real time

• Speech Coding

• Speech Recognition

• Speech Synthesis
Speech Coding

Speech Coding is the process of transforming a speech signal into a representation for efficient transmission and
storage of speech. This is only possible because of vast algorithms of Digital Signal Processing. By the means of
Speech Coding we can save network bandwidth and storage requirements also. Some typical examples which use
Speech Coding are given bellow:

• narrowband and broadband wired telephony

• cellular communications

• Voice over IP (VoIP) to utilize the Internet as a real-time communications medium

• secure voice for privacy and encryption for national security applications

• extremely narrowband communications channels, e.g., battlefield applications using HF radio

• storage of speech for telephone answering machines, IVR systems, pre recorded messages
Speech Recognition

Speech recognition is the inter-disciplinary sub-field of computational linguistics that develops methodologies and
technologies that enables the recognition and translation of spoken language into text by computers. It is also
known as "automatic speech recognition" (ASR), "computer speech recognition", or just "speech to text" (STT).

Speech recognition applications include voice user interfaces such as voice dialing (e.g. "Call home"), call routing
(e.g. "I would like to make a collect call"), domotic appliance control, search (e.g. find a podcast where particular
words were spoken), simple data entry (e.g., entering a credit card number), preparation of structured documents
(e.g. a radiology report), speech-to-text processing (e.g., word processors or emails), and aircraft (usually
termed Direct Voice Input).

Some of the biggest examples of Speech Recognition these days are:

• Google Assistant

• Siri

• Cortana

These technologies use various features of Digital Sinal Processing due to which they can translate our speech into
text and make our work easy like if we want to make call just said ‘Call Mom’.
Speech Synthesis

Synthesis of Speech is the process of generating a speech signal using computational means for effective human
machine interactions

• machine reading of text or email messages

• telematics feedback in automobiles

• talking agents for automatic transactions

• automatic agent in customer care call center

• handheld devices such as foreign language phrasebooks, dictionaries, crossword puzzle helpers

• announcement machines that provide information such as stock quotes, airlines schedules, weather reports, etc.
Overview of TTS system

It is the general representation of TTS system.


DSP chips or Speech Synthesizers accept text as
raw data and then by various computations text is
converted into speech.
This technology is very useful for blind persons
like we can use it in Blind Sticks they can easily
listen what is on the display or can easily listen
the instructions.
Other Speech Applications

• Speaker Verification: for secure access to premises, information, virtual spaces

• Speaker Recognition: for legal and forensic purposes— national security; also for personalized services

• Speech Enhancement: for use in noisy environments, to eliminate echo, to align voices with video segments,
to change voice qualities, to speed-up or slow-down prerecorded speech (e.g., talking books, rapid review of
material, careful scrutinizing of spoken material, etc) potentially to improve intelligibility and naturalness of
speech

• Language Translation: to convert spoken words in one language to another to facilitate natural language
dialogues between people speaking different languages, i.e., tourists, business people

You might also like