Algorithms for applying N-grams on emotional speech recognition and text correction

Abstract

Statistical language model aims to estimate the probability distribution of various linguistic units such as words and sentences. Language models employ statistical estimation techniques using text. The most popular language models are N-grams models. These models are fundamental to a variety of language technologies, such as speech and optical recognition, statistical machine translation, and spelling correction. In the framework of this work, two new algorithms are introduced, for applying N-grams models in emotional speech recognition and sentence correction.This work can be divided into two sections. The first one presents the algorithm for applying N- grams in emotional speech recognition. In spite of the remarkable recent progress in Large Vocabulary Recognition (LVR), it is still far behind the ultimate goal of recognising emotional speech. Read speech and non-read speech in a ‘careful’ style can be recognised with high accuracy using the state-of-the-art speech recognition tech ...
show more

All items in National Archive of Phd theses are protected by copyright.

DOI
10.12681/eadd/16871
Handle URL
http://hdl.handle.net/10442/hedi/16871
ND
16871
Alternative title
Αλγόριθμοι εφαρμογής των N-grams στην αναγνώριση συναισθηματικού λόγου και στην διόρθωση κειμένων
Author
Athanaselis, Theologos (Father's name: Dimitrios)
Date
2007
Degree Grantor
National Technical University of Athens (NTUA)
Committee members
Καραγιάννης Γεώργιος
Κόλλιας Στέφανος
Μαραγκός Πέτρος
Σταφυλοπάτης Ανδρέας
Σελλής Τίμος
Τσανάκας Παναγιώτης
Σαρρής Εμμανουήλ
Discipline
Engineering and TechnologyElectrical Engineering, Electronic Engineering, Information Engineering
Keywords
Emotional speech recognition; Emotionally oriented language model; Dictionary of affect in language; Text correction; Word order errors; Fast search algorithm; Permutations filtering; Confusion matrix
Country
Greece
Language
Greek
Description
164 σ., im.
Usage statistics
VIEWS
Concern the unique Ph.D. Thesis' views for the period 07/2018 - 07/2023.
Source: Google Analytics.
ONLINE READER
Concern the online reader's opening for the period 07/2018 - 07/2023.
Source: Google Analytics.
DOWNLOADS
Concern all downloads of this Ph.D. Thesis' digital file.
Source: National Archive of Ph.D. Theses.
USERS
Concern all registered users of National Archive of Ph.D. Theses who have interacted with this Ph.D. Thesis. Mostly, it concerns downloads.
Source: National Archive of Ph.D. Theses.
Related items (based on users' visits)