تجاوز إلى المحتوى الرئيسي
User Image

د. يوسف بن أحمد العوهلي

عميد عمادة التعاملات الإلكترونية والاتصالات، استاذ مشارك كلية علوم الحاسب والمعلومات

مادة دراسية

Arabic Voice Recognition

 

Abstract

Speech recognition is being the interest of scientists for more than four decades. Arabic speech recognition was an interest of Arabic scientists but they couldn’t reach a good performance until now. This is because they don’t have an existing speech dataset that they can build there systems on it.
In this project, we are going to build a Speech Recognition system using the HTK tool. We are also going to build a speaker-independent speech dataset that can be used in our project to train our system and to validate it. This speech dataset can be helpful in other Speech Recognition researches. We will also run our recognizer interactively so that we can make use of it in information retrieval applications

 

Table of Contents

Part1: Introduction     Part1

1.1 Speech Recognition Variables

1.1.1 Vocabulary Size 

1.1.2 Speaker Dependency 

1.1.3 Continuity of Speech 

1.1.4 Background Noise 

1.2 Problem Statement

1.3 Contributions 

1.4 Software Development Life Cycle Model

 

Part2: Speech DataSet 

(Database)    Part2

2.1 Previous Work 

2.2 Data Collection 

2.3Dataset Statistics 

2.4 Support Tools

Part3: Hidden Marcov Model 

(HMM)   Part3

3.1 Discrete Markov Process 

3.2 Extension of Hidden Marcov Models 

3.3 Elements of HMM   

3.4 The Three Basic Problems of HMM   

Problem 1:

Solution :

Problem 2:

Solution:

Problem 3:

Solution:

3.5 Types of HMM   

3.6 Optimization Criteria 

4. Hidden Marcov Model Toolkit (HTK)

4.1 Simple HTK Example

4.2 Difficulties in HTK

Part4: Modeling and Labeling Techniques      Part4

5.1 Example on Word Level Labeling Using HTK  

6. Speech Recognition Different Labeling Schemes Experiments 

6.1 Labeling on Word Level

6.2 Phonetic-Based Labeling 

6.3 Literal-Based Labeling (Old Version)

6.4 Literal-Based Labeling (New Version)

6.5 Comparison Between The Four Methods 

 

 Part5:Results of Word-Level Labeling    Part5

8. Results of Literal-Based Labeling 

9. Results of phonetic-Based Labeling 

10. Conclusion 

10.1 Future Work 




ملحقات المادة الدراسية