Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs....

52
Statistical analysis of acoustical parameters in the voice of children with juvenile dysphonia Miklós Gábriel Tulics, Ferenc Kazinczi and Klára Vicsi BUTE Laboratory of Speech Acoustics - Department of Telecommunications and Media Informatics 1

Transcript of Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs....

Page 1: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Statistical analysis of acoustical

parameters in the voice of children

with juvenile dysphonia

Miklós Gábriel Tulics, Ferenc Kazinczi and Klára Vicsi

BUTE

Laboratory of Speech Acoustics - Department of Telecommunications and Media Informatics

1

Page 2: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Goal

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia225/08/2016

Page 3: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Goal

To build an automatic tool that can recognize disordered speech, which is generally caused by

the defect of the vocal organs.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia225/08/2016

Page 4: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Goal

To build an automatic tool that can recognize disordered speech, which is generally caused by

the defect of the vocal organs.

This is done by classification methods separating healthy from disordered speech. In this case

the training data should statistically cover the data we want to recognize (testing data).

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia225/08/2016

Page 5: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Goal

To build an automatic tool that can recognize disordered speech, which is generally caused by

the defect of the vocal organs.

This is done by classification methods separating healthy from disordered speech. In this case

the training data should statistically cover the data we want to recognize (testing data).

Most of the research in this field focuses mainly on adult voices, but what about the children's

voices?

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia225/08/2016

Page 6: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Goal

To build an automatic tool that can recognize disordered speech, which is generally caused by

the defect of the vocal organs.

This is done by classification methods separating healthy from disordered speech. In this case

the training data should statistically cover the data we want to recognize (testing data).

Most of the research in this field focuses mainly on adult voices, but what about the children's

voices?

• Main question: Is it necessary to build a completely different system in order to

automatically recognize functional dysphonia (FD) in children's cases or is it possible to

train the system with healthy and pathological voices of adults?

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia225/08/2016

Page 7: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

What is functional dysphonia?

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia325/08/2016

Page 8: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

What is functional dysphonia?

• Dysphonia is the disorder of the articulation as a complex function. It is a pathological condition

showing varied based symptoms due to several etiologic factors and pathogenesis diversity.

• Functional dysphonia (FD) is a multifaceted voice disorder. It refers to a voice problem in the absence

of a physical condition.

• Juvenile dysphonia is when functional dysphonia occurs at an early age.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia325/08/2016

Page 9: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

What is functional dysphonia?

• Dysphonia is the disorder of the articulation as a complex function. It is a pathological condition

showing varied based symptoms due to several etiologic factors and pathogenesis diversity.

• Functional dysphonia (FD) is a multifaceted voice disorder. It refers to a voice problem in the absence

of a physical condition.

• Juvenile dysphonia is when functional dysphonia occurs at an early age.

• Discomfort associated with dysphonia are: pressure on the neck, forced coughing,

shortness of breath.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia325/08/2016

Page 10: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

What is functional dysphonia?

• Dysphonia is the disorder of the articulation as a complex function. It is a pathological condition

showing varied based symptoms due to several etiologic factors and pathogenesis diversity.

• Functional dysphonia (FD) is a multifaceted voice disorder. It refers to a voice problem in the absence

of a physical condition.

• Juvenile dysphonia is when functional dysphonia occurs at an early age.

• Discomfort associated with dysphonia are: pressure on the neck, forced coughing,

shortness of breath.

• The frequency of dysphonia among the 3-10-year-old population can be put between 20-30%. The data

therefore suggest that almost every fourth or fifth child produces a pathological voice. The studies

agree that dysphonia is more often found among boys than girls, the ratio being 70-30%.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia325/08/2016

Page 11: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

In this study

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia425/08/2016

Page 12: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

In this study

• Sustained voice or continuous speech?

Acoustic parameters like Jitter, Shimmer, HNR (Harmonics-to-Noise Ratio) in the automatic classification of results from

the healthy and pathological voices are improved in a big extent using continuous speech

• In the present study the differences and the similarities between adult and children’s voice was

analyzed statistically using continuous speech.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia425/08/2016

Page 13: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

In this study

• Sustained voice or continuous speech?

Acoustic parameters like Jitter, Shimmer, HNR (Harmonics-to-Noise Ratio) in the automatic classification of results from

the healthy and pathological voices are improved in a big extent using continuous speech

• In the present study the differences and the similarities between adult and children’s voice was

analyzed statistically using continuous speech.

• The acoustic parameters were also compared with two sample T-tests in the case of children, between

healthy and pathological group.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia425/08/2016

Page 14: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

In this study

• Sustained voice or continuous speech?

Acoustic parameters like Jitter, Shimmer, HNR (Harmonics-to-Noise Ratio) in the automatic classification of results from

the healthy and pathological voices are improved in a big extent using continuous speech

• In the present study the differences and the similarities between adult and children’s voice was

analyzed statistically using continuous speech.

• The acoustic parameters were also compared with two sample T-tests in the case of children, between

healthy and pathological group.

• Different approaches were carried out: acoustic parameters from (the Hungarian SAMPA) vowels [E]

and [o] were extracted from adult and children’s speech samples and compared by statistical

analyses.

• Differences and similarities of healthy voice samples between the adult and child group was examined.

At the beginning of our research male and female samples were treated together, seeing the difference

we arrived at the conclusion that it is better to treat them separately.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia425/08/2016

Page 15: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Participants and methods / Healthy Adults Speech Database

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia525/08/2016

Page 16: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Participants and methods / Healthy Adults Speech Database

• In this work a total of 84 recordings were used from healthy people (42 female and 42 male)

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia525/08/2016

Page 17: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Participants and methods / Healthy Adults Speech Database

• In this work a total of 84 recordings were used from healthy people (42 female and 42 male)

• Near field microphone (Monacor ECM-100), with Creative Soundblaster Audigy 2 NX: an

outer USB sound card with 44,100Hz sampling rate, at a 16-bit linear coding.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia525/08/2016

Page 18: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Participants and methods / Healthy Adults Speech Database

• In this work a total of 84 recordings were used from healthy people (42 female and 42 male)

• Near field microphone (Monacor ECM-100), with Creative Soundblaster Audigy 2 NX: an

outer USB sound card with 44,100Hz sampling rate, at a 16-bit linear coding.

• The duration of the recordings are about one minute each. Every patient had to read out

aloud one of Aesop's Fables, “The North Wind and the Sun”. This folktale is frequently used

in phoniatrics as an illustration of spoken language. It has been translated into several

languages, Hungarian included.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia525/08/2016

Page 19: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Participants and methods / Juvenile dysphonia and Healthy Child

Speech Database

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia625/08/2016

FD child

Healthy child

Page 20: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Participants and methods / Juvenile dysphonia and Healthy Child

Speech Database

• 20 healthy and 12 (1 female and 11 male) recordings from children diagnosed

with juvenile dysphonia (furthermore referred as FD children).

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia625/08/2016

FD child

Healthy child

Page 21: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Participants and methods / Juvenile dysphonia and Healthy Child

Speech Database

• 20 healthy and 12 (1 female and 11 male) recordings from children diagnosed

with juvenile dysphonia (furthermore referred as FD children).

• Conditions: microphone (Monacor ECM-100), Creative Soundblaster Audigy 2

NX outer USB sound card, with 44,100Hz sampling rate and 16-bit linear

coding.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia625/08/2016

FD child

Healthy child

Page 22: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Participants and methods / Juvenile dysphonia and Healthy Child

Speech Database

• 20 healthy and 12 (1 female and 11 male) recordings from children diagnosed

with juvenile dysphonia (furthermore referred as FD children).

• Conditions: microphone (Monacor ECM-100), Creative Soundblaster Audigy 2

NX outer USB sound card, with 44,100Hz sampling rate and 16-bit linear

coding.

• The duration of the recordings are about 20 seconds each. The children told a

two versed poem.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia625/08/2016

FD child

Healthy child

Page 23: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Participants and methods / Juvenile dysphonia and Healthy Child

Speech Database

• 20 healthy and 12 (1 female and 11 male) recordings from children diagnosed

with juvenile dysphonia (furthermore referred as FD children).

• Conditions: microphone (Monacor ECM-100), Creative Soundblaster Audigy 2

NX outer USB sound card, with 44,100Hz sampling rate and 16-bit linear

coding.

• The duration of the recordings are about 20 seconds each. The children told a

two versed poem.

• All the children from the database are aged 5 to 6. All the recordings were

made in the presence of the children's parents.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia625/08/2016

FD child

Healthy child

Page 24: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Participants and methods / Juvenile dysphonia and Healthy Child

Speech Database

• 20 healthy and 12 (1 female and 11 male) recordings from children diagnosed

with juvenile dysphonia (furthermore referred as FD children).

• Conditions: microphone (Monacor ECM-100), Creative Soundblaster Audigy 2

NX outer USB sound card, with 44,100Hz sampling rate and 16-bit linear

coding.

• The duration of the recordings are about 20 seconds each. The children told a

two versed poem.

• All the children from the database are aged 5 to 6. All the recordings were

made in the presence of the children's parents.

• Both databases were annotated and segmented on phoneme level, using the

SAMPA phonetic alphabet.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia625/08/2016

FD child

Healthy child

Page 25: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Pre-processing methods

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia725/08/2016

Page 26: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Pre-processing methods

• Among the 14 Hungarian vowels, [E] and [o] are usually

analysed in case of adults. There are approximately 50 [E]

vowels in the tale that was read.

• In the case of the children, the vowel [o] is the poem’s

most frequent one, with 16 pieces, and there are only 9

pieces of the vowel [E]. The statistical analyses were made

extracting the vowels [E], [o] from each database.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia725/08/2016

Page 27: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Pre-processing methods

• Among the 14 Hungarian vowels, [E] and [o] are usually

analysed in case of adults. There are approximately 50 [E]

vowels in the tale that was read.

• In the case of the children, the vowel [o] is the poem’s

most frequent one, with 16 pieces, and there are only 9

pieces of the vowel [E]. The statistical analyses were made

extracting the vowels [E], [o] from each database.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia725/08/2016

Parameters Short description

Jitter_ddp the average absolute difference between

consecutive differences between

consecutive periods, divided by the average

period

Shimmer_ddp the average absolute difference between

consecutive differences between the

amplitudes of consecutive periods

HNR Harmonics-to-Noise Ratio quantifies noise in

the speech signal, caused mainly due to

incomplete vocal fold closure

MFCC1 the first component of the mel-frequency

cepstral coefficients

Page 28: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Pre-processing methods

• Among the 14 Hungarian vowels, [E] and [o] are usually

analysed in case of adults. There are approximately 50 [E]

vowels in the tale that was read.

• In the case of the children, the vowel [o] is the poem’s

most frequent one, with 16 pieces, and there are only 9

pieces of the vowel [E]. The statistical analyses were made

extracting the vowels [E], [o] from each database.

• For the extraction of the acoustic parameters Praat

software was used. The acoustic parameters were

measured in the middle of the vowels.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia725/08/2016

Parameters Short description

Jitter_ddp the average absolute difference between

consecutive differences between

consecutive periods, divided by the average

period

Shimmer_ddp the average absolute difference between

consecutive differences between the

amplitudes of consecutive periods

HNR Harmonics-to-Noise Ratio quantifies noise in

the speech signal, caused mainly due to

incomplete vocal fold closure

MFCC1 the first component of the mel-frequency

cepstral coefficients

Page 29: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Acoustic parameters

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia825/08/2016

Page 30: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Acoustic parameters

Acoustic Parameters Short description Equation

Jitter_ddp the average absolute difference between consecutive

differences between consecutive periods, divided by

the average period

𝐽𝑖𝑡𝑡𝑒𝑟(𝑑𝑑𝑝) =1

𝑁−2

𝑖=2

𝑁−1(𝑇𝑖+1−𝑇𝑖)−(𝑇𝑖−𝑇𝑖−1)

1

𝑁

𝑖=1

𝑁𝑇𝑖

[%],

Ti is the duration of the i-th interval and N is the number of intervals

Shimmer_ddp the average absolute difference between consecutive

differences between the amplitudes of consecutive

periods

𝑆ℎ𝑖𝑚𝑚𝑒𝑟(𝑑𝑑𝑝)

=

1𝑁 − 2𝑖=2

𝑁−1(𝐴𝑖+1 − 𝐴𝑖) − (𝐴𝑖 − 𝐴𝑖−1)

1𝑁σ𝑖=1𝑁 𝐴𝑖

[%]

HNR Harmonics-to-Noise Ratio quantifies noise in the

speech signal, caused mainly due to incomplete vocal

fold closure

𝐻𝑁𝑅 = 10 ∙ 𝑙𝑔𝐸𝐻

𝐸𝑁[dB]

MFCC1 the first component of the mel-frequency cepstral

coefficients 𝑐𝑘−1 = σ𝑗=1𝑁 𝑃𝑗 cos

𝜋 𝑘−1 𝑗−0,5

𝑁,

N represents the number of spectral values and Pj the power in dB of the jth spectral value (k runs from 1 to N)

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia825/08/2016

Page 31: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Statistical analyses

• SPSS20.0 software was used

• Two sample T-tests were used for statistical significance testing.

• Where F tests showed significant variances of an acoustic parameter within the groups (with

significance level 95% (α = 0.05), Welch’s T-test was used.

• Our assumption is that the distributions are normal, but T tests are relatively robust to

moderate violations of the normality assumption.

• The null hypothesis is that the means are equal.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia925/08/2016

Page 32: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy and FD children

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia1025/08/2016

Page 33: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy and FD children

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia1025/08/2016

[E] [o]

p - values

Jitter_ddp 0.018** 0.000***

Shimmer_ddp 0.000*** 0.000***

mean_HNR 0.072* 0.003***

MFCC1 0.000*** 0.000***

Page 34: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy and FD children

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia1025/08/2016

[E] [o]

p - values

Jitter_ddp 0.018** 0.000***

Shimmer_ddp 0.000*** 0.000***

mean_HNR 0.072* 0.003***

MFCC1 0.000*** 0.000***

• * p < 0.1

• ** p < 0.05 -> indicates strong evidence against the null hypothesis, so the null hypothesis is rejected

• *** p < 0.01

Page 35: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy and FD children

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia1025/08/2016

[E] [o]

p - values

Jitter_ddp 0.018** 0.000***

Shimmer_ddp 0.000*** 0.000***

mean_HNR 0.072* 0.003***

MFCC1 0.000*** 0.000***

• * p < 0.1

• ** p < 0.05 -> indicates strong evidence against the null hypothesis, so the null hypothesis is rejected

• *** p < 0.01

The poem contains 9 instances of [E] and 16

instances of [o] sounds.

Page 36: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy adults and children /1

11

Page 37: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy adults and children /1

11

Vowel

[o] [E]

Child Male Child Male

Mean Std. Dev. Mean Std. Dev. p-value Mean Std. Dev. Mean Std. Dev. p-value

Jitter_ddp 1.095 0.740 1.448 1.533 0.020** 1.414 1.084 1.986 1.791 0.000***

Shimmer_ddp 7.514 3.698 8.654 6.962 0.109 9.669 5.268 12.125 10.070 0.003***

mean_HNR 17.982 4.232 12.872 4.776 0.000*** 13.262 3.914 8.337 4.068 0.000***

MFCC1 245.977 45.554 265.013 54.779 0.000*** 175.224 32.372 208.357 51.887 0.000***

Page 38: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy adults and children /1

11

Vowel

[o] [E]

Child Male Child Male

Mean Std. Dev. Mean Std. Dev. p-value Mean Std. Dev. Mean Std. Dev. p-value

Jitter_ddp 1.095 0.740 1.448 1.533 0.020** 1.414 1.084 1.986 1.791 0.000***

Shimmer_ddp 7.514 3.698 8.654 6.962 0.109 9.669 5.268 12.125 10.070 0.003***

mean_HNR 17.982 4.232 12.872 4.776 0.000*** 13.262 3.914 8.337 4.068 0.000***

MFCC1 245.977 45.554 265.013 54.779 0.000*** 175.224 32.372 208.357 51.887 0.000***

• * p < 0.1

• ** p < 0.05 -> indicates strong evidence against the null hypothesis, so the null hypothesis is rejected

• *** p < 0.01

Page 39: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy adults and children /2

12

Page 40: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy adults and children /2

12

Vowel

[o] [E]

Child Female Child Female

Mean Std. Dev. Mean Std. Dev. p-value Mean Std. Dev. Mean Std. Dev. p-value

Jitter_ddp 1.095 0.740 0.904 1.026 0.013** 1.414 1.084 1.173 1.155 0.011**

Shimmer_ddp 7.514 3.698 5.813 4.653 0.000*** 9.669 5.268 7.755 5.717 0.000***

mean_HNR 17.982 4.232 17.109 5.223 0.370 13.262 3.914 12.764 4.408 0.128

MFCC1 245.977 45.554 249.780 47.506 0.347 175.224 32.372 177.965 43.429 0.320

Page 41: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy adults and children /2

12

Vowel

[o] [E]

Child Female Child Female

Mean Std. Dev. Mean Std. Dev. p-value Mean Std. Dev. Mean Std. Dev. p-value

Jitter_ddp 1.095 0.740 0.904 1.026 0.013** 1.414 1.084 1.173 1.155 0.011**

Shimmer_ddp 7.514 3.698 5.813 4.653 0.000*** 9.669 5.268 7.755 5.717 0.000***

mean_HNR 17.982 4.232 17.109 5.223 0.370 13.262 3.914 12.764 4.408 0.128

MFCC1 245.977 45.554 249.780 47.506 0.347 175.224 32.372 177.965 43.429 0.320

• * p < 0.1

• ** p < 0.05 -> indicates strong evidence against the null hypothesis, so the null hypothesis is rejected

• *** p < 0.01

Page 42: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy adults and children /2

12

Vowel

[o] [E]

Child Female Child Female

Mean Std. Dev. Mean Std. Dev. p-value Mean Std. Dev. Mean Std. Dev. p-value

Jitter_ddp 1.095 0.740 0.904 1.026 0.013** 1.414 1.084 1.173 1.155 0.011**

Shimmer_ddp 7.514 3.698 5.813 4.653 0.000*** 9.669 5.268 7.755 5.717 0.000***

mean_HNR 17.982 4.232 17.109 5.223 0.370 13.262 3.914 12.764 4.408 0.128

MFCC1 245.977 45.554 249.780 47.506 0.347 175.224 32.372 177.965 43.429 0.320

• * p < 0.1

• ** p < 0.05 -> indicates strong evidence against the null hypothesis, so the null hypothesis is rejected

• *** p < 0.01

Differences exist in the examined acoustical

parameters even between healthy child and

healthy adult groups.

A decision system that inquiries child’s voice

trained with adult voice samples would likely

detract erroneous conclusions

Page 43: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy males and females

13

Page 44: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy males and females

13

Vowel

[o] [E]

Female Male Female Male

Mean Std. Dev. Mean Std. Dev. p-value Mean Std. Dev. Mean Std. Dev. p-value

Jitter_ddp 0.904 1.026 1.448 1.533 0.000*** 1.173 1.155 1.986 1.791 0.000***

Shimmer_ddp 5.813 4.653 8.654 6.962 0.000*** 7.755 5.717 12.125 10.070 0.000***

mean_HNR 17.109 5.223 12.872 4.776 0.000*** 12.764 4.408 8.337 4.068 0.000***

MFCC1 249.780 47.506 265.013 54.779 0.007*** 177.965 43.429 208.357 51.887 0.000***

Page 45: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy males and females

13

Vowel

[o] [E]

Female Male Female Male

Mean Std. Dev. Mean Std. Dev. p-value Mean Std. Dev. Mean Std. Dev. p-value

Jitter_ddp 0.904 1.026 1.448 1.533 0.000*** 1.173 1.155 1.986 1.791 0.000***

Shimmer_ddp 5.813 4.653 8.654 6.962 0.000*** 7.755 5.717 12.125 10.070 0.000***

mean_HNR 17.109 5.223 12.872 4.776 0.000*** 12.764 4.408 8.337 4.068 0.000***

MFCC1 249.780 47.506 265.013 54.779 0.007*** 177.965 43.429 208.357 51.887 0.000***

• * p < 0.1

• ** p < 0.05 -> indicates strong evidence against the null hypothesis, so the null hypothesis is rejected

• *** p < 0.01

Page 46: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Results / Comparison of healthy males and females

13

Vowel

[o] [E]

Female Male Female Male

Mean Std. Dev. Mean Std. Dev. p-value Mean Std. Dev. Mean Std. Dev. p-value

Jitter_ddp 0.904 1.026 1.448 1.533 0.000*** 1.173 1.155 1.986 1.791 0.000***

Shimmer_ddp 5.813 4.653 8.654 6.962 0.000*** 7.755 5.717 12.125 10.070 0.000***

mean_HNR 17.109 5.223 12.872 4.776 0.000*** 12.764 4.408 8.337 4.068 0.000***

MFCC1 249.780 47.506 265.013 54.779 0.007*** 177.965 43.429 208.357 51.887 0.000***

• * p < 0.1

• ** p < 0.05 -> indicates strong evidence against the null hypothesis, so the null hypothesis is rejected

• *** p < 0.01

It is reasonable to separate

male and female samples

when we have small dataset.

Page 47: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Conclusions

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia1425/08/2016

Page 48: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Conclusions

Variations of Jitter and Shimmer values with HNR and MFCC1 are good indicators to

separate healthy and FD voices in case of children as well.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia1425/08/2016

Page 49: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Conclusions

Variations of Jitter and Shimmer values with HNR and MFCC1 are good indicators to

separate healthy and FD voices in case of children as well.

Healthy samples of children and adult voices were compared giving the clear conclusion

that differences exist in the examined acoustical parameters even between healthy child and

healthy adult groups.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia1425/08/2016

Page 50: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Conclusions

Variations of Jitter and Shimmer values with HNR and MFCC1 are good indicators to

separate healthy and FD voices in case of children as well.

Healthy samples of children and adult voices were compared giving the clear conclusion

that differences exist in the examined acoustical parameters even between healthy child and

healthy adult groups.

It is necessary to carry out the investigations separately on children's voices as well, we

cannot use adult voices to make any conclusions to children's voices.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia1425/08/2016

Page 51: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

Conclusions

Variations of Jitter and Shimmer values with HNR and MFCC1 are good indicators to

separate healthy and FD voices in case of children as well.

Healthy samples of children and adult voices were compared giving the clear conclusion

that differences exist in the examined acoustical parameters even between healthy child and

healthy adult groups.

It is necessary to carry out the investigations separately on children's voices as well, we

cannot use adult voices to make any conclusions to children's voices.

In order to build an automatic decision making system that recognizes FD it is advisable to

train the system separately for adult males, adult females and children.

Tulics - Statistical analysis of acoustical parameters in the

voice of children with juvenile dysphonia1425/08/2016

Page 52: Statistical analysis of acoustical parameters in the voice ... · the defect of the vocal organs. This is done by classification methods separating healthy from disordered speech.

[email protected]

Thank you for your attention!