<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Sevana Oy Blog &#187; p.564</title>
	<atom:link href="http://blog.sevana.fi/tag/p-564/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.sevana.fi</link>
	<description>The Sevana Product Blog</description>
	<lastBuildDate>Tue, 02 Aug 2011 05:22:10 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>NIQA Non-Intrusive voice Quality Analyzer (alternative for ITU P.563)</title>
		<link>http://blog.sevana.fi/niqa-non-intrusive-voice-quality-analyzer-alternative-for-itu-p-563/</link>
		<comments>http://blog.sevana.fi/niqa-non-intrusive-voice-quality-analyzer-alternative-for-itu-p-563/#comments</comments>
		<pubDate>Sat, 24 Apr 2010 19:00:17 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Voice and Sound Quality Testing Software]]></category>
		<category><![CDATA[4G]]></category>
		<category><![CDATA[automated]]></category>
		<category><![CDATA[LTE]]></category>
		<category><![CDATA[mobile]]></category>
		<category><![CDATA[mos]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[non-intrusive]]></category>
		<category><![CDATA[p.563]]></category>
		<category><![CDATA[p.564]]></category>
		<category><![CDATA[Perceptual]]></category>
		<category><![CDATA[PSTN]]></category>
		<category><![CDATA[qos]]></category>
		<category><![CDATA[quality]]></category>
		<category><![CDATA[real-time]]></category>
		<category><![CDATA[testing]]></category>
		<category><![CDATA[voice]]></category>
		<category><![CDATA[voice quality testing QoS MOS method methods pesq]]></category>
		<category><![CDATA[VoIP]]></category>
		<category><![CDATA[vqt]]></category>

		<guid isPermaLink="false">http://wordpress.sevana.fi/?p=122</guid>
		<description><![CDATA[Modern standard methods for evaluating quality of transmitted speech Voice quality is one of the main characteristics of speech transmission systems. When analyzing voice quality one must not only consider audio signal degradation caused by transmission over telecom channels, but also specifics of speaker&#8217;s voice, conditions of listener&#8217;s hearing and variation of these parameters in [...]]]></description>
			<content:encoded><![CDATA[<p lang="en-US"><span style="font-family: Arial;"><span style="font-size: medium;"><strong>Modern standard methods for evaluating quality of transmitted speech</strong></span></span></p>
<p lang="en-US"><span style="font-family: Arial;">Voice quality is one of the main characteristics of speech transmission systems. When analyzing voice quality one must not only consider audio signal degradation caused by transmission over telecom channels, but also specifics of speaker&#8217;s voice, conditions of listener&#8217;s hearing and variation of these parameters in time.</span></p>
<p lang="en-US"><span style="font-family: Arial;">The most known methods for quality evaluation of voice transmission systems were developed by Telecommunication Standardization Sector of International Telecommunications Union (ITU-T) in the middle of 90-s. Results of this work are presented in Recommendation P.800 (P.830) «Methods for subjective determination of transmission quality» [1, 2]. This document describes conditions for voice quality testing, audio contents, scoring and methods to evaluate results. Typically “Methods for subjective determination of transmission quality” are used to obtain mean subjective quality score according to five-digit scale  (Mean Opinion Score &#8211; MOS).</span></p>
<p lang="en-US"><span style="font-family: Arial;">Unfortunately P.800 recommendation tests may lead to ambiguous results. Recommendation is warning about comparing MOS scores received under different conditions and consider such approach incorrect. Besides that preforming tests according to P.800 takes a lot of time and requires a lot of testers involved in the process.</span></p>
<p><span style="font-family: Arial;">In order to move from subjective (MOS) scores to objective ones and to automate the quality measurement</span><span style="color: #ff0000;"><span style="font-family: Arial;">,</span></span><span style="font-family: Arial;"> ITU-T has developed the P.861 recommendation, which is based on low level quantitative measurements [3]. Recommendation P.861 is a follow-up of PSQM method (Perceptual Speech Quality Measurement), developed by KPN Research and devoted to objective analysis of speech codecs performance with a</span><span style="color: #ff0000;"><span style="font-family: Arial;"><em> </em></span></span><span style="font-family: Arial;">low level of degradation. </span></p>
<p lang="en-US"><span style="font-family: Arial;">However, it is impossible to utilize PSQM for evaluation of work of a real communication system because the method does not consider all the important factors influencing human perception. Among these factors are delay, jitter, packet loss as well as signal level clipping. </span></p>
<p><span style="font-family: Arial;">In February 2001 ITU-T has issued another recommendation ITU-T P.862 [4], which describes a more advanced algorithm for voice quality testing  – PESQ (Perceptual Evaluation of Speech Quality). The algorithm includes level and time aligning, human perception and cognitive modeling. Due to these additional operations the approach considers signal amplification/ attenuation in a communication system, time delays and jitter as well as spectrum bands, which are the most significant for human perception.</span><span style="color: #ff0000;"><span style="font-family: Arial;"> </span></span><span style="font-family: Arial;">Based on cognitive modeling PESQ also recalculates objective quality score into MOS values.</span></p>
<p lang="en-US"><span style="font-family: Arial;">A disadvantage of PESQ as well as other similar algorithm is the fact that they are based on comparing of two signals: original and transmitted through a communication system. This approach may create a range of difficulties connected with setting and preforming voice quality testing. One requires to arrange signal recording on both sides of the telecommunication system as well as records transmission to the test system. Besides this real time quality monitoring in such approach appears quite difficult as well.</span></p>
<p lang="en-US"><span style="font-family: Arial;">In order to solve the challenging issues mentioned above ITU-T has developed a new recommendation P.563 [5] introduced in May 2004. This recommendation determines algorithm for evaluating speech quality by listening to communication sessions. The algorithm takes into account single-side distortions, speech trunk parameters, noise and speech naturalness. Developers of P.563 call attention that P.563 does not provide overall quality estimation of speech transmission. Distortions driven by delays, echo, loss of loudness and everything related to two-sided interaction cannot be taken into consideration by this method. </span></p>
<p lang="en-US"><span style="font-family: Arial;">It&#8217;s widely thought that P.563 provides a high level of correlation between automated and expert quality scores. However, simple tests based on ITU-T sound database for codec testing [6]  may raise some doubts about the consistence of the algorithm provided together with its description.</span></p>
<p lang="en-US"><span style="font-family: Arial;">Table.1. Comparison between results of P.563 and expert estimations</span></p>
<table border="1" cellspacing="0" cellpadding="7" width="597">
<col width="218"></col>
<col width="34"></col>
<col width="58"></col>
<col width="229"></col>
<tbody>
<tr>
<td width="218" height="1">MOS Range</td>
<td colspan="2" width="106">
<p lang="en-US">Ava rage Score</p>
</td>
<td width="229">
<p lang="en-US">Average error</p>
</td>
</tr>
<tr>
<td width="218" height="1" valign="TOP"></td>
<td width="34">MOS</td>
<td width="58">P.563</td>
<td width="229" valign="TOP"></td>
</tr>
<tr valign="TOP">
<td width="218" height="1">4 – 5</td>
<td width="34">4,25</td>
<td width="58"><span style="color: #000000;"><span style="font-family: Arial;">2,45</span></span></td>
<td width="229"><span style="color: #000000;"><span style="font-family: Arial;">1,79</span></span></td>
</tr>
<tr valign="TOP">
<td width="218" height="1">3 – 4</td>
<td width="34">3,42</td>
<td width="58"><span style="color: #000000;"><span style="font-family: Arial;">1,70</span></span></td>
<td width="229"><span style="color: #000000;"><span style="font-family: Arial;">1,69</span></span></td>
</tr>
<tr valign="TOP">
<td width="218" height="1">2 – 3</td>
<td width="34">2,56</td>
<td width="58"><span style="color: #000000;"><span style="font-family: Arial;">1,71</span></span></td>
<td width="229"><span style="color: #000000;"><span style="font-family: Arial;">0,97</span></span></td>
</tr>
<tr valign="TOP">
<td width="218" height="1">1 – 2</td>
<td width="34">1,68</td>
<td width="58"><span style="color: #000000;"><span style="font-family: Arial;">1,49</span></span></td>
<td width="229"><span style="color: #000000;"><span style="font-family: Arial;">0,55</span></span></td>
</tr>
</tbody>
</table>
<p lang="en-US"><span style="font-family: Arial;">The problem discovered in the distributed P.563 algorithm implementation required development of an alternative solution. Further down one can find one of possible solutions that is implemented in Sevana  NIQA (Non-Intrusive Quality Analyzer).</span></p>
<p lang="en-US"><span style="font-family: Arial;">NIQA&#8217;s (Non-Intrusive Quality Analyzer) approach is based on a database of trained etalons called associations. Each association corresponds to a group of files that have close expert estimations of sound quality and common set of reasons for sound quality degradation. For each association NIQA calculates and stores a distribution of parameters&#8217; values.</span></p>
<p lang="en-US"><span style="font-family: Arial;">Basic algorithm showing how NIQA obtains sound quality scores is represented on the picture below.</span></p>
<p lang="en-US">﻿</p>
<div id="attachment_123" class="wp-caption aligncenter" style="width: 251px"><a href="http://wordpress.sevana.fi/wp-content/uploads/2010/04/Pic1.png"><img class="size-medium wp-image-123" title="NIQA - non-intrusive voice quality testing software algorithm flowchart" src="http://wordpress.sevana.fi/wp-content/uploads/2010/04/Pic1-241x300.png" alt="NIQA - non-intrusive voice quality testing software algorithm flowchart" width="241" height="300" /></a><p class="wp-caption-text">NIQA - non-intrusive voice quality testing software</p></div>
<p lang="en-US"><span style="font-family: Arial;">When loading sound signal the system excludes all fragments with low energy level (according to threshold). The excluded fragments correspond to “absolute silence” and are considered irrelevant for obtaining sound quality score.</span></p>
<p lang="en-US"><span style="font-family: Arial;">At the next phase the signal is split into frames used in voice activity detection algorithm (VAD). The system calculates energy values for each frame what increases accuracy of VAD. With the help of VAD algorithm the signal divides to active and inactive components that are processed separately. The system builds level histograms for both active and inactive signal components.</span></p>
<p lang="en-US"><span style="font-family: Arial;">By discrete cosine transform (DCT) the system obtains signal spectrum and checks the active components frames for DTMF presence and then excludes the frames that are similar to DTMF from further processing.</span></p>
<p lang="en-US"><span style="font-family: Arial;">Next stage applies the first level of psycho-acoustic model to the signal spectrum. This model checks different types of masking (including pre-masking and post-masking). According to clear peaks of spectrum energy the system splits the signal into tone and noise components.</span></p>
<p lang="en-US"><span style="font-family: Arial;">Second level of psycho-acoustic model performs energy normalization of the signal – energy levels are transformed into loudness levels at 1kHz. Third level of psycho-acoustic model transforms loudness levels into several detectable grades of loudness that allow to ignore sound signal changes, which are not recognized by human ear.</span></p>
<p lang="en-US"><span style="font-family: Arial;">The next step is to split signal spectrum into bands that are critical to human ear perception and calculate parameters both on and out of the bands. Based on the computed signal parameters the system selects most similar associations from the database and performs matching. According to selected associations the system determines how much each of them influence the overall quality and then generates the final voice quality score as a combination of scores for selected associations and according to correspondent weights.</span></p>
<p lang="en-US"><span style="font-family: Arial;"><br />
</span></p>
<p lang="en-US"><span style="font-family: Arial;"><span style="font-size: medium;"><strong>Sevana NIQA Testing and Evaluation</strong></span></span></p>
<p lang="en-US"><span style="font-family: Arial;">Sevana NIQA has been tested utilizing the same ITU-T speech database that is used for conformance testing of P.563 algorithm. In the tests we used a total of 376 English language recordings. All recordings were sorted into 4 groups depending on their MOS scores (represented in the documentation attached to the sound database). For all groups of recordings we determined average expert scores and average NIQA scores (Table 2). In order to illustrate comparison with P.563 we also calculated average errors for P.563 and NIQA scores for the same tests.</span></p>
<p lang="en-US"><span style="font-family: Arial;">Table.2. Comparison of NIQA scores against expert estimations</span></p>
<table border="1" cellspacing="0" cellpadding="7" width="465">
<col width="146"></col>
<col width="34"></col>
<col width="52"></col>
<col width="118"></col>
<col width="43"></col>
<tbody>
<tr>
<td width="146" height="1">MOS Range</td>
<td colspan="2" width="100" valign="TOP">
<p lang="en-US">Average Score</p>
</td>
<td colspan="2" width="175" valign="TOP">
<p lang="en-US">Average Error</p>
</td>
</tr>
<tr>
<td width="146" height="1" valign="TOP"></td>
<td width="34">MOS</td>
<td width="52" valign="TOP">
<p lang="en-US">NIQA</p>
</td>
<td width="118" valign="TOP">
<p lang="en-US">NIQA</p>
</td>
<td width="43" valign="TOP">P.563</td>
</tr>
<tr valign="TOP">
<td width="146" height="1">4 – 5</td>
<td width="34">4,25</td>
<td width="52">
<p lang="en-US">3,44</p>
</td>
<td width="118">
<p lang="en-US">0,83</p>
</td>
<td width="43"><span style="color: #000000;"><span style="font-family: Arial;">1,79</span></span></td>
</tr>
<tr valign="TOP">
<td width="146" height="1">3 – 4</td>
<td width="34">3,42</td>
<td width="52">
<p lang="en-US">3,06</p>
</td>
<td width="118">
<p lang="en-US">0,51</p>
</td>
<td width="43"><span style="color: #000000;"><span style="font-family: Arial;">1,69</span></span></td>
</tr>
<tr valign="TOP">
<td width="146" height="1">2 – 3</td>
<td width="34">2,56</td>
<td width="52">
<p lang="en-US">2,61</p>
</td>
<td width="118">
<p lang="en-US">0,43</p>
</td>
<td width="43"><span style="color: #000000;"><span style="font-family: Arial;">0,97</span></span></td>
</tr>
<tr valign="TOP">
<td width="146" height="1">1 – 2</td>
<td width="34">1,68</td>
<td width="52">
<p lang="en-US">2,36</p>
</td>
<td width="118">
<p lang="en-US">0,68</p>
</td>
<td width="43"><span style="color: #000000;"><span style="font-family: Arial;">0,55</span></span></td>
</tr>
</tbody>
</table>
<p lang="en-US"><span style="font-family: Arial;">The results clearly show that NIQA allows receiving much higher accuracy between generated quality scores and expert estimations than P.563. NIQA scores are less precise only for records with very low MOS scores (in the range from 1 to 2). In all other cases NIQA provides 2-3 times higher quality scores precision compared to MOS values. </span></p>
<p lang="en-US">
<p lang="en-US"><span style="font-family: Arial;"><span style="font-size: medium;"><strong>References</strong></span></span></p>
<dl>
<dt>
<p lang="en-US"><span style="font-family: Arial;">1. Methods for subjective determination of 	transmission quality // ITU-T Recommendation P.800 / 	http://www.itu.int/rec/T-REC-P.800/en</span></p>
</dt>
<dt>
<p lang="en-US"><span style="font-family: Arial;">2. Subjective performance assessment of 	telephone-band and wideband digital codecs // ITU-T Recommendation 	P.830 / http://www.itu.int/rec/T-REC-P.830/en</span></p>
</dt>
<dt>
<p lang="en-US"><span style="font-family: Arial;">3. Objective quality measurement of 	telephone-band (300-3400 Hz) speech codecs // ITU-T Recommendation 	P.861 / http://www.itu.int/rec/T-REC-P.861/en</span></p>
</dt>
<dt>
<p lang="en-US"><span style="font-family: Arial;">4. Perceptual evaluation of speech quality 	(PESQ): An objective method for end-to-end speech quality assessment 	of narrow-band telephone networks and speech codecs // ITU-T 	Recommendation P.862 / http://www.itu.int/rec/T-REC-P.862/en</span></p>
</dt>
<dt>
<p lang="en-US"><span style="font-family: Arial;">5. Single-ended method for objective speech 	quality assessment in narrow-band telephony applications // ITU-T 	Recommendation P.563 / 	http://www.itu.int/rec/T-REC-P.563-200405-I/en</span></p>
</dt>
</dl>
<p lang="en-US"><span style="font-family: Arial;">6. ITU-T coded-speech database // Supplement 23 to ITU-T P-series Recommendations / http://www.itu.int/rec/T-REC-P.Sup23-199802-I/en</span></p>
<p lang="en-US"><span style="font-family: Arial;">Copied from: </span><a title="Sevana NIQA - non-intrusive voice quality testing software" href="http://www.sevana.fi/non-intrusive-voice-quality-testing-software.php" target="_self">http://www.sevana.fi/non-intrusive-voice-quality-testing-software.php</a></p>
<p><img src="file:///F:/Pic1.PNG" alt="" /></p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save"><img src="http://blog.sevana.fi/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a> </p>]]></content:encoded>
			<wfw:commentRss>http://blog.sevana.fi/niqa-non-intrusive-voice-quality-analyzer-alternative-for-itu-p-563/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Be the first to evaluate new Sevana non-intrusive voice quality testing software!</title>
		<link>http://blog.sevana.fi/be-the-first-to-evaluate-new-sevana-non-intrusive-voice-quality-testing-software/</link>
		<comments>http://blog.sevana.fi/be-the-first-to-evaluate-new-sevana-non-intrusive-voice-quality-testing-software/#comments</comments>
		<pubDate>Tue, 09 Mar 2010 12:38:46 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Voice and Sound Quality Testing Software]]></category>
		<category><![CDATA[4G]]></category>
		<category><![CDATA[automated]]></category>
		<category><![CDATA[CDMA]]></category>
		<category><![CDATA[communications]]></category>
		<category><![CDATA[GSM]]></category>
		<category><![CDATA[handset]]></category>
		<category><![CDATA[LTE]]></category>
		<category><![CDATA[Mean Opinion Score]]></category>
		<category><![CDATA[mobile]]></category>
		<category><![CDATA[mos]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[networks]]></category>
		<category><![CDATA[p.562]]></category>
		<category><![CDATA[p.563]]></category>
		<category><![CDATA[p.564]]></category>
		<category><![CDATA[Perceptual]]></category>
		<category><![CDATA[PSTN]]></category>
		<category><![CDATA[quality]]></category>
		<category><![CDATA[testing]]></category>
		<category><![CDATA[voice]]></category>
		<category><![CDATA[VoIP]]></category>
		<category><![CDATA[vqt]]></category>

		<guid isPermaLink="false">http://wordpress.sevana.fi/?p=119</guid>
		<description><![CDATA[Non-intrusive measurement perform at network nodes and may be a part of network routers/switches or other network equipment as well as standalone devices and handsets. Non-intrusive voice qualiy testing does not require a reference signal and can work with real communications data in real time. Non-intrusive techniques provide a possibility to test and monitor a [...]]]></description>
			<content:encoded><![CDATA[<p>Non-intrusive measurement perform at network nodes and may be a part of network routers/switches or other network equipment as well as standalone devices and handsets.</p>
<p>Non-intrusive voice qualiy testing does not require a reference signal and can work with real communications data in real time.</p>
<p>Non-intrusive techniques provide a possibility to test and monitor a greater amount of communications and therefore obtain a quite reliable information about the networks (VoIP, PSTN, GSM, CDMA, LTE) quality.</p>
<p>Today Sevana announces availability of new software for voice quality testing for customers&#8217; and partners&#8217; evaluation. Be among the first to test the new approach to voice quality testing and monitoring!</p>
<p><a href="mailto:sales@sevana.fi?Subject=Non-intrusive%20voice%20quality%20testing%20software%20inquiry" target="_blank">This software is delivered only upon request.</a></p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save"><img src="http://blog.sevana.fi/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a> </p>]]></content:encoded>
			<wfw:commentRss>http://blog.sevana.fi/be-the-first-to-evaluate-new-sevana-non-intrusive-voice-quality-testing-software/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

