Title of Invention | NOISE SUPPRESSION |
---|---|
Abstract | A network noise suppressor includes means (113) for partially decoding a CELP coded bit-stream. Means (116) determine a noise suppressing filter <i>H(z)</i> from the decoded parameters. Means (118, 120) use this filter to determine modified LP and gain parameters. Means (122) overwrite corresponding parameters in the coded bit-stream with the modified parameters. |
Full Text | FORM 2 THE PATENTS ACT 1970 [39 OF 1970] COMPLETE SPECIFICATION [See Section 10; rule 13] "NOISE SUPPRESSION" TELEFONAKTIEBOLAGET LM ERICSSON (PUBL), A Swedish company, of S-126 25 Stockholm, Sweden, The following specification particularly describes and ascertain the nature of the invention and the manner in which it is to be performed :- original 788/mumnp/2003 24 MAY 2007 TECHNICAL FIELD The present invention relates to noise suppression in telephony systems, and in particular to network-based noise suppression. BACKGROUND Noise suppression is used to suppress any background acoustic sound su¬perimposed on the desired speech signal, while preserving the characteris¬tics of the speech. In most applications, the noise suppressor is implemented as a pre-processor to the speech encoder. The noise suppressor may also be implemented as an integral part of the speech encoder. There also exist implementations of noise suppression algorithms that are installed in the networks. The rationale for using these network-based im¬plementations is that a noise reduction can be achieved also when the ter¬minals do not contain any noise suppression. These algorithms operate on the PCM (Pulse Code Modulated) coded signal and are independent of the bit-rate of the speech-encoding algorithm. However, in a telephony system using low speech coding bit-rate (such as digital cellular systems), network based noise suppression can not be achieved without introducing a tandem encoding of the speech. For most current systems this is not a severe re¬striction, since the transmission in the core network usually is based on PCM coded speech, which means that the tandem coding already exists. However, for tandem free or transcoder free operation, a decoding and sub¬sequent encoding of the speech has to be performed within the noise-suppressing device itself, thus breaking the otherwise tandem free operation. A drawback of this method is that tandem coding introduces a degradation of the speech, especially for speech encoded at low bit-rates. 2 SUMMARY An object of the present invention is a noise reduction in an encoded speech signal formed by LP (Linear Predictive) coding, especially low bit-rate CELP (Code Excited Linear Predictive) encoded speech, without introducing any tandem encoding. This object is achieved in accordance with the attached claims. Briefly, the present invention is based on modifying the parameters con¬taining the spectral and gain information in the coded bit-stream while leaving the excitation signals unchanged. This gives noise suppression with improved speech quality for systems with transcoder free operation. BRIEF DESCRIPTION OF THE DRAWINGS The invention, together with further objects and advantages thereof, may best be understood by making reference to the following description taken together with the accompanying drawings, in which: Fig. 1 is a block diagram of a typical conventional communication sys¬tem including a network noise suppressor; Fig. 2 is a block diagram of another typical conventional communication system including a network noise suppressor; Fig. 3 is a simplified block diagram of the CELP synthesis model; Fig. 4 is a diagram illustrating the power transfer function of an LP synthesis filter; Fig. 5 is a diagram illustrating the power transfer function of a noise-suppressing filter; Fig. 6 is a diagram comparing the power transfer function of tile origi¬nal synthesis filter to the true and approximate noise suppressed filters; Fig. 7 is a block diagram of a communication system including a net¬work noise suppressor in accordance with the present invention; 3 Fig. 8 is a flow chart illustrating an exemplary embodiment of a noise suppression method in accordance with the present invention; Fig. 9 is a series of diagrams illustrating the modification of die noise suppressing filter; and Fig. 10 is a block diagram of an exemplary embodiment of a network noise suppressor in accordance with the present invention. DETAILED DESCRIPTION In the following description elements performing the same or similar functions have been provided with the same reference designations. Fig. 1 is a block diagram of a typical conventional communication system in¬cluding a network noise suppressor. A transmitting terminal 10 encodes speech and transmits the coded speech signal to a base station 12, where it is decoded into a PCM signal. The PCM signal is passed through a noise sup¬pressor 14 in the core network, and the modified PCM signal is passed to a second base station 16, in which it is encoded and transmitted to a receiving terminal 18, where it is decoded into a speech signal. Fig. 2 is a block diagram of another typical conventional communication sys¬tem including a network noise suppressor. This embodiment differs from the embodiment of fig. 1 in that the coded speech signal is also used in the core, network, thereby increasing the capacity of the network, since the coded sig¬nal requires a lower bit-rate than a conventional PCM signal. However, the noise suppression algorithm used performs the suppression on the PCM sig¬nal. For this reason the network noise suppressor in addition to the actual noise suppressor unit 14 also includes a decoder 13 for decoding the received coded speech signal into a PCM signal and an encoder 15 for encoding the modified PCM signal. This feature is called tandem encoding. A drawback of tandem encoding is that at low speech coding bit-rates the encoding-decoding-encoding process leads to a degradation in speech quality. The reason for this is that the decoded signal, on which the noise suppression algorithm is ap- 4 plied, may not accurately represent the original speech signal due to the low coding bit-rate. A second encoding of this signal (after noise suppression) may therefore lead to poor representation of the original speech signal. The present invention solves this problem by avoiding the second encoding step of the conventional systems. Instead of modifying the samples of a de¬coded PCM signal, the present invention performs noise suppression directly in the speech coded bit-stream by modifying certain speech parameters, as will be described in more detail below. The present invention will now be explained with reference to CELP coding. However, it is to be understood that the same principles may be used for any type of linear predictive coding Fig. 3 is a simplified block diagram of the CELP synthesis model. Vectors from a fixed codebook 20 and an adaptive codebook 22 are amplified by gains gc and gP, respectively, and added in an adder 24 to form an excitation signal u(n). This signal is forwarded to an LP synthesis filter 26 described by a filter 1/A(z), which produces a speech signal s(n). This can be described by the equation The parameters of the filter A(z) and the parameters defining excitation sig¬nal u(n) are derived from the bit-stream produced by the speech-encoder. A noise suppression algorithm can be described as a linear filter operating on the speech signal produced by the speech decoder, i.e. 5 where the (time-varying) filter H(z) is designed so as to suppress the noise while retaining the basic characteristics of the speech, see e.g. [1] for more details on the derivation of the filter H(z). Now, applying the knowledge of how the speech decoder produces the de¬coded speech, a noise-suppressed signal can be achieved at the output of the speech decoder as The basic idea of the invention is to approximate the filter H(z)/A(z) with an AR (Auto Regressive) filter A{z) of the same order as A(z} and a gain factor a. Thus, the noise-suppressed signal at the output of the speech decoder can be approximated as Hence, by replacing the parameters in the coded bit-stream describing the filter A(z) and the gain of the excitation signal with new parameters de¬scribing A(z) and a gain reduced by a, the noise suppression can be per¬formed without introducing any complete decoding and subsequent coding of the speech. Fig. 4 is a diagram illustrating the power transfer function of an LP synthesis filter. It is characterized by peaks at certain frequencies interconnected by valleys. Fig. 5 is a diagram illustrating the power transfer function of a noise-suppressing filter. It is noted that it has peaks at approximately the same frequencies as the spectrum in Fig. 4. The effect of applying this filter to the 6 spectrum in Fig. 4 is to sharpen the peaks and to lower the valleys, as illus¬trated by Fig. 6, which is a diagram comparing the power transfer function of the original synthesis filter to the true and approximate noise suppressed filters. Fig. 7 is a block diagram of a communication system including a network noise suppressor in accordance with the present invention. As can be seen from Fig. 7, the encoder between noise suppressor unit 114 and base station 16 has been etiminated. According to the invention, noise suppression is per¬formed directly on the parameters of the coded bit-stream, which makes the encoder unnecessary. Furthermore, decoder 113 may perform either a com¬plete or a partial decoding, depending on the algorithm used, as will be de¬scribed in further detail below. In both cases the decoding is only used to de¬termine the necessary modification of parameters in the coded bit-stream. As an example of how the modification of the bit stream is performed, the application of the present invention to the 12.2 kbit/s mode of the Adaptive Multi-Rate (AMR) speech encoder for the GSM and UMTS systems [2] will now be described with reference to Fig. 8. However, the present invention is not limited to this speech codec, but can easily be extended to any speech codec for which a parametric spectrum and a coded innovation sequence are part of the coded parameters. As seen from Fig. 3, the parameters to be modified in order to achieve the noise reduction are the parameters describ¬ing the LP synthesis filter A(z) and the gain of the fixed codebook gc. The codewords representing the fixed and adaptive codebook vectors do not have to be altered and neither does the adaptive codebook gain gp (in this mode). The procedure can be summarized by the following steps, which are illus¬trated in Fig. 8. SI. The first step is to transform the quantized LSP (Line Spectral Pair) representing filter A{z) to the corresponding filter coefficients{ai-}, as described in [2], section 5.2.4. 7 In order to determine the noise suppressing filter H(z) a measure of the power spectral density of the coded speech signal is required. Using the determined filter coefficients this can be found as where is obtained from the fixed codebook gain and adaptive codebook gain in accordance with Another possibility is to completely decode the speech signal and- to use the fast Fourier transform to obtain Determine the noise suppressing filter H(z) as where is the saved power spectral density from an earlier "pure noise" frame and are constants. Modify the filter defined by H(k) as described in [1]. This gives the de¬sired H(z). The reason for the modification is that noise suppressing filters designed in the frequency domain are real-valued, which, leads 8 to a time domain representation in which the peak of the filter is split between the beginning and end of the filter (this is equivalent to a filter that is symmetric around lag 0, i.e. a non-causal filter). This makes the filter unsuitable for circular block convolution, since such a filter will generate temporal aliasing. The performed modification is outlined in Fig. 9. It essentially involves transforming H(k) to the time domain, circularly shifting he transformed filter to make it causal and linear phase, applying a window (to avoid time domain aliasing) to the shifted filter to extract the most significant taps, circularly shifting the win¬dowed filter to remove the initial delay, and (optionally) transforming the linear phase filter to a minimum phase filter. An alternative modi¬fication method is described in [3]. 35. Approximate the IIR (Infinite Impulse Response) filter defined as H{z}/A[z) by a FIR (Finite Impulse Response) filter G{z) of length L. The coefficients of G(z) may be found as the first L coefficients of the im¬pulse response g(k) of H(z)/A(z) or by performing the polynomial divi¬sion H{z)/A(z) and identifying the coefficients for the z_1 ... TL terms. 36. Obtain A{z) from the auto correlation function of G(z) using the Levinson-Durbin algorithm, see [2] section 5.2.2. 7. Transform the coefficients that define into modified LSP pa- rameters as described in [2], section 5.2.3 8. Quantize and code modified LSP parameters as described in [2], sec¬tion 5.2.5 and replace the AR parameter code in the bit-stream. 9 S9. The fixed codebook gain modification a is defined by square root of the prediction error power, which is calculated in the same way as ELD in [2] section 5.2.2. , S10. For the gain of the excitation signal the procedure in section 6.1 of [2] is used. The fixed codebook gain is given by where the factor y(n) is the gain correction factor transmitted by the encoder. The factor g" is given by where E is a constant energy, E is the energy of the codeword, and where R(n) are past gain correction factors in a scaled logarithmic domain. Using the expressions above it is found that 10 The noise suppression algorithm modifies the gain by the factor a. Thus, the gain in the decoder should equal a times the gain in the encoder, i.e. Hence, the transmitted gain correction factor should be replaced by where Eenc(n) and Edec (n) are the predicted energies based on the gain factors transmitted by the encoder and the gain factors modified by the noise suppression algorithm. S11. Find the index of the codeword closest to ynew{n) and overwrite the original fixed codebook gain correction index in the coded bit-stream. In the described example the fixed and adaptive codebook gains are coded independently. In some coding modes with lower bit-rate they are vector quantized. In such a case the adaptive codebook gain will also be modified by the noise suppression. However, the excitation vectors are still un¬changed. Fig. 10 is a block diagram of an exemplary embodiment of a network noise suppressor in accordance with the present invention. The received coded bit-stream is (partially) decoded in block 113. Block 116 determines the noise suppressing filter H(z) from the decoded parameters. Block 118 calculates A(z) and oc. Block 120 determines the new linear predictive and gain pa¬rameters. Block 122 modifies the corresponding parameters in the coded bit stream. Typically the functions performed in the network noise suppressor are realized by one or several micro processors or micro/ signal processor combi¬nations. However, the same functions may also be realized by application spe¬cific integrated circuits (ASIC). 11 It will be understood by those skilled in the art that various modincations and changes may be made to the present invention without departure from the scope thereof, which is defined by the appended claims. REFERENCES [1] WO 01/18960 Al [2J "AMR speech codec; Transcoding functions", 3G TS 26.090 v3.1.0, 3GPP, France, 1999. [3] H. Gustafsson et al., "Spectral subtraction using correct convolution and a spectrum dependent exponential averaging method", Research Report 15/98, Department of Signal Processing, University of Karlskrona/Ronneby, Sweden, 1998 12. WE CLAIM 1. A noise suppression method including the steps of representing a noisy signal by a bit stream formed by signal encoding based on a linear predictive synthesis filter; detennining a noise suppressing Slter; determining a modified synthesis Slier approximately representing the cascade of said syaithesis filter and said noise suppressing filter- and replacing predetermined coding parameters representing said synthe¬sis filter with corresponding coding parameters representing said modified synthesis filter directly in the encoded bit stream. 2. The method of claim 1, including the atep of replacing at least one code-book gain, 3. the method of claim 2, including the step of replacing the fixed codebook gam. 4. The method of claim 1, including the step of replacing line spectral pair parameters and a fixed codebook gain correction factor. 5. The method of claim I, wherein predetermined parameters are kept ion-changed, 6. The metod of claim 5, wherein fixed codebook vectors axe kept un¬changed. 7. A noise suppression system including means for representing ft noisgr signal by a bit stream formed by signal encoding based on a linear predictive synthesis filter; means for determining a noise suppressing filter; 13 means for determining a modified synthesis filter approximately repre¬senting the cascade of said synthesis filter and said noise suppressing filter; and means for replacing predetermined coding parameters representing said synthesis filter with corresponding coding parameters representing said modified synthesis filter directly In the encoded bit stream. 8, The system of claim 7, including means for modifying at least one code-book gain. 9. The system of claims 8, inctiiding means for modifying the fixed codebook gam. 10. The system of claim 7, including means for modifying line spectral pair parameters and a taped codebook gain correction factor. 11. A network noise suppressor including means for receiving a hit stream representing a noisy signal, said fait stream toeing formed by signal encoding based on a linear predictive syuttie-Sis liter; means lor determining a noise suppressing filter; means for detemining a modified synthesis liter approximately repre¬senting the cascade of said synthesis filter and said noise suppressing filter; and means for replacing predetermined coding parameters representing said synthesis niter with corresponding coding parameters representing said modified synthesis filter directly in the encoded bit stream. 12. The network noise suppressor of claim 11, including means for modifying at least one codebook gain. 14 13. The network noise suppressor of claim 12, including means for modifying the fixed codebook gain. 14. The network noise suppressor of claim 11, including means for modifying line spectral pair parameters and a fixed codebook gain correction factor. Dated this the 26th day of August 2003. Abhishek Sen OfS. Majumdar & Co. Applicant"s Agent |
---|
788-mumnp-2003-abstract(24-05-2007).doc
788-mumnp-2003-abstract(24-05-2007).pdf
788-mumnp-2003-claims(granted)-(24-05-2007).doc
788-mumnp-2003-claims(granted)-(24-05-2007).pdf
788-mumnp-2003-correspondence 1(20-12-2007).pdf
788-mumnp-2003-correspondence 2(13-03-2006).pdf
788-mumnp-2003-correspondence(ipo)-(04-12-2007).pdf
788-mumnp-2003-drawing(24-05-2007).pdf
788-mumnp-2003-form 13(14-06-2006).pdf
788-mumnp-2003-form 18(13-03-2006).pdf
788-mumnp-2003-form 1a(26-08-2003).pdf
788-mumnp-2003-form 2(granted)-(24-05-2007).doc
788-mumnp-2003-form 2(granted)-(24-05-2007).pdf
788-mumnp-2003-form 3(16-09-2007).pdf
788-mumnp-2003-form 3(26-08-2003).pdf
788-mumnp-2003-form 5(26-08-2003).pdf
788-mumnp-2003-general power of authority(10-06-2006).pdf
788-mumnp-2003-petition undre rule 137(20-12-2007).pdf
788-mumnp-2003-petition undre rule 138(20-12-2007).pdf
788-mumnp-2003-power of authority(26-08-2003).pdf
Patent Number | 214179 | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Indian Patent Application Number | 788/MUMNP/2003 | ||||||||||||
PG Journal Number | 13/2008 | ||||||||||||
Publication Date | 28-Mar-2008 | ||||||||||||
Grant Date | 05-Feb-2008 | ||||||||||||
Date of Filing | 26-Aug-2003 | ||||||||||||
Name of Patentee | TELEFONAKTIEBOLAGET LM ERICSSON (PUBL) | ||||||||||||
Applicant Address | S-126 25 STOCKHOLM, SWEDEN | ||||||||||||
Inventors:
|
|||||||||||||
PCT International Classification Number | G10L 21/02 | ||||||||||||
PCT International Application Number | PCT/SE02/00534 | ||||||||||||
PCT International Filing date | 2002-03-20 | ||||||||||||
PCT Conventions:
|