A method for encoding a high frequency signal includes determining a signal type of a high frequency signal of a current frame, smoothing and scaling time envelopes of the high frequency signal of the current frame and obtaining time envelopes of the high frequency signal of the current frame that require to be encoded when the high frequency signal of the current frame is a non-transient signal and a high frequency signal of the previous frame is a transient signal, and quantizing and encoding the time envelopes of the high frequency signal of the current frame that require to be encoded, and frequency information and signal type information of the high frequency signal of the current frame.
G10L 19/032 - Quantisation or dequantisation of spectral components
G10L 19/025 - Detection of transients or attacks for time/frequency resolution switching
G10L 19/20 - Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
G10L 25/18 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
2.
Method and device for encoding a high frequency signal, and method and device for decoding a high frequency signal
A method for encoding a high frequency signal includes determining a signal type of a high frequency signal of a current frame, smoothing and scaling time envelopes of the high frequency signal of the current frame and obtaining time envelopes of the high frequency signal of the current frame that require to be encoded when the high frequency signal of the current frame is a non-transient signal and a high frequency signal of the previous frame is a transient signal, and quantizing and encoding the time envelopes of the high frequency signal of the current frame that require to be encoded, and frequency information and signal type information of the high frequency signal of the current frame.
G10L 19/032 - Quantisation or dequantisation of spectral components
G10L 19/20 - Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
G10L 25/18 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
G10L 19/025 - Detection of transients or attacks for time/frequency resolution switching
3.
Method and device for encoding a high frequency signal, and method and device for decoding a high frequency signal
A method for encoding a high frequency signal includes determining a signal type of a high frequency signal of a current frame, smoothing and scaling time envelopes of the high frequency signal of the current frame and obtaining time envelopes of the high frequency signal of the current frame that require to be encoded when the high frequency signal of the current frame is a non-transient signal and a high frequency signal of the previous frame is a transient signal, and quantizing and encoding the time envelopes of the high frequency signal of the current frame that require to be encoded, and frequency information and signal type information of the high frequency signal of the current frame.
G10L 19/032 - Quantisation or dequantisation of spectral components
G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
G10L 19/20 - Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
G10L 25/18 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
G10L 19/025 - Detection of transients or attacks for time/frequency resolution switching
4.
Coding/decoding method, apparatus, and system for audio signal
Embodiments of the present application provide a coding/decoding method, apparatus, and system. According to the coding method, de-emphasis processing is performed on a full band signal by using a de-emphasis parameter determined according to a characteristic factor of an input audio signal, and then the full band signal is coded and sent to a decoder, so that the decoder performs corresponding de-emphasis decoding processing on the full band signal according to the characteristic factor of the input audio signal and restores the input audio signal. This resolves a prior-art problem that an audio signal restored by a decoder is apt to have signal distortion, and implements adaptive de-emphasis processing on the full band signal according to the characteristic factor of the audio signal to enhance coding performance, so that the input audio signal restored by the decoder has relatively high fidelity and is closer to an original signal.
G10L 19/12 - Determination or coding of the excitation functionDetermination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
A method for predicting a bandwidth extension frequency band signal includes demultiplexing a received bitstream to obtain a frequency domain signal; determining whether a highest frequency bin, to which a bit is allocated, of the frequency domain signal is less than a preset start frequency bin of a bandwidth extension frequency band; predicting an excitation signal of the bandwidth extension frequency band according to the determination; and predicting the bandwidth extension frequency band signal according to the predicted excitation signal of the bandwidth extension frequency band and a frequency envelope of the bandwidth extension frequency band.
G10L 19/12 - Determination or coding of the excitation functionDetermination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
G10L 19/08 - Determination or coding of the excitation functionDetermination or coding of the long-term prediction parameters
6.
Method and device for encoding a high frequency signal, and method and device for decoding a high frequency signal
A method for encoding a high frequency signal includes determining a signal type of a high frequency signal of a current frame, smoothing and scaling time envelopes of the high frequency signal of the current frame and obtaining time envelopes of the high frequency signal of the current frame that require to be encoded when the high frequency signal of the current frame is a non-transient signal and a high frequency signal of the previous frame is a transient signal, and quantizing and encoding the time envelopes of the high frequency signal of the current frame that require to be encoded, and frequency information and signal type information of the high frequency signal of the current frame.
G10L 19/032 - Quantisation or dequantisation of spectral components
G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
G10L 19/20 - Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
G10L 25/18 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
G10L 19/025 - Detection of transients or attacks for time/frequency resolution switching
7.
Recovering high frequency band signal of a lost frame in media bitstream according to gain gradient
A method for recovering lost frame in a media bitstream, where when a frame loss event occurs, a decoder obtains a synthesized high frequency band signal of a current lost frame, and recovery information related to the current lost frame, determines a global gain gradient of the current lost frame, and determines a global gain of the current lost frame according to the global gain gradient and a global gain of each frame in previous M frames of the current lost frame. A high frequency band signal of the current lost frame is obtained by adjusting the synthesized high frequency band signal of the current lost frame according to the global gain and a subframe gain of the current lost frame. Hence, the method enables natural and smooth transitions of the high frequency band signal between the frames, and attenuates noises in the high frequency band signal.
G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
G10L 19/005 - Correction of errors induced by the transmission channel, if related to the coding algorithm
G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
A method for predicting a bandwidth extension frequency band signal includes demultiplexing a received bitstream to obtain a frequency domain signal; determining whether a highest frequency bin, to which a bit is allocated, of the frequency domain signal is less than a preset start frequency bin of a bandwidth extension frequency band; predicting an excitation signal of the bandwidth extension frequency band according to the determination; and predicting the bandwidth extension frequency band signal according to the predicted excitation signal of the bandwidth extension frequency band and a frequency envelope of the bandwidth extension frequency band.
G10L 19/12 - Determination or coding of the excitation functionDetermination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
G10L 19/08 - Determination or coding of the excitation functionDetermination or coding of the long-term prediction parameters
9.
Frequency envelope vector quantization method and apparatus
th portion; and quantizing a second vector in the N1 vectors based on the second codebook. In the embodiments of the present application, vector quantization can be performed on frequency envelope vectors by using a codebook with a smaller quantity of bits. Therefore, complexity of vector quantization can be reduced, and an effect of vector quantization can also be ensured.
G10L 21/00 - Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
G10L 19/038 - Vector quantisation, e.g. TwinVQ audio
G10L 19/06 - Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
10.
Coding/decoding method, apparatus, and system for audio signal
Embodiments of the present application provide a coding/decoding method, apparatus, and system. According to the coding method, de-emphasis processing is performed on a full band signal by using a de-emphasis parameter determined according to a characteristic factor of an input audio signal, and then the full band signal is coded and sent to a decoder, so that the decoder performs corresponding de-emphasis decoding processing on the full band signal according to the characteristic factor of the input audio signal and restores the input audio signal. This resolves a prior-art problem that an audio signal restored by a decoder is apt to have signal distortion, and implements adaptive de-emphasis processing on the full band signal according to the characteristic factor of the audio signal to enhance coding performance, so that the input audio signal restored by the decoder has relatively high fidelity and is closer to an original signal.
G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
G10L 19/12 - Determination or coding of the excitation functionDetermination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
A method and a device for encoding a high frequency signal, and a method and a device for decoding a high frequency signal are provided, which relate to encoding and decoding technology. The method for encoding a high frequency signal includes: determining a signal type of a high frequency signal of a current frame; smoothing and scaling time envelopes of the high frequency signal of the current frame and obtaining time envelopes of the high frequency signal of the current frame that require to be encoded, if the high frequency signal of the current frame is a non-transient signal and a high frequency signal of the previous frame is a transient signal; and quantizing and encoding the time envelopes of the high frequency signal of the current frame that require to be encoded, and frequency information and signal type information of the high frequency signal of the current frame.
G10L 19/032 - Quantisation or dequantisation of spectral components
G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
G10L 19/20 - Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
G10L 25/18 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
G10L 19/025 - Detection of transients or attacks for time/frequency resolution switching
12.
Coding/decoding method, apparatus, and system for audio signal
Embodiments of the present invention provide a coding/decoding method, apparatus, and system. According to the coding method, de-emphasis processing is performed on a full band signal by using a de-emphasis parameter determined according to a characteristic factor of an input audio signal, and then the full band signal is coded and sent to a decoder, so that the decoder performs corresponding de-emphasis decoding processing on the full band signal according to the characteristic factor of the input audio signal and restores the input audio signal. This resolves a prior-art problem that an audio signal restored by a decoder is apt to have signal distortion, and implements adaptive de-emphasis processing on the full band signal according to the characteristic factor of the audio signal to enhance coding performance, so that the input audio signal restored by the decoder has relatively high fidelity and is closer to an original signal.
G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
G10L 19/12 - Determination or coding of the excitation functionDetermination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
A method for predicting a bandwidth extension frequency band signal includes demultiplexing a received bitstream to obtain a frequency domain signal; determining whether a highest frequency bin, to which a bit is allocated, of the frequency domain signal is less than a preset start frequency bin of a bandwidth extension frequency band; predicting an excitation signal of the bandwidth extension frequency band according to the determination; and predicting the bandwidth extension frequency band signal according to the predicted excitation signal of the bandwidth extension frequency band and a frequency envelope of the bandwidth extension frequency band.
G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
G10L 19/12 - Determination or coding of the excitation functionDetermination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
G10L 19/08 - Determination or coding of the excitation functionDetermination or coding of the long-term prediction parameters
14.
Recovering high frequency band signal of a lost frame in media bitstream according to gain gradient
A method for recovering lost frame in a media bitstream is provided. When a frame loss event occurs, a decoder obtains a synthesized high frequency band signal of a current lost frame, and recovery information related to the current lost frame. The decoder determines a global gain gradient of the current lost frame, and further determines a global gain of the current lost frame according to the global gain gradient and a global gain of each frame in previous M frames of the current lost frame. A high frequency band signal of the current lost frame is obtained by adjusting the synthesized high frequency band signal of the current lost frame according to the global gain and a subframe gain of the current lost frame. The process enables natural and smooth transitions of the high frequency band signal between the frames, and attenuates noises in the high frequency band signal.
G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
G10L 19/005 - Correction of errors induced by the transmission channel, if related to the coding algorithm
G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
th portion; and quantizing a second vector in the N1 vectors based on the second codebook. In the embodiments of the present application, vector quantization can be performed on frequency envelope vectors by using a codebook with a smaller quantity of bits. Therefore, complexity of vector quantization can be reduced, and an effect of vector quantization can also be ensured.
G10L 21/00 - Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
G10L 19/038 - Vector quantisation, e.g. TwinVQ audio
G10L 19/06 - Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
16.
Method and device for encoding a high frequency signal, and method and device for decoding a high frequency signal
A method and a device for encoding a high frequency signal, and a method and a device for decoding a high frequency signal are provided, which relate to encoding and decoding technology. The method for encoding a high frequency signal includes: determining a signal type of a high frequency signal of a current frame; smoothing and scaling time envelopes of the high frequency signal of the current frame and obtaining time envelopes of the high frequency signal of the current frame that require to be encoded, if the high frequency signal of the current frame is a non-transient signal and a high frequency signal of the previous frame is a transient signal; and quantizing and encoding the time envelopes of the high frequency signal of the current frame that require to be encoded, and frequency information and signal type information of the high frequency signal of the current frame.
G10L 19/032 - Quantisation or dequantisation of spectral components
G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
G10L 19/20 - Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
G10L 25/18 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
G10L 19/025 - Detection of transients or attacks for time/frequency resolution switching
17.
Method for predicting bandwidth extension frequency band signal, and decoding device
A method for predicting a bandwidth extension frequency band signal includes demultiplexing a received bitstream to obtain a frequency domain signal; determining whether a highest frequency bin, to which a bit is allocated, of the frequency domain signal is less than a preset start frequency bin of a bandwidth extension frequency band; predicting an excitation signal of the bandwidth extension frequency band according to the determination; and predicting the bandwidth extension frequency band signal according to the predicted excitation signal of the bandwidth extension frequency band and a frequency envelope of the bandwidth extension frequency band.
G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
G10L 19/08 - Determination or coding of the excitation functionDetermination or coding of the long-term prediction parameters
18.
Method and device for encoding a high frequency signal, and method and device for decoding a high frequency signal
A method and a device for encoding a high frequency signal, and a method and a device for decoding a high frequency signal are provided, which relate to encoding and decoding technology. The method for encoding a high frequency signal includes: determining a signal class of a high frequency signal of a current frame; smoothing and scaling time envelopes of the high frequency signal of the current frame and obtaining time envelopes of the high frequency signal of the current frame that require to be encoded, if the high frequency signal of the current frame is a non-transient signal and a high frequency signal of the previous frame is a transient signal; and quantizing and encoding the time envelopes of the high frequency signal of the current frame that require to be encoded, and frequency information and signal class information of the high frequency signal of the current frame.
A method and a device for encoding a high frequency signal, and a method and a device for decoding a high frequency signal are provided, which relate to encoding and decoding technology. The method for encoding a high frequency signal includes: determining a signal type of a high frequency signal of a current frame; smoothing and scaling time envelopes of the high frequency signal of the current frame and obtaining time envelopes of the high frequency signal of the current frame that require to be encoded, if the high frequency signal of the current frame is a non-transient signal and a high frequency signal of the previous frame is a transient signal; and quantizing and encoding the time envelopes of the high frequency signal of the current frame that require to be encoded, and frequency information and signal type information of the high frequency signal of the current frame.
A transient signal encoding method and device, decoding method and device, and processing system, where the transient signal encoding method includes: obtaining a reference sub-frame where a maximal time envelope having a maximal amplitude value is located from time envelopes of all sub-frames of an input transient signal; adjusting an amplitude value of the time envelope of each sub-frame before the reference sub-frame in such a way that a first difference is greater than a preset first threshold, in which the first difference is a difference between the amplitude value of the time envelope of each sub-frame before the reference sub-frame and the amplitude value of the maximal time envelope; and writing the adjusted time envelope into bitstream.