jmvalin
This is a follow-up on the first LPCNet demo. In this new demo, we turn LPCNet into a very low-bitrate neural speech codec (see submitted paper) that's actually usable on current hardware and even on phones. It's the first time a neural vocoder is able to run in real-time using just one CPU core on a phone (as opposed to a high-end GPU). The resulting bitrate — just 1.6 kb/s — is about 10 times less than what wideband codecs typically use. The quality is much better than existing very low bitrate vocoders and comparable to that of more traditional codecs using a higher bitrate.
Re: Can't get the same result
Date: 2020-02-25 08:02 am (UTC)PESQ.exe +16000 +wb ref.pcm ref_dec.pcm
PESQ.exe +16000 +wb ref.pcm lpcnq.pcm
I don't know if it is because of my compile, because a lot of warnings are generated, such as:
In file included from src/lpcnet_dec.c:38:0:
src/pitch.h:37:1: warning: C++ style comments are not allowed in ISO C90 [enabled by default]
//#include "modes.h"
^
src/pitch.h:37:1: warning: (this will be reported only once per input file) [enabled by default]
src/lpcnet_dec.c: In function 'decode_packet':
src/lpcnet_dec.c:96:3: warning: ISO C90 forbids mixed declarations and code [-Wpedantic]
unpacker bits;
^
src/lpcnet_dec.c:108:3: warning: C++ style comments are not allowed in ISO C90 [enabled by default]
//fprintf(stdout, "%d %d %d %d %d %d %d %d %d\n", c0_id, main_pitch, modulation, corr_id, vq_end[0], vq_end[1], vq_end[2], vq_mid, interp_id);
^
src/lpcnet_dec.c:108:3: warning: (this will be reported only once per input file) [enabled by default]
src/lpcnet_dec.c:135:3: warning: ISO C90 forbids mixed declarations and code [-Wpedantic]
float sign = 1;
^
CC src/lpcnet_enc.lo
In file included from src/lpcnet_enc.c:38:0:
src/pitch.h:37:1: warning: C++ style comments are not allowed in ISO C90 [enabled by default]
//#include "modes.h"
^
src/pitch.h:37:1: warning: (this will be reported only once per input file) [enabled by default]
src/lpcnet_enc.c:46:1: warning: C++ style comments are not allowed in ISO C90 [enabled by default]
//#define NB_FEATURES (2*NB_BANDS+3+LPC_ORDER)
PS:
The voice bandwidth of the "MELP 2.4kbps" downloaded from the website is 4kHz, but the MELP 2.4kbps program I downloaded from the network generates a voice bandwidth of 8kHz.
Re: Can't get the same result
Date: 2020-02-25 06:50 pm (UTC)Re: Can't get the same result
Date: 2020-02-26 01:47 am (UTC)Re: Can't get the same result
Date: 2020-02-26 08:10 am (UTC)Re: Can't get the same result
Date: 2020-02-26 08:47 am (UTC)