jmvalin: (Default)
[personal profile] jmvalin

This is a follow-up on the first LPCNet demo. In this new demo, we turn LPCNet into a very low-bitrate neural speech codec (see submitted paper) that's actually usable on current hardware and even on phones. It's the first time a neural vocoder is able to run in real-time using just one CPU core on a phone (as opposed to a high-end GPU). The resulting bitrate — just 1.6 kb/s — is about 10 times less than what wideband codecs typically use. The quality is much better than existing very low bitrate vocoders and comparable to that of more traditional codecs using a higher bitrate.

Read More

Re: Can't get the same result

Date: 2020-02-25 08:02 am (UTC)
From: (Anonymous)
Thanks for your reply,It's not just the difference in values, the two results sound quite different. I tested the PESQ-WB, the website is 1.777, and my own is 1.319. My own decoding results sound a lot of glitches.

PESQ.exe +16000 +wb ref.pcm ref_dec.pcm
PESQ.exe +16000 +wb ref.pcm lpcnq.pcm

I don't know if it is because of my compile, because a lot of warnings are generated, such as:

In file included from src/lpcnet_dec.c:38:0:
src/pitch.h:37:1: warning: C++ style comments are not allowed in ISO C90 [enabled by default]
//#include "modes.h"
^
src/pitch.h:37:1: warning: (this will be reported only once per input file) [enabled by default]
src/lpcnet_dec.c: In function 'decode_packet':
src/lpcnet_dec.c:96:3: warning: ISO C90 forbids mixed declarations and code [-Wpedantic]
unpacker bits;
^
src/lpcnet_dec.c:108:3: warning: C++ style comments are not allowed in ISO C90 [enabled by default]
//fprintf(stdout, "%d %d %d %d %d %d %d %d %d\n", c0_id, main_pitch, modulation, corr_id, vq_end[0], vq_end[1], vq_end[2], vq_mid, interp_id);
^
src/lpcnet_dec.c:108:3: warning: (this will be reported only once per input file) [enabled by default]
src/lpcnet_dec.c:135:3: warning: ISO C90 forbids mixed declarations and code [-Wpedantic]
float sign = 1;
^
CC src/lpcnet_enc.lo
In file included from src/lpcnet_enc.c:38:0:
src/pitch.h:37:1: warning: C++ style comments are not allowed in ISO C90 [enabled by default]
//#include "modes.h"
^
src/pitch.h:37:1: warning: (this will be reported only once per input file) [enabled by default]
src/lpcnet_enc.c:46:1: warning: C++ style comments are not allowed in ISO C90 [enabled by default]
//#define NB_FEATURES (2*NB_BANDS+3+LPC_ORDER)

PS:
The voice bandwidth of the "MELP 2.4kbps" downloaded from the website is 4kHz, but the MELP 2.4kbps program I downloaded from the network generates a voice bandwidth of 8kHz.

Re: Can't get the same result

Date: 2020-02-26 01:47 am (UTC)
From: (Anonymous)
The computer is a Windows 10 system, and the virtual machine installed therein is ubuntu14.

Re: Can't get the same result

Date: 2020-02-26 08:10 am (UTC)
From: (Anonymous)
The input of the system is a single-channel speech with a sampling rate of 16kHz. If the input audio with a single-channel sampling rate of 8kHz is also used for trial?

Re: Can't get the same result

Date: 2020-02-26 08:47 am (UTC)
From: (Anonymous)
I found the reason, because I input a 2-channel voice. Thank you for your patience.

Profile

jmvalin: (Default)
jmvalin

March 2023

S M T W T F S
   1234
567891011
12131415161718
1920212223 2425
262728293031 

Most Popular Tags

Page Summary

Style Credit

Expand Cut Tags

No cut tags
Page generated Sep. 22nd, 2024 05:25 am
Powered by Dreamwidth Studios