Picture for Zhisheng Wang

Zhisheng Wang

Center of Ultra-precision Optoelectronic Instrument engineering, Harbin Institute of Technology, Key Lab of Ultra-precision Intelligent Instrumentation, Harbin Institute of Technology

Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding

Add code
Oct 21, 2024
Figure 1 for Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding
Figure 2 for Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding
Figure 3 for Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding
Figure 4 for Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding
Viaarxiv icon

Geometric Artifact Correction for Symmetric Multi-Linear Trajectory CT: Theory, Method, and Generalization

Add code
Aug 27, 2024
Viaarxiv icon

Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models

Add code
Jul 18, 2024
Viaarxiv icon

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Add code
Mar 26, 2024
Viaarxiv icon

3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands

Add code
Jan 02, 2024
Viaarxiv icon

Monocular 3D Hand Mesh Recovery via Dual Noise Estimation

Add code
Dec 26, 2023
Viaarxiv icon

OSNet & MNetO: Two Types of General Reconstruction Architectures for Linear Computed Tomography in Multi-Scenarios

Add code
Sep 25, 2023
Figure 1 for OSNet & MNetO: Two Types of General Reconstruction Architectures for Linear Computed Tomography in Multi-Scenarios
Figure 2 for OSNet & MNetO: Two Types of General Reconstruction Architectures for Linear Computed Tomography in Multi-Scenarios
Figure 3 for OSNet & MNetO: Two Types of General Reconstruction Architectures for Linear Computed Tomography in Multi-Scenarios
Figure 4 for OSNet & MNetO: Two Types of General Reconstruction Architectures for Linear Computed Tomography in Multi-Scenarios
Viaarxiv icon

ExpCLIP: Bridging Text and Facial Expressions via Semantic Alignment

Add code
Sep 11, 2023
Viaarxiv icon

LongDanceDiff: Long-term Dance Generation with Conditional Diffusion Model

Add code
Aug 23, 2023
Viaarxiv icon

Analytical reconstructions of multiple source-translation computed tomography with extended field of views: a research study

Add code
May 31, 2023
Viaarxiv icon