Picture for Kai Shen

Kai Shen

FullStack Bench: Evaluating LLMs as Full Stack Coders

Add code
Dec 03, 2024
Viaarxiv icon

Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference

Add code
Jul 06, 2024
Figure 1 for Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference
Figure 2 for Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference
Figure 3 for Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference
Figure 4 for Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference
Viaarxiv icon

T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text

Add code
Jun 11, 2024
Viaarxiv icon

RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis

Add code
Apr 06, 2024
Viaarxiv icon

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Add code
Mar 05, 2024
Viaarxiv icon

PromptTTS 2: Describing and Generating Voices with Text Prompt

Add code
Sep 05, 2023
Viaarxiv icon

NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers

Add code
May 04, 2023
Viaarxiv icon

Deep Learning Predicts Prevalent and Incident Parkinson's Disease From UK Biobank Fundus Imaging

Add code
Feb 17, 2023
Viaarxiv icon

A Study on ReLU and Softmax in Transformer

Add code
Feb 13, 2023
Viaarxiv icon

Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction

Add code
Nov 23, 2022
Viaarxiv icon