Picture for Wei-Cheng Tseng

Wei-Cheng Tseng

Cosmos World Foundation Model Platform for Physical AI

Add code
Jan 07, 2025
Figure 1 for Cosmos World Foundation Model Platform for Physical AI
Figure 2 for Cosmos World Foundation Model Platform for Physical AI
Figure 3 for Cosmos World Foundation Model Platform for Physical AI
Figure 4 for Cosmos World Foundation Model Platform for Physical AI
Viaarxiv icon

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Figure 1 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 2 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 3 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 4 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Viaarxiv icon

Gaussian Splatting Visual MPC for Granular Media Manipulation

Add code
Oct 13, 2024
Figure 1 for Gaussian Splatting Visual MPC for Granular Media Manipulation
Figure 2 for Gaussian Splatting Visual MPC for Granular Media Manipulation
Figure 3 for Gaussian Splatting Visual MPC for Granular Media Manipulation
Figure 4 for Gaussian Splatting Visual MPC for Granular Media Manipulation
Viaarxiv icon

SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks

Add code
Aug 23, 2024
Figure 1 for SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Figure 2 for SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Figure 3 for SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Figure 4 for SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Viaarxiv icon

A Large-Scale Evaluation of Speech Foundation Models

Add code
Apr 15, 2024
Viaarxiv icon

VMCML: Video and Music Matching via Cross-Modality Lifting

Add code
Mar 22, 2023
Viaarxiv icon

SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks

Add code
Mar 01, 2023
Viaarxiv icon

Ensemble knowledge distillation of self-supervised speech models

Add code
Feb 24, 2023
Viaarxiv icon

DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores

Add code
Apr 07, 2022
Figure 1 for DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores
Figure 2 for DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores
Figure 3 for DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores
Figure 4 for DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores
Viaarxiv icon

An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks

Add code
Mar 31, 2022
Figure 1 for An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
Figure 2 for An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
Figure 3 for An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
Figure 4 for An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
Viaarxiv icon