Picture for Yifan Yang

Yifan Yang

MageBench: Bridging Large Multimodal Models to Agents

Add code
Dec 05, 2024
Viaarxiv icon

A Comparative Study of LLM-based ASR and Whisper in Low Resource and Code Switching Scenario

Add code
Dec 01, 2024
Figure 1 for A Comparative Study of LLM-based ASR and Whisper in Low Resource and Code Switching Scenario
Figure 2 for A Comparative Study of LLM-based ASR and Whisper in Low Resource and Code Switching Scenario
Figure 3 for A Comparative Study of LLM-based ASR and Whisper in Low Resource and Code Switching Scenario
Figure 4 for A Comparative Study of LLM-based ASR and Whisper in Low Resource and Code Switching Scenario
Viaarxiv icon

BIGCity: A Universal Spatiotemporal Model for Unified Trajectory and Traffic State Data Analysis

Add code
Dec 01, 2024
Viaarxiv icon

k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning

Add code
Nov 26, 2024
Viaarxiv icon

Navigating Spatial Inequities in Freight Truck Crash Severity via Counterfactual Inference in Los Angeles

Add code
Nov 26, 2024
Viaarxiv icon

LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation

Add code
Nov 26, 2024
Figure 1 for LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation
Figure 2 for LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation
Figure 3 for LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation
Figure 4 for LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation
Viaarxiv icon

REDUCIO! Generating 1024$\times$1024 Video within 16 Seconds using Extremely Compressed Motion Latents

Add code
Nov 20, 2024
Viaarxiv icon

Ensuring Safety and Trust: Analyzing the Risks of Large Language Models in Medicine

Add code
Nov 20, 2024
Figure 1 for Ensuring Safety and Trust: Analyzing the Risks of Large Language Models in Medicine
Figure 2 for Ensuring Safety and Trust: Analyzing the Risks of Large Language Models in Medicine
Figure 3 for Ensuring Safety and Trust: Analyzing the Risks of Large Language Models in Medicine
Figure 4 for Ensuring Safety and Trust: Analyzing the Risks of Large Language Models in Medicine
Viaarxiv icon

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Add code
Nov 07, 2024
Figure 1 for LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
Figure 2 for LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
Figure 3 for LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
Figure 4 for LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
Viaarxiv icon

VecCity: A Taxonomy-guided Library for Map Entity Representation Learning

Add code
Oct 31, 2024
Figure 1 for VecCity: A Taxonomy-guided Library for Map Entity Representation Learning
Figure 2 for VecCity: A Taxonomy-guided Library for Map Entity Representation Learning
Figure 3 for VecCity: A Taxonomy-guided Library for Map Entity Representation Learning
Figure 4 for VecCity: A Taxonomy-guided Library for Map Entity Representation Learning
Viaarxiv icon