Picture for Yao Hu

Yao Hu

Alibaba Group

WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs

Add code
Feb 06, 2025
Figure 1 for WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
Figure 2 for WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
Figure 3 for WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
Figure 4 for WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
Viaarxiv icon

scBIT: Integrating Single-cell Transcriptomic Data into fMRI-based Prediction for Alzheimer's Disease Diagnosis

Add code
Feb 04, 2025
Viaarxiv icon

FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration

Add code
Jan 24, 2025
Viaarxiv icon

DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors

Add code
Jan 15, 2025
Viaarxiv icon

Single Trajectory Distillation for Accelerating Image and Video Style Transfer

Add code
Dec 25, 2024
Viaarxiv icon

ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty

Add code
Dec 12, 2024
Figure 1 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Figure 2 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Figure 3 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Figure 4 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Viaarxiv icon

LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

Add code
Dec 02, 2024
Figure 1 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Figure 2 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Figure 3 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Figure 4 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Viaarxiv icon

ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval

Add code
Nov 24, 2024
Figure 1 for ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval
Figure 2 for ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval
Figure 3 for ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval
Figure 4 for ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval
Viaarxiv icon

GPRec: Bi-level User Modeling for Deep Recommenders

Add code
Oct 28, 2024
Figure 1 for GPRec: Bi-level User Modeling for Deep Recommenders
Figure 2 for GPRec: Bi-level User Modeling for Deep Recommenders
Figure 3 for GPRec: Bi-level User Modeling for Deep Recommenders
Figure 4 for GPRec: Bi-level User Modeling for Deep Recommenders
Viaarxiv icon

MoDification: Mixture of Depths Made Easy

Add code
Oct 18, 2024
Figure 1 for MoDification: Mixture of Depths Made Easy
Figure 2 for MoDification: Mixture of Depths Made Easy
Figure 3 for MoDification: Mixture of Depths Made Easy
Figure 4 for MoDification: Mixture of Depths Made Easy
Viaarxiv icon