Picture for Weihua Luo

Weihua Luo

AI Business, Alibaba Group

Evaluating Image Caption via Cycle-consistent Text-to-Image Generation

Add code
Jan 08, 2025
Figure 1 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 2 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 3 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Figure 4 for Evaluating Image Caption via Cycle-consistent Text-to-Image Generation
Viaarxiv icon

MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs

Add code
Jan 06, 2025
Viaarxiv icon

UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

Add code
Dec 25, 2024
Viaarxiv icon

PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm

Add code
Dec 05, 2024
Figure 1 for PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm
Figure 2 for PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm
Figure 3 for PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm
Figure 4 for PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm
Viaarxiv icon

Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement

Add code
Dec 05, 2024
Viaarxiv icon

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Add code
Nov 21, 2024
Viaarxiv icon

AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results

Add code
Oct 05, 2024
Figure 1 for AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results
Figure 2 for AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results
Figure 3 for AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results
Figure 4 for AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results
Viaarxiv icon

TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings

Add code
Sep 15, 2024
Figure 1 for TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings
Figure 2 for TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings
Figure 3 for TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings
Figure 4 for TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings
Viaarxiv icon

MoE-LPR: Multilingual Extension of Large Language Models through Mixture-of-Experts with Language Priors Routing

Add code
Aug 21, 2024
Figure 1 for MoE-LPR: Multilingual Extension of Large Language Models through Mixture-of-Experts with Language Priors Routing
Figure 2 for MoE-LPR: Multilingual Extension of Large Language Models through Mixture-of-Experts with Language Priors Routing
Figure 3 for MoE-LPR: Multilingual Extension of Large Language Models through Mixture-of-Experts with Language Priors Routing
Figure 4 for MoE-LPR: Multilingual Extension of Large Language Models through Mixture-of-Experts with Language Priors Routing
Viaarxiv icon

Building Decision Making Models Through Language Model Regime

Add code
Aug 12, 2024
Viaarxiv icon