Picture for Wenjie Wang

Wenjie Wang

SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation

Add code
Dec 08, 2024
Viaarxiv icon

STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training

Add code
Nov 29, 2024
Viaarxiv icon

Self-Calibrated Listwise Reranking with Large Language Models

Add code
Nov 07, 2024
Figure 1 for Self-Calibrated Listwise Reranking with Large Language Models
Figure 2 for Self-Calibrated Listwise Reranking with Large Language Models
Figure 3 for Self-Calibrated Listwise Reranking with Large Language Models
Figure 4 for Self-Calibrated Listwise Reranking with Large Language Models
Viaarxiv icon

Causality-Enhanced Behavior Sequence Modeling in LLMs for Personalized Recommendation

Add code
Oct 30, 2024
Figure 1 for Causality-Enhanced Behavior Sequence Modeling in LLMs for Personalized Recommendation
Figure 2 for Causality-Enhanced Behavior Sequence Modeling in LLMs for Personalized Recommendation
Figure 3 for Causality-Enhanced Behavior Sequence Modeling in LLMs for Personalized Recommendation
Figure 4 for Causality-Enhanced Behavior Sequence Modeling in LLMs for Personalized Recommendation
Viaarxiv icon

Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning

Add code
Oct 30, 2024
Figure 1 for Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning
Figure 2 for Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning
Figure 3 for Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning
Figure 4 for Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning
Viaarxiv icon

MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding

Add code
Oct 25, 2024
Figure 1 for MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding
Figure 2 for MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding
Figure 3 for MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding
Figure 4 for MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding
Viaarxiv icon

Large Language Models Empowered Personalized Web Agents

Add code
Oct 22, 2024
Figure 1 for Large Language Models Empowered Personalized Web Agents
Figure 2 for Large Language Models Empowered Personalized Web Agents
Figure 3 for Large Language Models Empowered Personalized Web Agents
Figure 4 for Large Language Models Empowered Personalized Web Agents
Viaarxiv icon

Personalized Image Generation with Large Multimodal Models

Add code
Oct 18, 2024
Viaarxiv icon

FOOGD: Federated Collaboration for Both Out-of-distribution Generalization and Detection

Add code
Oct 15, 2024
Figure 1 for FOOGD: Federated Collaboration for Both Out-of-distribution Generalization and Detection
Figure 2 for FOOGD: Federated Collaboration for Both Out-of-distribution Generalization and Detection
Figure 3 for FOOGD: Federated Collaboration for Both Out-of-distribution Generalization and Detection
Figure 4 for FOOGD: Federated Collaboration for Both Out-of-distribution Generalization and Detection
Viaarxiv icon

Efficient Inference for Large Language Model-based Generative Recommendation

Add code
Oct 07, 2024
Figure 1 for Efficient Inference for Large Language Model-based Generative Recommendation
Figure 2 for Efficient Inference for Large Language Model-based Generative Recommendation
Figure 3 for Efficient Inference for Large Language Model-based Generative Recommendation
Figure 4 for Efficient Inference for Large Language Model-based Generative Recommendation
Viaarxiv icon