Picture for Jian Xu

Jian Xu

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

Add code
Dec 24, 2024
Viaarxiv icon

AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games

Add code
Dec 14, 2024
Viaarxiv icon

Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning

Add code
Nov 20, 2024
Viaarxiv icon

PSformer: Parameter-efficient Transformer with Segment Attention for Time Series Forecasting

Add code
Nov 03, 2024
Viaarxiv icon

UFLUX v2.0: A Process-Informed Machine Learning Framework for Efficient and Explainable Modelling of Terrestrial Carbon Uptake

Add code
Oct 04, 2024
Viaarxiv icon

DAF-Net: A Dual-Branch Feature Decomposition Fusion Network with Domain Adaptive for Infrared and Visible Image Fusion

Add code
Sep 18, 2024
Viaarxiv icon

pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning

Add code
Sep 09, 2024
Figure 1 for pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning
Figure 2 for pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning
Figure 3 for pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning
Figure 4 for pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning
Viaarxiv icon

Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information

Add code
Sep 02, 2024
Viaarxiv icon

Balancing Performance and Efficiency: A Multimodal Large Language Model Pruning Method based Image Text Interaction

Add code
Sep 02, 2024
Figure 1 for Balancing Performance and Efficiency: A Multimodal Large Language Model Pruning Method based Image Text Interaction
Figure 2 for Balancing Performance and Efficiency: A Multimodal Large Language Model Pruning Method based Image Text Interaction
Figure 3 for Balancing Performance and Efficiency: A Multimodal Large Language Model Pruning Method based Image Text Interaction
Figure 4 for Balancing Performance and Efficiency: A Multimodal Large Language Model Pruning Method based Image Text Interaction
Viaarxiv icon

Multi-view Hand Reconstruction with a Point-Embedded Transformer

Add code
Aug 20, 2024
Viaarxiv icon