Picture for Yong Li

Yong Li

Tsinghua University

Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space

Add code
Mar 14, 2025
Viaarxiv icon

Decoupled Doubly Contrastive Learning for Cross Domain Facial Action Unit Detection

Add code
Mar 12, 2025
Viaarxiv icon

Beyond Overfitting: Doubly Adaptive Dropout for Generalizable AU Detection

Add code
Mar 12, 2025
Viaarxiv icon

Causal Discovery and Inference towards Urban Elements and Associated Factors

Add code
Mar 09, 2025
Viaarxiv icon

EPR-GAIL: An EPR-Enhanced Hierarchical Imitation Learning Framework to Simulate Complex User Consumption Behaviors

Add code
Mar 09, 2025
Viaarxiv icon

Causality Enhanced Origin-Destination Flow Prediction in Data-Scarce Cities

Add code
Mar 09, 2025
Viaarxiv icon

UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces

Add code
Mar 08, 2025
Viaarxiv icon

A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval

Add code
Mar 07, 2025
Figure 1 for A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval
Figure 2 for A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval
Figure 3 for A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval
Figure 4 for A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval
Viaarxiv icon

AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web Platforms

Add code
Feb 26, 2025
Viaarxiv icon

Planning with Linear Temporal Logic Specifications: Handling Quantifiable and Unquantifiable Uncertainty

Add code
Feb 26, 2025
Viaarxiv icon