Picture for Guangda Huzhang

Guangda Huzhang

M$^2$: Dual-Memory Augmentation for Long-Horizon Web Agents via Trajectory Summarization and Insight Retrieval

Add code
Feb 28, 2026
Viaarxiv icon

Building Autonomous GUI Navigation via Agentic-Q Estimation and Step-Wise Policy Optimization

Add code
Feb 14, 2026
Viaarxiv icon

Optimal Transport-Based Token Weighting scheme for Enhanced Preference Optimization

Add code
May 24, 2025
Viaarxiv icon

Establishing Reliability Metrics for Reward Models in Large Language Models

Add code
Apr 21, 2025
Viaarxiv icon

Recurrent Temporal Revision Graph Networks

Add code
Sep 26, 2023
Viaarxiv icon

Clustered Embedding Learning for Recommender Systems

Add code
Feb 10, 2023
Viaarxiv icon

Exploit Customer Life-time Value with Memoryless Experiments

Add code
Jan 17, 2022
Figure 1 for Exploit Customer Life-time Value with Memoryless Experiments
Figure 2 for Exploit Customer Life-time Value with Memoryless Experiments
Figure 3 for Exploit Customer Life-time Value with Memoryless Experiments
Figure 4 for Exploit Customer Life-time Value with Memoryless Experiments
Viaarxiv icon

A General Traffic Shaping Protocol in E-Commerce

Add code
Dec 30, 2021
Figure 1 for A General Traffic Shaping Protocol in E-Commerce
Viaarxiv icon

Re-ranking With Constraints on Diversified Exposures for Homepage Recommender System

Add code
Dec 12, 2021
Figure 1 for Re-ranking With Constraints on Diversified Exposures for Homepage Recommender System
Figure 2 for Re-ranking With Constraints on Diversified Exposures for Homepage Recommender System
Figure 3 for Re-ranking With Constraints on Diversified Exposures for Homepage Recommender System
Figure 4 for Re-ranking With Constraints on Diversified Exposures for Homepage Recommender System
Viaarxiv icon

Learning-To-Ensemble by Contextual Rank Aggregation in E-Commerce

Add code
Aug 10, 2021
Figure 1 for Learning-To-Ensemble by Contextual Rank Aggregation in E-Commerce
Figure 2 for Learning-To-Ensemble by Contextual Rank Aggregation in E-Commerce
Figure 3 for Learning-To-Ensemble by Contextual Rank Aggregation in E-Commerce
Figure 4 for Learning-To-Ensemble by Contextual Rank Aggregation in E-Commerce
Viaarxiv icon