Picture for Cheston Tan

Cheston Tan

FedHPD: Heterogeneous Federated Reinforcement Learning via Policy Distillation

Add code
Feb 02, 2025
Viaarxiv icon

FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHF

Add code
Dec 20, 2024
Figure 1 for FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHF
Figure 2 for FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHF
Figure 3 for FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHF
Figure 4 for FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHF
Viaarxiv icon

Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning Scenarios

Add code
Nov 20, 2024
Viaarxiv icon

Evaluating the Generation of Spatial Relations in Text and Image Generative Models

Add code
Nov 12, 2024
Figure 1 for Evaluating the Generation of Spatial Relations in Text and Image Generative Models
Figure 2 for Evaluating the Generation of Spatial Relations in Text and Image Generative Models
Figure 3 for Evaluating the Generation of Spatial Relations in Text and Image Generative Models
Figure 4 for Evaluating the Generation of Spatial Relations in Text and Image Generative Models
Viaarxiv icon

STLM Engineering Report: Dropout

Add code
Sep 09, 2024
Viaarxiv icon

LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments

Add code
Aug 28, 2024
Figure 1 for LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments
Figure 2 for LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments
Figure 3 for LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments
Figure 4 for LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments
Viaarxiv icon

Zero-Shot Visual Reasoning by Vision-Language Models: Benchmarking and Analysis

Add code
Aug 27, 2024
Viaarxiv icon

Social Learning through Interactions with Other Agents: A Survey

Add code
Jul 31, 2024
Viaarxiv icon

RoboPack: Learning Tactile-Informed Dynamics Models for Dense Packing

Add code
Jul 01, 2024
Viaarxiv icon

Super Tiny Language Models

Add code
May 23, 2024
Viaarxiv icon