Picture for Hongming Zhang

Hongming Zhang

Shammie

$β$-DQN: Improving Deep Q-Learning By Evolving the Behavior

Add code
Jan 01, 2025
Viaarxiv icon

Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models

Add code
Dec 21, 2024
Viaarxiv icon

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Add code
Oct 25, 2024
Figure 1 for OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Figure 2 for OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Figure 3 for OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Figure 4 for OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Viaarxiv icon

ParallelSpec: Parallel Drafter for Efficient Speculative Decoding

Add code
Oct 08, 2024
Viaarxiv icon

SePPO: Semi-Policy Preference Optimization for Diffusion Alignment

Add code
Oct 07, 2024
Figure 1 for SePPO: Semi-Policy Preference Optimization for Diffusion Alignment
Figure 2 for SePPO: Semi-Policy Preference Optimization for Diffusion Alignment
Figure 3 for SePPO: Semi-Policy Preference Optimization for Diffusion Alignment
Figure 4 for SePPO: Semi-Policy Preference Optimization for Diffusion Alignment
Viaarxiv icon

DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects

Add code
Oct 03, 2024
Figure 1 for DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Figure 2 for DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Figure 3 for DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Figure 4 for DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Viaarxiv icon

LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks

Add code
Oct 02, 2024
Viaarxiv icon

Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots

Add code
Sep 16, 2024
Figure 1 for Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Figure 2 for Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Figure 3 for Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Figure 4 for Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Viaarxiv icon

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Add code
Sep 12, 2024
Figure 1 for DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?
Figure 2 for DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?
Figure 3 for DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?
Figure 4 for DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?
Viaarxiv icon

$\textit{GeoHard}$: Towards Measuring Class-wise Hardness through Modelling Class Semantics

Add code
Jul 17, 2024
Viaarxiv icon