Picture for Jun Wang

Jun Wang

IBM T. J. Watson Research Center

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Add code
Nov 12, 2024
Viaarxiv icon

Direct Preference Optimization Using Sparse Feature-Level Constraints

Add code
Nov 12, 2024
Viaarxiv icon

Proprioceptive and Exteroceptive Information Perception in a Fabric Soft Robotic Arm via Physical Reservoir Computing with minimal training data

Add code
Nov 11, 2024
Viaarxiv icon

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Add code
Nov 05, 2024
Figure 1 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 2 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 3 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 4 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Viaarxiv icon

ALISE: Accelerating Large Language Model Serving with Speculative Scheduling

Add code
Oct 31, 2024
Viaarxiv icon

UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function

Add code
Oct 28, 2024
Viaarxiv icon

Lightweight Neural App Control

Add code
Oct 23, 2024
Viaarxiv icon

Elucidating the design space of language models for image generation

Add code
Oct 21, 2024
Viaarxiv icon

SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation

Add code
Oct 19, 2024
Viaarxiv icon

DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents

Add code
Oct 18, 2024
Figure 1 for DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Figure 2 for DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Figure 3 for DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Figure 4 for DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Viaarxiv icon