Picture for Bin Wang

Bin Wang

and Other Contributors

OmniLayout: Enabling Coarse-to-Fine Learning with LLMs for Universal Document Layout Generation

Add code
Oct 30, 2025
Viaarxiv icon

Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies

Add code
Oct 27, 2025
Viaarxiv icon

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Add code
Sep 26, 2025
Viaarxiv icon

ROSE: Remove Objects with Side Effects in Videos

Add code
Aug 26, 2025
Viaarxiv icon

Attention2Probability: Attention-Driven Terminology Probability Estimation for Robust Speech-to-Text System

Add code
Aug 26, 2025
Viaarxiv icon

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Add code
Aug 25, 2025
Viaarxiv icon

A robust and compliant robotic assembly control strategy for batch precision assembly task with uncertain fit types and fit amounts

Add code
Aug 17, 2025
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Viaarxiv icon

InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation

Add code
Jul 23, 2025
Viaarxiv icon

Derivative-Free Optimization-Empowered Wireless Channel Reconfiguration for 6G

Add code
Jul 03, 2025
Viaarxiv icon