Picture for Bin Chen

Bin Chen

From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents

Add code
Mar 02, 2026
Viaarxiv icon

DeAR: Fine-Grained VLM Adaptation by Decomposing Attention Head Roles

Add code
Mar 01, 2026
Viaarxiv icon

Improved Adversarial Diffusion Compression for Real-World Video Super-Resolution

Add code
Feb 28, 2026
Viaarxiv icon

SIGMA: A Semantic-Grounded Instruction-Driven Generative Multi-Task Recommender at AliExpress

Add code
Feb 26, 2026
Viaarxiv icon

GLM-5: from Vibe Coding to Agentic Engineering

Add code
Feb 17, 2026
Viaarxiv icon

Detecting Brick Kiln Infrastructure at Scale: Graph, Foundation, and Remote Sensing Models for Satellite Imagery Data

Add code
Feb 12, 2026
Viaarxiv icon

Seeing Through the Chain: Mitigate Hallucination in Multimodal Reasoning Models via CoT Compression and Contrastive Preference Optimization

Add code
Feb 03, 2026
Viaarxiv icon

Towards Distillation-Resistant Large Language Models: An Information-Theoretic Perspective

Add code
Feb 03, 2026
Viaarxiv icon

Tail-Aware Post-Training Quantization for 3D Geometry Models

Add code
Feb 02, 2026
Viaarxiv icon

How Implicit Bias Accumulates and Propagates in LLM Long-term Memory

Add code
Feb 02, 2026
Viaarxiv icon