Picture for Linghe Kong

Linghe Kong

FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing

Add code
Sep 26, 2025
Viaarxiv icon

Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning

Add code
Jun 11, 2025
Viaarxiv icon

Segment Concealed Objects with Incomplete Supervision

Add code
Jun 10, 2025
Viaarxiv icon

B2LoRa: Boosting LoRa Transmission for Satellite-IoT Systems with Blind Coherent Combining

Add code
May 30, 2025
Viaarxiv icon

ReCalKV: Low-Rank KV Cache Compression via Head Reordering and Offline Calibration

Add code
May 30, 2025
Viaarxiv icon

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Add code
May 28, 2025
Viaarxiv icon

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Add code
May 28, 2025
Viaarxiv icon

DVD-Quant: Data-free Video Diffusion Transformers Quantization

Add code
May 24, 2025
Viaarxiv icon

Low-bit Model Quantization for Deep Neural Networks: A Survey

Add code
May 08, 2025
Viaarxiv icon

QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation

Add code
Mar 09, 2025
Viaarxiv icon