text


SAM Audio Judge: A Unified Multimodal Framework for Perceptual Evaluation of Audio Separation

Add code
Jan 27, 2026
Viaarxiv icon

UniRec: Unified Multimodal Encoding for LLM-Based Recommendations

Add code
Jan 27, 2026
Viaarxiv icon

Tactile Memory with Soft Robot: Robust Object Insertion via Masked Encoding and Soft Wrist

Add code
Jan 27, 2026
Viaarxiv icon

A Hybrid Supervised-LLM Pipeline for Actionable Suggestion Mining in Unstructured Customer Reviews

Add code
Jan 27, 2026
Viaarxiv icon

Propagating Similarity, Mitigating Uncertainty: Similarity Propagation-enhanced Uncertainty for Multimodal Recommendation

Add code
Jan 27, 2026
Viaarxiv icon

FBSDiff++: Improved Frequency Band Substitution of Diffusion Features for Efficient and Highly Controllable Text-Driven Image-to-Image Translation

Add code
Jan 27, 2026
Viaarxiv icon

MaDiS: Taming Masked Diffusion Language Models for Sign Language Generation

Add code
Jan 27, 2026
Viaarxiv icon

Cortex-Grounded Diffusion Models for Brain Image Generation

Add code
Jan 27, 2026
Viaarxiv icon

Pixel-Grounded Retrieval for Knowledgeable Large Multimodal Models

Add code
Jan 27, 2026
Viaarxiv icon

TIGaussian: Disentangle Gaussians for Spatial-Awared Text-Image-3D Alignment

Add code
Jan 27, 2026
Viaarxiv icon