Picture for Kanchana Ranasinghe

Kanchana Ranasinghe

IVRA: Improving Visual-Token Relations for Robot Action Policy with Training-Free Hint-Based Guidance

Add code
Jan 22, 2026
Viaarxiv icon

Future Optical Flow Prediction Improves Robot Control & Video Generation

Add code
Jan 15, 2026
Viaarxiv icon

Robotic VLA Benefits from Joint Learning with Motion Image Diffusion

Add code
Dec 19, 2025
Viaarxiv icon

Pixel Motion Diffusion is What We Need for Robot Control

Add code
Sep 26, 2025
Viaarxiv icon

Pixel Motion as Universal Representation for Robot Control

Add code
May 12, 2025
Figure 1 for Pixel Motion as Universal Representation for Robot Control
Figure 2 for Pixel Motion as Universal Representation for Robot Control
Figure 3 for Pixel Motion as Universal Representation for Robot Control
Figure 4 for Pixel Motion as Universal Representation for Robot Control
Viaarxiv icon

Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation

Add code
Jan 08, 2025
Viaarxiv icon

LatentCRF: Continuous CRF for Efficient Latent Diffusion

Add code
Dec 24, 2024
Figure 1 for LatentCRF: Continuous CRF for Efficient Latent Diffusion
Figure 2 for LatentCRF: Continuous CRF for Efficient Latent Diffusion
Figure 3 for LatentCRF: Continuous CRF for Efficient Latent Diffusion
Figure 4 for LatentCRF: Continuous CRF for Efficient Latent Diffusion
Viaarxiv icon

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Add code
Jun 28, 2024
Figure 1 for LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Figure 2 for LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Figure 3 for LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Figure 4 for LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Viaarxiv icon

Too Many Frames, not all Useful:Efficient Strategies for Long-Form Video QA

Add code
Jun 17, 2024
Figure 1 for Too Many Frames, not all Useful:Efficient Strategies for Long-Form Video QA
Figure 2 for Too Many Frames, not all Useful:Efficient Strategies for Long-Form Video QA
Figure 3 for Too Many Frames, not all Useful:Efficient Strategies for Long-Form Video QA
Figure 4 for Too Many Frames, not all Useful:Efficient Strategies for Long-Form Video QA
Viaarxiv icon

Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs

Add code
Apr 11, 2024
Figure 1 for Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs
Figure 2 for Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs
Figure 3 for Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs
Figure 4 for Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs
Viaarxiv icon