Picture for Chengjian Feng

Chengjian Feng

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Add code
Dec 23, 2025
Viaarxiv icon

X-SAM: From Segment Anything to Any Segmentation

Add code
Aug 06, 2025
Viaarxiv icon

RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case

Add code
Aug 06, 2025
Figure 1 for RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case
Figure 2 for RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case
Figure 3 for RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case
Figure 4 for RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case
Viaarxiv icon

DisTime: Distribution-based Time Representation for Video Large Language Models

Add code
May 30, 2025
Figure 1 for DisTime: Distribution-based Time Representation for Video Large Language Models
Figure 2 for DisTime: Distribution-based Time Representation for Video Large Language Models
Figure 3 for DisTime: Distribution-based Time Representation for Video Large Language Models
Figure 4 for DisTime: Distribution-based Time Representation for Video Large Language Models
Viaarxiv icon

AP-CAP: Advancing High-Quality Data Synthesis for Animal Pose Estimation via a Controllable Image Generation Pipeline

Add code
Apr 01, 2025
Viaarxiv icon

DataPlatter: Boosting Robotic Manipulation Generalization with Minimal Costly Data

Add code
Mar 25, 2025
Viaarxiv icon

RoboMM: All-in-One Multimodal Large Model for Robotic Manipulation

Add code
Dec 10, 2024
Figure 1 for RoboMM: All-in-One Multimodal Large Model for Robotic Manipulation
Figure 2 for RoboMM: All-in-One Multimodal Large Model for Robotic Manipulation
Figure 3 for RoboMM: All-in-One Multimodal Large Model for Robotic Manipulation
Figure 4 for RoboMM: All-in-One Multimodal Large Model for Robotic Manipulation
Viaarxiv icon

DriveMM: All-in-One Large Multimodal Model for Autonomous Driving

Add code
Dec 10, 2024
Figure 1 for DriveMM: All-in-One Large Multimodal Model for Autonomous Driving
Figure 2 for DriveMM: All-in-One Large Multimodal Model for Autonomous Driving
Figure 3 for DriveMM: All-in-One Large Multimodal Model for Autonomous Driving
Figure 4 for DriveMM: All-in-One Large Multimodal Model for Autonomous Driving
Viaarxiv icon

RFSR: Improving ISR Diffusion Models via Reward Feedback Learning

Add code
Dec 04, 2024
Viaarxiv icon

AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language Models

Add code
Aug 01, 2024
Viaarxiv icon