Picture for Tianren Ma

Tianren Ma

ClawMachine: Fetching Visual Tokens as An Entity for Referring and Grounding

Add code
Jun 17, 2024
Viaarxiv icon

Artemis: Towards Referential Understanding in Complex Videos

Add code
Jun 01, 2024
Viaarxiv icon

ChatterBox: Multi-round Multimodal Referring and Grounding

Add code
Jan 24, 2024
Viaarxiv icon