Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ci Li

Dessie: Disentanglement for Articulated 3D Horse Shape and Pose Estimation from Images

Oct 04, 2024

Ci Li, Yi Yang, Zehang Weng, Elin Hernlund, Silvia Zuffi, Hedvig Kjellström

Abstract:In recent years, 3D parametric animal models have been developed to aid in estimating 3D shape and pose from images and video. While progress has been made for humans, it's more challenging for animals due to limited annotated data. To address this, we introduce the first method using synthetic data generation and disentanglement to learn to regress 3D shape and pose. Focusing on horses, we use text-based texture generation and a synthetic data pipeline to create varied shapes, poses, and appearances, learning disentangled spaces. Our method, Dessie, surpasses existing 3D horse reconstruction methods and generalizes to other large animals like zebras, cows, and deer. See the project website at: \url{https://celiali.github.io/Dessie/}.

* ACCV2024

Via

Access Paper or Ask Questions

CLHOP: Combined Audio-Video Learning for Horse 3D Pose and Shape Estimation

Jul 01, 2024

Ci Li, Elin Hernlund, Hedvig Kjellström, Silvia Zuffi

Figure 1 for CLHOP: Combined Audio-Video Learning for Horse 3D Pose and Shape Estimation

Figure 2 for CLHOP: Combined Audio-Video Learning for Horse 3D Pose and Shape Estimation

Figure 3 for CLHOP: Combined Audio-Video Learning for Horse 3D Pose and Shape Estimation

Figure 4 for CLHOP: Combined Audio-Video Learning for Horse 3D Pose and Shape Estimation

Abstract:In the monocular setting, predicting 3D pose and shape of animals typically relies solely on visual information, which is highly under-constrained. In this work, we explore using audio to enhance 3D shape and motion recovery of horses from monocular video. We test our approach on two datasets: an indoor treadmill dataset for 3D evaluation and an outdoor dataset capturing diverse horse movements, the latter being a contribution to this study. Our results show that incorporating sound with visual data leads to more accurate and robust motion regression. This study is the first to investigate audio's role in 3D animal motion recovery.

* CVPR CV4Animals Workshop 2024

Via

Access Paper or Ask Questions

Human-Centric Autonomous Systems With LLMs for User Command Reasoning

Nov 14, 2023

Yi Yang, Qingwen Zhang, Ci Li, Daniel Simões Marta, Nazre Batool, John Folkesson

Abstract:The evolution of autonomous driving has made remarkable advancements in recent years, evolving into a tangible reality. However, a human-centric large-scale adoption hinges on meeting a variety of multifaceted requirements. To ensure that the autonomous system meets the user's intent, it is essential to accurately discern and interpret user commands, especially in complex or emergency situations. To this end, we propose to leverage the reasoning capabilities of Large Language Models (LLMs) to infer system requirements from in-cabin users' commands. Through a series of experiments that include different LLM models and prompt designs, we explore the few-shot multivariate binary classification accuracy of system requirements from natural language textual commands. We confirm the general ability of LLMs to understand and reason about prompts but underline that their effectiveness is conditioned on the quality of both the LLM model and the design of appropriate sequential prompts. Code and models are public with the link \url{https://github.com/KTH-RPL/DriveCmd_LLM}.

* 6 pages, accepted by WACV LLVM-AD workshp, code https://github.com/KTH-RPL/DriveCmd_LLM

Via

Access Paper or Ask Questions

hSMAL: Detailed Horse Shape and Pose Reconstruction for Motion Pattern Recognition

Jun 18, 2021

Ci Li, Nima Ghorbani, Sofia Broomé, Maheen Rashid, Michael J. Black, Elin Hernlund, Hedvig Kjellström, Silvia Zuffi

Figure 1 for hSMAL: Detailed Horse Shape and Pose Reconstruction for Motion Pattern Recognition

Figure 2 for hSMAL: Detailed Horse Shape and Pose Reconstruction for Motion Pattern Recognition

Figure 3 for hSMAL: Detailed Horse Shape and Pose Reconstruction for Motion Pattern Recognition

Figure 4 for hSMAL: Detailed Horse Shape and Pose Reconstruction for Motion Pattern Recognition

Abstract:In this paper we present our preliminary work on model-based behavioral analysis of horse motion. Our approach is based on the SMAL model, a 3D articulated statistical model of animal shape. We define a novel SMAL model for horses based on a new template, skeleton and shape space learned from $37$ horse toys. We test the accuracy of our hSMAL model in reconstructing a horse from 3D mocap data and images. We apply the hSMAL model to the problem of lameness detection from video, where we fit the model to images to recover 3D pose and train an ST-GCN network on pose data. A comparison with the same network trained on mocap points illustrates the benefit of our approach.

* CV4Animals Workshop in CVPR 2021

Via

Access Paper or Ask Questions