Picture for Alexander Hauptmann

Alexander Hauptmann

Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios

Add code
Oct 22, 2024
Viaarxiv icon

Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony

Add code
Aug 18, 2024
Figure 1 for Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Figure 2 for Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Figure 3 for Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Figure 4 for Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Viaarxiv icon

SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions

Add code
Aug 09, 2024
Figure 1 for SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions
Figure 2 for SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions
Figure 3 for SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions
Figure 4 for SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions
Viaarxiv icon

Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

Add code
Jul 18, 2024
Viaarxiv icon

Multimodal Reranking for Knowledge-Intensive Visual Question Answering

Add code
Jul 17, 2024
Viaarxiv icon

Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning

Add code
Jun 17, 2024
Viaarxiv icon

Learning Visual-Semantic Subspace Representations for Propositional Reasoning

Add code
May 25, 2024
Viaarxiv icon

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Add code
Apr 02, 2024
Figure 1 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 2 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 3 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 4 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Viaarxiv icon

Hyperbolic vs Euclidean Embeddings in Few-Shot Learning: Two Sides of the Same Coin

Add code
Sep 18, 2023
Viaarxiv icon

STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition

Add code
Mar 31, 2023
Viaarxiv icon