Picture for Alexander Hauptmann

Alexander Hauptmann

Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios

Add code
Oct 22, 2024
Viaarxiv icon

Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony

Add code
Aug 18, 2024
Figure 1 for Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Figure 2 for Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Figure 3 for Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Figure 4 for Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Viaarxiv icon

SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions

Add code
Aug 09, 2024
Figure 1 for SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions
Figure 2 for SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions
Figure 3 for SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions
Figure 4 for SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions
Viaarxiv icon

Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

Add code
Jul 18, 2024
Figure 1 for Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Figure 2 for Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Figure 3 for Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Figure 4 for Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Viaarxiv icon

Multimodal Reranking for Knowledge-Intensive Visual Question Answering

Add code
Jul 17, 2024
Figure 1 for Multimodal Reranking for Knowledge-Intensive Visual Question Answering
Figure 2 for Multimodal Reranking for Knowledge-Intensive Visual Question Answering
Figure 3 for Multimodal Reranking for Knowledge-Intensive Visual Question Answering
Figure 4 for Multimodal Reranking for Knowledge-Intensive Visual Question Answering
Viaarxiv icon

Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning

Add code
Jun 17, 2024
Figure 1 for Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
Figure 2 for Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
Figure 3 for Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
Figure 4 for Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
Viaarxiv icon

Learning Visual-Semantic Subspace Representations for Propositional Reasoning

Add code
May 25, 2024
Figure 1 for Learning Visual-Semantic Subspace Representations for Propositional Reasoning
Figure 2 for Learning Visual-Semantic Subspace Representations for Propositional Reasoning
Figure 3 for Learning Visual-Semantic Subspace Representations for Propositional Reasoning
Figure 4 for Learning Visual-Semantic Subspace Representations for Propositional Reasoning
Viaarxiv icon

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Add code
Apr 02, 2024
Figure 1 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 2 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 3 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 4 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Viaarxiv icon

Hyperbolic vs Euclidean Embeddings in Few-Shot Learning: Two Sides of the Same Coin

Add code
Sep 18, 2023
Viaarxiv icon

STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition

Add code
Mar 31, 2023
Figure 1 for STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition
Figure 2 for STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition
Figure 3 for STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition
Figure 4 for STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition
Viaarxiv icon