Picture for Martin Renqiang Min

Martin Renqiang Min

Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation

Add code
Dec 19, 2024
Viaarxiv icon

Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection

Add code
Nov 17, 2024
Figure 1 for Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Figure 2 for Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Figure 3 for Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Figure 4 for Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Viaarxiv icon

Variational methods for Learning Multilevel Genetic Algorithms using the Kantorovich Monad

Add code
Nov 14, 2024
Viaarxiv icon

Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models

Add code
Oct 11, 2024
Viaarxiv icon

DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models

Add code
Oct 10, 2024
Viaarxiv icon

Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment

Add code
Sep 22, 2024
Figure 1 for Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Figure 2 for Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Figure 3 for Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Figure 4 for Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Viaarxiv icon

Compositional 3D Scene Synthesis with Scene Graph Guided Layout-Shape Generation

Add code
Mar 19, 2024
Viaarxiv icon

Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos

Add code
Mar 05, 2024
Viaarxiv icon

Exploring Compositional Visual Generation with Latent Classifier Guidance

Add code
Apr 25, 2023
Figure 1 for Exploring Compositional Visual Generation with Latent Classifier Guidance
Figure 2 for Exploring Compositional Visual Generation with Latent Classifier Guidance
Figure 3 for Exploring Compositional Visual Generation with Latent Classifier Guidance
Figure 4 for Exploring Compositional Visual Generation with Latent Classifier Guidance
Viaarxiv icon

Conditional Image-to-Video Generation with Latent Flow Diffusion Models

Add code
Mar 24, 2023
Figure 1 for Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Figure 2 for Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Figure 3 for Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Figure 4 for Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Viaarxiv icon