Picture for Martin Renqiang Min

Martin Renqiang Min

Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection

Add code
Nov 17, 2024
Viaarxiv icon

Variational methods for Learning Multilevel Genetic Algorithms using the Kantorovich Monad

Add code
Nov 14, 2024
Viaarxiv icon

Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models

Add code
Oct 11, 2024
Viaarxiv icon

DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models

Add code
Oct 10, 2024
Viaarxiv icon

Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment

Add code
Sep 22, 2024
Figure 1 for Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Figure 2 for Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Figure 3 for Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Figure 4 for Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Viaarxiv icon

Compositional 3D Scene Synthesis with Scene Graph Guided Layout-Shape Generation

Add code
Mar 19, 2024
Viaarxiv icon

Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos

Add code
Mar 05, 2024
Viaarxiv icon

Exploring Compositional Visual Generation with Latent Classifier Guidance

Add code
Apr 25, 2023
Figure 1 for Exploring Compositional Visual Generation with Latent Classifier Guidance
Figure 2 for Exploring Compositional Visual Generation with Latent Classifier Guidance
Figure 3 for Exploring Compositional Visual Generation with Latent Classifier Guidance
Figure 4 for Exploring Compositional Visual Generation with Latent Classifier Guidance
Viaarxiv icon

Conditional Image-to-Video Generation with Latent Flow Diffusion Models

Add code
Mar 24, 2023
Figure 1 for Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Figure 2 for Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Figure 3 for Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Figure 4 for Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Viaarxiv icon

T-Cell Receptor Optimization with Reinforcement Learning and Mutation Policies for Precesion Immunotherapy

Add code
Mar 02, 2023
Viaarxiv icon