Picture for Martin Renqiang Min

Martin Renqiang Min

Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models

Add code
Oct 11, 2024
Viaarxiv icon

DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models

Add code
Oct 10, 2024
Viaarxiv icon

Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment

Add code
Sep 22, 2024
Figure 1 for Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Figure 2 for Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Figure 3 for Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Figure 4 for Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Viaarxiv icon

Compositional 3D Scene Synthesis with Scene Graph Guided Layout-Shape Generation

Add code
Mar 19, 2024
Viaarxiv icon

Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos

Add code
Mar 05, 2024
Viaarxiv icon

Exploring Compositional Visual Generation with Latent Classifier Guidance

Add code
Apr 25, 2023
Figure 1 for Exploring Compositional Visual Generation with Latent Classifier Guidance
Figure 2 for Exploring Compositional Visual Generation with Latent Classifier Guidance
Figure 3 for Exploring Compositional Visual Generation with Latent Classifier Guidance
Figure 4 for Exploring Compositional Visual Generation with Latent Classifier Guidance
Viaarxiv icon

Conditional Image-to-Video Generation with Latent Flow Diffusion Models

Add code
Mar 24, 2023
Figure 1 for Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Figure 2 for Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Figure 3 for Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Figure 4 for Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Viaarxiv icon

T-Cell Receptor Optimization with Reinforcement Learning and Mutation Policies for Precesion Immunotherapy

Add code
Mar 02, 2023
Viaarxiv icon

Attribute-Centric Compositional Text-to-Image Generation

Add code
Jan 04, 2023
Viaarxiv icon

StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis

Add code
Mar 29, 2022
Figure 1 for StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis
Figure 2 for StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis
Figure 3 for StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis
Figure 4 for StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis
Viaarxiv icon