Picture for Jun Yu

Jun Yu

Lehigh University

Optimizing Preference Alignment with Differentiable NDCG Ranking

Add code
Oct 17, 2024
Viaarxiv icon

Prompting Video-Language Foundation Models with Domain-specific Fine-grained Heuristics for Video Question Answering

Add code
Oct 12, 2024
Figure 1 for Prompting Video-Language Foundation Models with Domain-specific Fine-grained Heuristics for Video Question Answering
Figure 2 for Prompting Video-Language Foundation Models with Domain-specific Fine-grained Heuristics for Video Question Answering
Figure 3 for Prompting Video-Language Foundation Models with Domain-specific Fine-grained Heuristics for Video Question Answering
Figure 4 for Prompting Video-Language Foundation Models with Domain-specific Fine-grained Heuristics for Video Question Answering
Viaarxiv icon

Multi-granularity Contrastive Cross-modal Collaborative Generation for End-to-End Long-term Video Question Answering

Add code
Oct 12, 2024
Viaarxiv icon

Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading

Add code
Oct 08, 2024
Figure 1 for Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading
Figure 2 for Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading
Figure 3 for Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading
Figure 4 for Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading
Viaarxiv icon

Learning the Generalizable Manipulation Skills on Soft-body Tasks via Guided Self-attention Behavior Cloning Policy

Add code
Oct 08, 2024
Figure 1 for Learning the Generalizable Manipulation Skills on Soft-body Tasks via Guided Self-attention Behavior Cloning Policy
Figure 2 for Learning the Generalizable Manipulation Skills on Soft-body Tasks via Guided Self-attention Behavior Cloning Policy
Figure 3 for Learning the Generalizable Manipulation Skills on Soft-body Tasks via Guided Self-attention Behavior Cloning Policy
Figure 4 for Learning the Generalizable Manipulation Skills on Soft-body Tasks via Guided Self-attention Behavior Cloning Policy
Viaarxiv icon

A General Framework for Producing Interpretable Semantic Text Embeddings

Add code
Oct 04, 2024
Figure 1 for A General Framework for Producing Interpretable Semantic Text Embeddings
Figure 2 for A General Framework for Producing Interpretable Semantic Text Embeddings
Figure 3 for A General Framework for Producing Interpretable Semantic Text Embeddings
Figure 4 for A General Framework for Producing Interpretable Semantic Text Embeddings
Viaarxiv icon

DDNet: Deformable Convolution and Dense FPN for Surface Defect Detection in Recycled Books

Add code
Sep 08, 2024
Figure 1 for DDNet: Deformable Convolution and Dense FPN for Surface Defect Detection in Recycled Books
Figure 2 for DDNet: Deformable Convolution and Dense FPN for Surface Defect Detection in Recycled Books
Figure 3 for DDNet: Deformable Convolution and Dense FPN for Surface Defect Detection in Recycled Books
Figure 4 for DDNet: Deformable Convolution and Dense FPN for Surface Defect Detection in Recycled Books
Viaarxiv icon

LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement

Add code
Aug 29, 2024
Figure 1 for LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement
Figure 2 for LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement
Figure 3 for LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement
Figure 4 for LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement
Viaarxiv icon

MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability

Add code
Jul 28, 2024
Figure 1 for MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability
Figure 2 for MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability
Figure 3 for MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability
Figure 4 for MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability
Viaarxiv icon

Facial Identity Anonymization via Intrinsic and Extrinsic Attention Distraction

Add code
Jun 25, 2024
Figure 1 for Facial Identity Anonymization via Intrinsic and Extrinsic Attention Distraction
Figure 2 for Facial Identity Anonymization via Intrinsic and Extrinsic Attention Distraction
Figure 3 for Facial Identity Anonymization via Intrinsic and Extrinsic Attention Distraction
Figure 4 for Facial Identity Anonymization via Intrinsic and Extrinsic Attention Distraction
Viaarxiv icon