Picture for Jiaqi Wang

Jiaqi Wang

AutoGLM: Autonomous Foundation Agents for GUIs

Add code
Oct 28, 2024
Viaarxiv icon

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Add code
Oct 23, 2024
Figure 1 for MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
Figure 2 for MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
Figure 3 for MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
Figure 4 for MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
Viaarxiv icon

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

Add code
Oct 22, 2024
Viaarxiv icon

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Add code
Oct 21, 2024
Viaarxiv icon

BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation

Add code
Oct 19, 2024
Viaarxiv icon

Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning

Add code
Oct 09, 2024
Viaarxiv icon

Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate

Add code
Oct 09, 2024
Viaarxiv icon

BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way

Add code
Oct 08, 2024
Figure 1 for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way
Figure 2 for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way
Figure 3 for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way
Figure 4 for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way
Viaarxiv icon

Unlocking the Boundaries of Thought: A Reasoning Granularity Framework to Quantify and Optimize Chain-of-Thought

Add code
Oct 08, 2024
Figure 1 for Unlocking the Boundaries of Thought: A Reasoning Granularity Framework to Quantify and Optimize Chain-of-Thought
Figure 2 for Unlocking the Boundaries of Thought: A Reasoning Granularity Framework to Quantify and Optimize Chain-of-Thought
Figure 3 for Unlocking the Boundaries of Thought: A Reasoning Granularity Framework to Quantify and Optimize Chain-of-Thought
Figure 4 for Unlocking the Boundaries of Thought: A Reasoning Granularity Framework to Quantify and Optimize Chain-of-Thought
Viaarxiv icon

BIPEFT: Budget-Guided Iterative Search for Parameter Efficient Fine-Tuning of Large Pretrained Language Models

Add code
Oct 04, 2024
Figure 1 for BIPEFT: Budget-Guided Iterative Search for Parameter Efficient Fine-Tuning of Large Pretrained Language Models
Figure 2 for BIPEFT: Budget-Guided Iterative Search for Parameter Efficient Fine-Tuning of Large Pretrained Language Models
Figure 3 for BIPEFT: Budget-Guided Iterative Search for Parameter Efficient Fine-Tuning of Large Pretrained Language Models
Figure 4 for BIPEFT: Budget-Guided Iterative Search for Parameter Efficient Fine-Tuning of Large Pretrained Language Models
Viaarxiv icon