Picture for Hailin Hu

Hailin Hu

and Other Contributors

Towards Lossless Ultimate Vision Token Compression for VLMs

Add code
Dec 09, 2025
Viaarxiv icon

Positional Preservation Embedding for Multimodal Large Language Models

Add code
Oct 27, 2025
Viaarxiv icon

Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition

Add code
May 29, 2025
Figure 1 for Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Figure 2 for Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Figure 3 for Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Figure 4 for Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Viaarxiv icon

Single Domain Generalization for Few-Shot Counting via Universal Representation Matching

Add code
May 22, 2025
Figure 1 for Single Domain Generalization for Few-Shot Counting via Universal Representation Matching
Figure 2 for Single Domain Generalization for Few-Shot Counting via Universal Representation Matching
Figure 3 for Single Domain Generalization for Few-Shot Counting via Universal Representation Matching
Figure 4 for Single Domain Generalization for Few-Shot Counting via Universal Representation Matching
Viaarxiv icon

EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution

Add code
May 08, 2025
Figure 1 for EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution
Figure 2 for EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution
Figure 3 for EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution
Figure 4 for EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution
Viaarxiv icon

Transferable text data distillation by trajectory matching

Add code
Apr 14, 2025
Figure 1 for Transferable text data distillation by trajectory matching
Figure 2 for Transferable text data distillation by trajectory matching
Figure 3 for Transferable text data distillation by trajectory matching
Figure 4 for Transferable text data distillation by trajectory matching
Viaarxiv icon

Saliency-driven Dynamic Token Pruning for Large Language Models

Add code
Apr 09, 2025
Figure 1 for Saliency-driven Dynamic Token Pruning for Large Language Models
Figure 2 for Saliency-driven Dynamic Token Pruning for Large Language Models
Figure 3 for Saliency-driven Dynamic Token Pruning for Large Language Models
Figure 4 for Saliency-driven Dynamic Token Pruning for Large Language Models
Viaarxiv icon

GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video

Add code
Jan 20, 2025
Figure 1 for GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video
Figure 2 for GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video
Figure 3 for GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video
Figure 4 for GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video
Viaarxiv icon

Omni-Dimensional Frequency Learner for General Time Series Analysis

Add code
Jul 15, 2024
Figure 1 for Omni-Dimensional Frequency Learner for General Time Series Analysis
Figure 2 for Omni-Dimensional Frequency Learner for General Time Series Analysis
Figure 3 for Omni-Dimensional Frequency Learner for General Time Series Analysis
Figure 4 for Omni-Dimensional Frequency Learner for General Time Series Analysis
Viaarxiv icon

GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization

Add code
Jun 24, 2024
Figure 1 for GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Figure 2 for GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Figure 3 for GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Figure 4 for GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Viaarxiv icon