Picture for Zhangyang Wang

Zhangyang Wang

Texas A&M University

SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training

Add code
Jan 12, 2025
Viaarxiv icon

VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment

Add code
Jan 03, 2025
Viaarxiv icon

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding

Add code
Jan 01, 2025
Figure 1 for Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
Figure 2 for Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
Figure 3 for Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
Figure 4 for Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
Viaarxiv icon

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing

Add code
Dec 31, 2024
Figure 1 for Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
Figure 2 for Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
Figure 3 for Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
Figure 4 for Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
Viaarxiv icon

Enhancing Item Tokenization for Generative Recommendation through Self-Improvement

Add code
Dec 22, 2024
Viaarxiv icon

AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving

Add code
Dec 19, 2024
Figure 1 for AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving
Figure 2 for AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving
Figure 3 for AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving
Figure 4 for AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving
Viaarxiv icon

APOLLO: SGD-like Memory, AdamW-level Performance

Add code
Dec 09, 2024
Viaarxiv icon

On How Iterative Magnitude Pruning Discovers Local Receptive Fields in Fully Connected Neural Networks

Add code
Dec 09, 2024
Viaarxiv icon

A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs

Add code
Dec 05, 2024
Figure 1 for A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs
Figure 2 for A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs
Figure 3 for A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs
Figure 4 for A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs
Viaarxiv icon

Oscillation Inversion: Understand the structure of Large Flow Model through the Lens of Inversion Method

Add code
Nov 17, 2024
Viaarxiv icon