Picture for Zhenghao Chen

Zhenghao Chen

School of Electrical and Information Engineering, The University of Sydney, Australia

Valley2: Exploring Multimodal Models with Scalable Vision-Language Design

Add code
Jan 13, 2025
Figure 1 for Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
Figure 2 for Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
Figure 3 for Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
Figure 4 for Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
Viaarxiv icon

An Efficient Adaptive Compression Method for Human Perception and Machine Vision Tasks

Add code
Jan 08, 2025
Figure 1 for An Efficient Adaptive Compression Method for Human Perception and Machine Vision Tasks
Figure 2 for An Efficient Adaptive Compression Method for Human Perception and Machine Vision Tasks
Figure 3 for An Efficient Adaptive Compression Method for Human Perception and Machine Vision Tasks
Figure 4 for An Efficient Adaptive Compression Method for Human Perception and Machine Vision Tasks
Viaarxiv icon

Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models

Add code
Nov 27, 2024
Viaarxiv icon

HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression

Add code
Nov 27, 2024
Viaarxiv icon

Frame-Voyager: Learning to Query Frames for Video Large Language Models

Add code
Oct 07, 2024
Viaarxiv icon

WEATHER-5K: A Large-scale Global Station Weather Dataset Towards Comprehensive Time-series Forecasting Benchmark

Add code
Jun 20, 2024
Viaarxiv icon

Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor

Add code
Jun 02, 2024
Figure 1 for Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor
Figure 2 for Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor
Figure 3 for Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor
Figure 4 for Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor
Viaarxiv icon

PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis

Add code
May 24, 2024
Viaarxiv icon

CRA5: Extreme Compression of ERA5 for Portable Global Climate and Weather Research via an Efficient Variational Transformer

Add code
May 08, 2024
Viaarxiv icon

Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression

Add code
May 07, 2024
Viaarxiv icon