Picture for Sai Qian Zhang

Sai Qian Zhang

FovealNet: Advancing AI-Driven Gaze Tracking Solutions for Optimized Foveated Rendering System Performance in Virtual Reality

Add code
Dec 12, 2024
Figure 1 for FovealNet: Advancing AI-Driven Gaze Tracking Solutions for Optimized Foveated Rendering System Performance in Virtual Reality
Figure 2 for FovealNet: Advancing AI-Driven Gaze Tracking Solutions for Optimized Foveated Rendering System Performance in Virtual Reality
Figure 3 for FovealNet: Advancing AI-Driven Gaze Tracking Solutions for Optimized Foveated Rendering System Performance in Virtual Reality
Figure 4 for FovealNet: Advancing AI-Driven Gaze Tracking Solutions for Optimized Foveated Rendering System Performance in Virtual Reality
Viaarxiv icon

Unlocking Visual Secrets: Inverting Features with Diffusion Priors for Image Reconstruction

Add code
Dec 11, 2024
Viaarxiv icon

DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation

Add code
Dec 03, 2024
Viaarxiv icon

GazeGen: Gaze-Driven User Interaction for Visual Content Generation

Add code
Nov 07, 2024
Viaarxiv icon

PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context

Add code
Oct 23, 2024
Viaarxiv icon

Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR Devices

Add code
Oct 10, 2024
Figure 1 for Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR Devices
Figure 2 for Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR Devices
Figure 3 for Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR Devices
Figure 4 for Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR Devices
Viaarxiv icon

DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing

Add code
Sep 12, 2024
Viaarxiv icon

T3M: Text Guided 3D Human Motion Synthesis from Speech

Add code
Aug 23, 2024
Figure 1 for T3M: Text Guided 3D Human Motion Synthesis from Speech
Figure 2 for T3M: Text Guided 3D Human Motion Synthesis from Speech
Figure 3 for T3M: Text Guided 3D Human Motion Synthesis from Speech
Figure 4 for T3M: Text Guided 3D Human Motion Synthesis from Speech
Viaarxiv icon

Rome was Not Built in a Single Step: Hierarchical Prompting for LLM-based Chip Design

Add code
Jul 23, 2024
Figure 1 for Rome was Not Built in a Single Step: Hierarchical Prompting for LLM-based Chip Design
Figure 2 for Rome was Not Built in a Single Step: Hierarchical Prompting for LLM-based Chip Design
Figure 3 for Rome was Not Built in a Single Step: Hierarchical Prompting for LLM-based Chip Design
Figure 4 for Rome was Not Built in a Single Step: Hierarchical Prompting for LLM-based Chip Design
Viaarxiv icon

HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization

Add code
May 31, 2024
Figure 1 for HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization
Figure 2 for HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization
Figure 3 for HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization
Figure 4 for HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization
Viaarxiv icon