Picture for Lei Li

Lei Li

Carnegie Mellon University

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Add code
Dec 30, 2024
Viaarxiv icon

Seamless Detection: Unifying Salient Object Detection and Camouflaged Object Detection

Add code
Dec 22, 2024
Viaarxiv icon

GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models

Add code
Dec 17, 2024
Figure 1 for GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models
Figure 2 for GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models
Figure 3 for GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models
Figure 4 for GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models
Viaarxiv icon

MeshArt: Generating Articulated Meshes with Structure-guided Transformers

Add code
Dec 16, 2024
Viaarxiv icon

Position-aware Guided Point Cloud Completion with CLIP Model

Add code
Dec 11, 2024
Figure 1 for Position-aware Guided Point Cloud Completion with CLIP Model
Figure 2 for Position-aware Guided Point Cloud Completion with CLIP Model
Figure 3 for Position-aware Guided Point Cloud Completion with CLIP Model
Figure 4 for Position-aware Guided Point Cloud Completion with CLIP Model
Viaarxiv icon

A Practical Examination of AI-Generated Text Detectors for Large Language Models

Add code
Dec 06, 2024
Figure 1 for A Practical Examination of AI-Generated Text Detectors for Large Language Models
Figure 2 for A Practical Examination of AI-Generated Text Detectors for Large Language Models
Figure 3 for A Practical Examination of AI-Generated Text Detectors for Large Language Models
Figure 4 for A Practical Examination of AI-Generated Text Detectors for Large Language Models
Viaarxiv icon

Chain-of-Thought in Large Language Models: Decoding, Projection, and Activation

Add code
Dec 05, 2024
Figure 1 for Chain-of-Thought in Large Language Models: Decoding, Projection, and Activation
Figure 2 for Chain-of-Thought in Large Language Models: Decoding, Projection, and Activation
Figure 3 for Chain-of-Thought in Large Language Models: Decoding, Projection, and Activation
Figure 4 for Chain-of-Thought in Large Language Models: Decoding, Projection, and Activation
Viaarxiv icon

3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting

Add code
Dec 02, 2024
Figure 1 for 3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting
Figure 2 for 3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting
Figure 3 for 3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting
Figure 4 for 3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting
Viaarxiv icon

SoK: Watermarking for AI-Generated Content

Add code
Nov 27, 2024
Viaarxiv icon

Graph Canvas for Controllable 3D Scene Generation

Add code
Nov 27, 2024
Viaarxiv icon