Picture for Jiawei Wang

Jiawei Wang

Tarsier: Recipes for Training and Evaluating Large Video Description Models

Add code
Jun 30, 2024
Viaarxiv icon

DLAFormer: An End-to-End Transformer For Document Layout Analysis

Add code
May 20, 2024
Viaarxiv icon

AMCEN: An Attention Masking-based Contrastive Event Network for Two-stage Temporal Knowledge Graph Reasoning

Add code
May 16, 2024
Viaarxiv icon

EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech

Add code
Mar 17, 2024
Figure 1 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Figure 2 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Figure 3 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Figure 4 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Viaarxiv icon

Large Language Models as Urban Residents: An LLM Agent Framework for Personal Mobility Generation

Add code
Feb 22, 2024
Viaarxiv icon

Boximator: Generating Rich and Controllable Motions for Video Synthesis

Add code
Feb 02, 2024
Figure 1 for Boximator: Generating Rich and Controllable Motions for Video Synthesis
Figure 2 for Boximator: Generating Rich and Controllable Motions for Video Synthesis
Figure 3 for Boximator: Generating Rich and Controllable Motions for Video Synthesis
Figure 4 for Boximator: Generating Rich and Controllable Motions for Video Synthesis
Viaarxiv icon

Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis

Add code
Jan 22, 2024
Viaarxiv icon

UniVIE: A Unified Label Space Approach to Visual Information Extraction from Form-like Documents

Add code
Jan 17, 2024
Viaarxiv icon

Dynamic Relation Transformer for Contextual Text Block Detection

Add code
Jan 17, 2024
Viaarxiv icon

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

Add code
Dec 25, 2023
Viaarxiv icon