Human Parsing


Human parsing is the process of identifying, segmenting, and categorizing different parts of a human body in an image or video such as head, shoulders, knees, and toes.

EARL: Towards a Unified Analysis-Guided Reinforcement Learning Framework for Egocentric Interaction Reasoning and Pixel Grounding

Add code
May 14, 2026
Viaarxiv icon

3D Primitives are a Spatial Language for VLMs

Add code
May 12, 2026
Viaarxiv icon

Chronicles-OCR: A Cross-Temporal Perception Benchmark for the Evolutionary Trajectory of Chinese Characters

Add code
May 12, 2026
Viaarxiv icon

Towards Robust Sequential Decomposition for Complex Image Editing

Add code
May 10, 2026
Viaarxiv icon

ScriptHOI: Learning Scripted State Transitions for Open-Vocabulary Human-Object Interaction Detection

Add code
May 06, 2026
Viaarxiv icon

Library learning with e-graphs on jazz harmony

Add code
May 06, 2026
Viaarxiv icon

Parser agreement and disagreement in L2 Korean UD: Implications for human-in-the-loop annotation

Add code
May 07, 2026
Viaarxiv icon

NeuroAgent: LLM Agents for Multimodal Neuroimaging Analysis and Research

Add code
May 07, 2026
Viaarxiv icon

An ERP Study of Recursive Possessive Parsing in ASD Children and Its Cognitive Neuro Mechanisms

Add code
May 05, 2026
Viaarxiv icon

Cited but Not Verified: Parsing and Evaluating Source Attribution in LLM Deep Research Agents

Add code
May 07, 2026
Viaarxiv icon