Human Parsing


Human parsing is the process of identifying, segmenting, and categorizing different parts of a human body in an image or video such as head, shoulders, knees, and toes.

From Watch to Imagine: Steering Long-horizon Manipulation via Human Demonstration and Future Envisionment

Add code
Sep 26, 2025
Viaarxiv icon

CoBel-World: Harnessing LLM Reasoning to Build a Collaborative Belief World for Optimizing Embodied Multi-Agent Collaboration

Add code
Sep 26, 2025
Viaarxiv icon

Parse Graph-Based Visual-Language Interaction for Human Pose Estimation

Add code
Sep 09, 2025
Viaarxiv icon

Automated Evidence Extraction and Scoring for Corporate Climate Policy Engagement: A Multilingual RAG Approach

Add code
Sep 10, 2025
Viaarxiv icon

Automating the Deep Space Network Data Systems; A Case Study in Adaptive Anomaly Detection through Agentic AI

Add code
Aug 28, 2025
Viaarxiv icon

MahaParaphrase: A Marathi Paraphrase Detection Corpus and BERT-based Models

Add code
Aug 24, 2025
Viaarxiv icon

Reliable Weak-to-Strong Monitoring of LLM Agents

Add code
Aug 26, 2025
Viaarxiv icon

Semantic Anchoring in Agentic Memory: Leveraging Linguistic Structures for Persistent Conversational Context

Add code
Aug 18, 2025
Viaarxiv icon

$NavA^3$: Understanding Any Instruction, Navigating Anywhere, Finding Anything

Add code
Aug 06, 2025
Viaarxiv icon

Incremental Language Understanding for Online Motion Planning of Robot Manipulators

Add code
Aug 08, 2025
Viaarxiv icon