Picture for Baotian Hu

Baotian Hu

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Add code
Nov 16, 2025
Viaarxiv icon

A Question Answering Dataset for Temporal-Sensitive Retrieval-Augmented Generation

Add code
Aug 17, 2025
Viaarxiv icon

On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey

Add code
Jul 28, 2025
Viaarxiv icon

KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model

Add code
Jun 26, 2025
Figure 1 for KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model
Figure 2 for KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model
Figure 3 for KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model
Figure 4 for KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model
Viaarxiv icon

AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation

Add code
Jun 12, 2025
Viaarxiv icon

ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Add code
Jun 11, 2025
Viaarxiv icon

Omni-DPO: A Dual-Perspective Paradigm for Dynamic Preference Learning of LLMs

Add code
Jun 11, 2025
Viaarxiv icon

ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Add code
Jun 05, 2025
Viaarxiv icon

VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization

Add code
May 25, 2025
Figure 1 for VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization
Figure 2 for VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization
Figure 3 for VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization
Figure 4 for VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization
Viaarxiv icon

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Add code
May 08, 2025
Figure 1 for Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Figure 2 for Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Figure 3 for Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Figure 4 for Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Viaarxiv icon