Picture for Xiang Zhang

Xiang Zhang

Victor

Cross-Modal Consistency in Multimodal Large Language Models

Add code
Nov 14, 2024
Viaarxiv icon

DexH2R: Task-oriented Dexterous Manipulation from Human to Robots

Add code
Nov 07, 2024
Viaarxiv icon

Imagined Potential Games: A Framework for Simulating, Learning and Evaluating Interactive Behaviors

Add code
Nov 06, 2024
Viaarxiv icon

Counting Ability of Large Language Models and Impact of Tokenization

Add code
Oct 25, 2024
Viaarxiv icon

Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors

Add code
Oct 25, 2024
Viaarxiv icon

Decoding Time Series with LLMs: A Multi-Agent Framework for Cross-Domain Annotation

Add code
Oct 22, 2024
Figure 1 for Decoding Time Series with LLMs: A Multi-Agent Framework for Cross-Domain Annotation
Figure 2 for Decoding Time Series with LLMs: A Multi-Agent Framework for Cross-Domain Annotation
Figure 3 for Decoding Time Series with LLMs: A Multi-Agent Framework for Cross-Domain Annotation
Figure 4 for Decoding Time Series with LLMs: A Multi-Agent Framework for Cross-Domain Annotation
Viaarxiv icon

Supervised Chain of Thought

Add code
Oct 18, 2024
Viaarxiv icon

Trojan Prompt Attacks on Graph Neural Networks

Add code
Oct 17, 2024
Viaarxiv icon

Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation

Add code
Oct 17, 2024
Figure 1 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Figure 2 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Figure 3 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Figure 4 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Viaarxiv icon

NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models

Add code
Oct 15, 2024
Viaarxiv icon