Picture for Zhijie Wang

Zhijie Wang

Towards Understanding Retrieval Accuracy and Prompt Quality in RAG Systems

Add code
Nov 29, 2024
Viaarxiv icon

LADEV: A Language-Driven Testing and Evaluation Platform for Vision-Language-Action Models in Robotic Manipulation

Add code
Oct 07, 2024
Viaarxiv icon

PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement

Add code
Mar 06, 2024
Viaarxiv icon

Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models

Add code
Jul 16, 2023
Figure 1 for Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models
Figure 2 for Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models
Figure 3 for Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models
Figure 4 for Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models
Viaarxiv icon

Benchmarking Robustness of AI-enabled Multi-sensor Fusion Systems: Challenges and Opportunities

Add code
Jun 06, 2023
Viaarxiv icon

Is Model Attention Aligned with Human Attention? An Empirical Study on Large Language Models for Code Generation

Add code
Jun 02, 2023
Viaarxiv icon

Towards Efficient Deep Hashing Retrieval: Condensing Your Data via Feature-Embedding Matching

Add code
May 29, 2023
Figure 1 for Towards Efficient Deep Hashing Retrieval: Condensing Your Data via Feature-Embedding Matching
Figure 2 for Towards Efficient Deep Hashing Retrieval: Condensing Your Data via Feature-Embedding Matching
Figure 3 for Towards Efficient Deep Hashing Retrieval: Condensing Your Data via Feature-Embedding Matching
Figure 4 for Towards Efficient Deep Hashing Retrieval: Condensing Your Data via Feature-Embedding Matching
Viaarxiv icon

DeepLens: Interactive Out-of-distribution Data Detection in NLP Models

Add code
Mar 02, 2023
Viaarxiv icon

DeepSeer: Interactive RNN Explanation and Debugging via State Abstraction

Add code
Mar 02, 2023
Viaarxiv icon

An Exploratory Study of AI System Risk Assessment from the Lens of Data Distribution and Uncertainty

Add code
Dec 13, 2022
Viaarxiv icon