Picture for Weiwei Cheng

Weiwei Cheng

Retrieve, Annotate, Evaluate, Repeat: Leveraging Multimodal LLMs for Large-Scale Product Retrieval Evaluation

Add code
Sep 18, 2024
Viaarxiv icon

What should I wear to a party in a Greek taverna? Evaluation for Conversational Agents in the Fashion Domain

Add code
Aug 13, 2024
Viaarxiv icon

Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information?

Add code
Sep 17, 2021
Figure 1 for Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information?
Figure 2 for Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information?
Figure 3 for Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information?
Figure 4 for Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information?
Viaarxiv icon

Evaluating for Diversity in Question Generation over Text

Add code
Aug 17, 2020
Figure 1 for Evaluating for Diversity in Question Generation over Text
Figure 2 for Evaluating for Diversity in Question Generation over Text
Figure 3 for Evaluating for Diversity in Question Generation over Text
Figure 4 for Evaluating for Diversity in Question Generation over Text
Viaarxiv icon

On the Bayes-optimality of F-measure maximizers

Add code
Mar 06, 2015
Figure 1 for On the Bayes-optimality of F-measure maximizers
Figure 2 for On the Bayes-optimality of F-measure maximizers
Figure 3 for On the Bayes-optimality of F-measure maximizers
Figure 4 for On the Bayes-optimality of F-measure maximizers
Viaarxiv icon

Label Ranking with Abstention: Predicting Partial Orders by Thresholding Probability Distributions (Extended Abstract)

Add code
Dec 02, 2011
Figure 1 for Label Ranking with Abstention: Predicting Partial Orders by Thresholding Probability Distributions (Extended Abstract)
Viaarxiv icon