Picture for Jindong Wang

Jindong Wang

General Scales Unlock AI Evaluation with Explanatory and Predictive Power

Add code
Mar 09, 2025
Viaarxiv icon

Understanding and Mitigating the Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks

Add code
Feb 10, 2025
Figure 1 for Understanding and Mitigating the Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks
Figure 2 for Understanding and Mitigating the Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks
Figure 3 for Understanding and Mitigating the Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks
Figure 4 for Understanding and Mitigating the Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks
Viaarxiv icon

MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison

Add code
Feb 07, 2025
Figure 1 for MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison
Figure 2 for MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison
Figure 3 for MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison
Figure 4 for MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison
Viaarxiv icon

Masked Autoencoders Are Effective Tokenizers for Diffusion Models

Add code
Feb 05, 2025
Viaarxiv icon

On Fairness of Unified Multimodal Large Language Model for Image Generation

Add code
Feb 05, 2025
Figure 1 for On Fairness of Unified Multimodal Large Language Model for Image Generation
Figure 2 for On Fairness of Unified Multimodal Large Language Model for Image Generation
Figure 3 for On Fairness of Unified Multimodal Large Language Model for Image Generation
Figure 4 for On Fairness of Unified Multimodal Large Language Model for Image Generation
Viaarxiv icon

Value Compass Leaderboard: A Platform for Fundamental and Validated Evaluation of LLMs Values

Add code
Jan 13, 2025
Viaarxiv icon

CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries

Add code
Jan 02, 2025
Viaarxiv icon

Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension

Add code
Dec 22, 2024
Figure 1 for Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension
Figure 2 for Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension
Figure 3 for Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension
Figure 4 for Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension
Viaarxiv icon

Outcome-Refining Process Supervision for Code Generation

Add code
Dec 19, 2024
Figure 1 for Outcome-Refining Process Supervision for Code Generation
Figure 2 for Outcome-Refining Process Supervision for Code Generation
Figure 3 for Outcome-Refining Process Supervision for Code Generation
Figure 4 for Outcome-Refining Process Supervision for Code Generation
Viaarxiv icon

SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer

Add code
Dec 14, 2024
Figure 1 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Figure 2 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Figure 3 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Figure 4 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Viaarxiv icon