Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xinghai Hu

Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering

Feb 13, 2025

Mark Beliaev, Victor Yang, Madhura Raju, Jiachen Sun, Xinghai Hu

Figure 1 for Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering

Figure 2 for Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering

Figure 3 for Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering

Figure 4 for Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering

Abstract:In this study, we tackle industry challenges in video content classification by exploring and optimizing GPT-based models for zero-shot classification across seven critical categories of video quality. We contribute a novel approach to improving GPT's performance through prompt optimization and policy refinement, demonstrating that simplifying complex policies significantly reduces false negatives. Additionally, we introduce a new decomposition-aggregation-based prompt engineering technique, which outperforms traditional single-prompt methods. These experiments, conducted on real industry problems, show that thoughtful prompt design can substantially enhance GPT's performance without additional finetuning, offering an effective and scalable solution for improving video classification systems across various domains in industry.

Via

Access Paper or Ask Questions

Measuring Fairness in Large-Scale Recommendation Systems with Missing Labels

Jun 07, 2024

Yulong Dong, Kun Jin, Xinghai Hu, Yang Liu

Figure 1 for Measuring Fairness in Large-Scale Recommendation Systems with Missing Labels

Figure 2 for Measuring Fairness in Large-Scale Recommendation Systems with Missing Labels

Figure 3 for Measuring Fairness in Large-Scale Recommendation Systems with Missing Labels

Figure 4 for Measuring Fairness in Large-Scale Recommendation Systems with Missing Labels

Abstract:In large-scale recommendation systems, the vast array of items makes it infeasible to obtain accurate user preferences for each product, resulting in a common issue of missing labels. Typically, only items previously recommended to users have associated ground truth data. Although there is extensive research on fairness concerning fully observed user-item interactions, the challenge of fairness in scenarios with missing labels remains underexplored. Previous methods often treat these samples missing labels as negative, which can significantly deviate from the ground truth fairness metrics. Our study addresses this gap by proposing a novel method employing a small randomized traffic to estimate fairness metrics accurately. We present theoretical bounds for the estimation error of our fairness metric and support our findings with empirical evidence on real data. Our numerical experiments on synthetic and TikTok's real-world data validate our theory and show the efficiency and effectiveness of our novel methods. To the best of our knowledge, we are the first to emphasize the necessity of random traffic in dataset collection for recommendation fairness, the first to publish a fairness-related dataset from TikTok and to provide reliable estimates of fairness metrics in the context of large-scale recommendation systems with missing labels.

Via

Access Paper or Ask Questions