Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pranav Venkit

Automated Detection and Analysis of Data Practices Using A Real-World Corpus

Feb 16, 2024

Mukund Srinath, Pranav Venkit, Maria Badillo, Florian Schaub, C. Lee Giles, Shomir Wilson

Figure 1 for Automated Detection and Analysis of Data Practices Using A Real-World Corpus

Figure 2 for Automated Detection and Analysis of Data Practices Using A Real-World Corpus

Figure 3 for Automated Detection and Analysis of Data Practices Using A Real-World Corpus

Figure 4 for Automated Detection and Analysis of Data Practices Using A Real-World Corpus

Abstract:Privacy policies are crucial for informing users about data practices, yet their length and complexity often deter users from reading them. In this paper, we propose an automated approach to identify and visualize data practices within privacy policies at different levels of detail. Leveraging crowd-sourced annotations from the ToS;DR platform, we experiment with various methods to match policy excerpts with predefined data practice descriptions. We further conduct a case study to evaluate our approach on a real-world policy, demonstrating its effectiveness in simplifying complex policies. Experiments show that our approach accurately matches data practice descriptions with policy excerpts, facilitating the presentation of simplified privacy information to users.

Via

Access Paper or Ask Questions

A `Sourceful' Twist: Emoji Prediction Based on Sentiment, Hashtags and Application Source

Mar 14, 2021

Pranav Venkit, Zeba Karishma, Chi-Yang Hsu, Rahul Katiki, Kenneth Huang, Shomir Wilson, Patrick Dudas

Figure 1 for A `Sourceful' Twist: Emoji Prediction Based on Sentiment, Hashtags and Application Source

Figure 2 for A `Sourceful' Twist: Emoji Prediction Based on Sentiment, Hashtags and Application Source

Figure 3 for A `Sourceful' Twist: Emoji Prediction Based on Sentiment, Hashtags and Application Source

Figure 4 for A `Sourceful' Twist: Emoji Prediction Based on Sentiment, Hashtags and Application Source

Abstract:We widely use emojis in social networking to heighten, mitigate or negate the sentiment of the text. Emoji suggestions already exist in many cross-platform applications but an emoji is predicted solely based a few prominent words instead of understanding the subject and substance of the text. Through this paper, we showcase the importance of using Twitter features to help the model understand the sentiment involved and hence to predict the most suitable emoji for the text. Hashtags and Application Sources like Android, etc. are two features which we found to be important yet underused in emoji prediction and Twitter sentiment analysis on the whole. To approach this shortcoming and to further understand emoji behavioral patterns, we propose a more balanced dataset by crawling additional Twitter data, including timestamp, hashtags, and application source acting as additional attributes to the tweet. Our data analysis and neural network model performance evaluations depict that using hashtags and application sources as features allows to encode different information and is effective in emoji prediction.

Via

Access Paper or Ask Questions