Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mohamad Rida Rammal

Reframing Data Value for Large Language Models Through the Lens of Plausability

Aug 30, 2024

Mohamad Rida Rammal, Ruida Zhou, Suhas Diggavi

Figure 1 for Reframing Data Value for Large Language Models Through the Lens of Plausability

Figure 2 for Reframing Data Value for Large Language Models Through the Lens of Plausability

Figure 3 for Reframing Data Value for Large Language Models Through the Lens of Plausability

Figure 4 for Reframing Data Value for Large Language Models Through the Lens of Plausability

Abstract:Data valuation seeks to answer the important question, "How much is this data worth?" Existing data valuation methods have largely focused on discriminative models, primarily examining data value through the lens of its utility in training. However, with the push for ever-larger language models, relying on valuation methods that require training becomes increasingly expensive and dependent on specific techniques. We propose an alternative perspective on the data value problem for language models, centering around the plausibility of the data. We posit that data holds lesser value if it can be plausibly generated by the model itself. Starting from some intuitive criteria that align with our notions of valuable data, we develop a novel value function that is computationally tractable and derived from first principles with provable properties. We conduct a theoretical analysis of our value function and evaluate it across multiple scenarios and datasets.

Via

Access Paper or Ask Questions

On Leave-One-Out Conditional Mutual Information For Generalization

Jul 01, 2022

Mohamad Rida Rammal, Alessandro Achille, Aditya Golatkar, Suhas Diggavi, Stefano Soatto

Figure 1 for On Leave-One-Out Conditional Mutual Information For Generalization

Figure 2 for On Leave-One-Out Conditional Mutual Information For Generalization

Figure 3 for On Leave-One-Out Conditional Mutual Information For Generalization

Figure 4 for On Leave-One-Out Conditional Mutual Information For Generalization

Abstract:We derive information theoretic generalization bounds for supervised learning algorithms based on a new measure of leave-one-out conditional mutual information (loo-CMI). Contrary to other CMI bounds, which are black-box bounds that do not exploit the structure of the problem and may be hard to evaluate in practice, our loo-CMI bounds can be computed easily and can be interpreted in connection to other notions such as classical leave-one-out cross-validation, stability of the optimization algorithm, and the geometry of the loss-landscape. It applies both to the output of training algorithms as well as their predictions. We empirically validate the quality of the bound by evaluating its predicted generalization gap in scenarios for deep learning. In particular, our bounds are non-vacuous on large-scale image-classification tasks.

Via

Access Paper or Ask Questions

Coded Estimation: Design of Backscatter Array Codes for 3D Orientation Estimation

Dec 01, 2021

Mohamad Rida Rammal, Suhas Diggavi, Ashutosh Sabharwal

Figure 1 for Coded Estimation: Design of Backscatter Array Codes for 3D Orientation Estimation

Figure 2 for Coded Estimation: Design of Backscatter Array Codes for 3D Orientation Estimation

Figure 3 for Coded Estimation: Design of Backscatter Array Codes for 3D Orientation Estimation

Figure 4 for Coded Estimation: Design of Backscatter Array Codes for 3D Orientation Estimation

Abstract:We consider the problem of estimating the orientation of a 3D object with the assistance of configurable backscatter tags. We explore the idea of designing tag response codes to improve the accuracy of orientation estimation. To minimize the difference between the true and estimated orientation, we propose two code design criteria. We also derive a lower bound on the worst-case error using Le Cam's method and provide simulation results for multiple scenarios including line-of-sight only and multipath, comparing the theoretical bounds to those achieved by the designs.

Via

Access Paper or Ask Questions