Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Felix B Mueller

Massively Multi-Person 3D Human Motion Forecasting with Scene Context

Sep 18, 2024

Felix B Mueller, Julian Tanke, Juergen Gall

Figure 1 for Massively Multi-Person 3D Human Motion Forecasting with Scene Context

Figure 2 for Massively Multi-Person 3D Human Motion Forecasting with Scene Context

Figure 3 for Massively Multi-Person 3D Human Motion Forecasting with Scene Context

Figure 4 for Massively Multi-Person 3D Human Motion Forecasting with Scene Context

Abstract:Forecasting long-term 3D human motion is challenging: the stochasticity of human behavior makes it hard to generate realistic human motion from the input sequence alone. Information on the scene environment and the motion of nearby people can greatly aid the generation process. We propose a scene-aware social transformer model (SAST) to forecast long-term (10s) human motion motion. Unlike previous models, our approach can model interactions between both widely varying numbers of people and objects in a scene. We combine a temporal convolutional encoder-decoder architecture with a Transformer-based bottleneck that allows us to efficiently combine motion and scene information. We model the conditional motion distribution using denoising diffusion models. We benchmark our approach on the Humans in Kitchens dataset, which contains 1 to 16 persons and 29 to 50 objects that are visible simultaneously. Our model outperforms other approaches in terms of realism and diversity on different metrics and in a user study. Code is available at https://github.com/felixbmuller/SAST.

* 14 pages, 6 figures

Via

Access Paper or Ask Questions

LLMs and Memorization: On Quality and Specificity of Copyright Compliance

May 28, 2024

Felix B Mueller, Rebekka Görge, Anna K Bernzen, Janna C Pirk, Maximilian Poretschkin

Figure 1 for LLMs and Memorization: On Quality and Specificity of Copyright Compliance

Figure 2 for LLMs and Memorization: On Quality and Specificity of Copyright Compliance

Figure 3 for LLMs and Memorization: On Quality and Specificity of Copyright Compliance

Figure 4 for LLMs and Memorization: On Quality and Specificity of Copyright Compliance

Abstract:Memorization in large language models (LLMs) is a growing concern. LLMs have been shown to easily reproduce parts of their training data, including copyrighted work. This is an important problem to solve, as it may violate existing copyright laws as well as the European AI Act. In this work, we propose a systematic analysis to quantify the extent of potential copyright infringements in LLMs using European law as an example. Unlike previous work, we evaluate instruction-finetuned models in a realistic end-user scenario. Our analysis builds on a proposed threshold of 160 characters, which we borrow from the German Copyright Service Provider Act and a fuzzy text matching algorithm to identify potentially copyright-infringing textual reproductions. The specificity of countermeasures against copyright infringement is analyzed by comparing model behavior on copyrighted and public domain data. We investigate what behaviors models show instead of producing protected text (such as refusal or hallucination) and provide a first legal assessment of these behaviors. We find that there are huge differences in copyright compliance, specificity, and appropriate refusal among popular LLMs. Alpaca, GPT 4, GPT 3.5, and Luminous perform best in our comparison, with OpenGPT-X, Alpaca, and Luminous producing a particularly low absolute number of potential copyright violations. Code will be published soon.

* 10 pages, 3 figures

Via

Access Paper or Ask Questions