Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Elahe Rahimtoroghi

Gemini: A Family of Highly Capable Multimodal Models

Dec 19, 2023

Gemini Team, Rohan Anil, Sebastian Borgeaud, Yonghui Wu, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth(+930 more)

Abstract:This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of Gemini models in cross-modal reasoning and language understanding will enable a wide variety of use cases and we discuss our approach toward deploying them responsibly to users.

Via

Access Paper or Ask Questions

What Happens To BERT Embeddings During Fine-tuning?

Apr 29, 2020

Amil Merchant, Elahe Rahimtoroghi, Ellie Pavlick, Ian Tenney

Figure 1 for What Happens To BERT Embeddings During Fine-tuning?

Figure 2 for What Happens To BERT Embeddings During Fine-tuning?

Figure 3 for What Happens To BERT Embeddings During Fine-tuning?

Figure 4 for What Happens To BERT Embeddings During Fine-tuning?

Abstract:While there has been much recent work studying how linguistic information is encoded in pre-trained sentence representations, comparatively little is understood about how these models change when adapted to solve downstream tasks. Using a suite of analysis techniques (probing classifiers, Representational Similarity Analysis, and model ablations), we investigate how fine-tuning affects the representations of the BERT model. We find that while fine-tuning necessarily makes significant changes, it does not lead to catastrophic forgetting of linguistic phenomena. We instead find that fine-tuning primarily affects the top layers of BERT, but with noteworthy variation across tasks. In particular, dependency parsing reconfigures most of the model, whereas SQuAD and MNLI appear to involve much shallower processing. Finally, we also find that fine-tuning has a weaker effect on representations of out-of-domain sentences, suggesting room for improvement in model generalization.

* 9 pages (not including references), 5 figures

Via

Access Paper or Ask Questions

Unsupervised Induction of Contingent Event Pairs from Film Scenes

Aug 30, 2017

Zhichao Hu, Elahe Rahimtoroghi, Larissa Munishkina, Reid Swanson, Marilyn A. Walker

Figure 1 for Unsupervised Induction of Contingent Event Pairs from Film Scenes

Figure 2 for Unsupervised Induction of Contingent Event Pairs from Film Scenes

Figure 3 for Unsupervised Induction of Contingent Event Pairs from Film Scenes

Figure 4 for Unsupervised Induction of Contingent Event Pairs from Film Scenes

Abstract:Human engagement in narrative is partially driven by reasoning about discourse relations between narrative events, and the expectations about what is likely to happen next that results from such reasoning. Researchers in NLP have tackled modeling such expectations from a range of perspectives, including treating it as the inference of the contingent discourse relation, or as a type of common-sense causal reasoning. Our approach is to model likelihood between events by drawing on several of these lines of previous work. We implement and evaluate different unsupervised methods for learning event pairs that are likely to be contingent on one another. We refine event pairs that we learn from a corpus of film scene descriptions utilizing web search counts, and evaluate our results by collecting human judgments of contingency. Our results indicate that the use of web search counts increases the average accuracy of our best method to 85.64% over a baseline of 50%, as compared to an average accuracy of 75.15% without web search.

* In Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP 2013)

Via

Access Paper or Ask Questions

Inference of Fine-Grained Event Causality from Blogs and Films

Aug 30, 2017

Zhichao Hu, Elahe Rahimtoroghi, Marilyn A Walker

Figure 1 for Inference of Fine-Grained Event Causality from Blogs and Films

Figure 2 for Inference of Fine-Grained Event Causality from Blogs and Films

Figure 3 for Inference of Fine-Grained Event Causality from Blogs and Films

Figure 4 for Inference of Fine-Grained Event Causality from Blogs and Films

Abstract:Human understanding of narrative is mainly driven by reasoning about causal relations between events and thus recognizing them is a key capability for computational models of language understanding. Computational work in this area has approached this via two different routes: by focusing on acquiring a knowledge base of common causal relations between events, or by attempting to understand a particular story or macro-event, along with its storyline. In this position paper, we focus on knowledge acquisition approach and claim that newswire is a relatively poor source for learning fine-grained causal relations between everyday events. We describe experiments using an unsupervised method to learn causal relations between events in the narrative genres of first-person narratives and film scene descriptions. We show that our method learns fine-grained causal relations, judged by humans as likely to be causal over 80% of the time. We also demonstrate that the learned event pairs do not exist in publicly available event-pair datasets extracted from newswire.

* Events and Stories in the News Workshop, ACL 2017
* Events and Stories in the News Workshop, ACL 2017

Via

Access Paper or Ask Questions

Learning Fine-Grained Knowledge about Contingent Relations between Everyday Events

Aug 30, 2017

Elahe Rahimtoroghi, Ernesto Hernandez, Marilyn A Walker

Figure 1 for Learning Fine-Grained Knowledge about Contingent Relations between Everyday Events

Figure 2 for Learning Fine-Grained Knowledge about Contingent Relations between Everyday Events

Abstract:Much of the user-generated content on social media is provided by ordinary people telling stories about their daily lives. We develop and test a novel method for learning fine-grained common-sense knowledge from these stories about contingent (causal and conditional) relationships between everyday events. This type of knowledge is useful for text and story understanding, information extraction, question answering, and text summarization. We test and compare different methods for learning contingency relation, and compare what is learned from topic-sorted story collections vs. general-domain stories. Our experiments show that using topic-specific datasets enables learning finer-grained knowledge about events and results in significant improvement over the baselines. An evaluation on Amazon Mechanical Turk shows 82% of the relations between events that we learn from topic-sorted stories are judged as contingent.

* 17th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2016)
* SIGDIAL 2016

Via

Access Paper or Ask Questions

Modelling Protagonist Goals and Desires in First-Person Narrative

Aug 29, 2017

Elahe Rahimtoroghi, Jiaqi Wu, Ruimin Wang, Pranav Anand, Marilyn A Walker

Figure 1 for Modelling Protagonist Goals and Desires in First-Person Narrative

Figure 2 for Modelling Protagonist Goals and Desires in First-Person Narrative

Figure 3 for Modelling Protagonist Goals and Desires in First-Person Narrative

Figure 4 for Modelling Protagonist Goals and Desires in First-Person Narrative

Abstract:Many genres of natural language text are narratively structured, a testament to our predilection for organizing our experiences as narratives. There is broad consensus that understanding a narrative requires identifying and tracking the goals and desires of the characters and their narrative outcomes. However, to date, there has been limited work on computational models for this problem. We introduce a new dataset, DesireDB, which includes gold-standard labels for identifying statements of desire, textual evidence for desire fulfillment, and annotations for whether the stated desire is fulfilled given the evidence in the narrative context. We report experiments on tracking desire fulfillment using different methods, and show that LSTM Skip-Thought model achieves F-measure of 0.7 on our corpus.

* 10 pages, 18th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2017)

Via

Access Paper or Ask Questions