Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Arkabandhu Chowdhury

Jack

The Llama 3 Herd of Models

Jul 31, 2024

Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan(+521 more)

Abstract:Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical evaluation of Llama 3. We find that Llama 3 delivers comparable quality to leading language models such as GPT-4 on a plethora of tasks. We publicly release Llama 3, including pre-trained and post-trained versions of the 405B parameter language model and our Llama Guard 3 model for input and output safety. The paper also presents the results of experiments in which we integrate image, video, and speech capabilities into Llama 3 via a compositional approach. We observe this approach performs competitively with the state-of-the-art on image, video, and speech recognition tasks. The resulting models are not yet being broadly released as they are still under development.

Via

Access Paper or Ask Questions

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Jun 01, 2023

Chaitanya Ryali, Yuan-Ting Hu, Daniel Bolya, Chen Wei, Haoqi Fan, Po-Yao Huang, Vaibhav Aggarwal, Arkabandhu Chowdhury, Omid Poursaeed, Judy Hoffman(+3 more)

Figure 1 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Figure 2 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Figure 3 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Figure 4 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Abstract:Modern hierarchical vision transformers have added several vision-specific components in the pursuit of supervised classification performance. While these components lead to effective accuracies and attractive FLOP counts, the added complexity actually makes these transformers slower than their vanilla ViT counterparts. In this paper, we argue that this additional bulk is unnecessary. By pretraining with a strong visual pretext task (MAE), we can strip out all the bells-and-whistles from a state-of-the-art multi-stage vision transformer without losing accuracy. In the process, we create Hiera, an extremely simple hierarchical vision transformer that is more accurate than previous models while being significantly faster both at inference and during training. We evaluate Hiera on a variety of tasks for image and video recognition. Our code and models are available at https://github.com/facebookresearch/hiera.

* ICML 2023 Oral version. Code+Models: https://github.com/facebookresearch/hiera

Via

Access Paper or Ask Questions

Few-shot Image Classification: Just Use a Library of Pre-trained Feature Extractors and a Simple Classifier

Jan 03, 2021

Arkabandhu Chowdhury, Mingchao Jiang, Chris Jermaine

Figure 1 for Few-shot Image Classification: Just Use a Library of Pre-trained Feature Extractors and a Simple Classifier

Figure 2 for Few-shot Image Classification: Just Use a Library of Pre-trained Feature Extractors and a Simple Classifier

Figure 3 for Few-shot Image Classification: Just Use a Library of Pre-trained Feature Extractors and a Simple Classifier

Figure 4 for Few-shot Image Classification: Just Use a Library of Pre-trained Feature Extractors and a Simple Classifier

Abstract:Recent papers have suggested that transfer learning can outperform sophisticated meta-learning methods for few-shot image classification. We take this hypothesis to its logical conclusion, and suggest the use of an ensemble of high-quality, pre-trained feature extractors for few-shot image classification. We show experimentally that a library of pre-trained feature extractors combined with a simple feed-forward network learned with an L2-regularizer can be an excellent option for solving cross-domain few-shot image classification. Our experimental results suggest that this simpler sample-efficient approach far outperforms several well-established meta-learning algorithms on a variety of few-shot tasks.

* 17 pages including appendix and references. 2 figures in the main paper, 1 figure in appendix. Under submission at a conference

Via

Access Paper or Ask Questions

Meta-Meta-Classification for One-Shot Learning

May 12, 2020

Arkabandhu Chowdhury, Dipak Chaudhari, Swarat Chaudhuri, Chris Jermaine

Figure 1 for Meta-Meta-Classification for One-Shot Learning

Figure 2 for Meta-Meta-Classification for One-Shot Learning

Figure 3 for Meta-Meta-Classification for One-Shot Learning

Figure 4 for Meta-Meta-Classification for One-Shot Learning

Abstract:We present a new approach, called meta-meta-classification, to learning in small-data settings. In this approach, one uses a large set of learning problems to design an ensemble of learners, where each learner has high bias and low variance and is skilled at solving a specific type of learning problem. The meta-meta classifier learns how to examine a given learning problem and combine the various learners to solve the problem. The meta-meta-learning approach is especially suited to solving few-shot learning tasks, as it is easier to learn to classify a new learning problem with little data than it is to apply a learning algorithm to a small data set. We evaluate the approach on a one-shot, one-class-versus-all classification task and show that it is able to outperform traditional meta-learning as well as ensembling approaches.

* 8 pages without references, 2 figures

Via

Access Paper or Ask Questions

Multi-modal dialog for browsing large visual catalogs using exploration-exploitation paradigm in a joint embedding space

Jan 29, 2019

Indrani Bhattacharya, Arkabandhu Chowdhury, Vikas Raykar

Figure 1 for Multi-modal dialog for browsing large visual catalogs using exploration-exploitation paradigm in a joint embedding space

Abstract:We present a multi-modal dialog system to assist online shoppers in visually browsing through large catalogs. Visual browsing is different from visual search in that it allows the user to explore the wide range of products in a catalog, beyond the exact search matches. We focus on a slightly asymmetric version of the complete multi-modal dialog where the system can understand both text and image queries but responds only in images. We formulate our problem of "showing $k$ best images to a user" based on the dialog context so far, as sampling from a Gaussian Mixture Model in a high dimensional joint multi-modal embedding space, that embed both the text and the image queries. Our system remembers the context of the dialog and uses an exploration-exploitation paradigm to assist in visual browsing. We train and evaluate the system on a multi-modal dialog dataset that we generate from large catalog data. Our experiments are promising and show that the agent is capable of learning and can display relevant results with an average cosine similarity of 0.85 to the ground truth. Our preliminary human evaluation also corroborates the fact that such a multi-modal dialog system for visual browsing is well-received and is capable of engaging human users.

* 10 pages including reference, 8 figures. First two authors are equal contributors

Via

Access Paper or Ask Questions

An Evolutionary Approach to Drug-Design Using a Novel Neighbourhood Based Genetic Algorithm

May 03, 2012

Arnab Ghosh, Avishek Ghosh, Arkabandhu Chowdhury, Amit Konar

Figure 1 for An Evolutionary Approach to Drug-Design Using a Novel Neighbourhood Based Genetic Algorithm

Figure 2 for An Evolutionary Approach to Drug-Design Using a Novel Neighbourhood Based Genetic Algorithm

Figure 3 for An Evolutionary Approach to Drug-Design Using a Novel Neighbourhood Based Genetic Algorithm

Figure 4 for An Evolutionary Approach to Drug-Design Using a Novel Neighbourhood Based Genetic Algorithm

Abstract:The present work provides a new approach to evolve ligand structures which represent possible drug to be docked to the active site of the target protein. The structure is represented as a tree where each non-empty node represents a functional group. It is assumed that the active site configuration of the target protein is known with position of the essential residues. In this paper the interaction energy of the ligands with the protein target is minimized. Moreover, the size of the tree is difficult to obtain and it will be different for different active sites. To overcome the difficulty, a variable tree size configuration is used for designing ligands. The optimization is done using a novel Neighbourhood Based Genetic Algorithm (NBGA) which uses dynamic neighbourhood topology. To get variable tree size, a variable-length version of the above algorithm is devised. To judge the merit of the algorithm, it is initially applied on the well known Travelling Salesman Problem (TSP).

* 10 pages,13 figures (Communicated to journal)

Via

Access Paper or Ask Questions

Multi-robot Cooperative Box-pushing problem using Multi-objective Particle Swarm Optimization Technique

May 03, 2012

Arnab Ghosh, Avishek Ghosh, Arkabandhu Chowdhury, Amit Konar, R. Janarthanan

Figure 1 for Multi-robot Cooperative Box-pushing problem using Multi-objective Particle Swarm Optimization Technique

Figure 2 for Multi-robot Cooperative Box-pushing problem using Multi-objective Particle Swarm Optimization Technique

Figure 3 for Multi-robot Cooperative Box-pushing problem using Multi-objective Particle Swarm Optimization Technique

Figure 4 for Multi-robot Cooperative Box-pushing problem using Multi-objective Particle Swarm Optimization Technique

Abstract:The present work provides a new approach to solve the well-known multi-robot co-operative box pushing problem as a multi objective optimization problem using modified Multi-objective Particle Swarm Optimization. The method proposed here allows both turning and translation of the box, during shift to a desired goal position. We have employed local planning scheme to determine the magnitude of the forces applied by the two mobile robots perpendicularly at specific locations on the box to align and translate it in each distinct step of motion of the box, for minimization of both time and energy. Finally the results are compared with the results obtained by solving the same problem using Non-dominated Sorting Genetic Algorithm-II (NSGA-II). The proposed scheme is found to give better results compared to NSGA-II.

* 6 Pages, 4 Figures (Accepted at MNCAPPS 2012)

Via

Access Paper or Ask Questions

An Evolutionary Approach to Drug-Design Using Quantam Binary Particle Swarm Optimization Algorithm

May 03, 2012

Avishek Ghosh, Arnab Ghosh, Arkabandhu Chowdhury, Jubin Hazra

Figure 1 for An Evolutionary Approach to Drug-Design Using Quantam Binary Particle Swarm Optimization Algorithm

Figure 2 for An Evolutionary Approach to Drug-Design Using Quantam Binary Particle Swarm Optimization Algorithm

Figure 3 for An Evolutionary Approach to Drug-Design Using Quantam Binary Particle Swarm Optimization Algorithm

Figure 4 for An Evolutionary Approach to Drug-Design Using Quantam Binary Particle Swarm Optimization Algorithm

Abstract:The present work provides a new approach to evolve ligand structures which represent possible drug to be docked to the active site of the target protein. The structure is represented as a tree where each non-empty node represents a functional group. It is assumed that the active site configuration of the target protein is known with position of the essential residues. In this paper the interaction energy of the ligands with the protein target is minimized. Moreover, the size of the tree is difficult to obtain and it will be different for different active sites. To overcome the difficulty, a variable tree size configuration is used for designing ligands. The optimization is done using a quantum discrete PSO. The result using fixed length and variable length configuration are compared.

* 4 pages, 6 figures (Published in IEEE SCEECS 2012). arXiv admin note: substantial text overlap with arXiv:1205.6412

Via

Access Paper or Ask Questions