Picture for Nishanth Dikkala

Nishanth Dikkala

How Transformers Solve Propositional Logic Problems: A Mechanistic Analysis

Add code
Nov 07, 2024
Viaarxiv icon

Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles

Add code
Sep 16, 2024
Figure 1 for Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles
Figure 2 for Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles
Figure 3 for Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles
Figure 4 for Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles
Viaarxiv icon

Learning Neural Networks with Sparse Activations

Add code
Jun 26, 2024
Viaarxiv icon

ReMI: A Dataset for Reasoning with Multiple Images

Add code
Jun 13, 2024
Figure 1 for ReMI: A Dataset for Reasoning with Multiple Images
Figure 2 for ReMI: A Dataset for Reasoning with Multiple Images
Figure 3 for ReMI: A Dataset for Reasoning with Multiple Images
Figure 4 for ReMI: A Dataset for Reasoning with Multiple Images
Viaarxiv icon

The Power of External Memory in Increasing Predictive Model Capacity

Add code
Jan 31, 2023
Figure 1 for The Power of External Memory in Increasing Predictive Model Capacity
Figure 2 for The Power of External Memory in Increasing Predictive Model Capacity
Figure 3 for The Power of External Memory in Increasing Predictive Model Capacity
Figure 4 for The Power of External Memory in Increasing Predictive Model Capacity
Viaarxiv icon

Alternating Updates for Efficient Transformers

Add code
Jan 30, 2023
Figure 1 for Alternating Updates for Efficient Transformers
Figure 2 for Alternating Updates for Efficient Transformers
Figure 3 for Alternating Updates for Efficient Transformers
Figure 4 for Alternating Updates for Efficient Transformers
Viaarxiv icon

A Theoretical View on Sparsely Activated Networks

Add code
Aug 08, 2022
Figure 1 for A Theoretical View on Sparsely Activated Networks
Figure 2 for A Theoretical View on Sparsely Activated Networks
Figure 3 for A Theoretical View on Sparsely Activated Networks
Figure 4 for A Theoretical View on Sparsely Activated Networks
Viaarxiv icon

Do More Negative Samples Necessarily Hurt in Contrastive Learning?

Add code
May 03, 2022
Figure 1 for Do More Negative Samples Necessarily Hurt in Contrastive Learning?
Figure 2 for Do More Negative Samples Necessarily Hurt in Contrastive Learning?
Figure 3 for Do More Negative Samples Necessarily Hurt in Contrastive Learning?
Figure 4 for Do More Negative Samples Necessarily Hurt in Contrastive Learning?
Viaarxiv icon

Statistical Estimation from Dependent Data

Add code
Jul 20, 2021
Figure 1 for Statistical Estimation from Dependent Data
Figure 2 for Statistical Estimation from Dependent Data
Figure 3 for Statistical Estimation from Dependent Data
Viaarxiv icon

For Manifold Learning, Deep Neural Networks can be Locality Sensitive Hash Functions

Add code
Mar 11, 2021
Figure 1 for For Manifold Learning, Deep Neural Networks can be Locality Sensitive Hash Functions
Figure 2 for For Manifold Learning, Deep Neural Networks can be Locality Sensitive Hash Functions
Figure 3 for For Manifold Learning, Deep Neural Networks can be Locality Sensitive Hash Functions
Figure 4 for For Manifold Learning, Deep Neural Networks can be Locality Sensitive Hash Functions
Viaarxiv icon