Picture for Adriana Meza Soria

Adriana Meza Soria

Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler

Add code
Aug 23, 2024
Figure 1 for Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler
Figure 2 for Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler
Figure 3 for Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler
Figure 4 for Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler
Viaarxiv icon

Scaling Granite Code Models to 128K Context

Add code
Jul 18, 2024
Viaarxiv icon

Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks

Add code
Jun 27, 2024
Figure 1 for Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks
Figure 2 for Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks
Figure 3 for Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks
Figure 4 for Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks
Viaarxiv icon

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

Add code
May 07, 2024
Figure 1 for Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Figure 2 for Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Figure 3 for Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Figure 4 for Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Viaarxiv icon

API Pack: A Massive Multilingual Dataset for API Call Generation

Add code
Feb 16, 2024
Viaarxiv icon