Picture for Boris Ginsburg

Boris Ginsburg

SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling

Add code
Apr 11, 2025
Viaarxiv icon

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Viaarxiv icon

OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs

Add code
Apr 05, 2025
Viaarxiv icon

OpenCodeReasoning: Advancing Data Distillation for Competitive Coding

Add code
Apr 02, 2025
Viaarxiv icon

L0-Reasoning Bench: Evaluating Procedural Correctness in Language Models via Simple Program Execution

Add code
Mar 28, 2025
Viaarxiv icon

Training and Inference Efficiency of Encoder-Decoder Speech Models

Add code
Mar 07, 2025
Viaarxiv icon

Scoring Verifiers: Evaluating Synthetic Verification in Code and Reasoning

Add code
Feb 19, 2025
Viaarxiv icon

Star Attention: Efficient LLM Inference over Long Sequences

Add code
Nov 26, 2024
Viaarxiv icon

NeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts

Add code
Nov 08, 2024
Viaarxiv icon

Anticipating Future with Large Language Model for Simultaneous Machine Translation

Add code
Oct 29, 2024
Figure 1 for Anticipating Future with Large Language Model for Simultaneous Machine Translation
Figure 2 for Anticipating Future with Large Language Model for Simultaneous Machine Translation
Figure 3 for Anticipating Future with Large Language Model for Simultaneous Machine Translation
Figure 4 for Anticipating Future with Large Language Model for Simultaneous Machine Translation
Viaarxiv icon