Picture for Hongyin Tang

Hongyin Tang

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

Efficient Context Scaling with LongCat ZigZag Attention

Add code
Dec 30, 2025
Viaarxiv icon

IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method

Add code
Sep 26, 2025
Figure 1 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 2 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 3 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 4 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Viaarxiv icon

NeedleInATable: Exploring Long-Context Capability of Large Language Models towards Long-Structured Tables

Add code
Apr 09, 2025
Viaarxiv icon

Ltri-LLM: Streaming Long Context Inference for LLMs with Training-Free Dynamic Triangular Attention Pattern

Add code
Dec 06, 2024
Figure 1 for Ltri-LLM: Streaming Long Context Inference for LLMs with Training-Free Dynamic Triangular Attention Pattern
Figure 2 for Ltri-LLM: Streaming Long Context Inference for LLMs with Training-Free Dynamic Triangular Attention Pattern
Figure 3 for Ltri-LLM: Streaming Long Context Inference for LLMs with Training-Free Dynamic Triangular Attention Pattern
Figure 4 for Ltri-LLM: Streaming Long Context Inference for LLMs with Training-Free Dynamic Triangular Attention Pattern
Viaarxiv icon

GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model

Add code
Jun 11, 2023
Figure 1 for GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model
Figure 2 for GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model
Figure 3 for GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model
Figure 4 for GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model
Viaarxiv icon

Multi-task Transformer with Relation-attention and Type-attention for Named Entity Recognition

Add code
Mar 20, 2023
Viaarxiv icon

Inflected Forms Are Redundant in Question Generation Models

Add code
Jan 01, 2023
Viaarxiv icon

CLOWER: A Pre-trained Language Model with Contrastive Learning over Word and Character Representations

Add code
Aug 23, 2022
Figure 1 for CLOWER: A Pre-trained Language Model with Contrastive Learning over Word and Character Representations
Figure 2 for CLOWER: A Pre-trained Language Model with Contrastive Learning over Word and Character Representations
Figure 3 for CLOWER: A Pre-trained Language Model with Contrastive Learning over Word and Character Representations
Figure 4 for CLOWER: A Pre-trained Language Model with Contrastive Learning over Word and Character Representations
Viaarxiv icon

VIRT: Improving Representation-based Models for Text Matching through Virtual Interaction

Add code
Dec 08, 2021
Figure 1 for VIRT: Improving Representation-based Models for Text Matching through Virtual Interaction
Figure 2 for VIRT: Improving Representation-based Models for Text Matching through Virtual Interaction
Figure 3 for VIRT: Improving Representation-based Models for Text Matching through Virtual Interaction
Figure 4 for VIRT: Improving Representation-based Models for Text Matching through Virtual Interaction
Viaarxiv icon