Picture for Fangkai Yang

Fangkai Yang

NVIDIA Corporation

Large Action Models: From Inception to Implementation

Add code
Dec 13, 2024
Viaarxiv icon

Self-Evolved Reward Learning for LLMs

Add code
Nov 01, 2024
Figure 1 for Self-Evolved Reward Learning for LLMs
Figure 2 for Self-Evolved Reward Learning for LLMs
Figure 3 for Self-Evolved Reward Learning for LLMs
Figure 4 for Self-Evolved Reward Learning for LLMs
Viaarxiv icon

Token-level Proximal Policy Optimization for Query Generation

Add code
Nov 01, 2024
Viaarxiv icon

AI Delegates with a Dual Focus: Ensuring Privacy and Strategic Self-Disclosure

Add code
Sep 26, 2024
Figure 1 for AI Delegates with a Dual Focus: Ensuring Privacy and Strategic Self-Disclosure
Figure 2 for AI Delegates with a Dual Focus: Ensuring Privacy and Strategic Self-Disclosure
Figure 3 for AI Delegates with a Dual Focus: Ensuring Privacy and Strategic Self-Disclosure
Figure 4 for AI Delegates with a Dual Focus: Ensuring Privacy and Strategic Self-Disclosure
Viaarxiv icon

Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents

Add code
Sep 25, 2024
Viaarxiv icon

EfficientRAG: Efficient Retriever for Multi-Hop Question Answering

Add code
Aug 08, 2024
Viaarxiv icon

The Vision of Autonomic Computing: Can LLMs Make It a Reality?

Add code
Jul 19, 2024
Figure 1 for The Vision of Autonomic Computing: Can LLMs Make It a Reality?
Figure 2 for The Vision of Autonomic Computing: Can LLMs Make It a Reality?
Figure 3 for The Vision of Autonomic Computing: Can LLMs Make It a Reality?
Figure 4 for The Vision of Autonomic Computing: Can LLMs Make It a Reality?
Viaarxiv icon

AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation

Add code
Jun 27, 2024
Figure 1 for AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation
Figure 2 for AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation
Figure 3 for AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation
Figure 4 for AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation
Viaarxiv icon

Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation

Add code
Jun 19, 2024
Figure 1 for Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation
Figure 2 for Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation
Figure 3 for Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation
Figure 4 for Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation
Viaarxiv icon

An Advanced Reinforcement Learning Framework for Online Scheduling of Deferrable Workloads in Cloud Computing

Add code
Jun 03, 2024
Figure 1 for An Advanced Reinforcement Learning Framework for Online Scheduling of Deferrable Workloads in Cloud Computing
Figure 2 for An Advanced Reinforcement Learning Framework for Online Scheduling of Deferrable Workloads in Cloud Computing
Figure 3 for An Advanced Reinforcement Learning Framework for Online Scheduling of Deferrable Workloads in Cloud Computing
Figure 4 for An Advanced Reinforcement Learning Framework for Online Scheduling of Deferrable Workloads in Cloud Computing
Viaarxiv icon