Picture for Yunxin Liu

Yunxin Liu

Institute for AI Industry Research, Shanghai AI Laboratory, Shanghai, China

V-LoRA: An Efficient and Flexible System Boosts Vision Applications with LoRA LMM

Add code
Nov 01, 2024
Viaarxiv icon

Generalized Robot Learning Framework

Add code
Sep 18, 2024
Figure 1 for Generalized Robot Learning Framework
Figure 2 for Generalized Robot Learning Framework
Figure 3 for Generalized Robot Learning Framework
Figure 4 for Generalized Robot Learning Framework
Viaarxiv icon

A First Look At Efficient And Secure On-Device LLM Inference Against KV Leakage

Add code
Sep 06, 2024
Figure 1 for A First Look At Efficient And Secure On-Device LLM Inference Against KV Leakage
Figure 2 for A First Look At Efficient And Secure On-Device LLM Inference Against KV Leakage
Figure 3 for A First Look At Efficient And Secure On-Device LLM Inference Against KV Leakage
Figure 4 for A First Look At Efficient And Secure On-Device LLM Inference Against KV Leakage
Viaarxiv icon

NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?

Add code
Jul 16, 2024
Viaarxiv icon

LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design

Add code
May 28, 2024
Viaarxiv icon

A Survey of Resource-efficient LLM and Multimodal Foundation Models

Add code
Jan 16, 2024
Figure 1 for A Survey of Resource-efficient LLM and Multimodal Foundation Models
Figure 2 for A Survey of Resource-efficient LLM and Multimodal Foundation Models
Figure 3 for A Survey of Resource-efficient LLM and Multimodal Foundation Models
Figure 4 for A Survey of Resource-efficient LLM and Multimodal Foundation Models
Viaarxiv icon

Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security

Add code
Jan 10, 2024
Figure 1 for Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security
Figure 2 for Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security
Figure 3 for Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security
Figure 4 for Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security
Viaarxiv icon

BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge

Add code
Dec 25, 2023
Figure 1 for BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge
Figure 2 for BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge
Figure 3 for BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge
Figure 4 for BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge
Viaarxiv icon

FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning

Add code
Oct 25, 2023
Viaarxiv icon

Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations

Add code
Sep 16, 2023
Figure 1 for Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations
Figure 2 for Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations
Figure 3 for Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations
Figure 4 for Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations
Viaarxiv icon