Picture for Ke Wang

Ke Wang

Xidian University

EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning

Add code
Feb 18, 2025
Viaarxiv icon

EquiBench: Benchmarking Code Reasoning Capabilities of Large Language Models via Equivalence Checking

Add code
Feb 18, 2025
Viaarxiv icon

SLVR: Securely Leveraging Client Validation for Robust Federated Learning

Add code
Feb 12, 2025
Viaarxiv icon

BounTCHA: A CAPTCHA Utilizing Boundary Identification in AI-extended Videos

Add code
Jan 30, 2025
Viaarxiv icon

SDPO: Segment-Level Direct Preference Optimization for Social Agents

Add code
Jan 03, 2025
Figure 1 for SDPO: Segment-Level Direct Preference Optimization for Social Agents
Figure 2 for SDPO: Segment-Level Direct Preference Optimization for Social Agents
Figure 3 for SDPO: Segment-Level Direct Preference Optimization for Social Agents
Figure 4 for SDPO: Segment-Level Direct Preference Optimization for Social Agents
Viaarxiv icon

Algorithmic Strategies for Sustainable Reuse of Neural Network Accelerators with Permanent Faults

Add code
Dec 17, 2024
Figure 1 for Algorithmic Strategies for Sustainable Reuse of Neural Network Accelerators with Permanent Faults
Figure 2 for Algorithmic Strategies for Sustainable Reuse of Neural Network Accelerators with Permanent Faults
Figure 3 for Algorithmic Strategies for Sustainable Reuse of Neural Network Accelerators with Permanent Faults
Figure 4 for Algorithmic Strategies for Sustainable Reuse of Neural Network Accelerators with Permanent Faults
Viaarxiv icon

LLaVA-Zip: Adaptive Visual Token Compression with Intrinsic Image Information

Add code
Dec 11, 2024
Viaarxiv icon

SAM Decoding: Speculative Decoding via Suffix Automaton

Add code
Nov 16, 2024
Viaarxiv icon

Wireless Resource Allocation with Collaborative Distributed and Centralized DRL under Control Channel Attacks

Add code
Nov 16, 2024
Figure 1 for Wireless Resource Allocation with Collaborative Distributed and Centralized DRL under Control Channel Attacks
Figure 2 for Wireless Resource Allocation with Collaborative Distributed and Centralized DRL under Control Channel Attacks
Figure 3 for Wireless Resource Allocation with Collaborative Distributed and Centralized DRL under Control Channel Attacks
Figure 4 for Wireless Resource Allocation with Collaborative Distributed and Centralized DRL under Control Channel Attacks
Viaarxiv icon

LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging

Add code
Oct 22, 2024
Viaarxiv icon