Picture for Zhiqi Shen

Zhiqi Shen

Following the TRAIL: Predicting and Explaining Tomorrow's Hits with a Fine-Tuned LLM

Add code
Feb 04, 2026
Viaarxiv icon

Towards Comprehensive Stage-wise Benchmarking of Large Language Models in Fact-Checking

Add code
Jan 06, 2026
Viaarxiv icon

MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP Servers

Add code
Dec 17, 2025
Viaarxiv icon

EHRStruct: A Comprehensive Benchmark Framework for Evaluating Large Language Models on Structured Electronic Health Record Tasks

Add code
Nov 16, 2025
Viaarxiv icon

Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy

Add code
Sep 16, 2025
Figure 1 for Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy
Figure 2 for Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy
Figure 3 for Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy
Figure 4 for Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy
Viaarxiv icon

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Add code
Aug 20, 2025
Figure 1 for MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers
Figure 2 for MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers
Figure 3 for MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers
Figure 4 for MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers
Viaarxiv icon

Semantic Item Graph Enhancement for Multimodal Recommendation

Add code
Aug 08, 2025
Viaarxiv icon

Does Multimodality Improve Recommender Systems as Expected? A Critical Analysis and Future Directions

Add code
Aug 07, 2025
Figure 1 for Does Multimodality Improve Recommender Systems as Expected? A Critical Analysis and Future Directions
Figure 2 for Does Multimodality Improve Recommender Systems as Expected? A Critical Analysis and Future Directions
Figure 3 for Does Multimodality Improve Recommender Systems as Expected? A Critical Analysis and Future Directions
Figure 4 for Does Multimodality Improve Recommender Systems as Expected? A Critical Analysis and Future Directions
Viaarxiv icon

Response Uncertainty and Probe Modeling: Two Sides of the Same Coin in LLM Interpretability?

Add code
May 24, 2025
Viaarxiv icon

Knowledge Retrieval in LLM Gaming: A Shift from Entity-Centric to Goal-Oriented Graphs

Add code
May 24, 2025
Viaarxiv icon