Picture for Lianhui Qin

Lianhui Qin

Shammie

Benchmarking the Capabilities of Large Language Models in Transportation System Engineering: Accuracy, Consistency, and Reasoning Behaviors

Add code
Aug 15, 2024
Viaarxiv icon

WeatherQA: Can Multimodal Language Models Reason about Severe Weather?

Add code
Jun 17, 2024
Viaarxiv icon

Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking

Add code
Jun 09, 2024
Figure 1 for Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking
Figure 2 for Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking
Figure 3 for Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking
Figure 4 for Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking
Viaarxiv icon

Capabilities of Large Language Models in Control Engineering: A Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra

Add code
Apr 04, 2024
Viaarxiv icon

LLMs-based Few-Shot Disease Predictions using EHR: A Novel Approach Combining Predictive Agent Reasoning and Critical Agent Instruction

Add code
Mar 19, 2024
Viaarxiv icon

COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability

Add code
Feb 13, 2024
Viaarxiv icon

From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models

Add code
Nov 25, 2023
Viaarxiv icon

MacGyver: Are Large Language Models Creative Problem Solvers?

Add code
Nov 16, 2023
Viaarxiv icon

Structured Chemistry Reasoning with Large Language Models

Add code
Nov 16, 2023
Viaarxiv icon

Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning

Add code
May 24, 2023
Viaarxiv icon