Picture for Ziyi Yang

Ziyi Yang

Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective

Add code
Jan 19, 2025
Viaarxiv icon

Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model

Add code
Jan 06, 2025
Viaarxiv icon

Frequency-aware Event Cloud Network

Add code
Dec 30, 2024
Figure 1 for Frequency-aware Event Cloud Network
Figure 2 for Frequency-aware Event Cloud Network
Figure 3 for Frequency-aware Event Cloud Network
Figure 4 for Frequency-aware Event Cloud Network
Viaarxiv icon

Deformable Radial Kernel Splatting

Add code
Dec 16, 2024
Figure 1 for Deformable Radial Kernel Splatting
Figure 2 for Deformable Radial Kernel Splatting
Figure 3 for Deformable Radial Kernel Splatting
Figure 4 for Deformable Radial Kernel Splatting
Viaarxiv icon

Weighted-Reward Preference Optimization for Implicit Model Fusion

Add code
Dec 04, 2024
Figure 1 for Weighted-Reward Preference Optimization for Implicit Model Fusion
Figure 2 for Weighted-Reward Preference Optimization for Implicit Model Fusion
Figure 3 for Weighted-Reward Preference Optimization for Implicit Model Fusion
Figure 4 for Weighted-Reward Preference Optimization for Implicit Model Fusion
Viaarxiv icon

OASIS: Open Agent Social Interaction Simulations with One Million Agents

Add code
Nov 26, 2024
Figure 1 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 2 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 3 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 4 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Viaarxiv icon

OASIS: Open Agents Social Interaction Simulations on One Million Agents

Add code
Nov 21, 2024
Figure 1 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Figure 2 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Figure 3 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Figure 4 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Viaarxiv icon

Towards Realistic Example-based Modeling via 3D Gaussian Stitching

Add code
Aug 28, 2024
Viaarxiv icon

See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses

Add code
Aug 16, 2024
Figure 1 for See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Figure 2 for See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Figure 3 for See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Figure 4 for See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Viaarxiv icon

Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle

Add code
Jul 18, 2024
Figure 1 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Figure 2 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Figure 3 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Figure 4 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Viaarxiv icon