Picture for Christopher Pal

Christopher Pal

AgentAda: Skill-Adaptive Data Analytics for Tailored Insight Discovery

Add code
Apr 10, 2025
Viaarxiv icon

Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments

Add code
Mar 28, 2025
Viaarxiv icon

StarFlow: Generating Structured Workflow Outputs From Sketch Images

Add code
Mar 27, 2025
Viaarxiv icon

UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

Add code
Mar 19, 2025
Viaarxiv icon

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Add code
Feb 03, 2025
Figure 1 for AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding
Figure 2 for AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding
Figure 3 for AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding
Figure 4 for AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding
Viaarxiv icon

LLMs for Literature Review: Are we there yet?

Add code
Dec 15, 2024
Viaarxiv icon

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Add code
Dec 05, 2024
Figure 1 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 2 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 3 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 4 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Viaarxiv icon

ParetoFlow: Guided Flows in Multi-Objective Optimization

Add code
Dec 04, 2024
Viaarxiv icon

IntentGPT: Few-shot Intent Discovery with Large Language Models

Add code
Nov 16, 2024
Viaarxiv icon

Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning

Add code
Nov 04, 2024
Figure 1 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Figure 2 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Figure 3 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Figure 4 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Viaarxiv icon