Picture for Soujanya Poria

Soujanya Poria

PROEMO: Prompt-Driven Text-to-Speech Synthesis Based on Emotion and Intensity Control

Add code
Jan 10, 2025
Viaarxiv icon

TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization

Add code
Dec 30, 2024
Viaarxiv icon

Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability

Add code
Dec 24, 2024
Viaarxiv icon

Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning

Add code
Dec 17, 2024
Viaarxiv icon

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Add code
Nov 09, 2024
Viaarxiv icon

Two are better than one: Context window extension with multi-grained self-injection

Add code
Oct 25, 2024
Figure 1 for Two are better than one: Context window extension with multi-grained self-injection
Figure 2 for Two are better than one: Context window extension with multi-grained self-injection
Figure 3 for Two are better than one: Context window extension with multi-grained self-injection
Figure 4 for Two are better than one: Context window extension with multi-grained self-injection
Viaarxiv icon

Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning

Add code
Oct 16, 2024
Figure 1 for Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning
Figure 2 for Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning
Figure 3 for Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning
Figure 4 for Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning
Viaarxiv icon

MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses

Add code
Oct 09, 2024
Viaarxiv icon

Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths

Add code
Oct 07, 2024
Viaarxiv icon

Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models

Add code
Sep 22, 2024
Viaarxiv icon