Picture for Yi Lu

Yi Lu

Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark

Add code
Dec 12, 2024
Viaarxiv icon

VisualLens: Personalization through Visual History

Add code
Nov 25, 2024
Figure 1 for VisualLens: Personalization through Visual History
Figure 2 for VisualLens: Personalization through Visual History
Figure 3 for VisualLens: Personalization through Visual History
Figure 4 for VisualLens: Personalization through Visual History
Viaarxiv icon

APDDv2: Aesthetics of Paintings and Drawings Dataset with Artist Labeled Scores and Comments

Add code
Nov 13, 2024
Figure 1 for APDDv2: Aesthetics of Paintings and Drawings Dataset with Artist Labeled Scores and Comments
Figure 2 for APDDv2: Aesthetics of Paintings and Drawings Dataset with Artist Labeled Scores and Comments
Figure 3 for APDDv2: Aesthetics of Paintings and Drawings Dataset with Artist Labeled Scores and Comments
Figure 4 for APDDv2: Aesthetics of Paintings and Drawings Dataset with Artist Labeled Scores and Comments
Viaarxiv icon

Mixture of Experts Fusion for Fake Audio Detection Using Frozen wav2vec 2.0

Add code
Sep 18, 2024
Figure 1 for Mixture of Experts Fusion for Fake Audio Detection Using Frozen wav2vec 2.0
Figure 2 for Mixture of Experts Fusion for Fake Audio Detection Using Frozen wav2vec 2.0
Figure 3 for Mixture of Experts Fusion for Fake Audio Detection Using Frozen wav2vec 2.0
Figure 4 for Mixture of Experts Fusion for Fake Audio Detection Using Frozen wav2vec 2.0
Viaarxiv icon

A Controlled Study on Long Context Extension and Generalization in LLMs

Add code
Sep 18, 2024
Figure 1 for A Controlled Study on Long Context Extension and Generalization in LLMs
Figure 2 for A Controlled Study on Long Context Extension and Generalization in LLMs
Figure 3 for A Controlled Study on Long Context Extension and Generalization in LLMs
Figure 4 for A Controlled Study on Long Context Extension and Generalization in LLMs
Viaarxiv icon

DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech

Add code
Sep 18, 2024
Figure 1 for DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
Figure 2 for DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
Figure 3 for DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
Figure 4 for DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech
Viaarxiv icon

Smart Multi-Modal Search: Contextual Sparse and Dense Embedding Integration in Adobe Express

Add code
Aug 26, 2024
Figure 1 for Smart Multi-Modal Search: Contextual Sparse and Dense Embedding Integration in Adobe Express
Figure 2 for Smart Multi-Modal Search: Contextual Sparse and Dense Embedding Integration in Adobe Express
Figure 3 for Smart Multi-Modal Search: Contextual Sparse and Dense Embedding Integration in Adobe Express
Figure 4 for Smart Multi-Modal Search: Contextual Sparse and Dense Embedding Integration in Adobe Express
Viaarxiv icon

Does Current Deepfake Audio Detection Model Effectively Detect ALM-based Deepfake Audio?

Add code
Aug 20, 2024
Figure 1 for Does Current Deepfake Audio Detection Model Effectively Detect ALM-based Deepfake Audio?
Figure 2 for Does Current Deepfake Audio Detection Model Effectively Detect ALM-based Deepfake Audio?
Figure 3 for Does Current Deepfake Audio Detection Model Effectively Detect ALM-based Deepfake Audio?
Figure 4 for Does Current Deepfake Audio Detection Model Effectively Detect ALM-based Deepfake Audio?
Viaarxiv icon

A Noval Feature via Color Quantisation for Fake Audio Detection

Add code
Aug 20, 2024
Viaarxiv icon

EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech

Add code
Aug 20, 2024
Figure 1 for EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech
Figure 2 for EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech
Figure 3 for EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech
Figure 4 for EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech
Viaarxiv icon