5 112 34

Ha-Yeong Choi

Ha0

https://scholar.google.com/citations?user=Jw3X6UgAAAAJ&hl=ko

hayeong0

AI & ML interests

Speech Synthesis, Voice Conversion, Generative Models

Recent Activity

upvoted a paper 1 day ago

Representation Forcing for Bottleneck-Free Unified Multimodal Models

upvoted a paper 3 days ago

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

upvoted a paper 9 days ago

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Paper • 2605.31604 • Published 6 days ago • 53

upvoted a paper 3 days ago

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

Paper • 2605.30409 • Published 7 days ago • 35

upvoted a paper 9 days ago

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

Paper • 2605.25604 • Published 10 days ago • 134

upvoted a paper 10 days ago

FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization

Paper • 2605.15824 • Published 20 days ago • 64

upvoted a paper 13 days ago

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published 16 days ago • 131

upvoted a paper 15 days ago

Lance: Unified Multimodal Modeling by Multi-Task Synergy

Paper • 2605.18678 • Published 17 days ago • 78

upvoted a paper about 1 month ago

Large Language Models Explore by Latent Distilling

Paper • 2604.24927 • Published Apr 27 • 74

liked a dataset about 1 month ago

nvidia/Nemotron-Personas-Korea

Viewer • Updated Apr 23 • 1M • 31.7k • 483

upvoted 4 papers about 1 month ago

Context Unrolling in Omni Models

Paper • 2604.21921 • Published Apr 23 • 13

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 243

Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation

Paper • 2604.18168 • Published Apr 20 • 96

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published Apr 17 • 59

liked a dataset about 2 months ago

walledai/AdvBench

Viewer • Updated Jul 4, 2024 • 520 • 11.9k • 101

upvoted 5 papers about 2 months ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published Apr 13 • 72

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published Apr 11 • 82

upvoted a paper 2 months ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155

upvoted a paper 3 months ago

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation

Paper • 2603.12267 • Published Mar 12 • 13

Ha-Yeong Choi

AI & ML interests

Recent Activity

Organizations

Ha0's activity