15 4 31

OpenSound PRO

OpenSound

AI & ML interests

🎧 We build open-source models for audio, speech, and music.

Recent Activity

liked a Space 25 days ago

OpenSound/EzAudio-ControlNet

liked a dataset about 1 month ago

LEMAS-Project/LEMAS-Dataset-train

liked a model 3 months ago

facebook/sam-audio-judge

View all activity

Organizations

spaces 6

FlexSED

🎧

an open-vocabulary sound event detection model

112

SoloSpeech

🎯

State-of-the-art target speech extractor

100

CapSpeech TTS

🧢

Stylized TTS – design voice, accent, and emotion your way

EzAudio ControlNet

🟣

Generate audio from text and reference audio

275

EzAudio

🟣

Generate or edit realistic audio from text prompts

SoloAudio

🎯

Separate sounds from audio mixtures using text prompts

models 5

datasets 20

OpenSound/CapSpeech-AgentDB-Audio

Updated Aug 5, 2025 • 9

OpenSound/CapSpeech-AgentDB

Viewer • Updated Jul 28, 2025 • 10.1k • 30

OpenSound/CapSpeech-SEDB-test

Viewer • Updated Jul 28, 2025 • 496 • 5

OpenSound/CapSpeech-SEDB

Viewer • Updated Jul 28, 2025 • 500 • 19

OpenSound/CapTTS-SFT

Viewer • Updated Jul 28, 2025 • 387k • 187 • 2

OpenSound/CapSpeech-PT

Viewer • Updated Jul 28, 2025 • 11.4M • 35 • 2

OpenSound/CapSpeech-PT-SEDB-HQ

Viewer • Updated Jun 4, 2025 • 198k • 9

OpenSound/CapSpeech-PT-SEDB-Audio

Viewer • Updated Jun 4, 2025 • 1.03M • 6

OpenSound/CapSpeech-SEDB-Audio

Viewer • Updated Jun 4, 2025 • 500 • 6

OpenSound/CapSpeech-CommonVoice

Viewer • Updated Jun 4, 2025 • 31.5k • 5 • 1

View 20 datasets