Categorized into development, validation, and evaluation sets for training and testing machine learning models. 📥 How to Download
The full development set is approximately 6.5 GB .
💡 If you were looking for the 7-Zip software tool instead of a dataset, ensure you only download it from the official site 7-zip.org to avoid malware variants hosted on lookalike domains. Download 736 740 zip
Reference the original paper: Drossos, K., Lipping, S., & Virtanen, T. (2020). "Clotho: an Audio Captioning Dataset." Proc. IEEE ICASSP, pp. 736-740 .
Thousands of sound samples ranging from 15 to 30 seconds. Reference the original paper: Drossos, K
Explain that the goal is "Automated Audio Captioning" (AAC)—predicting a textual description from an audio signal.
The request to "Download 736 740 zip" most likely refers to downloading the , a prominent audio captioning collection often cited in research papers by its specific page range, 736–740 . 🎧 The Clotho Dataset IEEE ICASSP, pp
Clotho is an audio dataset used for intermodal translation (audio-to-text) tasks. It is widely utilized in the (Detection and Classification of Acoustic Scenes and Events) challenges. 📂 Key Data Components