To facilitate comparable research on urban sound source classification, we are also releasing a second version of this dataset, UrbanSound8K, with 8732 excerpts limited to 4 seconds (also with source labels), and pre-sorted into 10 stratified folds. In addition to the source ID both datasets also include a (subjective) salience label for each source occurrence: foreground / background.
The datasets are released for research purposes under a Creative Commons Attribution Noncommercial License, and are available online at the dataset companion website:
This companion website also contains further information about each dataset, including the Urban Sound Taxonomy from which the 10 sound classes in this dataset were selected.
The datasets and taxonomy will be presented at the ACM Multimedia 2014 conference in Orlando in a couple of weeks. For those interested, please see our paper:
J. Salamon, C. Jacoby and J. P. Bello, "A Dataset and Taxonomy for Urban Sound Research", in Proc. 22nd ACM International Conference on Multimedia, Orlando USA, Nov. 2014.
For those attending ISMIR 2014 next week, I will also be there if you would like to discuss the datasets and taxonomy.
I hope you find the datasets useful for your work and look forward to seeing some of you at ISMIR and ACM-MM in the coming weeks!