Wijngaard, G., Formisano, E., Esposito, M., & Dumontier, M. (2025). Audio-Language Datasets of Scenes and Events: A Survey. IEEE Access, 13, 20328-20360. https://doi.org/10.1109/ACCESS.2025.3534621
Wijngaard, G., Formisano, E., Giordano, B. L., & Dumontier, M. (2023). ACES: Evaluating Automated Audio Captioning Models on the Semantics of Sounds. In 31st European Signal Processing Conference, EUSIPCO 2023 - Proceedings (pp. 770-774). IEEE. https://doi.org/10.23919/EUSIPCO58844.2023.10289793