Exploiting temporal feature integration for generalized sound recognition