|
[1]
|
Sigtia, S., Benetos, E. and Dixon, S. (2015) An End-to-End Neural Network for Polyphonic Piano Music Transcription. IEEE/ACM Transactions on Audio Speech & Language Processing, 24, 927-939. [Google Scholar] [CrossRef]
|
|
[2]
|
Kelz, R., Dorfer, M., Korzeniowski, F., et al. (2016) On the Potential of Simple Framewise Approaches to Piano Transcription. Proceedings of the 17th International Society for Music Information Retrieval Conference, New York City, 475-481.
|
|
[3]
|
Su, L. (2017) Between Homomorphic Signal Processing and Deep Neural Networks: Constructing Deep Algorithms for Polyphonic Music Transcription. 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Kuala Lumpur, 12-15 December 2017, 884-891. [Google Scholar] [CrossRef]
|
|
[4]
|
Su, L. (2018) Vocal Melody Extraction Using Patch-Based CNN. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, 15-20 April 2018, 371-375. [Google Scholar] [CrossRef]
|
|
[5]
|
Hawthorne, C., Stasyuk, A., Roberts, A., et al. (2018) Enabling Factorized Piano Music Modeling and Generation with the Maestro Dataset.
|
|
[6]
|
Jansson, A., Humphrey, E., Montecchio, N., et al. (2017) Singing Voice Separation with Deep U-Net Convolutional Networks. Proceedings of the 18th ISMIR Conference, Suzhou, 23-27 October 2017, 745-751.
|
|
[7]
|
Chen, L.C., Papandreou, G., Kokkinos, I., et al. (2018) DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. IEEE Transactions on Pattern Analysis & Machine Intelligence, 40, 834-848. [Google Scholar] [CrossRef]
|
|
[8]
|
He, K., Gkioxari, G., Dollar, P. and Girshick, R. (2017) Mask R-CNN. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 22-29 October 2017, 2980-2988. [Google Scholar] [CrossRef]
|