|
[1]
|
Yu, F. and Koltun, V. (2016) Multi-Scale Context Aggregation by Dilated Convolutions. International Conference on Learning Representations (ICLR), San Juan, 2-4 May 2016, 1-13.
|
|
[2]
|
Peng, C., Zhang, X., Yu, G., Luo, G. and Sun, J. (2017). Large Kernel Matters—Improve Semantic Segmentation by Global Convolutional Network. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 21-26 July 2017, 4353-4361.[CrossRef]
|
|
[3]
|
Zhao, H., Shi, J., Qi, X., Wang, X. and Jia, J. (2017). Pyramid Scene Parsing Network. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 21-26 July 2017, 2881-2890.[CrossRef]
|
|
[4]
|
Wang, X., Girshick, R., Gupta, A. and He, K. (2018). Non-local Neural Networks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 7794-7803.[CrossRef]
|
|
[5]
|
Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A. and Zagoruyko, S. (2020) End-to-End Object Detection with Transformers. Computer Vision—ECCV 2020, Springer International Publishing, Cham, 213-229. [Google Scholar] [CrossRef]
|
|
[6]
|
Chen, J., Lu, Y., Yu, Q., Luo, X., Adeli, E., Wang, Y., Lu, L., Yuille, A.L. and Zhou, Y. (2021) Transunet: Transformers Make Strong Encoders for Medical Image Segmentation. arXiv preprint arXiv:2102.04306
|