|
[1]
|
Bewley, A., Ge, Z., Ott, L., et al. (2016) Simple Online and Real-Time Tracking. 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, 25-28 September 2016, 3464-3468. [Google Scholar] [CrossRef]
|
|
[2]
|
Wojke, N., Bewley, A. and Paulus, D. (2017) Simple Online and Realtime Tracking with a Deep Association Metric. 2017 IEEE International Conference on Image Processing (ICIP), Beijing, 17-20 September 2017, 3645-3649. [Google Scholar] [CrossRef]
|
|
[3]
|
Zhang, Y., Sun, P., Jiang, Y., et al. (2022) Bytetrack: Mul-ti-Object Tracking by Associating Every Detection Box. European Conference on Computer Vision, Tel Aviv, 23-27 October 2022, 1-21. [Google Scholar] [CrossRef]
|
|
[4]
|
Sun, P., Cao, J., Jiang, Y., et al. (2020) Transtrack: Multiple Object Tracking with Transformer. [Google Scholar] [CrossRef]
|
|
[5]
|
Meinhardt, T., Kirillov, A., Leal-Taixe, L., et al. (2022) Track-former: Multi-Object Tracking with Transformers. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, 18-24 June 2022, 8844-8854. [Google Scholar] [CrossRef]
|
|
[6]
|
Vaquero, L., Brea, V.M. and Mucientes, M. (2023) Re-al-Time Siamese Multiple Object Tracker with Enhanced Proposals. Pattern Recognition, 135, Article ID: 109141. [Google Scholar] [CrossRef]
|
|
[7]
|
Cai, J., Xu, M., Li, W., et al. (2022) MeMOT: Multi-Object Tracking with Memory. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, 18-24 June 2022, 8090-8100. [Google Scholar] [CrossRef]
|
|
[8]
|
Bertinetto, L., Valmadre, J., Henriques, J.F., et al. (2016) Fully-Convolutional Siamese Networks for Object Tracking. Computer Vision ECCV 2016 Workshops, Amsterdam, 8-10 and 15-16 October 2016.
|
|
[9]
|
Bhat, G., Danelljan, M., Gool, L.V., et al. (2019) Learning Discriminative Model Prediction for Tracking. Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, 27 Octo-ber-2 November 2019, 6182-6191. [Google Scholar] [CrossRef]
|
|
[10]
|
Yan, B., Peng, H., Fu, J., et al. (2021) Learning Spatio-Temporal Transformer for Visual Tracking. Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, 10-17 October 2021, 10448-10457. [Google Scholar] [CrossRef]
|
|
[11]
|
Vaswani, A., Shazeer, N., Parmar, N., et al. (2017) Attention Is All You Need. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 6000-6010.
|
|
[12]
|
Chen, B., Li, P., Bai, L., et al. (2022) Backbone Is All Your Need: A Sim-plified Architecture for Visual Object Tracking. European Conference on Computer Vision, Tel Aviv, 23-27 October 2022, 375-392. [Google Scholar] [CrossRef]
|
|
[13]
|
Dosovitskiy, A., Beyer, L., Kolesnikov, A., et al. (2020) An Image Is Worth 16×16 Words: Transformers for Image Recognition at Scale. [Google Scholar] [CrossRef]
|
|
[14]
|
Xu, H., Zhang, J., Cai, J., et al. (2022) Gmflow: Learning Optical Flow via Global Matching. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, 18-24 June 2022, 8121-8130. [Google Scholar] [CrossRef]
|
|
[15]
|
Redmon, J., Divvala, S., Girshick, R., et al. (2016) You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 779-788. [Google Scholar] [CrossRef]
|
|
[16]
|
Carion, N., Massa, F., Synnaeve, G., et al. (2020) End-to-End Object Detection with Transformers. European Conference on Computer Vision, Glasgow, 23-28 August 2020, 213-229. [Google Scholar] [CrossRef]
|
|
[17]
|
Liu, W., Anguelov, D., Erhan, D., et al. (2016) Ssd: Single Shot Multibox Detector. Computer Vision—ECCV 2016: 14th European Conference, Amsterdam, 11-14 October 2016, 21-37. [Google Scholar] [CrossRef]
|
|
[18]
|
Lin, T.Y., Goyal, P., Girshick, R., et al. (2017) Focal Loss for Dense Object Detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, 22-29 October 2017, 2980-2988. [Google Scholar] [CrossRef]
|
|
[19]
|
Girshick, R., Donahue, J., Darrell, T., et al. (2014) Rich Feature Hi-erarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, 23-28 June 2014, 580-587. [Google Scholar] [CrossRef]
|
|
[20]
|
Girshick, R. (2015) Fast R-CNN. Proceedings of the IEEE Interna-tional Conference on Computer Vision, Santiago, 7-13 December 2015, 1440-1448. [Google Scholar] [CrossRef]
|