|
[1]
|
刘波. 视频摘要研究综述[J]. 南京信息工程大学, 2020, 12(3): 274-278.
|
|
[2]
|
Amiri, A. and Fathy, M. (2010) Hier-archical Keyframe-Based Video Summarization Using QR-Decomposition and Modified-Means Clustering. EURASIP Journal on Advances in Signal Processing, 2010, Article ID: 892124. [Google Scholar] [CrossRef]
|
|
[3]
|
Guimaraes, S.J.F. and Gomes, W.A. (2010) Static Video Summarization Method Based on Hierarchical Clustering. In: Ibero-American Congress Conference on Progress in Pattern Recognition, Springer-Verlag, Berlin, 46-54. [Google Scholar] [CrossRef]
|
|
[4]
|
Frey, B.J. and Dueck, D. (2007) Clustering by Passing Mes-sages between Data Points. Science, 315, 972-976. [Google Scholar] [CrossRef] [PubMed]
|
|
[5]
|
de Avila, S.E.F. and Lopes, A.P.B. (2011) VSUMM: A Mechanism Designed to Produce Static Video Summaries and a Novel Evaluation Method. Pattern Recognition Letters, 32, 56-68. [Google Scholar] [CrossRef]
|
|
[6]
|
Mundur, P., Rao, Y. and Yesha, Y. (2006) Keyframe-Based Video Summarization Using Delaunay Clustering. International Journal on Digital Libraries, 6, 219-232. [Google Scholar] [CrossRef]
|
|
[7]
|
Khosla, A., Hamid, R., Lin, C.J., et al. (2013) Large-Scale Video Summarization Using Web-Image Priors. 2013 IEEE Conference on Computer Vision and Pattern Recognition, Portland, 23-28 June 2013, 2698-2705. [Google Scholar] [CrossRef]
|
|
[8]
|
Panda, R. (2017) Weakly Supervised Summarization of Web Videos. 2017 IEEE International Conference on Computer Vision, Venice, 22-29 October 2017, 3677-3686. [Google Scholar] [CrossRef]
|
|
[9]
|
Potapov, D., Douze, M., Harchaoui, Z., et al. (2014) Catego-ry-Specific Video Summarization. European Conference on Computer Vision, Zurich, 6-12 September 2014, 540-555. [Google Scholar] [CrossRef]
|
|
[10]
|
Zhang, K., Chao, W.L., Sha, F., et al. (2016) Summary Trans-fer: Exemplar-Based Subset Selection for Video Summarization. 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 1059-1067. [Google Scholar] [CrossRef]
|
|
[11]
|
Gygli, M., Song, Y. and Cao, L. (2016) Video2GIF: Automatic Generation of Animated Gifs from Video. 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 1001-1009. [Google Scholar] [CrossRef]
|
|
[12]
|
Sun, M., Farhadi, A. and Seitz, S. (2014) Ranking Domain-Specific Highlights by Analyzing Edited Videos. In: European Conference on Computer Vision, Springer, Berlin, 787-802. [Google Scholar] [CrossRef]
|
|
[13]
|
Zhao, B., Li, X.L. and Lu, X.Q. (2017) Hierarchical Recurrent Neural Network for Video Summarization. In: The 2017 ACM on Multimedia Conference, ACM, New York, 863-871. [Google Scholar] [CrossRef]
|
|
[14]
|
Zhang, K., Chao, W.L., Sha, F., et al. (2016) Video Summarization with Long Short-Term Memory. In: European Conference on Computer Vision, Springer, Berlin, 766-782. [Google Scholar] [CrossRef]
|
|
[15]
|
冀中, 江俊杰. 基于解码器注意力机制的视频摘要[J]. 天津大学学报(自然科学与工程技术版), 2018, 51(10): 31-38.
|
|
[16]
|
Mahasseni, B., Lam, M. and Todorovic, S. (2017) Unsupervised Video Summarization with Adversarial LSTM Networks. IEEE Conference on Computer Vision and Pat-tern Recognition, Honolulu, 21-26 July 2017, 2982-2991. [Google Scholar] [CrossRef]
|
|
[17]
|
Yang, H., Wang, B.Y., Lin, S., et al. (2015) Unsupervised Extraction of Video Highlights via Robust Recurrent Auto-Encoders. Proceedings of the IEEE International Conference on Com-puter Vision, Santiago, 7-13 December 2015, 4633-4641. [Google Scholar] [CrossRef]
|
|
[18]
|
Sutskever, I., Vinyals, O. and Le, Q.V. (2014) Sequence to Sequence Learning with Neural Networks. Advances in Neural Infor-mation Processing Systems, 32, 3452-3462.
|
|
[19]
|
Szegedy, C., Liu, W., Jia, Y., et al. (2014) Going Deeper with Con-volutions. 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, 23-28 June 2014, 1-9.
https://ieeexplore.ieee.org/document/7298594 [Google Scholar] [CrossRef]
|
|
[20]
|
Cho, K., Merrieenboer, B., Gulcehre, C., et al. (2014) Learning Phrase Representations Using RNN Encoder-Decoder for Statistical Machine Translation. Conference on Em-pirical Methods in Natural Language Processing (EMNLP 2014), Doha, 25-29 October 2014, 1724-1734. [Google Scholar] [CrossRef]
|
|
[21]
|
Bahdanau, D., Cho, K. and Bengio, Y. (2015) Neural Machine Transla-tion by Jointly Learning to Align and Translate. International Conference on Learning Representation, San Diego, 7-9 May 2015, 1334-1349.
|
|
[22]
|
Song, Y., Vallmitjana, J., Stent, A., et al. (2015) TVSum: Summarizing Web Videos Using Titles. 2015 IEEE Conference on Computer Vision and Pattern Recognition, Boston, 7-12 June 2015, 5179-5187.
|
|
[23]
|
Gygli, M., Grabner, H., Riemenschneider, H., et al. (2014) Creating Summaries from User Videos. In: European Conference on Computer Vision, Springer, Cham, 505-520. [Google Scholar] [CrossRef]
|