|
[1]
|
Antol, S., Agrawal, A., Lu, J., Mitchell, M., Batra, D., Zitnick, C.L., et al. (2015) VQA: Visual Question Answering. 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, 7-13 December 2015, 2425-2433. [Google Scholar] [CrossRef]
|
|
[2]
|
Lu, J., Yang, J., Batra, D., et al. (2016) Hierarchical Question-Image Co-Attention for Visual Question Answering. Advances in Neural Information Processing Systems, 29.
|
|
[3]
|
Chen, Z., Chen, J., Geng, Y., Pan, J.Z., Yuan, Z. and Chen, H. (2021) Zero-Shot Visual Question Answering Using Knowledge Graph. In: Hotho, A., et al., Eds., Lecture Notes in Computer Science, Springer International Publishing, 146-162. [Google Scholar] [CrossRef]
|
|
[4]
|
Zhang, X., Wu, C., Zhao, Z., et al. (2023) PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering. arXiv:2305.10415.
|
|
[5]
|
Abacha, A.B., Shivade, C., Hasan, S.A., et al. (2019) VQA-Med: Overview of the Medical Visual Question Answering Task at Image CLEF 2019. CEUR 2019 Working Notes, Lugano, 9-12 September 2019, 9-12.
|
|
[6]
|
Jin, D., Pan, E., Oufattole, N., Weng, W., Fang, H. and Szolovits, P. (2021) What Disease Does This Patient Have? A Large-Scale Open Domain Question Answering Dataset from Medical Exams. Applied Sciences, 11, Article 6421. [Google Scholar] [CrossRef]
|
|
[7]
|
Dao, S.D., Zhao, E., Phung, D., et al. (2021) Multi-Label Image Classification with Contrastive Learning. arXiv:2107.11626.
|
|
[8]
|
Sahoo, S. and Maiti, J. (2025) Variance-Adjusted Cosine Distance as Similarity Metric. arXiv:2502.02233.
|
|
[9]
|
Xian, Y., Lampert, C.H., Schiele, B. and Akata, Z. (2019) Zero-Shot Learning—A Comprehensive Evaluation of the Good, the Bad and the Ugly. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41, 2251-2265. [Google Scholar] [CrossRef] [PubMed]
|
|
[10]
|
Liu, H. and Singh, P. (2004) ConceptNet—A Practical Commonsense Reasoning Tool-Kit. BT Technology Journal, 22, 211-226. [Google Scholar] [CrossRef]
|
|
[11]
|
Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., et al. (2015) Dbpedia—A Large-Scale, Multilingual Knowledge Base Extracted from Wikipedia. Semantic Web, 6, 167-195. [Google Scholar] [CrossRef]
|
|
[12]
|
Yang, Z., He, X., Gao, J., Deng, L. and Smola, A. (2016) Stacked Attention Networks for Image Question Answering. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 27-30 June 2016, 21-29. [Google Scholar] [CrossRef]
|
|
[13]
|
Kim, J.H., Jun, J. and Zhang, B.T. (2018) Bilinear Attention Networks. arXiv:1805.07932.
|
|
[14]
|
Snell, J., Swersky, K. and Zemel, R.S. (2017) Prototypical Networks for Few-Shot Learning. Advances in Neural Information Processing Systems, 30.
|
|
[15]
|
Zhu, L., She, Q., Chen, Q., Meng, X., Geng, M., Jin, L., et al. (2023) Background-Aware Classification Activation Map for Weakly Supervised Object Localization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45, 14175-14191. [Google Scholar] [CrossRef] [PubMed]
|