|
[1]
|
张丽英, 张永兴, 席云, 等. 基于人工智能的农机自动驾驶系统设计与优化[J]. 中国农机装备, 2025(12): 7-9.
|
|
[2]
|
韩永刚. 基于人工智能算法的图像识别技术分析[J]. 通讯世界, 2025, 32(11): 152-154.
|
|
[3]
|
雷郑波, 涂凯, 张永乐, 等. 一类分布鲁棒指数追踪模型及算法[J/OL]. 运筹学学报(中英文), 1-21[2025-12-28].
|
|
[4]
|
刘平献, 张明明, 王鹏, 等. 基于大模型的便民热线工单智能知识推荐系统的算法优化与性能评估[J]. 数字技术与应用, 2025, 43(3): 16-18.
|
|
[5]
|
孟彬, 杨帆. 基于深度强化学习的数据中心资源调度算法研究[J]. 软件, 2025, 46(11): 1-3.
|
|
[6]
|
阮春珠, 林旭怡, 张燕. 人工智能辅助学习系统技术架构优化与标准化性能评估[J]. 大众标准化, 2025(22): 164-166.
|
|
[7]
|
Grill, J.-B., et al. (2020) Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning. Proceedings of the 34th International Conference on Neural Information Processing Systems, Vancouver, 6-12 December 2020, 21271-21284.
|
|
[8]
|
Chen, X. and He, K. (2021) Exploring Simple Siamese Representation Learning. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 15745-15753. [Google Scholar] [CrossRef]
|
|
[9]
|
Caron, M., Touvron, H., Misra, I., Jegou, H., Mairal, J., Bojanowski, P., et al. (2021) Emerging Properties in Self-Supervised Vision Transformers. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, 10-17 October 2021, 9630-9640. [Google Scholar] [CrossRef]
|
|
[10]
|
Li, J., et al. (2023) Uniform Masking: Enabling MAE Pre-Training for Pyramid-Based Vision Transformers. IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, 1-6 October 2023, 1190-1199.
|
|
[11]
|
Chen, X., Ding, M., Wang, X., et al. (2024). Context Autoencoder for Self-Supervised Representation Learning. International Journal of Computer Vision, 132, 208-223.[CrossRef]
|
|
[12]
|
Li, J., et al. (2022) BLIP: Bootstrapping Language-Image Pre-Training for Unified Vision-Language Understanding and Generation. Proceedings of the 39th International Conference on Machine Learning, Baltimore, 17-23 July 2022, 12888-12900.
|
|
[13]
|
Wang, P., et al. (2022) OFA: Unifying Architectures, Tasks, and Modalities via a Simple Sequence-to-Sequence Framework. Proceedings of the 39th International Conference on Machine Learning, Baltimore, 17-23 July 2022, 23318-23340.
|
|
[14]
|
Alayrac, J.-B., et al. (2022) Flamingo: A Visual Language Model for Few-Shot Learning. NeurIPS 2022, New Orleans, 28 November-9 December 2022, 23716-23736.
|
|
[15]
|
Driess, D., et al. (2023) PaLM-E: An Embodied Multimodal Language Model. International Conference on Machine Learning, ICML 2023, Honolulu, 23-29 July 2023, 8469-8488.
|
|
[16]
|
Zhu, D., et al. (2023) MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models.
|
|
[17]
|
Luo, Z., et al. (2024) Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Models. IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, 16-22 June 2024, 42444-42457.
|
|
[18]
|
Team Gemini (2025) Gemini 1. 5: Unlocking Multimodal Understanding Across Millions of Tokens of Context.
|
|
[19]
|
OpenAI (2023) GPT-4V(ision) System Card & Benchmark Results.
|
|
[20]
|
Liu, H., et al. (2023) Visual Instruction Tuning (LLaVA). Proceedings of the 37th International Conference on Neural Information Processing Systems, New Orleans, 10-16 December 2023, 34892-34916.
|