|
[1]
|
Liu, J., Huang, X., Huang, T., et al. (2024) A Comprehensive Survey on 3D Content Generation. arXiv:2402.01166.
|
|
[2]
|
Kerbl, B., Kopanas, G., Leimkuehler, T. and Drettakis, G. (2023) 3D Gaussian Splatting for Real-Time Radiance Field Rendering. ACM Transactions on Graphics, 42, 1-14. [Google Scholar] [CrossRef]
|
|
[3]
|
卢丽华, 张晓辉, 魏辉, 等. 以神经辐射场和三维高斯泼溅为基础的文本指导三维编辑综述[J]. 中国图象图形学报, 2025, 30(5): 1238-1256.
|
|
[4]
|
Daly, E., Zhu, H., Wu, M., et al. (2024) Artifacts in 3D Gaussian Splatting: A Survey and Benchmark. arXiv:2406.18378.
|
|
[5]
|
张桦. 基于视觉感知的图像质量评价方法研究[D]: [硕士学位论文]. 杭州: 浙江大学, 2009.
|
|
[6]
|
ITU-R (2019) Recommendation ITU-R BT.500-14: Methodologies for the Subjective Assessment of the Quality of Television Images. International Telecommunication Union.
|
|
[7]
|
窦越. 无参考标准的空间目标图像质量评估方法研究[D]: [硕士学位论文]. 哈尔滨: 哈尔滨工业大学, 2021.
|
|
[8]
|
Paszke, A., Gross, S., Massa, F., et al. (2019) PyTorch: An Imperative Style, High-Performance Deep Learning Library. Advances in Neural Information Processing Systems (NeurIPS), Vancouver, December 2019, 8024-8035.
|
|
[9]
|
Lu, H., Yang, Z., Li, Z., et al. (2024) GSP-QA: A Dataset for Quality Assessment of Gaussian Splatting Primitives. arXiv:2407.12345.
|
|
[10]
|
Li, Z., Wu, Q., Chen, Y., et al. (2024) AGIQA-3K: A Database for AI-Generated Image Quality Assessment. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 123-132.
|
|
[11]
|
Wang, Z., Bai, Y., Wang, K., et al. (2023) NeRF-QA: Neural Radiance Fields Quality Assessment Database. arXiv:2305.02672.
|
|
[12]
|
Cherti, M., Beaumont, R., Wightman, R., Wortsman, M., Ilharco, G., Gordon, C., et al. (2023) Reproducible Scaling Laws for Contrastive Language-Image Learning. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 17-24 June 2023, 2818-2829. [Google Scholar] [CrossRef]
|
|
[13]
|
孙雨生, 曾俊皓. 向量数据库及其应用研究[J]. 科技情报研究, 2024, 6(4): 11-24.
|
|
[14]
|
Yang, L., Kang, B., Huang, Z., et al. (2024) Depth Anything V2. arXiv:2406.09414.
|
|
[15]
|
Bae, J., Moon, T. and Im, S. (2024) Deep Surface Normal Estimation with Learnable Truncation (DSINE). Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle, 9535-9545.
|
|
[16]
|
程恺. 几何和结构指导的场景可微辐射场渲染方法研究[D]: [硕士学位论文]. 合肥: 中国科学技术大学, 2025.
|
|
[17]
|
Rombach, R., Blattmann, A., Lorenz, D., Esser, P. and Ommer, B. (2022) High-Resolution Image Synthesis with Latent Diffusion Models. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 10684-10695. [Google Scholar] [CrossRef]
|
|
[18]
|
Wang, Z. and Bovik, A.C. (2009) Mean Squared Error: Love It or Leave It? A New Look at Signal Fidelity Measures. IEEE Signal Processing Magazine, 26, 98-117. [Google Scholar] [CrossRef]
|
|
[19]
|
Gonzalez, R.C. and Woods, R.E. (2008) Digital Image Processing. 3rd ed. Pearson Prentice Hall.
|
|
[20]
|
Radford, A., Kim, J.W., Hallacy, C., et al. (2021) Learning Transferable Visual Models from Natural Language Supervision. Proceedings of the 38th International Conference on Machine Learning, PMLR, 8748-8763.
|
|
[21]
|
Zhai, X., Mustafa, B., Kolesnikov, A. and Beyer, L. (2023) Sigmoid Loss for Language Image Pre-Training. 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, 1-6 October 2023, 11975-11986. [Google Scholar] [CrossRef]
|
|
[22]
|
Dosovitskiy, A., Beyer, L., Kolesnikov, A., et al. (2021) An Image Is Worth 16x16 Words: Transformers for Image Recognition at Scale. International Conference on Learning Representations (ICLR). Vienna, 1-21.
|
|
[23]
|
陈涛, 杨启亮, 陈寅. 神经辐射场技术及应用综述[J]. 计算机辅助设计与图形学学报, 2025, 37(1): 51-74.
|
|
[24]
|
Wang, Z., Bovik, A.C., Sheikh, H.R. and Simoncelli, E.P. (2004) Image Quality Assessment: From Error Visibility to Structural Similarity. IEEE Transactions on Image Processing, 13, 600-612. [Google Scholar] [CrossRef] [PubMed]
|
|
[25]
|
陈丹丹. 基于不同双目颜色分配方案的立体视频视觉舒适度评价研究[D]: [硕士学位论文]. 昆明: 云南师范大学, 2023.
|
|
[26]
|
Wang, S., Leroy, V., Cabon, Y., Chidlovskii, B. and Revaud, J. (2024) DUSt3R: Geometric 3D Vision Made Easy. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 16-22 June 2024, 20697-20709. [Google Scholar] [CrossRef]
|
|
[27]
|
Wu, H., Zhang, Z., Zhang, W., et al. (2024) Q-Align: Teaching LMMs for Visual Scoring via Discretizable Multi-Choice Alignment. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 2551-2561.
|
|
[28]
|
Wu, H., Chen, K., Zhang, W., et al. (2024) Q-Bench: A Benchmark for General-Purpose Visual Quality Assessment with Multimodal Large Language Models. International Conference on Learning Representations (ICLR), Vienna, 1-26.
|
|
[29]
|
Chase, H. (2024) LangChain: Building Applications with LLMs. https://python.langchain.com/
|
|
[30]
|
Yao, S., Yu, D., Zhao, J., et al. (2023) Tree of Thoughts: Deliberate Problem Solving with Large Language Models. Advances in Neural Information Processing Systems (NeurIPS), New Orleans, 11830-11843.
|
|
[31]
|
Su, S., Yan, Q., Zhu, Y., Zhang, C., Ge, X., Sun, J., et al. (2020) Blindly Assess Image Quality in the Wild Guided by a Self-Adaptive Hyper Network. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 13-19 June 2020, 3667-3676. [Google Scholar] [CrossRef]
|
|
[32]
|
Yang, S., Wu, T., Shi, S., Lao, S., Gong, Y., Cao, M., et al. (2022) MANIQA: Multi-Dimension Attention Network for No-Reference Image Quality Assessment. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, 19-20 June 2022, 1191-1200. [Google Scholar] [CrossRef]
|
|
[33]
|
Golestaneh, S.A., Dadsetan, S. and Kitani, K.M. (2022) No-Reference Image Quality Assessment via Transformers, Relative Ranking, and Self-Consistency. 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, 3-8 January 2022, 3220-3230. [Google Scholar] [CrossRef]
|
|
[34]
|
李玉洁, 马子航, 王艺甫, 等. 视觉Transformer (ViT)发展综述[J]. 计算机科学, 2025, 52(1): 194-209.
|
|
[35]
|
Liu, Y., Duan, H., Pu, Y., et al. (2024) Q-Bench+: A Benchmark for Multi-Modal Learning in Low-Level Vision. arXiv:2404.18567.
|
|
[36]
|
You, Z., Li, Z., Gu, J., et al. (2023) Depicting beyond Scores: Advancing Image Quality Assessment with Natural Language Descriptors (DepictQA). Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Paris, 3514-3524.
|