|
[1]
|
Ling H, Kreis K, Li D, et al. (2021) Editgan: High-Precision Semantic Image Editing. Advances in Neural Information Processing Systems, 34, 16331-16345.
|
|
[2]
|
Shi, Y., Yang, X., Wan, Y. and Shen, X. (2022). Semanticstylegan: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 11244-11254.[CrossRef]
|
|
[3]
|
Alaluf, Y., Patashnik, O., Wu, Z., et al. (2023) Third Time’s the Charm? Image and Video Editing with StyleGAN3. In: Karlinsky, L., Michaeli, T. and Nishino, K., Eds., Computer Vision—ECCV 2022 Workshops, Springer, 204-220.
|
|
[4]
|
Yang, B., Gu, S., Zhang, B., Zhang, T., Chen, X., Sun, X., et al. (2023). Paint by Example: Exemplar-Based Image Editing with Diffusion Models. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 17-24 June 2023, 18381-18391.[CrossRef]
|
|
[5]
|
Nichol, A., Dhariwal, P., Ramesh, A., et al. (2021) Glide: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models. arXiv: 2112.10741.
|
|
[6]
|
Kawar, B., Zada, S., Lang, O., Tov, O., Chang, H., Dekel, T., et al. (2023). Imagic: Text-Based Real Image Editing with Diffusion Models. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 17-24 June 2023, 6007-6017.[CrossRef]
|
|
[7]
|
Radford, A., Kim, J.W., Hallacy, C., et al. (2021) Learning Transferable Visual Models from Natural Language Supervision. arXiv: 2103.00020.
|
|
[8]
|
Schuhmann, C., Beaumont, R., Vencu, R., et al. (2022) Laion-5b: An Open Large-Scale Dataset for Training Next generation Image-Text Models. Advances in Neural Information Processing Systems, 35, 25278-25294.
|
|
[9]
|
Lin, T.Y., Maire, M., Belongie, S., et al. (2014) Microsoft Coco: Common Objects in Context. In: Fleet, D., Pajdla, T., Schiele, B. and Tuytelaars, T., Eds., Computer Vision—ECCV 2014, Springer, 740-755. [Google Scholar] [CrossRef]
|
|
[10]
|
Nam, S., Kim, Y. and Kim, S.J. (2018) Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language. arXiv: 1810.11919.
|
|
[11]
|
Tao, M., Tang, H., Wu, F., Jing, X., Bao, B. and Xu, C. (2022). DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 16494-16504.[CrossRef]
|
|
[12]
|
Karras, T., Laine, S. and Aila, T. (2019). A Style-Based Generator Architecture for Generative Adversarial Networks. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, 15-20 June 2019, 4396-4405.[CrossRef]
|
|
[13]
|
Karras, T., Laine, S., Aittala, M., Hellsten, J., Lehtinen, J. and Aila, T. (2020). Analyzing and Improving the Image Quality of StyleGAN. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 13-19 June 2020, 8107-8116.[CrossRef]
|
|
[14]
|
Patashnik, O., Wu, Z., Shechtman, E., Cohen-Or, D. and Lischinski, D. (2021). StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, 10-17 October 2021, 2065-2074.[CrossRef]
|
|
[15]
|
Ho, J., Jain, A. and Abbeel, P. (2020) Denoising Diffusion Probabilistic Models. Advances in Neural Information Processing Systems, 33, 6840-6851.
|
|
[16]
|
Nichol, A.Q. and Dhariwal, P. (2021) Improved Denoising Diffusion Probabilistic Models. arXiv: 2102.09672.
|
|
[17]
|
Lugmayr, A., Danelljan, M., Romero, A., Yu, F., Timofte, R. and Van Gool, L. (2022). Repaint: Inpainting Using Denoising Diffusion Probabilistic Models. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 11451-11461.[CrossRef]
|
|
[18]
|
Dhariwal, P. and Nichol, A. (2021) Diffusion Models Beat Gans on Image Synthesis. Advances in Neural Information Processing Systems, 34, 8780-8794.
|
|
[19]
|
Couairon, G., Verbeek, J., Schwenk, H., et al. (2022) DiffEdit: Diffusion-Based Semantic Image Editing with Mask Guidance. arXiv: 2210.1142.
|
|
[20]
|
Mao, W., Han, B. and Wang, Z. (2023). Sketchffusion: Sketch-Guided Image Editing with Diffusion Model. 2023 IEEE International Conference on Image Processing (ICIP), Kuala Lumpur, 8-11 October 2023, 790-794.[CrossRef]
|
|
[21]
|
Meng, C., Song,Y., Song, J., et al. (2021) SDEdit: Image Synthesis and Editing with Stochastic Differential Equations. arXiv: 2108.01073.
|
|
[22]
|
Gal, R., Alaluf, Y., Atzmon, Y., et al. (2022) An Image Is Worth One Word: Personalizing Text-to-Image Generation Using Textual Inversion. arXiv: 2208.01618.
|
|
[23]
|
Ruiz, N., Li, Y., Jampani, V., Pritch, Y., Rubinstein, M. and Aberman, K. (2023). Dreambooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 17-24 June 2023, 22500-22510.[CrossRef]
|
|
[24]
|
Liu, X., Park, D.H., Azadi, S., et al. (2021) More Control for Free! Image Synthesis with Semantic Diffusion Guidance. arXiv: 2112.05744.
|
|
[25]
|
Kim, G., Kwon, T. and Ye, J.C. (2022). Diffusionclip: Text-Guided Diffusion Models for Robust Image Manipulation. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 2416-2425.[CrossRef]
|
|
[26]
|
Hertz, A., Mokady, R., Tenenbaum, J., et al. (2022) Prompt-to-Prompt Image Editing with Cross Attention Control. arXiv: 2208.01626.
|
|
[27]
|
Zhang, Z., Han, L., Ghosh, A., Metaxas, D. and Ren, J. (2023). SINE: Single Image Editing with Text-To-Image Diffusion Models. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 17-24 June 2023, 6027-6037.[CrossRef]
|
|
[28]
|
Li, J., Li, D., Xiong, C., et al. (2022) Blip: Bootstrapping Language-Image Pre-Training for Unified Vision-Language Understanding and Generation. International Conference on Machine Learning. PMLR, 12888-12900.
|
|
[29]
|
Rombach, R., Blattmann, A., Lorenz, D., Esser, P. and Ommer, B. (2022). High-resolution Image Synthesis with Latent Diffusion Models. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 10674-10685.[CrossRef]
|
|
[30]
|
Wang, Z., Simoncelli, E.P. and Bovik, A.C. (2003) Multiscale Structural Similarity for Image Quality Assessment. The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, Pacific Grove, 9-12 November 2003, 1398-1402.
|
|
[31]
|
Zhang, R., Isola, P., Efros, A.A., Shechtman, E. and Wang, O. (2018). The Unreasonable Effectiveness of Deep Features as a Perceptual Metric. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 586-595.[CrossRef]
|