Education, Science, Technology, Innovation and Life
Open Access
Sign In

A Multimodal Diffusion-based Interior Design AI with ControlNet

Download as PDF

DOI: 10.23977/jaip.2024.070124 | Downloads: 13 | Views: 86

Author(s)

Ziyi Qin 1

Affiliation(s)

1 Wuxi Jiangsu Tianyi High School, Wuxi, China

Corresponding Author

Ziyi Qin

ABSTRACT

This study introduces an AI-driven method for generating interior design visualizations using ControlNet, which effectively interprets blueprints and sketches to produce high-quality visualizations. By focusing on improving PSNR and SSIM metrics, the approach ensures structural accuracy and aesthetic appeal, demonstrating ControlNet's capability to capture detailed design information. The results showcase the model's potential to assist designers in creating refined and visually coherent interior design concepts, highlighting the benefits of AI integration in the design process.

KEYWORDS

ControlNet, Diffusion Model, Interior Design

CITE THIS PAPER

Ziyi Qin, A Multimodal Diffusion-based Interior Design AI with ControlNet. Journal of Artificial Intelligence Practice (2024) Vol. 7: 162-165. DOI: http://dx.doi.org/10.23977/jaip.2024.070124.

REFERENCES

[1] Huang, H., Amor, B.B., Lin, X., Zhu, F., Fang, Y. G-VAE, a Geometric Convolutional VAE for ProteinStructure Generation, arXiv.org, https://arxiv.org/abs/2106.11920, 2021.
[2] Salehin, I., Islam, M.S., Saha, P., Noman, S.M., Tuni, A., Hasan, Md.M, Baten, M.A., AutoML: A Systematic Review on Automated Machine Learning with Neural Architecture Search, Journal of Information and Intelligence, 2023.
[3] Wang, Z., et al., Toward Learning Joint Inference Tasks for IASS-MTS Using Dual Attention Memory with Stochastic Generative Imputation. IEEE Transactions on Neural Networks and Learning Systems, 2023. 
[4] Yin, W., Yin, H., Kragic, D., Björkman, M., Graph-based normalizing flow for human motion generation and reconstruction, arXiv.org, https://arxiv.org/abs/2104.03020, 2021.
[5] Lvmin Zhang and Anyi Rao and Maneesh Agrawala. Adding Conditional Control to Text-to-Image Diffusion Models. International Conference on Computer Vision (ICCV), 2023.
[6] Dang, Q., et al., Towards an automatic interior design system using GAN. 170-172.10.1145/3478905.3478939. 2021.
[7] Karras, T., Laine, S., Aila, T., A Style-Based generator architecture for generative adversarial networks, arXiv.org, https://arxiv.org/abs/1812.04948, 2018.
[8] R. Zhou, H. Zhou, H. Gao, M. Tomizuka, J. Li, and Z. Xu, “Grouptron: Dynamic multi-scale graph convolutional networks for group-aware dense crowd trajectory forecasting,” in 2022 International Conference on Robotics and Automation (ICRA), 2022, pp. 805–811. 
[9] Liu, H., Simonyan, K., Yang, Y., DARTS: Differentiable Architecture Search, arXiv.org, https://arxiv.org/ abs/ 1806. 09055, 2018.
[10] Zoph, B., Le, Q., V., Neural Architecture Search with Reinforcement Learning, arXiv.org, https://arxiv.org/ abs/1611.01578, 2016.
[11] Lillicrap, T.P., Hunt, J.J., Pritzel, A., Heess, N., Erez, T., Tassa, Y., Silver, D., Wierstra, D., Continuous control with deep reinforcement learning, arXiv.org, https://arxiv.org/abs/1509.02971, 2015.
[12] Stampa, G., Arias, M., Sanchez-Charles, D., Muntes-Mulero, V., Cabellos, A., A Deep-Reinforcement learning approach for Software-Defined Networking routing optimization, arXiv.org, https://arxiv.org/abs/1709.07080, 2017.
[13] Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., & Polosukhin, I. (2017, June 12). Attention is all you need. arXiv.org. https://arxiv.org/abs/1706.03762

Downloads: 6041
Visits: 182241

Sponsors, Associates, and Links


All published work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © 2016 - 2031 Clausius Scientific Press Inc. All Rights Reserved.