Multimodal generative AI for conceptual design:

Abstract

Recent advances in AI offer promising opportunities for creative design, particularly through the generation of inspirational images. While prior research has explored the general benefits and limitations of text-to-image tools, there is significant potential in overcoming these constraints by investigating agile, multimodal prompting to facilitate more project-appropriate human-AI interaction. We present the development of a system designed to support both text-based and sketch-based image generation, serving as a research artefact for studying creativity support through multimodal Generative AI. The system enables dynamic dialogue interaction and visualization of the respective contributions. This paper focuses on the development of this AI system as a research artefact to enable future research through design, exploring how multimodal prompting can influence the design process.

Details

Title

Multimodal generative AI for conceptual design: enabling text-based and sketch-based human-AI conversations

Author

Guo, Chenjun¹; Goucher-Lambert, Kosa¹

¹ University of California, Berkeley, USA

Pages

2501-2510

Section

Article

Publication year

2025

Publication date

Aug 2025

Publisher

Cambridge University Press

e-ISSN

2732-527X

Source type

Conference Paper

Language of publication

English

DOI

https://doi.org/10.1017/pds.2025.10264

ProQuest document ID

3243761777

© The Author(s) 2025. This work is licensed under the Creative Commons Attribution – Non-Commercial – No Derivatives License http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Multimodal generative AI for conceptual design: enabling text-based and sketch-based human-AI conversations

Jump to:

Abstract

Details

Suggested sources