Research on Text-to-Image Generation Method Based on GAN

Yinuo Liu

2024

Abstract

The main task of text-to-image is to generate images that are true and clear according to the text content. As one of the representative methods, generative adversarial network (GAN) occupies an important position in the implementation of text images. According to the different requirements of text image generation, the GAN network generated based on text images is divided into three major functions: improving content authenticity, enhancing semantic correlation, and promoting content diversity. In response to the above needs, this article analyzes the authenticity of the content from the perspective of improving the quality, fine particle size enhancement, contextual enhancement, and dynamic adjustment of the content of the stack structure, analyzed the authenticity of the content from the stack structure. The perspective of extraction, semantic layout, and cycle consistency analyzes enhanced semantic correlation function and analyzes the diversity of content diversity from the perspective of training mechanisms and text processing. The thesis focuses on predecessors' representative methods' basic process and design ideas. The predecessor method is used to compare and analyze the predecessor methods through the existing data set. Forecasting and prospects will help researchers to further promote this field.

Download


Paper Citation


in Harvard Style

Liu Y. (2024). Research on Text-to-Image Generation Method Based on GAN. In Proceedings of the 2nd International Conference on Data Analysis and Machine Learning - Volume 1: DAML; ISBN 978-989-758-754-2, SciTePress, pages 124-130. DOI: 10.5220/0013510500004619


in Bibtex Style

@conference{daml24,
author={Yinuo Liu},
title={Research on Text-to-Image Generation Method Based on GAN},
booktitle={Proceedings of the 2nd International Conference on Data Analysis and Machine Learning - Volume 1: DAML},
year={2024},
pages={124-130},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013510500004619},
isbn={978-989-758-754-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 2nd International Conference on Data Analysis and Machine Learning - Volume 1: DAML
TI - Research on Text-to-Image Generation Method Based on GAN
SN - 978-989-758-754-2
AU - Liu Y.
PY - 2024
SP - 124
EP - 130
DO - 10.5220/0013510500004619
PB - SciTePress