Publication Date
1-1-2023
Document Type
Contribution to a Book
Publication Title
Deep Learning Research Applications for Natural Language Processing
Editor
L. Ashok Kumar, Dhanaraj Karthika Renuka, S. Geetha
DOI
10.4018/978-1-6684-6001-6.ch007
First Page
113
Last Page
131
Abstract
A picture is worth a thousand words goes the well-known adage. Generating images from text understandably has many uses. In this chapter, the authors explore a state-of-the-art generative deep learning method to produce synthetic images and a new better way for evaluating the same. The approach focuses on synthesizing high-resolution images with multiple objects present in an image, given the textual description of the images. The existing literature uses object pathway GAN (OP-GAN) to automatically generate images from text. The work described in this chapter attempts to improvise the discriminator network from the original implementation using OP-GAN. This eventually helps the generator network's learning rate adjustment based on the discriminator output. Finally, the trained model is evaluated using semantic object accuracy (SOA), the same metric that is used to evaluate the baseline implementation, which is better than the metrics used previously in the literature.
Department
Applied Data Science
Recommended Citation
Vishnu S. Pendyala and VigneshKumar Thangarajan. "Reconnoitering Generative Deep Learning Through Image Generation From Text" Deep Learning Research Applications for Natural Language Processing (2023): 113-131. https://doi.org/10.4018/978-1-6684-6001-6.ch007
Comments
This is the Version of Record, and has been used with the permission of IGI Global, under their Fair Use Policy.