Publication Date
Fall 2022
Degree Type
Master's Project
Degree Name
Master of Science (MS)
Department
Computer Science
First Advisor
William Andreopoulos
Second Advisor
Robert Chun
Third Advisor
Fabio Di Troia
Keywords
Recommender System, Machine Learning, Next Word Prediction, Natural Language Processing, AutoComplete
Abstract
With the increasing number of options or choices in terms of entities like products, movies, songs, etc. which are now available to users, they try to save time by looking for an application or system that provides automatic recommendations. Recommender systems are automated computing processes that leverage concepts of Machine Learning, Data Mining and Artificial Intelligence towards generating product recommendations based on a user’s preferences. These systems have given a significant boost to businesses across multiple segments as a result of reduced human intervention. One similar aspect of this is content writing. It would save users a lot of time if we were to come up with a way to auto-recommend words or phrases and not have to explicitly type them out. This project aims to develop a Natural Language Processing based pipeline using Machine Learning that provides next word prediction, that is, given a set of words, it will provide a list of most probable follow-up words. Moreover, we aim to form an entire sentence using intermediate next-word suggestions. The project also compares different techniques deployed to achieve the same and showcases the results on the MS COCO image-captions dataset. The results show that N-gram based probabilistic is accurately able to form an entire sentence using intermediate next-word suggestions and thus the approach performs the best on the above dataset for next word prediction.
Recommended Citation
Patel, Meet, "Caption And Image Based Next-Word Auto-Completion" (2022). Master's Projects. 1191.
DOI: https://doi.org/10.31979/etd.zetp-evd2
https://scholarworks.sjsu.edu/etd_projects/1191