Publication Date

Fall 2024

Degree Type

Master's Project

Degree Name

Master of Science in Computer Science (MSCS)

Department

Computer Science

First Advisor

William Andreopoulos

Second Advisor

Katerina Potika

Third Advisor

Navrati Saxena

Keywords

Retrieval-Augmented Generation (RAG), AI Companions, Multimodal RAG, Large Language Models (LLMs), Generative AI, Vector Databases, Graph RAG, Prompt Engineering, Embedding Techniques, Knowledge Graphs, Contextual Retrieval, CLIP

Abstract

The rapid advancement in generative AI and large language models have forever revolutionized how we synthesize data. This project explores and experiments with the potential of a multimodal Retrieval-Augmented Generation (RAG) framework for processing text, tabular and image data. Starting with prompt engineering techniques, we address their limitations in dynamic and domain-specific real world applications by building a multimodal RAG pipeline and evaluating it against human-generated ground truth. The project culminates in BrightMind.ai, a full-stack educational platform featuring novel personalized AI companions for context-aware and adaptive response generation. Its innovative capabilities extend to music, video, and code generation, setting it apart from existing AI assistants. BrightMind.ai sets a benchmark for interactive, multimodal AI applications and opens pathways for enhanced user engagement with generative AI technologies.

Available for download on Saturday, December 20, 2025

Share

COinS