Publication Date

Fall 2025

Degree Type

Thesis

Degree Name

Master of Science (MS)

Department

Computer Engineering

Advisor

Carlos Rojas; Bernardo Flores; Mahima Suresh

Abstract

The study of genomic data is important for interpreting their function. Hi-C (high-throughput chromatin conformation capture) data gives researchers a way to analyze the 3-D structure of a genome. This helps researchers understand how distal interactions activate genes. While Hi-C data offers important insights into the mechanisms underlying diseases such as cancer, its limited resolution constrains a comprehensive view of interactions. This study investigates the potential of generative models to enhance the resolution of Hi-C maps, with a emphasis on leveraging the close relationship between text- and image-based embedding models. This work focuses on modifying existing architectures to improve results. Embedding the transformer architecture into Akita yielded notable performance gains. Another set of experiments focused on using a modified version of the Muse architecture and this showed promise. These findings suggest that generative models hold strong potential for working effectively on Hi-C data, and this work serves as an initial step toward advancing research in this direction.

Available for download on Thursday, February 11, 2027

Share

COinS