In a world flooded with information, document clustering is an important tool that can help categorize and extract insight from text collections. It works by grouping similar documents, while simultaneously discriminating between groups. In this article, we provide a brief overview of the principal techniques used to cluster documents, and introduce a series of novel deep-learning based methods recently designed for the document clustering task. In our overview, we point the reader to salient works that can provide a deeper understanding of the topics discussed.
David Anastasiu and Andrea Tagarelli. "Document Clustering" Wiley StatsRef: Statistics Reference Online (2017): 1-11. doi:10.1002/9781118445112.stat07973