Library Guides: Digital Scholarship Research Guide: Text Analysis

What is text analysis?

Text analysis, also known as text mining, distant reading, and computational linguistics, is the process of using software to extract meaningful information from a body of text by identifying entities, patterns, relationships, etc. Often text mining can help to address research questions about large bodies of text that are impossible (or extraordinarily difficult) to answer through normal human reading alone (we call this close reading). Text mining tools are meant to compliment rather than replace traditional human-driven literary analysis. With any research method, a content expert’s intervention is necessary to identify how meaningful the results of any text mining process are and to interpret those results in a responsible way.

Example projects

Robots Reading Vogue
Uses techniques like topic modeling to explore trends in Vogue Magazine over time.
Mining the Dispatch
Analysis of the text of the Richmond (Virginia) Daily Dispatch, a newspaper from the Civil War era.
The Pulter Project
Allows comparisons of different versions of the text of Hester Pulter's poetry.
Quantifying Kissinger
Analyzes and visualizes the text of Henry Kissinger's memos and phone calls.
Dante Lab
Web-based workspace for analysis of Dante's Divine Comedy.
Corpus Thomisticum
Comprehensive collection of the works of Thomas Aquinas.