Subject Guides

HathiTrust Digital Library

Guide Contents

Text and Data Mining

HathiTrust Research Center (HTRC) enables computational analysis of works in the HathiTrust Digital Library (HTDL) to facilitate qualified research and educational uses of the collection. The Research Center creates and maintains a suite of tools and services for text-based, data-driven research, such as HTRC Algorithms and Data Capsule, and engages in cutting-edge research on large-scale data analysis, allowing scholars to fully utilize content of the HathiTrust Digital Library.

Learn more at https://analytics.hathitrust.org/

What can I do in HathiTrust?

How can HathiTrust help me with my research and assignments?

  • Discover items digitized from libraries around the world. It’s like going to dozens of different college and university libraries to do research — without leaving your home desk.
  • Looking for primary sources? Find texts you need for research papers or projects in HathiTrust’s digital library of 17+ million books, journals, and publications.
    • 51% of the collection is in English, and hundreds of languages are represented, including large amounts of material in German, French, Chinese, Russian, and Spanish.
    • Texts include items in Language and Literature; Social Sciences; Sciences; the Arts and more.
    • Uncover facts from the past in 1.3 million U.S. Federal Documents, including the U.S. Congressional Serial Set, Bureau of Indian Affairs publications, U.S. Environmental Protection Agency publications, and U.S. Civil Rights Commission, and many other federal divisions.
  • Search every word in every book and item in the whole collection using keywords or phrases. Use these searches to discover items on desired topics or themes, identify items for interlibrary loan requests, or to add to your research bibliography.
  • Create a bibliography with a few clicks: View or download citations for any item in the collection in MLA or APA format.