/*Font style and formatting for LibGuides*/ Skip to Main Content

Text & Data Mining

A brief guide to tools and resources (including datasets) for getting started with computational approaches to textual analysis. This guide also includes instructions for getting started with Constellate, a hosted text analysis platform.

What is Constellate?

Constellate is a text and data analytics service from ITHAKA/JSTOR and Portico. It provides a platform for learning and performing text analysis, building datasets, and sharing curricular materials for course-integrated text mining projects.

Your Claremont Colleges network credentials provide you with full access to the platform and tutorials.

What content is in Constellate?

Constellate provides rights-cleared content from several data providers including:

How does Constellate work?

Screenshot of the Constellate homepage with the log in link circled in red

To start using Constellate:

  • Navigate to the Constellate homepage.
  • Click the the "Log In" link in the top right-hand corner.
  • Create an account using your Claremont Colleges email address.

Constellate provides you with a hosted JupyterLab environment where you can create and share Jupyter notebooks. Currently (as of July 2024) the primary support is for Python, but R support is offered as a beta release. 

Learn to use Constellate