New York Times Annotated Corpus, 1987-2007

Contains text of over 1.8 million news articles (excluding wire service articles) written and published by the New York Times between January 1, 1987 and June 19, 2007. Includes metadata tags for people, organizations, locations, and topics; article summaries for over 650,000 articles, and Java tools for parsing documents from XML format.

License Information

Users: HBS Only
Vendor: New York Times/Linguistic Data Consortium
Start Date: 2017

Coverage

Start Date: 1987, End Date: 2007

Still need help?

Our expert librarians are here to help you find what you’re looking for.

Interior shot of inside Baker Library hall with students
Shot of inside of Baker Library with students studying