Harvard Business School

Baker Library | Bloomberg Center

 

WRDS

Introduction to CRSP Stock Price Data

This document is intended to provide background information, descriptions, definitions, and sources useful to those who want to use CRSP stock price data for financial research.

CRSP refers to the Center for Research in Security Prices, located at the University of Chicago. The CRSP database includes monthly and daily price quotations for common stocks, excluding preferred stock and including ADRs, traded on the New York and American stock exchanges and NASDAQ.

The terms of the CRSP database license maintain that the data may be used only for academic research. Commercial use and republication of the data are prohibited without permission. The following citation should be given when CRSP data are used in academic research:

Source: CRSP, Center for Research in Security Prices. Graduate School of Business, The University of Chicago (2004). Used with permission. All rights reserved. www.crsp.uchicago.edu

CRSP data are perhaps most conveniently accessed through the WRDS system at the Wharton School of University of Pennsylvania. This brief document can give only a cursory overview of the contents of the CRSP database. Comprehensive documentation is available through WRDS, or directly from CRSP.

This site also provides presentations and tutorials prepared by CRSP. Additional help with the use of CRSP data is also available through Baker Library Research Services.

Data availability

CRSP monthly stock price data cover the period from December 1925 through the present.* Daily data begin in July 1962. However, while NYSE data are available throughout the entire period, AMEX data start in 1962, and NASDAQ data begin in 1972. The following table summarizes the availability of data by date and exchange:

Period

Exchange

Start Date

Monthly

NYSE

December 1925

 

AMEX

July 1962

 

NASDAQ

December 1972

Daily

NYSE

July 1962

 

AMEX

July 1962

 

NASDAQ

December 1972


* CRSP stock price data as licensed by Harvard Business School are updated on a quarterly basis. Updates are generally within six weeks of the end of the latest calendar quarter. When accessing the data in WRDS, take care to select "Quarterly Updates..." to access the most recent data.

Data items

The principal data items available in CRSP include individual stock prices (high, low, and close; raw and adjusted); trading volume; dividends and other distributions; holding period returns; and market indices, including both CRSP proprietary indices and the S&P 500. The following table summarizes the principal data items in CRSP. Brief definitions are also provided later in this document.

Price

 

Adjustment factors

 

Individual returns

Low

 

Price

 

Ex-dividend

High

 

Share

 

Inc-dividends

Close

 

 

 

 

 

Distributions

 

Market indices and returns

Raw, Unadjusted

 

Cash dividends

 

Value-weighted

Split-adjusted

 

Stock dividends

 

Equal-weighted

 

 

 

 

Ex-dividend

Volume

 

Delistings

 

Inc-dividends

Unadjusted

 

Delisting returns

 

S&P 500

Adjusted

 

 

 

 

Identifiers

CRSP also includes identifying data for each company and stock covered by the database. These identifiers include the proprietary CRSP data items PERMCO and PERMNO, as well as other, more commonly encountered concepts:

CRSP Identifiers

 

Other

PERMCO

 

CUSIP

    Company ID

 

    8-digit

PERMNO

 

SIC

    Stock issue ID

 

Ticker

Definitions

The following definitions of the most critical CRSP variables may aid in the effective use of CRSP data.

  • PERMNO
    Every stock issue covered within CRSP is assigned a unique number, its PERMNO. While a company may change its name, ticker, exchange, or CUSIP from time to time, the PERMNO of a stock within CRSP remains unchanged. Consequently, the PERMNO is the principal identifier of a stock in the CRSP database, and provides a reliable way of tracking a stock over time. As discussed immediately below, a PERMNO identifies an individual stock. For the most part, companies will have only one issue of common stock on the market at any time, so there is usually a unique link between PERMNO and company. However, a company with multiple common stock issues on the market will be associated with multiple PERMNOs.

  • PERMCO
    If a company has issued multiple tranches of publicly traded common stock, CRSP will track all of them individually with different PERMNOs. The CRSP PERMCO is a number that uniquely identifies each company within the CRSP database. This makes it possible to associate different stock issues from the same company. Most companies have only a single common stock issue outstanding at any time, so there is usually a one-to-one correspondence between PERMNO and PERMCO. As of December 2003, for example, CRSP was covering three different series of common stock (A, E, and H) related to General Motors. These three stocks have different PERMNOs within CRSP, but the common PERMCO indicates that the stocks belong to the same company.

  • Split factor
    When a company splits its stock, prices and numbers of shares are not directly comparable over time. For example, on May 27, 1999, IBM split 2-for-1. The number of shares outstanding doubled from the previous day, from 920 million to 1.84 billion, while the price per share was cut roughly in half, from $236.25 to $116.00. Looking only at the raw, unadjusted, prices would give us a very inaccurate measure of the single-day change in price. We have to apply an adjustment factor, dividing the earlier price by two or multiplying the later price by two, to have comparable prices. By the same token, any share-based measurements - trading volume, for example - would have to be adjusted if we want to compare before- and after-split values

  • Price
    Usually, the CRSP price is the closing price, the price of the last reported trade on any given day. However, if no trade is recorded on a given trading day, the reported price will be the negative of the average of the bid and asked prices for that day. The price reported in CRSP monthly files is the price of the last trading day of the month.

    When comparing prices at different times, care must be taken to apply the split factor appropriately, as described above, to assure that the prices are truly comparable.

  • Price - Ask or High, Bid or Low
    Daily: The high price reported by CRSP is the highest trading price each day or, if no trade is reported, the closing ask price. Similarly, the low price is the lowest trading price of the day or, if no trade is reported, the closing bid price. CRSP also reports the high and low prices for each trading period.

    Monthly: The reported monthly high and low prices are the highest and lowest daily closing prices, rather than the highest and lowest daily high and low prices. Again, if no trade is recorded, the negative of the average of the daily bid and ask prices are reported instead.

  • Return
    The return measures the percentage change in value over a period of time, and must take into account the effects of splits and other capital actions. Further, depending on the research objective, one might include or exclude dividends and other distributions in the calculation of stock returns.

    In the following examples rt represents the holding period return from period t-1 through period t, pt represents the closing price at period t, ft represents the split factor, and dt represents the per share value of dividends or other distributions.

    • Excluding dividends

    • Including dividends

  • Calculating returns
    Splits and dividends are relatively infrequent events, so the return most days is simply the relative or percentage change in price from the end of one day to the end of the next:



    For example, the price of IBM on the NYSE at the end of May 12, 1999, was $225.50, while the price at the end of May 13, 1999 was $246. So the return between May 12, 1999 and May 13, 1999 was:



    On the other hand, IBM paid a dividend of $0.24 per share the previous week, so the return from May 5, 1999, through May 6, 1999, was:



    Finally, IBM split 2-for-1 later in the month, so the return from May 26, 1999, through May 27, 1999, was, incorporating a split factor:



  • Compound returns
    A compound return accumulates returns over a number of successive periods. It is calculated as:



    That is, add one to each individual period return; multiply all these elements; and subtract one from the final product.

    When cumulating returns, it is important to remember to add one to the individual period returns before multiplying the successive returns. Remember also to subtract one at the end of the process to convert the total cumulative value (1+r) to a total cumulative return, or total change in value.

    The following table illustrates the calculation of the compound return for IBM over the first week of May 1999. The stock accumulated nearly 4 percent in value over that five-day trading period.

    PERMNO

    Date

    Return

    Compound Return

    12490

    19990503

    0.014640

    0.014640

    12490

    19990504

    -0.001178

    0.013445

    12490

    19990505

    0.000590

    0.014042

    12490

    19990506

    -0.012422

    0.001446

    12490

    19990507

    0.038232

    0.039733


  • Market index returns
    CRSP calculates returns for equal-weighted and value-weighted portfolios of all available stocks each trading period, based on individual stock returns. Each of these is calculated in two ways, including dividends and excluding dividends.

    • Equal-weighted: The CRSP equal-weighted market return is a simple average of the returns of all traded stocks.

    • Value-weighted: The value-weighted market return is a weighted average of all stock returns, with the weights given by the market value of the stock issue (price times shares outstanding) at the end of the previous trading period.


Baker Library | Bloomberg Center, Harvard Business School, Soldiers Field, Boston, MA 02163, 617-495-6040