DESCRIPTION OF "MARKET DATA" FILES CREATED BY SEVERIN BORENSTEIN

 

The "Market Data" files replace the "Market Share" data files that I previously released.  Unlike the Market Share files, the Market Data files include interline one-way tickets.  These were rare in the 1990s when I created the Market Share files, but have become quite common with regional airlines code-sharing with majors.  Also, the Market Data include all carriers, not just the 30 largest, and include all routes.  As a result, these are larger than the Market Share datasets.  The structure is somewhat different as well.

 

The Market Data files are available as Stata datasets.  They are created from the domestic airline ticket data from the DOT's Databank 1A/1B (DB1A), a 10% sample of all tickets collected by US carriers. 

 

  -- Tickets with an international segment are excluded. 

  -- Tickets with a First-class or Business-Class segment are excluded. 

  -- Tickets must be one-way or round-trip; open-jaw, circle trips,

     etc are excluded.

  -- A ticket must have no more than 2 coupons for a one-way trip, no more

     than 4 coupons (and no more than 2 coupons each way) for a rountd-trip

     ticket.

  -- Tickets with fare less than $20  or fares above $9998 excluded.

  -- Tickets with fares more than 5 times USDOT's Standard Industry Fare Level for that trip distance in that quarter are excluded.

  -- Records are for one-way trips, so round-trip tickets are split into two one-way observations

  -- The Market Data dataset reports operating carriers, but drops information on ticketing carriers

 

The Stata dataset has one record per route/carrier-set/dir-cop where

-- route is a pair of airports without regard to direction (ordered alphabetically)

-- carrier-set is one carrier and a blank if the trip is one-coupon.  If the trip is two-coupon, carrier-set is the pair of airlines that provided the service on the two legs (possibly the same carrier) with carrier codes listed in alphabetical order.  Information about the order of flights is not retained.

-- dir-cop is a distinction between one-coupon and two-coupon tickets.  On two-coupon tickets, the location of the change-of-plane is not retained, though the average total routing distance for all c-o-p tickets that have been collapsed into a single record is reported.

 

The dataset includes the following variables

 

yr -- year

qtr -- quarter

ap1 -- 3-letter alpha code of the first airport (by alphabetical ordering)

ap2 -- 3-letter alpha code of the second airpor (by alphabetical ordering) - blank if one-coupon ticket

cr1 -- 2-letter alpha code of the first operating carrier (by alphabetical ordering)

cr2 -- 2-letter alpha code of the second operating carrier (by alphabetical ordering) - blank if one-coupon ticket

pax -- number of passengers reported in record

nsdst -- non-stop distance from airport ap1 to airport ap2

avdst -- average total routing distance of passengers in this record - equal to nsdst if one-coupon ticket

avprc -- average one-way equivalent price paid by passengers reported in this record

cop -- 0 if one-coupon ticket, 1 if two-coupon ticket

 

 

The Market Data dataset is available for 1979 to 2009, though there are data reliability issues for the first few years, and particularly for 1980q4 when EA and DL significantly under-reported.