The Value of Residential Land and Structures during the Great Housing Boom and Bust

Nicolai V. Kuminoff and Jaren C. Pope

Abstract

This study examines how the value of residential land and structures evolved during the great housing boom and bust, using data on more than a million residential properties that were sold in 10 metropolitan areas between 1998 and 2009. We use a hedonic estimator to disentangle the market value of land and structures at a local (Census tract) level. Our estimates reveal substantial heterogeneity in the evolution of the market value of land and structures within metropolitan areas. Surprisingly, lowervalue land at the urban fringes of metropolitan areas was the most volatile during the boom-bust. (JEL R14, R21)

I. Introduction

Housing is a major source of wealth in the United States. The Federal Reserve’s Flow of Funds Report documents that the asset value of owner-occupied housing for the entire United States was approximately $23 trillion in 2006—more than the capitalized value of the NYSE, Amex, and Nasdaq exchanges combined. A house’s value can be decomposed into two components: the value of the land on which the house is built, and the value of the structures that comprise the house itself. Decomposing property value into the value of land and structures is important for several reasons. First, some cities and counties tax land and structures at different rates (Chapman, Johnston, and Tyrrell 2009; Banzhaf and Lavery 2010; Cho, Lambert, and Roberts 2010). Successful implementation of a split-rate tax requires accurate estimates for each component of value. Second, structures depreciate differently from land. Documenting this difference is necessary for calculating tax code allowances for depreciation and for insurance companies to reimburse homeowners for damaged structures. Third, understanding how the value of land has evolved relative to the value of structures may help households, banks, and local governments to manage risk within their financial portfolios. Finally, tracking the evolution of land and structural values within and across metropolitan areas may provide insights into the forces that drive boom-bust cycles in real estate.

There are three primary techniques for decomposing property values into the value of land and structures. The “teardown” approach derives land value from the sales prices (plus demolition costs) of properties that were purchased with the intention to tear down the existing structures (e.g., Rosenthal and Helsley 1994; Dye and McMillen 2007). The “replacement cost” approach infers land value from the difference between property values and the depreciated costs of replacing the structures on the property (e.g., Davis and Heathcoat 2007; Davis and Palumbo 2008). Finally, the “hedonic” approach estimates land value by regressing the sales prices of properties on the characteristics of the land and structures. In this case, the land value is commonly defined as the marginal implicit price per square foot of a lot (e.g., Clapp 1980; Glaeser, Gyourko, and Saks 2005). Bell, Bowman, and German (2009) provide an overview of the three methods, describing their strengths and weaknesses. The difficulty with the teardown approach is the sparseness of the data. There are too few teardowns to apply the methodology at a high level of spatial resolution in large geographic areas. Bell, Bowman, and German (2009, 177) describe a more nuanced set of trade-offs between the replacement cost and hedonic approaches but conclude that “the contribution [hedonic] principle of value seems more consistent with the notion of market value.”1 Despite this conceptual advantage, the replacement cost approach is more commonly used. This is partly due to its apparent simplicity and relatively low data requirements. However, it also reflects concerns that omitted variables may confound hedonic estimates of land value.2 If the houses located in neighborhoods with higher land values also tend to have nicer physical characteristics that are not readily observed by the analyst (e.g., granite countertops, hardwood floors) then the value of land will be confounded with the value of unobserved physical characteristics.

The objective of this paper is twofold: (1) to refine the hedonic approach to estimating land value in a way that mitigates omitted variable bias, and (2) to use our refined methodology to analyze how the market value of land and structures evolved within and across major metropolitan areas during the 2000s. Our data describe the sale prices, physical attributes, and geographic locations of more than a million houses that were sold in 12 metropolitan areas between 1998 and 2009: Boston, Charlotte, Cincinnati, Detroit, Los Angeles, Miami, Oakland, Philadelphia, Pittsburg, San Francisco, San Jose, and Tampa. According to the Standard & Poor’s Case-Shiller repeat sales index, residential property values in major metropolitan areas more than doubled between 1998 and 2006 and then declined by approximately 40% between 2006 and the end of 2009 (Figure 1).3 Our data span this remarkable boom-bust cycle.

Figure 1

Standard and Poor’s Case-Shiller National Housing Price Index

The location of each house conveys access to a specific bundle of local public goods and also defines the commuting opportunities that would be faced by a working household. These localized amenities may be in limited supply due to zoning regulations and other forms of development restrictions (Glaeser, Gyourko, and Saks 2005). As a result, it is important to recognize that land values may vary across neighborhoods within a metropolitan area. Equally important is the need to recognize that the market values of land and structures may evolve differently over time. The relative price of land may increase over time as developable land becomes scarcer. Likewise, changes in credit constraints or wealth may alter the relative demands for the public and private attributes of housing in ways that differ across time and space.

To characterize the spatiotemporal variation in the value of land and structures, we estimate housing price functions for each metropolitan area, while allowing the shape of the price function to change from year to year. Our estimator extends the conventional hedonic approach in two ways. First, we use fixed effects for U.S. Census tracts to capture spatial variation in localized amenities that contribute to land value through a parcel’s location, rather than its size. Second, we add interactions between the fixed effects and square footage of living space to capture spatial variation in latent attributes of structures. We then generate estimates for annual average values of land and structures at the level of an individual Census tract. Our spatially explicit estimates are typically an order of magnitude larger than estimates based on the conventional hedonic approach.

When we aggregate our tract-level estimates up to the level of a metropolitan area, they are generally consistent with the metropolitan area averages reported prior to the boom by past studies using the replacement cost method (e.g., Davis and Palumbo 2008). The same is true after the bust. However, the two sets of estimates diverge during the boom-bust period. Our estimates for land values do not rise as fast during the boom or fall as quickly during the bust. Our estimates imply that the market value of structures exceeded their replacement cost during the height of the boom. The differences can be large—up to 100% for San Francisco. One potential explanation is that local housing markets are less than perfectly competitive. With a small share of houses on the market at any one time, the unique bundle of amenities provided by a desirable neighborhood may allow home sellers to command a markup on the structural characteristics of their houses, as Taylor and Smith (2000) first observed. Indeed, we find that neighborhoods with higher preboom land values (presumably the higher-amenity neighborhoods) had larger markups on structures during the boom. Over time, we would expect these markups to stimulate new construction, following the general logic of Tobin’s q-theory.

Consistent with Davis and Palumbo’s (2008) analysis of the variation between metropolitan areas, we find that land values are more volatile in metropolitan areas where the supply of housing is less elastic. Interestingly, we find the opposite pattern within metropolitan areas. Neighborhoods at the urban fringe, where we would expect the supply of housing to be most elastic, were the neighborhoods that experienced the most volatility in housing prices and land values during the boom and bust. This general pattern can be seen in the Case-Shiller index. Figure 24 displays indices for the lowest, middle, and highest tier of houses (ranked by 2010 value) for Miami, San Francisco, Boston, and Atlanta. Within each metropolitan area, the lowest-value houses were the most volatile and the highest-value houses were the least volatile.5 We find that the higher-value houses tend to be located closer to the city, where the supply of land is least elastic, and the lower-value houses tend to be located at the outskirts of the surrounding suburbs, where most of the new housing is built. This suggests that factors other than supply elasticity of housing are playing an important role in the evolution of land and structural values.

Figure 2

Hetergogeneity in the Evolution of Housing Prices across and within Metropolitan Areas

Overall, this research makes three contributions to the literature on land valuation. First, we use high-resolution spatial fixed effects to refine the conventional hedonic approach to decomposing housing prices into the implicit value of land and structures. Second, we use a consistent estimation strategy to provide new estimates for how land values evolved within and across several metropolitan areas during the remarkable boom-bust cycle of the 2000s. Finally, while prior studies have reported that land values vary across space and time, we document two novel features of this variation that deserve more attention in future research: (1) the least valuable land at the urban fringes of metropolitan areas was the most volatile during the recent boombust cycle; and (2) the market value of structures exceeded construction costs during the boom, with the largest markups occurring in the most affluent neighborhoods.

II. The Market Value of land and Structures in a Metropolitan Area

We begin from a standard description of residential sorting. Heterogeneous households are assumed to choose from a stock of houses with different lot sizes and structural characteristics (e.g., bedrooms, bathrooms, square feet). Their collective location choices will in turn influence the supply of neighborhood amenities (e.g., public school quality, commute time to the city center, preservation of open space) through a combination of voting, social interactions, and feedback effects.6 Formally, an individual household’s utility maximization problem is

Embedded Image[1]

In period t, household i selects one of j= 1,...,Jk houses located in one of k=1,...,K neighborhoods. Their utility depends on the lot size of the parcel (l ), the structural characteristics of the house (x ), the amenities provided by the neighborhood ( g), and the income the household has left over to spend on the numeraire good ( b ) after paying the annualized after-tax price of housing (Pjkt). The household’s idiosyncratic preferences are represented by α it.

Sellers in this market may include a mix of developers and individuals selling their houses. There is no need to be more specific about the supply side of the market. Under a pair of weak restrictions on consumer preferences, any market outcome consistent with utility maximizing behavior can be described by a hedonic price function.

Assumption 1.

a. Ui(gkt,ljk,xjkt,b,αit) is strictly increasing in b for all b∈(0,yit).

Let ≥ i represent household i’s preference ordering over all potential location choices that satisfy the household’s budget constraint. ≥ i is invariant to i’s actual location choice.

The first condition is self-explanatory. The second condition simply limits the scope for any one household to influence prices or the supply of neighborhood amenities. For example, suppose household i has exceptionally bright children. If i were to move from its current house in school district R to a new house in school district S, then school quality may increase marginally in S due to peer effects, and decrease marginally in R. These adjustments may be followed by changes in housing prices. Condition b implies that these changes must be sufficiently small to leave i’s preference ordering over the two houses unchanged.7 The need for this restriction becomes apparent in the proof of Theorem 1, which is simply a variation on results derived by Bajari and Benkard (2005).8

Theorem 1. Suppose that Assumption 1 holds for every household. Then for any two houses, j,k and r,s, it must be true that Pjkt= Prst if gkt = gst, ljk = lrs, and xjkt = xrst.

Proof. Suppose i chooses j,k given Pjkt>Prst. Then Ui(gkt,ljk,xjkt,yit-Pjkt; αit) < Ui(gst,lrs, xrst,yit — Prst; αit) because utility is strictly increasing in the numeraire. This preference ordering is invariant to whether i locates in j, kor r, s. Therefore, j, k cannot be a utility-maximizing location for i in period t, which is a contradiction. QED

Theorem 1 states that property values will be functionally related to neighborhood amenities, lot sizes, and structural housing characteristics during a single period. Relative to the majority of the empirical literature that maintains the assumptions of Rosen’s (1974) hedonic model as a basis for measuring the willingness to pay for urban amenities, Theorem 1 is notable for what it does not assume. We do not require the market to be perfectly competitive. Nor do we require households to be free to choose continuous quantities of every housing characteristic in every neighborhood. Some characteristics may be approximately continuous; others may be discrete. For example, households may be free to choose square footage continuously over some range, whereas public school quality may change discretely as one crosses the border between two adjacent school attendance zones (Black 1999).

Relaxing continuity and perfect competition means that we lose the ability to interpret equilibrium marginal implicit prices for characteristics as measures that identify consumers’ marginal willingness to pay for those characteristics or firms’ marginal costs (Feenstra 1995; Taylor and Smith 2000; Kuminoff, Smith, and Timmins 2010). If discreteness in the choice set prevents buyers from setting their marginal rates of substitution between characteristics equal to the corresponding implicit price ratios, then the market clearing prices for individual characteristics may understate or overstate individual marginal willingness to pay. Likewise, if sellers can exert some degree of market power, then implicit prices for specific characteristics may not equal their marginal costs.

The reason for relaxing continuity and perfect competition in our conceptual model is that in reality some neighborhoods are constrained by geographic features such as water bodies, steep terrain, and public land that can produce discreteness in the supply of amenities, whereas other neighborhoods use zoning regulations to explicitly constrain further urban development. If a constrained neighborhood provides access to a unique bundle of amenities, then the amenity bundle may convey market power to property owners in the neighborhood (e.g., see Taylor and Smith 2000). We return to this point in Section VI, where we consider local market power as a potential explanation for some of our results.

Our specification for the hedonic price function, Pjkt= P(gkt,ljk,xjkt), describes a spatial landscape at a single point in time where prices, amenities, and location choices are all defined such that no household would prefer to move, given its income and preferences. This is a single-period snapshot of market outcomes; it may or may not be a long-run steady state. Current period incomes and preferences may reflect temporary macroeconomic factors. Credit may be unusually easy (or difficult) to obtain relative to a long-run equilibrium. The average household may be unusually optimistic (or pessimistic) about the future asset value of housing. Budget constraints may reflect other temporary macroeconomic shocks. As all of these factors change over time, households may adjust their behavior in ways that alter the shape of the price function and generate boom-bust cycles.

During a boom-bust cycle, the evolution of the price function can be decomposed into changes in the market value of land and structures. To illustrate this, we first define the market value of a property at a single point in time as its current annualized price.

Definition 1. Pjkt ≡ P(gkt,ljk,xjkt) is the market value of property j,k in period t.

The value of the underlying land is then defined by the thought experiment where we remove all of the structural characteristics from the property.

Definition 2. LVjktP(gkt,ljk,0) is the land value of property j,k in period t.

LVjkt measures what a vacant (but otherwise identical) parcel to j would sell for in the same neighborhood.9 This definition of land value captures the spatial trade-off between commuting costs and accessibility to the city center (Alonso 1964; Muth 1969; Mills 1967), as well as the value of local public goods and urban amenities conveyed by the neighborhood (Tiebout 1956).10

Finally, subtracting land value from total market value yields the value associated with a property’s structural characteristics, xjkt.

Definition 3. SVjkt = Pjkt- LVjkt is the structural value of property j,k in period t.

While it is conceptually straightforward to decompose property value into the value of land and structures, empirical implementation presents several challenges.

III. Estimating the Market Value of Land and Structures

Background

If life were more like a laboratory experiment, there would be no need to estimate land values. Sales of vacant parcels would be randomly distributed throughout metropolitan areas, and we would simply measure their transaction prices. The problem, of course, is that vacant land sales typically occur at the fringes of urban areas. We rarely observe such transactions occurring in built-up neighborhoods. In an established neighborhood, the closest substitute for a vacant land sale is likely to be a “teardown.”

When an existing structure is purchased with a plan to demolish it and build new housing, the value of the underlying land should equal the sale price of the developed parcel less demolition costs. Rosenthal and Helsley (1994) were the first to apply this idea to infer land values from teardown properties in Vancouver, British Columbia. In subsequent work, Dye and McMillen (2007) and McMillen (2008) refined the econometrics to control for the non-random selection of which parcels are torn down and provided new evidence on land values in Chicago. While teardowns can support a convincing quasi- experimental approach to measuring land value, the active markets are too few and too thin to apply the method broadly across the United States or at a high level of spatial resolution throughout a single metropolitan area.

Since the lack of data makes it difficult to measure the market value of land directly, analysts have sought to estimate it indirectly from hedonic regressions or replacement cost equations. Both strategies begin by rearranging the decomposition in Definition 3,

Embedded Image[2]

Given data on the structural characteristics of houses and their transaction prices, equation [2] can be used to estimate land values. In the replacement cost framework, two maintained assumptions are sufficient to guarantee the estimator will be consistent. First, the market for housing is assumed to be sufficiently competitive that the market value of a structure will equal the cost of rebuilding that structure in its current condition: replacement costjktRCt(xjkt) =SVjkt. Second, the replacement cost function is assumed to be known. Under these assumptions, one can obtain a consistent estimate for land value as the residual obtained by subtracting replacement cost from the price of housing,

Embedded Image[3]

Davis and Heathcoat (2007) applied this logic at the national level to develop the first macroeconomic index of residential land value in the United States. Davis and Palumbo (2008) refined their methodology to control for variation in property values and construction costs across major metropolitan areas. They developed a database describing the value of land in 46 major metropolitan areas between 1984 and present.11

During boom-bust cycles, the replacement cost framework tends to attribute most of the changes in property values to speculation on land. This follows from the mechanics of [2]-[3]. If residential construction costs are relatively stable during a period when property values are rising rapidly, then observed changes in property values will be interpreted as changes in land value. This was exactly what happened during the recent boom. The replacement cost model indicates that the ratio of land value to property value on the West Coast increased from 61% in 1998 to 74% in 2004, for example (Davis and Palumbo 2008). We have no doubt that the market value of land did increase during the boom. However, the replacement cost estimates for the magnitude of the change may be too high if housing markets are less than perfectly competitive or if zoning restrictions and permitting requirements drive a wedge between construction costs and effective replacement costs in the short run.

The hedonic approach to estimating land values avoids the need to specify replacement costs or assume that markets are perfectly competitive. Instead, the key maintained assumption is a parametric specification for the relationship between the sale price of a house and its characteristics. Equation [4] presents a linear example reflecting the spatiotemporal structure ofpasthedonic landvalue estimators.

Embedded Image[4]

In this case, provides an estimate of the implicit marginal price of land, and · j provides an estimate for the property’s land value. Efforts to estimate δ from data on individual housing sales date back at least to Clapp’s (1980) study of land values in Chicago.12 Over the years, the methodology has been refined to allow more flexible parametric specifications for the hedonic price surface (Cheshire and Sheppard 1995) and extended to compare estimates across 21 metropolitan areas (Glaeser, Gyourko, and Saks 2005).

There are two key challenges to developing credible hedonic estimates for land values. The first challenge—omitted variable bias— is widely recognized. For example, one might expect that houses built on larger lots will also tend to be built using higher-quality materials. Because data on building materials are typically unavailable, their effect on sale prices will be confounded with the value of land (McMillen 2008). Another concern is that an estimate for the depreciation of structures (from the coefficient on age) may be confounded with unobserved neighborhood amenities, because all of the houses in a subdivision tend to be built at about the same time (Davis and Palumbo 2008). More generally, there is always likely to be some degree of spatial correlation between observed parcel characteristics and unobserved neighborhood amenities that will ultimately bias the estimator (Kuminoff, Parmeter, and Pope 2010).

The second challenge is to choose a specification for the price function that is sufficiently flexible to capture the key features of spatial and temporal variation in land values. Past studies have focused on allowing the per unit price of land ( δ ) to vary flexibly within a metropolitan area (e.g., see Cheshire and Sheppard 1995). While this is an important dimension of heterogeneity, we hypothesize that it is equally important to distinguish between the variable (i.e., quantity-based) and fixed (i.e., access-based) components of land value, while recognizing that marginal implicit prices for both may change over time due to changing market conditions.13

Access matters. This is a central theme of public, urban, and environmental economics. Commuters value access to the central business district (CBD). Homeowners value access to local public goods and amenities that contribute to their quality of life. These values are fundamental to the models of urban spatial structure and neighborhood formation that build on the work of Tiebout (1956), Alonso (1964), Mills (1967), and Muth (1969). Within a neighborhood, the value of access will be approximately fixed, independent of parcel size. As one moves to a different neighborhood with higher crime rates, lower-quality schools, and/ or a longer commute to the CBD, the value of access may drop sharply. To identify spatial variation in access value separately from spatial variation in the per unit price of land, the analyst must observe several housing transactions within each neighborhood during an interval over which land values are relatively stable.14 To identify temporal variation in each of these components, the analyst must observe a large number of observations in each period. Our econometric model is specially designed to accomplish these tasks using data on the universe of housing sales within a metropolitan area together with controls for omitted variables.

Refining the Hedonic Approach to Estimation

Our approach to estimating land values relies on micro data that are sufficiently rich to allow us to estimate annual price functions for metropolitan areas, while simultaneously using spatial fixed effects to capture the market value of latent attributes of land and structures. In the case of land, the issue is that no existing database provides comprehensive coverage of spatial variation in access-based amenities below the level of a county. This is important because amenities often vary significantly within a county. To measure this variation we use spatial fixed effects for neighborhoods, which we define to be Census tracts.15 Within a tract, access to amenities will be approximately fixed. Children will be assigned to public schools in the same school district, their parents will face the same commuting opportunities, and there will be little or no variation in crime rates or air quality. Thus, we would expect tract fixed effects to absorb the composite value of access to these and other neighborhood amenities.

In the case of structures, micro-level data are typically limited to the attributes recorded by the county assessor. Some houses have hardwood floors, granite countertops, skylights, solar panels, and spas. Unfortunately, these improvements are rarely noted in the county records. If the quality of building materials varies systematically across neighborhoods in ways we do not observe, then their average effect on property values may be confounded with our estimates for the fixed component of neighborhood land value. To mitigate this potential source of confounding, we add a set of interactions between the fixed effects for neighborhoods and the square footage of the house. The resulting terms are intended to capture systematic variation across neighborhoods in the average value of a square foot of structural improvements.

We adopt a semilog form for the estimation, regressing the log of transaction prices for all of the single-family residential properties sold in a metropolitan area during year t on their lot sizes, their structural characteristics, and two sets of fixed effects,

Embedded Image[5]

The first two terms after the equality correspond to the property’s land value. ξ kt denotes the neighborhood fixed effects. They will measure the component of land value that is constant across all the houses sold within tract k during year t , regardless of lot size. The neighborhood amenities that enter ξ kt may also interact with the size of the lot to influence the variable component of land value. For example, the marginal value of yard size may be larger in quieter neighborhoods with lower crime rates. Therefore, we allow the coefficient on lot size, δ kt, to vary over neighborhoods as well.

The third and fourth terms after the equality correspond to the value of structural improvements. x jktβ t measures the component of property value that can be explained by the housing characteristics that are observed. While we allow the implicit prices of characteristics to change over time, we restrict them to be fixed within a metropolitan area during the course of a year. γ ktsqftjkt measures systematic variation in the average value of a square foot of living space that varies across neighborhoods due to unobserved structural improvements.

Finally, we interpret the error term ε jkt as the composite of three effects. It will reflect (1) unobserved idiosyncratic structural improvements that differ from the tract average, (2) idiosyncratic access to amenities within a neighborhood,16 and (3) misspecification in the shape of the price function. To mitigate the first two effects, we aggregate our microlevel estimates for the value of land and structures to report averages for Census tracts, counties, and metropolitan areas. This also allows us to compare our results to estimates from the prior literature. While our resulting point estimates surely contain some error, and their standard errors may be affected by spatial autocorrelation, we expect the magnitude of the bias in our point estimates to be smaller than in previous hedonic studies because of the ways in which our model enhances spatial and temporal resolution and controls for omitted variables.17 To evaluate the impact of these refinements, we use the fact that our model nests the conventional hedonic specification as a special case. Equation [5] reduces to [4] if we omit spatial fixed effects (γkt = (γ kt = ξkt =0) and restrict the implicit price per acre of land to be constant within a metropolitan area (δkt= δ).

IV. Data and Summary Statistics

Our analysis is based on more than one million observations on the sales of single-family residential properties across the United States. We purchased the data from Data- Quick. This widely used commercial vendor of real estate data assembled the data from assessor’s offices in individual towns and counties. The data include the transaction price of each house, the sale date, and a consistent set of structural characteristics, including square feet of living area, number of bathrooms, number of bedrooms, year built, and lot size. Using these characteristics, we performed some standard cleaning of the data, removing outlying observations, removing houses built prior to 1900, and removing houses built on lots larger than 5 acres.

The data also include the physical address of each house, which we translated into latitude and longitude coordinates using GIS street maps and a geocoding routine. The latlong coordinates were then used to assign each house to its corresponding census tract. The tract-level assignment provides the needed spatial resolution to analyze trends in land values within and across metropolitan areas during the boom-bust cycle. Furthermore, it allows us to use spatial fixed effects to control for the average effect of latent variables within each tract.

While we conducted the econometric analysis for 10 metropolitan areas, we focus on four of them in greater detail in order to illustrate our main results: Miami, San Francisco, Boston, and Charlotte.18 We selected these four because all have complete data between 1998 and 2008, they provide geographic variation on populous areas in the United States, they provide variation in the supply elasticity of land, and they differ in the intensity of their boom-bust cycles. Figure 2 illustrates the differences in the sizes of their booms and busts using the Case-Shiller Home Price Index. Each panel also reports Saiz’s (2010) estimates for the supply elasticity of housing.19

Table 1 provides summary statistics for the housing transactions that we observe in Miami, San Francisco, Boston, and Charlotte. The first two rows of each panel illustrate that the average sale price rose in all four areas between 1998 and 2006. The size of the increase was most striking in Miami ($162,000 to $410,000) and San Francisco ($343,000 to $809,000), where prices more than doubled in nominal terms. These increases do not reflect any obvious changes in the composition of houses on the market. The structural characteristics of the average sale property are essentially constant over the study period. In each area, the median transaction was a single-family house with three bedrooms, two baths, and between 1,600 and 1,900 square feet of living area. Naturally, Charlotte and Miami have newer housing stocks than San Francisco and Boston. Lot sizes also tend to be larger in Charlotte and Boston than in Miami and San Francisco, reflecting variation in the balance between sales from the cities and suburbs.20

Table 1

Summary Statistics for Miami, San Francisco, Boston, and Charlotte

V. Results

Comparison to Preboom Estimates from the Existing Literature (1998-1999)

We begin by comparing our estimates for land values to previous figures generated by the conventional hedonic estimator by Glaeser, Gyourko, and Saks (2005) (henceforth GGS) and the replacement cost estimator developed by Davis and Palumbo (2008) (henceforth DP). Neither study had the benefit of our spatially delineated micro data on actual housing sales. Instead, they combined data from the American Housing Survey with other sources to generate estimates for average land values within several metropolitan areas. Fortunately, some of their estimates overlap with the spatial dimensions of our data prior to the onset of the housing market boom, providing an opportunity for comparison. The purpose of the comparison is to investigate how our refinements to the hedonic land value estimator influence the accuracy of our results. We would expect DP’s replacement cost calculations to generate reasonable estimates for land values prior to the boom. The relative stability of the market in the mid to late 1990s would have allowed developers considerable time to meet demand, mitigating any wedge between construction costs and effective replacement costs. Thus, we view DP’s estimates as the most reliable baseline for comparison.

The task of estimating land values is a relatively small component of the overall analysis by GGS. Their main objective is to test the hypothesis that land use regulations impose an effective tax that explains the rise in housing prices in major metropolitan areas. To illustrate their point and to compare housing prices to construction costs, GGS estimate the “freemarket cost of land” using a conventional hedonic model (similar to equation [4] above) for 21 metropolitan areas based on data from the 1998 and 1999 installments of the American Housing Survey.21 We have the requisite information to develop comparable estimates for 10 of their 21 metropolitan areas. Conveniently, DP also report estimates for all 10 areas.

To provide the best possible comparison, we focus on the subset of our data that overlaps with the information used by GGS. Specifically, we limit our data to the year that matches the year in which each metropolitan area was covered by the AHS (either 1998 or 1999). Then we subdivide metropolitan areas to match the disaggregate definitions used in the AHS. This means subdividing the San Francisco consolidated metropolitan statistical area into the San Francisco, Oakland, and San Jose primary metropolitan areas, for example. While our micro data still differ from the AHS in terms of the number of observations and the richness of information on structural characteristics, their spatial and temporal dimensions are the same.

The first column in Table 2 simply reproduces the estimates of land value (on a per acre basis) from Table 4 of GGS. In Column 2, we report the results from our attempt to come as close as possible to replicating their estimating equation, given the differences between the variables in our data and the AHS micro data.22 A quick comparison between Columns 1 and 2 confirms that the two sets of estimates are quite similar (with Tampa as the exception). Overall, the estimates line up with our general intuition for which metropolitan areas ought to have more expensive land. San Francisco, San Jose, Oakland, and Los Angeles have the highest measures of land value, whereas Detroit and Tampa have the lowest. However, all of the estimates seem implausibly low for the late 1990s. Could you really buy an acre of land in San Francisco for under $200,000 or in Boston for under $30,000? A likely explanation is that the conventional hedonic estimator does not fully capture the fixed component of land value associated with access to the local public goods and amenities (ξ kt).

Table 2

Comparing Traditional Hedonic Estimates with Estimates Generated by the Replacement-Cost Method and Our New Hedonic Method

Table 4

EEousing Units. Property Values, and Land Values in the San Francisco Bay Area. 1998-2009

Column 3 reports the corresponding replacement cost estimates for land value from DP. They used information published by R.S. Means Company (2004) to develop metropolitan-level estimates for replacement cost. Their measures for housing prices were developed by combining data on price levels in each metropolitan area during AHS survey years with time-series data on the percentage change in housing prices from Freddie Mac’s Conventional Mortgage Housing Price Index (CMHPI). The rank order in Column 3 is similar to the first two columns, but the replacement cost estimates are typically an order of magnitude larger! While there are some slight variations in the datasets used to develop the estimates in Columns 1 and 3, none seem capable of generating order of magnitude differences.23 It seems more likely that the differences are due to estimation procedures. In particular, the access-based component of land value associated with local public goods would be included in the replacement cost estimates and excluded from the estimates generated by conventional hedonic regressions.

Column 4 reports the estimates from our refinement to the hedonic estimator, using the specification in equation [5].24 Generalizing the conventional hedonic model to allow for access-based amenities and latent housing characteristics increases our estimates by an order of magnitude (moving from Column 2 to Column 4). The resulting estimates align much more closely with the estimates from DP’s replacement-cost model.

Finally, it is important to reiterate that the similarity between our estimates and DP’s occurs during a two-year period prior to the boom. As we track the two sets of estimates over the course of the boom-bust cycle, we see some interesting differences.

The Evolution of Average Land Values during the Boom-Bust Cycle (1998-2009)

We estimated equation [5] for each (metropolitan area, year) combination from 1998 and 2009. Table 3 summarizes results for the four metropolitan areas where we have a complete set of data: Miami, San Francisco, Boston, and Charlotte. It reports our measures for the evolution of land values and the share of property value attributed to land (“land share”), alongside the replacement cost estimates from DP.25 The hedonic measures were generated by averaging our parcel-specific estimates for land values and improved values over all of the housing transactions in each metropolitan area. There are some obvious differences between the two sets of estimates at the market’s peak.

Table 3

Evolution of Land Values and Land Shares between 1998 and 2009

Figure 326 illustrates the differences graphically. Focusing on the first column in the figure, it is clear that land values estimated by both methods rise and fall during the boom and bust. Prior to the boom, the two sets of estimates are similar. The same is true following the bust. However, the peak amplitude is much larger in the replacement cost estimates.

Figure 3

Evolution of Total Land Value (left) and Structural Value (right) for Four Metropolitan Areas

The second column of Figure 3 illustrates how estimates for the value of structures evolved over the same period. The hedonic model suggests that the market value of structures rose and fell in tandem with the market value of land during the boom-bust cycle. The replacement cost measures rose steadily, following a similar trend in every metropolitan area. Once again, the differences between the hedonic estimates and the replacement cost measures are largest at the height of the boom.

Does the difference between the two sets of estimates reveal something interesting about the behavior of housing markets during the boom-bust cycle? Or does it merely reflect differences in the underlying data? While our comparisons were made along a consistent set of spatial and temporal dimensions, the underlying micro data are not the same. DP’s replacement cost estimates are based on integrating the AHS data with Freddie Mac’s CMHPI, whereas our hedonic estimates come from assessor data. In principle, the assessor data describe the universe of housing transactions, whereas the CMHPI is limited to transactions where: (1) the transaction was a repeated sale, and (2) the buyer took out a conventional mortgage that was purchased or insured by Freddie Mac or Fannie May.

Figure 427 compares the evolution of average property values in the two datasets. The assessor data suggest slightly smaller increases in property values during the boom. One possible explanation is that less expensive transactions were more likely to be associated with unconventional mortgages. Another explanation is that new houses built over this period tended to be located near the urban fringe, where land values (and property values) were lower. In any case, the differences between the two measures of average property value in Figure 4 are dwarfed by the differences in estimated land values in Figure 3. Thus, the differences between the hedonic and replacement cost estimates for land value appear to be tied to methodology, not the underlying data.

Figure 4

Comparing Our Assessor Data to Freddie Mac’s Conventional Mortgage Home Price Index

Data differences aside, the main economic implication of our comparison between the hedonic and replacement cost estimates is that, during the boom, the market value of structures may have exceeded their replacement costs. To further investigate this possibility, we examine the spatial variation in the evolution of land values within each metropolitan area.

Within-Metropolitan Heterogeneity in the Evolution of Land Values (1998-2009)

Figure 528 illustrates the spatial heterogeneity in land values across counties in the greater San Francisco and Boston metropolitan areas.29 The left-most maps display the land value of the average residential property sold in 1998, the change in average land value during the boom (1998-2006), and the change in average land value during the bust (2006-2009). In the San Francisco metropolitan area the counties with the highest land values in 1998 are San Francisco and San Mateo, followed by Marin and Santa Clara. These same counties experienced the largest increases in land value during the boom and the smallest decreases during the bust. Looking at the leftmost maps in Figure 5 for the Boston metropolitan area reveals a similar pattern.

The right-most maps in Figure 5 focus on the ratio of land value to total property value. The three maps display the average land share in 1998 and the subsequent changes during the boom and bust periods. Focusing first on the greater San Francisco metropolitan area, we see that areas with higher land shares in 1998 (e.g., San Francisco and San Mateo) tended to see drops in land share during the boom and increases during the bust. Again, a similar pattern emerges in the Boston metropolitan area.

Figure 5

Within Metropolitan Land Value Heterogeneity (left) and Heterogeneity in the Evolution of Land Shares (right), San Francisco Bay Area and Boston Area

Overall, the spatial heterogeneity in the evolution of land values within the Boston and San Francisco areas seems somewhat counterintuitive. The counties that experienced the least volatility in land values during the boom-bust cycle are the same counties that we would expect to have the most inelastic supply of housing. This pattern is the opposite of what the prior literature has observed about the variation in land values between metropolitan areas (e.g., Davis and Palumbo 2008). Figure 2 provides an example of the stylized fact that housing prices (and land values) tend to be more volatile in metropolitan areas where the supply of land is relatively inelastic.30 Why would the volatility of land values be so different within a metropolitan area?

To further investigate the relationship between land value and housing supply, Table 4 summarizes trends in land values and permits issued for the construction of new housing units in the San Francisco Bay Area. Column 1 reports the baseline number of owner-oc-cupied housing units by county from the 2000 Census, and Column 2 reports the number of new permits for construction of single-family residential housing units. The counties are ranked by Column 3, which reports the ratio of Column 2 to Column 1. The ratios are smallest for San Francisco and its adjacent coastal counties (Marin and San Mateo). The same is true if we look at the ratio of all new permits to all housing units in Column 4. This ratio is much higher for San Francisco because it includes permits to build apartment units. It also includes all housing units in the denominator, regardless of occupancy status. In the absence of county-level estimates for the supply elasticity of housing, Columns 34 provide a crude proxy for the responsiveness of housing supply during the boom.

Comparing the ratios in Columns 3-4 with the values of land and property in Columns 5-12 highlights five interesting trends.31 First, at the start of the boom period, property values and land values were higher in counties where the supply of housing was less responsive. This is true whether we look at the median self-reported property values in Column 5, the mean of actual transaction prices in Column 6, or our estimates for mean land values in Column 7. Second, land tends to represent a smaller share of total property value in counties where the housing supply is more responsive (comparing Columns 6 and 7). Third, while the counties with the least responsive housing supply experienced the largest nominal increases in land values during the boom (Column 8), these increases were relatively small in percentage terms (Column 9). Fourth, during the bust, the counties with the least responsive housing supply experienced the smallest decreases inland values in both nominal and percentage terms (Columns 10-11). Finally, and perhaps most strikingly, the counties with the least responsive housing supply had large net gains in land value between 1998 and 2009, whereas the fastest growing counties (Contra Costa, Napa, and Solano) lost most of the land value that had accumulated during the boom. Overall, these trends support our initial hypothesis that the Bay Area counties with the most volatile property values and land values during the boom-bust cycle also had the most elastic housing supplies.

Finally, we report an intriguing pattern in our estimates for the ratio of land value to total property value. We further disaggregate our results to the level of a Census tract and regress the change in the land value share of each tract between 1998 and 2006 on its baseline land value share in 1998. We and that census tracts with high initial land shares in 1998 tended to see smaller increases in Iand shares during the boom. Table 5 summarizes the results, by metropolitan area. For example, the coefficient for the San Francisco metropolitan area indicates that a 1 percentage point increase in a census tract’s 1998 land value share was associated with a 0.574 percentage point decrease in the size of the change in the tract’s land share between 1998 and 2006. The net effect is an increase (decrease ) in the land share for tracts with initial land shares below (above) 62.5%.32 This negative correlation holds for all of the metropolitan areas and ranges from -0.393 in Pittsburg to -0.754 in San Jose. Furthermore, the 7?-squared values between 0.24 and 0.59 suggest that the initial land share in 1998 explains much of the variation in land shares during the boom.

Table 5

CoiTelation between Increase in Land Share (1998-2006) and Baseline Land Share (1998)

One explanation for this pattern is that areas with high land shares in 1998 (presumably high-amenity areas ) saw both land values and structural values increase during the boom, but structural values went up relatively more due to markups arising from spatial market power associated with the inelastic supply of access to amenities. Another explanation is that areas with low land shares in 1998 (presumably low-amenity areas) saw large increases in land values relative to Structural values because (1) the relatively elastic supply of land in low-amenity areas kept the implicit price of Structures pinned to construction costs, and/or (2) the general relaxation of credit constraints during the boom had the largest impact on demand in these areas.

Without imposing additional structure on the data, we cannot disentangle the relative importance of these explanations. We discuss them briefly in the hope of motivating future research.

VI. Discussion

Conventional wisdom suggests that variation in the volatility of housing prices across metropolitan areas is primarily due to heterogeneity in the supply elasticity of land. In areas with physical and legislated constraints to urban development, the market price of housing will be relatively sensitive to demand shocks fueled by speculation and relaxation of credit constraints. These demand shocks will be translated into higher land values. Our results do not contradict this hypothesis. However, the land supply hypothesis is not sufficient to explain the variation we observe in the amplitude of the boom and bust within metropolitan areas.

Within the metropolitan areas that we studied, housing prices were relatively volatile in neighborhoods at the urban fringe, where the supply of land for housing is relatively elastic. Decomposing this price volatility into the market value of land and structures revealed two other interesting trends. First, we saw that the average market value of land and structures tended to rise and fall in tandem. Since construction costs rose steadily during 19982009, our results suggest the presence of a wedge between construction costs and the market value of structures. Second, we saw that the size of the wedge tended to be larger in neighborhoods with higher land shares prior to the boom.

One possible explanation for the relatively high volatility at the urban fringe is that the relaxation of credit constraints was particularly important for lower-income households, allowing them to purchase houses at the fringe of the suburbs. Recent work by Landvoigt, Piazzesi, and Schneider (2011) supports this hypothesis.33 Using a calibrated assignment model, they suggest that cheaper credit for low-income households effectively drove up housing prices for the “low end of the market” in San Diego during the boom. Likewise, Mian and Sufi (201 1) use micro data on homeowners to document that during the boom there was positive correlation between borrowing and house price appreciation, and Mian and Sufi (2009) use zip-code level data to document that during the boom there was negative correlation between income growth and subprime mortgage credit. However, these correlations do not offer a specific explanation for the wedge we observe between construction costs and the market value of structures. In the remainder of this section, we discuss two market forces that may help to explain the wedge: imperfect competition and q-theory. Both present interesting opportunities for further research.

Imperfect Competition

If markets were perfectly competitive with no barriers to entry, we would expect the market value of structures to be pinned to construction costs. Our results indicate this is not the case. One explanation is that barriers to entry convey some degree of market power to homeowners in exclusive neighborhoods. The construction industry may be close to perfectly competitive. However, builders cannot simply build more houses in established neighborhoods. Furthermore, development restrictions and zoning regulations often limit the ability of homeowners to expand their houses. With a small proportion of houses on the market at any one time, the unique bundle of amenities provided by a desirable neighborhood may allow home sellers to charge a markup on the structural characteristics of their houses. If there are only a few large houses on the market in the best school district, for example, the implicit market value of square footage may be bid far above construction costs due, in part, to the demand for access to high-quality schools. This hypothesis is consistent with our observation that the neighborhoods experiencing the largest increase in market values of structural characteristics were the neighborhoods with the largest preboom land values (presumably the highest-amenity neighborhoods).

To illustrate the comparative statics of the market power hypothesis, we use Kuminoff and Jarrah’s (2010) iterative bidding algorithm (IBA) to simulate hedonic equilibria with heterogeneous households and houses. The IBA uses a numerical procedure to solve for an assignment of people to houses and a vector of prices that jointly support a hedonic equilibrium, given an initial stock of housing and a set of draws from the joint distribution of income and preferences.

We use the IBA to simulate market outcomes in a stylized metropolitan area containing two built-up neighborhoods, A and B. Each neighborhood is defined to have 100 lots of identical size (li = 7000 sq ft). In neighborhood A, half of the lots contain “small” houses, uniformly drawn from Xi ∼ [1000,2000] sqft. The remaining lots contain “large” houses, uniformly drawn from Xi ∼ [2000,3000] sqft. In neighborhood B, the large and small houses are drawn from the same uniform distributions, but only 20% of the houses are large. The only other difference between the two neighborhoods is that B has more desirable amenities: gB = 75> 50 = gA. Utility is specified as a Cobb- Douglas function of housing and neighborhood attributes:

Embedded Image

Finally, the joint distribution of preferences is drawn from a gamma distribution, and income is drawn from the same empirical distribution used by Kuminoff and Jarrah (2010).34

In the resulting equilibrium, large houses and small houses are both more expensive in neighborhood B, because it provides access to higher-quality amenities. Our main point is that the large houses in B also command larger price premiums because they are in limited supply. To isolate the price premium, we begin by calculating the difference between the average price of large and small houses in each neighborhood. For example, (pj I jA and 2000 < Xj < 3000) - (pj I jA and 1000 < Xj < 2000) measures the difference between the average prices of large and small houses in neighborhood A. Differencing removes the market value of land. This follows because all houses in A have identical lots and they provide access to same amenity. The same is true for all houses in B. Therefore, the percentage markup on square footage in neighborhood B can be defined as

Embedded Image[6]

Figure 635 graphs the relationship between the markup and the difference in amenities provided by the two neighborhoods. In the baseline equilibrium (i.e., gB- gA=75- 50=25) there is a 63% premium on the market value of structures in the high-amenity neighborhood. As (gA- gB)r0, spatial market power diminishes and the equilibrium markup approaches zero.

Figure 6

Markup on $/sqft of Large Houses in the High- Amenity Neighborhood

It is important to reiterate that our hedonic framework in this paper is consistent with the possibility of market power. While Rosen’s (1974) welfare interpretation of the hedonic gradient relies on the maintained assumption of perfect competition, we did not maintain that assumption in order to prove that market outcomes can be described by a hedonic price function in Theorem 1. As Feenstra (1995) demonstrated, introducing imperfect competition into a hedonic equilibrium simply changes the interpretation of the price function coefficients. They describe the implicit market prices of product characteristics, which reflect marginal costs plus markups or discounts.

Taylor and Smith (2000) provided the first hedonic evidence of market power in the market for beach rental properties in North Carolina. In particular, they found access to the beach gave owners the ability to charge markups on structural features of the house that were difficult to modify, such as the number ofbedrooms. In our model, markups can enter through variation in the tract-specific coefficients on square feet of living space. It would be interesting to investigate the extent to which this variation can be explained by spatial variation in the quality of local public goods and urban amenities, perhaps using a higher-resolution version of the quality-of-life indices that have been constructed at the county level (e.g., Blomquist, Berger, and Hoehn 1988). One could also consider generalizing our model to allow for more spatial variation in the implicit prices (and markups) for other structural characteristics.

q-Theory

A second explanation for the wedge between construction costs and the market value of structures is that new houses take time to build. In a long-run equilibrium we would expect the market value of an additional unit of each structural characteristic to equal its marginal construction cost. In the short run, however, a positive demand shock may lead market values to exceed construction costs. As the ratio of market value to construction cost increases, so does the incentive for new development. This is the basic idea behind Tobin’s “q-theory” of capital investment. Figure 736 illustrates his logic in the context of our hedonic simulation. It graphs the difference ratio that enters the markup formula in [6] against the share of “large” houses in the high-amenity neighborhood. Holding the amenity differential fixed, we simulate the transition path to a long-run equilibrium by incrementally “remodeling” the small houses in the high-amenity neighborhood as large houses. Each time we remodel an additional 10% of houses, we solve for a new set of equilibrium prices and location choices. While the total number of houses is constant throughout this exercise, there is an increase in the total square footage of living space. As living space increases, the market-clearing difference ratio moves closer to 1, consistent with Tobin’s description of the transition to a long-run equilibrium.37

Figure 7

Q-Ratio and the Share of Large Houses in High- Amenity Neighborhood

While Tobin (1978) noted the potential for his model to explain the evolution of housing prices, there have been few applications. Part of the difficulty is that the logic of q-theory applies to structures, not to land, since we usually think of the supply of land as being fixed.38 Thus, to evaluate the testable implications of q-theory we need credible estimates for the market value of structures.39 Given credible estimates for the value of structures and construction costs, the interpretation of qratios is still complicated by several factors, including the presence of taxes, search costs, lender fees, and the aggregation over individual housing characteristics, each of which would be expected to have a separate marginal q.40 These factors may lead to q-ratios that differ from 1 in a long-run equilibrium.

As a first pass at assessing the predictions of q-theory in the housing market, we approximate q by dividing our annual estimates for the market value of structures by the replacement cost estimates for the cost of rebuilding the average home in Miami, San Francisco, Boston, and Charlotte. The results appear to be broadly consistent with the implications of q-theory.41 Figure 842 illustrates that our preboom and postbust estimates for structural value would imply q-ratios close to 1 in some areas. During the boom we generally see qratios increase and then decrease, consistent with lagged construction. In Miami, for example, our estimate for the q-ratio increases from 0.92 in 1998 to 1.5 in 2007, and then decreases back to approximately 0.92 by 2009. The decrease in the q-ratio is likely due to decreased demand as well as increased supply. Boston and Charlotte show similar patterns, although their q-ratio is more elevated at the beginning of the time frame. San Francisco however begins with a q-ratio of 1.68, peaks at over 3 in 2005, and then drops back to 1.69 in 2009. Modeling the dynamics of housing supply and demand as they relate to q-theory would be an interesting direction for future research.

Figure 8

Q-Ratios for Miami, San Francisco, Boston, and Charlotte

VII. Conclusion

The boom-bust cycle in real estate during the 2000s was staggering in its size and impact on the world economy. It is extremely important to characterize the dimensions of this volatility and to understand why they arise. Hedonic pricing models provide a useful conceptual framework for this task, but recent studies have questioned their ability to deal with unobserved attributes of houses and neighborhoods. We answered this question by refining the standard hedonic pricing model to address omitted variables and by using our refined estimator to characterize how the market values of land and structures evolved within and between several major metropolitan areas between 1998 and 2009.

Concerns about omitted variables led us to make three refinements to the conventional hedonic strategy for inferring the market value of residential land. We used spatial fixed effects to recognize that land values may be highly localized due to the spatial distribution of nonmarket amenities; we controlled for unobserved housing attributes by allowing the implicit per unit prices of structures to vary from neighborhood to neighborhood; and we exploited data on the universe of housing sales to recognize that shocks to wealth, credit, tastes, and amenities may cause the shape of the housing price function to change from year to year. These refinements increased our estimates by an order of magnitude. Our estimates for the average market values of land in major metropolitan areas were very similar to the preboom estimates from Davis and Palumbo’s (2008) replacement cost model, but an order of magnitude larger than estimates from Glaeser, Gyourko, and Saks’s (2005) conventional hedonic model.

Consistent with prior studies of the boombust period, we found that land values were generally more volatile in metropolitan areas where the supply of developable land was less elastic. Focusing on the spatiotemporal variation within metropolitan areas yielded two novel findings. The least valuable land at the urban fringe was the most volatile, and the market value of structures exceeded construction costs during the boom, with the largest markups occurring in the most affluent neighborhoods. Interestingly, the volatility in the market value of structures was far greater than what has been assumed by prior replacement cost studies that pin the value of structures to construction costs. Finally, we suggested two potential explanations for the wedge between construction costs and the market value of structures: imperfect competition and q-theory. Formal tests of these hypotheses await future research.

Finally, our results also have some implications for the related literature on land value taxation. Over the years there has been considerable interest in the possible efficiency gains from replacing the property tax with a tax on the market value of land or a split tax with separate rates on land and structures (e.g., Banzhaf and Lavery 2010; Cho, Lambert, and Roberts 2010). One of the stylized facts about land value taxation is that, if implemented, it would lead to more variable revenue streams than the current property tax because land values are more susceptible to speculation (Bourassa 2009). At a practical level, part of the challenge with implementing a tax on land is determining its market value. Our findings have three implications for this literature. First, the replacement cost approach may overstate the value of land during a boom-bust cycle. Second, the bias may not be neutral. Our results suggest it would be largest in the highest-amenity neighborhoods. To the extent that homeowners in these neighborhoods collect markups on structures, they would have a disincentive to invest in structural improvements if they were effectively taxed on these improvements by a replacement cost scheme for determining land value. Finally, our estimates suggest that moving from a property tax to a land tax may actually help to stabilize revenue streams for some municipalities.

Acknowledgments

We thank two anonymous referees, Dan Bromley, Leah Brooks, Morris Davis, Paul Ellickson, Gerald Korngold, Byron Lutz, David C. Lincoln, Elizabeth Plummer, V. Kerry Smith, David Wildasin, Kent Zhao, and seminar participants at the David C. Lincoln Fellowship Symposium and the Annual Conference of the National Tax Association for valuable comments and suggestions on this research. We gratefully acknowledge research support by the Lincoln Institute of Land Policy.

Appendix

Table A1.

Housing Units. Property Values, and Land Vales in the Boston Area. 1998-2008

Footnotes

  • The authors are, respectively, assistant professor, Department of Economics, Arizona State University, Tempe; and assistant professor, Department of Economics, Brigham Young University, Provo, Utah.

  • 1 In the literature on land valuation, the hedonic approach is also referred to as the “contribution” approach.

  • 2 Indeed, some prior hedonic estimates for land value seem strikingly low. For example, the estimates by Glaeser, Gyourko, and Saks (2005) suggest that the value of a full acre of land in Boston in 1998 was less than $30,000 and in San Francisco it was less than $200,000.

  • 3 This figure shows that housing prices more than doubled from 1998 to 2006 but then declined substantially from 2006 to 2009 for 20 metropolitan areas included in the index. The data for this figure comes from the Standard and Poor’s Case-Shiller U.S. National Values Home Price Index. For documentation, see http://www.standardandpoors.com.

  • 4 This figure shows the substantial heterogeneity in price changes both across and within metropolitan areas. The three lines show the evolution of prices for a metropolitan area’s bottom “tier,” middle “tier,” and top “tier” of the price distribution. Breakpoints are defined by metropolitan area as of August 2010. The data for this figure also comes from the Standard & Poor’s Case-Shiller Home Price Index. *Supply elasticities are based on Saiz (2010).

  • 5 One can find the same pattern in the other 16 major metropolitan areas tracked by the Case-Shiller index.

  • 6 The new empirical literature on Tiebout sorting stresses the need to recognize that neighborhood amenities are typically endogenous to the collective location choices made by the households in a metropolitan area (Kuminoff, Smith, and Timmins 2010). For example, urban development may provide opportunities for dining and nightlife, while increasing traffic congestion and degrading air and water quality. Homeowners may be asked to vote on assessments to fund open space preservation and public schools. Academic performance among students in those schools may depend on the distribution of income and education among parents in the school district. While we do not model these mechanisms, our framework is consistent with their presence.

  • 7 Theorem 1 can also be proven under an alternative assumption that households ignore their own contributions to the supply of neighborhood amenities.

  • 8 Our theorem recognizes that neighborhood amenities may be determined endogenously through a Tiebout sorting process. In contrast, Bajari and Benkard (2005) characterize markets where product attributes (other than price) are determined exogenously. They also model unobserved product attributes and restrict utility to be Lipschitz continuous in order to guarantee Lipschitz continuity of the price function. While it is straightforward to add these elements to our model, they are unnecessary to guarantee the existence of a price function.

  • 9 This assumes the undeveloped parcel is also zoned for residential development.

  • 10 Cheshire and Sheppard (1995) distinguish between these two components of land value. While we could certainly do the same, it is not essential to our analysis.

  • 11 While the published version of Davis and Palumbo’s paper presents estimates for 1984 to 2004, the Lincoln Institute of Land Policy maintains a web page where their estimates are updated as new data become available: http://www.lincolninst.edu/subcenters/land-values/metro-arealand-prices.asp.

  • 12 In earlier work, Jackson (1979) used aggregate census tract data to estimate a coarse approximation to a hedonic price surface in Milwaukee. In principle, his results could also be used to develop an approximation to the value of land.

  • 13 In general, the shape of the equilibrium hedonic price function will vary with changes in tastes, wealth, regulations, and spatially delineated amenities. See Kuminoff, Smith, and Timmins (2010 and Kuminoff, Parmeter, and Pope (2010) for details.

  • 14 Abbott and Klaiber (2011) make a similar point in the context of identifying what occupants are willing to pay for a particular amenity.

  • 15 The U.S. Census Bureau defines Census tracts to be “as homogeneous as possible with respect to population characteristics, economic status, and living conditions.” See “Chapter 10: Census Tracts and Block Numbering Areas, U.S. Census Bureau,” Geographic Areas Reference Manual, which can be found at http://www.census.gov/geo/www/garm.html.

  • 16 For example, all of the houses in a Census tract may be located near public open space, but the handful of lots that are adjacent to the public lands may sell at an additional premium.

  • 17 A referee notes that we could have used an explicit spatial model to attempt to mitigate the bias from spatial correlation between housing attributes and omitted variables. Examples of explicit spatial models include the spatial error, spatial lag, general spatial, and spatial Durbin models. We chose not to use these models for three reasons. First, there is prior evidence that spatial fixed effects outperform explicit spatial models in terms of recovering accurate estimates for the implicit prices of housing attributes (Kuminoff, Parmeter, and Pope 2010). Second, our fixed effects model produces estimates that are very similar to Davis and Palumbo’s (2008) replacement cost calculations prior to the boom, as we discuss in Section V. We interpret this evidence as validating our model, since we expect the replacement cost methodology to provide good estimates for land values during periods of market stability. The third reason is that our sample sizes and covariate matrices are too large to implement the maximum likelihood routines for estimating flexible spatial autocorrelation models. As computing power continues to improve, an interesting avenue for future research would be to develop new models that combine spatial fixed effects with strategies for addressing more localized forms of spatial autocorrelation within neighborhoods.

  • 18 The other eight are Cincinnati, Detroit, Los Angeles, Oakland, Philadelphia, Pittsburg, San Jose, and Tampa.

  • 19 His estimates are generated using information on geographic constraints, regulatory constraints, and predetermined population levels in each metropolitan area.

  • 20 A related literature on gentrification investigates how the demographic composition of neighborhoods within a metropolitan area changes as people migrate from the suburbs to the cities, and vice versa. See McKinnish, Walsh, and White (2010) for an interesting example of this line of research, as well as citations to the broader literature.

  • 21 The results (reported in their Table 4) support their hypothesis that the areas that we would expect to be more highly regulated have larger differences between construction costs and housing prices.

  • 22 Our results are generated using a simple linear model estimated according to equation [4], using the combination of control variables that comes as close as possible to the specification from GGS. Complete results will be provided upon request.

  • 23 For example, the CMHPI uses a slightly different definition for metropolitan areas than the AHS. Also, since DP do not report average lot size, we use the average lot size in our data to convert the DP estimates to a $ per acre measure. So, there are certainly some differences in the estimates that are caused by differences in spatial-temporal components of the underlying datasets, but we think that it is highly unlikely that these drive the large differences we document between GGS and DP.

  • 24 We dropped tracts with fewer than 15 observations per year to better ensure accurate estimates of tract-specific land values. This drop (approximately 33.5% of our observations) was necessary given our Census tract fixed effects identification strategy.

  • 25 The replacement cost results are the Davis and Palumbo (2008) estimates that have been updated and provided at http://www.lincolninst.edu/subcenters/land-values/. Also note that we were unable to obtain assessor data for the year 2009 in Boston. Therefore all results reported for Boston are for the 1998-2008 time frame.

  • 26 Column 1 shows the evolution of total land value, and Column 2 shows the evolution of total structural value by Primary Metropolitan Statistical Area. Both columns show estimates derived using the hedonic method and the replacement cost method.

  • 27 Figures are produced using our assessor data and data from Freddie Mac’s CMHPI as documented by Davis and Palumbo (2008).

  • 28 These figures were produced using our hedonic approach to estimating land value. Black represents greatest absolute change, positive or negative.

  • 29 Charlotte and Miami have only one or two counties with available assessor data, so their maps are less interesting.

  • 30 This is despite the fact that many have been concerned about gentrification in urban areas, which is likely to dampen the volatility.

  • 31 All five trends are also present in the Boston area. For brevity, we provide a table with results in the Appendix.

  • 32 62.5% is the value for the 1998 land share that would correspond to a prediction of no change in the land share over the boom. It is calculated by dividing the regression intercept by the slope coefficient (= 0.359/0.574). The regression predicts decreases for larger baseline land shares and increases for smaller baseline land shares.

  • 33 Also see their paper for a useful summary of the recent literature on the role of credit in the housing market boom and bust.

  • 34 Matlab code to reproduce the simulation results is available from the authors upon request.

  • 35 Markups are based on the hedonic simulation described in Section VI. The percentage markups on the price of square footage are calculated using equation [6]. In the low-amenity neighborhood, 50% of houses are large and 50% are small. In the high-amenity neighborhood, 20% of houses are large and 80% are small.

  • 36 Q-ratios are based on the hedonic simulation described in Section VI. Ratios are calculated using the difference ratio in the markup formula [6]. The share of large houses in the low-amenity neighborhood are held constant at 50%. The level of the amenity in the low- (high-) amenity community is held constant at 50 (75).

  • 37 Since the difference ratios are based on average prices, rather than marginal prices, we would not necessarily expect them to be exactly equal to 1 in a long-run equilibrium.

  • 38 A rare exception is residential communities that are built on swamps that have been drained or wetlands that have been filled.

  • 39 Jud and Winkler (2003) apply q-theory to housing values without decomposing them into the value of land and structures.

  • 40 For a discussion of aggregation issues in a setting with multiple capital goods and citations to the broader literature on the empirical measurement of q-ratios, see Wildasin (1984).

  • 41 We are grateful to David Wildasin for first bringing this to our attention.

  • 42 The q-ratios illustrated above are the ratio between the average market value of structures within the metropolitan area and the structural costs of the average house within the metropolitan area as given by the updated estimates of Davis and Palumbo (2008) that can be found at http://www.lincolninst.edu/subcenters/land-values/metropolitan-area-land-prices.asp.

References