An Econometric Analysis of Land Development with Endogenous Zoning

Van Butsic, David J. Lewis and Lindsay Ludwig

Abstract

Zoning is a widely used tool to manage residential growth. Estimating the effect of zoning on development, however, is difficult because zoning can be endogenous in models of land conversion. We compare three econometric methods that account for selection bias in a model of land conversion: a jointly estimated probit-logit model, propensity score matching, and regression discontinuity. Our results suggest that not accounting for selection bias leads to erroneous estimates. After correcting for selection bias we find that zoning has no effect on a landowner’s decision to subdivide in a rural Wisconsin county. (JEL Q24, R14)

I. Introduction

One of the most widely discussed land management issues of recent years is urban sprawl—noncontiguous development on previously undeveloped agricultural and forested landscapes. Urban sprawl is criticized largely on the grounds that development consumes an excessive amount of land that would otherwise have provided market and nonmarket benefits associated with open space. A corollary of excess sprawl is the loss of farmland, since exurban growth often occurs in areas that are primarily agricultural. Local zoning ordinances remain probably the most widespread land use control influencing sprawl. In general, the effects of zoning on land development may vary across regions and are not well understood. Some argue that defining specific zones on the landscape for different types of development and open space can be viewed as a desirable feature of so-called “smart growth” policies (Danielson, Lang, and Fulton 1999). In contrast, others argue that minimum lot zoning requirements can exacerbate sprawl by forcing consumption of larger lot sizes than the market would dictate in the absence of zoning (Fischel 2000). Empirical evidence regarding the effects of zoning on land development and sprawl is limited (McConnell, Walls, and Kopits 2006) and requires an understanding of how individual landowners make decisions in response to local market conditions and zoning constraints.

Accounting for zoning policies in empirical land use models requires that researchers address the nonrandom application of zoning across a landscape. Including zoning in a model of land development can induce a form of selection bias in econometric estimation for at least two reasons. First, zoning policies may simply “follow the market” if local governments systematically consider the land market in the application of zoning and variance decisions (Wallace 1988). In particular, if a price differential exists between zones, then local governments will be pressured to expand the high-price zone, or to simply grant variances—an often less costly alternative than rewriting the ordinance. To the extent that a researcher does not observe all factors that influence a parcel’s development value, there is the strong possibility that the same unobservable factors that influence development will also influence zoning decisions, presenting a selection bias estimation problem commonly known as “selection on unobservables” (Cameron and Trivedi 2005, Ch. 25).

Second, zoning can induce selection bias in a land development model because parcels that are placed in a certain zone might have different distributions of the underlying covariates than parcels placed in an alternative zone. For example, parcels closer to busy roads may be less likely to be zoned restrictively due to the influence of road access on development potential. As such, zoning rules may be applied to a nonrandom sample (only the parcels with the unique attribute are zoned), and even if one can observe all characteristics that influence development decisions, parametric econometric methods can produce biased estimates due to differences in the distributions of the underlying covariates (Heckman et al. 1996). In this case it is difficult to separate the effect of zoning from the effect of the observed characteristic (e.g., proximity to busy roads), even though parcels are selected for specific zoning rules on “observable” characteristics.

The purpose of this paper is to conduct a parcel-level econometric analysis of the ability of local zoning (exclusive agriculture zoning [EAZ]) and statewide tax incentives (Wisconsin’s Farmland Preservation Program [FPP]) to influence land use conversion in an exurban region outside of Madison, Wisconsin. Using a unique spatial panel dataset derived from five parcel-level cross-sectional landscape observations between the years 1972 and 2005, we estimate the effect of EAZ and FPP on the likelihood of land development, using multiple econometric techniques that correct for different forms of selection bias. While corrections for selection bias have been commonly applied to estimating the effects of zoning on property values in linear hedonic models (Wallace 1988; McMillen and McDonald 1991, 2002), we extend the application of these techniques to nonlinear models of the discrete decision of whether to subdivide and develop land—the key decision in analyses of urban sprawl.

We acknowledge the two types of selection bias discussed above and compare three econometric approaches to estimate the effects of endogenous land use policy on land development. First, we jointly estimate a parcel’s selection into exclusive agricultural zoning (the zoning decision) with the decision to develop (subdivide) the parcel. The model specifies that these decisions are influenced by common observable characteristics (e.g., parcel size, distance from roads, etc.) and, importantly, common unobservable characteristics. The decisions are estimated within a joint discrete-choice framework that embeds correlated unobservables across the decisions (Greene 2006). Econometric estimation is performed with maximum simulated likelihood and allows for an empirical test of selection bias and unobserved heterogeneity with respect to inclusion of a parcel in an exclusive agricultural zone. Although the analysis extends the “selection on unobservables” approach for endogenous treatment effects to nonlinear models (e.g., Cameron and Trivedi 2005, Ch. 25), identification relies on potentially strong functional form assumptions.

Second, we perform propensity score matching to estimate the effects of zoning on the land use decisions of those parcels that are “treated” with exclusive agricultural zoning (the average treatment effect on the treated). Matching methods exploit heterogeneity in the zoning status across parcels and provide potentially unbiased estimates of the treatment effect even if the zoning board selects parcels into exclusive agricultural zoning in a nonrandom fashion. In contrast to the joint discrete-choice estimation exercise, matching methods assume that selection into zoning is based only on characteristics observable to the researcher (e.g., prime farmland, distance from service districts, etc.). Relative to joint estimation, the strength of matching is that it imposes minimal functional form restrictions in estimation, although the estimates will be biased if there are unobservable characteristics that influence both the zoning and the development decisions.

Finally, we exploit a discontinuity in the application of FPP to examine the effects of income tax credits on the landowner’s decision to subdivide. Wisconsin’s FPP provides income tax credits to landowners who maintain the agricultural status of EAZ parcels of at least 35 acres. Parcels less than 35 acres that are zoned EAZ are still subject to subdivision restrictions but are not eligible for income tax credits. Thus, the discontinuity in eligibility for FPP at 35 acres allows for estimation of both semiparametric and fully parametric discrete-choice models over a sample where the application of FPP is quasirandom.

Our estimation results yield the following basic conclusions. First, under the assumption that zoning is exogenous, exclusive agricultural zoning significantly reduces the probability of subdivision. In our application, this result leads to the erroneous conclusion that agricultural zoning significantly alters development patterns. Second, joint estimation of the zoning and development decisions provides strong evidence that the two decisions are influenced by correlated unobserved heterogeneity, contradicting the assumption of exogeneity in the first model. Joint estimation (waiving the assumption of exogeneity) indicates that zoning has no effect on the probability of subdivision. Third, results derived from matching methods largely confirm insights drawn from joint estimation: zoning has no effect on the probability of subdivision for the parcels that receive the treatment. Fourth, the discontinuity analysis shows that eligibility for Wisconsin’s FPP has at most a weak effect on the probability of subdivision.

II. Empirical Analyses of Zoning

Previous economic analyses of zoning focused on the property price effects of various zoning restrictions. Relevant for our application, Henneberry and Barrows (1990) provide evidence that EAZ increases farmland values in Wisconsin. However, the results from Henneberry and Barrows are contingent on an assumption that EAZ is exogenous in a model of land values. Wallace (1988) provides a widely cited hedonic analysis of the effects of zoning on land values in King County, Washington, concluding that zoning tends to “follow the market”—areas of high development value are more likely to be zoned to allow development. A series of papers by McMillen and McDonald (1989, 1991, 2002) provide further evidence on the effects of zoning on land values, concluding that zoning authorities systematically consider the local land market when selecting parcels for particular zoning rules (McMillen and McDonald 1989). A consistent estimation strategy in the hedonic literature on endogenous zoning is a two-stage estimation approach similar to Heckman’s (1979) seminal two-stage sample selection model. In the first stage, the zoning decision is typically modeled as a discretechoice decision process. In the second stage, results from the first stage are then used to correct for the endogeneity of zoning in a variant of a linear hedonic model of land values. The “selection on unobservables” approach used in this paper is motivated by the early hedonic research on two-stage models of zoning and land values, with the difference arising that our “second-stage” model is a nonlinear model of the binary decision to develop land.

The recent economics literature on land use change has focused on parcel-scale discretechoice models of the land development decision. A variety of econometric approaches has been used in prior work, including probit models of the binary development decision (Bockstael 1996; Carricón-Flores and Irwin 2004); conditional logit models of decisions involving agriculture, forest, and development (Newburn and Berck 2006; Lewis and Plantinga 2007); duration models of the time to conversion (Irwin and Bockstael 2002; Towe, Nickerson, and Bockstael. 2008); and jointly estimated probit-Poisson models of the decision to develop and the decision of how many new lots to create (Lewis, Provencher, and Butsic 2009; Lewis 2010). In contrast to the hedonic literature cited above, most of the econometric land use change literature treats zoning as exogenous in estimation (Irwin and Bockstael 2004; Newburn and Berck 2006; Towe, Nickerson, and Bockstael 2008) or ignores zoning altogether (Lubowski, Plantinga, and Stavins 2006; Lewis and Plantinga 2007). While some analyses argue that zoning rules are exogenous in their application due to a natural experiment in policy design (McConnell, Walls, and Kopits 2006; Towe, Nickerson, and Bockstael. 2008; Lewis, Provencher, and Butsic 2009), other analyses note the possibility that zoning is endogenous but do not attempt to address the problem, often because zoning is not a central feature of the analysis.

Despite their common grounding in land values, it is evident that the discrete-choice land use change literature has diverged substantially from much of the hedonic literature when it comes to handling potential selection bias associated with zoning.1 One reason for the divergence is the fundamental difficulty associated with modeling selection bias in linear versus nonlinear models. While linear models of land values can use widely understood variants of Heckman’s (1979) two-step empirical sample selection methodology, such methods are, in general, not appropriate for the type of nonlinear models used in the land use change literature (Greene 2006). However, recent advances in modeling selection problems with nonlinear models (Greene 2006; Lewis, Provencher, and Butsic 2009), combined with widely used quasi-experimental techniques such as matching methods and regression discontinuity, provide an opportunity to reconsider how selection problems associated with zoning can be handled in discrete-choice models of land use change.

III. Study Area, Relevant Land Use Policies, and Data

The study area for this analysis, Columbia County, Wisconsin, is a fast-growing county located just north of the Madison metropolitan area. While still considered rural in many areas, Columbia County has experienced significant growth in rural-urban fringe development from nearby Madison (McFarlane and Rice 2007). Conflicts have arisen in Columbia County due to farm odors, slow machinery on roads, and the operation of machinery at late hours (Columbia County Planning and Zoning Department 2007).

Agricultural Zoning and Farmland Preservation

In 1969, Columbia County began active attempts to slow the conversion of agricultural lands. EAZ was established in the county in 1973 in an attempt to limit rural subdivisions, and parcels zoned EAZ can subdivide under three conditions. First, EAZ parcels can create one new residence per 35 acres, as long as the residence is related to farm work. Second, landowners can ask the town board to rezone their property to allow residential development. Third, landowners can request a variance from EAZ rules to develop their land. All three conditions appear to have been widely used since EAZ was originally established.

In 1977, the FPP was established by the state of Wisconsin to complement EAZ and preserve Wisconsin farmland through a system of tax credits and land use restrictions. Owners of farmland can qualify for the tax credit if they sign a farmland preservation agreement restricting development of land for a specific amount of time, or if their farmland is zoned for exclusive agricultural use (State of Wisconsin 2007). Farmland owners who qualify for the tax credit may claim a sizable tax break each year; currently the maximum an owner can claim is $4,200 a year, while the average payment in Columbia County is $641 per year. Generally, the tax credit increases as property taxes increase and household income decreases (State of Wisconsin 2007). Given that data on whether land owners enroll in FPP is unavailable, our analysis assesses the impact of eligibility for this program.

A convenient feature of zoning in Columbia County is that areas not zoned EAZ have a uniform minimum lot size: 20,000 sq ft (15,000 sq ft for panels prior to 1991). In our setting, we propose that minimum lot size is the most restrictive facet of zoning, as lot size restrictions will likely have a larger influence than other facets (such as minimum set backs or height restrictions) on the ability of a landowner to subdivide. Therefore, in this setting, the regulatory landscape can generally be described by two zones: EAZ and non-EAZ. This allows for estimation of zoning as a binary treatment variable.

Spatial-Temporal Data and Development Trends

We obtained spatial data on development decisions and parcel attributes over a number of years for two townships in Columbia County: Lodi and West Point, neighboring townships located in the southwest corner of the county bordering Lake Wisconsin and the

Wisconsin River (Figure 1). The parcel-level data was generated by the Center for Land Use Education at the University of Wisconsin-Stevens Point. Property boundaries were reconstructed over the study area for five points in time: 1972, 1983, 1991, 2000, and 2005. Using 2005 digital parcel data, historic property boundaries were recreated through a process of “reverse parcelization” that selects and merges parcels using historic tax records and plat maps (see McFarlane [2008] for a complete description of the data construction). Zoning data is constructed from the Columbia County Planning Department.

Figure 1

Lodi and West Point Townships in Columbia County, Wisconsin

The full dataset has 21,798 individual parcel observations. Parcels that could not legally subdivide were dropped from this dataset; these include public lands and parcels too small to subdivide due to zoning restrictions. Additionally, all parcels adjacent to Lake Wisconsin were dropped from the analysis because waterfront parcels are arguably part of a different land market than nonwaterfront property.2 The final dataset used for estimation contains 5,764 observations. A host of variables is thought to influence the decision to enroll a parcel in EAZ and the decision to subdivide. The variables used in the econometric analysis, and summary statistics for the variables, are presented in Table 1.3

Table 1.

Description of Variables and Summary Statistics by Policy

More than 30 years after EAZ and the FPP were established, Lodi and West Point townships are still experiencing a loss of agricultural lands. Out of 1,186 developable parcels in our dataset in 1972, 328 (28%), subdivided by 2005. There are 539 parcels zoned EAZ that are eligible for FPP in 1972, and 132 (24%) of these parcels subdivided by 2005. There are 386 parcels in EAZ that are too small to qualify for FPP, and 77 (20%) of these subdivided by 2005. For the non-EAZ parcels 92 of the 228 (40%) parcels less than 35 acres subdivided by 2005, while 27 of the 33 (81%) parcels larger than 35 acres subdivided over our study period. Thus, summary statistics indicate that parcels zoned EAZ and those eligible for FPP payments are less likely to subdivide. However, summary statistics also indicate that development certainly happened on parcels with various combinations of EAZ and FPP, indicating the widespread application of rezoning and variances in this region (see Ludwig [2008] for further information).

The data from Lodi and West Point townships admittedly represents a small geographic area compared to land use change models that use data from full counties (Lewis, Provencher, and Butsic 2009), multiple counties (Bockstael 1996; Lewis and Plantinga 2007), or the nation (Lubowski, Plantinga, and Stavins 2008). In land use change models, a small geographic sample raises two concerns. First, if the small geographic location results in a small sample size, this can lead to Type 1 errors. The panel nature of our data increases the sample size to 5,764 observations, large enough to assure statistical precision. Additionally, when we use econometric techniques that do not exploit the panel nature of the data (resulting in smaller sample sizes), our results remain relatively stable compared to models that use the full sample. Second, the transferability of these results to other settings may be hindered by the specialized sample. However, the townships examined here share multiple characteristics typical of exurban townships: proximity to urban areas, mixed agriculture and large-lot subdivision, and zoning boards comprised of local landowners.

Expanding our sample geographically is prohibitive for two reasons. First, historical reconstruction of parcel-level land use change is labor intensive and expensive. Second, expanding the geographic area would hinder our identification strategy. The fact that zoning is binary in our sample (EAZ or non-EAZ) allows us to use econometric techniques appropriate for evaluating binary treatments. Using data from additional municipalities would introduce other land use policies, negating our ability to use these techniques. Overall then, while the sample comes from a small geographic area, the number of observations is large enough to ensure statistical precision, the townships are typical of exurban development, and the townships provide a unique mechanism for evaluating the effects of land use policy.

IV. Estimating The Effects of Eaz on Development

The landowner’s decision problem is cast as a problem of whether to subdivide and develop his land at time t. Much of the land use literature is motivated by Capozza and Helsley’s (1989) deterministic optimal stopping problem, whereby development takes place once development rents (assumed to be increasing over time) equal the rents from agriculture (assumed to be constant over time). We cast the decision problem in terms of the reduced form net land value of subdividing at time t, where Snt = 1 if parcel n subdivides in time t, and Snt = 0 otherwise. Formally, the land value of subdivision is LVnt, and subdivision occurs when

Embedded Image [1]

where wnt is a set of observable parcel characteristics, EAZnt is a binary indicator of the zoning status of parcel n, and μn and vnt denote parcel-specific characteristics observed by the parcel owner but not by the analyst. We model μ n as an iid standard normal random effect to reflect the panel structure of our data—repeated parcel-level decisions are observed over time.

The zoning agency’s decision problem is cast as a problem of whether to impose exclusive agricultural zoning status on parcel n (EAZnt=I) or not (EAZnt = 0). As is typical in local governments throughout the United States, the landowner of parcel n can lobby the local government regarding the zoning decision. The net value to the zoning agency of imposing EAZ status on parcel n is defined as VZnt, and EAZnt = 1 when

Embedded Image [2]

where xnt is a set of parcel characteristics observable to the researcher and the zoning agency, and εnt is a set of parcel characteristics observable to the zoning agency but not the researcher. In this setting, some of the same observable characteristics that influence zoning can also influence the net value of subdividing (xntwnt), and, importantly, some of the same unobservable characteristics that influence zoning can be correlated with unobservable characteristics that influence the net value of subdividing. Such correlation implies that EAZnt is an endogenous variable when attempting to estimate the parameters in [1].

Full Information Maximum Likelihood (FIML) Estimation: Selection on Unobservables

One approach to obtaining a consistent estimate of the effects of EAZnt on development is to jointly estimate [1] and [2] with correlated unobservables across equations. Such a strategy can be implemented with a fully parametric approach, and we adopt such a frame-work in this section. In particular, we make the following assumption:

Embedded Image [3]

By further assuming that v nt is logistically distributed, we follow Greene (2006) and model the two equations as a joint probit-logit model. In particular, by writing V(wnt, EAZnt) as a linear function of parameters, the probability that farm n subdivides in time t, conditional on wnt, μn, and EAZnt, can be written as

Embedded Image [4]

Further, by writing G(xnt) as a linear function, Greene (2006) shows that the probability of the observed EAZ behavior on farm n in time t, conditional on χnt and μn, can be written as

Embedded Image [5]

where the term 2EAZnt – 1 is a computational and notational convenience that exploits the symmetry of the normal distribution. Conditional on wnt, xnt, and μn, the joint probability of the observed behavior on parcel n is

Embedded Image [6]

The unconditional probability of the observed behavior is generally stated as

Embedded Image [7]

Equation [7] can be solved with maximum simulated likelihood by taking R draws from the normal distribution of μ n. The log likelihood function to be maximized over N parcels is

Embedded Image [8]

This function is maximized by choice of the parameter vector (β,λ,α,σ,ρ), and accounts for correlated unobservables across the decisions to zone and subdivide, and the panel structure of the data by modeling random parcel effects. The correlation coefficient ρ deserves special attention. In this model, p corrects and tests for unobserved selection bias between the decisions to zone and the decision to subdivide. The sign of p indicates the direction of correlation between the joint decisions, while its magnitude and standard error measure its significance. A negative statistically significant ρ indicates that parcels that are more likely to be zoned EAZ are less likely to subdivide.

Propensity Score Matching: Selection on Observables

An alternative to the FIML model is the use of propensity score matching. In this setting, EAZ is still modeled as endogenous to the decision to subdivide, but we assume that we can observe all important inputs to the decision to zone a parcel EAZ and the decision to subdivide. Additionally, we assume that the same characteristics that influence the decision to zone a parcel EAZ also influence the decision to subdivide. Matching works by comparing outcomes on parcels that were zoned EAZ and those that were not zoned EAZ but are similar in observed baseline covariates. The goal of matching is to make the covariate distributions of EAZ and non-EAZ parcels similar. In this way matching mimics a random sample. Following the notation used earlier, but with unscripted letters equaling population averages, the average treatment effect for the treated (ATT) is defined as

Embedded Image [9]

The key is to find a proxy for the unobservable counter factual E[S(EAZ = 0) IEAZ = 1). Under the assumption of common support and unconfoundedness (Caliendo and Kopeinig 2008),

Embedded Image [10]

where C is a vector of characteristics that affect both the selection into EAZ and the likelihood of subdivision, and the subscript on S denotes the outcome (1 = subdivision; 0 = no subdivision). Matching on C implies controlling for a high dimensional vector. Thus we follow the insights of Rosenbaum and Rubin (1983a) and use the propensity score defined as P(C) = Pr(EAZ = 11 C), which is the probability that a parcel is zoned EAZ given its set of covariates C. We can rewrite the estimate of ATT as

Embedded Image [11]

In order to implement propensity score matching we must specify the zoning selection equation, which assigns a propensity score to each observation. The selection equation should only include variables that affect the participation decision (zoned EAZ or not) and the subdivision outcome (Heckman, Ichimura, and Todd 1998; Dehejia and Wahba 1999). In our case, we use a probit specification similar to the “first stage” of the FIML model.

Formally, to derive equation [11], two conditions need to hold (Becker and Ichino 2002). First, the pretreatment variables must be balanced given the propensity score:

Embedded Image [12]

Second, the assignment to the treatment must be unconfounded given the propensity score:

Embedded Image [13]

If equation [12] is satisfied, the distribution of the underlying covariates is the same regardless of treatment. That is, the treatment is randomly assigned. Therefore, treated and untreated parcels will be observationally identical on average. To validate these two requirements, we implement the propensity score matching algorithm derived by Becker and Ichino (2002), which assures that the propensity scores used for comparison are balanced in the underlying covariates.

A variety of matching estimators exist that have different trade-offs between variance and bias. The central questions when choosing a matching estimator are what constitutes a match and should one match with or without replacement? There is little theory to guide the choice of matching estimators—matching without replacement yields the most precise estimates—but only in relatively large datasets. We follow Caliendo and Kopening (2008) and test multiple matching estimators. We utilize radius matching, kernel matching, and nearest neighbor matching without replacement to estimate the average treatment effect on the treated (ATT) of EAZ on parcels not eligible for FPP. Finally, we check for “hidden bias” that may occur if there is unobserved heterogeneity in our dataset using Rosenbaum bounds (Becker and Caliendo 2007).

We model each panel as an individual experiment where the treatment is applied at the beginning of each panel and the outcome is the state of the parcel at the beginning of the following panel. In total, we estimate 12 equations (three matching estimators by four panels) to estimate the effect of EAZ on the likelihood of a parcel to subdivide. The effect of EAZ on development is identified separately from the effect of FPP by limiting our sample to those parcels less than 35 acres in size, and thus not eligible for FPP.

Regression Discontinuity (RD): Effects of Eligibility for the FPP

Turning our attention to estimating the effect of FPP eligibility we return once again to the selection of a parcel into EAZ, equation [2]. In our setting there is a sharp discontinuity where parcels that receive the treatment in equation [2] are eligible for FPP only if they are larger than 35 acres. Thus we are faced with a second policy assignment:

Embedded Image [14]

where FPPn represents the eligibility of an individual parcel for FPP, EAZn is the state of zoning, and acres is the size of the parcel. As acres is likely correlated with the decision to subdivide, the assignment mechanism is clearly not random, and a comparison of outcomes between treated and nontreated parcels is likely to be biased. If, however, parcels close to 35 acres are similar in the baseline covariates, the policy design has some desirable experimental properties for parcels in the neighborhood of 35 acres.

Using the sharp regression discontinuity framework from Imbens and Lemieux (2008), we can estimate the average causal effect of eligibility for FPP by looking at the discontinuity in the conditional expectations of the outcome:

Embedded Image [15]

The average causal effect of eligibility for FPP at the discontinuity of 35 acres is

Embedded Image [16]

By assuming that the conditional regression functions describing the subdivision decision are continuous in acres at the discontinuity (Imbens and Lemieux 2008), we can rewrite the estimate of the treatment effect for being eligible for FPP as

Embedded Image [17]

which is the difference of two regression functions at a point. Intuitively, by comparing parcels that are near the discontinuity that receive and do not receive the treatment, we can identify the average treatment effect for parcels with values of acres at the point of discontinuity (Lee and Lemieux 2009).

We estimate this effect in two ways. First, we use a semiparametric procedure developed by Nichols (2007), which uses local linear regression to estimate the average treatment effect for the treated around the point of the discontinuity. Second, we specify probit regressions with the discontinuity entering the estimation equation as a dummy variable (Imbens and Lemieux 2008). We specify these regressions over a number of distances away from the discontinuity. In both cases we present graphical evidence of the discontinuity. Finally, Lee and Lemiex (2009) show that in RD, panel datasets can be effectively analyzed as a single cross section. Thus, we estimate the probit models with clustered errors, but no random effects.

Summary of the Models

The four models estimate different treatment effects and are based on different underlying functional form and selection bias assumptions. The FIML models from Section IV estimate the average treatment effect of both EAZ and FPP eligibility across all parcels. The matching estimator estimates the average treatment effect for those parcels treated with EAZ, but not eligible for FPP. And the RD method estimates the effect of FPP eligibility on parcels that are treated with EAZ. The FIML models are based on explicit assumptions regarding the underlying distributions of the unobservables, while the matching and RD estimators have much weaker functional form assumptions.

To demonstrate the importance of accounting for endogenous land use policy in models of land use conversion, we also estimate a binary logit model of the subdivision decision to quantify the effects of EAZ and FPP under the assumption that both policies are exogenously applied. In contrast, the FIML model assumes that parcels are selected into zoning based on observable and unobservable factors that may also influence the development decision. Matching and RD estimators assume that zoning selection is based only on observable components, where identification is based off either the balancing of the propensity score, or manipulation of the sample, respectively. Table 2 presents a summary of the underlying assumptions concerning the endogeneity of EAZ and FPP eligibility in the analysis.

Table 2.

Comparison of Econometric Methods

Finally, we note that the decision to subdivide may be different than the decision to develop. For instance, inherited farmland may be split between relatives, but the use of the land may remain agricultural. For policy purposes the change in ownership may be irrelevant unless land use changes in some way. To address this, we ran all the models on the same data but where subdivisions were counted only if a new structure was built by the year 2005 (the last year of our data). The results of these models mirror the results presented in the next section.

V. Results

Regression Techniques

Estimated parameters for the FIML model and the independent probit and logit models of the zoning and subdivision decisions are presented in Table 3 for the period 1972–2005.4 We hypothesize that whether a parcel is zoned EAZ is a function of its size, land use, and location. The results of the first-stage FIML probit regression bear this out: the size, land use, and location of the parcel all significantly influence the likelihood it is zoned EAZ. Of particular interest for this analysis is the estimate of ρ, the coefficient of correlation between the unobservables across the subdivision and zoning decisions, in the FIML estimator. Our estimate of ρ is - 0.74, indicating that parcels with unobservables that make them more likely to subdivide have unobservables that make them less likely to be zoned exclusive agriculture. The estimate of ρ is significantly different from zero at the 5% level and provides evidence that estimates of EAZ in the binary subdivision model suffer from selection bias.

Table 3.

FIML, Probit, and Logit Results for Data from 1972 to 2005

The policy-relevant variables in the logit model and the logit component of the jointly estimated FIML model, EAZ and FPP eligibility, are best interpreted through discrete change effects rather than parameter estimates. The discrete change effects of EAZ and FPP eligibility from the binary logit model are both negative and significantly different from zero (Figures 2 and 3), indicating that under the assumption that EAZ is exogenously imposed, parcels zoned EAZ are less likely to subdivide. However, when the assumption of exogeneity is relaxed in the FIML model, the results change substantially. The discrete change effects of EAZ and FPP eligibility estimated with the FIML model are not significantly different from zero at any reasonable confidence level,5 indicating that we cannot reject the null hypothesis that the zoning policies have no effect on the probability of subdivision when we allow correlated unobservables across the zoning and subdivision decisions.

Figure 2

Discrete Change Effects of EAZ Estimates (Bands Indicate Confidence Intervals)

Figure 3

Discrete Change Effects of FPP Eligibility Estimates (Bands Indicate Confidence Intervals)

Propensity Score Matching

The specification of the propensity score follows closely to the probit selection equation estimated using the regression techniques, with the addition of some higher-order terms to assure proper balance between the covariates. Specifications of the selection equation vary slightly from panel to panel to assure that the balancing algorithm of Becker and Ichino (2002) is met for each specification.6 Table 4 presents the results of the selection equation for 2001–2005, where EAZ is the dependent variable and a probit specification is used. Overall, the size of the parcel, distance to services, distance to Lodi, distance to water, and land use significantly affect the likelihood of a parcel being in EAZ; the other panels mirror this result.

Table 4.

Results from EAZ Selection Equation for Panel Data from 2001 to 2005

There is some variation between panels and between estimators in the magnitude and standard error of EAZ’s ATT. In all cases the ATT is negative, although for the panels 1972-1983 and 1983–1991 results are not significantly different from zero (Table 5). For the panel 1991–2000 the nearest-neighbor algorithm estimates a statistically significant — 7 percentage point change in the likelihood of subdivision; for 2000–2005 this estimate is statistically significant and — 4 percentage points. All other estimates for 1991–2000 and 2000-2005 are not significantly different from zero.

Table 5.

Propensity Score Matching Results: The Effect of EAZ on Parcels Zoned EAZ but Not Eligible for FPP

When significant effects of EAZ were detected, we tested the sensitivity of these results to “hidden bias” using the basic formulation from Rosenbaum (1983b). We use the Mantel-Haenszel (Mantel and Haenszel 1959) test statistic to measure how strongly an unobserved variable would have to influence the selection process to undermine the implications of the matching analysis. The effect of an unobserved variable on the selection into EAZ, γ, is simulated over various values, where larger γ values simulate higher levels of hidden bias. For each value of γ, the Mantel-Haenszel statistic is calculated. As γ increases we can detect the point at which the implications of the matching estimator are no longer valid—the point at which the Mantel-Haenszel statistic becomes statistically insignificant. For the 1991–2000 panel, we find the matching estimates are sensitive to unobserved bias that would increase the odds of being selected into EAZ by 40%. That is, the existence of an unobserved variable that would increase the odds of being zoned EAZ by 40% makes our estimates of the treatment effect null. The 2000–2005 estimates are sensitive to bias that would increase the odds of being selected into EAZ by 20%.7

Regression Discontinuity

Graphical analysis plays an important role in RD, and we present three graphs here (Figure 4). First, we note that there are many observations near the discontinuity of 35 acres. In our setting, 35% of all parcels in the dataset are in EAZ and are between 25 and 45 acres in size, and 50% of all parcels in EAZ fall within this range (top graph, parcels in each 5 acre bin). Figure 4 also presents the mean probability of subdivision for 5 acre bins along with the number of observations. Of particular note is the drop in the mean probability of subdivision between 25–35 acres and 35–45 acres. Also note that the number of observations between 25 and 35 acres (n = 259) is much smaller than between 35 and 45 acres (n = 1,727), which may increase the standard errors of our estimate. Finally we fit a kernel density function to this data and include a break at the discontinuity.8 We note a large discontinuity at 35 acres, indicating that FPP eligibility may have an effect on the propensity to subdivide.

Figure 4

Regression Discontinuity Summary Graphs: (top) Number of Parcels in Each 5 acre Bin. (middle) Mean Probability of Subdivision within Each Bin and Number of Observations. (bottom) Kernel Estimation of Mean Probability of Subdivision on Each Side of the Discontinuity (Bandwidth = 3.37)

A semiparametric methodology developed by Nichols (2007) is used to estimate the effect of FPP eligibility on the likelihood of a parcel to subdivide. In this method, local linear regressions are run on each side of the discontinuity to estimate the local Wald statistic, which can be interpreted as the percentage point change induced by FPP eligibility in the area around the discontinuity.9 The local linear regressions rely simply on the running variable (acres in this case) and the outcome variable—whether or not a subdivision happens—along with specifying the discontinuity. Estimates may be sensitive to bandwidth choice, which dictates how far observations are used from the discontinuity. McCrary (2007) suggests that visual inspection of the local linear regressions around the discontinuity is the most effective way to select a bandwidth. We do this and find an optimal bandwidth around 3. To check the sensitivity of our estimates we estimate the effect of FPP over multiple bandwidths.

An alternative RD method involves running a probit model over the sample data around the discontinuity (Greenstone and Gallagher 2008). In this case, the effect of FPP eligibility can be estimated with a dummy variable.10 Other variables that we assume affect the likelihood of subdivision are also included in the probit model, such as acres, land use, and location of the parcel. The running variable—acres—enters the model linearly. Choosing which parcels are “near” the discontinuity (Imbens and Lemieux 2008) is admittedly at the discretion of the researcher; therefore, we use multiple breakpoints to check for sensitivity in our analysis.11 While there was some sensitivity in regards to standard errors, the main findings are consistent over the range of estimates. We present the full results of one probit model (all years, acres between 25 and 45) in Table 6.12

Table 6.

Full Estimation Results from Probit Discontinuity Model for Parcels between 25 and 45 acres and Zoned EAZ

The RD results all find negative effects of FPP eligibility on the probability of subdivision, but only the semiparametric design produces results that are statistically different from zero (Table 7). In general, these results suggest that the effect of FPP eligibility on the propensity of parcels that are zoned EAZ to subdivide may be negative around the discontinuity. Combined, the two RD methods provide some evidence in favor of an effect of FPP on subdivision, although the bulk of evidence indicates that this effect is weak.

Table 7.

Estimated Regression Discontinuity Results

Discrete Change Effects of EAZ and FPP Eligibility

It is useful to scale the results such that they are easily comparable. Discrete change effects in this setting can be interpreted as the percentage point change in the probability of subdivision for the given treatment (either EAZ or FPP). Some care is still needed when interpreting the discrete change effects, since the actual treatment effects vary between estimators. Overall, the majority of the estimates in Figure 2 suggest that we fail to reject a null hypothesis that EAZ has no effect on the propensity of landowners to subdivide. The binary logit models that assume no selection bias have discrete change effects around — 5 percentage points. Given that correlated unobservables are found in the jointly estimated model, and the propensity score estimates (nearest-neighbor matching) are sensitive to unobserved “hidden bias,” it is likely that these results are erroneous. The FIML estimates and the majority of the propensity score estimates find no effect of EAZ on the propensity to subdivide. The bulk of the evidence suggests that EAZ likely has no effect on the likelihood of a parcel to subdivide.

The story for FPP eligibility is less clear (Figure 3). Both the binary logit model (no assumed selection bias) and the semiparametric RD model from 1983 produce statistically significant effects of FPP eligibility. As mentioned earlier, the binary logit model is likely affected by selection bias. The semiparametric RD models, however, do offer some evidence that FPP eligibility may affect the likelihood ofa parcel to subdivide. In contrast, the FIML model and the probit discontinuity model find no evidence that FPP eligibility affects the likelihood of a parcel to subdivide. We conclude, therefore, that FPP eligibility likely has a weak effect (if any effect at all) on the likelihood that a landowner subdivides.

VI. Discussion

We present multiple methods to estimate the effect of endogenous land use policy on the likelihood of rural landowners to subdivide. This exercise leads to two main results. First, we cannot reject a null hypothesis that Columbia County’s EAZ program has no effect on development decisions, while Wisconsin’s FPP of tax credits has at most a weak effect on the development decisions of rural landowners in our study area. Second, we find evidence that including zoning as an exogenous explanatory variable in land development models can lead to selection bias, resulting in erroneous inference regarding the effects of land use policies on development decisions.

Our results show that consistent estimates of the effects of land use policy require the researcher to seriously consider the potential for selection bias in land conversion models. While the hedonic literature on zoning has long accounted for endogenous policy application, less attention has been paid to this issue in the land conversion literature. In our setting, three very different econometric methods—FIML estimation, propensity score matching, and regression discontinuity— prove useful at addressing the endogeneity of zoning. Even though the propensity score matching estimator cannot account for unobserved selection bias, the examination of Rosenbaum bounds allows us to evaluate whether these estimates are sensitive to the presence of unobserved selection bias. The regression discontinuity analysis, in general, produces estimates of FPP eligibility that are consistent with results that correct for unobserved selection bias. Our favored estimates are from the FIML model of the jointly estimated zoning-subdivision decision, although we recognize the critique that this method relies extensively on functional form assumptions for identification. Nevertheless, joint estimation provides a plausible identification strategy and generates estimates that can be used in spatial landscape simulations where econometric estimates are linked with a geographic information system to examine how multiple individual decisions influence larger landscapes (Lewis and Plantinga 2007). Future research in land use conversion models would be well served by focusing more attention on methods to properly model selection bias arising from the nonrandom application of land use policy.

As a policy-relevant finding, we cannot reject the null hypothesis that EAZ has no effect on landowner development decisions, while FPP eligibility has at most a weak effect on these decisions. The fact that EAZ does not influence subdivision decisions hints that, in this application, zoning may simply “follow the market.” That is, restrictive zoning, such as EAZ, is likely to be applied to parcels that are unlikely to subdivide whether they are zoned or not. This result is consistent with previous work done using hedonic analysis that finds that areas of high development potential are often zoned to allow development (Wallace 1988; McMillen and McDonald 1989). The result that FPP at most weakly influences the landowner’s decision to subdivide is not surprising given the small benefit to the landowner from FPP (on average $641 per year per farm), compared with the much larger gains possible from subdividing (upward of $7,000 per acre if left in agricultural use and possibly much higher in residential use) (Anderson and Weinhold 2008). This result indicates that, at least in the region of the state we analyze, the money Wisconsin spends on FPP annually has little effect on farmland preservation.

Our use of an admittedly small region— two townships in one exurban county near Madison, Wisconsin—leads to both strengths and weaknesses of our analysis. A clear strength of the small region is the reduction of zoning policy into a binary variable (exclusive agricultural zoning or not) amenable to contemporary treatment evaluation techniques. Analyzing significantly larger regions would provide far less policy clarity, given the fact that zoning rules typically exhibit significant variation across municipalities. However, while the small region of analysis provides empirical clarity, such clarity comes at the expense of generalizability of the results to other regions. Nevertheless, a primary purpose of our analysis is to demonstrate and examine multiple empirical methods to account for the endogeneity of zoning in land conversion models. To the extent that zoning rules in other exurban regions are set by democratically elected boards comprised of local residents and landowners—as occurs in our study region—then the methodology and empirical issue of endogenous zoning will likely be relevant issues for many other researchers.

The evidence presented here suggests that zoning does not alter land development. Corollaries of this result are troubling for other land conservation programs where landowners can influence whether or not they receive a conservation “treatment.” For example, the purchase of development rights (PDR) by governments and nonprofits is a popular way to preserve farmland in perpetuity and is often credited with preserving open space. However, it is easy to imagine a situation analogous to our findings concerning EAZ: those landowners who are least likely to subdivide in the absence of a conservation program (those who wish to continue farming) may be the most likely to sell their development rights. If this is the case, the amount of land “preserved” through PDR programs may be overstated, at least in the short run, due to the fact that some of the farmland likely would not be developed even in the absence of the PDR payment. An analogous situation exists for conservation easements and nature reserves (Andam et al. 2008). More research investigating whether PDR programs and other conservation policies simply “follow the market” may be a valuable line of inquiry that would help policy makers better decide which lands to preserve and how to best go about preserving them.

Acknowledgments

We thank Bill Provencher, Dana Bauer, David Newburn, Andrew Plantinga, an anonymous reviewer, and seminar participants at the AERE sessions of the 2009 AAEA Annual Meeting and the 2009 Heartland Environmental and Resource Economics Workshop for helpful comments; Anna Haines and Dan McFarlane for spatial data assistance; and Randy Thompson for assistance with zoning data. We gratefully acknowledge support for this research by USDA McIntire-Stennis (#WIS01229). The GIS data construction was supported by the National Research Initiative of CSREES, USDA Grant #2005–35401–15924.

Footnotes

  • The authors are, respectively, Humboldt Research Fellow, Humboldt University, Berlin; assistant professor, Department of Economics, University of Puget Sound, Tacoma, Washington; and associate, Industrial Economics, Inc., Cambridge, Massachusetts.

  • 1 One exception in the land use change literature is the analysis of Bento, Towe, and Geoghegan (2007); they use matching methods to estimate the effects of development moratoria on land use change in a selection on observables analysis.

  • 2 Eliminating waterfront parcels reduces the dataset by 315 observations. All econometric models were run with the full dataset and with the restricted dataset; the results are not sensitive to the exclusion of waterfront parcels. Results from the full dataset are available upon request.

  • 3 In addition to the variables used in the estimation, many other geographic variables—distance to Madison, a township dummy, distance to public open space, alternative road measures, among others—were created but found to not influence to the likelihood of zoning or subdivision and were thus left out of the final estimated equations.

  • 4 Since EAZ went into effect in 1973 and FPP in 1977, we also estimate the models with just the 1983-2005 period. Parameter estimates using just the 1983-2005 period are available from the authors upon request, and the relevant estimates are not affected by which time period is used.

  • 5 Standard errors for the discrete change effects for the FIML model are estimated using the Krinsky-Robb method.

  • 6 Results of the selection equations are available upon request from the authors.

  • 7 Full Mantel-Haenszel statistics are available from the authors upon request.

  • 8 We employ the method suggested by Lee and Lemieux (2009) to choose a bandwidth of 0.75 acres.

  • 9 The use of a linear probability model when faced with discrete data is less than ideal. To check the robustness of using this model, we compare the discrete change effects of a probit model with those from a linear model. The marginal effects are nearly identical between the two models, hinting that in this case the use of the linear probability model on a discrete dependent variable is not problematic.

  • 10 We also run the regression with an interaction term FPP*acres, as suggested by Lee and Lemieux (2009); the results do not change qualitatively.

  • 11 A probit model with panel-robust standard errors was run over the range of data from 34–36 acres, 33–37 acres, 32–38 acres, and 31–39 acres along with 30–40 acres and 25–45 acres.

  • 12 As an additional robustness check, we test for a discontinuity at 25 acres. The results of the local linear and probit models are both null.

References