Gravity Models and Empirical Trade

The gravity model of international trade states that the volume of trade between two countries is proportional to their economic mass and a measure of their relative trade frictions. Perhaps because of its intuitive appeal, the gravity model has been the workhorse model of international trade for more than 50 years. While the initial empirical work using the gravity model lacked sound theoretical underpinnings, the theoretical developments have highlighted how a gravity-like specification can be derived from many models with varying assumptions about preferences, technology, and market structure. Along the strengthening of the theoretical roots of the gravity model, the way in which it is estimated has also evolved significantly since the start of the new millennium. Depending on the exact characteristics of regression, different estimation methods should be used to estimate the gravity model.

Keywords

international trade
bilateral trade
the gravity equation
structural gravity
trade costs
new trade theory
heterogeneous firms

Subjects

International Economics

The Workhorse of International Trade

For more than 50 years, the gravity model has been the workhorse model of empirical international trade. Originally the model was presented as a simple analogy between Newton’s Universal Law of Gravitation and factors that would influence bilateral trade flows. The flow of trade between two countries was posited to be proportional to the economic size of the trading partners and inversely related to their distance from each other. As formulated, the gravity equation of international trade could be rewritten as a log-linear empirical specification that could be easily estimated. A large number of studies showed that the empirical findings were consistent with naïve gravity model. In particular, the coefficient estimates of the elasticity of bilateral trade to importer and exporter GDP were close to unity, the elasticity of trade with respect to bilateral distance was negative; moreover, the empirical specification was able to account for a reasonable amount of the observed variation in trade.

Even though the model was an empirical success, the gravity equation lacked a sound theoretical background. Beginning in the late 1970s, several authors showed that the gravity-like specification would emerge from a variety of standard assumptions regarding preferences, technology, market structure, and trade. At the same time, empirical trade economists became more concerned about the estimation strategy; in particular, that estimation using ordinary least squares might lead to biased coefficient estimates. The purpose of the review is to trace the history of the gravity equation and provide context for the evolution of the gravity equation of international trade. The review also highlights the current state of the field and highlights areas of future research. 1

Since much of the work on the gravity equation has been designed to identify factors that may reduce or enhance bilateral trade, the paper starts by using a naïve gravity specification to show how geography, history, culture, and government policies appear to influence trade flows by looking at the cross-section of data for 145 countries in 2014 . It goes on to provide an overview of theoretical models and empirical specifications from 1970 through 2001 . The subsequent section works through four of the standard models of international trade and shows how each model leads to a similar empirical specification—the structural gravity model. This section also briefly covers how these models can be extended to include tariffs, intermediate goods, and multiple sectors; it concludes by reviewing recent theoretical models that lead to a gravity-like empirical specification.

The final section of the article reviews the state of the empirical specifications. It starts with a discussion of the conditions under which the log-linear gravity model estimated by ordinary least squares will yield consistent estimates of the coefficients of interest. In most cases, however, these conditions are not satisfied, and an alternative estimator is needed. Santos Silva and Tenreyro (2006) showed that the Poisson Pseudo Maximum Likelihood Estimator has desirable properties that make it attractive for the empirical gravity work. These estimators are contrasted with the Gamma Pseudo Maximum Likelihood Estimator, and Nonlinear Least Squares, and different specification tests are discussed that may assist in choosing among them. Another issue that can arise in estimating the gravity model is the endogeneity of the control variables, the typical solutions for which are also briefly discussed. The section concludes with a discussion of the path for future work.

Gravity: A First Look at the Data

The early empirical gravity models of international trade were rooted in a simple and intuitive analogy to Newton’s Law of Universal Gravity. According to Newton’s law, the force of attraction between two bodies is proportional to the product of their masses and inversely proportional to their distance squared. These early gravity models of trade postulated a similar relationship between the bilateral trade flows between two countries, their economic sizes, and a measure of trade frictions. The lack of theoretical underpinnings for this relationship is the reason why it is referred to as the naïve gravity model. Mathematically, it can be expressed as

X i j = G Y i β 1 Y j β j d i s t i j β 3 ε i j (1)

where X i j is bilateral trade between exporting country i and importing country j , Y i ( Y j ) is the gross domestic product in country i ( j ) and d i s t i j is the bilateral distance between country i and j . ε i j is typically assumed to be a log-normally distributed error term. Given the multiplicative structure and the assumption on the error term, Equation 1 can be estimated by taking the natural logarithm that leads to a log-linear specification

l n X i j = ln ( G ) + β 1 ln ( Y i ) + β 2 ln ( Y j ) + β 3 ln ( d i s t i j ) + ln ( ε i j ) (2)

The core relationship of gravity models can be easily illustrated using the overall patterns in trade data. In a world without trade frictions, a simple gravity relationship is given by X i j = Y i Y j Y W where, as before, Y i ( Y j ) is GDP of country i ( j ) and Y W is world income. The frictionless gravity equation can be rearranged to relate country j ' s expenditure share on goods produced in country i ( X i j Y j ) to the latter’s share of the world production ( Y i Y W ). 2 Using trade data from 2014 for 125 countries, Figure 1 plots the relationship between expenditure shares and production shares on a logarithmic scale. The high correlation between these variables shown in this graph is consistent with, and part of the appeal of, the empirical structure of the gravity equation. 3 At the same time, Figure 1 also shows that more than 90% of all expenditure shares remain below the 45-degree line. That these import shares fall consistently below the production shares indicates that the world is far from frictionless.

Figure 1. Expenditure and production shares.

Modifying the frictionless gravity equation gives a crude measure of trade costs. To start, we rewrite the bilateral trade flows as

X i j = Y i Y j Y W ϕ i j − ϵ

where ϕ i j the bilateral cost of trading and the strictly positive ϵ is the elasticity of bilateral trade flows with respect to these trade costs. Much of the empirical gravity literature has been devoted to identifying and quantifying the factors that influence trade costs. They can be classified as costs induced by geography (natural trade costs), costs associated with historical and cultural linkages, and costs induced by policies (sometimes referred to as “unnatural trade costs”). Researchers interested in international trade and economic geography have emphasized the role of natural trade costs (often referred to as second nature geography) and how these natural trade costs are associated with the respective location of the economic agents. An obvious empirical measure of such costs is the distance between countries. Limao and Venables (2001) and Hummels (2007) investigated the empirical relationship between observed CIF-FOB trade costs—that is, all the costs associated with shipping the goods and insuring it against damage during transport—and distance. These authors found a positive correlation between distance and trade costs. Indeed, one of the most robust findings in the empirical gravity literature is the negative relationship between distance and bilateral trade, or its equivalent: the positive relationship between the natural logarithm of distance and trade costs. The rough measure of trade costs obtained by rewriting the naïve gravity equation as

ϕ i j ϵ = ( X i j Y W Y i Y j ) − 1

Figure 2 plots the relationship between this measure of trade costs and distance, depicting their clear positive relationship. The fitted line indicates that trade falls by 1.4% for every 1% increase in distance.

Figure 2. Trade costs and distance.

Other geographical factors are also posited to influence trade, including whether the countries share a border. It is frequently argued that contiguous countries have lower trade costs because their common border lowers both pecuniary and non-pecuniary costs of trade. Figure 3 depicts the relationship between trade costs and distance, where contiguous country-pairs are depicted with the plus symbol (+). On average, the points that correspond to the bilateral pairs that share a border lie below the least-squares line representing the relationship between trade costs and distance, indicating that contiguous countries face lower trade costs. 4

Figure 3. Trade costs and distance for contiguous (+) and non-contiguous (o) countries.

In addition to geography, cultural and historical factors are likely to influence trade costs. For example, Figure 4 depicts the relationship between trade costs and distance, this time with bilateral pairs that have ever had a colonial relationship indicated with a diamond symbol. Since these country pairs fall on average below the fitted line, the figure again suggests that trade costs are lower for bilateral pairs that share a colonial history. 5 Potential explanations for this may be that the colonial history implies more familiarity with each other or more similar institutions. Alternatively, the existence of differences in resources that increase trade between the two countries would have been a factor of colonial relationships in the first place.

Figure 4. Trade costs and distance for countries with (♦) and without (●) a colonial relationship.

Finally, some trade costs are likely attributable to government policies. For example, higher tariffs, economic sanctions, and other forms of regulations likely raise the cost of trading and hence reduce trade. Free trade agreements, on the other hand, are typically designed to lower trade costs and boost trade. Figure 5a depicts the relationship between trade costs and distance where countries that have a free trade agreement are depicted with a gold cross.

However, it is not immediately evident that conditional on distance, countries that have free trade agreements face lower trade costs. A potential explanation may be that other factors associated with trade costs also need to be included. Alternatively, the formation of the trade agreements could be in response to other factors such as high trade costs. Figure 5b depicts the separate fitted lines for the relationship between trade costs and distance with and without trade agreements. These fitted lines indicate that over shorter distances trade costs are lower, on average, for bilateral pairs that have a trade agreement. However, as the distance between the countries increases, free trade agreements appear to have a smaller impact on trade costs. 6

Figure 5. Distance and trade costs for countries with (x) and without (●) a free trade agreement. (a). combined fitted line. (b). separate fitted lines.

While these figures are only suggestive of the relationships between bilateral trade and geography, history, and government policies, the theories described in subsequent sections provide guidance on how other confounding factors can be controlled for when specifying an empirical gravity model.

Early Theoretical Developments and Empirical Applications

The gravity framework initially was appealing to researchers because the log-linear model was a simple and intuitive empirical way to assess the relationship between bilateral trade flows, production, income, and variables that could conceivably be viewed as factors that distort bilateral trade. When applied to trade data, the coefficient estimates were typically economically and statistically significant, and the simple gravity specification seemed to account for a large share of the variation of bilateral trade flows. Even though the gravity equation was considered an empirical success, it was often criticized for lacking sound theoretical foundations. Many of the early attempts to provide a theoretical foundation for the gravity model showed that bilateral trade was a function of incomes but did not provide an explicit rationale for the inclusion of distance and other trade costs. For example, Leamer and Stern (1970) presented a probabilistic model of bilateral trade flows. In their model, it was assumed that each transaction was of the same size ( γ ) and that the likelihood an exporter in i would meet and trade with an importer in j , would depend on the trade capacity of each of the two countries relative to total trade. If trade capacity of the exporting (importing) county i ( j ) is given by F i ( F j ) , then the probability of trade between an exporter and importer is given by p i j = F i F W F j F W , where F W represents total world trade. If there are N transactions of size γ , then total world trade would be given by F W = N γ and the volume of trade between i and j to be given by

X i j = p i j N γ = F i F j F W .

Letting gross domestic product proxy for trade capacity results in the frictionless gravity equation. Leamer and Stern (1970) then asserted that it is plausible to assume that the likelihood of trade between two countries would depend on their proximity to each other so that bilateral trade would be given by

X i j = p i j N γ = F i F j F W g ( d i s t i j )

where g ′ ( d i s t i j ) < 0 .

Anderson (1979) also derived a simple, frictionless gravity equation. In a world without trade costs and where preferences are characterized by homothetic preferences defined over a distinct basket of goods produced by each country, Anderson showed that the volume of trade between country i and country j is given by X i j = θ i Y j where θ i represents the representative agent’s preferences for the good produced in country i . Goods market clearing (i.e., total goods supplied equal total goods demanded) implies that

Y i = ∑ j X i j = ∑ j θ i Y j o r θ i = Y i Y W

Substituting for θ i yields the frictionless gravity equation

X i j = ( Y i Y j Y W ) .

Even this simple formulation without trade costs provides a few simple, testable hypotheses regarding bilateral trade flows. Helpman (1987) and Baier and Bergstrand (2001) showed that this model predicts that bilateral trade increases as the difference in the economic size of the two countries decreases and when total economic size increases. To see this, define s i = ( Y i Y i + Y j ) and s i j = Y i + Y j Y W so that the frictionless gravity equation can be expressed as X i j Y W = s i s j s i j 2

This equation is linear when log transformed and can be estimated as

ln ( X i j Y W ) = β 0 + β 1 ln ( s i s j ) + β 2 ln ( s i j ) + e i j

One would expect the coefficient on β 1 to be close to unity and the coefficient on β 2 to be close to two. Estimating this model using bilateral data from 2014 for 145 countries, the parameters are as follows

ln ( X i j Y W ) = 12.73 ( 0.012 ) + 1.20 ( 0.018 ) ln ( s i s j ) + 2.43 ( 0.017 ) ln ( s i j ) + e i j

where the standard errors are in parenthesis, r 2 = 0.531 and N = 17 , 538 . The coefficient estimates of both l n ( s i s j ) and l n ( s i j ) are different from its hypothesized values at the 95% confidence level; however, the coefficient estimate is roughly consistent with the simple, frictionless gravity.

In addition to the model with costless trade, Anderson (1979) presented several models where bilateral trade is influenced by trade costs. The most widely cited of these models is the Armington model. In this model, the representative agent’s preferences are defined over goods, where each good is uniquely produced by one country. These preferences are characterized by a constant elasticity of substitution (CES) utility function given by

U j = ( ∑ i = 1 N β i 1 σ c i j σ − 1 σ ) σ σ − 1 .

The representative agent maximizes her utility subject to a budget constraint given by w j = ∑ i = 1 N p i j c i j where w j is the wage rate of the representative agent in country i and p i j and c i j are respectively the consumption and the landed price of the good produced in i and consumed in j (i.e. the price paid by consumers in j). Anderson assumed that trade was subject to iceberg trading costs such that if one unit of the good is shipped from country i to country j only 1 t i j of the good would arrive in country j ( t i j > 1 for i ≠ j ). As markets are assumed to be competitive, the landed price is simply equal to the factory gate price, p i , scaled up by the iceberg trading costs so that p i j = p i t i j .

The Armington assumptions imply that country j’s total expenditures on goods produced in i are given by

X i j = β i ( p i t i j P j ) 1 − σ Y j .

where Y j is aggregate income ( Y j = w j L j ) and P j = ( ∑ i = 1 N β i ( p i t i j ) 1 − σ ) 1 1 − σ . By summing over all destinations, market clearing implies

β i = ( Y i Y W ) < ∑ k = 1 N ( p i t i k P k ) 1 − σ ( Y k / Y W ) >− 1 . (3)

If the quantity of each country’s good is defined so that its price is equal to unity, the following expression for bilateral trade can be obtained by substituting for β i

X i j = ( Y i Y j Y W ) t i j 1 − σ < ( ∑ ( l = 1 ) N ( β l t l j 1 − σ ) ( σ − 1 ) ∑ k = 1 N ( p i t i k P k ) 1 − σ ( Y k / Y W ) >. (4)

In order to estimate equation 4, many authors assumed that the term in brackets exhibited little variation across bilateral trading partners so that it could safely be ignored. Additionally, it was assumed that trade costs could be modeled as t i j 1 − σ = exp ( z i j N β N + z i j H β H + z i j P β P + e i j ) , where z i j N is a vector of variables capturing natural trading costs, z i j H is a vector of variables associated with history and cultural factors, and z i j P is a vector of trade costs associated with policy, and ϵ i j is a normally distributed error term. Taking the natural logarithm of Equation 4 yields

ln ( X i j ) = β 0 + β 1 ln ( Y i ) + β 2 ln ( Y j ) + z i j N β N + z i j H β H + z i j P β P + e i j . (5)

Rather than ignoring the term in brackets, some authors approximated the term by computing a “remoteness” index (see, e.g., Wei [1996], Helliwell [1997], and Harrigan [2003]). While the coefficient estimates on the remoteness variables had the expected signs and were statistically significant, the variables were simple reduced-form representations that typically were a GDP-weighted measure of distance that did not incorporate all aspects of the trade cost vector.

Bergstrand (1985) built on the Anderson framework by including a nested CES demand structure that allowed for the elasticity of substitution among imported goods to differ from the elasticity of substitution between domestically produced goods and imported goods. Unlike Anderson (1979), Bergstrand (1985) assumed that there are costs associated with distributing the products to each potential market and this cost could be modeled with a constant elasticity of transformation (CET) function. Assuming that incomes and prices were exogenous, the CET-technology yielded a set of export supply equations that can be linked to the CES-system of demand equations yielding the following gravity equation

X i j = Y i σ − 1 γ + σ Y j γ + 1 γ + σ t i j − σ ( γ + 1 ) γ + σ ( 1 + τ i j ) − σ ( γ + 1 ) γ + σ P i − ( ( σ − 1 ) ( γ − ν ) γ + σ ) P j ( γ + 1 ) ( σ − ν ) γ + σ P ¯ i ( σ − 1 ) ( 1 + ν ) γ + σ P ¯ j − γ + 1 ( γ + σ ) ( σ + ν )

where ν is the elasticity of substitution between domestically produced goods and importables. One of the main contributions of the Bergstrand framework was to show that in addition to incomes and trade costs, bilateral trade flows depended on importer and exporter price indices. Furthermore, with data on gross tariffs, this framework allows the researcher to identify the elasticity of substitution among importables ( σ ) and the elasticity of transformation parameter ( γ ). While the Bergstrand model featured price indices that were related to trade and trade costs, it was not clear at the time how to account for these indices.

One weakness of the Armington models of Anderson (1979) and Bergstrand (1985) was that product differentiation was determined arbitrarily depending on the country of origin. The new trade models of Krugman (1979), Krugman (1980) and Helpman and Krugman (1989), which were developed to account for intra-industry trade, provided a richer supply-side model of production. The key characteristics of these models are that the market structure is monopolistically competitive, consumers have a love of variety, firms within a country all have the same production technology, and the production technology for each firm exhibits increasing returns to scale.

Specifically, consumer preferences are defined by a utility function with constant elasticity of over the varieties of a good

U j = ( ∑ ω c j ( ω ) σ − 1 σ ) σ σ − 1

where ω represents a distinct variety. These preferences are typically referred to Dixit-Stiglitz preferences. Given that firms are homogeneous within a country and that preferences are symmetric over all varieties, the utility function of the representative agent can be expressed as

U j = ( ∑ i = 1 N N i c i j σ − 1 σ ) σ σ − 1 . (6)

where N i is the number of varieties produced in country i . The representative agent maximizes her utility subject to a budget constraint, which is given by w j = ∑ i = 1 N N i p i j c i j . The demand for traded goods of consumers in country j for goods produced in country i is given by

X i j = N i p i j c i j = N i ( p i j P j ) 1 − σ Y j

In this model, firms face a fixed cost of production and constant marginal cost. With labor as the only input, production of a representative firm in country i is given by q i = A i ( l i − f i ) where q i is the output produced by the firm in country i , A i is the technology available to firms in country i , l i is the labor employed by the representative firm in country i , and f i is the fixed production cost for the representative firm in country i . Given the production technology, the wage bill for the firm is given by w i l i = w i ( q i A i + f i ) . As in the Armington model, trade involves iceberg trade costs so that total production of a variety produced in country i and shipped to all other markets is constrained by

q i = ∑ k = 1 N t i k ( p i k P k ) 1 − σ Y k .

Profit maximization implies

Π i = m a x p I i k < ∑ k = 1 N [ ( p i k P k ) 1 − σ Y k − ( w i t i k A i ) ( p i k P k ) − σ Y k ] − w i f i .

The first-order conditions can be arranged to show that prices, inclusive of iceberg trade costs, are markups over marginal costs

p i k = w i t i k A i ρ , where ρ = σ − 1 σ .

Free-entry implies zero excess profits, so that total output of each firm in country i is given by q i = ( A i f i ( σ − 1 ) ) . Labor market clearing implies that L i = ∑ ( q A i + f i ) = N i σ f i or N i = L i σ f i , so that aggregate production in market i is given by Y i = p i Q i = p i N i q i = ( w i A i ρ ( A i L i ( σ − 1 ) σ ) = w i L i

A conditional equilibrium gravity equation is obtained by substituting p i j = p i t i j in the demand equation and substituting N i = Y i p i q i so that the demand equation can be expressed as

X i j = q i p i − σ ( t i j P j ) ( 1 − σ ) Y i Y j .

Assuming that technology is the same across countries this equation can be estimated in log form as

ln ( X i j ) = β 0 + z N β N + z H β H + z P β P − σ ln ( p i ) + ( 1 − σ ) l n P j + ϵ i j

where the trade costs were modeled as

ln ( t i j ) = z N β N + z H β H + z P β P + ϵ i j .

This conditional equilibrium gravity equation includes the price of goods produced in i and the price index for the commodities consumed in country j . Baier and Bergstrand (2001) used GDP deflators as empirical proxies for the price terms. However, as Feenstra (2004) pointed out, the GDP deflators do not reflect the international prices implied by the theory, so it should come as no surprise when the price terms were often statistically insignificant.

In order to obtain unbiased coefficient estimates, the empirical specification needs to account for the price terms or a more theoretically consistent measure of the remoteness terms ( p i and P j ). Since these variables are functions of the trade costs, they are likely correlated with the other right-hand side variables. As a result, failing to account for them correctly will lead to biased coefficient estimates. Anderson and van Wincoop (2003) were the first to provide guidance on estimating the gravity equation and accounting for the general equilibrium price terms in a theoretically consistent way. They showed that the price terms in the gravity equation were implicit functions of the trade costs, incomes, and expenditures confirming what Linnemann (1966) had suggested many years ago when he stated that prices were an equilibrium outcome of the trade costs and (assumed to be) exogenous incomes. However, unlike Linneman, who had suggested that the price terms could be ignored, Anderson and van Wincoop stressed the importance of controlling for these price terms in order to obtain consistent estimates. In their application, Anderson and van Wincoop addressed what had been termed the “border puzzle.” This referred to McCallum (1995) who found that controlling for distance and economic size, the trade between Canadian provinces was 22 times higher than trade between Canadian provinces and U.S. states. However, once the theoretically consistent price terms are accounted for and using the comparative statics outlined in Anderson and van Wincoop, the impact of the border is dramatically reduced. Since the publication of the Anderson and van Wincoop paper, nearly all empirical specifications have attempted to account for the price terms.

Structural Gravity

The remainder of this section explains how once the market structure and market clearing are taken into account, the Armington model, the monopolistically competitive trade model, the Eaton and Kortum (2002) Ricardian model, and Melitz’s (2003) heterogeneous firms model can all be written in a similar form. The so-called structural gravity equation takes the following shape

X i j = G ( t i j Π i P j ) − ϵ Y i E j

Where like before, G is a constant term, t I j are trade costs between countries i and j , Y i is production in country i , E j is aggregate expenditures by country j , and ϵ is the trade elasticity. As will be explained in more detail, the exporter and importer price indexes ( Π i and P j ) aggregate the trade costs over all trading partners

Structural Gravity in the Armington Model

The Armington model assumes that preferences can be represented by a CES utility function where each country’s good enters into the utility function symmetrically. 7 More formally, the utility function of the representative agent in country j is given by

U j = ( ∑ i = 1 N c i j σ − 1 σ ) σ σ − 1 . (7)

The agents maximize their utility subject to a budget constraint, which states that expenditures equal income plus the trade deficit: e j = w j + d j ≥ ∑ i = 1 N p i j c i j . The underlying assumption is that the trade deficit is attributable to macroeconomic factors that are not influenced by current trade or trade policies. Maximizing Equation 7 subject to the budget constraint and aggregating over consumers in country j yields the following expression country i ' s consumption of country j ' s good

C i j = ( p i j P j ) − σ ( E j P j )

where the price index is given P j = [ ∑ i = 1 N p i j 1 − σ ] 1 1 − σ , C i j = c i j L j is aggregate consumption of goods produced in country i and consumed in j , and total expenditures in country j are given by E j = e j L j .

Assuming that markets are perfectly competitive, the landed price of a good will be equal to the factory gate price scaled up by the iceberg trade costs; that is, p i j = p i t i j . The value of bilateral exports from i to j is given by

X i j = ( p i t i j P j ) 1 − σ E j

and the price index can be expressed as P j = [ ∑ i = 1 N ( p i t i j ) 1 − σ ] 1 1 − σ .

Market clearing implies Y i = ∑ i = 1 N X i j

Y i = ∑ i = 1 N ( p i t i j P j ) 1 − σ E j Y i = p i 1 − σ Π i 1 − σ (8)

where Π i = [ ∑ j = 1 N ( t i j P j ) 1 − σ E j ] 1 1 − σ .

Substituting p i 1 − σ from Equation 8 into the price index and the trade flow equation yields the following set of equations

X i j = ( t i j Π i P j ) 1 − σ Y i E j , (9) Π i = [ ∑ j = 1 N ( t i j P j ) 1 − σ E j ] 1 1 − σ , (10) P j = [ ∑ i = 1 N ( t i j Π i ) 1 − σ Y i ] 1 1 − σ . (11)

These price terms are referred to in Anderson and van Wincoop as the multilateral resistance terms. The outward multilateral resistance term, Π i , is a weighted aggregate of all trade costs faced by the exporters in country i . While the inward multilateral resistance term, P j , is a weighted aggregate of all trade costs faced by the importers in country i . Therefore, what matters for the volume of trade is the vector of bilateral trade costs relative to the inward and outward multilateral resistance terms. Finally, from an empirical standpoint, the trade elasticity in Armington model is constant and is determined by the elasticity of substitution across goods ( σ ) .

In order to close the model, labor is assumed to be the only input in the production of good i and that production function for the good produced in country i ( i = 1 , … , N ) exhibits constant returns to scale. If technology in country i is given by A i , then the factory gate price is given by p i = W i / A i . Market clearing implies W i L i = ∑ i = 1 N X i j = p i 1 − σ Π i 1 − σ . After substituting for p i , the market clearing condition can be rearranged to solve for wage rate; that is,

W i = ( L i ) 1 σ ( A i Π ) σ − 1 σ . (12)

As one would expect, an increase in technology in country i ( A i ) increases the wage rate in country i . In addition, having better access to markets reduces the outward multilateral resistance term ( Π i ) and pushes up the wage rate similar to technological improvement. Finally, in the Armington model, increasing the population (the number of workers) lowers the wage because this increases the supply of country i ' s good and lowers the price, which ends up being reflected back on factor prices.

Structural Gravity and Monopolistic Competition with Homogenous Firms

As in the previous section, agents have symmetric preferences over varieties of goods. Since each firm has identical technology within a country, the value of trade from country i to country j is given by

X i j = N i ( p i t i j P j ) 1 − σ E j

where P j = [ ∑ i = 1 N N i p i j 1 − σ ] 1 1 − σ is the CES-price index for the consumer in country j and N i is the number of varieties (goods) produced in country i . As in the Armington model, the landed price in country j is equal to the factory gate price scaled by the iceberg trading costs ( p i j = p i t i j ) and market clearing implies Y i = ∑ i = 1 N X i j or

Y i = N i p i 1 − σ [ ∑ i = 1 N ( t i j P j ) 1 − σ E j ] .

Substituting for N i p i 1 − σ in the CES-price index and into the bilateral trade flow equation yields the following system of equations

X i j = ( t i j Π i P j ) 1 − σ Y i E j , (13) Π i = [ ∑ j = 1 N ( t i j P j ) 1 − σ E j ] 1 1 − σ , (14) P j = [ ∑ i = 1 N ( t i j Π i ) 1 − σ Y i ] 1 1 − σ . (15)

As in the Armington model, the trade elasticity is determined by the elasticity of substitution across (varieties) of goods ( σ ) and is constant.

Factor prices are pinned down by substituting these into the market clearing condition for goods and labor: Y i = W i L i and N i = A i L i σ f i . Rearranging gives us

W i = B M C ( A i Π i ) σ σ − 1 . (16)

where B M C = ( f i − 1 σ ( σ − 1 ( σ − 1 ) σ − 1 σ ) . Equations (12)—(15) yield a system of equations for trade flows, the multilateral resistance terms, and factor prices. As in the Armington model, factor prices are influenced by technology and the outward-multilateral resistance term. However, unlike the Armington model, the wage rate does not depend on the supply of labor. Instead, an increase in the amount of labor in country i leads to an increase in the number of varieties in country i . Since consumers have preferences over varieties, the increase in demand perfectly offsets the increased production and the wage rate does not change.

The supply side of both the Armington and the monopolistically competitive trade model are rather simplistic. The Armington model assumes that each country produces a single good and that the producers in the country do not face direct competition. Within a country, production exhibits constant returns to scale, so that pricing and market demand follow directly. In the monopolistically competitive trade model, households’ love of variety and increasing returns to production are essential for pinning down the number of varieties and the size of the firm. However, in most cases, it is assumed that each firm has access to the same technology so that production, sales, and exports are the same for all firms. The models developed by Eaton and Kortum (2002) and Melitz (2003) provide richer supply side models of international trade. The Eaton-Kortum model is a Ricardian model with perfect competition where agents have preferences over varieties, and consumers buy from the low-cost producer. The Melitz model builds on the models developed by Krugman (1979, 1980) and Helpman and Krugman (1989) by modeling heterogeneous firms that differ in terms of their productivity.

Structural Gravity in the Multi-Country Ricardian Model

Eaton and Kortum (2002) extended the classic Dornbusch, Fischer, and Samuelson (1977) Ricardian model with a continuum of goods to a multi-country setting. In this setting, goods within a category are homogenous, and Ricardian differences in technology imply that trade is driven by comparative advantage. It will be shown that in this framework, the structural gravity model also emerges. Unlike the previous models, trade elasticity is determined by the dispersion parameter on the Frechet distribution, which determines the dispersion of productivity across firms in different countries.

As in the monopolistically competitive model, aggregate demand by country j for variety ω is given by

C j ( ω ) = ( p j ( ω ) P j ) − σ ( E j / P j )

Eaton and Kortum assume that the technical efficiency of a firm in country i is determined by a random draw from a Frechet distribution. The CDF of this distribution is given by F i ( z ) = exp ( − T i z i − θ ) where T i is a country-specific parameter reflecting the productivity distribution in i ( T i > 0 ), and θ , common to all countries, represents the dispersion of technology. Eaton and Kortum assume that the productivity draws are independent. Furthermore, if labor is the only input into the production process and production exhibits constant returns to scale, the factory gate price for variety ω produced in country i is given by p i i ( ω ) = W i / z i where z i is the technology of the firm producing the good. 8

Given that the productivity draws are independent, it can be shown that the probability that a producer in country i can deliver the product to country j at a price lower than or equal to p is given by

G i j ( p ) = 1 − exp ( Φ j p θ )

where Φ j = < ∑ k = 1 N T i ( W i t i j ) − θ >. The probability that country i ' s good sells for the lowest price in market j is given by

T i ( W i t i j ) − θ Φ j .

Given this probability, bilateral trade from i to j can be expressed as

X i j = ( T i ( W i t i j ) − θ Φ j ) E j .

Market clearing implies that

Y i = W i L i = T i W i − θ [ ∑ j = 1 N t i j − θ Φ j ] .

Eaton and Kortum show that the price index in country j is given by P j = γ Φ j − 1 θ . Substituting the price index into the market clearing condition and into the trade flow equation yields

X i j = ( t i j Π i P j ) − θ Y i E j , (17) Π i = [ ∑ j = 1 N ( t i j P j ) − θ E j ] − 1 θ , (18) P j = [ ∑ i = 1 N ( t i j Π i ) − θ Y i ] − 1 θ . (19)

Goods market clearing implies that the wage rate is given by

W i = B E K ( z ¯ i Π i ) θ θ + 1 (20)

where z ¯ i = e 0.577 T i θ is the geometric mean of z i .

Structural Gravity with Heterogeneous Firms

In the standard monopolistically competitive trade model, all firms are assumed to be identical, and all firms export. However, firm-level data shows that firms differ in terms of a host of characteristics, including size and productivity, and that the latter is highly correlated with trade participation. By allowing for differences in firm-level productivity and a fixed cost of exporting, Melitz (2003) developed a model that can account for several of these features that are present in the data. Chaney (2008) and Redding (2011) show that when the distribution of firm-level productivity is characterized by a Pareto distribution, the response of bilateral trade flows to changes in trade costs results in a clean decomposition of the extensive and intensive margins of trade. This section will show how the Melitz model works and where the distribution of productivity can be characterized by a Pareto distribution. This yields a structural gravity equation that is similar to the gravity equation derived in previous sections. Similar to the Eaton and Kortum model, the trade elasticity with respect to marginal trade costs is given by the parameters of the Pareto distribution.

As in the previous section, consumer’s preferences are characterized by the Dixit-Stiglitz CES-utility function defined over varieties. Aggregate demand in country j for variety ω is given by

C j ( ω ) = ( p j ( ω ) P j ) − σ ( E j P j ) .

For a producer of the variety ω in country i , profits from selling in market j are given by

π i j ( ω ) = max < 0 , ( p j ( ω ) P j ) 1 − σ E j − ( W i t i j A i φ ) ( p i ( ω ) P j ) − σ ( E j P j ) − W i f i j X >

where A i is the aggregate technology in country i , φ is the firm-specific productivity, and f i j X is the fixed cost of exporting from market i into market j . Profit maximization implies that the price of the varieties shipped from country i to country j will depend on the firm’s productivity draw ( ω ); that is, the price of goods shipped from country i to country j are given by

p i j ( φ ) = ( W i t i j ρ A i φ )

where ρ = σ − 1 σ . The profits earned by firms in country i with productivity φ that sells in country j are given by

π i j ( φ ) = max < 0 , ( W i t i j ρ A i φ P j ) ( 1 − σ ) ( E j σ ) − W i f i j X >.

Melitz defined the cut-off productivity φ i j * such that the firm’s profits are exactly zero: π ( φ i j * ) = 0 .

As in Chaney (2008) Redding (2011) and Melitz and Redding (2014), productivity is assumed to follow a Pareto distribution where the cumulative density is given by G ( φ ) = 1 − ( φ ¯ φ ) κ and is defined on the support [ φ ¯ , ∞ ) . Given the productivity distribution and the definition of the zero-cut-off productivity, the expected profits for firms in country i selling in country j are given by

E < π i j | φ ≥ φ i j * >= σ − 1 κ − σ + 1 W i f i j X

and expected profits in market j by all active firms in country i are given by

E < π i j | φ ≥ φ i i * >= ( φ ¯ φ i i * ) κ σ − 1 κ − σ + 1 W i f i j X .

Aggregating across all markets delivers an expression for expected profits

E ( π i ) = σ − 1 κ − σ + 1 ∑ j ( φ ¯ φ i j * ) κ W i f i j X .

Free entry implies that the expected profits, conditional on a productivity draw greater than or equal to φ i i * , are equal to the fixed costs of entry ( f i E

) in terms of domestic labor units; that is,

( φ ¯ φ i i * ) E ( π i ) = W j f i E . (21)

As in the Krugman model, the labor market clearing condition pins down the mass of firms in each country

M i = σ − 1 κ σ ( L i f i E ) ( φ ¯ φ i i * ) κ .

Given the mass of firms, exports from country i to county j are given by

X i j = M i 1 − G ( φ i i * ) 1 − G ( φ i j * ) ∫ φ i j * ∞ ( W i t i j ρ A i P j ) 1 − σ ( E j σ ) κ φ − ( κ − σ + 2 ) φ i j − κ d φ

where G ( φ i k * ) = 1 − ( φ i k * φ ) κ ∀ k . Substituting φ i j * W i f i j X = ( W i t i j ρ A i P j ) 1 − σ E j yields

X i j = ( ( σ − 1 ) L i κ σ ( φ i j * φ ¯ ) − κ ) ( W i f i j X κ σ κ − σ + 1 ) ) .

The first term in brackets is the mass of firms in country i that actively export to country j , which following Redding (2011) can be viewed as the extensive margin. A change in variable trade costs or fixed trade costs will impact the extensive margin through its impact on the cut-off productivity, φ i j * . The second term in brackets, in turn, shows that the intensive margin is a function of the fixed cost of exporting, f i j X .

In order to express the bilateral trade equation in the form of structural gravity, the cut-off productivity of Equation 21 can be substituted into the bilateral trade equation to obtain

X i j = B m ( E j κ σ − 1 ) ( Y i ) W i − κ σ σ − 1 A i κ ( t i j P j ) − κ f i j X − κ σ − 1 − 1 ,

where B m = φ ¯ ( σ − 1 ) κ + 1 σ κ σ σ − 1 κ − σ + 1 . Market clearing implies

Y i = B m Y i W i − κ σ σ − 1 A i κ ∑ j ( t i j P j ) − κ ( f i j X ) − κ σ − 1 E j κ σ − 1 .

Defining Π i as

Π i = [ ∑ j ( t i j P j ) − κ ( f i j X ) − κ σ − 1 E j κ σ − 1 ] − 1 κ , P ¯ j = E j κ − σ + 1 σ − 1 P j ,

bilateral trade can be expressed as

X i j = ( t i j Π i P ¯ j ) − κ ( f i j X ) − ( κ σ − 1 − 1 ) Y i E J .

Extensions

In the models reviewed in the previous section, all trade costs were modeled as iceberg trade costs. However, some trade costs create rents, and how these rents are (re-)distributed can impact trade flows. For example, if there are ad valorem tariffs (i.e., tariffs on the value and not the quantity of the good) and those tariffs are distributed as lump-sum payments to households, a structural gravity equation emerges with tariffs included as part of the trade cost vector. Moreover, the inclusion of tariffs may allow the researcher to identify key parameters of the model. In both the Armington model and the model with monopolistic competition, the structural gravity equation is given by

X i j = τ i j − σ ( t i j Π i P j ) 1 − σ Y i E j , Π i = [ ∑ i = 1 N τ i j − σ ( t i j P j ) 1 − σ E j ] 1 1 − σ , P j = [ ∑ i = 1 N τ i j − σ ( t i j Π i ) 1 − σ Y i ] 1 1 − σ .

where τ i j is the gross tariff rate. When data on ad valorem tariff rates is available, this can then be used to identify the elasticity of substitution across varieties.

Other examples where the structural gravity model emerges, is where the production function is modified to include intermediate goods. Eaton and Kortum (2002) and Redding and Venables (2004) provide theoretical models that include intermediate goods used in the production process, as described in Fujita, Krugman, and Venables (1999). They show that when production technology is represented by a Cobb-Douglas production function using labor and a CES-aggregate of intermediates goods, a structural gravity equation emerges. The main difference between these models and, for example, the Armington model is that factor prices will be influenced by the inward-multilateral resistance terms, Π i . Intuitively, better access to foreign intermediaries tends to raise the returns to the domestic factors of production. In addition, a sectoral gravity equation emerges when there are many sectors, and the demand for the varieties produced in different sectors is weakly separable in the production function and/or in the utility function. Anderson and Yotov (2016) estimated a sectoral structural gravity equation, and they highlight how trade costs vary across sectors. Redding and Weinstein (2019) used a nested CES demand system to show how a log-linear gravity equation can be estimated and aggregated, and how it is possible to decompose the overall effects of different trade costs into different components reflecting the sectoral gravity equation estimates.

Hallak (2006), Hallak (2010), and Baldwin and Harrigan (2011) allowed for varieties to differ by quality. In these models, the demand for high-quality products increases with the consumer’s income. On the supply side, high-income countries also tend to produce high-quality goods. This is either because they are more likely to produce high-quality goods so that they can satisfy the local market demands. Alternatively, high-income countries are more capable of producing high-quality goods because their firms are, on average, more productive and can therefore produce higher-quality goods more efficiently. As a result, countries with similar per capita GDPs are expected to trade more (see Linder, 1961). Hallak (2010) used a sectoral gravity-like equation to show that bilateral pairs that have similar per capita GDPs should trade more.

In all of the models discussed so far, preferences are characterized by CES-utility function. Novy (2013) derived a gravity equation where the demand system can be characterized by a translog demand system. Unlike the earlier models, Novy (2013) showed that the bilateral elasticity of trade with respect to trade costs is not constant. Behrens and Murata (2012) and Arkolakis, Costinot, Donaldson, and Rodriiguez-Clare (2015) showed that the structural gravity model can be obtained when agents have constant absolute risk aversion utility functions. In this case, there is a choke price that prevents all firms from exporting. Nevertheless, because the firm technology is Pareto distributed in the ACDRC framework, there will always be positive bilateral trade because of the unbounded Pareto distribution, and a gravity-like equation can be obtained when preferences are characterized by constant absolute risk aversion utility functions, and firm productivity is given by an unbounded Pareto distribution. 9

Empirical Gravity

From Tinbergen’s early application until the beginning of this century, trade economists working on the gravity model have focused mainly on either the theoretical foundations of the model or on expanding the list of covariates used to identify other natural, historical, cultural, and policy-related variables that affect bilateral trade. During this time, nearly every empirical application estimated the gravity model in log-linear form using ordinary least squares (OLS). 10 The influential work by Santos Silva and Tenreyro (2006) called into question the use of the log-linear specification. Santos Silva and Tenreyro (hereafter SST) argued that the error term was likely heteroskedastic, and the variance was likely a function of the right-hand side control variables. If this were the case, the coefficient estimates from the log-linear would be inconsistent. SST proposed using a Poisson Pseudo Maximum Likelihood Estimator. This section discusses the properties of the different estimators, starting with the assumptions required for OLS to yield consistent estimates. It subsequently covers the estimation of the gravity model using Poisson Pseudo Maximum Likelihood Estimator (PPML), the Gamma Pseudo Maximum Likelihood Estimator (GPML), and Nonlinear Least Squares (NLS). 11 This section closes by addressing the potential endogeneity of the policy variables and discussing how this has been addressed.

The Log-Linear Model

Assuming that the expected value of trade is given by the structural gravity equation as derived in the previous section, observed bilateral trade is given by

X i j t = ( t i j t Π i t P j t ) − ϵ E j Y i + u i j t (22)

In addition to the error term u i j t , time subscripts have been added to the gravity model to allow for the estimation to include several years of bilateral trade data. In some instances, it will be more convenient to substitute for the trade cost vector in the functional form and express Equation 22 as

X i j t = exp < Z i j t N β N + Z i j t H β H + Z i j t P β P + ( ϵ l n Π i t + l n E i t ) + ( ϵ l n P j t + l n Y j t ) >+ u i j t

Most early applications of the gravity model estimated Equation (22) without accounting for the multilateral resistance terms. To illustrate, the first column of Table 1 presents the coefficients of the log-linear model estimated using pooled OLS for five-year intervals from 1974 to 2014 . 12 Included in these regressions are time dummies that capture the yearly variations in worldwide trade. The results are consistent with many papers estimating the log-linear gravity. Specifically, the coefficients on GDP are (relatively) close to unity, the absolute value of the distance elasticity is close to unity, and the coefficients on the common language, contiguity, colony, and the FTA indicator variables all indicate that they have a positive effect on trade.

Addressing Multilateral Resistance

There are a number of reasons why the coefficient estimates from this specification may be inconsistent. The most obvious, given the theoretical discussion in the previous section, is that this specification does not include controls for the multilateral resistance terms. 13 Anderson and van Wincoop used an iterative nonlinear least squares estimator that computes and incorporates the multilateral resistance terms. However, a more straightforward way to account for the multilateral resistance terms that avoids custom programming is to include importer and exporter fixed effects. The inclusion of these fixed effects means that the trade elasticity with respect to GDPs can no longer be identified directly. When extended to a panel setting, Baldwin and Taglioni (2006) and Baier and Bergstrand (2007) emphasized that the theoretically consistent fixed effects should be specified as exporter-year and importer-year fixed effects.

The structural gravity equation can now be rewritten as

X i j t = X ij SG + u i j t (23)

where X ij SG = exp ( Z i j t β + δ X D i t + δ M D j t ) , Z i j t is a k -dimensional vector capturing bilateral trade costs, and D i t ( D j t ) are exporter-year (importer-year) dummy indicators. One issue that arises in estimating equation (23) is that as the number of countries and years in the panel rises, the estimation of the coefficients on the dummy indicators becomes increasingly difficult and time consuming. This challenge led to Baier and Bergstrand’s (2009) linearized version of the multilateral resistance terms, which greatly reduced the number of parameters and allowed for the inclusion of importer and exporter specific effects. However, these technical issues are less of a concern now that most statistical packages have custom programs that allow the researcher to estimate models using high dimensional fixed effects.

Table 1. Comparison of Different Estimators

*** Standard errors in parentheses p

Heteroskedasticity and the Structural Gravity Model

For many years, nearly all empirical papers estimated a log-linear gravity model using ordinary least squares, which in many instances may have led to biased coefficient estimates. If the error term is heteroskedastic and variance of the error term is correlated with the right-hand side variables, the estimates are likely to be biased. Estimating equation (23) in log levels will yield consistent estimates under the following conditions

u i j t = exp ( Z i j t β + δ X D i t + δ M D j t ) η i j E ( η i j | Z i j , D i , D j ) = 0 V a r ( η i j | Z i j , D i , D j ) = σ η 2

Furthermore, in the presence of missing trade data, the OLS coefficient estimates will only be consistent when the data are completely missing at random, or the missing observations are functions of the right-hand side controls but independent of the error terms. There are statistical tests that can be performed to check whether the zero trade flows are economically determined as opposed to missing at random. Perhaps the simplest of these tests is to estimate Equation 22 while including an indicator variable that shows if the bilateral pair has positive trade flows in the subsequent period. If the coefficient on this variable is statistically significant, the zero trade flows are likely economically determined.

Alternatively, Helpman, Melitz, and Rubinstein (2008) employ a Heckman-like correction to account for firm heterogeneity and zero trade flows. They develop a model where firm productivity is drawn from a truncated Pareto distribution. They then show how to account for firm heterogeneity empirically and how to employ a two-step Heckman correction for selection into trade. As is typical with Heckman corrections, the researcher needs to find a variable that influences the extensive margin of trade without impacting the intensive margin of trade. HMR used data from the World Bank’s “Doing Business” report for a core set of countries and used religion as the identifying variable for a broader group of countries. However, Santos Silva and Tenreyro (2015) showed that the HMR specification is only valid under relatively strong distributional assumptions and that standard statistical tests to assess these assumptions were rejected.

Gamma and Poisson Estimators

While controlling for the multilateral resistance terms helps to account for the correlation between the trade costs and the error term, there are other reasons to suspect the coefficient estimates may be inconsistent. SST showed that the log-linear specification leads to inconsistent estimates if the error term is heteroskedastic, and the variance depends on the right-hand side control variables. To see how the heteroskedasticity is likely to depend on the right-hand side variables the gravity equation is rewritten as

X i j t = exp ( Z i j t β + δ X D i t + δ M D j t ) ν i j t

where E ( ν i j t | Z i j , D i , D j ) = exp ( h ( Z i j ) * e i j t ) and e i j t ~ N ( 0 , σ 2 ) so that ν i j t is log-normal with a zero mean and variance that is a function of the Z i j ’s. Then the expected value E [ ln ( v i j t ) | Z i j ] = − 1 2 σ v 2 so that the coefficient estimates of the log linear model would be given by

E [ ln ( X i j ) | Z i j , D i , D J ] = Z i j β + δ X D i t + δ M D j t − 1 2 σ ν 2

When heteroskedasticity is present, and the conditional mean function is exponential, SST showed that the PPML estimator provides consistent estimates. 14 However, there are other Pseudo Maximum Likelihood Estimators that will also lead to consistent estimates of the parameters of interest. The first-order conditions for this class of models include

∑ i N < X i j t − X i j S G >( X i j S G * Z i j V ( X i j S G ) ) = 0

In the case of the PPML estimator, the variance is proportional to the mean, and so the first-order conditions include

∑ i N < X i j t − X i j S G >( X i j S G * Z i j ) = 0.

For the Gamma Pseudo Maximum Likelihood estimator (GPML) where the variance is proportionate to the square of the mean, the first-order conditions would include

∑ i N < X i j t − X i j S G X i j S G >Z i j = 0.

The term in brackets is the percentage difference in the actual trade from the predicted trade. As Head and Mayer (2014) pointed out, this term may be roughly equal to the log difference in actual trade and predicted trade; in which case, the coefficient estimates may be similar to those using OLS.

Nonlinear Least Squares

The final specification discussed in this section is the nonlinear least squares (NLS). For NLS, the variance is independent of the conditional mean so that the first-order conditions include

∑ i N < X i j t − X i j S G >X i j S G Z i j = 0.

As long as the conditional mean is correctly specified, and the sample size is sufficiently large, the coefficient estimates should be similar across these specifications.

Table 1 columns 3 to 5 include the results for the PPML, the GPML, and NLS. As expected, the GPML estimates are similar to the OLS-FE model. The absolute value of the distance elasticity is lower for the Poisson model than it is for the OLS-FE and GPML; this is quite common and was pointed out by SST. For the NLS model, the coefficient estimates on language and Colony are notably different from the other specifications.

Model Selection and Heteroskedasticity

In order to assess these models, a number of standard tests for functional form and for the presence of heteroskedasticity can be implemented. To test for the latter, SST used the Ramsey Reset test, but this test may be also be thought of as a test for functional form. The idea of the Ramsey Reset test is straightforward. After estimating the model, save the predicted values and re-run the model with the same controls along with squares of the predicted value and other higher-order terms. If these additional regressors are not statistically significant, the functional form is likely correctly specified and heteroskedasticity is not a problem. Another commonly used test is the MaMu (or Park) test for heteroskedasticity. For this test, you again save the fitted value from the original specification, create V ^ i j t = ( X i j t − X ^ i j t ) 2 , and subsequently estimate the following model

V ^ i j t = λ 0 ( X i j t ^ ) λ 1

Using the same estimator that generated the predicted values, a statistical test on the value on λ 1 can help discriminate among the models. If the coefficient estimate is close to one (two) the PPML (GPML) estimator is more efficient, and if the coefficient is close to zero, then NLS may be appropriate. In many cases, the coefficient estimate is somewhere between one and two. Head and Mayer (2014) ran a simulation exercise in which the variance structure is proportionate to the mean-making PPML the most efficient estimate. They found the coefficient estimate on λ 1 to be close to 1.60. When λ ^ 1 was significantly below two, the MaMu (Park) test was a near-perfect predictor for the model specification. 15

Given the advances in computing power and improvement in estimating techniques, best practices for reporting empirical results would include estimating the model using OLS, PPML, GPML, and potentially NLS. 16 As Head and Mayer suggest, if all the coefficient estimates are similar, then there is little reason for concern. If the coefficient estimates are economically different, then the Ramsey Reset test and the MaMu (Park) test can provide additional insights into the correct empirical specification.

Endogenous Trade Policy

In many cases, the researcher may be concerned that the right-hand side controls are not exogenous. This is most likely to arise when policy variables are included in the specification. Clearly, tariffs and trade agreements are the results of negotiations between bilateral pairs and are hence unlikely to be randomly distributed across bilateral pairs even after controlling other right-hand side variables. By running a series of cross-sectional gravity equations over time, Baier and Bergstrand (2007) showed that the estimated coefficients on free trade agreements are less stable compared to standard gravity controls. 17 Table 2 presents the results for the gravity equation for five-year intervals from 1979 to 2014 . In order to account for the endogeneity, one must find instruments that are correlated with the tariffs or trade agreements but uncorrelated with trade flows. An alternative is to assume that there are bilateral specific effects that evolve slowly over time to the point where the researcher can assume that they are constant. One can then estimate the model using bilateral fixed effects. Baier and Bergstrand found that when controlling for time-varying multilateral resistance using standard panel fixed effects, the coefficient on trade agreements was positive and significant and was robust to changes in the specification.

Table 2 presents the results at five-year intervals from the OLS specification from 1979 to 2014 . The coefficient on trade agreements is negative and significant for several years, after which it becomes positive and significant. The coefficient on trade agreements ranges from −0.689 to 0.590. If the policy variables are correlated with the error term, the consistent estimation can be obtained by using standard instrumental variable techniques. Egger et al. (2011) and Magee (2003) are two notable examples that use IV estimation. Rather than taking the IV approach, Baier and Bergstrand (2007) assumed that the policy variables are correlated with an unobserved component that is fixed or sufficiently slow moving over time. If this assumption holds and all of the other conditions needed for consistent estimation for the log-linear gravity model are met, then consistent estimates can be obtained by fixed effects or first differencing the data.

Table 2. Stability of the Coefficients

*** Robust standard errors in parentheses p

Baier and Bergstrand (2007) also included lags and lead to capture the dynamic aspects of trade agreements. The lagged values of the trade agreement variables detect the phase-in effects, while the leads detect feedback effects (i.e., where large bilateral trade flows lead to the new trade agreements). Anderson and Yotov (2016) obtained qualitatively similar findings using a PPML estimator with bilateral fixed effects. Table 3 presents the results using a standard fixed effects estimation and the fixed-effect PPML estimator. In both specifications, there is evidence of economically and statistically significant lagged effects of trade agreements and little evidence of feedback effects. The fixed effect PPML estimates are also smaller than the standard log-linear fixed effect specification.

Table 3. Lagging and Leading Trade Agreements

*** Robust standard errors in parentheses p

The Current State and Future of Gravity

Over time, improvements to the data and theoretical innovations have resolved several of the empirical puzzles that trade economists identified when employing the gravity equation. McCallum’s border puzzle is one such issue and was addressed by Anderson and van Wincoop (2003). Another puzzle that has been widely discussed is the distance puzzle. Several studies have shown that the absolute value of the elasticity of trade with respect to distance has increased over time (see, e.g., Disdier & Head, 2008). Using data that includes gross production and intra-country trade, Yotov (2012) showed that the effect of distance on trade has declined over time when one measures the impact of distance on international relative to intranational trade. Caron, Fally, and Markusen (2014) argued that incorporating a gravity framework into a model with multiple sectors and non-homothetic preferences addresses several puzzles in international trade.

More recently, the gravity framework has been used to assist in quantifying the general equilibrium impacts of trade policies and to assess the welfare implications. A typical assumption in most empirical specifications is that incomes and prices are assumed to exogenous, and this may be an appropriate assumption when the observation is bilateral trade. Given the theoretical developments of the gravity model, it is relatively easy to embed measured trade costs into the general equilibrium models and observe how changes in trade costs will impact prices and incomes. An important contribution that set the stage for the use of the gravity equation in evaluating trade policies was the paper of Arkolakis, Costinot, and Rodríguez-Clare (2012). They showed that for a wide class of models, the welfare implication depends on the share of expenditures on domestically produced goods and the elasticity of trade with respect to (variable) trade costs. Another significant contribution that led to the gravity model’s use in evaluating trade policy was the small-scale model developed by Alvarez and Lucas (2007). Alvarez and Lucas showed how the Eaton and Kortum model could be calibrated to simulate changes in trade policy. Caliendo and Parro (2015) quantified the impact of the reduction in tariffs as a result of the North American Free Trade Association (NAFTA). Caliendo et al. (2017) used a quantitative gravity model to evaluate the impact of 20 years of tariff changes through the GATT/WTO and trade agreements. Felbermayr, Gröschl, and Steininger (2018) used a quantitative trade model to evaluate the impact of Brexit.

Since the advent of new trade theory, there has been an interest in linking trade, firm location, and economic geography. Early theoretical and empirical examples are Fujita et al. (1999) and Redding and Venables (2004). These papers focused on market access and supplier access. As more data and better data have become available, these models have used the gravity framework to address how market access and supplier access has impacted different areas (see, e.g., Donaldson & Hornbeck, 2016; Donaldson, 2018; Allen & Arkolakis, 2014; Ahlfeldt, Redding, Sturm, & Wolf, 2015).

In the future, several areas need to be addressed. As pointed out by Lai and Trefler (2002), the gravity equation does an excellent job of explaining cross-sectional variation in trade flows but does not perform as well in explaining the growth of trade. The reason for this is somewhat obvious: for much of the post–World War II period, trade has increased faster than income. In order for the gravity model to explain the growth in trade, there must be changes in the trade costs that have led to an increase in trade. In most specifications, on the right-hand side, bilateral control variables are constant over time and thus cannot explain the growth in trade. A related area of research would be able to provide a dynamic model of international trade both at the aggregate level and incorporating firm dynamics. Anderson, Larch, and Yotov (2015) used an Armington framework with capital accumulation to develop a dynamic general equilibrium gravity model. Sampson (2016) and Perla, Tonetti, and Waugh (2015) provided a dynamic model of trade and growth with heterogeneous firms. Finally, an area of future research is to have a better understanding of trade costs that are derived from first principles. Most theoretical developments have been in terms of firm production and preferences of the individual. In almost all examples, trade costs are simply assumed to be iceberg trade costs, and the functional form of the trade costs is log-linear. Chaney (2018) provided a model that helps to explain the role of distance in the gravity equation.

References

Ahlfeldt, G. M. , Redding, S. J. , Sturm, D. M. , & Wolf, N. (2015). The economics of density: Evidence from the Berlin Wall. Econometrica, 83(6), 2127–2189.

Allen, T. , & Arkolakis, C. (2014). Trade and the topography of the spatial economy. Quarterly Journal of Economics, 129(3), 1085–1140.

Allen, T. , & Arkolakis, C. (2015). Elements of advanced international trade. Graduate Trade Notes.

Alvarez, F. , & Lucas Robert, E. J. (2007). General equilibrium analysis of the Eaton-Kortum model of international trade. Journal of Monetary Economics, 54(6), 1726–1768.

Anderson, J. E. (1979). A theoretical foundation for the gravity equation. The American Economic Review, 69(1), 106–116.

Anderson, J. E. , & van Wincoop, E. (2003). Gravity with gravitas: A solution to the border puzzle. American Economic Review, 93(1), 170–192.

Anderson, J. E. , & Yotov, Y. V. (2016). Terms of trade and global efficiency effects of free trade agreements, 1990–2002. Journal of International Economics, 99, 279–298.

Anderson, J. , Larch, M. , & Yotov, Y. (2015). Growth and trade with frictions: A structural estimation framework. National Bureau of Economic Research, Working paper series No. 21377.

Arkolakis, C. , Costinot, A. , Donaldson, D. , & Rodríguez-Clare, A. (2015). The elusive pro-competitive effects of trade. National Bureau of Economic Research, Working paper series No. 21370.

Arkolakis, C. , Costinot, A. , & Rodríguez-Clare, A. (2012). New trade models, same old gains? American Economic Review, 102(1), 94–130.

Baier, S. L. , & Bergstrand, J. H. (2001). The growth of world trade: Tariffs, transport costs, and income similarity. Journal of International Economics, 53(1), 1–27.

Baier, S. L. , & Bergstrand, J. H. (2007). Do free trade agreements actually increase members’ international trade? Journal of International Economics, 71(1), 72–95.

Baldwin, R. E. , & Harrigan, J. (2011). Zeros, quality, and space: Trade theory and trade evidence. American Economic Journal: Microeconomics, 3(2), 60–88.

Baldwin, R. , & Taglioni, D. (2006). Gravity for dummies and dummies for gravity equations. National Bureau of Economic Research, Working paper series No. 12516.

Behrens, K. , & Murata, Y. (2012). Globalization and individual gains from trade. Journal of Monetary Economics, 59(8), 703–720.

Bergstrand, J. H. (1985). The gravity equation in international trade: Some microeconomic foundations and empirical evidence. The Review of Economics and Statistics, 67(3), 474.

Bergstrand, J. H. , & Egger, P. (2011). Gravity equations and economic frictions in the world economy. In D. Bernhofen , R. Falvey , D. Greenaway , & U. Kreickemeierm (Eds.), Palgrave handbook of international trade. London: Palgrave-Macmillan.

Bertoletti, P. , Etro, F. , & Simonovska, I. (2018). International trade with indirect additivity. American Economic Journal: Microeconomics, 10(2), 1–57.

Caliendo, L. , & Parro, F. (2015). Estimates of the trade and welfare effects of NAFTA. The Review of Economic Studies, 82(1), 1–44.

Caron, J. , Fally, T. , & Markusen, J. R. (2014, May). International trade puzzles: A solution linking production and preferences. Quarterly Journal of Economics, 129(3), 1501–1552.

Chaney, T. (2008). Distorted gravity: The intensive and extensive margins of international trade. American Economic Review, 98(4), 1707–1721.

Chaney, T. (2018). The gravity equation in international trade: An explanation. Journal of Political Economy, 126(1), 150–177.

Cheng, I.-H. , & Wall, H. J. (2005). Controlling for heterogeneity in gravity models of trade and integration. Federal Reserve Bank of St. Louis Review, 87(1), 49–63.

Deardorff, A. V. (1998). Determinants of bilateral trade: Does gravity work in a neoclassical world? In J. Frankel (Ed.), The regionalization of the world economy (pp. 7–32). Chicago: University of Chicago Press.

De Benedictis, L. , & Taglioni, D. (2010) The Gravity Model in International Trade. In L. De Benedictis and L. Salvatici (Eds.), The Trade Impact of European Union Preferential Policies. Berlin: Springer-Verlag.

Disdier, A.-C. , & Head, K. (2008). The puzzling persistence of the distance effect on bilateral trade. Review of Economics and Statistics, 90(1), 37–48.

Donaldson, D. (2018). Railroads of the Raj: Estimating the impact of transportation infrastructure. American Economic Review, 108(4–5), 899–934.

Donaldson, D. , & Hornbeck, R. (2016). Railroads and American economic growth: A ‘market access’ approach. The Quarterly Journal of Economics, 131(2), 799–858.

Dornbusch, R. , Fischer, S. , & Samuelson, P. (1977). Comparative advantage, trade, and payments in a Ricardian model with a continuum of goods. American Economic Review, 67(5), 823–839.

Eaton, J. , & Kortum, S. (2002). Technology, geography and trade. Econometrica, 70(5), 1741–1779.

Eaton, J. , Kortum, S. S. , & Sotelo, S. (2012). Series international trade: Linking micro and macro. National Bureau of Economic Research, Working paper series No. 17864.

Eaton, J. , & Tamura, A. (1995). Bilateralism and regionalism in Japanese and U.S. trade and direct foreign investment patterns. National Bureau of Economic Research, Working paper series No. 4758.

Egger, P. , Larch, M. , Staub, K. E. , & Winkelmann, R. (2011). The trade effects of endogenous preferential trade agreements. American Economic Journal: Economic Policy, 3(3), 113–143.

Fally, T. (2015). Structural gravity and fixed effects. Journal of International Economics, 97(1), 76–85.

Feenstra, R. C. (2004). Advanced international trade: Theory and evidence by Robert C. Feenstra, 2004. Princeton, NJ: Princeton University Press.

Felbermayr, G. , Gröschl, J. , & Steininger, M. (2018). Brexit through the lens of new quantitative trade theory. In Annual Conference on Global Economic Analysis at Purdue University.

Frankel, J. A. (1997). Regional trading blocs. Washington, DC: Institute for International Economics.

Fujita, M. , Krugman, P. R. , & Venables, A. J. (1999). The spatial economy—cities, regions, and international trade. Cambridge, MA: MIT Press.

Hallak, J. C. (2006). Product quality and the direction of trade. Journal of International Economics, 68(1), 238–265.

Hallak, J. C. (2010). A product-quality view of the Linder Hypothesis. Review of Economics and Statistics, 92(3), 453–466.

Harrigan, J. (2003). Specialization and the volume of trade: Do the data obey the laws. In K. E. Choi & J. Harrigan (Eds.), Handbook of international trade (1st ed., pp. 85–118). Oxford, U.K.: Blackwell.

Head, K. , & Mayer, T. (2014). Gravity equations: Workhorse, toolkit, and cookbook. In G. Gopinath , E. Helpman , & K. Rogoff (Eds.) Handbook of International Economics (Vol. 4, pp. 131–195). North Holland: Elsevier.

Head, K. , Mayer, T. , & Reis, J. (2010). The erosion of colonial trade linkages after independence. Journal of International Economics, 81, 1–14.

Helliwell, J. F. (1997). National borders, trade and migration. Pacific Economic Review, 2(3), 165–185.

Helpman, E. (1987). Imperfect competition and international trade: Evidence from fourteen industrial countries. Journal of the Japanese and International Economies, 1(1), 62–81.

Helpman, E. , & Krugman, P. R. (1989). Trade policy and market structure. Cambridge, MA: MIT Press.

Helpman, E. , Melitz, M. , & Rubinstein, Y. (2008). Trading partners and trading volumes. Quarterly Journal of Economics, 123(2), 441–487.

Hummels, D. (2007). Transportation costs and international trade in the second era of globalization. The Journal of Economic Perspectives, 21(3), 131–154.

Krugman, P. R. (1979). Increasing returns, monopolistic competition, and international trade. Journal of International Economics, 9(4), 469–479.

Krugman, P. R. (1980). Scale economies, product differentiation, and the pattern of trade. American Economic Review, 70(5), 950–959.

Lai, H. , & Trefler, D. (2002). The gains from trade with monopolistic competition: Specification, estimation and mis-specification. National Bureau of Economic Research, Working paper series No. 9169.

Leamer, E. E. , & Stern, R. M. (1970). Quantitative International Economics (First). Boston: Allyn and Bacon.

Limao, N. , & Venables, A. J. (2001, September). Infrastructure, geographical disadvantage, transport costs, and trade. The World Bank Economic Review, 15(3), 451–479.

Linder, S. B. (1961). An essay on trade and transformation. Stockholm, Sweden: Almqvist & Wicksells.

Linnemann, H. (1966). An econometric study of international trade flows. Amsterdam, The Netherlands: North-Holland.

Magee C. S. (2003). Endogenous preferential trade agreements: An empirical analysis. The B.E. Journal of Economic Analysis & Policy, 2(1), 1–19.

McCallum, J. (1995). National borders matter. American Economic Review, 85(3), 615–623.

Melitz, M. J. (2003). The impact of trade on intra-industry reallocations and aggregate industry productivity. Econometrica, 71(6), 1695–1725.

Novy, D. (2013). International trade without CES: Estimating translog gravity. Journal of International Economics, 89(2), 271–282.

Perla, J. , Tonetti, C. , & Waugh, M. (2015). Equilibrium technology diffusion, trade, and growth. National Bureau of Economic Research, Working paper series No. 20881.

Piermartini, R. , & Yotov, Y. (2016). Estimating trade policy effects with structural gravity. School of Economics Working Paper Series. Philadelphia, PA: LeBow College of Business, Drexel University.

Redding, S. J. (2011). Theories of heterogeneous firms and trade. Annual Review of Economics, 3(1), 77–105.

Redding, S. , & Weinstein, D. (2019). Aggregation and the gravity equation. American Economic Review Papers and Proceedings, 109, 450–455.

Redding, S. , & Venables, A. J. (2004). Economic geography and international inequality. Journal of International Economics, 62(1), 53–82.

Sampson, T. (2016). Dynamic selection: An idea flows theory of entry, trade, and growth. The Quarterly Journal of Economics, 131(1), 315–380.

Silva, J. M. C. S. , & Tenreyro, S. (2006). The log of gravity. Review of Economics and Statistics, 88(4), 641–658.

Wei, S.-J. (1996). Intra-national versus international trade: How stubborn are nations in global integration? National Bureau of Economic Research, Working paper series No. 5531.

Yotov, Y. V. (2012). A simple solution to the distance puzzle in international trade. Economics Letters, 117(3), 794–798.

Notes

1. This review covers much of the evolution of the gravity equation. Like any review, the choice of topics covered is selective. The interested reader may also want to review other surveys: in particular, Head and Mayer (2014), Allen and Arkolakis (2015) Bergstrand and Egger (2011), De Benedictis and Taglioni (2010), Scott L. Baier, Kerr, and Yotov (2018), and Piermartini and Yotov (2016).
2. The frictionless gravity model is derived in the next section. Leamer and Stern (1970) and Anderson (1979) and Deardorff (1998) are some of the earliest theoretical contributions to derive a frictionless gravity model.
3. The correlation between the natural logarithm of expenditure shares and production shares is 0.67.
4. Conditioning on the pair being contiguous, the elasticity of trade costs with respect to distance falls to 1.12.
5. Head, Mayer, and Reis (2010) highlighted this relationship.
6. Baier et al. (2018) highlight this more rigorously and develop a model that highlights these interactions between trade agreements and other trade costs.
7. This section ignores the specific taste parameter for a country’s good ( β ), allowing one to relate factor prices to a country’s production technology.
8. It is assumed that there are no internal trade costs so that the landed price in country i is equal to the factory gate price.
9. More recently, Bertoletti, Etro, and Simonovska (2018) derive a gravity-like equation when agents have indirectly additive preferences.
10. Notable exceptions were Eaton and Tamura (1995) and Frankel (1997).
11. PPML, GPML, and NLS are in the same class of estimators commonly referred to as the generalized linear models (GLM).
12. For the reasons outlined in Cheng and Wall (2005) and Baier and Bergstrand (2007), five-year intervals are used.
13. As discussed earlier, these terms were typically approximated using “remoteness” variables that may be poor proxies for the multilateral resistance.
14. Another appealing feature of the PPML model is that the coefficient estimates on the exporter-year and importer-year fixed effects are the theoretically consistent multilateral resistance terms derived in the previous section (see Fally, 2015).
15. Egger, Larch, Staub, and Winkelmann (2011) examined the small sample properties of the different GLM estimators.
16. Eaton, Kortum, and Sotelo (2012) use a Multinomial Pseudo Maximum Likelihood estimator to estimate a gravity model using export shares. Head and Mayer (2014) find that this model performs well in the presence of zeros and when the variance of the error term is proportionate to the mean. However, it performs less well when the variance is proportional to the square of the mean.
17. Baier and Bergstrand (2007) attributed this instability to the endogeneity of the trade agreements.

The Economics of Innovation, Knowledge Diffusion, and Globalization
Globalization, Trade, and Health Economics
The Law and Political Economy of International Trade Agreements