Acessibilidade / Reportar erro

The provision of education and its impacts on college premium in Brazil

Abstracts

This paper tests the existence of causal relationship between the evolution of college-educated labor supply and the performance of the college premium in Brazil. Late college graduation causes sampling composition problem which may bias the impact of labor supply on the college premium. I estimate the impact of the relative supply of college-educated labor on wage, with and without controlling for the composition bias. In both cases, I find a relatively low elasticity of substitution between school groups. However, the estimate of the partial elasticity of substitution between age groups is crucially affected by the chosen estimation procedure.

college premium; pseudo panel; labor supply; elasticity of substitution


Esse artigo testa a existência de relação causal entre a evolução da oferta de trabalhadores com qualificação universitária e a performance do diferencial salarial entre trabalhadores com nível universitário e trabalhadores com nível secundário, no Brasil. Graduação tardia causa problema de composição amostral que potencialmente viesa o impacto da oferta de trabalho sobre o premio salarial. Eu estimo o impacto da oferta relativa no diferencial de salarios, com e sem controlar os efeitos da graduação tardia. Em ambos os casos, eu encontro coeficientes de elasticidade de substituição baixo entre trabalhadores qualificados e não qualificados. Contudo, o procedimento adotado afeta decisivamente conclusões sobre a elasticidade de substituição entre diferentes grupos etários.


The provision of education and its impacts on college premium in Brazil* * I thank the valuable comments of John Karl Scholz, Rodolfo Manuelli, Cristina Terra, Hugo Boff, Francisco Ferreira, Gustavo Gonzaga, Naercio Menezes, Reynaldo Fernandes, as well as other participants of seminars at the University of Wisconsin - Madison, FGV-RJ, IBMEC-SP, USP, LACEA and IPEA. The usual disclaims also apply.

Sergio Guimarães FerreiraI

IBNDES/UCAM. E-mail: sergiogu@bndes.gov.br

ABSTRACT

Summary: 1. Introduction; 2. Theoretical framework; 3. The college wage premium and the relative supplies of college workers by age; 4. The effect of cohort-specific supplies on the college wage premium of male workers; 5. Screening the college-high school ratio between age and cohort effects; 6. Conclusion.

This paper tests the existence of causal relationship between the evolution of college-educated labor supply and the performance of the college premium in Brazil. Late college graduation causes sampling composition problem which may bias the impact of labor supply on the college premium. I estimate the impact of the relative supply of college-educated labor on wage, with and without controlling for the composition bias. In both cases, I find a relatively low elasticity of substitution between school groups. However, the estimate of the partial elasticity of substitution between age groups is crucially affected by the chosen estimation procedure.

Keywords: college premium; pseudo panel; labor supply; elasticity of substitution.

JEL codes: J31; C23.

RESUMO

Esse artigo testa a existência de relação causal entre a evolução da oferta de trabalhadores com qualificação universitária e a performance do diferencial salarial entre trabalhadores com nível universitário e trabalhadores com nível secundário, no Brasil. Graduação tardia causa problema de composição amostral que potencialmente viesa o impacto da oferta de trabalho sobre o premio salarial. Eu estimo o impacto da oferta relativa no diferencial de salarios, com e sem controlar os efeitos da graduação tardia. Em ambos os casos, eu encontro coeficientes de elasticidade de substituição baixo entre trabalhadores qualificados e não qualificados. Contudo, o procedimento adotado afeta decisivamente conclusões sobre a elasticidade de substituição entre diferentes grupos etários.

1 Introduction

Education attainment of the labor force has mildly increased in the last two decades in Brazil. I am interested in testing if the increase in the number of workers with a college degree has somewhat led to a downward pressure in the college wage premium. The premium is defined as the wage difference between workers with a college degree and workers with a high school degree.

There is recent work that examines the effects of supply changes on the returns to education in the US, Canada and UK. The seminal work, by Katz and Murphy (1992) looks at movements in the college wage premium in the US from 1963 to 1987 and concludes that ''it appears to be strongly related to fluctuations in the rate of growth of the supply of college graduates''. Their model assumes perfect substitutability among workers in different age groups as long as they have the same level of education. Such an assumption allows them to construct an aggregate education index as a linear combination between the ''amount of education'' supplied by workers of different ages.

Card and Lemieux (2001) extend Katz and Murphy's model to allow imperfect substitution between workers of different ages. By measuring the impact of changes in the supply of education (of different age groups) on the college wage premium, they are able to estimate the elasticity of substitution between different age groups with the same education level. Once they estimate this parameter, they are able to calculate aggregate indexes for the total supply of high school and college workers in the labor force that take into account the imperfect substitution among age groups. Their estimates for the US suggest that the elasticity of substitution between different age groups is large but finite at 4.4, while the elasticity of substitution between college and high school labor types is in the range of 1.1 to 1.6. Results are similar for both the UK and Canada.

In Brazil, the college-high school wage gap for full-time male workers stays fairly flat from 1976 to 1984 at 2.01, then jumps to 2.11 in 1984, and finally jumps once more by 7% in 1988. After 1988, however, it has been fluctuating around 2.20, as shown in figure 1. During the same period, the number of workers with college education in the labor force has being increasing relative to the number of high school workers (figure A.1). This paper examines how changes in the supply of school-related labor has affected such pattern observed for the college premium.


The theoretical model has a production function that uses only labor as input. However, labor can be of two kinds: high school type and college type, combined under a CES technology. It is straightforward to show that the college-high school wage gap for a given age-group depends on both the aggregate relative supply of college labor in period t, and on the age-group specific relative supply of college labor.

The econometric method consists on constructing a pseudo panel of cohort groups. In the first step, I estimate the partial elasticity of substitution among age groups within each educational category in order to calculate aggregates for the two labor types. The second step consists on the estimation of the elasticity of substitution between the two labor types, college and high school types.

One important issue is how to correctly identify changes in demand for skills from changes in the supply of skills. Card and Lemieux (2001) and Katz and Murphy (1992), among others, assume that the effect of skill-biased technical change on college wage premium can be summarized by a time trend. This assumption may be questionable for the Brazilian economy, since some authors argue in favor of the presence of a structural break on the impact of demand on the relative price of skills, in the 1990s (e.g. Fernandes and Menezes (2001)).1 1 There are other studies looking at the impact of labor supply in the school-related wage differential. Fernandes and Menezes (2001) assume three alternative values for the impact of the supply of skills on the college premium, and estimate the impact of demand shocks in relative wages, finding a structural break in the 1990s - which they attribute to trade liberalization. I allow for this possibility by adding cohort dummies which can partly capture non linear variations over time in the relative demand for skills. In this paper, I get very small (statistically not significant) impact of age-specific supply on age-specific school premium when not controlling for life cycle variations in school supply. Due to a substantial presence of late graduation in the Brazilian data, there is a clear positive relationship between age and relative supply of college-type labor. Life cycle changes in the supply of schooling are shown to be positively correlated to life cycle changes in wage differentials.

Once I decompose changes in the supply of schooling between life cycle and cohort components, I find that cohort-specific relative supply is negatively correlated to schooling premium: a 10% increase in college-educated supply for a given age group will lead to a 2.2% fall on college premium for that particular age group.

The paper has six additional sections. In section 2, the theoretical model is presented. Section 3 presents the empirical procedure. Section 4 presents the selected sample, and explains how I constructed the panel of wage gaps and school-related labor indexes. In addition, it presents how the constructed series have evolved over time. Section 5 presents the result of the estimations, when I do not control for the effects of late graduation in the supply of college-type workers. In section 6, I treat the data for potential presence of life cycle bias and estimate again the partial elasticity of substitution. Section 7 concludes the paper.

2 Theoretical Framework

Aggregate output depends on two CES sub-aggregates of high school and college labor:

and

where -¥ < h < 1 is a function of the partial elasticity of substitution sA between different age groups j with the same level of education (h = 1 - 1/sA). Each age group has specific relative efficiency parameters, ajt and bjt, which vary over time. In other words, those parameters may suffer influence of cohort-specific productivity shocks (e. g. variation in school quality).2 2 In addition, such a jt and b jt may capture structural breaks in the demand for skills potentially driven, for example, by trade liberalization.

In the limiting case of perfect substitutability across age groups, h is equal to 1 and total high-school (or college) labor input is a weighted sum of the quantity of labor supplied by each age group. I assume that the aggregate production function is also CES:

where -¥ < r < 1 is a function of the elasticity of substitution sE between the two education groups (r = 1-1/sE). In this setting, the marginal product of labor for a given age-education group depends on both the group's own supply of labor and the aggregate supply of labor in its education category. Efficient utilization of different skill groups requires that relative wages are equated to relative marginal products:3 3 . Similarly, the marginal product of college workers in age group j is .

Equations (4) and (5) imply that the ratio of the wage rate of college workers in age group to the wage of high-school workers in the same age group satisfies to the following equation:

Hence, if the relative employment ratios are exogenous, equation (7) leads to a model for the observed college-high school wage gap rjt of workers in age group j in year t:

where ejt reflects sampling variation in the measured gap and/or any other sources of variations in age-specific wage premiums.4 4 If enrollment decisions are somehow influenced by expected future changes in returns to education, the weighted least square estimates of the elasticity of substitution will have a positive bias. In other words, the WLS procedure will overestimate the true elasticity of substitution parameters.

According to equation (8a), the college-high school gap for a given age group depends on both the aggregate relative supply of college labor (Ct /Ht) in period t, and on the age-group specific relative supply of college labor (Cjt /Hjt). When there is perfect substitution across age groups with the same level of education, the college premium will depend only on the aggregate relative supply of college workers, on the relative technology shock qct /qht, and on the age-cohort relative efficiency parameter bjt /ajt.

For purposes of estimation, it is convenient to rearrange equation (8a) in an alternative form:

3 Econometric Method

The primary purpose of this paper is to estimate the effect of the aggregate relative supply of college labor on the college-high school wage gap. A problem arises in the attempt to estimate Equation (8a) because aggregate supplies of the two types of labor (Ct and Ht) depend on the elasticity of substitution across age groups, according to equations (1) and (2). Following Card and Lemieux (2001), a two-step estimation provides a method for identifying both sA and sE. In the first step, sA is estimated from a regression of age-group specific college wage gaps on age group-specific relative supplies of college educated labor, cohort effects (which absorb the relative productivity effect, ln(bjt /ajt)), and time effects (which absorb the combined relative technology shock and any effect of aggregate relative supply):

where bt-j are cohort dummies and dt are time dummies. Given an estimate of 1/sA, the relative efficiency parameters ajt and bjt are easily computed by noting that equations (4) and (5) can be transformed into:

The left hand side of these equations can be computed directly using the first step estimate of 1/sA. The right hand side can be estimated by a set of time dummies (first term) and cohort dummies (relative efficiency parameters). Given estimates of ajt's, bjt's and of h, it is easy to construct estimates of the aggregate supply of college and high school labor in each year (Ct and Ht).

With these estimates in hand, and some assumptions about the time series path of the relative productivity term, log(qct /qht), equation (8b) can be estimated directly. I follow the previous mentioned literature and assume that log(qct /qht) can be represented as a linear trend.

4 The College Wage Premium and the Relative Supplies of College Workers by Age

In this section, I present a descriptive overview of trends in the college wage premium for different age groups in Brazil. I also summarize data on the relative supplies of college-educated workers by age group. The data are drawn from the Brazilian Nationwide Household Sample (PNAD), for the years 1976 to 1998.5 5 Data is not available for the following years: 1980, 1991 and 1994.

The Evolution of the College Wage Premium for Male Workers6 6 I focus exclusively on the evolution of the college wage gap for men. I believe that this focus is appropriate, given inter-cohort changes in female labor supply that have presumably affected the age profiles of earnings for women in different education groups over the past twenty years. If women in younger cohorts accumulate more actual experience per year of potential experience than older cohorts, this will increase the measured college-high school wage even if the true college premium is fixed. Secular changes in the age profile of the college-high school wage gap may, therefore, be contaminated by these composition effects. Another reason for excluding women from the calculation of the college wage gap is that the sample female hourly labor earnings are very imprecise due to substantial variations in the number of hours worked.

Figure 1 shows the evolution of the overall college premium for male workers since 1977. This premium is measured by the coefficient on a regression of log wage on a college dummy and several control variables, for every year in the sample.7 7 The control variables are: a second degree polynomial for age, five regional dummies and a dummy for self employment. Table A.1 shows the cohorts classified by birth year. Table A2 in the Appendix shows the results of the regression for each year. My estimates of the college wage premiums are based on the total labor earnings (income from all jobs) of men aged 24 to 58, living in urban areas of Brazil, who are either employed or self - employers and are working at least 40 hours in the week of reference. In addition, extremely low and extremely high hourly wage rates are dropped from the sample.8 8 Wages are deflated by the GPI-FGV index, and converted to values of 1996. In 1996 values, the upper bound from 1983 to 1998 is R$ 719.00 per hour, or R$ 28,760 a week (assuming a 40 hour week). This represents respectively the percentile: 98.43%, in 1998; 98.63%, in 1997; 98.25%, in 1996; 98.77%, in 1995; 99% in 1993; 99% in 1992; 99.25% in 1990; 99.25% in 1989; 99.11% in 1988; 99.49% in 1987; 99.44% in 1986; 99.54% in 1985; 100% in 1984 and 1983. The other upper bounds, and respective percentiles in the unrestricted distribution are, in hourly terms: R$ 190.00 (99.7%), in 1982; R$ 177.00 (99.7%), in 1981; R$ 154.00 (99.78%), 1979; R$ 262.00 (99.82%) in 1978; R$ 393.00 (99.81%) in 1977; R$ 971.00 (99.71%), in 1976. The aim of such upper boundaries was to eliminate unrealistic earnings reports. Such misreport is easily identified, since there is enormous discontinuity above the cutoff points, with the next value being as much as 100,000 larger than the chosen cut-off point. Such misreported data occurred in more than 1.75% of the sample in recent years, with a much smaller fraction for the data from the 1970s. The sample is additionally restricted by the elimination of workers earning less than R$ 6 monetary units per week (25% of the official minimum wage). The remaining sample size is 217,206 observations.The college wage gap has been widening, but not at a constant rate over time. After increasing substantially during the 1980's and reaching 0.82 in 1990, the college-high school differences of the log hourly wage has been oscillating around 0.78 during the 1990's.9 9 In 1998, a worker in the college group earned on average 2.3 times more than a worker in the high school group. The college premium increased by 15% from 1977 to 1998, but only 1.2% from 1990 to 1998.

Table 1 presents my estimates of the ''college wage premiums'' for 5-year age groups, taken at 5-year intervals over my sample period. The estimates are based on differences in mean log average hourly wages between full time workers with a complete or an incomplete college degree (i. e., at least 12 years in school) and those with a complete or an incomplete high school degree (i. e., at least 9 and at most 11 years in school).10 10 I can only identify a complete high school or college degree for years after 1992. It is possible that some of the changes in wage can be a result of composition within the education categories, but I do not have how to control for it. The wage gaps are estimated in separate regression models for each cohort in each ''year''.11 11 The ''year'' is a pool of three subsequent years. The only exception is the ''year'' 1991-93, because there was no survey in 1991. Each model includes a dummy for college degree (defined as above), and the following control variables: a linear age term, an indicator of self-employment, five regional dummies, and time dummies.12 12 The standard error in parenthesis is taken as weights when estimating the model developed in section 2.

Comparisons down a column of the table show the changing college premium for a specific age group. Older workers observe a substantial increase in the college premium over time, especially until 1990. On the contrary, the wage gap is almost the same for younger workers in 1977 and in 1998. One can say that the increase in school premium during the 1980s was pushed by senior workers (more than 34 years old).

Relative Supplies of College-educated Labor

I turn next to an overview of my estimates of the relative supplies of college-educated labor by age group and year. I estimate relative supplies from a broad sample that includes men and women. My estimates of relative supplies of different education groups are based on data for men and women aged 24 to 58, who had worked as an employee or were self-employer at least one hour in the week of reference. The sample size is 1,803,168 observations. To account for differences in the effective supply of labor by different groups, I count the number of hours worked in the week by each worker and weight these hours by the average wage (over all periods) of his (or her) education group.

Figure 2 shows that there is a clear positive trend on the proportion of workers with more than eleven years in school in the labor force, although this proportion is no bigger than 17% at the end of the sample period.


I define the amount of ''high school labor'' of age group j in year t (Hjt) as the total weekly hours13 13 I attribute 20 hours worked for individuals who worked less than 40 hours and 40 hours for those working at least 40 hours. worked by high school graduates or dropouts in that age range, plus the total hours of elementary school graduates or dropouts (weighted by their wage relative to the high school group).14 14 The first group is called ''incomplete high school'' and the second group is the ''elementary school''. Both groups are considered in the aggregate labor supply of ''high school'' workers, with their respective labor efficiency factors. The relative wage of the workers with complete or incomplete elementary school with respect to the wage of ''high school workers'' is obtained through a regression (for the whole sample period) of the log hourly real wage on a dummy for ''high school workers'', regional dummies, a dummy for self employment and a squared polynomial of age. The labor efficiency factor of the ''elementary school'' group is 0.52. Card and Lemieux (2001) use only high school graduates or dropouts in their measure of ''high school labor type''. In addition, they are able to identify high school dropouts and give respective labor efficiency weights for such group of workers, while I have to consider them with the same weight as those workers who has completed high school. The amount of ''college labor'' of age group j(Cjt) is defined as the total number of hours worked by workers with complete or incomplete college degree.15 15 Card and Lemieux (2001) adopt a similar procedure. The difference is that they can identify college dropouts and then consider the respective labor efficiency factors for that group. In addition, they consider part of the college dropout workers in the high school group, while I have to consider as college-educated workers.

In table 2, the ratio of college labor to high school labor increases for each age group, but the growth is significantly bigger for the older age groups. The college-high school labor ratio hardly changes for individuals in the younger age group, while it almost triples for some of the older age groups (e.g. individuals aged 44 to 48).

The combined evidence of the recent evolution of labor supply and school premium in Brazil shows, first, that the older age groups are both the ones which present the higher growth in college premium and the larger increase in college-type labor supply. Second, it shows that both the aggregate college-educated labor supply and the college premium increases simultaneously. From these facts, one can certainly argue that demand factors are driving the evolution of college premium. However, this does not rules out the influence of the supply side on the premium once we correctly control for the effects of demand shifts.

Moreover, the evidence on labor supply points out the presence of substantial late graduation, that is, a strong positive correlation between age and the relative supply of college-educated workers. In the next two sections, I estimate the impact of labor supply on college premium, and my estimates will crucially depend on whether using age-specific or cohort-specific relative supply of college-educated workers.

5 The Effect of Cohort-Specific Supplies on the College Wage Premium of Male Workers16 16 For all the estimations in this paper, I assume that men and women are perfectly substitutable in the labor force, with identical productivity parameters for a given age and cohort group. This assumption allows me to pool the male and female labor supply and test the impact of it on the college premium of male workers.

I now turn to the estimation of the effects of the relative supply of college educated workers on the college-high school wage gap. Table 3 presents three estimations of the partial elasticity of substitution.17 17 In column (1), the specification includes time and cohort dummies. In the second specification, the time dummies are replaced by a linear time trend. In the third specification, I use a time trend, age dummies and cohort dummies as controls. In all three cases, the impact of the age-specific relative supply of college-educated workers on the college wage gap is not statistically different from zero. This seems to show that the partial elasticity of substitution across age groups is very high.18 18 One has to be careful about the ability to identifying the age, cohort and time coefficients. Since Age = Time - Birth Year, the identification of these effects cannot be done without additional assumptions. In specification (1), cohort dummies capture both age effects and cohort-specific fixed effects of the wage gap. In specification (2), the cohort dummies will capture transitory time effects as well (for example, business cycle effects on the college premium), since the time dummies are replaced by a time trend. In specification (3), the life cycle shape of labor earnings is assumed to be fixed over time (captured by the age dummy coefficients), and cohort dummies capture changes in such shape as well as cohort-specific fixed effects and transitory time effects. Because specification (3) has the most complete set of controls, I assume 1/sA is -.11 for the purpose of estimating the cohort-specific efficiency parameters ajt and bjt.19 19 In the appendix A, table A.3 shows the estimated coefficients for all age and cohort dummies in the stage one.

Based on estimates of 1/sA, ajt and bjt, I get estimates of the aggregate relative supply index.20 20 In the appendix A, figure A.1 shows the relative aggregate college-labor supply. Table 4 presents estimates of the second stage models (based on equation (8b)) that include both age-group specific relative supplies of college labor, and the aggregate relative supply index. The relative technology shock variable (ln(qct /qht) is assumed to follow a linear trend. All the specifications include cohort and age dummies, as well as a time trend.21 21 Controlling for age dummies does not change significantly the coefficient on the aggregate index, but does change the age-specific supply coefficient, which goes from -.007 to -.063.

In column (1), I use the estimated aggregate supply of college and high school educated workers as the measure of labor supply. The estimation of equation (8b) shows that the increase in the aggregate supply of college workers substantially depressed the college premium in the last twenty years in Brazil. A 10% increase in the relative supply of college-educated workers drives down the college-high school wage differential by 5.2%, in the absence of non-neutral technology changes. This is compatible to an elasticity of substitution between college and high school workers of 1.93.22 22 Does the estimate of the elasticity of substitution between school groups change if I assume infinite partial elasticity between age groups? In columns (2), I adopt the simple aggregation of college and high school workers across age, assuming perfect substitution across age-groups, as done in Katz and Murphy (1992). In this case, the estimate of the elasticity of substitution between school groups decreases slightly to 1.67.

The time trend coefficient is 0.08 and it is significant at 5%. This means that, absent the age/cohort productivity factor and the changes in school supply, the college wage gap would have increased on average by 8% in each 5-year period.23 23 I should mention that the impact of shifts in the demand are not only captured by the deterministic time trend but by the cohort and age dummies as well. This might be an effect of skill-biased technology changes, or it may an effect of trade liberalization. Alternatively, this may be an effect of accumulation of physical capital, if capital is complementary to skilled labor and substitute to unskilled labor.

A 10% increase in the age-specific relative supply of college-educated workers decreases the college premium by only 0.6%, for that particular age-group.24 24 Controlling for age dummies changes the coefficient on the aggregate index from -.44 to -.52, but the age-specific supply coefficient goes from -.007 to -.063. This implies a very large partial elasticity of substitution of 15.9. Most of the individuals in the sample are low skill workers, performing basic tasks. One might believe that the learning-by-doing content of such jobs are limited, and job-experience should not contribute significantly for labor productivity. In that case, one should expect high substitutability between experienced workers and apprentices.

However, it remains a suspicion that life cycle variations may be creating some noise and biasing the results of OLS regression. Suppose that one believes that age variations in Cjt/Hjt are larger for cohorts which faced perspectives of larger schooling premium. For example, individuals invest in college education once they predict higher schooling premium. Hence cohorts entering in the labor force during an economic boom would show larger than average growth in Cjt/Hjt over the life cycle. In this case, the life cycle component of Cjt/Hjt would be endogenous while the cohort component of Cjt/Hjt would be a result of educational policy25 25 The assumption of an educational policy exogenous to productivity shocks is standard in such literature and I will keep it here. In the next section, I filter Cjt/Hjt from the age component in order to get some cohort-specific schooling supply.

6 Screening the College-High School Ratio between Age and Cohort Effects

I can decompose the relative supply of college-educated workers into two components: an age-specific component () and a cohort-specific component (lt-j). This specific decomposition assumes that there is an age profile that is common across different cohorts.26 26 Appendix B shows all details of the procedure. The log of the relative supply of college-educated workers will become:

The regression of ln(Cjt/Hjt) in a series of age and cohort dummies in order to find and shows that all dummies are statistically significant.27 27 Figure B.1 show plots of . The age profile of the relative supply of college-educated workers has a concave shape, having a very positive slope for initial ages. Figure B.2 shows plots of . The cohort profile shows an interesting S-shape. The relative ratio of college-educated labor doubles from cohort born in 1934-1938 and 1949-1951, and stays stable around 0.23 after then. Once I calculate , I can use it to estimate the partial elasticity of substitution between age groups, instead of using ln(Cjt/Hjt). Table 5 presents the estimation of equation (9), modified by the presence of :

This two-stage procedure leads to much stronger effect of age-specific labor supply on age-specific college premium, since it eliminates the downward bias caused by the positive correlation between fj and life cycle changes in schooling premium. An increase of 10% in the relative supply of ''college labor'' for a specific age group leads to a 2.27% fall in the college premium for that particular group, which is equivalent to a partial elasticity of substitution of 4.41 - much smaller than the one found in section 5.

I assume 1/sA = -.227, and use this value to estimate ln(bjt) and ln(ajt), respectively the age/cohort productivity factor of high-school-educated and college-educated and workers, in order to calculate the aggregate supply of college-educated and high-school-educated workers, with the respective cohort efficiency parameters.

Table 6 shows the result of the estimation of Equation (8b), analog to table 4. Using the cohort-specific measure of relative supply of college-educated workers, instead of the age/time relative supply, I find that a 10% increase in cohort supply of the college/high school ratio leads to a 2.3% decrease in cohort-specific college/high school wage gap - when controlling for age and cohort effects.28 28 The estimate does not change substantially in the specification without age dummies. The coefficient is statistically significant, which seems to confirm my suspicions that changes in the supply of college-educated workers across age (related to late graduation) seems to be causing the insignificant coefficients found in section 5.

The coefficient of the aggregate supply index is equal to -0.64, meaning that a 10% increase in the aggregate relative supply of college-educated workers leads to a 6.4% fall in the college/high school wage gap, which is equivalent to an estimate of 1.56 for the elasticity of substitution between college and high school workers. This is slightly smaller than the one found in section 5, and confirms that ''college labor'' is not highly substitutable for ''high school labor''.

Also, the presence of a positive (significantly different from zero) long run trend to higher college/high school wage gap is inferred from the linear trend coefficient of .10. The estimate is close to the one found in table 5, when the directly observed age-specific supply is used. Hence, the evidence that changes in demand for skills (caused either by trade liberalization or high skill-biased technology shocks) is robust to the specification and the variables representing the age specific supply of college workers.

7 Conclusion

In this paper, I estimate the impact on the college wage of the evolution in the relative supply of college graduate workers in the Brazilian labor force. I control for the effect of potential endogeneity in the schooling choice associated to substantial late graduation in the data. I show that life cycle variations in relative supply of college-type labor are positively correlated to life cycle variations in the college premium, while cohort variations in relative supply are negatively correlated to changes in college premium. I argue that the positive correlation may be an effect of endogeneity in the decision of human capital accumulation. Based on such possibility, I filter the changes in supply to allow only cohort-specific supply of education. My results indicate that the partial elasticity of substitution across age groups (within a given school group) is relatively low, around 4.4.

Moreover, I find that ''college-labor'' and ''high school labor'' are quite imperfect substitutes. The elasticity of substitution between college-educated and high school-educated labor is smaller than 2. Aggregate changes in education endowments may be contributing to reduce the huge income inequality in Brazil. This might be happening through the negative impact of a higher contingent in the labor force of college-educated workers on the college wage premium. During the 1990s, it has been a substantial increase in the number of college students in Brazil. Our results indicate that as they graduate and go to the labor force this should cause a substantial decrease in the college premium.

This paper was received in Apr. 2003 and approved in Feb. 2004.


Variations of the supply ratio across different ages for a given cohort does not seem to depress wage gaps. At the same time, it seems to be big enough to dominate the variations across different cohorts for a given age. In this appendix, I replace the age-specific relative supply ratio by an estimated supply ratio that is fixed for a given cohort, and re-estimate the elasticity of substitution coefficients. The main conclusion is that, absent the age profile of the college-high school ratio, cohort variations are negatively correlated to the wage gap.

I can decompose the relative supply of college-educated workers into two components: an age-specific component (fj) and a cohort-specific component (lt-j). This specific decomposition assumes that there is an age profile that is common across different cohorts. The log of the relative supply of college-educated workers will become:

Under this decomposition, lt-j is the projection of the relative supply on the cohort dummies:

Similarly, fj is the projection of the relative supply on the age dummies:

In table B.1, I regress ln(Cjt/Hjt) in a series of age and cohort dummies in order to find fj and lt-j. All dummies are statistically significant. Figure 5 show plots of fj. The age profile of the relative supply of college-educated workers has a concave shape, having a very positive slope for initial ages, as shown in Figure B.1. Figure B.2 shows plots of lt-j. The cohort profile shows an interesting S-shape. Generations born between 1941 and 1951 experiences the highest increase in the ratio between college and high school educated workers. Nonetheless, lt-j reaches the maximum value for the cohort born in 1969-73.



  • Card, D. & Lemieux, T. (2001). Can falling supply explain the rising return to college for younger men? a cohort-based analysis. Quarterly Journal of Economics, 116(2):705-746.
  • Fernandes, R. & Menezes, N. (2001). Escolaridade e demanda relativa por trabalho: Uma avaliação para o Brasil nas décadas de 80 e 90. Universidade de São Paulo.
  • Katz, L. & Murphy, K. (1992). Changes in relative wages, 1963-1987: Supply and demand factors. Quarterly Journal of Economics, 107.
  • Meguir, C. & Whitehouse, E. (1996). The evolution of wages in the United Kingdom: Evidence from micro data. Journal of Labor Economics, 14(1).
  • Menezes, N., Fernandes, R., & Piccheti, P. (1998). The distribution of male wages in Brazil: Some stylized facts. Universidade de São Paulo. mimeo.
  • Moreira, M. & Najberg, S. (1999). O impacto da abertura comercial sobre o emprego: 1990-1997. In Giambiagi, F. & Mesquita, M., editor, A Economia Brasileira Nos Anos 90. BNDES, Rio de Janeiro.

Appendix

Appendix B:  Filtering the College-High School Ratio

  • *
    I thank the valuable comments of John Karl Scholz, Rodolfo Manuelli, Cristina Terra, Hugo Boff, Francisco Ferreira, Gustavo Gonzaga, Naercio Menezes, Reynaldo Fernandes, as well as other participants of seminars at the University of Wisconsin - Madison, FGV-RJ, IBMEC-SP, USP, LACEA and IPEA. The usual disclaims also apply.
  • 1
    There are other studies looking at the impact of labor supply in the school-related wage differential. Fernandes and Menezes (2001) assume three alternative values for the impact of the supply of skills on the college premium, and estimate the impact of demand shocks in relative wages, finding a structural break in the 1990s - which they attribute to trade liberalization.
  • 2
    In addition, such a
    jt and b
    jt may capture structural breaks in the demand for skills potentially driven, for example, by trade liberalization.
  • 3
    . Similarly, the marginal product of college workers in age group
    j is
    .
  • 4
    If enrollment decisions are somehow influenced by expected future changes in returns to education, the weighted least square estimates of the elasticity of substitution will have a positive bias. In other words, the WLS procedure will overestimate the true elasticity of substitution parameters.
  • 5
    Data is not available for the following years: 1980, 1991 and 1994.
  • 6
    I focus exclusively on the evolution of the college wage gap for men. I believe that this focus is appropriate, given inter-cohort changes in female labor supply that have presumably affected the age profiles of earnings for women in different education groups over the past twenty years. If women in younger cohorts accumulate more actual experience per year of potential experience than older cohorts, this will increase the measured college-high school wage even if the true college premium is fixed. Secular changes in the age profile of the college-high school wage gap may, therefore, be contaminated by these composition effects. Another reason for excluding women from the calculation of the college wage gap is that the sample female hourly labor earnings are very imprecise due to substantial variations in the number of hours worked.
  • 7
    The control variables are: a second degree polynomial for age, five regional dummies and a dummy for self employment.
    Table A.1 shows the cohorts classified by birth year.
    Table A2 in the
    AppendixAppendix shows the results of the regression for each year.
  • 8
    Wages are deflated by the GPI-FGV index, and converted to values of 1996. In 1996 values, the upper bound from 1983 to 1998 is R$ 719.00 per hour, or R$ 28,760 a week (assuming a 40 hour week). This represents respectively the percentile: 98.43%, in 1998; 98.63%, in 1997; 98.25%, in 1996; 98.77%, in 1995; 99% in 1993; 99% in 1992; 99.25% in 1990; 99.25% in 1989; 99.11% in 1988; 99.49% in 1987; 99.44% in 1986; 99.54% in 1985; 100% in 1984 and 1983. The other upper bounds, and respective percentiles in the unrestricted distribution are, in hourly terms: R$ 190.00 (99.7%), in 1982; R$ 177.00 (99.7%), in 1981; R$ 154.00 (99.78%), 1979; R$ 262.00 (99.82%) in 1978; R$ 393.00 (99.81%) in 1977; R$ 971.00 (99.71%), in 1976. The aim of such upper boundaries was to eliminate unrealistic earnings reports. Such misreport is easily identified, since there is enormous discontinuity above the cutoff points, with the next value being as much as 100,000 larger than the chosen cut-off point. Such misreported data occurred in more than 1.75% of the sample in recent years, with a much smaller fraction for the data from the 1970s. The sample is additionally restricted by the elimination of workers earning less than R$ 6 monetary units per week (25% of the official minimum wage).
  • 9
    In 1998, a worker in the college group earned on average 2.3 times more than a worker in the high school group.
  • 10
    I can only identify a complete high school or college degree for years after 1992. It is possible that some of the changes in wage can be a result of composition within the education categories, but I do not have how to control for it.
  • 11
    The ''year'' is a pool of three subsequent years. The only exception is the ''year'' 1991-93, because there was no survey in 1991.
  • 12
    The standard error in parenthesis is taken as weights when estimating the model developed in section 2.
  • 13
    I attribute 20 hours worked for individuals who worked less than 40 hours and 40 hours for those working at least 40 hours.
  • 14
    The first group is called ''incomplete high school'' and the second group is the ''elementary school''. Both groups are considered in the aggregate labor supply of ''high school'' workers, with their respective labor efficiency factors. The relative wage of the workers with complete or incomplete elementary school with respect to the wage of ''high school workers'' is obtained through a regression (for the whole sample period) of the log hourly real wage on a dummy for ''high school workers'', regional dummies, a dummy for self employment and a squared polynomial of age. The labor efficiency factor of the ''elementary school'' group is 0.52. Card and Lemieux (2001) use only high school graduates or dropouts in their measure of ''high school labor type''. In addition, they are able to identify high school dropouts and give respective labor efficiency weights for such group of workers, while I have to consider them with the same weight as those workers who has completed high school.
  • 15
    Card and Lemieux (2001) adopt a similar procedure. The difference is that they can identify college dropouts and then consider the respective labor efficiency factors for that group. In addition, they consider part of the college dropout workers in the high school group, while I have to consider as college-educated workers.
  • 16
    For all the estimations in this paper, I assume that men and women are perfectly substitutable in the labor force, with identical productivity parameters for a given age and cohort group. This assumption allows me to pool the male and female labor supply and test the impact of it on the college premium of male workers.
  • 17
    In column (1), the specification includes time and cohort dummies. In the second specification, the time dummies are replaced by a linear time trend. In the third specification, I use a time trend, age dummies and cohort dummies as controls.
  • 18
    One has to be careful about the ability to identifying the age, cohort and time coefficients. Since Age = Time - Birth Year, the identification of these effects cannot be done without additional assumptions. In specification (1), cohort dummies capture both age effects and cohort-specific fixed effects of the wage gap. In specification (2), the cohort dummies will capture transitory time effects as well (for example, business cycle effects on the college premium), since the time dummies are replaced by a time trend. In specification (3), the life cycle shape of labor earnings is assumed to be fixed over time (captured by the age dummy coefficients), and cohort dummies capture changes in such shape as well as cohort-specific fixed effects and transitory time effects.
  • 19
    In the
    appendix AAppendix,
    table A.3 shows the estimated coefficients for all age and cohort dummies in the stage one.
  • 20
    In the
    appendix AAppendix,
    figure A.1 shows the relative aggregate college-labor supply.
  • 21
    Controlling for age dummies does not change significantly the coefficient on the aggregate index, but does change the age-specific supply coefficient, which goes from -.007 to -.063.
  • 22
    Does the estimate of the elasticity of substitution between school groups change if I assume infinite partial elasticity between age groups? In columns (2), I adopt the simple aggregation of college and high school workers across age, assuming perfect substitution across age-groups, as done in Katz and Murphy (1992). In this case, the estimate of the elasticity of substitution between school groups decreases slightly to 1.67.
  • 23
    I should mention that the impact of shifts in the demand are not only captured by the deterministic time trend but by the cohort and age dummies as well.
  • 24
    Controlling for age dummies changes the coefficient on the aggregate index from -.44 to -.52, but the age-specific supply coefficient goes from -.007 to -.063.
  • 25
    The assumption of an educational policy exogenous to productivity shocks is standard in such literature and I will keep it here.
  • 26
    Appendix B Appendix B: Filtering the College-High School Ratio shows all details of the procedure.
  • 27
    Figure B.1 show plots of
    . The age profile of the relative supply of college-educated workers has a concave shape, having a very positive slope for initial ages.
    Figure B.2 shows plots of
    . The cohort profile shows an interesting S-shape. The relative ratio of college-educated labor doubles from cohort born in 1934-1938 and 1949-1951, and stays stable around 0.23 after then.
  • 28
    The estimate does not change substantially in the specification without age dummies.
  • Publication Dates

    • Publication in this collection
      26 Aug 2004
    • Date of issue
      June 2004

    History

    • Accepted
      Feb 2004
    • Received
      Apr 2003
    Fundação Getúlio Vargas Praia de Botafogo, 190 11º andar, 22253-900 Rio de Janeiro RJ Brazil, Tel.: +55 21 3799-5831 , Fax: +55 21 2553-8821 - Rio de Janeiro - RJ - Brazil
    E-mail: rbe@fgv.br