ArticlePDF Available

Family size and schooling in Sub-Saharan Africa: Testing the quantity-quality trade-off a

Authors:

Abstract and Figures

Many family planning programs are based on the idea that small families lead to improved development outcomes, such as more schooling for children. Because of endogeneity issues, this idea is however difficult to verify. A handful of studies have made use of twin birth to deal with the endogeneity of family size. We do so for Sub-Saharan African countries. In a compilation of 86 survey rounds from 34 countries, we exploit the birth of twins to study the effect of a quasi-exogenous increase in family size on the schooling of children at the first, second and third birth order. Our findings do not support the generally assumed negative effect of family size on schooling.
Content may be subject to copyright.
1 | P a g e
Family size and schooling in Sub-Saharan Africa:
Testing the quantity-quality trade-off a
Sahawal Alidou1
Marijke Verpoorten2
This is a pre-print of the article “ Family size and schooling in Sub-Saharan Africa: Testing the
quantity-quality trade-off”. Journal: Journal of Population Economics
DOI: 10.1007/s00148-019-00730-z
Abstract
Many family planning programs are based on the idea that small families lead to improved
development outcomes, such as more schooling for children. Because of endogeneity issues, this
idea is however difficult to verify. A handful of studies have made use of twin birth to deal with
the endogeneity of family size. We do so for Sub-Saharan African countries. In a compilation of
86 survey rounds from 34 countries, we exploit the birth of twins to study the effect of a quasi-
exogenous increase in family size on the schooling of children at the first, second and third birth
order. Our findings do not support the generally assumed negative effect of family size on
schooling.
Keywords : Family size, schooling, quantity-quality trade-off, Sub-Saharan Africa
a We received much appreciated comments from Elena Briones Alonso, Philip H. Ross, Pieter Serneels, Nik Stoop and
participants at seminars, conferences and workshops in Leuven (LICOS-KULeuven seminar), Antwerp (Doctoral
Day), and Oxford (2017 CSAE conference), as well as from the Editor and two anonymous referees of this journal.
1 IOB – Institute of Development Policy and Management (University of Antwerp, Prinsstraat 13, 2000 Antwerp),
LICOS – Centre for Institutions and Economic Performance (KU Leuven), sahawal.alidou@uantwerp.be
2 Corresponding author. IOB–Institute of Development Policy and Management (University of Antwerp, Prinsstraat
13, 2000 Antwerp), marijke.verpoorten@uantwerp.be, Tel. 003232655297, Fax 003232655741
2 | P a g e
1. Introduction
Countries worldwide have devoted much effort and resources to family planning programs
(Bongaarts, 2009). Most of these programs have been voluntary, but some have left little choice to
parents, such as China’s one-child program and India’s sterilization camps. A major assumption
underlying these programs is that “a small family is a happy family” 3, or that a reduction in family
size enables families to raise investments in human capital per child, leading in its turn to a stronger
economy (Bongaarts, 2009). Intuitively the assumed causality between small family sizes and high
schooling attainments makes sense: dividing scarce resources among less children, leaves each child
with more resources.
This intuition found support in well-known social science research. Judith Blake (1989), studying
U.S. families, famously concluded that children from one- and two-child families are better
educated and more successful than children in larger families because their parents have more time
and money to invest in them. This ‘resource dilution model’ was backed up by economic theory,
in particular in a pioneering paper written by Gary Becker (1960), in which the quantity and quality
of children are modelled as substitutes from the parents’ point of view.
However, there also exist theories that support a positive causal effect of family size on children’s
schooling. These theories break with Blake’s and Becker’s assumptions that children only imply a
cost to parents, and that more children imply higher costs. As such, the quantity-quality trade-off
needs no longer hold when allowing for part-time child work (Mueller, 1984a; Marteleto and de
Souza, 2013), or when older children work to provide for the younger ones (the so-called chain
arrangement), or when there are economies of scale in raising children, with children sharing
clothes, text books, transport to school, or knowledge and skills (Guo and Van Wey, 1999;
Rosenzweig and Zhang, 2009; Steelman et al., 2002; Qian, 2009). Economies of scale can also be
3 This text featured on a 2 Rupee Coin issued by India in 1993, to promote family planning.
3 | P a g e
present in household chores, such that the time each child spends on chores reduces with the
number of siblings, thus freeing up time for school.
Despite the diversity of theoretical predictions, it is hard to move away from the idea of a negative
causal relation. An important reason for its stickiness lies in the strong negative correlation between
family size and children’s schooling, and the difficulty to empirically distinguish this correlation
from the actual causal effect of family size on schooling. To make this distinction, one needs to
purge the correlation of confounding factors. Most importantly, parents’ characteristics determine
preferences both for the number of children and their years of schooling. For instance, mothers
who enjoyed more years of schooling generally prefer smaller families and at the same time are
likely to give more importance to their children’s schooling. Other confounding factors include
wealth, social norms regarding fertility and child labor, labor market opportunities for adults and
children, and the availability and quality of old-age security schemes and education policies
(Rosenzweig and Wolpin, 1980; Angrist et al., 2010; Black et al., 2010). To the extent that these
confounding factors are not perfectly observed and controlled for, the estimated relation between
family size and children’s schooling is plagued by endogeneity issues.
In this paper, we aim to remedy this endogeneity problem in a sample of children from 208,729
households across 34 Sub-Saharan African countries. In 3,844 of these households twins were
born, causing a quasi-exogenous increase in household size. Provided controls for certain mother
characteristics (Smits and Monden, 2011; Bhalotra and Clarke, 2016)4, and for the health condition
of twins (Rosenzweig and Zhang, 2009)5, twin birth can be used as a plausibly exogenous
instrumental variable to isolate the causal effect of family size on the educational outcomes of
4 According to Smits and Monden (2011) twinning is associated with maternal age, maternal height, smoking, oral
contraceptive use, race and ethnicity.
5 Rosenzweig and Zhang (2009) point out that twins have lower average birth weight than singletons, and perhaps
worse health or cognitive achievement later on, and this may threaten the exclusion restriction if parents therefore
allocate resources away from twins, towards older singleton-birth children.
4 | P a g e
children born prior to the twin birth (Angrist et al., 2010; Conley et al, 2012). The same does not
apply for children born after the twins, because their birth can be the result of parental choice.
Concretely, in our IV approach, we look at the outcomes of first-, first- and second-, and first-,
second- and third- born children respectively in families of two or more, three or more, and four
or more children, using the birth of twins at the second, third birth and fourth order as the
instrumental variable.
Our empirical investigation adds to the body of literature that has tried to empirically unearth the
quantity-quality trade-off by relying on a number of techniques, such as tracing children’s
intellectual abilities in a longitudinal analysis of families (Guo and Van Wey, 1999), exploiting the
gradual roll-out and subsequent relaxation of family planning programs (Liu, 2014; Qian, 2009), a
randomized controlled trial in family planning (Sinha, 2005; Joshi and Schultz, 2007), and
instrumental variable approaches based on reported miscarriages (Maralani, 2008), siblings’ sex-
composition (Angrist et al., 2010; Conley and Glauber, 2006; Black et al., 2010; Fitzsimons and
Malde, 2014; Lee, 2008), and twin birth (Rosenzweig and Wolpin, 1980; Rosenzweig and Zhang,
2009; Angrist et al., 2010; Black et al., 2005, 2010; Marteleto and de Souza, 2012, 2013; Mogstad
and Wiswall, 2016; Bhalotra and Clarke, 2016). These exercises in causal identification have not
uniformly yielded negative estimates of the effect of sibship on schooling. Instead, the effect turns
out to vary over time, across regions and subpopulations, across birth order, and across the exact
outcome of interest studied (e.g. private schooling, school enrolment, educational attainment, or
IQ)6.
6 In the first study using twins as an instrument, Rosenzweig and Wolpin (1980) find that schooling levels of Indian
children decrease with exogenous increases in fertility. Consecutive studies using twins find mixed results. Caceres
(2006), relying on US Census data, finds a negative effect of family size on the probability of enrollment in private
schooling; Black et al. (2010) find that family size negatively affects the IQ of younger cohorts in Norway; and Li et al.
(2008), relying on Chinese twins for identification, find a negative effect of sibship on schooling attainments. However,
family size is found to have no effect on children’s educational attainment in Norway (Black et al., 2005) and in Israel
(Angrist et al. 2010). Furthermore, Marteleto and de Souza (2012), studying the effect of family size on adolescents’
schooling in Brazil, find a positive effect in periods and regions in the earlier stages of socioeconomic development;
5 | P a g e
None of these exercises in causal identification has however looked specifically at Sub-Saharan
African countries.7 Our paper fills this gap. There are several reasons why SSA provides an
interesting setting for such analysis. First, in SSA, the majority of households face tight budget
constraints, schooling is barely compulsory, and children’s participation in the labor market and in
time-consuming household chores is socially still largely accepted (Bass, 2004). Combined, these
features make it very likely that a household’s decision to invest in children’s formal education
involves important trade-offs. Second, in most African cultures, family members are bound to act
for the benefit of the collective, be it the nuclear family or the extended family, the clan or ethnic
group (Lloyd and Blanc, 1996). Regarding the decision to invest in schooling, this implies that the
benefits of schooling are expected to be shared giving for instance way to the ‘chain arrangement’
in which earlier-born children are sent to school and use their wage earnings to invest in their
younger siblings later on, rendering a quantity-quality trade-off superfluous (Baland et al., 2016;
Mueller, 1984b). Third, in many of the least developed regions in SSA, the quality of schooling may
be low (e.g. Chaudhury et al., 2006), or labor market opportunities may be lacking (e.g. Garcia and
Fares, 2008), both of which depress the returns to education. Hence, additional schooling may not
be a way to invest in children’s ‘quality’. Finally, SSA still is the region with the highest fertility and
lowest educational enrolments and attainments8, increasing the relevance of research on these
issues.
Our use of DHS data comes with both pros and cons. Among the pros, we count the availability
of demographic and health data of mothers, allowing us to explicitly control for factors such as
but this effects disappears for recent periods “when the opportunities for child farm work have declined, education
has expanded, and fertility has declined to below-replacement levels” (p. 1473).
7 Bhalotra and Clarke (2016) tested the quantity-quality trade-off in a large sample, with data from 72 countries among
which many Sub-Saharan African countries, but they did not specifically focus on SSA.
8 Data on 46 SSA countries reveal an average Total Fertility Rate (TFR) of 4.97 in 2014 (World Bank, 2016) and a
Gross School Enrolment rate (GSE) of 98.96 in 2013 (UNESCO, 2016). The 2014 TFR is markedly lower than in
1985 (6.62) but still much higher than the 2014 TFR in South-Asia (2.56) and in Latin-America (2.11). Similarly, the
2013 GSE has greatly improved from its level in 1985 (79.24), but still compares unfavorably with the 2013 GSE in
Southern Asia (109.06) Latin-America and the Caribbean (109.03).
6 | P a g e
ethnicity, mothers’ height and health that are likely to affect twinning. In doing so, we further purge
this instrument of potential sources of endogeneity. Second, the detailed information on children’s
health allows us to verify that parents do not allocate resources away from twins – who may suffer
from poorer health at birth - toward older singleton-birth children, thus further providing
confidence in our instrument. Third, the DHS data allows us to distinguish between three distinct
proximate causes that underlie differences in educational attainment: school enrolment, school
starting age, and dropout. Among the cons, we face the constraint that, beyond the fourth birth
order, there are insufficient observations on twin births in the DHS to provide enough power in
the first stage. Our analysis therefore only focuses on the effect of family size on outcomes of
children of the first, second and third birth order, and its findings cannot readily be generalized to
siblings at higher birth orders (Booth and Kee, 2009; Qian, 2009; Mogstad and Wiswall, 2016).
Another point of attention is that the DHS focuses on mothers of childbearing age, 15-49, such
that the observed number of children may be below the eventual number of children and the
reported level of schooling may be below the eventual schooling attainment. Consequently, the
relation that we observe between sibship and schooling captures a snapshot in time of a process in
motion, not its final stage. Furthermore, the DHS provides no systematic information on health
for children above five years old, nor information on test scores or other proxies for the quality of
schooling. We therefore limit our analysis to the quantity of schooling, measured by the number
of years of schooling. Finally, as will be explained further on, we need to address the complexity
of family structure in SSA, which includes polygamy and a non-negligible number of children living
outside the household, often with extended family.
In the next section we explain the empirical strategy. Then, we describe our data and present the
results. In our results section, we find overall no evidence of a quantity-quality trade-off. In
particular, we cannot reject the null hypothesis of no relation between family size and schooling in
7 | P a g e
the subsamples of families with two or more, or four or more children, while we find a significant
positive effect of family size on schooling in the sample of families with three or more children.
2. Empirical strategy
In our empirical analysis, we use an age-standardized z-score9 for the years of schooling as our
main outcome variable. We consider 3 sub-samples: firstborn from families with at least 2 children
(2+Sample), first- and second- born from families with at least 3 children (3+Sample) and first-,
second- and third- born from families with at least 4 children (4+Sample). We first examine the
relation between schooling and family size using ordinary least squares (OLS). Then, we
respectively use twinning at 2nd, 3rd and 4th birth to instrument the number of children in the
‘2+Sample’, ‘3+Sample’ and ‘4+Sample’. Focusing on the schooling outcomes of n-1 children born
prior to twins at nth birth, avoids selection problems that arise because “families who choose to have
another child after a twin birth may differ from families who choose to have another child after a
singleton birth” (Black et al. 2005). To ensure the validity of our instrument, we duly control for a
battery of mother-level characteristics that may affect twinning and at the same time correlate with
children’s schooling.
Concretely, the OLS specification takes the following form:
 = +    ℎ+  +  +  + +  . 1
where h indicates household, m mother, f father and i the individual child.  equals the
child’s z-score.   ℎ is a count variable that equals the total number of sons and
daughters of the household head, residing in the household; Xhmfi is a vector of child-level
9 Since the variation in completed grades across country and by child age is so large, it is unlikely to be completely
captured by country and age fixed effects. We therefore use age-standardized z-score for which the reference group
comprises of children in the same country and birth cohort. In the robustness section, we show that our results remain
stable when using completed grades as outcome variable.
8 | P a g e
characteristics that includes an indicator variable for sex, indicator variables for each age in the
range 6 to 18, the child’s birth-year and its month of birth; Xhm is the set of mother-level
characteristics including her years of schooling, age, age squared, height, religion, ethnicity10, the
total number of her children who have died and whether a child of her has died before its first
birthday11; Xhf is the set of father-level characteristics comprising his age and years of schooling; Xh
includes household’s residence area (urban/rural) and wealth quintile12. To account for within-
household correlation of the residuals, we cluster all error terms  at the DHS cluster level,
which equate villages in rural areas and city blocks in urban areas.13
The second stage of the IV specification is captured by the following equation:
 = +    ℎ
+  +  +  + +  . 2
in which family size is instrumented in the first stage equation:
  ℎ =+  +  +  +  + +  . 3
In the n+Sample, the indicator variable  equals 1 if the nth birth is a multiple birth and 0
otherwise14. The X-vectors include control variables as previously defined.  , and  are
error terms.
In our results section, we scrutinize the exclusion restriction of our instrument, relying on insights
of Altonji et al. (2005), Conley et al. (2012) and Bhalotra and Clarke (2016). In a series of nine
10 Mother’s ethnicity is country specific and generated as follows: (country code x 1000) + ethnic group code. The
insertion of the complete set of ethnicity fixed effects (as dummy variables) makes country fixed effects superfluous.
In some DHS rounds, ethnicity is omitted (e.g. Rwanda, Burundi). These rounds are omitted from our baseline results,
but included in a robustness check that uses region of residence of the household as a proxy for ethnicity.
11 Infant deaths proxy among others the reproductive health of the mother.
12 For each round, DHS separates all interviewed households into five wealth quintiles based on their wealth index.
The wealth index is a composite measure of a household's cumulative living standard. It is calculated using principal
components analysis on easy-to-collect data on a household’s ownership of selected assets, such as televisions and
bicycles; materials used for housing construction; and types of water access and sanitation facilities (Standard Recode
Manual for DHS 6, 2013).
13 The results are robust to clustering at the ethnicity or region level. The number of countries is insufficient to get a
robust covariance matrix when clustering at the country level.
14 For instance, in the 3+Sample, the indicator variable  equals 1 if the third birth is a multiple birth and 0
otherwise.
9 | P a g e
robustness checks (cf. infra), we modify this empirical framework in several ways, using alternative
samples, changing the decision unit (from household head to wife/wives), and modifying the
definition of our key variables.
In all cases, we report heteroscedasticity-robust statistics and the usual post-estimation tests
(underidentification test, weak identification test and overidentification test).
3. Data
In our empirical analysis, we rely on all DHS surveys implemented in SSA countries in the period
1990-2014, for which we could construct the main variables.15 In our baseline approach, we
consider 59 survey rounds for which information on the ethnicity of the mother is available
(Appendix 1 gives an overview of these survey rounds by country and year) and restrict the sample
to children whose siblings of schooling age (6-18) all reside within the household.16 This gives a
dataset of 456,068 siblings of schooling age (6-18), to which we will refer as ‘Sample I’.
We focus on the educational attainments of 64,339 firstborn (2+Sample); 99,875 first- and second-
born (3+Sample) and 101,848 first-, second- and third- born children (4+Sample) in the age group
6 to 18, who live with their parents. The lower-limit of 6 is the age at which many children in SSA
start primary schooling. The upper-limit of 18 is the age of secondary schooling completion,
provided a swift grade progression. We do not extend the upper-limit beyond 18 because post-
secondary education in SSA still faces important supply side constraints, and because a considerable
proportion of children above 18 live outside the household such that their schooling attainment
goes unrecorded in the DHS surveys.
15 In the period 1990-2014, 86 DHS survey rounds were administered in 34 SSA countries. For the following surveys,
we could not construct and merge the key variables because of issues with the unique identifiers of surveyed individuals:
1991 and 1998 in Cameroon, 1996-1997 in Chad, 1998-1999 in Cote d’Ivoire, 2001 in Mali and 1992 in Niger.
16 A household is defined as one person or a group of persons who usually live and eat together. This is not the same
as a family. A family includes people who are related, but a household includes any people who live together, whether
or not they are related (DHS Interviewer’s manual, October 2012).
10 | P a g e
To capture the educational attainment of these children, we look at their completed years of
schooling at the time of the survey and then construct age-standardized education z-score with
children of the same country and birth cohort as reference group. Our explanatory variable of
interest is the number of children in a household. In our baseline specification, we define this
variable as the total number of the household head’s sons and daughters residing in the household.
Our instrumental variables are the birth of twins at second, third and fourth birth order. In the
DHS birth records, there is a specific variable indicating whether a child is part of a twinship or
not. To determine birth order, we consider all children of the household head including those who
do not have their mother in the household.
The summary statistics of the principal variables in our analysis can be consulted in Table 1. The
summary statistics of the other variables can be consulted in Table A1 of the online Supplementary
Appendix.
4. Baseline Results
4.1 Baseline OLS and IV estimation
The estimation of the OLS model (Eq.1) yields a negative and significant relation between family
size and children’s schooling. As shown in Table 2, the estimated coefficient is rather small in
magnitude: one additional child is associated with a reduction of about 0.023 units of the z-score
which corresponds to 0.057 years of schooling for a child of 10 years of age.17 As this result does
not isolate a causal link between our variables of interest, we turn to our IV estimations.
The estimates of the first stage (Eq.3) are shown in the second panel of Table 3. Unsurprisingly
they indicate that twinning increases average family size. The effect of twinning ranges from an
17 The coefficients are multiplied by the standard deviation of completed years of education in Sample I (see Table 1)
and interpreted with reference to the average age of children in Sample I.
11 | P a g e
additional family size of 0.356 (in 4+Sample) to 0.509 (in 3+Sample). The coefficients are precisely
estimated, significant at 1%, and similar to the range of coefficients found in previous research.18
In all cases, the twin instrument has reassuring first stage post-estimation statistics, with the Cragg-
Donald Wald F-statistic well above 100.
In contrast to the OLS estimates that are uniformly negative in all 3 sub-samples, the second-stage
IV estimates (first panel of Table 3) indicate either no impact (in the 2+Sample and 4+Sample) or
a positive and significant effect (at 5%) of family size on education z-scores (in the 3+Sample). In
the 3+Sample, a one-unit increase in predicted family size increases the z-score by 0.097 units on
average which is the equivalent of 0.240 years of schooling for a child of 10 years of age. We
demonstrate the robustness of these IV results in Section 5, and tentatively explore plausible
explanations in Section 6.
4.2. On instrument validity
A concern when implementing the IV estimation is the violation of the exclusion restriction. The
exclusion restriction may be threatened because of the presence of confounding factors, that affect
both twinning and children’s schooling. In particular, besides a mother’s age, ethnicity and height,
also less easily measurable characteristics such as her general health condition may affect the
probability of twinning (Smits and Monden, 2011). In theory, a mother’s health behaviour, in
particular smoking and multiple-birth-enhancing fertility treatments could pose another threat to
our instrument’s validity, but in practice this threat is neutralized in SSA due to its prohibitively
high (social and monetary) cost (Inhorn, 2003).
Table 4 explores the determinants of twinning in our sample. The first column shows the results
of a regression of twinning on the mother-level characteristics included in our baseline model: her
18 For instance Bhalotra and Clarke (2016), Marteleto and de Souza (2012), Angrist et al. (2010) and Black et al. (2005).
12 | P a g e
years of schooling, age, age squared, height, ethnicity, religion, the total number of her children
who have died and a dummy variable capturing whether these children died before their first
birthday. Out of these 8 characteristics, only mother’s religion does not significantly affect the
probability of twinning at third birth. However, the overall explanatory power of the regression is
very low (R2 of 0.009). The second column adds four additional mother-level regressors - mothers’
BMI, access to prenatal health care, access to a doctor and access to a nurse.19 All four additional
regressors turn up significant while mother’s education becomes non-significant.
Based on subsamples of individuals for which all additional four control variables are available,
Table 5 shows how the estimated coefficient of interest changes when adding the additional four
control variables to our baseline controls. Using the baseline specification, the estimates are 0.003
in the 2+Sample (Panel A, column I), 0.137** in the 3+Sample (Panel B, column I) and -0.043 in
the 4+Sample (Panel C, column I). When adding the four additional mother-level controls, our
coefficient of interest only slightly decreases in all three panels, to -0.005, 0.126** and -0.061,
respectively. Thus, without the additional controls, our estimates are (slightly) biased upward,
which is expected if the included mother characteristics positively correlate with both twinning and
children’s schooling. In the spirit of Altonji et al. (2005), we however argue that, if observed
additional controls change the value of our estimated coefficients only to such a small extent, then
it is unlikely that there exist unobserved controls that would turn our results upside down.20
To further safeguard our results, we follow Conley et al. (2012) and Bhalotra and Clarke (2016) in
deriving bounds for our coefficient of interest using the ‘Plausexog’ command in Stata. To do so,
we first acquire insight into the direct effect of twinning on children’s schooling (γ) by simply
19 To measure access to prenatal care, we use the percentage of births (occurring within 5 years prior to the survey)
with prenatal care in the DHS cluster. Access to a doctor or nurse are proxied by the DHS cluster-level percentage of
births with prenatal care given by a doctor or nurse.
20 We do not formally verify this intuition, since the formalized extension by Oster (2017) only applies to OLS
estimates.
13 | P a g e
comparing education z-scores of children from twin-mothers to those of children from non-twin
mothers, controlling for the set variables in our baseline specification (see Table A2 in Appendix).
We then take the upper value of the estimated 95% confidence interval which is 0.005.21 The
standard deviation of γ (sd=0.007) is obtained from a 100 replications bootstrap estimation of γ.
Table 6 shows our bounds’ estimates of the family size effect on children’s schooling using Conley’s
Union of Confidence Interval (UCI) and Local To Zero (LTZ) approaches. The results point to a
rejection of a quantity-quality trade-off in all three samples, and a confirmation of the positive
effect of family size on children schooling in the 3+Sample.
Another potential violation of the exclusion restriction stems from parental behavior towards twins
due to twins’ lower average birth weight and the potential consequences for their health or
cognitive achievements later on (Rosenzweig and Zhang, 2009). If the future earning potential of
twins is thought to be lower, parents may divert resources from twins to singletons. Rosenzweig
and Zhang (2009) suggest to include twins in the analytical sample and to include birth weight in
the regression in order to control for this potential bias. This could indeed be a straightforward
solution, were it not that the DHS only includes birth weight for the under-five year olds. As a
second best, we verify whether this ‘diversion of resources’ hypothesis is supported in the sample
of under-five year olds.
Using information on 70,902 under-five years olds in our 59 DHS waves, we regress birth weight
(in log terms) on a twin indicator and find that twins indeed have a significantly lower average birth
weight than singletons (see column I of Table A3 in the Supplementary Appendix). 22 When looking
at the weight (column II) and body mass index percentile (column III) of under-five year olds at
21 Note that our prior value on γ is more conservative than the 0.004 used in Bhalotra and Clarke (2016). Our results
hold even with γ set at 0.008.
22 We control for several characteristics likely to affect birth weight such as a mother’s age, education, body mass, age
at first birth and ethnicity; and the child’s sex, preceding birth interval and year of birth; and the household’s residence
and wealth quintile. All error terms are clustered at the DHS cluster level.
14 | P a g e
the time of the survey, we still find that twins have significantly lower weight and body mass index (BMI)
than singletons.23 However, the estimated coefficient is smaller, indicating that the gap has become
smaller over time. In fact, when looking at the BMI, twins and singletons belong to the same decile
on average.24 The closing of the gap suggests that parents do not divert resources away from twins.
Furthermore, controlling for the entire set of age dummies, wealth quintiles, region of residence,
child sex, parental education and ethnicity fixed-effect, we find that twins enjoy as much education
as singletons (Column IV of Table A3 in the Supplementary Appendix).
4.3. Heterogeneity
As mentioned in the introduction, the studies that have set out to identify the causal relationship
between family size and schooling have produced mixed results, suggesting that the relation is
context dependent. Hence, we explore heterogeneity of our result, by running separate regressions
across subsamples with respect to gender, poverty status and region.
To explore regional variation in the estimated relation, we contrast West and Central Africa with
East and Southern Africa. This division is inspired by the regional clustering of TFR, which is on
average relatively high in West and Central Africa (5.09), and lower in East and Southern Africa
(4.45). To compare across poor and non-poor, we define households in the 1st and the 2nd asset
ownership quintiles as poor and those in the 4th and the 5th quintiles as non-poor. We discard the
3rd quintile to achieve a sharper contrast between poor and non-poor. The results are lined up in
Table 7. In our discussion below, we highlight the various significant coefficient estimates.
23 We regress BMI and BMI percentile on the twin indicator, controlling for the factors mentioned in the previous
footnote, as well as for birth weight, child age, number of months of breastfeeding, number of under-5 year olds in
the households and whether or not the mother lives with her husband.
24 The difference in BMI is measured in terms of percentiles. A child is considered underweighted if its BMI is below
the 10th percentile of the World Health Organization’s reference BMI distribution (see Cole et al, 2007). According to
this definition, 13.75% of twins are underweighted compared to only 10% of singletons. But, as shown in Table 3, on
average twins and singletons belong to the same decile.
15 | P a g e
When only looking at the schooling of boys, we find a sizeable and significantly positive coefficient
estimate in the 3+Sample (0.122**). For girls, the estimated coefficient is found to be negative and
(slightly) significant (-0.183*) in the 4+Sample. In the subsample of poor households, the estimated
coefficient is insignificant across the board. For the subsample of non-poor, we observe a
significantly positive estimate in the 3+Sample (0.139***). In the regional subsamples, the
estimated family size coefficient appears to be only slightly significant (and positive) in the
3+Sample in West and Central Africa (0.108*). It remains non-significant elsewhere.
When comparing the effect of family size on schooling across countries with persistently high
fertility25 and countries with either a low fertility or a downward fertility trend26, we find a non-
significant effect in low-fertility countries in all 2+, 3+ and 4+Samples, but a positive and slightly
significant (of 0.128*) effect in the 3+Sample of high-fertility countries.
Finally, we explore whether the relation between family size and schooling has changed over time.
To do so, we focus on the year 2000, which marked the adoption of the Education For All initiative
by 189 countries.27 We compare the effect of family size on schooling across children born prior
to 2000 and children born from 2000 onwards. In none of the samples, we find a differential effect
of family size over time, but our results indicate that the association of parents’ education, gender
and residence area with children’s educational outcomes has weakened considerably over time,
suggesting a democratization of schooling (see Table 8).
25 Burkina Faso, Burundi, Congo DR, Guinea, Malawi, Mali, Niger, Nigeria, Chad (see Lesthaeghe, 2014, p. 2)
26 Ghana, Lesotho, Liberia, Madagascar, Namibia, Rwanda, Senegal, South Africa, Swaziland, Uganda (see Lesthaeghe,
2014, p. 2)
27 The initiative aimed at a global commitment to provide quality basic education for all children, youth and adults, and
was first launched in 1990 (http://www.worldbank.org/en/topic/education/brief/education-for-all). Admittedly,
children born in 2000 reached schooling age only in 2006. On the other hand, the initiative’s adoption itself is unlikely
to have had an immediate impact on the ground. In any case, any cut-off year would be somewhat arbitrary. Choosing
slightly different cut-off years (1998 and 1999 which give almost balanced samples across cohorts) yields similar results.
16 | P a g e
In sum, in our various subsample analyses, the zero-result is confirmed for the 2+ and the
4+Sample (with the notable exception of the subsample of girls in the 4+Sample28), while the
positive result for the 3+Sample is shown to be mainly driven by children from non-poor families,
living in high-fertility countries. We will come back to this latter result in Section 6. Now we first
discuss a series of robustness checks.
5. Robustness checks
We check the robustness of our results in ten different ways. Table 9 gives a line-up of the estimated
coefficients on family size for each check in the 2+, 3+ and 4+Samples. The full results are reported
in Tables A4-A13 of the Supplementary Appendix, for the total sample, as well as for the poor and
non-poor subsamples.
In the first robustness check, we follow Angrist et al. (2010) in allowing for heterogeneity across
subsamples in the predictive power of twin birth in the first stage. We do so by including interaction
terms in our first stage regressions between twin birth and a set of indicator variables (rural, mother
is Muslim, rural West & Central Africa), thus sequentially adding the following regressors: 
 ; ∗ ℎ_ and ∗  & . The rationale for
including these regressors is that fertility is higher in rural areas (compared to urban areas), in
Muslim families, and in West and Central Africa (compared to East and Southern Africa), and the
twin instrument tends to perform less well in larger families (Angrist et al., 2010).29 When including
28 Family size might have a different impact on girls’ schooling (no impact or negative impact) because of time spent
in households chores by older daughters (Mueller, 1984; Tiefenthaler,1997) which might negatively affect their
educational achievements.
29 Also following Angrist et al. (2010), we try out a second instrument, i.e. sibship sex-composition. This alternative
instrument relies on the idea that parents prefer to have both boys and girls rather than only children of the same sex.
Hence, in the latter case, they may be more likely to have an additional. child. However, in our case, only twinning
performs well in the first stage. For instance, where the F-statistic surpasses 100 for twinning, it barely reaches 20 for
the sex composition instrument (results not reported, but available on request). We therefore focus on twinning in our
analysis.
17 | P a g e
all three interaction terms, the zero-result remains in the 2+ and 4+Samples, while we still find a
positive and significant in the 3+Sample.
In the second robustness check, we use an unrestricted specification with the ‘partially missing
instruments’ method as described in Mogstad and Wiswall (2016, p.174:175). The partially missing
instruments are constructed based on a polynomial function of mother age, mother education,
father age and father education, controlling for child characteristics, households characteristics,
mother ethnicity and her health conditions. This change in specification does not alter our results:
family size remains insignificant in the 2+ and 4+Samples and significantly positive in the
3+Sample, in particular when captured by the indicator variable ‘more than 3 children’ that
apprehends the marginal effect of moving from 3 to 4 children.
Third, instead of restricting the sample to those children that are part of households where all
children reside within the household, we expand the sample to include also children that live in
households where one or more school-aged siblings reside outside the household. This approach
yields a 2+Sample of 68,259 children ; a 3+Sample of 112,285 children and a 4+Sample of 119,544
children. We find that the coefficient on family size is still positive and slightly significant in the
3+Sample, positive and non-significant in the 2+Sample, but turns negative and slightly significant
in the 4+Sample.
Fourth, recognizing the complexity of SSA households, we change the decision unit. Among the
525,646 children in our Sample I, we count 165,418 living in polygamous households. While our
baseline approach considers the household head as the unit of decision making, in this robustness
check we assume decisions to be taken at the level of each mother. In this decentralized approach,
the number of children is defined for each of the household head’s wives as her total number of
sons and daughters living in the household. Birth order is also defined at the level of the mothers.
Doing so, we find that family size loses its significance in the 3+Sample, and remains so in the 2+
and 4+Samples.
18 | P a g e
In checks five, six and seven, we use alternative definitions of our key variables. Instead of defining
the number of children as the sons and daughters of the household head, we define them as the
total number of births given by the household head’s wives (unless the household head is female).
To reduce measurement error in our schooling variable (for instance parents reporting years in
kindergarten as schooling), education z-score are based on censored years of education.30 And,
estimates using completed grades (years of schooling) rather than educational z-scores are provided
in robustness check seven. Our findings remain similar in all three cases: no significant effect in
the 2+ and 4+Samples and a positive and significant effect in the 3+Sample.
In robustness checks eight and nine, we use region of residence of the household and country-by-
urban/rural fixed effects instead of mother’s ethnicity fixed-effects to take into account DHS
rounds in which ethnicity is not included (e.g. Rwanda, Burundi).31 In the former case, the estimated
coefficient loses significance in the 3+Sample. Apart from that, the results remain qualitatively
similar: a positive and significant coefficient in the 3+Sample and non-significant ones in the 2+
and the 4+Samples.
Finally, we use an alternative definition to discriminate between poor and non-poor families,
defining the poverty line as the average value of the wealth index in each DHS round. This last
approach confirms the positive effect observed only in non-poor families of the 3+Sample and the
absence of effect both in poor and non-poor families across 2+ and 4+Samples.
Overall, our results remain fairly robust in all three subsamples. The zero-result in the 2+Sample
remains so across all ten robustness checks while it becomes significantly negative in the 4+Sample
in only one case (inclusion of households in which some school-aged children reside outside the
30 We censor the completed years of education to the child’s age minus 6, assuming that schooling can start only from
6 years onwards since children are admitted to primary school at 6 in most of the countries (from
http://data.worldbank.org/indicator). For instance, if a 12-year old in our sample is reported to have completed 9
years of schooling, we set years of schooling to 6.
31 This yields a more balanced sample between West & Central Africa (57.80%) and East & Southern Africa (42.20%).
19 | P a g e
household). In the 3+Sample, the positive coefficient is no longer significant in only two cases
(decentralized approach and region of residence of the household instead of mother’s ethnicity
fixed effect). When running the robustness checks on the non-poor sample, the estimated
coefficient on family size in the 3+Sample remains positive and significant across the board.32
Taken together, these results bolster the case against a quantity-quality trade off in SSA, when
quality is measured as educational attainment. At the same time, however, we note the
heterogeneity of coefficient estimates, not only across gender, asset wealth and region, but also
across the 2+, 3+ and 4+Samples. Whether or not this heterogeneity is a statistical artefact needs
to be determined by future work.
6. Mechanisms
To further guide future work we fully exploit the DHS data, in order to provide some cues for the
possible mechanisms underlying our results and their heterogeneity
First, we use the DHS data to distinguish between three proximate causes that underlie differences
in educational attainment: school enrolment, school starting age and dropout. We explore these
proximate causes in all three analytical samples, by estimating (Eq. 2) and (Eq.3) with school
enrolment, school starting age and dropout as dependent variables. Table 10 lines up the
coefficients of interest. The full results are given in Tables A14-A20 of the Supplementary
Appendix.
When using the total sample and the subsample of poor households, we find a zero-effect of family
size on enrolment, dropout and school starting age in the 2+, 3+ and 4+ Sample, with one
exception (in the total 3+Sample, the dropout of second born is slightly reduced). For the
32 In the case of the 4+Sample, we obtain a negative and slightly significant (at 10%) coefficient in the subsample of
non-poor households when using censored education z-scores (Panel C of Table A9) and when using the household’s
region of residence instead of mother’s ethnicity (Panel C of Table A11).
20 | P a g e
subsample of non-poor households, we find various significant coefficients. In the 2+Sample,
firstborn’s school starting age is reduced (-0.499*) with an exogenous increase in family size. In the
3+Sample, it are second born that seem to start school earlier on. A closer examination of this
effect reveals that it is largest and significant when the second born is relatively close in age with
the first born (3 years apart or less33) (see column III Table 10). In the 4+Sample, we find that
family size reduces the probability of enrolment of the second born (-0.072*). For the third born,
results show a reduction in the probability of dropout (-0.076*)34.
Overall, this tentative exploration of the proximate causes suggests that, in response to an
exogenous increase in family size at birth order n, relatively small and wealthy households tend to
send the n-1th child earlier to school, a finding that is not replicated in poor households. This could
indicate that, when financially possible, some households opt to speed the schooling of earlier born
children upon twin birth. Whether this finding can be broadly replicated, and whether it is explained
by an attempt on the part of parents to maximize economies of scale35, or simply to relieve the
caregiver so s/he can focus on younger siblings, is a question left for future research. The non-
linearity of economies of scale (see e.g. Holmes and Tiefenthaler, 1997, Tiefenthaler, 1997) together
with the negative effect of reduced care time on children cognitive abilities (Lehmann et al, 2018)
may account for the heterogeneity across the 2+, 3+ and 4+ samples.
Next to economies of scale, the introduction of the article mentioned three other mechanisms that
could explain the absence of a quantity-quality trade-off, or even a positive effect of family size on
schooling: child labor, the chain arrangement, and support from the extended family. While we do
33 This result remains significant but is lower when considering an age difference of ‘≤4 years’, but it loses significance
when the age difference is 2 years or less, both in the 3+ and 2+Samples.
34 Enrolment, school starting age and dropout of first born in the 4+Sample are not affected by family size (results
not reported).
35 When sharing books, tutoring or transport to school, the per child cost of schooling declines when two or more
children can be sent to school simultaneously (Qian, 2009).
21 | P a g e
not have the data to explore the likelihood of the latter two mechanism, we tentatively discuss (and
dismiss) the role of child labor.
Child labor, both at home and in the labor market,36 still is common in many SSA countries, but
the group of children that are working is increasingly made up of children who combine part-time
employment and schooling (Guarcello et al., 2015). The combination of work and schooling may
allow for child labor to contribute to schooling rather than crowd it out, by providing resources
for schooling fees, their own as well as those of their siblings. Should child labor explain the positive
effect of family size in the 3+Sample, we would however expect the effect to be larger in poor
households and lower in non-poor ones, given that the latter rely less on resources provided by
children. Instead, we find the reverse. To test more formally for the child labor mechanism, we
exploit the available child labor information in a subset of the DHS surveys.37 If child labor
contributes to schooling and explains the positive effect (or zero-effect) of family size, the
estimated coefficient on family size would be reduced after controlling for child labor in our model.
Results in Table A21 of the Supplementary Appendix show that, rather than attenuating the
positive effect of family size, the inclusion of child labor (both own labor of the labor of its siblings)
slightly reinforces the positive effect in 3+Sample while it leaves the estimated coefficients in the
2+ and the 4+Samples almost unaltered.
7. Conclusion
The aim of this study was to test the quantity-quality trade-off in SSA, focusing on children’s
schooling. To do so, we investigated how an increase in family size affects schooling using twin
36 In our analysis child labor is broadly defined as the sum of time spent in household chores, number of working
hours for a family member (including him/her self) or for someone outside the household. In an alternative definition,
we exclude household chores; our findings remain robust to this change.
37 In 21 surveys of 15 countries, information on children’s time allocation is provided (the rounds and countries are
listed in Appendix 1). The average total hours worked by a child, including time spent on household chores, is 9.78
per week and is significantly larger in poor families in a sample mean t-test: 11.73 hours compared to 7.59 in richer
ones.
22 | P a g e
birth as an instrumental variable to deal with endogeneity issues. Overall, we find no significant
effect of family size on children’s schooling, thus casting doubt on the generally assumed negative
relation between family size and schooling.
In the subsample of first and second born from relatively rich households with three or more
children, we find a positive effect of family size on schooling and this effect survives various
robustness checks. To exclude that this result is a statistical artifact, its replication in other samples
is required.
In a tentative exploration of the underlying mechanisms, it emerges that upon a fertility shock at
the nth birth, relatively small and rich families tend to send their n-1th born to school relatively early
on. Doing so may optimize economies of scale in schooling and/or relieve the caregiver, although
both the replicability of the finding and its explanation need further study.
Exploring heterogeneity of our results, we find that the effect of family size does not vary
substantially across time. This in sharp contrast to the role of other factors - parental education,
gender, residence area (urban/rural) that significantly decreased over time, in line with the
‘democratization’ of education. Only in the 3+Sample we find a regional difference: family size
positively impacts children’s schooling in countries with persistently high fertility while we find no
effect in countries with low or declining fertility.
Our research suffers from a number of limitations. First, the positive impact of family size on
schooling of first and second born children in the 3+Sample cannot readily be generalized to higher
parity children, which is clear from our results in the 4+Sample. Second, the DHS provides only a
snapshot in time of children whose mothers are of childbearing age (15-49). The number of
children observed as well as their schooling attainment reflect therefore only an intermediate
situation, not the final one. This leaves open the possibility that, in the longer run, the positive
effect of family size on schooling in non-poor households with 3 or more children (driven by early
23 | P a g e
enrolment of the second born) fades away. Third, the available data are not well suited to
distinguish between competing underlying mechanisms to the differential effect of family size
across samples. The short-term horizon does not allow for explicitly testing of the chain
arrangement. And, lacking more detailed data on household consumption and transfers, we cannot
thoroughly test for the economies of scale and extended family mechanisms. Finally, the number
of years of schooling is only one way in which parents can invest in their children. Important
omitted dimensions include the quality of schooling, and health care.
These gaps should be filled by future research, relying on other types of data such as pooled census
data in which families and their split-offs are traced over time, and micro-economic surveys that
provide detailed information on household members’ consumption of schooling inputs and their
time allocation, as well as surveys that include more information on health and the quality of
schooling of a children in various age cohorts. A more open line of questioning, in qualitative
research, could also reveal the reasoning underlying parent’s decision making.
24 | P a g e
Compliance with Ethical Standards:
No conflict exists. The authors declare that they have no conflict of interest.
25 | P a g e
References
Altonji JG, Elder TE, Taber CR (2005) Selection on Observed and Unobserved Variables:
Assessing the Effectiveness of Catholic Schools. Journal of Political Economy, 113: 151–184.
Angrist JD, J-S Pischke (2009 Mostly Harmless Econometrics: An Empiricist’s Companion, 1
edition. ed. Princeton, NJ: Princeton University Press.
Angrist J, Lavy V, Schlosser A (2010) Multiple experiments for the causal link between the quantity
and quality of children. Journal of Labor Economics, 28(4), pp.773-824.
Baland JM, Bonjean I, Guirkinger C, Ziparo R (2016) The economic consequences of mutual help
in extended families. Journal of Development Economics, 123, pp. 38-56
Bass LE (2004) Child labor in sub-Saharan Africa. Lynne Rienner Publishers.
Becker GS (1960) An Economic Analysis of Fertility. In: National Bureau of Economic Research
(ed.). Demographic and Economic Change in Developed Countries. Princeton: Princeton
University Press: 209-231.
Bhalotra S R, Clarke D (2016) The twin instrument. IZA Discussion Paper No. 10405.
Black SE, Devereux PJ,. Salvanes KG (2005) The more the merrier? The effect of family
composition on children’s education. Quarterly Journal of Economics, 120, pp. 669–700.
Black SE, Devereux PJ, Salvanes KG (2010) Small family, smart family? Family size and the IQ
scores of young men. Journal of Human Resources, 45(1), pp.33-58.
Blake J (1989) Family Size and Achievement. Berkeley: Univ. Calif. Press
Bongaarts J (2009) Human population growth and the demographic transition. Philosophical
Transactions of the Royal Society of London B: Biological Sciences, 364(1532), pp.2985-2990.
26 | P a g e
Booth AL, Kee HJ (2009) Birth order matters: the effect of family size and birth order on
educational attainment. J Popul Econ 22:367–397
Cáceres-Delpiano J (2006) The impacts of family size on investment in child quality. Journal of
Human Resources, 41(4), 738-754.
Chaudhury N, Hammer J, Kremer M, Muralidharan K, Rogers FH (2006) "Missing in Action:
Teacher and Health Worker Absence in Developing Countries." Journal of Economic
Perspectives, 20 (1): 91-116.
Cole T J, Flegal K M, Nicholls D, Jackson AA (2007) Body mass index cut offs to define thinness
in children and adolescents: international survey. BMJ: British Medical Journal, 335(7612), 194.
http://doi.org/10.1136/bmj.39238.399444.55
Conley D, Glauber R (2006) Parental educational investment and children’s academic risk
estimates of the impact of sibship size and birth order from exogenous variation in fertility. Journal
of Human Resources, 41(4), pp.722-737.
Conley, T. G., Hansen, C. B., Rossi, P. E. (2012) Plausibly exogenous. Review of Economics and
Statistics, 94(1), 260-272.
Deaton A, Paxson C (1998) Economies of scale, household size, and the demand for food. Journal
of Political Economy, 106(5), pp.897-930.
Fitzsimons E, Malde B (2014) Empirically probing the quantity–quality model. J Popul Econ 27:
33-68.
Foster A, Roy N (1997) The dynamics of education and fertility: Evidence from a family planning
experiment. Unpublished manuscript.
27 | P a g e
Garcia M, Fares J (2008) Youth in Africa's labor market (English). Directions in development ;
human development. Washington, DC: World Bank. Gibbs, B., Workman, J., and Downey D
(2016) The (Conditional) Resource Dilution Model:State- and Community-Level Modifications.
Demography (2016) 53:723–748
Guarcello L, Lyon S, Valdivia C (2015) Evolution of the relationship between child labour and
education since 2000 Evidence from 19 developing countries. Working Paper March 2015.
Understanding Children’s Work (UCW) Programme.
Guo G, Van Wey L (1999) Sibship size and intellectual development: is the relationship causal?
American Sociological Review 64:169–87.
Holmes, J., & Tiefenthaler, J. (1997). Cheaper by the dozen? The marginal time costs of children
in the Philippines. Population Research and Policy Review, 16(6), 561-578.
Inhorn MC (2003) Global infertility and the globalization of new reproductive technologies:
illustrations from Egypt. Social science & medicine, 56(9), pp.1837-1851.
Joshi S, Schultz TP (2007) Family planning as an investment in development: evaluation of a
program's consequences in Matlab, Bangladesh. Yale University Economic Growth Center
Discussion Paper, (951).
Lee J (2008) Sibling size and investment in children’s education: an Asian instrument. J Popul Econ
21: 855–875
Lehmann, J. Y. K., Nuevo-Chiquero, A., & Vidal-Fernandez, M. (2018). The early origins of birth
order differences in children’s outcomes and parental behavior. Journal of Human Resources, 53(1),
123-156.
28 | P a g e
Lesthaeghe R (2014) The fertility transition in sub-Saharan Africa into the 21st century. Ann Arbor,
MI: University of Michigan.
Li H, Zhang J, Zhu Y (2008) The quantity-quality trade-off of children in a developing country:
Identification using Chinese twins. Demography, 45(1), pp. 223-243.
Liu H (2014) The quality–quantity trade-off: evidence from the relaxation of China’s one-child
policy. J Popul Econ 27: 565-602.
Lloyd CB, Blanc AK (1996) Children's schooling in sub-Saharan Africa: The role of fathers,
mothers, and others. Population and development review, 22(2), pp.265-298.
Maralani V (2008) The changing relationship between family size and educational attainment over
the course of socioeconomic development: Evidence from Indonesia. Demography, 45(3), pp.693-
717.
Marteleto LJ, de Souza LR (2012) The changing impact of family size on adolescents’ schooling:
Assessing the exogenous variation in fertility using twins in Brazil. Demography, 49(4), pp.1453-
1477.
Marteleto LJ, de Souza LR (2013) The implications of family size for adolescents' education and
work in Brazil: Gender and birth order differences. Social forces, 92(1), pp.275-302.
Mogstad M, Wiswall M (2016) Testing the quantity–quality model of fertility: Estimation using
unrestricted family size models. Quantitative Economics, 7, pp. 157–192.
Mueller E (1984a) The value and allocation of time in rural Botswana. Journal of Development
Economics, 15(1), pp.329-360.
29 | P a g e
Mueller E (1984b) Income, aspirations, and fertility in rural areas of less developed countries. In
W. A. Schutjer & C. Shannon Stokes (Eds.), Rural development and human fertility (pp. 121–150).
New York: Macmillan.
Narayan D, Pritchett L. (1999) Cents and Sociability: Household Income and Social Capital in
Rural Tanzania, Economic Development and Cultural Change, 47(4): 871-897.
Oster E (2017) Unobservable selection and coefficient stability: Theory and validation. Journal of
Business Economics and Statistics. Forthcoming.
Qian N(2009) Quantity-quality and the one child policy: The only-child disadvantage in school
enrollment in rural China (No. w14973). National Bureau of Economic Research.
Rosenzweig MR, Wolpin KI (1980) Testing the quantity-quality fertility model: The use of twins
as a natural experiment. Econometrica: journal of the Econometric Society, pp.227-240.
Rosenzweig MR, Zhang J (2009) Do population control policies induce more human capital
investment? Twins, birth weight and China's “one-child” policy. The Review of Economic
Studies, 76(3), pp.1149-1174.
Sinha N (2005) Fertility, child work, and schooling consequences of family planning programs:
Evidence from an experiment in rural Bangladesh. Economic Development and Cultural
Change, 54(1), pp.97-128.
Smits J, Monden C (2011) Twinning across the developing world. PLoS One, 6(9), p.e25239.
Steelman LC, Powell B, Werum R, Carter S (2002) Reconsidering the effects of sibling
configuration: Recent Advances and Challenges. Annual Review of Sociology 28:243–69.
30 | P a g e
Tiefenthaler, J. (1997). Fertility and family time allocation in the Philippines. Population and
Development Review, 377-397.
UIS Stat. UNESCO. 2016. http://data.uis.unesco.org
World Bank (2016) World Development Indicators.
http://databank.worldbank.org/data/reports.aspx?source=world-development-indicators
Accessed 6 February 2016.
31 | P a g e
Tables
Table 1: Sample means and proportions of main variables
Variables SAMPLE I 2+SAMPLE 3+SAMPLE 4+SAMPLE
Number of children 5.30 3.86 4.39 5.14
(2.54) (1.53) (1.46) ( 1.43)
Child completed years of schooling 2.05 2.47 2.29 2.31
(2.47) (2.67) (2.52) (2.55)
Child Education Z-score 0.00 0.156 0.104 0.086
(1.00) (1.04) (1.02) (1.02)
Child age 10.09 10.38 10.19 10.44
(3.04) (3.11) (2.94) (2.96)
Child sex
Male 51.76 51.03 51.36 51.68
Female 48.24 48.97 48.64 48.32
Month of birth 6.05 6.05 6.05 6.06
(3.33) (3.36) (3.34) (3.33)
Mother age 35.39 30.17 30.91 32.05
(6.31) (4.91) (4.74) (4.66)
Mother education (single years) 2.65 3.61 3.20 2.80
(3.84) (4.42) (4.18) (3.90)
Mother religion
Mother is muslim 43.49 38.53 40.33 42.22
Mother is not muslim 56.51 61.47 59.67 (57.78)
Father age 44.73 38.18 39.23 40.79
(9.43) (7.76) (7.79) (7.88)
Father education (single years) 4.16 5.22 4.83 4.44
(4.85) (5.20) (5.08) (4.93)
Residence
Urban 26.39 31.98 29.32 26.93
Rural 73.61 68.02 70.68 73.07
Wealth quintile
Poorest 25.00 22.10 23.62 25.36
Poor 21.21 19.58 20.24 21.05
Middle 19.77 18.76 19.38 19.56
Rich 18.03 18.60 18.39 18.09
Richest 16.00 20.97 18.37 15.94
Twin birth
Twin at 2nd birth 1.10 1.37 1.24 1.13
Twin at 3rd birth 1.53 1.60 1.61 1.71
Twin at 4th birth 1.64 1.88 1.82 1.93
Twin at 5th birth 1.89 2.28 2.16 2.12
N (number of observations)
456,068 64,339 99,875 101,848
Source : Authors, based on data from 59 DHS rounds. The twin indicators are expressed in percentages of the total number of households,
while the other percentages relate to the total number of children. Sample I includes all children of schooling age (6-18). ‘N+Sample’ is composed
of lower birth order children from families with N or more children, whose siblings of schooling age (6-18) all reside within the household.
32 | P a g e
Table 2: OLS estimates
DEPENDENT VARIABLE : EDUCATION Z-SCORE
2+SAMPLE 3+SAMPLE 4+SAMPLE
Number of children -0.023*** -0.024*** -0.023***
(0.003) (0.003) (0.003)
Clusters 20,339 20,117 18,307
Observations 64,339 99,875 101,848
R-squared 0.398 0.375 0.363
*** p<0.01, ** p<0.05, * p<0.1
Note.— 2+Sample includes first born from families with two or more children; 3+Sample includes first and second-born from families
with three or more children and 4+Samples comprises first, second and third-born from families with four or more children. Control variables
include child-level characteristics (sex, age, birth-year and month of birth), mother-level characteristics (mother’s education, mother’s age,
mother-age squared, mother height, mother ethnicity, her religion, the number of her children who have died and an indicator variables taking
one if any of the mother’s children died before their first birthday), father-level characteristics (his age and education) and household-level
characteristics (residence area and wealth quintile). Standard errors are clustered at the DHS-Cluster level and reported in parentheses.
33 | P a g e
Table 3: IV estimates of the effect of family size on children schooling
2+SAMPLE 3+SAMPLE 4+SAMPLE
2ND STAGE
DEPENDENT VARIABLE : EDUCATION Z-SCORE
Number of children 0.019 0.097** -0.060
(0.055) (0.047) (0.064)
Clusters 20,339 20,117 18,307
Observations 64,339 99,875 101,848
R-squared 0.339 0.297 0.304
1ST STAGE
DEPENDENT VARIABLE : NUMBER OF CHILDREN
Twin at 2nd birth 0.503***
(0.042)
Twin at 3rd birth
0.509***
(0.040)
Twin at 4th birth
0.356***
(0.038)
F-statistic (excluded instrument) 142.21 160.71 85.41
Under identification test p-value 0.000 0.000 0.000
Weak identification test:
Cragg-Donald Wald F 143.41 259.91 136.89
(Stock-Yogo critical values) (16.38) (16.38) (16.38)
Ho: equation is weakly identified
*** p<0.01, ** p<0.05, * p<0.1
Note.— 2+Sample includes first born from families with two or more children; 3+Sample includes first and second-born from families
with three or more children and 4+Samples comprises first, second and third-born from families with four or more children. Control variables
include child-level characteristics (sex, age, birth-year and month of birth), mother-level characteristics (mother’s education, mother’s age,
mother-age squared, mother height, mother ethnicity, her religion, the number of her children who have died and an indicator variables taking
one if any of the mother’s children died before their first birthday), father-level characteristics (his age and education) and household-level
characteristics (residence area and wealth quintile). Standard errors are clustered at the DHS-cluster level and reported in parentheses.
34 | P a g e
Table 4: Determinants of twinning
DEPENDENT VARIABLE: PROBABILITY OF TWINNING
(I) (II)
With H+E With H+E+AC
Mother age 0.223*** 0.208***
(0.062) (0.062)
Mother age squared -0.002** -0.002*
(0.001) (0.001)
Mother completed years of education 0.037** 0.012
(0.015) (0.015)
Mother height 0.005*** 0.005***
(0.001) (0.001)
Number of dead children 0.478*** 0.497***
(0.056) (0.056)
Infant death 0.453*** 0.452***
(0.127) (0.127)
Mother is Muslim -0.136 -0.153
(0.145) (0.145)
Mother BMI - 0.000*
(0.000)
Access to prenatal care - 0.450*
(0.234)
Access to a doctor - 0.613*
(0.326)
Access to a nurse - 0.454**
(0.199)
Clusters 121,597 121,597
Observations 250,837 250,837
R-squared 0.009 0.009
*** p<0.01, ** p<0.05, * p<0.1
Note.—This table shows linear probability estimates of twinning. The sample includes all children in Sample I. H+E stands for Mother’s
health characteristics and her ethnicity for which we control for in our baseline specification: her years of education, age, age squared, height,
ethnicity, religion, number of her children who have died, whether children died before their first birthday. AC stands for four additional
controls: mother’ BMI, access to prenatal health care, access to a doctor and access to a nurse. We additionally control for mother ethnicity
and the year in which the birth occurred. Robust standard errors are clustered at the mother level and reported in parentheses.
35 | P a g e
Table 5: Stability of the estimated coefficient of interest and post-estimation statistics when expanding the set of mother-level control variables
DEPENDENT VARIABLE : EDUCATION Z-SCORE
(I) (II)
With H+E With H+E +AC
PANEL A: 2+SAMPLE
Number of children 0.003 -0.005
(0.063) (0.063)
Clusters 16,935 16,935
Observations 42,789 42,789
R squared 0.345 0.337
Cragg-Donald Wald F 102.09 103.92
(Stock-Yogo critical values) (16.38) (16.38)
PANEL B: 3+SAMPLE
Number of children 0.137** 0.126**
(0.056) (0.054)
Clusters 16,940 16,940
Observations 66,654 66,654
R squared 0.273 0.287
Cragg-Donald Wald F 178.99 182.76
(Stock-Yogo critical values) (16.38) (16.38)
PANEL C: 4+SAMPLE
Number of children -0.043 -0.061
(0.073) (0.071)
Clusters 14,996 14,996
Observations 68,144 68,144
R squared 0.304 0.309
Cragg-Donald Wald F 93.97 97.56
(Stock-Yogo critical values) (16.38) (16.38)
*** p<0.01, ** p<0.05, * p<0.1
Note : H+E stands for Mother’s health characteristics and her ethnicity for which we control in our baseline specification: her years of
education, age, age squared, height, ethnicity, religion, number of her children who have died and whether children died before their first
birthday. AC stands for five additional controls: mothers’ BMI, access to prenatal health care, access to a doctor and access to a nurse. In
Col. I we control H+E in Col. (II) and H+E and AC in Col. (III). Robust standard errors are clustered at the DHS-cluster level and
reported in parentheses. 2+Sample, 3+Sample and 4+Sample are here restricted to subsamples for which the four additional control
variables (AC) are available.
36 | P a g e
Table 6: Bounds’ estimates of family size effect on children schooling using Conley’s UCI and LTZ approaches
Union of confidence interval (UCI) Local to zero (LTZ)
Lower bound Upper bound Lower bound Coefficient Upper bound
2+SAMPLE -0.108 0.127 -0.109 0.020 0.149
3+SAMPLE -0.002 0.176 0.017 0.119** 0.221
4+SAMPLE -0.196 0.048 -0.162 -0.033 0.095
Note : The bound estimates are derived using the ‘Plausexog’ command in Stata and are based on the prior that being from a twin family
has a direct effect γ = 0.005 on educational outcomes (which for UCI bounds, is more conservative compared to the 0.004 used in Bhalotra
and Clarke, 2016 for developing countries). The UCI bounds are derived based on gamma-min =0.000 and gamma-max =0.010 while
the LTZ bounds are derived based on γ = 0.005 with a sd of 0.007 (the sd results from 100 replications bootstrap estimations and we
perform test of normal distribution of γ; see details in Table A3 in Appendix). Since the LTZ does not allow for factor variables, we exclude
mother’s ethnicity from the equation and all other variables enter as continuous variables. When applied to the baseline specification, this does
not qualitatively change our results.
37 | P a g e
Table 7: Subsample analyses with respect to gender, asset wealth and region
DEPENDENT VARIABLE: EDUCATION Z-SCORE
Only Boys
Only Girls
Only children
from poor
families
Only children
from non-poor
families
Only children living
in West & Central
Africa
Only children
living in East
& Southern
Africa
High fertility
Low Fertility
PANEL A: 2+SAMPLE
Number of children
0.042
(0.077)
-0.023
(0.079)
-0.094
(0.072)
0.020
(0.095)
0.050
(0.076)
-0.059
(0.066)
0.026
(0.103)
0.244
(0.225)
Clusters
15,963
15,825
11,080
11,287
13,549
6,790
7,901 2,924
Observations
32,831
31,508
26,814 25,458
44,276
20,063 30,767 6,960
R-squared
0.325
0.354
0.150 0.385
0.326
0.387 0.360 0.255
Cragg-Donald Wald F 72.25 69.98 57.60
60.97
71.19
109.35
39.38
15.07
PANEL B: 3+SAMPLE
Number of children
0.122**
(0.060)
0.064
(0.065)
0.060
(0.085)
0.139**
(0.061)
0.108*
(0.060)
0.075
(0.072)
0.128*
(0.067)
0.058
(0.140)
Clusters
17,404
17,240
11,286
11,020
13,554
6,563
7,934 10,597
Observations
51,297
48,578
43,802 36,717
69,185
30,690 48,672 2,847
R-squared
0.282
0.315
0.150 0.333
0.286
0.347 0.306 0.326
Cragg-Donald Wald F 135.96 120.22 59.51
175.07
136.81
175.96
110.79
28.75
PANEL C: 4+SAMPLE
Number of children
0.031
(0.072)
-0.183*
(0.102)
-0.076
(0.108)
-0.132
(0.087)
-0.066
(0.084)
-0.046
(0.088)
-0.041
(0.101)
0.107
(0.283)
Clusters
15,994
15,900
10,314
9,372
12,518
5,789
7,546 2,039
Observations
52,637
49,211
47,269 34,662
71,406
30,442 50,298 3,886
R-squared
0.298
0.273
0.146 0.337
0.306
0.331 0.338 0.290
Cragg-Donald Wald F 95.68 120.22 33.06
83.93
67.14
124.04
45.44
28.75
*** p<0.01, ** p<0.05, * p<0.1
Note.— The control variables are as specified in the note of Table 3. Robust standard errors are clustered at the DHS-cluster level and reported in parentheses. Low and High fertility countries are as defined
in Lesthaeghe (2014). Stock-Yogo critical value is 16.38 in all columns.
38 | P a g e
Table 8: Effect of family size across older and younger cohorts
DEPENDENT VARIABLE: EDUCATION Z-SCORE
2+SAMPLE
3+SAMPLE
4+SAMPLE
Number of children
-0.067
(0.074)
0.079
(0.070)
-0.138
(0.091)
Number of children * young cohort
0.
22
2
(0.170)
0.030
(0.120)
0.152
(0.149)
Female
-
0.0
18
***
(0.008)
-
0.029***
(0.007)
-
0.040***
(0.007)
Female * young cohort
0.0
4
2***
(0.014)
0.045***
(0.011)
0.028***
(0.011)
Mother’s completed years of education
0.036***
(0.004)
0.040***
(0.003)
0.032***
(0.004)
Mother’s completed years of
education
* young cohort
-
0.003
(0.006)
-
0.007*
(0.004)
-
0.003
(0.006)
Father’s completed years of education
0.03
2
***
(0.001)
0.034***
(0.001)
0.033***
(0.002)
Father’s completed years of
education* young cohort
-
0.00
4*
(0.002)
-
0.005*
(0.002)
-
0.002
(0.003)
Rural
-
0.1
64
***
(0.021)
-
0.206***
(0.017)
-
0.175***
(0.021)
Rural * young cohort
0.0
81
***
(0.025)
0.091***
(0.019)
0.076***
(0.020)
Wealth quintile
0.116
***
(0.007)
0.122***
(0.005)
0.124***
(0.005)
Wealth quintile * young cohort
0.048
**
(0.020)
0.023**
(0.012)
0.026***
(0.010)
Clusters
20,339
20,117
18,307
Observations
64.339
99,875
101,848
R-squared
0.
3
2
8
0.301
0.292
Cragg-Donald Wald F
31.40
58.73
33.02
(Stock-Yogo critical values)
(7.03)
(7.03)
(7.03)
*** p<0.01, ** p<0.05, * p<0.1
Note.— This table shows the effect of the number of children and other relevant variables on children’s schooling for children belonging to a
young (born after 2000) and an older cohort (born before 2000). Control variables include child-level characteristics (age, birth-year and
month of birth), mother-level characteristics (mother’s age, mother-age squared, mother height, mother ethnicity, her religion, the number of
her children who have died and an indicator variables taking one if any of the mother’s children died before their first birthday) and father’s
age. Robust standard errors are clustered at the DHS-cluster level and reported in parentheses.
39 | P a g e
Table 9: Summary of robustness checks
Robustness
check
Table
Description
Coefficient on number of children
2+SAMPLE
3+SAMPLE
4+SAMPLE
1
Table A4
Heterogeneity of twin at 3
rd
birth
Interaction with Rural
0.019
(0.056)
0.098**
(0.047)
-
0.061
(0.064)
Interaction with Mother is Muslim
0.008
(0.054)
0.092**
(0.046)
-
0.047
(0.063)
Interaction with Rural West&Central
Africa
0.018
(0.054)
0.094**
(0.047)
-
0.064
(0.062)
2
Table A5
Partially missing instruments with
non-linear specification
Number of children>2
-
0.050
(0.264)
Number
of children>3
0.156
(0.122)
0.275**
(0.127)
Number of children>4
-
0.083
(0.140)
-
0.104
(0.115)
-
0.110
(0.120)
Number of children>5
0.021
(0.166)
0.023
(0.129)
0.043
(0.116)
3
Table A6
Expand sample incl. households
in which some school-aged
children reside outside the
household
0.029
(0.056)
0.091*
(0.048)
-0.114*
(0.070)
4
Table A7
Decentralized approach (mother
as decision unit)
0.001
(0.066)
0.059
(0.042)
0.003
(0.044)
5
Table A8
Alternative definition of the
number of children (as the total
number of births given by the
household head’s wives)
0.018
(0.051)
0.092*
(0.045)
-0.055
(0.059)
6
Table A9
Education Z-score calculated
with censored completed years of
education (to the child’s age
minus 6 at most)
0.050
(0.045)
0.113**
(0.041)
-0.054
(0.057)
7
Table A10
Completed grade (years of
schooling) as outcome variable
0.075
(0.095)
0.144*
(0.079)
-
0.031
(0.111)
8
Table A11
Region of residence of the
household instead of mother’s
ethnicity FE
0.030
(0.048)
0.063
(0.041)
-0.080
(0.054)
9
Table A12
Country
-
by
-
urban/rural instead
of mother’s ethnicity FE
0.040
(0.048)
0.097**
(0.044)
-
0.0
5
4
(0.061)
10
Table A13
Alternative definition to
discriminate between poor and
non-poor families
Poor
-
0.030
(0.071)
0.034
(0.073)
-
0.072
(0.088)
Non
-
Poor
0.042
(0.087)
0.138**
(0.059)
-
0.099
(0.093)
Note.— Detailed tables of robustness checks are provided in the Supplementary Appendix A4-A13.
40 | P a g e
Table 10: Exploring the proximate cause underlying the positive effect of family size on schooling
(I)
Enrolment
(II)
Starting age
(III)
Starting age when
difference in age
between 1st and 2nd
born ≤3years
(
I
V)
Drop out
PANEL A: 2+SAMPLE
Effect on 1
st
born
(All)
-
0.009
(0.023)
-
0.172
(0.149)
-
0.214
(0.245)
-
0.024
(0.026)
Effect on 1st born
(Poor)
0.000
(0.039)
0.134
(0.169)
0.192
(0.182)
-0.058
(0.057)
Effect on 1st born
(Non-poor)
-0.028
(0.023)
-0.499*
(0.260)
-0.965*
(0.528)
0.008
(0.039)
PANEL B : 3+SAMPLE
Effect on
1
st
born
(All)
0.001
(0.022)
-
0.196
(0.139)
-
0.048
(0.181)
-
0.014
(0.030)
Effect on 1st born
(Poor)
-0.019
(0.047)
-0.039
(0.404)
0.550
(0.534)
0.065
(0.092)
Effect on 1
st
born
(Non-poor)
0.030
(0.023)
-
0.128
(0.154)
-
0.179
(0.210)
-
0.017
(
(0.030)
Effect on 2nd born
(All)
0.037
(0.024)
-0.018
(0.180)
-0.222
(0.208)
0.063*
(0.035)
Effect on 2nd born
(Poor)
-0.010
(0.050)
0.524
(0.405)
0.691
(0.490)
0.062
(0.081)
Effect on 2
nd
born
(Non-poor)
0.023
(0.027)
-
0.285
(0.180)
-
0.623**
(0.275)
0.060
(0.040)
PANEL C : 4+SAMPLE
Effect on 2
nd
born
(All)
-
0.010
(0.050)
0.524
(0.405)
0.691
(0.490)
0.062
(0.081)
Effect on 2nd born
(Poor)
0.034
(0.063)
-0.462
(0.291)
-0.248
(0.326)
-0.008
(0.085)
Effect on 2nd born
(Non-poor)
-0.072*
(0.042)
0.342
(0.441)
0.469
(0.435)
-0.048
(0.047)
Effect on 3rd born
(All)
-0.004
(0.035)
-0.204
(0.234)
-0.109
(0.177)
-0.047
(0.040)
Effect on 3
rd
born
(Poor)
-0.027
(0.068)
0.240
(0.276)
-0.147
(0.213)
0.016
(0.087)
Effect on 3
rd
born
(Non-poor)
0.020
(0.046)
-0.557
(0.429)
-0.159
(0.380)
-0.076*
(0.040)
*** p<0.01, ** p<0.05, * p<0.1
Note.— The control variables are as specified in the note of Table 3. Robust standard errors are clustered at the DHS-cluster level and reported in
parentheses. In Cols (II) to (V) we consider only children enrolled in school at the time of the survey. In Cols (IV) and (V), the school starting age is
generated based on censored years of education (see robustness check 5). Following Angrist & Pischke (2009: 102-107), we use manual 2SLS as a
linear probability model in Cols (I) and (VI), although ‘enrolment’ and ‘drop out’ are binary variables. In Panel C, the age difference in Cols (III) and
(V) is between the 2nd and the 3rd born. Full results are reported in the Supplementary Appendix Tables A14-A20.
41 | P a g e
Appendix 1: Cross-country summary statistics based on Sample I
Country DHS round Exploitable data on child
labor
Benin
1996 No
2001 No
2006 No
2012 No
Burkina
1993 No
2003 No
2010 Yes
Cameroon
2004 No
2011 Yes
Central African Rep.
1995 No
Chad
2004 Yes
Congo
2012 Yes
Congo Dem.
2007 Yes
2014 Yes
Cote d'Ivoire
2012 Yes
Ethiopia
2000 No
2005 No
2011 Yes
Gabon
2000 No
2012 Yes
Gambia
2013 No
Ghana
1993 No
1998 No
2003 No
2008 No
2014 No
Guinea
1999 No
2005 No
2012 Yes
Kenya
1993 No
1998 No
2003 No
2014 No
Liberia
2013 No
Malawi
2000 Yes
2004 Yes
2010 No
Mali
1996 No
2006 Yes
2013 Yes
Mozambique
1997 No
42 | P a g e
2011 No
Namibia
1992 No
2000 No
Niger
1998 No
2006 Yes
Nigeria
2008 No
2013 No
Senegal
1993 No
2005 Yes
2011 Yes
2013 Yes
2014 No
Sierra Leone
2008 Yes
2013 Yes
Togo
1998 No
2014 Yes
Uganda
2011 No
Zimbabwe
1994 No
All Together 59
Source.— Authors, based on DHS data
... While several studies find a negative causal impact of fertility on education, thereby providing support for a Q-Q trade-off (e.g., Bougma et al., 2015in Burkina Faso, Li et al., 2008Rosenzweig & Zhang, 2009in China, Ponczek & Souza, 2012in Brazil, Li et al., 2017 in 17 Asian and Latin American countries), some others find no evidence of a trade-off (e.g., Alidou & Verpoorten, 2019 in 34 countries in Sub Saharan Africa). Interestingly, some studies even find a positive effect of fertility on educational outcomes (e.g., Alidou & Verpoorten, 2019 for the sample of families with three or more children in Sub-Saharan Africa; McCarthy & Pearlman, 2022 in a sub-district in rural Bangladesh; Qian, 2009 for the sample of first-born children in China; Zhong, 2017 for the sample of rural areas in China). ...
... While several studies find a negative causal impact of fertility on education, thereby providing support for a Q-Q trade-off (e.g., Bougma et al., 2015in Burkina Faso, Li et al., 2008Rosenzweig & Zhang, 2009in China, Ponczek & Souza, 2012in Brazil, Li et al., 2017 in 17 Asian and Latin American countries), some others find no evidence of a trade-off (e.g., Alidou & Verpoorten, 2019 in 34 countries in Sub Saharan Africa). Interestingly, some studies even find a positive effect of fertility on educational outcomes (e.g., Alidou & Verpoorten, 2019 for the sample of families with three or more children in Sub-Saharan Africa; McCarthy & Pearlman, 2022 in a sub-district in rural Bangladesh; Qian, 2009 for the sample of first-born children in China; Zhong, 2017 for the sample of rural areas in China). Such positive effects have been explained by potential economies of scale in raising children (Qian, 2009) or spillovers within the family (McCarthy & Pearlman, 2022). ...
... As a robustness check, we also perform the B-O decomposition using age and state (location) specific -standardized z-score of total years of schooling as the dependent variable (following Alidou & Verpoorten, 2019;Bhalotra & Clarke, 2016), as this is expected to reduce the possibility of our findings being driven by time-location fixed-effects. The estimates which are presented in appendix Table A14 show that our findings remain unchanged. ...
Article
Full-text available
The theory of Quantity–Quality (Q–Q) trade-off suggests that given the resource constraints in a household, an increase in family size would result in lower investments in the human capital development of children. Following this theory, we investigate the role of fertility in explaining the educational gap between Muslims and Hindus in India. A historically large difference in the total fertility rates (TFR) between them, which is as high as 24% in 2015–2016, may have contributed to the existing gap in education. Using decomposition techniques, we find that family size accounts for about 10% of the gap in years of schooling between high-caste Hindus and Muslims. Examining the likelihood of completion of different levels of education, we find that the contribution of family size increases with the level of education, rising to 16% for secondary education. Upon further investigation, we find that the unmet need for family planning is higher for Muslim women than for Hindu women. Thus, appropriate supply side measures addressing these unmet needs may help to reduce the fertility gap, with the potential to reduce the education gap in due course. Additionally, with a comparatively higher desired fertility of Muslims on average, public investments in good-quality schooling, safer and cheaper transportation to schools, and general awareness initiatives about pecuniary and non-pecuniary benefits of schooling may reduce the Q–Q trade-off. This may weaken the association between fertility and education, which could go a long way in reducing the educational disadvantage of Muslim children.
... Moreover, we contribute to the current empirical evidence focusing on SSA, which is scarce and only analyzes the QQ trade-off from a fertility outcomes perspective. Some recent studies in SSA find evidence for the QQ trade-off [Temel (2013), Bougma et al. (2015), Vogl (2016), Ito and Tanaka (2017)] whereas others point to a lack of evidence [Eloundou-Enyegue and Giroux (2012), Kravdal et al. (2013), Alidou and Verpoorten (2019)]. ...
... Some recent studies in SSA find that men report higher fertility, son, and human capital preference than women [Sahn et al. (2010), Norling (2018, Ibrahim and Arulogun (2019)]. To our knowledge, there is only one study on child-gendered preferences in the QQ trade-off in SSA in particular [Alidou and Verpoorten (2019)]. This study estimates a causal relation, considers only gender preferences in education, and uses historical data. ...
... Ethiopia is the 12th most populous country in the world and the 2nd in SSA, coupled with low school completion rates (completion rate of 54% in primary, 29% in secondary, and 8% in tertiary education) and a large gender gap in school enrolment rates (expected years of schooling of 8.3 years for girls and 9.1 years for boys) [The World Bank (2022)]. The majority of households face budget constraints, child labor is still common and the provision of high-quality public education is low, implying that the opportunity cost of education is high [Alidou and Verpoorten (2019)]. With a current total fertility rate 2 of 4.1 births per woman, Ethiopia is believed to have reached the end of the second stage of the demographic transition and to enter the third or late transitional stage. ...
Article
Full-text available
Fertility decline in human history is a complex enigma. Different triggers have been proposed, among others the increased demand for human capital resulting in parents making a quantity–quality (QQ) trade-off. This is the first study that examines the existence of a QQ trade-off and the possible gender bias by analyzing fertility intentions rather than fertility outcomes. We rely on the unified growth theory to understand the QQ trade-off conceptually and a discrete choice experiment conducted among 426 respondents in Ethiopia to analyze fertility intentions empirically. We confirm the existence of a QQ trade-off only when the number of children is less than six and find that intentions are gendered in two ways: (i) boys are preferred over girls, and (ii) men are willing to trade-off more education in return for more children. Results imply that a focus on both stimulating intentions for education, especially girls' education, and on family size intentions is important to accelerate the demographic transition.
... Within African countries, cross-family variation displays positive, negative, and null relationships between sibship size and educational attainment, in large part reflecting contextual variation in differential fertility (Buchmann and Hannum 2001;Eloundou-Enyegue and Williams 2006;Vogl 2016). Furthermore, twin births predict larger family sizes but are unrelated to children's schooling (Alidou and Verpoorten 2019), suggesting the absence of a family size effect. However, the family-level effect of an unexpected child may not be relevant to the population-level forces driving fertility change (Galor 2012), and the null result may reflect confounding (Bhalotra and Clarke 2019). ...
... Olayemi (2012) concludes that large family size has a negative impact on house food security. Alidou and Verpoorten (2019) also pooling all countries in Sub Saharan African and testing the quantity and quality trade-off on schooling saw that their finding did not second the inverse effect of family size on schooling with estimates between the ranges of 0.023 and 0.057 z score. But then studies on family size and child health has been quite dormant. ...
... Briole et al., 2020;Alidou and Verpoorten, 2019;Fernihough, 2017;Mogstad and Wiswall, 2016;Bhalotra and Clarke, 2019). For mental health, studies to date showed either no effect of the number of siblings on children's mental health(Baranowska-Rataj et al., 2016) or a positive correlation(Grinde and Tambs, 2016) between the family size and the children's mental health. ...
Thesis
Full-text available
Inequalities in health are a prevalent feature of societies. And as societies, we condemn inequalities that are rooted in immutable circumstances such as gender, race, and parental background. Consequently, policy makers are interested in measuring and understanding the causes of health inequalities rooted in circumstances. However, identifying causal estimates of these relationships is very ambitious for reasons such as the presence of confounders or measurement error in the data. This thesis contributes to this ambitious endeavour by addressing these challenges in four chapters. In the first Chapter, I use 25 years of rich health information to describe three features of intergenerational health mobility in Germany. First, we describe the joint permanent health distribution of the parents and their children. A ten percentile increase in parental permanent health is associated with a 2.3 percentile increase in their child’s health. Second, a percentile point increase in permanent health ranks is associated with a 0.8% to 1.4% increase in permanent income for, both, children, and parents, respectively. Non-linearities in the association between permanent health and income create incentives to escape the bottom of the permanent health distribution. Third, upward mobility in permanent health varies with parental socio-economic status. In the second Chapter, we estimate the effect of maternal schooling on children’s mental health in adulthood. Using the Socio-Economic Panel and the mental health measure based on the SF-12 questionnaire, we exploit a compulsory schooling law reform to identify the causal effect of maternal schooling on children’s mental health. While the theoretical considerations are not clear, we do not find that the mother’s schooling has an effect on the mental health of the children. However, we find a positive effect on children’s physical health operating mainly through physical functioning. In addition, albeit with the absence of a reduced-form effect on mental health, we find evidence that the number of friends moderates the relationship between maternal schooling and their children’s mental health. In the third Chapter, against a background of increasing violence against non-natives, we estimate the effect of hate crime on refugees’ mental health in Germany. For this purpose, we combine two datasets: administrative records on xenophobic crime against refugee shelters by the Federal Criminal Office and the IAB-BAMF-SOEP Survey of Refugees. We apply a regression discontinuity design in time to estimate the effect of interest. Our results indicate that hate crime has a substantial negative effect on several mental health indicators, including the Mental Component Summary score and the Patient Health Questionnaire-4 score. The effects are stronger for refugees with closer geographic proximity to the focal hate crime and refugees with low country-specific human capital. While the estimated effect is only transitory, we argue that negative mental health shocks during the critical period after arrival have important long-term consequences. In the last Chapter of this thesis, we investigate how the economic consequences of the pandemic and the government-mandated measures to contain its spread affect the self-employed – particularly women– in Germany. For our analysis, we use representative, real-time survey data in which respondents were asked about their situation during the COVID-19 pandemic. Our findings indicate that among the self-employed, who generally face a higher likelihood of income losses due to COVID-19 than employees, women are 35% more likely to experience income losses than their male counterparts. We do not find a comparable gender gap among employees. Our results further suggest that the gender gap among the self-employed is largely explained by the fact that women disproportionately work in industries that are more severely affected by the COVID-19 pandemic. Our analysis of potential mechanisms reveals that women are significantly more likely to be impacted by government-imposed restrictions, e.g., the regulation of opening hours. We conclude that future policy measures intending to mitigate the consequences of such shocks should account for this considerable variation in economic hardship.
Article
Full-text available
To design family planning policies in Sub-Saharan Africa that are based on reproductive empowerment instead of reproductive control, knowledge on fertility preferences and trade-offs between the number of children and child-raising quality is key. This study is the first to empirically investigate fertility decisions and the quantity–quality trade-off theory from a preferences perspective. For this purpose, we use a novel discrete choice experimental method, including three quality and one quantity aspect of child-raising, and three socio-economic drivers of fertility. The choice experiment was executed with 1000 men and women in rural Senegal and rural Uganda. We show that rural households prefer to have many children, but women and non-poor respondents demonstrate a lower preference for many children than men and poor respondents. We find that the quantity–quality trade-off is a two-sided story. On the one hand, for most of the quality attributes, we confirm the existence of a trade-off. On the other hand, quantity and quality are complementary when all children in the household can attain a lower secondary school diploma. Our results imply that improving living conditions and access to infrastructure in rural areas may empower people to make decisions leading to a lower fertility. Aligning family planning programs with reproductive justice requires a shift in focus from reproductive control, based on mere expansion of contraceptive use, to reproductive empowerment, based on preference-based family planning programs. Improving rural livelihoods in general, and specific targeting of family planning programs to people with low-fertility preferences could be a first step.
Preprint
Full-text available
To attenuate the adverse effects of high population growth in low-income countries and to achieve the Sustainable Development Goals, knowledge on rural fertility preferences and the existence of a quantity-quality trade-off between the number of children and child-raising quality is key. To tackle this, we implement a choice experiment in Senegal and Uganda. We include three quality and one quantity aspect of child-raising, and three socio-economic drivers of fertility, resulting in a comprehensive assessment. We show that rural households prefer to have many children, but women and non-poor respondents demonstrate a lower preference for many children than men and poor respondents. We find that the quantity-quality trade-off is a two-sided story. On the one hand, for most of the quality attributes, we confirm the existence of a trade-off. On the other hand, quantity and quality are complementary when all children in the household can attain a lower secondary school diploma. Our results imply that broadening the currently narrow focus on contraceptive uptake in family planning programs, and more specific targeting of such programs to people with low fertility preferences, could improve their effectiveness. JEL codes J10, J13, J19
Article
The question of whether large families have a subsequent negative impact on child health, education, and welfare is of pressing concern for development and public health policy. We tackle this question by empirically exploring whether parents face a trade-off between increasing the size of their family and investing in their children. Using data from a rural sub-district in Bangladesh (the 1996 Matlab Health and Socioeconomic Survey), we estimate the effect of siblings on school attendance, literacy, and numeracy. We use an instrumental variables approach, instrumenting first for children’s sibship size their mothers’ menarche age, and also instrumenting for children’s sibship size with the household’s treatment status (in the Matlab family planning experiment). Although we find no effect of siblings on school attendance, we find that additional siblings increase the likelihood that children are literate and numerate. These effects are greater for girls and younger siblings, consistent with positive numeracy and literacy spillovers from older siblings to younger ones. The results provide counter-evidence to the quality-quantity trade-off theory and demonstrate that sibling education spillovers may dominate any reduced investment on the part of parents.
Preprint
The theory of Quantity-Quality (Q-Q) trade-off suggests that given the resource constraints in a household, an increase in family size would result in lower investments in human capital development of children. Following this theory, we investigate the role of fertility in explaining the educational gap between Muslims and Hindus in India. A historically large difference in the total fertility rates (TFR) between them, which is as high as 24% in 2015-16, may have contributed to the existing gap in education. Using decomposition techniques, we find that family size accounts for about 11% of the gap in years of schooling between high caste Hindus and Muslims. While examining the likelihood of completion of different levels of education, the contribution of family size increases with the level of education, rising to 15% for secondary education. Our estimates are robust to varied fixed effects and time-trends, which indicate that we do not have the supporting evidence to qualify the access to and quality of schools as potential reasons for the gap. While analysing the policy implications, our data reveals that the unmet needs for family planning of the Muslim women have been higher than Hindu women for both spacing (6.96% versus 5.26-5.46%) and limiting (9.58% versus 7.20-7.52%). Additionally, Hindu women have (72-73%) better access to modern methods of contraception than to Muslim women (61.3%). Hence, appropriate supply side measures addressing these needs should go a long way in reducing the fertility gap, with potential to reduce the education gap in due course.
Preprint
Full-text available
Studies investigating how the number of children in a family affects the parents or the children face problems because the variable of interest is endogenous in the model. The currently accepted solution to this problem is to use instrumental variables (IVs), for example, based on twin births. In this paper, I review and add to the critique of IVs based on twin births and show that that there are so many issues—major and minor—with these IVs that results based on them are not reliable or interpretable. I also review other IVs used in the literature, for example IVs based on the sexes of the firstborn children, and conclude that there are, as of yet, no credible IVs for the number of children. We need to disregard results from studies applying these IVs, reevaluate the current state of knowledge, and develop new, more credible methods.
Article
Full-text available
We examine the relationship between child quantity and quality. Motivated by the theoretical ambiguity regarding the sign of the marginal effects of additional siblings on children's outcomes, our empirical model allows for an unrestricted relationship between family size and child outcomes. We find that the conclusion in Black, Devereux, and Salvanes (2005) of no family size effect does not hold after relaxing their linear specification in family size. We find nonzero effects of family size in ordinary least squares estimation with controls for confounding characteristics like birth order and in instrumental variables estimation instrumenting family size with twin births. Estimation using a unrestricted specification for the quality-quantity relationship yields substantial family size effects. This finding suggests that social policies that provide incentives for fertility should account for spillover effects on existing children.
Article
We document birth order differences in cognitive and noncognitive outcomes and maternal behavior from birth to adolescence using the National Longitudinal Survey of Youth 1979 (NLSY79). As early as age one, later-born children score lower on cognitive tests than their siblings, and the gap increases until school entry and remains statistically significant thereafter. Variations in parental behavior, such as cognitive stimulation by mothers, can explain a large portion of the birth order differences in cognitive abilities before school entry. Our findings suggest that broad shifts in parental behavior are plausible explanations for the observed birth order differences in education and labor market outcomes. © 2018 by the Board of Regents of the University of Wisconsin System.
Article
A common approach to evaluating robustness to omitted variable bias is to observe coefficient movements after inclusion of controls. This is informative only if selection on observables is informative about selection on unobservables. Although this link is known in theory (i.e. Altonji, Elder and Taber (2005 —, —, and —, “Selection on Observed and Unobserved Variables: Assessing the Effectiveness of Catholic Schools,” Journal of Political Economy, 2005, 113 (1), 151–184.)), very few empirical papers approach this formally. I develop an extension of the theory which connects bias explicitly to coefficient stability. I show that it is necessary to take into account coefficient and R-squared movements. I develop a formal bounding argument. I show two validation exercises and discuss application to the economics literature.
Article
In the absence of well-developed markets for credit and insurance, extended families play a major role as a traditional system of mutual help. However these arrangements have important consequences on economic choices. In this paper, we use first hand data from Western Cameroon to explore this question. We find that the large majority of transfers follow a given pattern whereby elder siblings support their younger siblings in the early stages of their lives who in turn reciprocate by supporting their elder siblings when they have children. We interpret this pattern as a generalised system of reciprocal credit within the extended family. We propose a simple overlapping generation model to investigate its welfare properties. We then explore the implications of this pattern on labour market outcomes and find evidence of large disincentive effects. This pattern of transfers also implies that younger siblings are more educated but have fewer and less educated children.
Article
One of the most consistent patterns in the social sciences is the relationship between sibship size and educational outcomes: those with fewer siblings outperform those with many. The resource dilution (RD) model emphasizes the increasing division of parental resources within the nuclear family as the number of children grows, yet it fails to account for instances when the relationship between sibship size and education is often weak or even positive. To reconcile, we introduce a conditional resource dilution (CRD) model to acknowledge that nonparental investments might aid in children's development and condition the effect of siblings. We revisit the General Social Surveys (1972-2010) and find support for a CRD approach: the relationship between sibship size and educational attainment has declined during the first half of the twentieth century, and this relationship varies across religious groups. Findings suggest that state and community resources can offset the impact of resource dilution-a more sociological interpretation of sibship size patterns than that of the traditional RD model.