ArticlePDF Available

The genome-wide structure of the Jewish people

Authors:
  • Institute of Genomics, University of Tartu

Abstract and Figures

Contemporary Jews comprise an aggregate of ethno-religious communities whose worldwide members identify with each other through various shared religious, historical and cultural traditions. Historical evidence suggests common origins in the Middle East, followed by migrations leading to the establishment of communities of Jews in Europe, Africa and Asia, in what is termed the Jewish Diaspora. This complex demographic history imposes special challenges in attempting to address the genetic structure of the Jewish people. Although many genetic studies have shed light on Jewish origins and on diseases prevalent among Jewish communities, including studies focusing on uniparentally and biparentally inherited markers, genome-wide patterns of variation across the vast geographic span of Jewish Diaspora communities and their respective neighbours have yet to be addressed. Here we use high-density bead arrays to genotype individuals from 14 Jewish Diaspora communities and compare these patterns of genome-wide diversity with those from 69 Old World non-Jewish populations, of which 25 have not previously been reported. These samples were carefully chosen to provide comprehensive comparisons between Jewish and non-Jewish populations in the Diaspora, as well as with non-Jewish populations from the Middle East and north Africa. Principal component and structure-like analyses identify previously unrecognized genetic substructure within the Middle East. Most Jewish samples form a remarkably tight subcluster that overlies Druze and Cypriot samples but not samples from other Levantine populations or paired Diaspora host populations. In contrast, Ethiopian Jews (Beta Israel) and Indian Jews (Bene Israel and Cochini) cluster with neighbouring autochthonous populations in Ethiopia and western India, respectively, despite a clear paternal link between the Bene Israel and the Levant. These results cast light on the variegated genetic architecture of the Middle East, and trace the origins of most Jewish Diaspora communities to the Levant.
Content may be subject to copyright.
LETTERS
The genome-wide structure of the Jewish people
Doron M. Behar
1,2
*, Bayazit Yunusbayev
2,3
*, Mait Metspalu
2
*, Ene Metspalu
2
, Saharon Rosset
4
,Ju
¨ri Parik
2
,
Siiri Rootsi
2
, Gyaneshwer Chaubey
2
, Ildus Kutuev
2,3
, Guennady Yudkovsky
1,5
, Elza K. Khusnutdinova
3
,
Oleg Balanovsky
6
, Ornella Semino
7
, Luisa Pereira
8,9
, David Comas
10
, David Gurwitz
11
, Batsheva Bonne-Tamir
11
,
Tudor Parfitt
12
, Michael F. Hammer
13
, Karl Skorecki
1,5
& Richard Villems
2
Contemporary Jews comprise an aggregate of ethno-religious
communities whose worldwide members identify with each other
through various shared religious, historical and cultural tradi-
tions
1,2
. Historical evidence suggests common origins in the Middle
East, followed by migrations leading to the establishment of com-
munities of Jews in Europe, Africa and Asia, in what is termed the
Jewish Diaspora
3–5
. This complex demographic history imposes
special challenges in attempting to address the genetic structure
of the Jewish people
6
. Although many genetic studies have shed
light on Jewish origins and on diseases prevalent among Jewish
communities, including studies focusing on uniparentally and
biparentally inherited markers
7–16
, genome-wide patterns of
variation across the vast geographic span of Jewish Diaspora com-
munities and their respective neighbours have yet to be addressed.
Here we use high-density bead arrays to genotype individuals from
14 Jewish Diaspora communities and compare these patterns of
genome-wide diversity with those from 69 Old World non-Jewish
populations, of which 25 have not previously been reported.
These samples were carefully chosen to provide comprehensive
comparisons between Jewish and non-Jewish populations in the
Diaspora, as well as with non-Jewish populations from the Middle
East and north Africa. Principal component and structure-like
analyses identify previously unrecognized genetic substructure
within the Middle East. Most Jewish samples form a remarkably
tight subcluster that overlies Druze and Cypriot samples but not
samples from other Levantine populations or paired Diaspora host
populations. In contrast, Ethiopian Jews (Beta Israel) and Indian
Jews (Bene Israel and Cochini) cluster with neighbouring auto-
chthonous populations in Ethiopia and western India, respec-
tively, despite a clear paternal link between the Bene Israel and
the Levant. These results cast light on the variegated genetic archi-
tecture of the Middle East, and trace the origins of most Jewish
Diaspora communities to the Levant.
Recently, the capacity to obtain whole-genome genotypes with the
use of array technology has provided a robust tool forelucidating fine-
scale population structure and aspects of demographic history
17–23
.
This approach, initially used to account for population stratification
in genome-wide association studies, identified genome-wide patterns
of variation that distinguished between Ashkenazi Jews and non-
Jews of European descent
7,11,12,14–16
. Similarly, a large-scale survey of
autosomal microsatellites found that samples from four Jewish
communities clustered close to each other and intermediate between
non-Jewish Middle Eastern and European populations
10
.
Illumina 610K and 660K bead arrays were used to genotype 121
samples from 14 Jewish communities. The results were compared
with 1,166 individuals from 69 non-Jewish populations (Supplemen-
tary Note 1 and Supplementary Table 1), with particular attention to
neighbouring or ‘host’ populations in corresponding geographic
regions. These results were also integrated with analyses of genotype
data from about 8,000 Y chromosomes and 14,000 mitochondrial
DNA (mtDNA) samples (Supplementary Note 6 and Supplemen-
tary Tables 4 and 5). Several questions were then addressed: What
are the locations of the various Jewish communities in a global genetic
variation context? What are the features of the Middle Eastern (Sup-
plementary Fig. 1) population genetic substructure? What are the
genetic distances between contemporary Jewish communities, their
Diaspora neighbours and Middle Easternpopulations? Can the genetic
origin of Jews be pinpointed within the Middle East?
The EIGENSOFT package
24
was used to identify the principal
components (PCs) of autosomal variation in our Old World sample
set (Fig. 1 and Supplementary Fig. 2a). This analysis places the
studied samples along two well-established geographic axes of global
genetic variation
18,19,22
: PC1 (sub-Saharan Africa versus the rest of the
Old World) and PC2 (east versus west Eurasia). Focusing on the
Middle Eastern populations in the PC1–PC2 plot (Fig. 1b) reveals
more geographically refined groupings. Populations of the Caucasus,
flanked by Cypriots, form an almost uninterrupted rim that separates
the bulk of Europeans from Middle Eastern populations. Bedouins,
Jordanians, Palestinians and Saudi Arabians are located in close
proximity to each other, which is consistent with a common origin
in the Arabian Peninsula
25
, whereas the Egyptian, Moroccan,
Mozabite Berber, and Yemenite samples are located closer to sub-
Saharan populations (Fig. 1a and Supplementary Fig. 2a).
Most Jewish samples, other than those from Ethiopia and India,
overlie non-Jewish samplesfrom the Levant (Fig. 1b). The tight cluster
comprising the Ashkenazi, Caucasus (Azerbaijani and Georgian),
Middle Eastern (Iranian and Iraqi), north African (Moroccan) and
Sephardi (Bulgarian and Turkish) Jewish communities, as well as
Samaritans, strongly overlaps Israeli Druze and is centrally located
on the principal component analysis (PCA) plot when compared with
Middle Eastern, European Mediterranean, Anatolian and Caucasus
non-Jewish populations (Fig. 1). This Jewish cluster consists of
*These authors contributed equally to this work.
1
Molecular Medicine Laboratory, Rambam Health Care Campus, Haifa 31096, Israel.
2
Estonian Biocentre and Department of Evolutionary Biology, University of Tartu, Tartu 51010,
Estonia.
3
Institute of Biochemistry and Genetics, Ufa Research Center, Russian Academy of Sciences, Ufa 450054, Russia.
4
Department of Statistics and Operations Research, School
of Mathematical Sciences, Tel Aviv University, Tel Aviv 69978, Israel.
5
Rappaport Faculty of Medicine and Research Institute, Technion
Israel Institute of Technology, Haifa 31096,
Israel.
6
Research Centre for Medical Genetics, Russian Academy of Medical Sciences, Moscow 115478, Russia.
7
Dipartimento di Genetica e Microbiologia, Universita
`di Pavia, Pavia
27100, Italy.
8
Instituto de Patologia e Imunologia Molecular da Universidade do Porto (IPATIMUP), Porto 4200-465, Portugal.
9
Faculdade de Medicina, Universidade do Porto, Porto
4200-319, Portugal.
10
Institute of Evolutionary Biology (CSIC-UPF), CEXS-UPF-PRBB and CIBER de Epidemiologı
´
a y Salud Pu
´blica, Barcelona 08003, Spain.
11
Department of Human
Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
12
Department of the Languages and Cultures of the Near and Middle
East, Faculty of Languages and Cultures, School of Oriental and African Studies (SOAS) , University of London, London WC1H 0XG, UK.
13
ARL Division of Biotechnology, University of
Arizona, Tucson, Arizona 85721, USA.
doi:10.1038/nature09103
1
Macmillan Publishers Limited. All rights reserved
©2010
samples from most Jewish communities studied here, which together
cover more than 90% of the current world Jewish population
5
; this is
consistent with an ancestral Levantine contribution to much of con-
temporary Jewry. A compact cluster of Yemenite Jews, which is also
located within an assemblage of Levantine samples, overlaps primarily
with Bedouins but also with Saudi individuals (Fig. 1b). In contrast,
Ethiopian and Indian Jews are located close to those from neighbour-
ing host populations (Fig. 1c,d). Ethiopian Jews clustered with
Semitic-speaking rather than Cushitic-speaking Ethiopians. See Sup-
plementary Note 2 for a discussion of the assignment of samples repre-
senting the Belmonte and Uzbek (Bukharan) Jewish communities.
To glean further details of Levantine genetic structure, we repeated
PCA on a restricted set of samples from west Eurasia (Fig. 2, Sup-
plementary Fig. 3 and Supplementary Note 2) and by inspect-
ing lower-ranked PCs in the Old World context (Supplementary
Fig. 2b, c; PC1 versus PC3 and PC4). These analyses reveal three
−0.02 00.02
0.04 0.06
−0.08 −0.06 −0.04 −0.02 0.00 0.02
Eigenvector 2
Eigenvector 1
00
0.0
2
ector 2
Ru
Fr
Fr
FB
FB Orc
Orc
Fr
Fr
FB
FB
Orc
Orc
Fr Fr
Fr
FB
FB
Orc
Orc
Fr
Fr
Fr
Fr
Fr
Fr
Fr
Fr
Fr
FB
Tus
Orc
Fr Fr
Fr
Fr
FB
Tus
Orc
Fr
Fr
FB Fr
Fr
FB
FB
FB
Tus
Orc
Tus
Orc
FB
FB
Tus
FB
FB
Tus
Orc
Ru
FB
Tus
Orc
Ru
Ru
R
Fr
Fr
FB
FB
FB
Orc
Orc
Ru
Ru
Fr
Fr
FB Orc
FB
Chu
Bel
Bel
Bel
BelBel
Bel
Bel
Bel
Bel
Hng
Hng
Hng
Hng
Hng
Hng
Hng
Hng
Hng
Hng
Hng
Hng
Hng
Hng
Hng
HngHng
Hng
Hng
Hng
Lit
Lit
Lit
Lit
Lit
Lit
Lit
Lit Lit
Spa
Spa
Spa
Spa
Spa
Spa
Spa
Spa
Spa
Spa
Spa
Spa
Rmn Rmn
Rmn
Rmn
Rmn
Rmn
Rmn
Rmn
Rmn
RmnRmn
Rmn Rmn
Rmn
Europeans
Levantine
non-Jewish populations
non-Jewish populations
non-Jewish populations
Bedouins
Palestinians
Saudis
Jordanians
a
Sp
Sp
Spa
Spa
pa
S
Spa
p
Sp
Sp
pa
Sp
Sp
S
S
pa
Sp
SpSp
S
S
Druze
Cypriots
Samaritans
T
T
Hng
Hng
ng
ng
Rmn
ng
ng
Rm
HH
Hn
Hn
ngngngng
n
n
n
n
Armenians
Georgians
Turks
Iranians
Lezgins
Adygei
u
R
R
L
ez
g
in
s
Ady
g
ei
Ady
Ady
Ady
Ady
Ady
Ady
Ady
Ady Ady
Ady Ady
Ady
Ady
Ady
Ady
Ady
Pal
Pal Pal
Pal
Pal
Pal Pal
Pal
PalPal Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
Pal
alP
Pal Pal
GoGo
Go
Go
Go Go
Go
Go
Go
Go
Go
Go
Go
Go
Go
Go
Go
Go
Go
Go
Arm
Arm
Arm
Arm
Arm
Arm
Arm
Arm
Arm
Arm
Arm
Arm
Arm
Arm
Arm
Arm
Arm
Arm
Arm
Tur
Tur
Tur
Tur
Tur
Tur
Tur
Tur
Tur
Tur
TurTu r
Tur
Tur
Tur
Tur
Tur
TurTur
Irn
Irn
Irn
Irn
Irn
Irn
Irn
Irn
Irn
Irn
Irn
Irn
Irn
Irn
Irn
Irn
Irn
Irn
Yem
Yem
Lzg
Lzg
Lzg
Lzg
LzgLzg
Lzg Lzg
Lzg Lzg
Lzg
Lzg
Lzg
Lzg
Lzg
Lzg
Lzg
Lzg
Drz Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz
Drz Drz
Drz
Drz
Drz
Drz
Drz
LebLeb
Leb
Leb
Leb
Leb
Cyr
Cyr
Cyr
CyrCyr
Cyr
Cyr
Cyr
Cyr
Cyr Cyr
Cyr
Jor
Jor
Jor
Jor
Jor
Jor
Jor
Jor
Jor
Jor
Jor Jor
Jor
Jor
Jor
Jor
Jor
Syr
Syr
Syr
Syr
Syr
Syr
Syr
Syr
Syr
Syr
Syr
Syr
Syr
Syr
Syr
Sdi
Sdi
Sdi
Sdi
Sdi
Sdi
Sdi
Sdi Sdi
Sdi
Sdi
Sdi
Sdi
Sdi
Sdi
Sdi
Sdi
Sdi
Sdi
Sm
Sm
Sm
InJ
InJInJ
InJ
IqJ IqJ
IqJIqJ
IqJIqJ
IqJ
IqJ IqJ
IqJ
IqJ
SJ
SJ
SJ
SJ
SJ
SJ
SJ
SJ
SJ
SJ
SJ
SJ
SJ
SJ
SJ
SJ
SJ SJ SbJ AJ
AJ
AJ
AJ
AJ
AJAJ
AJ
AJ
AJ
AJ
AJ AJ
AJ
AJ AJ
AJ AJ
AJ
AJ
AJ UJ
UJ
YJ
YJ
YJ
YJ
YJ
YJ
YJ
YJ
YJ
YJ
YJ
YJ
YJ
YJ
YJ
MJ MJ
MJ
MJ
MJ
MJ
MJ
MJ MJ
MJ
MJ
MJ
MJ
MJ
MJ
MJ
AzJ
AzJ
AzJ
AzJ
AzJ
AzJ
AzJ
AzJ
GJ
GJ
GJ
GJ
InJ, Iranian Jews
IqJ, Iraqi Jews
SJ, Sephardi Jews
AJ, Ashkenazi Jews
UJ, Uzbekistani Jews
MJ, Moroccan Jews
AzJ, Azerbaijani Jews
GJ, Georgian Jews
YJ, Yemenite Jews
Leb, Lebanese
Syr, Syrians
Yem, Yemenites
0
E
EJ
EJ
EJ
EJ
EJ
EJ
EJ
EJ
EJ
EJ
EJ
EJ
EJ
Jor
Mor
Mor
EtO
EtO
EtO
EtO
EtO
EtO
EtT
EtT
EtO
EtA
EtA
EtA
EtA
EtTEtT
EtT
EtA
EtA
EJ, Ethiopian Jews
Jor, Jordanians
Mor, Moroccans
EtT, Tigray Ethiopians
EtO, Oromo Ethiopians
EtA, Amhara Ethiopians
0
0
2
Blo
Sin
Sin
Ptn
Bur
Bur
Sin
Ptn
Ptn
Bur
Bur
Sin
Sin
Ptn
Bur Bur
Blo
SinSin
Ptn Bur
Bur
Sin
Ptn
Bur
Bur
Sin
Sin
Ptn
Bur
Bur
Sin Sin
Ptn Ptn
Bur
Sin
Sin
Ptn
Bur Bur
Sin
Sin
Bur
Bur
Ptn
Bur
Bur
Sin
Sin
Ptn
Ptn
Bur
Bur
Bur
Bur
Bur
Bur
Ind Ind Ind
Ind
Ind
Ind
Ind
IcJ
IcJ
IcJ
IcJIbJ
IbJ IbJ
IbJ
Ptn, Pathan
Blo, Balochi
Bur, Burusho
Sin, Sindhi
Ind, India (southern)
IcJ, Cochini Jews
IbJ, Mumbai Jews
Sub-Saharan Africa
North and east Africa
Middle East
Europe
Jewish communities
South Asia
East Asia
a
b
c
d
SbJ, Belmonte Jew
Figure 1
|
PCA of high-density array data. a, Scatter plot of Old World
individuals, showing the first two principal components. Each ring
corresponds to one individual and the colour indicates the region of origin
(for the full figure see Supplementary Fig. 2). bd, A series of magnifications
showing samples from Europe and the Middle East (b), Ethiopia (c) and
south Asia (d). Each letter code (Supplementary Table 1) corresponds to one
individual, and the colour indicates the geographic region of origin. In b,a
polygon surrounding all of the individual samples belonging to a group
designation highlights several population groups.
LETTERS NATURE
2
Macmillan Publishers Limited. All rights reserved
©2010
distinct Near Eastern Jewish subclusters: the first group is located
between Middle Eastern and European populations and consists of
Ashkenazi, Moroccan and Sephardi Jews. The second group, com-
prising the Middle Eastern and Caucasus Jewish communities, is
positioned within the large conglomerate of non-Jewish populations
of the region. The third group contains only a tight cluster of
Yemenite Jews.
After elucidation of these groupings by PCA, we turned to
structure-like analysis
26
with the algorithm ADMIXTURE
27
to assign
individuals proportionally to hypothetical ancestral populations
(Supplementary Note 3). Initially, all Jewish samples were analysed
jointly with 25 novel reference populations (Supplementary Note 1)
in combination with the Human Genome Diversity Panel
18
samples
representing Africa, the Middle East, Europe, and central, south and
east Asia (Fig. 3 and Supplementary Fig. 4). This analysis significantly
refines and reinforces the previously proposed partitioning of Old
World population samples into continental groupings
18,19
(Sup-
plementary Fig. 4 and Supplementary Note 4). We note that mem-
bership of a sample in a component that is predominant in, but not
restricted to, a specific geographic region is not sufficient to infer its
genetic origins. Membership in several genetic components can imply
either a shared genetic ancestry or a recent admixture of sampled
individuals
18,28
. An illustrative example at K58 (Fig. 3 and Sup-
plementary Note 3) is the pattern of membership of Ashkenazi,
Caucasus (Azerbaijani and Georgian), Middle Eastern (Iranian and
Iraqi), north African (Moroccan), Sephardi (Bulgarian and Turkish)
and Yemenite Jewish communities in the light-green and light-
blue genetic components, which is similar to that observed for
Middle Eastern non-Jewish populations, suggesting a shared regional
origin of these Jewish communities. This inference is consistent with
historical records describing the dispersion of the people of ancient
Israel throughout the Old World
1–4
. Our conclusion favouring
common ancestry over recent admixture is further supported by the
fact that our sample contains individuals that are known not to be
admixed in the most recent one or two generations. It is also evident
that among the Ashkenazi, Moroccan and Sephardi Jewish com-
munities the dark-blue component dominating European populations
is more substantial than the corresponding proportion of this com-
ponent amongthe Middle Eastern Jewish communities (Fig. 3).For the
Indian and Ethiopian Jewish communities the dark-green and light-
brown genetic components are consistent with corresponding mem-
bership of theirrespective host populations (Fig. 3).ADMIXTURE was
also run on the west Eurasian subset of the Old World sample, which
highlights differentiation between the Middle East and Europe (Sup-
plementary Fig. 4b). Here, comparison between the ADMIXTURE-
derived component patterns for Sephardi and Ashkenazi Jews shows
that the former have only slightly greater similarity to the pattern
observed for Middle Eastern populations than do the latter.
Genetic relationships between our population samples were then
explored with the measure of allele sharing distances (ASDs)
29
.Table1
provides genetic distances between each Jewish community and its
corresponding host population, all Jewish communities, west
Eurasian Jewish communities, their respective Jewish group inferred
from the PCA, and non-Jewish Levantine populations. The Ashkenazi,
Sephardi, Moroccan, Iranian, Iraqi, Azerbaijani and Uzbekistani
Jewish communities have the lowest ASD values when compared with
their PCA-based inferred Jewish sub-cluster (Fig. 3 and Supplemen-
tary Figs 2c and 3). In all except the Sephardi Jewish community, this
ASD difference is statistically significant (P,0.01, bootstrap t-test).
ASD values between Ashkenazi, Sephardi and Caucasus Jewish popu-
lations and their respective hosts are lower than those between each
Jewish population and non-Jewish populations from the Levant. This
might be the result of a bias inherent in our calculations as a result of
the genetically more diverse non-Jewish populations of the Levant.
The Ethiopian and Indian Jewish communities show the lowest ASD
values when compared with their host population (Supplementary
Tables 2 and 3 and Supplementary Note 5).
Although uniparental markers
8,9
(Supplementary Note 6) are limited
in their capacity to uncover genetic substructure within the Middle
East, they do provide important insights into sex-specific processes that
are not unambiguously evident from the autosomal data alone. For
example, Y-chromosome data point to a unique paternal genetic link
between the Bene Israel community and the Levant, whereas the
absence of sub-Saharan African maternal lineages in Yemenite and
Moroccan Jews (in contrast to their hosts) suggests limited maternal
gene flow.
–0.10 –0.05 0 0.05
–0.05
0
0.05
Ei
g
envector 2, ei
g
envalue = 2.8
Eigenvector 1, eigenvalue = 6.1
Adygei
Lezgins
Armenians
Georgians
Chuvashi
Fr. Basque
Sardinians
Spaniards
French
Russians
Romanians
Hungarians
Lithuanians
Orcadians
Iranians
Saudis
Bedouins
Syrians
Jordanians
Druze
Turk s
Cypriots
Ashkenazi Jews
Iraqi Jews
Yemenite Jews
Sephardi Jews
Moroccan Jews o
o
oo
oo
oo
Azerbaijani Jews (o)
gg
g
g
Georgian Jews (g)
L
L
LL
L
LL
L, Lebanese
S
S
S
S, Samaritans
T
TT
T
TT
T
T, Tuscans
J
J
JJ
Iranian Jews (J)
B
B
BB
BB
B
B
B
B, Belorussians
SbJ
U
U
U, Uzb. Jews
SbJ, Sephardi Belmonte
Palestinians
Figure 2
|
PCA of west Eurasian high-density array data. Plot of kernel
densities (Supplementary Note 2) for each population sample (n.10) was
estimated on the basis of PC1 and PC2 coordinates in Supplementary Fig. 3.
Individuals from these samples were plotted by using PC1 and PC2
coordinates and were overlaid with the plot of kernel density.
Africa Middle East Europe Central, south and east Asia
Biaka P ygmies
Mbuti Pygmies
San
Bantu
Yoruba
Mandenkas
*Ethiopian J ews
*Ethiopians
Mozabi tes
*Moroccans
*Moroccan Je ws
*Egyptians
*Saudis
*Yemenese
*Yemenite Je ws
Bedouins
Palestini ans
*Syrians
*Jordanians
Druze
*Lebanese
*Samari tans
*Turks
*Iraqi Jews
*Iranian Jews
*Iranians
*Armenians
*Georgians
*Georgian Je ws
*Azerbaijani Je ws
Adyge i
*Lezgins
*Sephardi J ews
*Ashkenazi Jews
T
B
Orcadi ans
French Basque
French
*Spaniards
Tuscans
Sardinians
*Cypriots
*Romanians
*Hungarians
*Lithuanians
*Belorussians
Russians
*Chuvashs
*Uzbeks
Uygurs
Hazara
Burusho
Pathan
Brahui
Balochi
Sindhi
Makrani
*South Indians
Yakuts
Cambodians
Dai
Lahu
Miaozu
She
Han
Tuji a
Naxi
Yizu
Tu
Xibo
Oroqe n
Mongo ls
Daur
Hezhen
Japanese
*Uzbekistani Jews
*Mumbai Jews
*Cochini Jews
Figure 3
|
Population structure inferred by ADMIXTURE analysis. Each
individual is represented by a vertical (100%) stacked column of genetic
components proportions shown in colour for K58. The Jewish
communities are labelled in colour and bold. T and B further specify
Sephardi Jews from Turkey and Bulgaria, respectively. Populations
introduced for the first time in this study and analysed together with the
Human Genome Diversity Panel
18
data are marked with an asterisk.
NATURE LETTERS
3
Macmillan Publishers Limited. All rights reserved
©2010
Our PCA, ADMIXTURE and ASD analyses, which are based on
genome-wide data from a large sample of Jewish communities, their
non-Jewish host populations, and novel samples from the Middle
East, are concordant in revealing a close relationship between most
contemporary Jews and non-Jewish populations from theLevant. The
most parsimonious explanation for these observations is a common
genetic origin, which is consistent with an historical formulation of
the Jewish people as descending from ancient Hebrew and Israelite
residents of the Levant. This inference underscores the significant
genetic continuity that exists among most Jewish communities and
contemporary non-Jewish Levantine populations, despite their long-
term residence in diverse regions remote from the Levant and isola-
tion from one another. This study further uncovers genetic structure
that partitions most Jewish samples into Ashkenazi–north African–
Sephardi, Caucasus–Middle Eastern, and Yemenite subclusters
(Fig. 2). There are several mutually compatible explanations for the
observed pattern: a splintering of Jewish populations in the early
Diaspora period, an underappreciated level of contact betweenmem-
bers of each of these subclusters, and low levels of admixture with
Diaspora host populations. Equally interesting are the inferences that
can be gleaned from more distant Diaspora communities, such as the
Ethiopian and Indian Jewish communities. Strong similarities to their
neighbouring host populations may have resulted from one or more
of the following: large-scale introgression, asymmetrical sex-biased
gene flow, or religious and cultural diffusion during the process of
becoming one of the many and varied Jewish communities.
METHODS SUMMARY
Blood or buccal samples were collected with informed consent from unrelated
volunteers who self-identified as members of one of the Jewish communities or
non-Jewish populations studied here (Supplementary Note 1). The term ‘Old
World’ refers to populations of the Eastern Hemisphere, specifically Europe,
Asia and Africa. Whenever the term Jewish is not part of the population
designation, this refers to a non-Jewish population. DNA samples chosen for
the biparental analysis were genotyped on Illumina 610K or 660K bead arrays
and showed a genotyping success rate of more than 97%. Data management and
quality control were aided by PLINK 1.05 (ref. 30). For comparison, the relevant
populations from the Illumina 650K-based data set of the Human Genome
Diversity Panel, excluding relatives
18
, were included in our analysis. After iden-
tification of the intersection of genotypes from the various Bead-Arrays, quality
control (QC) and linkage disequilibrium (LD) pruning, a total of 226,839 auto-
somal single nucleotide polymorphisms (SNPs) remained for further analysis.
PCA of autosomal variation using the smartpca of the EIGENSOFT package
24
was performed (Supplementary Note 2). Samples were modelled as comprising a
mixture of major genetic components using the structure-like ADMIXTURE
program
27
, and the inferred genetic membership of each individual from this
analysis was studied (Supplementary Notes 3 and 4). ASD
29
between groups
was assessed, and a bootstrap procedure to determine the significance of differ-
ences in ASD between pairs of populations was adapted (Supplementary Note 5).
Our uniparental data was merged with previously reported data sets for
Y-chromosome and mtDNA analysis (Supplementary Note 6). A matrix of
Y-chromosome and mtDNA haplogroup frequencies was constructed, and
PCA was performed in the R environment (using the function princomp).
Full Methods and any associated references are available in the online version of
the paper at www.nature.com/nature.
Received 9 December 2009; accepted 21 April 2010.
Published online 9 June 2010.
1. Ben-Sasson, H. H. A History of the Jewish People (Harvard Univ. Press, 1976).
2. De Lange, N. Atlas of the Jewish World (Phaidon Press, 1984).
3. Mahler, R. A History of Modern Jewry (Schocken, 1971).
4. Stillman, N. A. Jews of Arab Lands: A History and Source Book (Jewish Publication
Society of America, 1979).
5. Della Pergola, S. in Papers in Jewish Demography 1997 (eds Della Pergola, S. & Even,
J.) 11
33 (The Hebrew University of Jerusalem, 1997).
6. Cavalli-Sforza, L. L., Menozzi, A. & Piazza, A. in The History and Geography of
Human Genes 4 (Princeton Univ. Press, 1994).
7. Bauchet, M. et al. Measuring European population stratification with microarray
genotype data. Am. J. Hum. Genet. 80, 948
956 (2007).
8. Behar, D. M. et al. Counting the founders: the matrilineal genetic ancestry of the
Jewish Diaspora. PLoS ONE 3, e2062 (2008).
9. Hammer, M. F. et al. Jewish and Middle Eastern non-Jewish populations share a
common pool of Y-chromosome biallelic haplotypes. Proc. Natl Acad. Sci. USA 97,
6769
6774 (2000).
10. Kopelman, N. M. et al. Genomic microsatellites identify shared Jewish ancestry
intermediate between Middle Eastern and European populations. BMC Genet. 10,
80 (2009).
11. Need, A. C., Kasperaviciute, D., Cirulli, E. T. & Goldstein, D. B. A genome-wide
genetic signature of Jewish ancestry perfectly separates individuals with and
without full Jewish ancestry in a large random sample of European Americans.
Genome Biol. 10, R7 (2009).
12. Olshen, A. B. et al. Analysis of genetic variation in Ashkenazi Jews by high density
SNP genotyping. BMC Genet. 9, 14 (2008).
13. Ostrer, H. A genetic profile of contemporary Jewish populations. Nature Rev.
Genet. 2, 891
898 (2001).
14. Price, A. L. et al. Discerning the ancestry of European Americans in genetic
association studies. PLoS Genet. 4, e236 (2008).
15. Seldin, M. F. et al. European population substructure: clustering of northern and
southern populations. PLoS Genet. 2, e143 (2006).
16. Tian, C. et al. Analysis and application of European genetic substructure using
300 K SNP information. PLoS Genet. 4, e4 (2008).
17. Abdulla, M. A. et al. Mapping human genetic diversity in Asia. Science 326,
1541
1545 (2009).
18. Li, J. Z. et al. Worldwide human relationships inferred from genome-wide patterns
of variation. Science 319, 1100
1104 (2008).
19. Jakobsson, M. et al. Genotype, haplotype and copy-number variation in worldwide
human populations. Nature 451, 998
1003 (2008).
20. Novembre, J. et al. Genes mirror geography within Europe. Nature 456, 98
101
(2008).
21. Reich, D., Thangaraj, K., Patterson, N., Price, A. L. & Singh, L. Reconstructing Indian
population history. Nature 461, 489
494 (2009).
22. Biswas, S., Scheinfeldt, L. B. & Akey, J. M. Genome-wide insights into the patterns
and determinants of fine-scale population structure in humans. Am. J. Hum. Genet.
84, 641
650 (2009).
23. Tishkoff, S. A. et al. The genetic structure and history of Africans and African
Americans. Science 324, 1035
1044 (2009).
Table 1
|
Genetic distances (ASD) between Jewish, Levantine and Diaspora host populations
Jewish community Host population Hosts Levant*All Jews West Eurasian Jews{Jewish cluster{
Ashkenazi Europe10.236 0.239I0.240 0.236 0.235
Sephardi Spain 0.236 0.238 0.239 0.236 0.235
Moroccan Morocco 0.246 0.239 0.240 0.237 0.236
Georgian Georgia 0.234 0.238 0.239 0.236 0.236
Azerbaijani Lezgin 0.238 0.240 0.241 0.238 0.237
Iranian Iran 0.239 0.239 0.240 0.237 0.236
Iraqi Syria, Iran 0.238 0.238 0.239 0.236 0.236
Uzbekistani Uzbekistan 0.243 0.238 0.239 0.236 0.235
Bene Israel India (Mumbai) 0.240 0.245 0.245 0.243 0.241
Cochini India (Kerala) 0.238 0.247 0.247 0.245 0.241
Ethiopian Ethiopia"0.245 0.253 0.255 0.254
Yemenite Yemen 0.243 0.238 0.240 0.237
*Levant populations included Bedouin, Cypriots, Druze, Jordanians, Lebanese, Palestinians, Samaritans and Syrians.
{All Jewish populations excluding Ethiopian and Indian Jews.
{Jewish communities in the same cluster as obtained from the PCA analysis (Supplementary Fig. 3) are indicated by bold, italic or underlined type under the heading Jewish community.
1Russians, Romanians, Hungarians, Belorussians, French and Lithuanians.
ISignificance throughout the table: italic entries are significantly bigger than ASD from hosts (that is, further away), bold entries are significantly smaller than ASD from hosts; see Supplementary
Table 3 for details.
"Amhara, Oromo and Tigray.
LETTERS NATURE
4
Macmillan Publishers Limited. All rights reserved
©2010
24. Patterson, N., Price, A. L. & Reich, D. Population structure and eigenanalysis. PLoS
Genet. 2, e190 (2006).
25. Hourani, A. A History of the Arab Peoples (Faber & Faber, 1991).
26. Weiss, K. M. & Long, J. C. Non-Darwinian estimation: my ancestors, my genes’
ancestors. Genome Res. 19, 703
710 (2009).
27. Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of
ancestry in unrelated individuals. Genome Res. 19, 1655
1664 (2009).
28. Rasmussen, M. et al. Ancient human genome sequence of an extinct Palaeo-
Eskimo. Nature 463, 757
762 (2010).
29. Gao, X. & Martin, E. R. Using allele sharing distance for detecting human
population stratification. Hum. Hered. 68, 182
191 (2009).
30. Purcell, S. et al. PLINK: a tool set for whole-genome association and population-
based linkage analyses. Am. J. Hum. Genet. 81, 559
575 (2007).
Supplementary Information is linked to the online version of the paper at
www.nature.com/nature.
Acknowledgements We thank the individuals who provided DNA samples for this
study, including the National Laboratoryfor the Genetics of Israeli Populations; Mari
Nelis, Georgi Hudjashov and Viljo Soo for conducting the autosomal genotyping;
Lauri Anton for computational help. R.V. and D.M.B. thank the European
Commission, Directorate-General for Researchfor FP7 Ecogene grant 205419. R.V.
thanks the European Union, Regional Development Fund through a Centre of
Excellence in Genomics grant and the Swedish Collegium for Advanced Studies for
support during the initial stage of this study. E.M. and Si.R. thank the Estonian
Science Foundation for grants 7858 and 7445, respectively. K.S. thanks the Arthur
and Rosalinde Gilbert Foundation fund of the American Technion Society. Sa.R.
thanks the European Union for Marie Curie International Reintegration grant
CT-2007-208019,and the Israeli Science Foundation for grant 1227/09. IPATIMUP
is an Associate Laboratory of the Portuguese Ministry of Science, Technology and
Higher Education and is partlysupported by Fundac¸a
˜o para a Cie
ˆncia ea Tecnologia,
the Portuguese Foundation for Science and Technology.
Author Contributions D.M.B. and R.V. conceived and designed the study. B.B.T.,
D.C., D.G., D.M.B., E.K.K., G.C., I.K., L.P., M.F.H., O.B., O.S., T.P. and R.V. provided
DNA samples to this study. E.M., J.P. and G.Y. screened and prepared the samples
for the autosomal genotyping. D.M.B., E.M., G.C., M.F.H. and Si.R. generated and
summarized the databasefor the uniparental analysis. B.Y., M.M. and Sa.R. designed
and applied the modelling methodologyand statistical analysis. T.P. provided expert
input regarding the relevant historical aspects. B.Y., D.M.B., K.S., M.F.H., M.M., R.V.
and Sa.R. wrote the paper. B.Y., D.M.B. and M.M. contributed equally to the paper.
All authors discussed the results and commented on the manuscript.
Author Information The array data described in this paper are deposited in the
Gene Expression Omnibus under accession number GSE21478. Reprints and
permissions information is available at www.nature.com/reprints. The authors
declare no competing financial interests. Readers are welcome to comment on the
online version of this article at www.nature.com/nature. Correspondence and
requests for materials should be addressed to D.M.B. (behardm@usernet.com),
K.S. (skorecki@tx.technion.ac.il) or R.V. (rvillems@ebc.ee).
NATURE LETTERS
5
Macmillan Publishers Limited. All rights reserved
©2010
METHODS
Sample collection. All samples reported here were derived from a buccal swab or
blood cells collected with informed consent in accordance with protocols
approved by the National Human Subjects Review Committee in Israel and
Institutional Review boards of the participating research centres. Participants
were recruited during scheduled archaeogenetics lectures addressing the general
public, genealogical societies, heritage centres and the scientific community.
Each volunteer reported ancestry by providing information on the origin of all
four grandparents. Samples were also obtained from the National Laboratory for
the Genetics of Israeli Populations (http://www.tau.ac.il/medicine/NLGIP/).
Comparative data sets for the uniparental and biparental analysis were
assembled from the literature as summarized in Supplementary Note 1 and
Supplementary Tables 1 and 4 and 5.
Genotyping autosomal markers. Illumina 610K or 660K bead arrays were used
for genotyping with standard protocols, and Bead Studio software was used to
assign genotypes. PLINK 1.05 (ref. 30) was used to perform data management
and QC operations. Samples and SNPs with success rates of less than 97% were
excluded. A total of 475 novel samples were analysed, 121 of which were from 14
Jewish communities representing most of the known geographic range of Jews
during the past 100 years. The other 354 samples were chosen from 27 non-
Jewish populations to enable paired analysis with the Jewish sample set. For
comparison, relevant populations were further included (Supplementary
Table 1) from the Illumina 650K-based data set of the Human Genome
Diversity Panel after excluding relatives as in ref. 18. Because background LD
can distort both PCA
24
and structure-like analysis
27
results, one member of any
pair of SNPs in strong LD (r
2
.0.4) in windows of 200 SNPs (sliding the window
by 25 SNPs at a time) was removed using indep-pairwise in PLINK. After iden-
tifying the intersection of genotypes from the two types of bead array (Illumina
610K and 660K), QC and LD pruning, a total of 226,839 autosomal SNPs were
chosen for all autosomal analyses.
Principal component analysis. PC analysis was performed with the smartpca
program of the EIGENSOFT package
24
. To express the relative importance of the
top two eigenvectors in the resulting PC plot, two axes were scaled by a factor
equal to the square root of the corresponding eigenvalue (Supplementary Note
2). Our analysis was repeated for the entire set of populations and for the subset
of west Eurasian populations (Supplementary Table 1). The R environment was
used to perform PCA (using the function princomp) and plot the results for all
analyses of uniparental data.
Structure-like analysis. The recently introduced structure-like approach was
applied as assembled in the program ADMIXTURE
27
(Supplementary Notes 3
and 4). ADMIXTURE was run on our global and west Eurasian data sets 100
times in parallel at K52toK510 (using random seeds). Convergence between
independent runs at the same Kwas monitored by comparing the resulting log-
likelihood scores (LLs). The minimal variation in LLs (less than 1 LL unit) within
a fraction (10%) of runs with the highest LLs was assumed to be a reasonable
proxy for inferring convergence
28
. In the global data set, convergence was
observed in the case of all explored Kvalues (K52toK510). Results from
runs at all values of Kare shown rather than restricting the reader to one chosenK
(Supplementary Note 3). To focus on population structure in the relevant
regions of the Middle East and Europe we performed analyses on a data set
restricted to west Eurasian samples. In this analysis, convergence was reached
at K52toK55; K57 and K58. Only K54 was highlighted in Supplemen-
tary Fig. 5 because components appearing at higher values of Kwere predomi-
nantly restricted to a single population and were therefore less informative for
our purposes. Judging from the distribution of LLs of the converged Kvalues, the
maximum-likelihood solutions with LLs very close to the highest LLs were also
the most frequent solutions (except for K56 of the global data set). One run
from the top LLs fraction of each converged K(from global and west Eurasian
data set) was plotted with Excel (Supplementary Fig. 4a, b).
Allele sharing distances. ASD was used for measuring genetic distances between
populations. ASD is less sensitive to small sample size than the Fixation Index
(F
ST
) and other measures
29
, and more appropriate for our goal of measuring
genetic distances between groups regardless of their internal diversity. Standard
errors of ASD values were calculated with a bootstrap approach, accounting for
variance resulting from both sample selection and site selection. ASDs between
individual Jewish populations and population groups representing a geographic
region or ethnic group were calculated. In each case, the population under
consideration was removed from all groupings with which it was compared.
To test significance of differences in pairs of ASD values in each row in
Table 1, a bootstrap approach was used (Supplementary Note 5 and Supplemen-
tary Tables 2 and 3).
Genotyping uniparental markers. Our data from the Y chromosome and
mtDNA were combined with previously published data sets from populations
of interest (Supplementary Note 6). Markers were chosen to match the phylo-
genetic level of resolution achieved in previously reported data sets. A total of
8,210 samples were assembled for Y-chromosome analysis (Supplementary
Table 4). Genotypes for these sites were determined by using multiple tech-
niques, such as allele-specific PCR, TaqMan, Kaspar and direct sequencing. A
total of 13,919 samples were assembled for mtDNA analysis (Supplementary
Table 5).
doi:10.1038/nature09103
Macmillan Publishers Limited. All rights reserved
©2010
... allen-ancient-dna-resource-aadr-downloadable-genotypes-presentday-and-ancient-dna-data). We assembled a dataset from mostly European populations for genome-wide analyses [105][106][107][108][109][110][111][112][113][114][115] . This modern set includes 10,176 individuals. ...
Article
Full-text available
The early Iron Age (800 to 450 BCE) in France, Germany and Switzerland, known as the ‘West-Hallstattkreis’, stands out as featuring the earliest evidence for supra-regional organization north of the Alps. Often referred to as ‘early Celtic’, suggesting tentative connections to later cultural phenomena, its societal and population structure remain enigmatic. Here we present genomic and isotope data from 31 individuals from this context in southern Germany, dating between 616 and 200 BCE. We identify multiple biologically related groups spanning three elite burials as far as 100 km apart, supported by trans-regional individual mobility inferred from isotope data. These include a close biological relationship between two of the richest burial mounds of the Hallstatt culture. Bayesian modelling points to an avuncular relationship between the two individuals, which may suggest a practice of matrilineal dynastic succession in early Celtic elites. We show that their ancestry is shared on a broad geographic scale from Iberia throughout Central-Eastern Europe, undergoing a decline after the late Iron Age (450 BCE to ~50 CE).
... Throughout the last few decades, microsatellite markers have been used across livestock species, and reliable results have been produced in the same context. After the dominance of microsatellite markers in genome-wide studies, SNPs have now emerged as important third-generation markers and act as a substitute for microsatellites in studies on different aspects of population genetics [32]. With the advent of density-based SNP panels, it has become extremely easy to conduct genome-wide studies on livestock species. ...
Article
The fastest way to significantly change the composition of a population is through admixture, an evolutionary mechanism. In animal breeding history, genetic admixture has provided both short-term and long-term advantages by utilizing the phenomenon of complementarity and heterosis in several traits and genetic diversity, respectively. The traditional method of admixture analysis by pedigree records has now been replaced greatly by genome-wide marker data that enables more precise estimations. Among these markers, SNPs have been the popular choice since they are cost-effective, not so laborious, and automation of genotyping is easy. Certain markers can suggest the possibility of a population's origin from a sample of DNA where the source individual is unknown or unwilling to disclose their lineage, which are called Ancestry-Informative Markers (AIMs). Revealing admixture level at the locus-specific level is termed as local ancestry and can be exploited to identify signs of recent selective response and can account for genetic drift. Considering the importance of genetic admixture and local ancestry, in this mini-review, both concepts are illustrated, encompassing basics, their estimation/identification methods, tools/- software used and their applications.
... The study of population genetics investigated the origins of these disparate populations to determine if there was a common genetic heritage. Multiple studies of autosomal DNA show that the genetics of Ashkenazi, Sephardi, and Mizrachi Jews share ancestry despite the thousands of years of separation [4,5]. ...
Article
What makes you, you? Your genes. Every human has a unique genetic code made of genes. Genes are the basic physical and functional unit of heredity and are made of DNA. Each gene ranges from a few hundred DNA bases to more than 2 million. A human being has between 20,000 to 25,000 genes. Each person inherits two copies of each gene, one copy from the mother and the other from the father. Most of our genes are the same; less than one percent of genes differ between people. These differing components are known as alleles. Alleles are forms of the same gene but with small differences in their DNA bases, and alleles are what make each one of us unique. One’s genotype is the internal, inherited genetic code inside all living organisms. A phenotype is the outside, physical manifestation of those genes. Phenotype examples include skin, eye, and hair color. They are related to all that is physical, functional, or behavioral [1,2].
... Such human groups in Asia, Africa, Europe, and 'We shall triumph like the Jews': unveiling the implicit side… America have embraced the controversial theory that they descend from the lost tribes of Israel (Parfitt and Fisher 2016). Although some groups have been admitted into Israel on the basis of 'shared ancestry' or genetic association; Ethiopian Jews, Beta Israel, Bnei Menashe of eastern India' etc. (Behar et al. 2020), the Igbo claim of being from the tribe of Gad, one of the lost tribes (Miles 2023;Lis 2009) despite some appealing framings (Bruder and Parfitt 2012) has been largely speculative (Harnischfeger 2012). In spite of this, an increasing number of mainly young Igbo claiming Jewish lineage, absolutely believe that independence of their Biafra homeland will only come through armed struggle as they hoped for Elohim's providence for victory. ...
Article
Full-text available
This study investigated the impetus for the armed separatism of the Indigenous Peoples Of Biafra (IPOB). Drawing from a mixed method design utilizing a cross-sectional survey and qualitative study, seventy five (75) agitators were purposively sampled to assess the trajectory of revivified armed struggle for the independence of Biafra in Southeast Nigeria. Informed conversations on neo-Biafra movements are mainly constructed around state repression, socio-political exclusion and terrorism. This study focused on how ancient Israel’s providential war victories are renegotiated in IPOB’s armed struggle against the Nigerian state. The article specifically assessed how the influences from Jewish triumphant war narratives and self-rule are re-enacted in the IPOB’s armed struggle towards achieving Biafra’s independence in Southeast Nigeria. The findings indicated that the more religiously motivated participants are, the more their zest and proclivity to armed struggle for an independent state of Biafra (β = 0.25; t = 2.12; P < 0.05). Hence, the renewed armed struggle for an independent state of Biafra was largely traced to religious motivation and proclivity to violence F (2, 74) = 2.26, P < 0.05. In line with the findings, conclusions were drawn and implications were examined. The study uncovers the motivation behind the reinvigoration of the IPOB movement and their perception of Elohim’s (God) backing to have been drawn from the Jews’ past armed insurrection, war successes and eventual political freedom. This account maps into the discourse on how religious beliefs frame the struggle for political independence in postcolonial settings.
... Recent studies of population genetics uncovered the early shared backgrounds and subsequent mutations of contemporary Jewish populations (Bonné-Tamir et al. 1992;Motul-sky 1995;Skorecki et al. 1997;Hammer et al. 2000;Risch et al. 2003;Behar et al. 2004aBehar et al. , 2010Bradman et al. 2004;Adams et al. 2008;Carmi et al. 2014;Yardumian and Schurr 2019). Although the historicity of the traditional Biblical account should be critically scrutinized, recent genome research allowed us to figure out the ancient antecedents of fatherhood and motherhood of present populations. ...
Article
Full-text available
As an essential prerequisite to the genealogical study of Jews, some elements of Jewish demographic history are provided in a long-term transnational perspective. Data and estimates from a vast array of sources are combined to draw a profile of Jewish populations globally, noting changes in geographical distribution, vital processes (marriages, births and deaths), international migrations, and changes in Jewish identification. Jews often anticipated the transition from higher to lower levels of mortality and fertility, or else joined large-scale migration flows that reflected shifting constraints and opportunities locally and globally. Cultural drivers typical of the Jewish minority interacted with socioeconomic and political drivers coming from the encompassing majority. The main centers of Jewish presence globally repeatedly shifted, entailing the intake within Jewish communities of demographic patterns from significantly different environments. During the 20th century, two main events reshaped the demography of the Jews globally: the Shoah (destruction) of two thirds of all Jews in Europe during World War II, and the independence of the State of Israel in 1948. Mass immigration and significant convergence followed among Jews of different geographical origins. Israel’s Jewish population grew to constitute a large share—and in the longer run—a potential majority of all Jews worldwide. Since the 19th century, and with increasing visibility during the 20th and the 21st, Jews also tended to assimilate in the respective Diaspora environments, leading to a blurring of identificational boundaries and sometimes to a numerical erosion of the Jewish population. This article concludes with some implications for Jewish genealogical studies, stressing the need for contextualization to enhance their value for personal memory and for analytic work.
Article
Full-text available
The risk of developing age-related macular degeneration (AMD) is influenced by genetic background. In 2016, the International AMD Genomics Consortium (IAMDGC) identified 52 risk variants in 34 loci, and a polygenic risk score (PRS) from these variants was associated with AMD. The Israeli population has a unique genetic composition: Ashkenazi Jewish (AJ), Jewish non-Ashkenazi, and Arab sub-populations. We aimed to perform a genome-wide association study (GWAS) for AMD in Israel, and to evaluate PRSs for AMD. Our discovery set recruited 403 AMD patients and 256 controls at Hadassah Medical Center. We genotyped individuals via custom exome chip. We imputed non-typed variants using cosmopolitan and AJ reference panels. We recruited additional 155 cases and 69 controls for validation. To evaluate predictive power of PRSs for AMD, we used IAMDGC summary-statistics excluding our study and developed PRSs via clumping/thresholding or LDpred2. In our discovery set, 31/34 loci reported by IAMDGC were AMD-associated (P < 0.05). Of those, all effects were directionally consistent with IAMDGC and 11 loci had a P-value under Bonferroni-corrected threshold (0.05/34 = 0.0015). At a 5 × 10⁻⁵ threshold, we discovered four suggestive associations in FAM189A1, IGDCC4, C7orf50, and CNTNAP4. Only the FAM189A1 variant was AMD-associated in the replication cohort after Bonferroni-correction. A prediction model including LDpred2-based PRS + covariates had an AUC of 0.82 (95% CI 0.79–0.85) and performed better than covariates-only model (P = 5.1 × 10⁻⁹). Therefore, previously reported AMD-associated loci were nominally associated with AMD in Israel. A PRS developed based on a large international study is predictive in Israeli populations.
Article
Due to its complex history of migrations and colonization of African, European, and Asian people, the Tunisian territory is an ideal area to study the effects of cultural change on the genetic structure of human populations. This study investigated genetic variation in the mitochondrial DNA of Tunisian populations to detect the possible impact of recent historical events on their gene pool. Two Arab and three Berber communities were analyzed using a comparison data set of 45 other populations comprising African, Arabian, Asian, European, and Near Eastern groups. Results were compared with those produced using a large panel of autosomal single-nucleotide polymorphisms. We observed a slight but important difference between the populations that inhabit the southern and central-northern areas of Tunisia. Furthermore, robust signatures of genetic isolation were detected in two Berber populations (Nouvelle Zraoua and Tamezret) and in the R’Baya people, a seminomadic Arab group. This investigation suggests that the genetic structure of investigated southern Tunisian populations retains signatures of historical events that occurred between the 7th and 17th centuries, particularly the trans-Saharan slave trade and the emigration of Berbers in remote areas of the south during the Arab conquest.
Article
There are various hypotheses on the origin and time of the appearance of Russian settlements in the Arctic Ocean shores of Eastern Siberia. In order to study the history of the formation of the russian old-settlers of Yakutia, we analyzed the lineages of the Y-chromosome in three groups of residents of the village of Russkoye Ust’ye, located in the delta of the Indigirka River (“Pomors”, “Cossacks” and “Zashivertsy”), comparable with the main migration waves of settlement of Russians on the Arctic coast of Eastern Siberia. For the first time, the characteristic features of the genetic structure of the population of Russkoustinans are described by the data of a genome-wide analysis of 740,000 single nucleotide polymorphisms. The results of the study to a greater extent testify in favor of the “Pomor” hypothesis of the origin of the Russkoustinians.
Article
Full-text available
Individuals differ widely in their immune responses, with age, sex and genetic factors having major roles in this inherent variability 1–6 . However, the variables that drive such differences in cytokine secretion—a crucial component of the host response to immune challenges—remain poorly defined. Here we investigated 136 variables and identified smoking, cytomegalovirus latent infection and body mass index as major contributors to variability in cytokine response, with effects of comparable magnitudes with age, sex and genetics. We find that smoking influences both innate and adaptive immune responses. Notably, its effect on innate responses is quickly lost after smoking cessation and is specifically associated with plasma levels of CEACAM6, whereas its effect on adaptive responses persists long after individuals quit smoking and is associated with epigenetic memory. This is supported by the association of the past smoking effect on cytokine responses with DNA methylation at specific signal trans -activators and regulators of metabolism. Our findings identify three novel variables associated with cytokine secretion variability and reveal roles for smoking in the short- and long-term regulation of immune responses. These results have potential clinical implications for the risk of developing infections, cancers or autoimmune diseases.
Article
Full-text available
Evolutionary event has not only altered the genetic structure of human populations but also associated with social and cultural transformation. South Asian populations were the result of migration and admixture of genetically and culturally diverse groups. Most of the genetic studies pointed to large-scale admixture event between Ancestral North Indian (ANI) and Ancestral South Indian (ASI) groups, also additional layers of recent admixture. In the present study, we have analyzed 213 individuals inhabited in South-west coast India with traditional warriors and feudal lord status and historically associated with migratory events from North/North West India and possible admixture with West Eurasian populations, whose genetic links are still missing. Analysis of autosomal SNP markers suggests that these groups possibly derived their ancestry from some groups of North West India having additional Middle Eastern genetic component. Higher distribution of West Eurasian mitochondrial haplogroups also points to female-mediated admixture. Estimation of Effective Migration Surface (EEMS) analysis indicates Central India and Godavari basin as a crucial transition zone for population migration from North and North West India to South-west coastal India. Selection screen using three distinct outlier-based approach revealed genetic signatures related to Immunity and protection from Viral infections. Thus, our study suggests that the South-west coastal groups with traditional warriors and feudal lords’ status are of a distinct lineage compared to Dravidian and Gangetic plain Indo-Europeans and are remnants of very early migrations from North West India following Godavari basin to Karnataka and Kerala.
Article
Understanding the genetic structure of human populations is of fundamental interest to medical, forensic and anthropological sciences. Advances in high-throughput genotyping technology have markedly improved our understanding of global patterns of human genetic variation and suggest the potential to use large samples to uncover variation among closely spaced populations. Here we characterize genetic variation in a sample of 3,000 European individuals genotyped at over half a million variable DNA sites in the human genome. Despite low average levels of genetic differentiation among Europeans, we find a close correspondence between genetic and geographic distances; indeed, a geographical map of Europe arises naturally as an efficient two-dimensional summary of genetic variation in Europeans. The results emphasize that when mapping the genetic basis of a disease phenotype, spurious associations can arise if genetic structure is not properly accounted for. In addition, the results are relevant to the prospects of genetic ancestry testing; an individual’s DNA can be used to infer their geographic origin with surprising accuracy—often to within a few hundred kilometres.
Article
Asia harbors substantial cultural and linguistic diversity, but the geographic structure of genetic variation across the continent remains enigmatic. Here we report a large-scale survey of autosomal variation from a broad geographic sample of Asian human populations. Our results show that genetic ancestry is strongly correlated with linguistic affiliations as well as geography. Most populations show relatedness within ethnic/linguistic groups, despite prevalent gene flow among populations. More than 90% of East Asian (EA) haplotypes could be found in either Southeast Asian (SEA) or Central-South Asian (CSA) populations and show clinal structure with haplotype diversity decreasing from south to north. Furthermore, 50% of EA haplotypes were found in SEA only and 5% were found in CSA only, indicating that SEA was a major geographic source of EA populations.