![Meredith Yeager](https://c5.rgstatic.net/m/437738464651637/images/template/default/profile/profile_default_l.jpg)
Meredith YeagerNational Cancer Institute (USA) | NCI · Human Genetics
Meredith Yeager
About
669
Publications
82,776
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
30,903
Citations
Publications
Publications (669)
Childhood radioactive iodine exposure from the Chornobyl accident increased papillary thyroid carcinoma (PTC) risk. While cervical lymph node metastases (cLNM) are well-recognized in pediatric PTC, the PTC metastatic process and potential radiation association are poorly understood. Here, we analyze cLNM occurrence among 428 PTC with genomic landsc...
Here, in a multi-ancestry genome-wide association study meta-analysis of kidney cancer (29,020 cases and 835,670 controls), we identified 63 susceptibility regions (50 novel) containing 108 independent risk loci. In analyses stratified by subtype, 52 regions (78 loci) were associated with clear cell renal cell carcinoma (RCC) and 6 regions (7 loci)...
Introduction: Persistent infection with a high-risk HPV (HR-HPV) type causes cervical cancer and many other cancers in both men and women. Our published data have shown that the distribution of specific HPV16/35 sublineages and genetic variants differ around the world and confer greater risk of cervical precancer/cancer in the populations where the...
Some evidence suggests that pediatric sarcomas have both shared and distinct genetic profiles; however, large-scale efforts to characterize germline genetic susceptibility across these malignancies are limited by their rarity. We evaluated the role of common and rare variants in the genetic etiology of the more frequent pediatric sarcomas: osteosar...
High-coverage sequencing allows the study of variants occurring at low frequencies within samples, but is susceptible to false-positives caused by sequencing error. Ion Torrent has a very low single nucleotide variant (SNV) error rate and has been employed for the majority of human papillomavirus (HPV) whole genome sequences. However, benchmarking...
Burkitt lymphoma (BL) is responsible for many childhood cancers in sub-Saharan Africa, where it is linked to recurrent or chronic infection by Epstein-Barr virus or Plasmodium falciparum. However, whether human leukocyte antigen (HLA) polymorphisms, which regulate immune response, are associated with BL has not been well investigated, which limits...
In high-income countries, mosaic chromosomal alterations in peripheral blood leukocytes are associated with an elevated risk of adverse health outcomes, including hematologic malignancies. We investigate mosaic chromosomal alterations in sub-Saharan Africa among 931 children with Burkitt lymphoma, an aggressive lymphoma commonly characterized by im...
Background:
Glyphosate is the most commonly used herbicide worldwide and has been implicated in the development of certain hematologic cancers. Although mechanistic studies in human cells and animals support the genotoxic effects of glyphosate, evidence in human populations is scarce.
Objectives:
We evaluated the association between lifetime occ...
Burkitt lymphoma (BL) is an aggressive B‐cell lymphoma that significantly contributes to childhood cancer burden in sub‐Saharan Africa. Plasmodium falciparum, which causes malaria, is geographically associated with BL, but the evidence remains insufficient for causal inference. Inference could be strengthened by demonstrating that mendelian genes k...
Cervical cancer is caused by human papillomavirus (HPV) infection, has few approved targeted therapeutics, and is the most common cause of cancer death in low-resource countries. We characterized 19 cervical and four head and neck cell lines using long-read DNA and RNA sequencing and identified the HPV types, HPV integration sites, chromosomal alte...
HPV16 is the most oncogenic type of human papillomaviruses (HPV). Integration of HPV into the human genome is an important mechanism of carcinogenesis but is absent in at least 30% of HPV16+ tumors. We applied long-read whole-genome sequencing (WGS) to cervical cancer cell lines and tumors to characterize HPV16 carcinogenesis in the absence of inte...
Background:
Relative to other pediatric cancers, survival for rhabdomyosarcoma has not improved in recent decades, suggesting the need to enhance risk stratification. Therefore, we conducted a genome-wide association study for event-free survival (EFS) and overall survival (OS) to identify genetic variants associated with outcomes in individuals w...
Background
Mapping of quantitative trait loci (QTL) associated with molecular phenotypes is a powerful approach for identifying the genes and molecular mechanisms underlying human traits and diseases, though most studies have focused on individuals of European descent. While important progress has been made to study a greater diversity of human pop...
Ewing sarcoma (EwS) is a rare bone and soft tissue malignancy driven by chromosomal translocations encoding chimeric transcription factors, such as EWSR1-FLI1, that bind GGAA motifs forming novel enhancers that alter nearby expression. We propose that germline microsatellite variation at the 6p25.1 EwS susceptibility locus could impact downstream g...
The Prostate, Lung, Colorectal and Ovarian (PLCO) Cancer Screening Trial is a prospective cohort study of nearly 155,000 U.S. volunteers aged 55–74 at enrollment in 1993–2001. We developed the PLCO Atlas Project, a large resource for multi-trait genome-wide association studies (GWAS), by genotyping participants with available DNA and genomic consen...
The human papillomavirus (HPV) type 16 E7 oncogene is critical to carcinogenesis and highly conserved. Previous studies identified a preponderance of non-synonymous E7 variants amongst HPV16-positive cancer-free controls compared to those with cervical cancer. To investigate the function of E7 variants, we constructed full-length HPV16 E7 genes and...
Myelofibrosis is a rare myeloproliferative neoplasm (MPN) with high risk for progression to acute myeloid leukemia. Our integrated genomic analysis of up to 933 myelofibrosis cases identifies 6 germline susceptibility loci, 4 of which overlap with previously identified MPN loci. Virtual karyotyping identifies high frequencies of mosaic chromosomal...
The chr12q24.13 locus encoding OAS1–OAS3 antiviral proteins has been associated with coronavirus disease 2019 (COVID-19) susceptibility. Here, we report genetic, functional and clinical insights into this locus in relation to COVID-19 severity. In our analysis of patients of European (n = 2,249) and African (n = 835) ancestries with hospitalized ve...
Background
Oropharyngeal squamous cell carcinoma is frequently associated with high-risk HPV infection, which confers a good prognosis. Immunohistochemistry for p16 is used as a surrogate for HPV status, but discrepant results are occasionally seen. Here, we report a case with a unique pattern of partial loss of p16.
Methods
A 63 year old male pre...
Background: Rhabdomyosarcoma (RMS) is the most common soft tissue sarcoma in children and has one of the poorest survival rates among pediatric cancers, underscoring the need to identify factors which may be leveraged to improve therapeutic options for these individuals.
Methods: We carried out a genome-wide association study of overall survival (O...
Following the Chernobyl nuclear power plant explosion in Ukraine in 1986, increased childhood exposure to radioactive iodine (131I), which occurred primarily through contaminated food sources, has been consistently associated with increased risk of developing papillary thyroid carcinoma (PTC). Increased frequency of cervical lymph node metastases (...
Chordoma is a rare bone tumor with genetic risk factors largely unknown. We conducted a whole‐exome sequencing (WES) analysis of germline DNA from 19 familial chordoma cases in five pedigrees and 137 sporadic chordoma patients and identified 17 rare germline variants in PALB2 and BRCA2, whose products play essential roles in homologous recombinatio...
Purpose
A significant barrier to adoption of de-escalated treatment protocols for human papillomavirus-driven oropharyngeal cancer (HPV-OPC) is that few predictors of poor prognosis exist. We conducted the first large whole-genome sequencing (WGS) study to characterize the genetic variation of the HPV16 genome and to evaluate its association with H...
Background
Mapping of quantitative trait loci (QTL) associated with molecular phenotypes is a powerful approach for identifying the genes and molecular mechanisms underlying human traits and diseases. How the genetic architecture of molecular traits varies across human populations, however, has been less explored. To better understand the genetics...
The application of whole-exome sequencing (WES) has led to the identification of high and moderate-risk variants that contribute to cutaneous melanoma susceptibility. However, confirming disease-causing variants remains challenging. We applied a gene co-expression network analysis to prioritize candidate genes identified from WES of 34 melanoma-pro...
Oropharyngeal squamous cell carcinoma (SCC) is increasing in incidence and, in Western countries, strongly associated with transcriptionally-active high-risk human papillomavirus (HPV). Within HPV-positive tumors, there is wide morphologic diversity with numerous histologic subtypes of SCC. There are also variable degrees of keratinization, anaplas...
Purpose: A significant barrier to adoption of de-escalated treatment protocols for human papillomavirus-driven oropharyngeal cancer (HPV-OPC) is that few predictors of poor prognosis exist. We conducted the first large whole-genome sequencing (WGS) study to characterize the genetic variation of the HPV16 genome and to evaluate its association with...
The human papillomavirus (HPV) type 16 E7 oncogene is critical to carcinogenesis and highly conserved. Previous studies identified a preponderance of non-synonymous E7 variants amongst HPV16-positive cancer-free controls compared to those with cervical cancer. To investigate the function of E7 variants, we constructed full-length HPV16 E7 genes and...
Introduction: Myelofibrosis (MF) is a rare myeloproliferative neoplasm (MPN) characterized by bone marrow fibrosis, progressive bone marrow failure, and increased risk of acute myeloid leukemia. While MF arises from somatic driver mutations in JAK2, MPL, and CALR, some MPN patients may have a heritable component. To comprehensively examine the gene...
Integration of Human Papillomaviruses (HPV) is an important mechanism of carcinogenesis but is absent in a significant fraction of HPV16+ tumors. We applied long-read whole-genome sequencing (WGS) to cervical cancer cell lines and tumors. In two HPV16+ cell lines, we identified large tandem arrays of full-length and truncated viral genomes integrat...
Human papillomavirus (HPV) type 31 (HPV31) is closely related to the most carcinogenic type, HPV16, but only accounts for 4% of cervical cancer cases worldwide. Viral genetic and epigenetic variations have been associated with carcinogenesis for other high-risk HPV types, but little is known about HPV31. We sequenced 2093 HPV31 viral whole genomes...
APOBEC is a mutagenic source in human papillomavirus (HPV)-mediated malignancies, including HPV+ oropharyngeal squamous cell carcinoma (HPV + OPSCC), and in HPV genomes. It is unknown why APOBEC mutations predominate in HPV + OPSCC, or if the APOBEC-induced mutations observed in both human cancers and HPV genomes are directly linked. We performed s...
To improve risk stratification and treatment decisions for patients with acute myeloid leukemia (AML) undergoing hematopoietic cell transplantation (HCT). We used SNP-array data from the DISCOVeRY-BMT study to detect chromosomal aberrations in pre-HCT peripheral blood (collected 2–4 weeks before the administration of conditioning regimen) from 1974...
Genomic regions have been associated with COVID-19 susceptibility and outcomes, including the chr12q24.13 locus encoding antiviral proteins OAS1-3. Here, we report genetic, functional, and clinical insights into genetic associations within this locus. In Europeans, the risk of hospitalized vs. non-hospitalized COVID-19 was associated with a single...
Aim:
Recessive genetic variation is thought to play a role in non-Hodgkin lymphoma (NHL) etiology. Runs of homozygosity (ROH), defined based on long, continuous segments of homozygous SNPs, can be used to estimate both measured and unmeasured recessive genetic variation. We sought to examine genome-wide homozygosity and NHL risk.
Methods:
We use...
Background:
Chordoma is a rare bone cancer with an unknown etiology. TBXT is the only chordoma susceptibility gene identified to date; germline single nucleotide variants and copy number variants in TBXT have been associated with chordoma susceptibility in familial and sporadic chordoma. However, the genetic susceptibility of chordoma remains larg...
Background:
Fanconi anemia (FA) is an inherited bone marrow failure syndrome associated with characteristic dysmorphology primarily caused by biallelic pathogenic germline variants in any of 22 different DNA repair genes. There are limited data on the specific molecular causes of FA in different ethnic groups.
Methods:
We performed exome sequenc...
Admixed populations are a resource to study the global genetic architecture of complex phenotypes, which is critical, considering that non-European populations are severely underrepresented in genomic studies. Here, we study the genetic architecture of BMI in children, young adults, and elderly individuals from the admixed population of Brazil.
Lev...
Genomics of radiation-induced damage
The potential adverse effects of exposures to radioactivity from nuclear accidents can include acute consequences such as radiation sickness, as well as long-term sequelae such as increased risk of cancer. There have been a few studies examining transgenerational risks of radiation exposure but the results have...
Genomics of radiation-induced damage
The potential adverse effects of exposures to radioactivity from nuclear accidents can include acute consequences such as radiation sickness, as well as long-term sequelae such as increased risk of cancer. There have been a few studies examining transgenerational risks of radiation exposure but the results have...
The 1986 Chernobyl nuclear power plant accident increased papillary thyroid cancer (PTC) incidence in surrounding regions, particularly for 131I-exposed children. To investigate the contribution of environmental radiation to PTC characteristics and improve understanding of radiation-induced carcinogenesis, we analyzed genomic, transcriptomic, and e...
A Correction to this paper has been published: https://doi.org/10.1038/s41375-021-01140-5
We previously identified 10 lung adenocarcinoma susceptibility loci in a genome-wide association study (GWAS) conducted in the Female Lung Cancer Consortium in Asia (FLCCA), the largest genomic study of lung cancer among never-smoking women to date. Furthermore, household coal use for cooking and heating has been linked to lung cancer in Asia, espe...
Background
Pediatric cancers are the leading cause of death by disease in children despite improved survival rates overall. The contribution of germline genetic susceptibility to pediatric cancer survivors has not been extensively characterized. We assessed the frequency of pathogenic/likely pathogenic (P/LP) variants in 5,451 long-term pediatric c...
Age-related male Y and female X chromosome mosaicism is commonly observed in large population-based studies. To investigate the frequency of male X chromosome mosaicism, we scanned for deviations in chromosome X genotyping array intensity data in a population-based survey of 196,219 UK Biobank men. We detected 12 (0.006%) men with mosaic chromosome...
Key Points
Pre-HCT mosaicism is related to increased relapse risk and lower survival after unrelated HCT, independent of cytogenetics at diagnosis. Pre-HCT mosaicism could be a useful clinical tool to guide risk stratification in acute lymphoblastic leukemia patients.
Human leukocyte antigen (HLA) gene variation is associated with risk of cancers, particularly those with infectious etiology or hematopoietic origin, given its role in immune presentation. Previous studies focused primarily on HLA allele/haplotype-specific associations. To answer whether associations are driven by HLA class I (essential for T-cell...
Introduction: Allogeneic hematopoietic cell transplantation (HCT) is curative for myelofibrosis (MF). However, prognosis post HCT is variable. Identifying subsets of patients who can greatly benefit from HCT is important for clinical decision-making and patient counseling. The Dynamic International Prognostic Scoring System (DIPSS) score includes c...
Clonal mosaicism, detectable in peripheral blood, can be an important predictor of developing a hematological malignancy. We sought to determine if mosaic events, in addition to clinical and demographic variables, contributed independent information about acute lymphoblastic leukemia (ALL) patient survival and risk of relapse after allogeneic hemat...
Although next-generation sequencing has demonstrated great potential for novel gene discovery, confirming disease-causing genes after initial discovery remains challenging. Here, we applied a network analysis approach to prioritize candidate genes identified from whole-exome sequencing analysis of 98 cutaneous melanoma patients from 27 families. Us...
Background:
Ewing sarcoma (EwS) is a rare, aggressive solid tumor of childhood, adolescence and young adulthood associated with pathognomonic EWSR1-ETS fusion oncoproteins altering transcriptional regulation. Genome-wide association studies (GWAS) have identified 6 common germline susceptibility loci but have not investigated low-frequency inherit...
The immune system has several mechanisms to recognize tumor cells as foreign and eliminate them. Somatically altered genes can create peptides, or neoantigens, that class I HLA (HLA-I) molecules can present to cytotoxic T-cells. As a result, people with suppressed immune systems commonly have a higher rate of cancer. PanCancer data of 9423 tumor ex...
Background: Human papillomavirus type 16 (HPV16) is one of the most common and carcinogenic HPV types associated with high risk of anal, vagina, vulva, penis and cervical neoplastic transformations. However, many genetic variants exist within this virus and not all seem to have the same carcinogenic potential.
Aim: To determine HPV16 lineages and t...
Human papillomavirus (HPV) 16 displays substantial sequence variation; four HPV16 lineages (A, B, C, and D) have been described as well as multiple sublineages. To identify molecular events associated with HPV16 carcinogenesis, we evaluated viral variation, the integration of HPV16, and somatic mutation in 96 cervical cancer samples from Guatemala....
Intratumor heterogeneity (ITH) and tumor evolution have been well described for clear cell renal cell carcinomas (ccRCC), but they are less studied for other kidney cancer subtypes. Here we investigate ITH and clonal evolution of papillary renal cell carcinoma (pRCC) and rarer kidney cancer subtypes, integrating whole-genome sequencing and DNA meth...
Cancer risk is highly variable in carriers of the common TP53-R337H founder allele, possibly due to the influence of modifier genes. Whole-genome sequencing identified a variant in the tumor suppressor XAF1 (E134*/Glu134Ter/ rs146752602) in a subset of R337H carriers. Haplotype-defining variants were verified in 203 patients with cancer, 582 relati...
HPV35 has been found in only ∼2% of invasive cervical cancers (ICC) worldwide but up to 10% in Sub‐Saharan Africa, warranting further investigation and consideration of impact on preventive strategies. We studied HPV35 and ethnicity, in relation to the known steps in cervical carcinogenesis, using multiple large epidemiologic studies in the U.S. an...
Importance
Osteosarcoma, the most common malignant bone tumor in children and adolescents, occurs in a high number of cancer predisposition syndromes that are defined by highly penetrant germline mutations. The germline genetic susceptibility to osteosarcoma outside of familial cancer syndromes remains unclear.
Objective
To investigate the germlin...
The Transatlantic Slave Trade transported more than 9 million Africans to the Americas between the early 16th and the mid-19th centuries. We performed a genome-wide analysis using 6,267 individuals from 25 populations to infer how different African groups contributed to North-, South-American and Caribbean populations, in the context of geographic...
Background
High-resolution genome-wide SNP-arrays detect large chromosomal aberrations including copy-neutral loss of heterozygosity (CNLOH), which is not captured in conventional cytogenetics.
Methods
We used SNP-array genotyping data generated by the DISCOVeRY-BMT study to detect chromosomal aberrations in pre-HCT blood samples from 1,974 acute...
Mosaic loss of Y chromosome (mLOY) is the most frequently detected somatic copy number alteration in leukocytes of men. In this study, we investigate blood cell counts as a potential mechanism linking mLOY to disease risk in 206,353 UK males. Associations between mLOY, detected by genotyping arrays, and blood cell counts were assessed by multivaria...
HPV16 causes half of cervical cancers worldwide; for unknown reasons, most infections resolve within two years. Here, we analyze the viral genomes of 5,328 HPV16-positive case-control samples to investigate mutational signatures and the role of human APOBEC3-induced mutations in viral clearance and cervical carcinogenesis. We identify four de novo...
Western South America was one of the worldwide cradles of civilization. The well known Inca Empire was the tip of the iceberg of a cultural and biological evolutionary process that started 14-11 thousand years ago. Genetic data from 18 Peruvian populations reveal that: (1) The between-population homogenization of the central-southern Andes and its...
Severe aplastic anemia (SAA) is a rare disorder characterized by hypoplastic bone marrow and progressive pancytopenia. The etiology of acquired SAA is not understood but is likely related to abnormal immune responses and environmental exposures. We conducted a genome-wide association study of individuals with SAA genetically matched to healthy cont...
Human papillomavirus (HPV) 16 displays substantial sequence variation; four HPV16 lineages (A, B, C, D) have been described, as well as multiple sub-lineages. To identify molecular events associated with HPV16 carcinogenesis we evaluated viral variation, the integration of HPV16, and somatic mutation in 96 cervical cancer samples from Guatemala. A...
Admixed populations are a resource to study the global genetic architecture of complex phenotypes, which is critical, considering that non-European populations are severely under-represented in genomic studies. Leveraging admixture in Brazilians, whose chromosomes are mosaics of fragments of Native American, European and African origins, we used ge...
Introduction: Clinical cytogenetics is the most important prognostic test for patients with acute myeloid leukemia (AML) or myelodysplastic syndrome (MDS); however, current tools do not provide a complete genome-wide picture of somatic chromosomal aberrations in those patients. We used a high-resolution genome-wide single-nucleotide polymorphism (S...
Introduction. Acquired severe aplastic anemia (SAA) is a life-threatening disorder characterized by severe progressive pancytopenia and hypocellular bone marrow. The etiology of acquired SAA is not understood but believed to be related to abnormal immune responses to environmental exposures. We conducted a genome-wide association study (GWAS) to id...
Background:
Human papillomavirus (HPV) is a common sexually transmitted infection associated with cervical cancer that frequently occurs as a coinfection of types and subtypes. Highly similar sublineages that show over 100-fold differences in cancer risk are not distinguishable in coinfections with current typing methods.
Results:
We describe an...
Genetic susceptibility is likely involved in nasopharyngeal carcinoma (NPC), a cancer caused by Epstein-Barr virus (EBV) infection. Understanding of genetic factors involved in NPC and how they contribute to EBV-induced carcinogenesis is limited. We conducted whole-exome capture/sequencing among 251 individuals from 97 multiplex families from Taiwa...
Background:
Genetic susceptibility is associated with nasopharyngeal carcinoma (NPC). We previously identified rare variants potentially involved in familial NPC and common variants significantly associated with sporadic NPC.
Methods:
We conducted targeted gene sequencing of 20 genes [16 identified from the study of multiplex families, three ide...
Clonal hematopoiesis (CH) is an age-related accumulation of detectable somatic mutations in circulating leukocytes that is associated with diverse outcomes including hematologic cancer, solid tumors, cardiovascular disease, neurodegenerative disorders and overall mortality. The biological mechanisms by which CH is associated with disease is poorly...
We investigated whether genetic susceptibility to tuberculosis (TB) influences lung adenocarcinoma development among never-smokers using TB genome-wide association study (GWAS) results within the Female Lung Cancer Consortium in Asia. Pathway analysis with the adaptive rank truncated product method was used to assess the association between a TB-re...
Background: Known cutaneous malignant melanoma (CMM) genes account for melanoma risk in less than 40% of melanoma-prone families, suggesting the existence of additional risk genes or other modifiers. Whole exome sequencing (WES) of high-risk families usually results in the identification of a large number of potential disease-causing genes. However...
Clonal hematopoiesis (CH) is an age-related accumulation of detectable somatic mutations in circulating leukocytes that is associated with diverse outcomes including hematologic cancer, solid tumors, cardiovascular disease, neurodegenerative disorders and overall mortality. The biological mechanisms by which CH is associated with disease is poorly...
Background:
Human papillomaviruses (HPV) cause over 500 000 cervical cancers each year, most of which occur in low-resource settings. Human papillomavirus genotyping is important to study natural history and vaccine efficacy. We evaluated TypeSeq, a novel, next-generation, sequencing-based assay that detects 51 HPV genotypes, in 2 large internatio...
Renal cell carcinoma (RCC) has an undisputed genetic component and a stable 2:1 male to female sex ratio in its incidence across populations, suggesting possible sexual dimorphism in its genetic susceptibility. We conducted the first sex-specific genome-wide association analysis of RCC for men (3227 cases, 4916 controls) and women (1992 cases, 3095...
The Transatlantic Slave Trade transported more than 9 million Africans to the Americas between the early 16th and the mid-19th centuries. We performed genome-wide analysis of 6,267 individuals from 22 populations and observed an enrichment in West-African ancestry in northern latitudes of the Americas, whereas South/East African ancestry is more pr...
We have developed a new HPV genotyping assay for detection of 51 HPV genotypes by next generation sequencing (NGS). The TypeSeq assay consists of 3 PCR steps which equalize viral load and each type's amplicon copies prior to genotyping by NGS, thereby maximizing multiple type sensitivity with minimal sequencing reads. The analytical sensitivity of...
Populations in sub-Saharan Africa have historically been exposed to intense selection from chronic infection with falciparum malaria. Interestingly, populations with the highest malaria intensity can be identified by the increased occurrence of endemic Burkitt Lymphoma (eBL), a pediatric cancer that affects populations with intense malaria exposure...
Additional information and methods.
(DOCX)
Kinship and inbreeding in the Uganda (EMBLEM) and Ghana datasets.
(A) Kinship coefficients (Φij) estimates by the probabilities of IBD = 0 estimates for all pairs of individuals. The colored dots are the theoretical relatedness degree probabilities of Φij and IBD = 0. (B) The distribution of individual inbreeding coefficients estimated for all indi...
Inbreeding in the EMBLEM Uganda sample separated by Burkitt lymphoma (Cases), pilot population controls (PPCs), matched population controls (MPCs) and health-center II controls (HCII).
(TIF)
ADMIXTURE barplot representation of the individual ancestry proportions of the Pan-African populations.
(Top) The proportions of individual ancestry values were calculated using ADMIXTURE unsupervised mode with the number of ancestral K = 2 to K = 15. (Bottom) ADMIXTURE cross-validation errors as a function of K.
(TIF)
Neutral coalescent demographic model used in the PBS analysis.
Ne = effective population size, kya = thousand years ago and m = migration rate. We used the migration rates following the current ancestry profile estimated by ADMIXTURE, as 4Nemij, where 4Ne is the population effective size and mij the fraction of population i that is made up of migra...
ATP2B4 PBS value observed against neutral distribution.
PBS neutral values were generated by 10,000 simulations of plausible neutral demographic models (S15 Fig) for UNW and UNC populations respectively.
(TIF)
Gene candidates for natural selection in the Uganda North Central (UNC) population based on the outlier PBS (99.9th percentile and p-value<0.05) and xpEHH (>2) tests.
The SNP with the highest PBS value for each candidate gene is tabulated.
(XLSX)