Home
Vysaul Nyirongo

Vysaul Nyirongo

About

Publications

6,947

Reads

1,336

Citations

Publications

Figure 4. Chip Intensity (sum[X and Y] channels) density plots for...

Figure 5. Heatmaps of core deletion feature intensities. Heatmaps were...

Figure 6. Multiple-Regression Model (MRM) and Classification And...

Figure 7. Haplotype, extended haplotype homozygosity (EHH) and...

Figure 8. Circular haplotype dendrograms for polymorphisms across the...

Haplotype heterogeneity and low linkage disequilibrium reduce reliable prediction of genotypes for the ‑α3.7I form of α-thalassaemia using genome-wide microarray data

Article

Full-text available

Sep 2021

Background: The -α 3.7I -thalassaemia deletion is very common throughout Africa because it protects against malaria. When undertaking studies to investigate human genetic adaptations to malaria or other diseases, it is important to account for any confounding effects of α-thalassaemia to rule out spurious associations. Methods: In this study, we ha...

Haplotype heterogeneity and low linkage disequilibrium reduce reliable prediction of genotypes for the ‑α3.7I form of α-thalassaemia using genome-wide microarray data

Article

Dec 2020

Resistance to malaria through structural variation of red blood cell invasion receptors

Article

May 2017

The malaria parasite Plasmodium falciparum invades human red blood cells via interactions between host and parasite surface proteins. By analyzing genome sequence data from human populations, including 1269 individuals from sub-Saharan Africa, we identify a diverse array of large copy number variants affecting the host invasion receptor genes GYPA...

Resistance to malaria through structural variation of red blood cell invasion receptors

Preprint

Full-text available

Feb 2017

Plasmodium falciparum invades human red blood cells by a series of interactions between host and parasite surface proteins. Here we analyse whole genome sequence data from worldwide human populations, including 765 new genomes from across sub-Saharan Africa, and identify a diverse array of large copy number variants affecting the host invasion rece...

Figure 1—source data 2.

Data

Jan 2017

Single SNP association test results with adjustment for additive effect of G6PD+202.DOI: http://dx.doi.org/10.7554/eLife.15085.007

Figure 2—source data 1.

Data

Jan 2017

G6PDd score association test results.DOI: http://dx.doi.org/10.7554/eLife.15085.015

Supplementary file 1.

Data

Jan 2017

(A) Summary of study designs of contributing partner studies to MalariaGEN Consortial Project 1 (CP1). (B) Genotyped sample distribution. (C) Summary of 65 SNPs selected for analysis and successfully genotyped. (D) G6PD+202 female association test results. (E) G6PD+202 male association test results. (F) G6PD+202 all individuals association test res...

Supplementary file 2.

Data

Jan 2017

(A) SNP selection across G6PD region for genotyping. (B) SpectroDESIGNER assay design file for 135 G6PD locus SNPs in four multiplexes. (C) SpectroDESIGNER assay design file for 107 G6PD locus SNPs in four multiplexes. (D) SpectroDESIGNER assay design file for 68 G6PD locus SNPs in three multiplexes. DOI: http://dx.doi.org/10.7554/eLife.15085.020

Figure 1—source data 1.

Data

Jan 2017

Single SNP association test results.DOI: http://dx.doi.org/10.7554/eLife.15085.006

Characterisation of the opposing effects of G6PD deficiency on cerebral malaria and severe malarial anaemia

Article

Full-text available

Jan 2017

Glucose-6-phosphate dehydrogenase (G6PD) deficiency is believed to confer protection against Plasmodium falciparum malaria, but the precise nature of the protective effecthas proved difficult to define as G6PD deficiency has multiple allelic variants with different effects in males and females, and it has heterogeneous effects on the clinical outco...

Resistance to malaria through structural variation of red blood cell invasion receptors.

Article

Jan 2017

Admixture into and within sub-Saharan Africa

Article

Full-text available

Jun 2016

ELife digest Our genomes contain a record of historical events. This is because when groups of people are separated for generations, the DNA sequence in the two groups’ genomes will change in different ways. Looking at the differences in the genomes of people from the same population can help researchers to understand and reconstruct the historical...

Admixture into and within sub-Saharan Africa

Article

Jun 2016

Similarity between two individuals in the combination of genetic markers along their chromosomes indicates shared ancestry and can be used to identify historical connections between different population groups due to admixture. We use a genome-wide, haplotype-based, analysis to characterise the structure of genetic diversity and gene-flow in a coll...

Figure 1-figure supplement 1. Map of populations used in the analysis....

Figure 1-figure supplement 3. fineSTRUCTURE analysis of the full...

Figure 3-figure supplement 2. Comparison of weighted LD amplitude...

Figure 3-figure supplement 4. Comparison of the minimum distance to...

Figure 6-figure supplement 1. Gene-flow in Africa over the last 2,000...

Admixture Into and Within Sub-Saharan Africa

Preprint

Full-text available

Feb 2016

Understanding patterns of genetic diversity is a crucial component of medical research in Africa. Here we use haplotype-based population genetics inference to describe gene-flow and admixture in a collection of 48 African groups with a focus on the major populations of the sub-Sahara. Our analysis presents a framework for interpreting haplotype div...

A novel locus of resistance to severe malaria in a region of ancient balancing selection

Article

Full-text available

Sep 2015

The high prevalence of sickle haemoglobin in Africa shows that malaria has been a major force for human evolutionary selection, but surprisingly few other polymorphisms have been proven to confer resistance to malaria in large epidemiological studies. To address this problem, we conducted a multi-centre genome-wide association study (GWAS) of life-...

Reappraisal of known malaria resistance loci in a large multicenter study

Article

Full-text available

Nov 2014

Many human genetic associations with resistance to malaria have been reported, but few have been reliably replicated. We collected data on 11,890 cases of severe malaria due to Plasmodium falciparum and 17,441 controls from 12 locations in Africa, Asia and Oceania. We tested 55 SNPs in 27 loci previously reported to associate with severe malaria. T...

Causes of death among persons of all ages within the Kilifi Health and Demographic Surveillance System, Kenya, determined from verbal autopsies interpreted using the InterVA-4 model

Article

Full-text available

Oct 2014

Background The vast majority of deaths in the Kilifi study area are not recorded through official systems of vital registration. As a result, few data are available regarding causes of death in this population. Objective To describe the causes of death (CODs) among residents of all ages within the Kilifi Health and Demographic Surveillance System...

Verbal autopsy as a tool for identifying children dying of sickle cell disease: A validation study conducted in Kilifi district, Kenya

Article

Full-text available

Apr 2014

Sickle cell disease (SCD) is common in many parts of sub-Saharan Africa (SSA), where it is associated with high early mortality. In the absence of newborn screening, most deaths among children with SCD go unrecognized and unrecorded. As a result, SCD does not receive the attention it deserves as a leading cause of death among children in SSA. In th...

Imputation-Based Meta-Analysis of Severe Malaria in Three African Populations

Article

Full-text available

Jun 2013

Combining data from genome-wide association studies (GWAS) conducted at different locations, using genotype imputation and fixed-effects meta-analysis, has been a powerful approach for dissecting complex disease genetics in populations of European ancestry. Here we investigate the feasibility of applying the same approach in Africa, where genetic d...

Figure S1

Data

May 2013

Example of cluster plot from Malawi cohort with outlying sets of individuals. (TIF)

Figure S2

Data

May 2013

Distribution of relatedness between most-related pairs. (TIF)

Figure S3

Data

May 2013

Comparison of logistic regression (SNPTEST) and mixed model (MMM) P values. (TIF)

Figure S4

Data

May 2013

SNPs showing highly divergent P values between logistic regression and mixed model scans. (TIF)

Figure S7

Data

May 2013

Comparison of meta-analysis P values versus Bayes factors under the fixed-effect model. (TIF)

Figure S8

Data

May 2013

Quantile-quantile plots of the region-based test in the three cohort and in the meta-analysis. The genomic control inflation factor is given in the title of the plots. (TIF)

Figure S9

Data

May 2013

Manhattan plot showing –log10 P values (thresholded at 10) for additive, dominant, heterozygote, recessive, and general models, and additive model conditional on the genotype at the sickle locus rs334, across all imputed SNPs. Meta-analysis P values for all three cohorts and for the East African cohorts are also shown for additive, dominant, recess...

Figure S13

Data

May 2013

The distribution of ethnic groups in Kenyan samples that were imputed with higher or lower quality (as defined by the red line in Figure S12). The difference in the two distributions is highly significant (Fisher's exact test, P = 4×10−4), suggesting that ethnic differences contribute to the bimodal distribution of imputation quality seen in Figure...

Figure S5

Data

May 2013

–log10(P values) for test of association using the mixed model. (TIF)

Figure S12

Data

May 2013

The distribution of imputation quality (measured by type2 r2) across imputed Kenyan samples. The red line is at r2 = 0.909, and is the minimum between the two peaks. (TIF)

Table S2

Data

May 2013

Pre-imputation individual QC. (DOCX)

Figure S6

Data

May 2013

Top: signal of association in the HBB region after conditioning on the genotype at the known causal locus rs334. Bottom: signal of association in the ABO region after conditioning on the genotype at rs8176719. (TIF)

Figure S11

Data

May 2013

Example output from the imputation quality control pipeline for the Kenya imputation. a) per-SNP certainty (mean maximum posterior genotype call); b) per-SNP accuracy (type2 r2); c) per-individual type2 r2, averaged across segments; d) per-segment heterozygous call accuracy (proportion of true heterozygous calls that are correctly imputed with high...

Figure S15

Data

May 2013

Population-specific PCA analysis of Kenyan samples. (TIF)

Figure S19

Data

May 2013

a) Empirical distribution, across approximately 20,000 gene regions, of the maximum likelihood estimate of the eta parameter (see Text S2), for the region-based test. Overlaid (red line) is the assumed prior distribution under the alternative used to calculate Bayes factors in the region-based analysis. b) Scatter plot of the log10 combined Bayes F...

Table S5

Data

May 2013

Post-imputation sample exclusions. (DOCX)

Table S6

Data

May 2013

Genomic Inflation factors (λ) for logistic regression and mixed-model scans. (DOCX)

Table S8

Data

May 2013

Enrichment of low region based test P values in three previously defined sets of regions. Each P value in the table results from a one-sided binomial test for an enrichment in the number of regions with empirical P value below the given threshold. The bottom row gives a summary of the distribution of the number of SNPs in each region. Note that the...

Figure S10

Data

May 2013

Manhattan plot showing –log10 P values (thresholded at 10) for additive, dominant, heterozygote, recessive, and general models, and additive model conditional on the genotype at the sickle locus rs334, across all non-excluded genotyped SNPs. Meta-analysis P values for all three cohorts and for the East African cohorts are also shown for additive, d...

Figure S14

Data

May 2013

Population-specific PCA analysis of Gambian samples. (TIF)

Figure S16

Data

May 2013

Population-specific PCA analysis of Malawian samples. (TIF)

Figure S17

Data

May 2013

Comparison of fixed, structured, correlated and independent-effect models at the ABO and HBB loci. The height of each bar represents the posterior probability that the corresponding model is true, under the assumption that one of the models is true. (TIF)

Table S1

Data

May 2013

Details on the 3 study sites and genotyping platforms. (DOCX)

Table S4

Data

May 2013

P values for correlation between the first 5 PCs and case/control status. (DOCX)

Text S2

Data

May 2013

Supplementary statistical details. (PDF)

Figure S18

Data

May 2013

ROC curve showing empirical true positive rate (y-axis) against false positive rate (x-axis) for each method used to detect regional association (regional test with Fisher meta-analysis, regional test with Bayesian meta-analysis, best single-SNP frequentist meta-analysis in region, best single-SNP Bayes factor for each of the four choices of correl...

Table S3

Data

May 2013

Pre-imputation SNP QC. (DOCX)

Table S7

Data

May 2013

Regions showing most association in single-SNP and regional association test analyses. (XLSX)

Text S1

Data

May 2013

Details of quality control. (DOCX)

Socio-demographic Implications of HIV/AIDS in Malawi

Article

Full-text available

Aug 2012

Vysaul Nyirongo

Malawi is one of the countries in the sub-Saharan Africa with high prevalence of HIV/AIDS. This paper ana- lyzes socio-demographic effects using estimates and projections by the United Nations Population Division. It compares estimates and projections for both short term (2005-2020) and also long term (1980-2050), with the reality of HIV/AIDS and w...

Bayesian Hierarchical Alignment Methods

Chapter

Jan 2012

This chapter considers the problem of matching configurations of biological macromolecules when both alignment and superposition transformations are unknown. Alignment denotes correspondence – a bijection or mapping – between points in different structures according to some objectives or constraints. Superposition denotes rigid-body transformations...

Hierarchical Bayesian Modeling of Pharmacophores in Bioinformatics

Article

Jun 2011

One of the key ingredients in drug discovery is the derivation of conceptual templates called pharmacophores. A pharmacophore model characterizes the physicochemical properties common to all active molecules, called ligands, bound to a particular protein receptor, together with their relative spatial arrangement. Motivated by this important applica...

A global network for investigating the genomic epidemiology of malaria The Malaria Genomic Epidemiology Network Nature 2008 456 7223 732 737 10.1038/nature07632

Article

Full-text available

Dec 2008

Large-scale studies of genomic variation could assist efforts to eliminate malaria. But there are scientific, ethical and practical challenges to carrying out such studies in developing countries, where the burden of disease is greatest. The Malaria Genomic Epidemiology Network (MalariaGEN) is now working to overcome these obstacles, using a consor...

Simulating Virtual Protein C α Traces with Applications

Article

Dec 2008

We propose a simple procedure for generating virtual protein C(alpha) traces. One of the key ingredients of our method, to build a three-dimensional structure from a random sequence of amino acids, is to work directly on torsional angles of the chain which we sample from a von Mises distribution. With simple modeling of the hydrophobic effect in pr...

Statistical Pitfalls in Medical Research

Article

Full-text available

Apr 2008

In conducting and reporting of medical research, there are some common pitfalls in using statistical methodology which may result in invalid inferences being made. This paper is aimed to highlight to inexperienced statisticians or non-statistician some of the common statistical pitfalls encountered when using statistics to interpret data in medical...

Simulating Virtual Protein C

Article

Jan 2008

A global network for investigating the genomic epidemiology of malaria.

Article

Full-text available

Jan 2008

Markov Chain Monte Carlo Implementation of Rock Fracture Modelling

Article

Full-text available

Aug 2007

This paper deals with the problem of estimating fracture planes, given only the data at borehole intersections with fractures. We formulate an appropriate model for the problem and give a solution to fitting the planes using a Markov chain Monte Carlo (MCMC) implementation. The basics of MCMC are presented, with particular emphasis given to reversi...

Table 1 : Results for alcohol dehydrogenase (1hdx 1) matching against...

Table 3 : Results for alcohol dehydrogenase (1hdx 1) matching against...

Table 4 : Results for alcohol dehydrogenase (1hdx 1) matching against...

Additional file 1

Data

Full-text available

Jul 2007

Case 1 Results. Results for alcohol dehydrogenase (1hdx_1) matching against its own SCOP family. Tables 1–2: Without amino acid property. Tables 3–4: With amino acid property

Table 3 : Results for alcohol dehydrogenase matching against...

Table 4 : Results for alcohol dehydrogenase matching against...

Table 5 : Results for alcohol dehydrogenase matching against...

Table 6 : Results for alcohol dehydrogenase matching against...

Table 7 : Results for alcohol dehydrogenase matching against...

Additional file 4

Data

Full-text available

Jul 2007

Case 4 Results. Results for alcohol dehydrogenase and FAD/NAD(P)-binding domain. Tables 1–5: Without physico-chemistry. Tables 5–10: With physico-chemistry.

Table 2 : Results for 17 − β hydroxysteroid dehydrogenase matching...

Table 3 : Results for 17 − β hydroxysteroid dehydrogenase matching...

Table 4 : Results for 17 − β hydroxysteroid dehydrogenase matching...

Table 5 : Results for 17 − β hydroxysteroid dehydrogenase matching...

Table 6 : Results for alcohol dehydrogenase (1hdx 1) matching against...

Additional file 2

Data

Full-text available

Jul 2007

Case 2 Results. Results for 17 – β hydroxysteroid dehydrogenase and family. Tables 1–5: Without amino acid property. Tables 6–10: With amino acid property.

Table 2 : Results for alcohol dehydrogenase matching against its own...

Table 3 : Results for alcohol dehydrogenase matching against its own...

Table 4 : Results for alcohol dehydrogenase matching against its own...

Table 5 : Results for alcohol dehydrogenase matching against its own...

Table 6 : Results for alcohol dehydrogenase matching against its own...

Additional file 3

Data

Full-text available

Jul 2007

Case 3 Results. Results for alcohol dehydrogenase (1hdx_1) and superfamily. Tables 1–14: Without physico-chemistry. Tables 14–28: With physico-chemistry.

Figure 1: Alcohol dehydrogenase NAD-binding site (1hdx_1) matching...

Figure 2: Effect of MCMC refinement on graph matches of 1hdx_1 (Alcohol...

Figure 3: Corresponding amino acids between the NAD-binding site of...

Figure 4: Corresponding amino acids between the NAD-binding site of...

Figure 5: Effect of MCMC refinement on graph matches of 1a27_0 (17 – β...

Bayesian refinement of protein functional site matching

Article

Full-text available

Feb 2007

Matching functional sites is a key problem for the understanding of protein function and evolution. The commonly used graph theoretic approach, and other related approaches, require adjustment of a matching distance threshold a priori according to the noise in atomic positions. This is difficult to pre-determine when matching sites related by varyi...

A line finding assignment problem and rock fracture modelling

Chapter

Full-text available

Jan 2007

The paper deals with a stochastic stereologic problem of estimating fracture lines, given only the data at boreholes. We formulate an appropriate model. The problem is challenging since neither the lines (slope, intercept) are known, nor their number. We give an MCMC implementation where all the parameters are allowed to vary. We examine sensitivit...

EM algorithm, Bayesian and distance approaches to matching functional sites

Article

Jan 2005

The explosion in volume of protein structural information prior to any knowledge of protein biochemical function has made the characterisation of protein functional sites to be an area of huge interest. Structural similarity of functional sites from proteins with unknown function to those with known functions can be used to infer on the function of...

Procrustes statistics for unlabelled points and applications

Article

Jan 2004

Protein Matching Using Amino acids Information

Article

Hierarchical Bayesian modelling of pharmacophores

Article

Statistical modelling of globular proteins

Article

Protein structure simulations are important for understan ding and exploring properties of proteins and evaluating algorithms in bioinformatics. For example, computer-generated protein structures designed to mimic real a protein, decoys can be us ed to test the validity of a protein model. The model is considered correct only if is able to identify...

Statistical approaches to protein matching in Bioinformatics /

Article

Vysaul Nyirongo

Thesis (Ph.D.) -- University of Leeds (Department of Statistics), 2006.

Bayesian modelling for matching and alignment of biomolecules

Article

Network

Charles Newton
University of Oxford
Muntaser Ibrahim
University of Khartoum
Julie Makani
Muhimbili University of Health and Allied Sciences
Michael Parker
University of Oxford
Margaret Pinder
Durham University

Sophie Uyoga
KEMRI-Wellcome Trust Research Programme
Angela Allen
Subulade A Ademola (nee Olaniyan)
University of Ibadan
Moses Laman
Papua New Guinea Institute of Medical Research
Kathryn Maitland
KEMRI-Wellcome Trust Research Programme

Top co-authors

Malcolm Molyneux
Liverpool School of Tropical Medicine
Ivo Mueller
The Walter and Eliza Hall Institute of Medical Research
Edith Bougouma
Centre National de Recherche et de Formation sur le Paludisme (CNRFP)
Kalifa Bojang
Medical Research Council Unit, The Gambia Unit
Anita Ghansah
University of Ghana

All co-authors (50)

View All