Fig 3 - uploaded by Teri Manolio
Content may be subject to copyright.
Allele frequency and effect sizes for genetic variants associated with colorectal cancer. Allele frequencies and ORs are taken from published literature where available and are not depicted to scale. Associations identified through GWA or GWA follow-up studies are shown with solid colored bars; all others are shaded from dark (top) to light (bottom). 

Allele frequency and effect sizes for genetic variants associated with colorectal cancer. Allele frequencies and ORs are taken from published literature where available and are not depicted to scale. Associations identified through GWA or GWA follow-up studies are shown with solid colored bars; all others are shaded from dark (top) to light (bottom). 

Source publication
Article
Full-text available
Genome-wide association studies have broadened our understanding of the genetic architecture of cancer to include common variants, in addition to the rare variants previously identified by linkage analysis. We review current knowledge on the genetic architecture of four cancers--breast, lung, prostate and colorectal--for which the balance of common...

Context in source publication

Context 1
... is manifested. Published GWAs in cancer are summarized in supplementary Table 1A, available at Carcinogenesis Online. Over 150 associations at P , 5 Â 10 À 8 have been identified with a variety of cancer phenotypes, however, the functional implications of these variants are often unclear. As seen for other complex diseases, the vast majority of cancer-associated variants identified in GWA studies are either intronic (39%) or intergenic (52%). Effect sizes associated with these common variants are modest (median OR 1.25, 75% with OR , 1.5) (8), consistent with the idea that variants with large effect sizes could be subject to selection pressure (13), though selection pressures are thought to have changed dramatically over the course of human evolution. Associations with OR . 2 were observed for SNPs associated with prostate cancer, testicular cancer and myelo- proliferative neoplasms. Most variants identified to date are cancer type and, in some cases, subtype specific, though important exceptions include TERT , CLPTM1L , TP53 and the 8q24 region. In addition to true differences in the genetic architecture of various cancers, the tumor-specific productivity of the GWA approach may reflect the ability to assemble large sample sizes with enough statistical power to identify variants of small effect; the genetic heterogeneity of some tumor types relative to others; the relative contribution of genetic and environmental factors or the differing selection pressures on cancers showing major differences in incidence or course by age or sex. Few studies have followed-up variants from GWA studies in families, either to study unexplained disease or as genetic modi- fiers of known variants. The overwhelming majority of studies have included cancer risk as the primary GWAS endpoint. A minority of studies have looked at survival, prognosis or treatment response in individuals with cancer, with limitations due to small sample size or lack of replication (14–20). To date, the gaps in genetic architecture reflect the areas where genotyping technology is still being developed and/or applied. Thus, the contribution of rare variants other than those segregating in affected families has yet to be defined but may well be clarified as very high density SNP arrays (2 million or more SNPs), exome and whole genome sequencing become feasible in population-based studies. For reasons that are largely unknown, epidemiologic studies of some cancer types have been more successful at identifying less common, high penetrance variants and others at identifying high frequency susceptibility alleles of smaller effect. Linkage scans of prostate and testicular cancer, for example, have identified only a limited number of highly penetrant variants (21), although GWA studies of prostate cancer have identified the largest number of variants to date (1). Studies of breast and colorectal cancer have identified high-risk mutations in BRCA1 , BRCA2 and APC , MLH1 and MSH2 , respectively, and fewer susceptibility alleles overall. Below, we highlight specific results from the four most common cancers—breast, colorectal, prostate and lung. We discuss findings from both candidate gene studies and GWAS, recognizing that many GWAS findings may still be re- garded as preliminary until functional consequences are identified. Over two dozen loci and variants have been implicated in susceptibility to breast cancer (Figure 2). Fewer than 10% of breast cancer cases are attributed to rare, high-penetrance genes, with a similar estimate attributed to SNPs identified through GWA studies (supplementary Table 1B is available at Carcinogenesis Online) (1,2). BRCA1 and BRCA2 , the two most well-characterized genes, are associated with greater than a 10-fold increase of breast cancer risk relative to the general population (22). The other rare and highly penetrant loci are reported in rare hereditary cancer syndromes such as Li-Fraumeni syndrome and Cowden’s syndrome. An intermediate risk group of breast cancer variants—in ATM , BRIP1 , CHEK2 , PALB2 and RAD51C generally confer a 2- to 3-fold increased risk of breast cancer. Minor allele frequencies for these variants are typically , 0.1%, although there are some notable exceptions (for example, CHEK2 has a frequency of $ 1% in the Ashkenazi Jewish population) (2). GWA-identified variants to date have shown ORs , 1.5 and are frequent (most with risk allele frequencies of . 20%). In contrast to the rare and intermediate effect variants, where functional significance is often recognized (for example, in tumor suppressor or loss-of-function mutations), the GWA loci harbor susceptibility variants whose functions are largely unknown. In fact, some, such as the variants in 8q24.21, are in regions of the genome that are devoid of known genes. As in breast cancer, several key loci in colorectal cancer were discovered in familial or syndromic cases (including APC , MUTYH and mismatch repair genes MLH1 , MSH2 , MSH6 and PMS2 ; Figure 3, supplementary Table 1C is available at Carcinogenesis Online). To- gether, these loci account for $ 20% of the genetic predisposition to colorectal cancer (23). A handful of low to moderate penetrance variants, including APC -I1307K, BLM , HRAS1 , TGF b R1 , HFE , CCND1 and MTHFR , have also been identified in candidate gene association studies (23). GWA studies have identified a number of common variants (those at BMP4 , CDH1 , EIF3H , RHPN2 , SMAD7 , 8q24, 10p14, 11q23.1 and 20p12.3), which are associated with small increases in relative risk and which cumulatively account for $ 6% of the excess familial risk (24). Prostate cancer demonstrates strong familial clustering (25) but high penetrance genes have yet to be identified and replicated. The most likely candidate so far is BRCA2 , which is associated with a 20-fold increased risk relative to the general population. Linkage studies show a number of susceptibility loci with modest LOD scores; however, most were not replicated across studies (26).The GWA study approach has been fruitful in identifying many replicated variants, all with OR , 2 and most with OR , 1.3 (Figure 4; supplementary Table 1D is available at Carcinogenesis Online). In addition to variants in the EHBP1 , IGF1/IGF2 (11p15), ITGA6 , KLK3 , LMTK2 , MSMB , NKX3.1 , NUDT10/NUDT11 , PDLIM5 , SLC22A3 , TCF2 , THADA and TET2 genes, many of these variants are in intergenic regions at 3p12, 3q21, 8q24, 11q13, 17q24, 19q13 and 22q13.2. Furthermore, the cumulative effects of these loci explain at least 20% of the familial risk (27). Interestingly, several of these variants localize to distinct linkage disequilibrium blocks on 8q24 (28). GWA studies have identified more signals for prostate cancer than for any other cancer type. This may reflect the true underlying genetic architecture of prostate cancer (reasonably common variants with modest effect sizes) and/or the ways in which prostate cancer has been particularly amenable to the GWA approach, including reduced influence of environmental factors, limited disease subtype heterogeneity and long survival after diagnosis. Notably, the variants identified thus far do not distinguish between less and more aggressive tumor types (29), suggesting that they may be more involved in the initiation of cancer than its severity or course. Finally, that prostate cancer presents largely in older age suggests that alleles predisposing to prostate cancer may be under less se- lective pressure than those that predispose to cancers at younger ages (27) or that age-related senescence plays a key role in etiology. However, the overall picture is probably more complicated, as genotypes are selected based on the phenotypes they produce and focusing on a variant’s influence on a single cancer ignores possible pleiotropic effects (30). The etiology of lung cancer is recognized to be strongly environmental, although there is increasing evidence that genetic factors may also play a role. Family and linkage studies have identified high- penetrance variants in TP53 , RB1 and 6q23–25 (31–33). More recently, GWA studies have consistently identified three loci at genome-wide significance (Figure 5; supplementary Table 1E is available at Carcinogenesis Online)—variants at 5p15.33 ( TERT-CLPTM1L ), 6p21( BAT3, APOM )/6p22.1 ( TRNAA-UGC ) and 15q25.1 ( CHRNA3 , CHRNA4 and CHRNA5 ), which explain $ 7% of the familial risk of lung cancer (27). Interestingly, 15q25.1 variants are also associated with smoking behavior, a risk factor for lung cancer. Part of the association of 15q25.1 variants with lung cancer can probably be explained by associations with smoking behavior; however, not all studies agree whether they have some degree of independent effects (34–36). Specific variants have also been consistently identified for strong lung cancer risk factors, such as smoking quantity, smoking initiation or cessation (37,38) but have not met strict definitions of genome-wide significance. The emerging picture of the genetic architecture of cancer is complex and varies somewhat by cancer type, as discussed above. Adding to this complexity are several additional dimensions, including allelic heterogeneity, phenotypic heterogeneity and pleiotropy. Allelic heterogeneity, or the association of a single trait with multiple variants within the same gene or locus, has been described for several cancers. For example, within the 8q24 region, five loci are independently associated with prostate cancer susceptibility (28,39).Additionally, these loci may be population specific, reflecting differences in linkage disequilibrium (Figure 6) as well as disease risks conferred (39). Allelic heterogeneity is evident across the spectrum of rare and common variation, as for colorectal cancer and variants in the APC gene (23). In the presence of allelic heterogeneity, the value of any one approach, whether GWA-, family- or candidate gene-based, is limited; results from all of these study ...

Citations

... Both LFS and Cowden's syndrome have a high degree of penetrance. In contrast to the general population, this indicates that those who inherit a mutation in one of the aforementioned genes have a significantly increased lifetime chance of acquiring cancer [36]. ...
... To harness the complementary insights from population-and individual-levels, we assessed the predictive power of GWAS-identified candidate biomarkers, evaluated the predictive power of ML-identified candidate biomarkers, and developed a model based on the combination of SNPs identified using the two types. Complex disorders are influenced by the joint contributions of multiple dysfunctional genetic variants, each of which contributes to the phenotypic expression with an individual effect of varying magnitude 20,55 . While utilizing various biomarkers causes a significant impact on the phenotype, the relevance of features must still be accounted for. ...
Article
Full-text available
The genetic basis of phenotypic emergence provides valuable information for assessing individual risk. While association studies have been pivotal in identifying genetic risk factors within a population, complementing it with insights derived from predictions studies that assess individual-level risk offers a more comprehensive approach to understanding phenotypic expression. In this study, we established personalized risk assessment models using single-nucleotide polymorphism (SNP) data from 200 Korean patients, of which 100 experienced hepatitis B surface antigen (HBsAg) seroclearance and 100 patients demonstrated high levels of HBsAg. The risk assessment models determined the predictive power of the following: (1) genome-wide association study (GWAS)-identified candidate biomarkers considered significant in a reference study and (2) machine learning (ML)-identified candidate biomarkers with the highest feature importance scores obtained by using random forest (RF). While utilizing all features yielded 64% model accuracy, using relevant biomarkers achieved higher model accuracies: 82% for 52 GWAS-identified candidate biomarkers, 71% for three GWAS-identified biomarkers, and 80% for 150 ML-identified candidate biomarkers. Findings highlight that the joint contributions of relevant biomarkers significantly influence phenotypic emergence. On the other hand, combining ML-identified candidate biomarkers into the pool of GWAS-identified candidate biomarkers resulted in the improved predictive accuracy of 90%, demonstrating the capability of ML as an auxiliary analysis to GWAS. Furthermore, some of the ML-identified candidate biomarkers were found to be linked with hepatocellular carcinoma (HCC), reinforcing previous claims that HCC can still occur despite the absence of HBsAg.
... To harness the complementary insights from population-and individual-levels, we assessed the predictive power of GWAS-identified candidate biomarkers, evaluated the predictive power of ML-identified candidate biomarkers, and developed a model based on the combination of SNPs identified using the two types. Complex disorders are influenced by the joint contributions of multiple dysfunctional genetic variants, each of which contributes to the phenotypic expression with an individual effect of varying magnitude 20,48 . While utilizing various biomarkers causes a significant impact on the phenotype, the relevance of features must still be accounted for. ...
Preprint
Full-text available
The genetic basis of phenotypic emergence provides valuable information for assessing individual risk. While associationstudies have been pivotal in identifying genetic risk factors within a population, complementing it with insights derived frompredictions studies that assess individual-level risk offers a more comprehensive approach to understanding phenotypicexpression. In this study, we established personalized risk assessment models using single-nucleotide polymorphism (SNP)data from 200 Korean patients, of which 100 experienced hepatitis B surface antigen (HBsAg) seroclearance and 100 patientsdemonstrated high levels of HBsAg. The risk assessment models determined the predictive power of the following: (1)genome-wide association study (GWAS)-identified candidate biomarkers considered significant in a reference study and(2) machine learning (ML)-identified candidate biomarkers with the highest feature importance scores obtained by usingrandom forest (RF). While utilizing all features yielded 64% model accuracy, using relevant biomarkers achieved higher modelaccuracies: 82% for 52 GWAS-identified candidate biomarkers, 71% for three GWAS-identified biomarkers, and 80% for 150ML-identified candidate biomarkers. Findings highlight that the joint contributions of relevant biomarkers significantly influencephenotypic emergence. On the other hand, combining ML-identified candidate biomarkers into the pool of GWAS-identifiedcandidate biomarkers resulted in the improved predictive accuracy of 90%, demonstrating the capability of ML as an auxiliaryanalysis to GWAS. Furthermore, some of the ML-identified candidate biomarkers were found to be linked with hepatocellularcarcinoma (HCC), reinforcing previous claims that HCC can still occur despite the absence of HBsAg.
... The genetic architecture of complex phenotypes have been studied extensively for over a century, yet a relevant part of the genetics still elude us. That is because, essentially, many factors are involved in the development of such traits, both biological and environmental, which makes it harder to discover causative effects for any complex phenotype or disease [1]. Genome-wide association studies (GWAS) investigate the associations * Correspondence: s0raaldi@uni-bonn.de of low-effect single nucleotide polymorphisms (SNPs) with phenotypes. ...
... • GBS model: biomarker = covariates + GBS • PRS model: biomarker = covariates + PRS • Combined model: biomarker = covariates + GBS + PRS Our tool, GenRisk, uses PyCaret as underlying framework for prediction model generation. PyCaret is a python library that implements different machine learning models and can be used for training and testing, selecting, fine tuning and finalizing models [1] . Different models (n=17) including linear, such as ridge, elastic net and lasso regression, and non-linear models, like gradient boosting and random forest regression, are tested. ...
... For the GBS, only the gene predictors that were selected in the feature selection step for each biomarker were included. The training step was [1] https://pycaret.gitbook.io/docs/ performed on the training set, with the corresponding biomarker as target, using 10 fold cross-validation and the best performing model for each biomarker is then finalized considering the complete training cohort and applied on the independent test set. ...
Preprint
Full-text available
Background: A relevant part of the genetic architecture of complex traits is still unknown; despite the discovery of many disease-associated common variants. Polygenic risk score (PRS) models are based on theevaluation of the additive effects attributable to common variants and have been successfully implemented to assess the genetic susceptibility for many phenotypes. In contrast, burden tests are often used to identify an enrichment of rare deleterious variants in specific genes. Both kinds of genetic contributions are typically analyzed independently. Many studies suggest that complex phenotypes are influenced by both low effectcommon variants and high effect rare deleterious variants. The aim of this paper is to integrate the effect of both common and rare functional variants for a more comprehensive genetic risk modelling. Methods: We applied a framework combining gene-based scores based on the enrichment of rare functionally relevant variant with genome-wide PRS based on common variants for association analysis and prediction models. We applied our framework on UK Biobank dataset with genotyping and exome data and considered 28 blood biomarkers levels as target phenotypes. For each biomarker, an association analysis was done on full cohort using gene-based scores (GBS). The cohort was then split into 3 subsets for PRS constuction and feature selection, predictive model training, and independent evaluation, respectively.prediction models were generated including either PRS, GBS or both (combined). Results: Association analyses of the cohort were able to detect significant genes that were previously known to be associated with different biomarkers. Interestingly, the analyses also revealed heterogeneous effect sizes directionality highlighting the complexity of the blood biomarkers regulation. However, the combined models for many biomarkers show little or no improvement in prediction accuracy compared to PRS model. Conclusion: This study shows that rare variants play an important role in the genetic architecture of complex multifactorial traits such as blood biomarkers. However, despite at individual level rare deleterious variant play a strong role, our results indicate that classical common variant based PRS might be more informative to predictthe genetic susceptibility at population level. Keywords: gene associations; blood biomarkers; genetic prediction; rare variants; PRS; complex phenotypes
... Most of our candidate SNVs were found within intergenic and intronic regions and some within TFBSs. Many variants associated with cancer risk are located in intergenic and intronic regions with unknown functions [97]. This can make interpretation difficult but may suggest that modification of gene regulatory regions may contribute disproportionately to the modulation of risk (Supplement File S1). ...
Article
Full-text available
Small effective population sizes raise the probability of extinction by increasing the frequency of potentially deleterious alleles and reducing fitness. However, the extent to which cancers play a role in the fitness reduction of genetically depauperate wildlife populations is unknown. Santa Catalina island foxes (Urocyon littoralis catalinae) sampled in 2007–2008 have a high prevalence of ceruminous gland tumors, which was not detected in the population prior to a recent bottleneck caused by a canine distemper epidemic. The disease appears to be associated with inflammation from chronic ear mite (Otodectes) infections and secondary elevated levels of Staphyloccus pseudointermedius bacterial infections. However, no other environmental factors to date have been found to be associated with elevated cancer risk in this population. Here, we used whole genome sequencing of the case and control individuals from two islands to identify candidate loci associated with cancer based on genetic divergence, nucleotide diversity, allele frequency spectrum, and runs of homozygosity. We identified several candidate loci based on genomic signatures and putative gene functions, suggesting that cancer susceptibility in this population may be polygenic. Due to the efforts of a recovery program and weak fitness effects of late-onset disease, the population size has increased, which may allow selection to be more effective in removing these presumably slightly deleterious alleles. Long-term monitoring of the disease alleles, as well as overall genetic diversity, will provide crucial information for the long-term persistence of this threatened population.
... As an alternative approach, we and others hypothesize that some Mendelian disorders may represent the severely affected extreme of a spectrum of pathologic variation. For conditions like familial hypercholesterolemia 18 , hereditary breast cancer 19 , and long QT syndrome 20 , this relationship is well documented, and large biobank datasets have recently enabled investigators to examine the interplay between rare pathogenic variation and common polymorphisms 21,22 . In these examples, however, the analyses were possible because the condition of interest mapped to a univariate, quantitative phenotype. ...
Article
Full-text available
Clinical heterogeneity is common in Mendelian disease, but small sample sizes make it difficult to identify specific contributing factors. However, if a disease represents the severely affected extreme of a spectrum of phenotypic variation, then modifier effects may be apparent within a larger subset of the population. Analyses that take advantage of this full spectrum could have substantially increased power. To test this, we developed cryptic phenotype analysis, a model-based approach that infers quantitative traits that capture disease-related phenotypic variability using qualitative symptom data. By applying this approach to 50 Mendelian diseases in two cohorts, we identify traits that reliably quantify disease severity. We then conduct genome-wide association analyses for five of the inferred cryptic phenotypes, uncovering common variation that is predictive of Mendelian disease-related diagnoses and outcomes. Overall, this study highlights the utility of computationally-derived phenotypes and biobank-scale cohorts for investigating the complex genetic architecture of Mendelian diseases. The severity of rare genetic diseases often varies between individuals, but small sample sizes make it difficult to identify contributing factors. Here, the authors use biobank-scale clinical and genetic data to investigate a role for common genetic variation.
... Manolio et al. (2009) discussed this particular variant spectrum profile for the first time as an explanation for the genetic architecture of complex diseases. Similar graphs have been reported previously (Allin et al. 2014;Hindorff et al. 2011). However, for most traits and diseases, this full spectrum of frequencies and effect sizes is only reached when multiple genes are involved. ...
Article
Full-text available
In this study, we investigated the association of ACAN variants with otosclerosis, a frequent cause of hearing loss among young adults. We sequenced the coding, 5′-UTR and 3′-UTR regions of ACAN in 1497 unrelated otosclerosis cases and 1437 matched controls from six different subpopulations. The association between variants in ACAN and the disease risk was tested through single variant and gene-based association tests. After correction for multiple testing, 14 variants were significantly associated with otosclerosis, ten of which represented independent association signals. Eight variants showed a consistent association across all subpopulations. Allelic odds ratios of the variants identified four predisposing and ten protective variants. Gene-based tests showed an association of very rare variants in the 3′-UTR with the phenotype. The associated exonic variants are all located in the CS domain of ACAN and include both protective and predisposing variants with a broad spectrum of effect sizes and population frequencies. This includes variants with strong effect size and low frequency, typical for monogenic diseases, to low effect size variants with high frequency, characteristic for common complex traits. This single-gene allelic spectrum with both protective and predisposing alleles is unique in the field of complex diseases. In conclusion, these findings are a significant advancement to the understanding of the etiology of otosclerosis.
... Majority (80%) of CRC is sporadic, occurring in individuals aged 60 or more, where environmental factors such as diet and exposure to carcinogen would play a greater role than genes (Fig. 1A). Nevertheless, a monozygotic twin study has estimated that genetic factors contribute about 30% to the etiology of CRC (Lichtenstein et al., 2000); each genetic variant contributes a miniscule effect to what essentially is a polygenic trait (Hindorff et al., 2011) (Fig. 1B). The effective size and penetrance of the rare disease-causing variants or mutations in the initiating tumor suppressor gene for familial syndrome however is much greater (Fig. 1B). ...
Article
Colorectal carcinoma is a high incidence cancer and leading cause of cancer mortality worldwide. The advances in genomics and transcriptomics in the past decades have improved the detection and prevention of CRC in familial CRC syndromes. Nevertheless, the ultimate goal of personalized medicine for sporadic CRC is still not within reach due no less to the difficulty in integrating population disparity and clinical data to combat what essentially is a very heterogenous disease. This minireview highlights the achievement of the past decades and present possible direction in the hope of early detection and metastasis prevention for reducing CRC-associated morbidity and mortality.
... These conditions are influenced by the interaction between a genetic predisposition and environmental or lifestyle factors (Smith et al., 2005). As opposed to rare diseases, which are often caused by the dysfunction of a single gene, common diseases are complex traits, i.e., they are influenced by the added contribution of thousands of common genetic variants, each having a small individual effect on the phenotype (Hindorff et al., 2011). This makes studying complex diseases challenging, as their genetic architecture follows a polygenic rather than a Mendelian model (Visscher and Goddard, 2019). ...
Article
Full-text available
Genome-wide association studies (GWAS) have successfully mapped thousands of loci associated with complex traits. These associations could reveal the molecular mechanisms altered in common complex diseases and result in the identification of novel drug targets. However, GWAS have also left a number of outstanding questions. In particular, the majority of disease-associated loci lie in non-coding regions of the genome and, even though they are thought to play a role in gene expression regulation, it is unclear which genes they regulate and in which cell types or physiological contexts this regulation occurs. This has hindered the translation of GWAS findings into clinical interventions. In this review we summarize how these challenges have been addressed over the last decade, with a particular focus on the integration of GWAS results with functional genomics datasets. Firstly, we investigate how the tissues and cell types involved in diseases can be identified using methods that test for enrichment of GWAS variants in genomic annotations. Secondly, we explore how to find the genes regulated by GWAS loci using methods that test for colocalization of GWAS signals with molecular phenotypes such as quantitative trait loci (QTLs). Finally, we highlight potential future research avenues such as integrating GWAS results with single-cell sequencing read-outs, designing functionally informed polygenic risk scores (PRS), and validating disease associated genes using genetic engineering. These tools will be crucial to identify new drug targets for common complex diseases.
... These SNPs are mainly associated with the risk of prostate cancer [20][21][22][23], breast cancer, ovarian cancer, colorectal cancer, and bladder cancer [13,14,[24][25][26]. Besides these SNP variants, amplification of 8q24.21 occurs most frequently in human cancers, including ovarian [27], colorectal [28][29][30][31], breast [32][33][34][35][36], prostate [37][38][39][40][41][42][43], and others. ...
Article
Full-text available
FAM84B is a risk gene in breast and prostate cancers. Its upregulation is associated with poor prognosis of prostate cancer, breast cancer, and esophageal squamous cell carcinoma. FAM84B facilitates cancer cell proliferation and invasion in vitro, and xenograft growth in vivo. The FAM84B and Myc genes border a 1.2 Mb gene desert at 8q24.21. Co-amplification of both occurs in 20 cancer types. Mice deficient of a 430 Kb fragment within the 1.2 Mb gene desert have downregulated FAM84B and Myc expressions concurrent with reduced breast cancer growth. Intriguingly, Myc works in partnership with other oncogenes, including Ras. FAM84B shares similarities with the H-Ras-like suppressor (HRASLS) family over their typical LRAT (lecithin:retinal acyltransferase) domain. This domain contains a catalytic triad, H23, H35, and C113, which constitutes the phospholipase A 1/2 and O-acyltransferase activities of HRASLS1-5. These enzymatic activities underlie their suppression of Ras. FAM84B conserves H23 and H35 but not C113 with both histidine residues residing within a highly conserved motif that FAM84B shares with HRASLS1-5. Deletion of this motif abolishes FAM84B oncogenic activities. These properties suggest a collaboration of FAM84B with Myc, consistent with the role of the gene desert in strengthening Myc functions. Here, we will discuss recent research on FAM84B-derived oncogenic potential.