ArticlePDF Available

Transcriptomic Maps of Colorectal Liver Metastasis: Machine Learning of Gene Activation Patterns and Epigenetic Trajectories in Support of Precision Medicine

MDPI
Cancers
Authors:

Abstract and Figures

Simple Summary Liver metastasis is a significant factor contributing to mortality associated with colorectal cancer. Establishing the biological mechanisms of metastasis is crucial for refining diagnostics and identifying therapeutic windows for interventions. Currently, little is known of the processes that govern the development of liver metastases, the role of the tumor microenvironment, the role of epigenetics, and potential treatment-induced shaping effects. Machine learning-based bioinformatics has provided an important methodical option to decipher fine-granular details of the transcriptomic landscape of tumor heterogeneity and the underlying molecular mechanisms. Our molecular portrayal method has potential implications for treatment decisions, which may require personalized diagnostics. Abstract The molecular mechanisms of the liver metastasis of colorectal cancer (CRLM) remain poorly understood. Here, we applied machine learning and bioinformatics trajectory inference to analyze a gene expression dataset of CRLM. We studied the co-regulation patterns at the gene level, the potential paths of tumor development, their functional context, and their prognostic relevance. Our analysis confirmed the subtyping of five liver metastasis subtypes (LMS). We provide gene-marker signatures for each LMS, and a comprehensive functional characterization that considers both the hallmarks of cancer and the tumor microenvironment. The ordering of CRLMs along a pseudotime-tree revealed a continuous shift in expression programs, suggesting a developmental relationship between the subtypes. Notably, trajectory inference and personalized analysis discovered a range of epigenetic states that shape and guide metastasis progression. By constructing prognostic maps that divided the expression landscape into regions associated with favorable and unfavorable prognoses, we derived a prognostic expression score. This was associated with critical processes such as epithelial–mesenchymal transition, treatment resistance, and immune evasion. These factors were associated with responses to neoadjuvant treatment and the formation of an immuno-suppressive, mesenchymal state. Our machine learning-based molecular profiling provides an in-depth characterization of CRLM heterogeneity with possible implications for treatment and personalized diagnostics.
Functional characteristics of CRLM using gene set Z scores (GSZ) profiling in CRLM of various signatures: (a) Expression of gene modules (GM) extracted from an independent study [4] confirm LMS and spot clusters in this study. (b) Consensus molecular subtypes of CRC (CMS1-4) [24] resemble CRLM expression. (c) There is partial concordance between the gene interaction perturbation network subtypes (GINS1-6) signatures [28] and the LMS expression patterns. (d) Immune microenvironment types (IM1-5) relate the LMS to specific activation patterns of immune checkpoint inhibitors and HLA-DRB [3]. (e) Tumor microenvironment (TME) PanCancer signatures taken from [29] assign the LMS to different TME-subtypes. (f) Immunogenicity genes in solid cancers were mostly upregulated in LMS5 [30]. (g) Cell specific expression markers extracted from a single-cell RNA analysis of CRLM [22] activate in two clusters in LMS1 and LMS5, respectively. (i) Hallmark signatures of cancer [36] confirm the functional assignment of LMS and spots. (j) Genes on chromosomes show dose response effects of copy number gains (red frame) and losses (blue) typically observed in CRC and CRLM [41]. (k) Epigenetic signatures identify two distinct patterns across all LMS, one related to open, actively transcribed genes, the other to repressed and poised ones [42]. (h) Cell-deconvolution of CRLM transcriptomics data [43] reveals LMS-specific fractions of selected cell types. Gene sets implemented in this study are given in Supplementary Table S3.
… 
Tree trajectory analysis of CRLM heterogeneity using monocle [44]. (a) The monocle-tree divides into two major segments (Seg1–3) where Seg1 further splits into six subsegments (1.1–6). Mean expression portraits averaged over the CRLM portraits along the segments reveal changing expression patterns. (b) Samples along the tree are colored by spot expression A–F. Ellipses indicate areas of high expression of the respective spot. (c) Different LMS accumulate in different segments as marked by the ellipses and by the overlap coefficients (OC = overlap (LMS, segment)/(min_size (LMS, segment) between LMS and segments. (d) Profiles along different paths reveal smooth changes in spot expression. The log expression profiles of spots and hazard ratios (HR) use sorted CRLM along the abscissa. Pseudotime (PT)-scaled plots “virtually” compress the data along the subsegments 1.1 and 1.3 (Supplementary Figure S8a). Series of individual CRLM portraits from LMS1 and LMS5 indicate their pseudo-dynamics towards spot E (partly below and see also arrows in the portraits; for portraits of other subtypes see Supplementary Figure S8b). The blue and red dashed lines along the HR scatterplot visualize LMS1 HR-high and -low subgroups. (e) Sankey river-flow diagram between subtype and segment stratification of CRLMs. OC values greater than 0.5 show that LMS3 and, partly, LMS4 flow to Seg2, and LMS5 to Seg3 and Seg1.3. Their difference portraits resemble the epithelial patterns of Seg1.1 to overlay with Seg3 patterns. For interpretation of functional characteristics see Supplementary Figure S5 and next subsection. (f) Comparison of plasma and endothelial cell signatures (Figure 3e) in terms of biplots and their difference profile indicate that both signatures correlate but combine with different amplitudes in the CRLM (red and blue arrows and frames).
… 
This content is subject to copyright.
Citation: Ashekyan, O.; Shahbazyan,
N.; Bareghamyan, Y.; Kudryavzeva,
A.; Mandel, D.; Schmidt, M.;
Loeffler-Wirth, H.; Uduman, M.;
Chand, D.; Underwood, D.; et al.
Transcriptomic Maps of Colorectal
Liver Metastasis: Machine Learning
of Gene Activation Patterns and
Epigenetic Trajectories in Support of
Precision Medicine. Cancers 2023,15,
3835. https://doi.org/10.3390/
cancers15153835
Academic Editor: David Wong
Received: 6 July 2023
Revised: 24 July 2023
Accepted: 26 July 2023
Published: 28 July 2023
Copyright: © 2023 by the authors.
Licensee MDPI, Basel, Switzerland.
This article is an open access article
distributed under the terms and
conditions of the Creative Commons
Attribution (CC BY) license (https://
creativecommons.org/licenses/by/
4.0/).
cancers
Article
Transcriptomic Maps of Colorectal Liver Metastasis: Machine
Learning of Gene Activation Patterns and Epigenetic
Trajectories in Support of Precision Medicine
Ohanes Ashekyan 1, , Nerses Shahbazyan 1, , Yeva Bareghamyan 1, Anna Kudryavzeva 1, Daria Mandel 1,
Maria Schmidt 2, Henry Loeffler-Wirth 2, Mohamed Uduman 3, Dhan Chand 3, Dennis Underwood 3,
Garo Armen 3, Arsen Arakelyan 4, Lilit Nersisyan 1and Hans Binder 1, 2,*
1Armenian Bioinformatics Institute, 3/6 Nelson Stepanyan Str., Yerevan 0062, Armenia;
ohanes.ashekyan@abi.am (O.A.); nerses.shahbazyan@abi.am (N.S.); yeva.bareghamyan@abi.am (Y.B.);
anyakudryavceva804@gmail.com (A.K.); daria.laricheva@abi.am (D.M.); lilit.nersisyan@abi.am (L.N.)
2IZBI, Interdisciplinary Centre for Bioinformatics, Universität Leipzig, Härtelstr. 16–18,
04107 Leipzig, Germany; schmidt@izbi.uni-leipzig.de (M.S.); wirth@izbi.uni-leipzig.de (H.L.-W.)
3Agenus Inc., 3 Forbes Road, Lexington, MA 7305, USA; mohamed.uduman@agenusbio.com (M.U.);
dhan.chand@agenusbio.com (D.C.); dennis.underwood@agenusbio.com (D.U.); armen@agenusbio.com (G.A.)
4Institute of Molecular Biology of the National Academy of Sciences of the Republic of Armenia,
7 Has-Ratyan Str., Yerevan 0014, Armenia; arsen.arakelyan@abi.am
*Correspondence: hans.binder@abi.am
These authors contributed equally to this work.
Simple Summary:
Liver metastasis is a significant factor contributing to mortality associated with
colorectal cancer. Establishing the biological mechanisms of metastasis is crucial for refining di-
agnostics and identifying therapeutic windows for interventions. Currently, little is known of the
processes that govern the development of liver metastases, the role of the tumor microenvironment,
the role of epigenetics, and potential treatment-induced shaping effects. Machine learning-based
bioinformatics has provided an important methodical option to decipher fine-granular details of the
transcriptomic landscape of tumor heterogeneity and the underlying molecular mechanisms. Our
molecular portrayal method has potential implications for treatment decisions, which may require
personalized diagnostics.
Abstract:
The molecular mechanisms of the liver metastasis of colorectal cancer (CRLM) remain
poorly understood. Here, we applied machine learning and bioinformatics trajectory inference to
analyze a gene expression dataset of CRLM. We studied the co-regulation patterns at the gene level,
the potential paths of tumor development, their functional context, and their prognostic relevance.
Our analysis confirmed the subtyping of five liver metastasis subtypes (LMS). We provide gene-
marker signatures for each LMS, and a comprehensive functional characterization that considers
both the hallmarks of cancer and the tumor microenvironment. The ordering of CRLMs along a
pseudotime-tree revealed a continuous shift in expression programs, suggesting a developmental
relationship between the subtypes. Notably, trajectory inference and personalized analysis discov-
ered a range of epigenetic states that shape and guide metastasis progression. By constructing
prognostic maps that divided the expression landscape into regions associated with favorable and
unfavorable prognoses, we derived a prognostic expression score. This was associated with critical
processes such as epithelial–mesenchymal transition, treatment resistance, and immune evasion.
These factors were associated with responses to neoadjuvant treatment and the formation of an
immuno-suppressive, mesenchymal state. Our machine learning-based molecular profiling provides
an in-depth characterization of CRLM heterogeneity with possible implications for treatment and
personalized diagnostics.
Cancers 2023,15, 3835. https://doi.org/10.3390/cancers15153835 https://www.mdpi.com/journal/cancers
Cancers 2023,15, 3835 2 of 31
Keywords:
genomics analysis of liver metastases; gene expression; molecular mechanisms; tumor
heterogeneity; prognostic score; treatment resistance; self-organizing map machine learning
1. Introduction
Liver metastasis, linked with poor prognosis, is a common occurrence in various
cancers, including colorectal cancer (CRC), pancreatic cancer, breast cancer, melanoma, and
lung cancer. In CRC, the liver is the primary site of metastasis owing to the anatomical and
vascular connections between the colorectal regions and the liver [
1
]. Such metastases pose
a significant challenge for clinical intervention and represent a major cause of CRC-related
mortality. However, the molecular mechanisms that govern the molecular heterogeneity
and tumor development in liver metastasis remain poorly understood.
The tumor microenvironment (TME) significantly impacts the pathophysiology of
cancer cells metastasizing to the liver. It includes liver sinusoidal endothelial cells, Kupffer
cells, hepatic stellate cells, and parenchymal hepatocytes, as well as infiltrating stromal
and immune cells [
2
]. Interactions with the TME facilitate cancer cells to overcome the
tumor stroma, settle, and to colonize. Interestingly, colorectal cancer liver metastases
(CRLM) show high genetic concordance in key lesions, mutations, and copy number
variations (CNV) with primary CRC suggesting the liver microenvironment has a limited
influence on the mutation pattern of CRLM cells, which remain largely genetically primed
by their primary CRC origin [
3
]. This correlation of molecular characteristics between
primary and metastatic tumors is further supported by the functional and transcriptional
studies on CRLM [
3
7
]. However, genetic lesions and adaptive interactions with the TME,
while necessary, are not sufficient for cancer initiation and progression. As a third factor,
epigenetic regulation, including chromatin remodeling associated and driven by a large
set of histone- and DNA-modifying mechanisms, are essential for cancer clones to acquire
the plasticity necessary for adaptive cell fate changes towards evolutionary fitness in an
epigenetic landscape [
8
,
9
]. Currently, little is known of the evolutionary processes that
govern CRLM, the role of epigenetics, and potential treatment-induced shaping effects.
Establishing the biological mechanisms of metastasis is crucial for refining diagnostics and
identifying therapeutic windows for interventions.
Molecular subtyping has emerged as an important concept to decipher cancer het-
erogeneity in both primary and metastatic tumors [
10
]. While correlations between the
molecular subtypes of primary CRC, metastatic propensity, and responses to therapy have
been noted [
6
], it is not clear to what degree these CRC characteristics are maintained
after distant spread. Recently, Moosavi et al. developed a metastasis-oriented subtyping
framework through the transcriptomic analysis of patients with CRLMs [
5
]. Their de novo
liver metastases subtypes (LMS) recapitulated epithelial-like and mesenchymal-like tumors,
with the latter showing a strong immune and stromal component.
In this study, we leveraged machine learning-based bioinformatics to analyze the
transcriptomes of 283 CRLM [
5
], to characterize the molecular landscape with single-tumor
resolution, and to deduce cues for CRLM development under neoadjuvant treatment. We
further evaluated and extended the LMS framework, and identified subtype-specific gene
signatures, explored their functional relevance to cancer progression, and suggest possi-
ble developmental paths under epigenetic control using pseudotime inference through
in-depth transcriptomic analysis. Our approach utilizes a high-resolution molecular cartog-
raphy and portrayal method previously applied to various cancer types [
11
14
], treatment
resistance [
15
], and modes of epigenetic regulation [
16
,
17
]. In parallel, we have estab-
lished, and made available, an interactive web platform for more detailed insights into our
analyses. Through the personalized portrayal of gene expression data and the creation
of prognostic maps for CRLM, we open new avenues for personalized diagnostics and
treatment decision-making.
Cancers 2023,15, 3835 3 of 31
2. Materials and Methods
2.1. Gene Expression Data of Colon Liver Metastases (CRLM)
Microarray-based gene expression data (GeneChip Human Transcriptome Array 2.0;
HTA2.0) of 283 hepatic resections of liver metastases (CRLM) of primary colorectal (CRC)
tumors were taken from GEO-database under accession numbers GSE159216 (see also [
5
])
covering the expression of, in total, more than 67,000 probe sets. We used the classification
of CRLM-samples into five liver metastasis subtypes (LMS1-LMS5) and an “unclassified”
group, all taken from the original paper [
5
]. The latter class showed an expression signature
of healthy liver tissue and therefore it was assigned as liver-like (LIV, see Section 3). We
used preprocessed gene expression data, as described in [5].
2.2. SOM Portrayal and Downstream Functional Analysis
The gene expression data were transformed to log10-scale, then quantile normalized
and centralized (with respect to the mean log-expression of the gene averaged over all
CRLM studied). Throughout the paper, we use the terms “over-“ and “under-“expression
for positive and negative values (larger/smaller than the mean value, respectively). Ex-
pression profiles denote the vector of expression data of each transcript across all CRLM-
samples. Expression profiles were clustered using self-organizing map (SOM) machine
learning. This method transforms the ~67,000 transcript-centric profiles into 2500 meta-
gene profiles of reduced dimensionality and visualizes their expression in each CRLM
sample as two-dimensional quadratic images with 50
×
50 metagene resolution [
18
]. Meta-
genes were colored according to their expression for each CRLM from dark blue (low
expression) to maroon (high expression), thereby providing a portrait of their gene ex-
pression state. An alternative “coastline” color scale was applied to better resolve subtle
expression changes for over (red) and under (blue) expression regions. The SOM clustered
similar profiles of co-expressed genes together, appearing as spot-like regions in the por-
traits [
18
]. Subtype-specific mean portraits were generated by averaging the metagene
landscapes of all CRLM belonging to the same class. Difference portraits between subtypes
were calculated as the differences between the respective mean portraits. Details of SOM
training and parametrization have been previously described [
18
,
19
]. Bioinformatics down-
stream analyses, including class discovery and sample diversity analysis, feature selection
from metagenes and spots, and biological function-mining using gene sets analysis, were
performed as described in [
18
,
20
]. All downstream methods were implemented in the
R-package ‘oposSOM’ [
21
] and applied for analysis. Gene sets implemented specifically
for this study are provided in Supplementary Table S2. Single-cell RNAseq data from
Che et al. [
22
] (GSE178318) on CRLM were re-analyzed to extract cell-specific marker genes.
2.3. Prognostic Maps
For prognostic maps, the hazard ratio (HR) relative to the mean overall survival (OS)
of all patients was calculated for each metagene-pixel on the map. This was done by
selecting patients with a centralized metagene expression greater than zero or one standard
deviation and coloring the pixel in an HR-scale from red (indicating inferior prognosis) to
blue, as described in [11].
2.4. oposSOM Browser of Livermet-Transcriptomes
The results of transcriptome analyses of CRLM presented in this publication can be
interactively explored for further details using the oposSOM browser [
23
], available via the
IZBI-portal (www.izbi.de, accessed on 27 July 2023; https://apps.health-atlas.de/opossom-
browser/?dataset=15, accessed on 27 July 2023). For more information refer to the data
availability statement below and Supplementary Figure S1.
Cancers 2023,15, 3835 4 of 31
3. Results
3.1. SOM Portrayal Deciphers Liver Metastases Subtypes (LMS)
The self-organizing maps (SOM) algorithm was applied to transcriptomic data, gener-
ating individual SOM portraits of each of the 283 CRLM samples (Figures 1a and S2). These
portraits were group-sorted according to their previous classification [
5
] into five subtypes
LMS1–LMS5 and one “unclassified” group, LIV, which strongly resembled the transcrip-
tional characteristics of healthy liver tissue. Mean portraits of each subtype indicate the
specific expression patterns of spot-like clusters of overexpressed and under-expressed
genes, using a red-to-blue color scale, respectively (Figure 1b).
Cancers2023,15,xFORPEERREVIEW4of34
3.Results
3.1.SOMPortrayalDeciphersLiverMetastasesSubtypes(LMS)
Theself-organizingmaps(SOM)algorithmwasappliedtotranscriptomicdata,gen-
eratingindividualSOMportraitsofeachofthe283CRLMsamples(Figures1aandS2).
Theseportraitsweregroup-sortedaccordingtotheirpreviousclassication[5]intove
subtypesLMS1–LMS5andone“unclassiedgroup,LIV,whichstronglyresembledthe
transcriptionalcharacteristicsofhealthylivertissue.Meanportraitsofeachsubtypeindi-
catethespecicexpressionpaernsofspot-likeclustersofoverexpressedandunder-ex-
pressedgenes,usingared-to-bluecolorscale,respectively(Figure1b).
Thedistributionofspotnumbersrevealsthatthepersonalizedportraitsmostfre-
quentlyexpressonetothreesuchspot-clusters(Figure1c).LMS3andLMS4displaythe
broadestdistributionsofspotnumbersand,consequently,themostheterogeneousex-
pressionpaerns.Themostvariablegenesacrossthedatasetsarelocatedinsixmajorspot
modulesofthegenescoregulatedacrossthesubtypesassignedwithcapitalleersA–F
(seethesummaryandvariancemapsFigure1d).Eachmapiscomposedof2500pixels,
alsoreferredtoasmetagenes,eachcontainingupto200genes(seethepopulationmapin
Figure1d).Keygenesharboringsomaticmutationsand/orcopynumbervariations(CNV)
inCRCandCRLM[24–26]arelocatedinornearthespots(Figure1d,partbelow),which
includescanonicalcoreCRCdrivers(combinationsofAPC,KRAS,TP53,orSMAD4),
whichcancombinewithoneadditionalcandidatemetastasisdriver(TCF7L2,AMER1,or
PTPRT)[27].
SimilarityplotsoftheCRLMusingindependentcomponentanalysis(ICA,Figure
1e)andamultidimensionalnetworkpresentation(Figure1e)revealtheintermixingofthe
CRLMofnearlyallLMS,exceptforLMS5andLIV(seealsoSupplementaryFigureS3).
Thisisalsoreectedinthesilhoueeplotthatcomparesintra-versusinter-clustersimi-
laritiesofthesamples,where,exceptLMS5andLIV,alargefractionoftheCRLMshow
highpreferencesforinter-clustersimilarities.This,overall,conrmstheheterogeneous
transcriptionalpaernsofCRLMandtheirstraticationintoveLMSandaLIVgroup
enrichedinhealthylivertissue.
Figure1.SOMportrayalofLMSsubtypes.(a)IndividualgeneexpressionSOMportraitsofthe283
livermetastases(CRLM)samplestakenfrom[5](seeSupplementaryFigureS2foranenlarged
Figure 1.
SOM portrayal of LMS subtypes. (
a
) Individual gene expression SOM portraits of the
283 liver metastases (CRLM) samples taken from [
5
] (see Supplementary Figure S2 for an enlarged
portrait gallery). (
b
) Mean portraits of each of the subtypes LMS 1-5 and LIV (CRLM strongly
contaminated with liver tissue). N assigns the respective sample size. (
c
) Distribution of spot
numbers in the individual portraits of each subtype. (
d
) Supporting maps summarize the major
overexpression spots containing highly variant genes across all the portraits (assigned with capital
letters A–F; the variance of gene expression across the map; the color-coded distribution of the
number of genes in each of the 2500 pixels of the map called metagenes; and the locations of key
CRC- and CRLM-related genes on the SOM portraits. (
e
) Sample similarity relations are visualized
using Independent Component Analysis (ICA) and a multidimensional network. The silhouette plot
estimates the similarity of each sample to its own subtype cluster through silhouette scores, and
indicates the most similar subtype with a color-code bar below.
The distribution of spot numbers reveals that the personalized portraits most fre-
quently express one to three such spot-clusters (Figure 1c). LMS3 and LMS4 display the
broadest distributions of spot numbers and, consequently, the most heterogeneous expres-
sion patterns. The most variable genes across the datasets are located in six major spot
modules of the genes coregulated across the subtypes assigned with capital letters A–F
(see the summary and variance maps Figure 1d). Each map is composed of 2500 pixels,
also referred to as metagenes, each containing up to 200 genes (see the population map
Cancers 2023,15, 3835 5 of 31
in Figure 1d). Key genes harboring somatic mutations and/or copy number variations
(CNV) in CRC and CRLM [
24
26
] are located in or near the spots (Figure 1d, part below),
which includes canonical core CRC drivers (combinations of APC,KRAS,TP53, or SMAD4),
which can combine with one additional candidate metastasis driver (TCF7L2,AMER1, or
PTPRT) [27].
Similarity plots of the CRLM using independent component analysis (ICA, Figure 1e)
and a multidimensional network presentation (Figure 1e) reveal the intermixing of the
CRLM of nearly all LMS, except for LMS5 and LIV (see also Supplementary Figure S3).
This is also reflected in the silhouette plot that compares intra- versus inter-cluster simi-
larities of the samples, where, except LMS5 and LIV, a large fraction of the CRLM show
high preferences for inter-cluster similarities. This, overall, confirms the heterogeneous
transcriptional patterns of CRLM and their stratification into five LMS and a LIV group
enriched in healthy liver tissue.
3.2. LMS Are Governed by Six Major Modules of Co-Regulated Genes
The spots A–F taken from the mean LMS portraits represent modules containing
about 200 (spot A) to 623 (spot C) co-expressed genes (Figure 2a, lists of the spot genes are
provided in Supplementary Table S2). Their expression profiles reveal the overexpression
of spot A in LMS1, spots B and D in LMS3, spot C in LMS4, spot E in LMS5, and the
under-expression of spots A and C in LMS5 (Figures 2b and S3). Their functional context
was established by gene set analysis using an array of gene sets implemented in oposSOM
(see Figure 2a, Supplementary Table S1 and next subsection). The spot profiles show
the expression of the spot modules as bar plots, where the samples within each subtype
are ordered with the increasing expression values of the spot E that shows inflammatory
characteristics (Figure 2b). The negative slope of the expression values of spot D, associated
with upper crypt genes and chromosomal instability (CIN) anticorrelates with spot E.
Conversely, the expression of spot A (epithelium) positively and negatively correlates with
the inflammatory characteristics, respectively, suggesting an overlay of various immune-
related functions in LMS1 and LMS5. Overall, the distinct spot profile indicates complex co-
expression patterns governing a multitude of cellular functions, which are addressed below.
Spot F exhibits the expression characteristics of healthy liver tissue, with high values
in the LIV group and in about 35% of the CRLM samples across all LMS, suggesting
contamination with liver tissue (Figure 2b, part below). These samples are prone to LMS
misclassification, as most of them show a negative silhouette score and preference for
the LIV-group (Figure 1d). Liver (ALB) and intestine gene marker genes (KRT20) [
5
] are
located in spots F and C, respectively, defining a major transcriptomic axis between liver
and CRLM.
The receiver operating characteristic (ROC) curves indicate a high classification power
for the overexpression of spot A for LMS1 (AUC = 0.95), of spot B (0.86) and D (0.81) for
LMS3, of spot C for LMS4 (0.76), and of spot E for LMS5 (0.98) (Figure 2c). LMS2 exhibits
mixed characteristics, combining moderate AUCs for spots A, B, and C. Notably, all the
nine genes of the mini classifier distinguishing between LMS1 and other subtypes [
5
] are
located in spot A. Spot C (cycling genes) is a specific under-expression marker for LMS5
(AUC = 0.11), implying that cell-cycle activity in LMS5 is distinctly reduced compared with
the other LMS. ROCs of spot F reflect no preference for any LMS, thus supporting the view
that contaminations with healthy liver tissue are distributed over all LMS.
The combined classification power of these spots is demonstrated using ternary dia-
grams of their expressions (Figure 2d). A ternary combination of spots A, D, and E provides
the best resolution, with LMS1 distributing to the spot A corner, while LMS3 and LMS5
distribute to the D and E corners, respectively, and LMS4 to the middle. LMS2 could not be
specifically classified with single spot markers. Other combinations of spots in the ternary
diagrams show a depleted population of the B-corner, especially in combination with spots
A and D.
Cancers 2023,15, 3835 6 of 31
Cancers2023,15,xFORPEERREVIEW6of34
spotsprovidetranscriptomicmarkersignaturesofthesubtypeswheretriplecombinations
ofepithelial-(spotA),CIN(spotD),andmesenchymal/inammation(spotE)bestresolve
thesubtypesLMS1,LMS2-4,andLMS5,respectively.
Figure2.Expressionprolesofthespot-modules.(a)Theheatmapshowssample-specicexpres-
sionofeachspot,groupedbysubtypes.HierarchicalclusteringofsamplesaccordingtospotFex-
pressionisshownbelowalongwiththeexpressionofthespotFthatreectslowerandhigherliver
cellcontentinLIVlowandLIVhigh,respectively.Thepercentageofsamplesclassiedintohighand
lowspotFexpressionprolesisshownbelow.Genesetanalysisassociatesspotgeneswithbiolog-
icalfunction(seeSupplementaryTab leS1andtext).(b)Barplotsofspotexpressionprolesare
sortedwithincreasingexpressionofspotEineachsubtypetobeervisualizeco-expressionofthe
otherspotswithinammatoryresponse.(c)Receiveroperatingcharacteristiccurves(ROC)ofsub-
typeclassicationpowerofeachspotexpression.ROCcurvesbelowthediagonalline(AUC<0.5)
meanthatunderexpression(lowsensitivity)isabeerpredictorthanoverexpression(highsensi-
tivity).AUCvaluesarelistedinthetablebelow.(d)Ternarydiagramscombineexpressionofse-
lectedspottuples.
3.3.FunctionalandTMEContextandRelationtoPrimaryCRCCharacteristics
TofurthercharacterizethefunctionalcontextoftheCRLMsubtypes,weanalyzeda
varietyofgenesetsimplementedintheoposSOMsoftwarepackagewithheatmaps(Fig-
ures3andS4),withsingle-setprolesandgenesetmapsfordetailedinspection(Supple-
mentaryFigureS5),andspotenrichment.Forcomparisonwithapriorindependentgene
expressionstudyonCRLM,weconsideredtwenty-sixgenemodules(GM1-26)identied
bytheauthors[4]andcomputedtheirGSZscoresinoursamples(Figure3a).Thegene
setsrelatedtoendothelialcharacteristics,angiogenesis,andinnateimmunitywereupreg-
ulatedinLMS5,thoserelatedtocellcycle—inLMS1-4,thoserelatedtoinammationand
Figure 2.
Expression profiles of the spot-modules. (
a
) The heatmap shows sample-specific expression
of each spot, grouped by subtypes. Hierarchical clustering of samples according to spot F expression
is shown below along with the expression of the spot F that reflects lower and higher liver cell
content in LIVlow and LIVhigh, respectively. The percentage of samples classified into high and low
spot F expression profiles is shown below. Gene set analysis associates spot genes with biological
function (see Supplementary Table S1 and text). (
b
) Barplots of spot expression profiles are sorted
with increasing expression of spot E in each subtype to better visualize co-expression of the other
spots with inflammatory response. (
c
) Receiver operating characteristic curves (ROC) of subtype
classification power of each spot expression. ROC curves below the diagonal line (AUC < 0.5) mean
that underexpression (low sensitivity) is a better predictor than overexpression (high sensitivity).
AUC values are listed in the table below. (
d
) Ternary diagrams combine expression of selected
spot tuples.
Overall, the six overexpression spots provide transcriptomic marker signatures of
four (LMS1, LMS3, LMS4, LMS5) of the five subtypes. Triple combinations of spots A
(epithelium), D (CIN), and E (endothelium/mesenchyme and inflammation) best resolve
subtypes LMS1, LMS3, LMS 4, and LMS5, respectively. Therefore, the six overexpression
spots provide transcriptomic marker signatures of the subtypes where triple combinations
of epithelial- (spot A), CIN (spot D), and mesenchymal/inflammation (spot E) best resolve
the subtypes LMS1, LMS2-4, and LMS5, respectively.
3.3. Functional and TME Context and Relation to Primary CRC Characteristics
To further characterize the functional context of the CRLM subtypes, we analyzed
a variety of gene sets implemented in the oposSOM software package with heatmaps
(
Figures 3and S4
), with single-set profiles and gene set maps for detailed inspection (Sup-
Cancers 2023,15, 3835 7 of 31
plementary Figure S5), and spot enrichment. For comparison with a prior independent
gene expression study on CRLM, we considered twenty-six gene modules (GM1-26) iden-
tified by the authors [
4
] and computed their GSZ scores in our samples (Figure 3a). The
gene sets related to endothelial characteristics, angiogenesis, and innate immunity were
upregulated in LMS5, those related to cell cycle—in LMS1-4, those related to inflammation
and neoantigens found in microsatellite instable (MSI) CRC in LMS1, and those specific to
liver-tissue spread across all LMS.
Next, we analyzed the expression signatures of primary CRC subtypes in our LMS-
stratified CRLM data (Figures 3b–d and S5a). Signatures of four consensus molecular
subtypes (CMS1-4) of CRC [
24
] indicate associations between the subtypes of the primary
tumors and of liver metastases, namely CMS1 (immune-activated) with LMS1 (and to a
less degree with LMS5), CMS2 (canonical, WNT-signaling) with LMS2-4, CMS3 (metabolic)
with LMS4 (and partly LMS1), and CMS4 (mesenchymal) with LMS5 (Figure 3b, see
also [
5
]). This indicates the resemblance of transcriptional programs between LMS and
CMS subtypes, which is further supported with signatures of another gene-interaction-
perturbation-network-based subtyping (GINS1-6) [28] (Figure 3c).
To gain further insight into the immunogenetic properties of LMS, we assessed signa-
tures associated with immune microenvironmental subtypes (IM1-5). These subtypes con-
sider the activation patterns of the MHCII receptor HLA-DR and of the immune checkpoint
inhibitors (ICI) PD1 and ICOS related to T-cell states (Figure 3d) [
3
]. According to these
patterns, the inflamed IM2 type, which resembles LMS1, is characterized by deactivated
ICIs. In contrast, the LMS5-resembling IM1 and IM5 exhibit a T-cell-exhausted immune
environment. This is regulated by the hypoxia-marker SLCA1 with inferior survival risk
and treatment resistance [3].
Next, we considered a series of signatures characterizing the tumor microenvironment
(TME, Figure 3e–h). A pan-cancer classification of TME-types [
29
] shows that LMS1
corresponds to a fibrotic TME-type, LMS2–LMS4 to an immune depleted and proliferative
TME-type, and LMS5 to an immune-enriched and fibrotic TME-type (Figure 3e). For a
closer look, we mapped immunogenicity-related genes in solid cancers to the CRLM [
30
]
(Figures 3f and S4). Accordingly, the expression of MHC class II, and both immune-inhibitor
and -stimulator genes, upregulate in LMS5, while MHC class I shows slight activity in both
LMS1 and LMS5. To obtain a more specific picture, we next studied signatures extracted
from the single-cell transcriptomics data of CRLM [
22
] (Figures 3g and S6). It shows that
endothelial cell activity in LMS5 is accompanied by upregulated expression from a series of
immune cells such as B-, natural killer (NK)-, myeloid, plasma-, and plasmacytoid dendritic
(pDC) cells. It is reported that, e.g., pDCs, promote the recruitment of regulatory T cells
(Tregs) into the tumor microenvironment, leading to immune suppression and promoting
tumor growth [
31
]. Contrarily, epithelial and T cells are activated in LMS1 and, partly,
LMS4. Cell-type deconvolution refines this result in terms of the fraction of immune cells
in the TME (Figure 3h) [
32
]. Indeed, CD8+ T- and NK-cell invasion is most prominent in
LMS1, while regulatory T and B cells are enriched in LMS5. Interestingly, the frequencies
of tumor-suppressive M1- and of tumor-promoting M2-macrophages are high and low,
respectively, which suggests a pivotal role of macrophage polarization in CRLM in analogy
to primary CRC [
33
35
]. We found similar activity profiles across the LMS using specific
marker sets for M2-like (SPP1+ and MRC1 + CCL18) macrophages identified in CRLM and
possibly originating from liver-intrinsic Kupffer cells [7].
Finally, we were interested in signatures related to cancer hallmarks and copy num-
ber variations along the chromosomes and chromatin states associated with genetics and
epigenetics, respectively (Figures 3i–k and S5c–e). Cancer hallmark signatures [
36
] reveal
functional associations in a broader context (Figure 3i), which generally reflect subtype-
specific activations of the hallmark programs. For example, the epithelial–mesenchymal
transition (EMT) signature is activated in LMS5, while being downregulated in LMS1, corre-
sponding to the mesenchymal and epithelial nature of these subtypes. Gene expression and
copy number variations (CNV) are typically linked via a dose–response relationship [
37
].
Cancers 2023,15, 3835 8 of 31
We found that the genes on chromosomes 7 and 13, both showing CNV gains in primary
CRC and CRLM [
38
], accumulate in spot D and are upregulated in LMS3 (and, partly
LMS4), while the genes on, e.g., chromosome 18 showing CNV-losses in CRC and CRLM,
downregulate in these LMS (Figure 3j, see red and blue boxes, respectively). Overall, we
observed the systematic over- and under-expression of genes along chromosomes known
for their CNV gains and losses in CRC and CRLM: they accumulate in the CIN-related
LMS3 (and partly LMS4) and thus govern the genomic regulation of these LMS.
We were also interested in the footprints of (epi-)genetic effects in the CRLM tran-
scriptomes. Gene sets for different chromatin states in colonic tissue were taken from [
39
]
and mapped to the CRLM transcriptomes (Figures 3k and S5e, see also [
40
]). Genes with
active (TssA) and with repressed and poised (ReprPC and TssP, respectively) promoters
were upregulated in a virtually exclusive fashion in each of the LMS. The percentage of
CRLM with highly expressed genes with TssA-promoters increased from 7% in LMS1 to
48% in LMS4, meaning that the fraction of open chromatin was markedly larger in the
proliferative LMS4 compared with epithelial LMS1. In turn, closed chromatin associated
genes with repressed and poised promoters had the biggest fractions in LMS1 and LMS5
with epithelial and endothelial characteristics, respectively.
In summary, gene set analysis specifies the context of the LMS in terms of cellular
programs related to epithelial- (LMS1), endothelial/mesenchymal- (LMS5), proliferative
and virtually immune desert TME (LMS3 and 4), and immune cell composition, as well as
chromatin states and CNV. These mechanisms markedly resemble molecular mechanisms
observed in CRC-subtypes, suggesting that CRLM partly preserves the characteristics of
the primary tumors. Pathway activities can be browsed interactively using the oposSOM
web tool (see above). As worked example, activity patterns of the WNT-pathway for each
LMS is provided in Supplementary Figure S7 showing specific activation in LMS2.
Cancers2023,15,xFORPEERREVIEW8of34
knownfortheirCNVgainsandlossesinCRCandCRLM:theyaccumulateintheCIN-
relatedLMS3(andpartlyLMS4)andthusgovernthegenomicregulationoftheseLMS.
Wewerealsointerestedinthefootprintsof(epi-)geneticeectsintheCRLMtran-
scriptomes.Genesetsfordierentchromatinstatesincolonictissueweretakenfrom[39]
andmappedtotheCRLMtranscriptomes(Figures3kandS5e,seealso[40]).Geneswith
active(TssA)andwithrepressedandpoised(ReprPCandTss P,respectively)promoters
wereupregulatedinavirtuallyexclusivefashionineachoftheLMS.Thepercentageof
CRLMwithhighlyexpressedgeneswithTssA-promotersincreasedfrom7%inLMS1to
48%inLMS4,meaningthatthefractionofopenchromatinwasmarkedlylargerinthe
proliferativeLMS4comparedwithepithelialLMS1.Inturn,closedchromatinassociated
geneswithrepressedandpoisedpromotershadthebiggestfractionsinLMS1andLMS5
withepithelialandendothelialcharacteristics,respectively.
Insummary,genesetanalysisspeciesthecontextoftheLMSintermsofcellular
programsrelatedtoepithelial-(LMS1),endothelial/mesenchymal-(LMS5),proliferative
andvirtuallyimmunedesertTME(LMS3and4),andimmunecellcomposition,aswellas
chromatinstatesandCNV.Thesemechanismsmarkedlyresemblemolecularmechanisms
observedinCRC-subtypes,suggestingthatCRLMpartlypreservesthecharacteristicsof
theprimarytumors.PathwayactivitiescanbebrowsedinteractivelyusingtheoposSOM
webtool(seeabove).Asworkedexample,activitypaernsoftheWNT-pathwayforeach
LMSisprovidedinSupplementaryFigureS7showingspecicactivationinLMS2.
Figure3.FunctionalcharacteristicsofCRLMusinggenesetZscores(GSZ)prolinginCRLMof
varioussignatures:(a)Expressionofgenemodules(GM)extractedfromanindependentstudy[4]
conrmLMSandspotclustersinthisstudy.(b)ConsensusmolecularsubtypesofCRC(CMS1-4)
[24]resembleCRLMexpression.(c)Thereispartialconcordancebetweenthegeneinteractionper-
turbationnetworksubtypes(GINS1-6)signatures[28]andtheLMSexpressionpaerns.(d)Immune
microenvironmenttypes(IM1-5)relatetheLMStospecicactivationpaernsofimmunecheck-
pointinhibitorsandHLADRB[3].(e)Tumormicroenvironment(TME)PanCancersignaturestaken
from[29]assigntheLMStodierentTME-subtypes.(f)Immunogenicitygenesinsolidcancerswere
mostlyupregulatedinLMS5[30].(g)Cellspecicexpressionmarkersextractedfromasingle-cell
RNAanalysisofCRLM[22]activateintwoclustersinLMS1andLMS5,respectively.(i)Hallmark
signaturesofcancer[36]conrmthefunctionalassignmentofLMSandspots.(j)Genesonchromo-
somesshowdoseresponseeectsofcopynumbergains(redframe)andlosses(blue)typically
Figure 3.
Functional characteristics of CRLM using gene set Z scores (GSZ) profiling in CRLM of
various signatures: (
a
) Expression of gene modules (GM) extracted from an independent study [
4
]
confirm LMS and spot clusters in this study. (
b
) Consensus molecular subtypes of CRC (CMS1-
4) [
24
] resemble CRLM expression. (
c
) There is partial concordance between the gene interaction
perturbation network subtypes (GINS1-6) signatures [
28
] and the LMS expression patterns. (
d
) Immune
Cancers 2023,15, 3835 9 of 31
microenvironment types (IM1-5) relate the LMS to specific activation patterns of immune check-
point inhibitors and HLA-DRB [
3
]. (
e
) Tumor microenvironment (TME) PanCancer signatures taken
from [
29
] assign the LMS to different TME-subtypes. (
f
) Immunogenicity genes in solid cancers were
mostly upregulated in LMS5 [
30
]. (
g
) Cell specific expression markers extracted from a single-cell
RNA analysis of CRLM [
22
] activate in two clusters in LMS1 and LMS5, respectively. (
i
) Hallmark
signatures of cancer [
36
] confirm the functional assignment of LMS and spots. (
j
) Genes on chro-
mosomes show dose response effects of copy number gains (red frame) and losses (blue) typically
observed in CRC and CRLM [
41
]. (
k
) Epigenetic signatures identify two distinct patterns across all
LMS, one related to open, actively transcribed genes, the other to repressed and poised ones [
42
].
(
h
) Cell-deconvolution of CRLM transcriptomics data [
43
] reveals LMS-specific fractions of selected
cell types. Gene sets implemented in this study are given in Supplementary Table S3.
3.4. Trajectory Inference Indicates Continuous Alterations of Cellular States along Developmental
Paths and Gradual Changes of the TME
We performed trajectory analysis on the SOM portraits using the Monocle method [
44
],
which sorts CRLM in low-dimensional space, an approach usually applied for inferring
developmental paths in single-cell RNAseq settings. The application of this method to bulk
expression data enables the extraction of potential paths of cancer progression [
13
]. The
obtained tree divides into three major segments (Seg1–3) where Seg1 further subdivides
into six subsegments (Seg1.1–1.6, Figure 4a). The mean portraits of the (sub)segments reveal
systematic alterations of the expression patterns. To better characterize these changes, we
visualize spot expression as well as the accumulation of the different LMS along the tree
(Figure 4b,c, respectively). LMS2, LMS5 (partly), and LMS3 samples accumulate near
the ends of Seg1, Seg2, and Seg3, respectively, paralleled by the high expression of the
respective marker spots A, D, and E, respectively. LMS4 CRLM distributes over wider
ranges of the tree and the tumors of LMS5 also accumulate in Seg1.3, which is also described
with the expression of spot E.
To better understand expression changes along the three paths connecting different
tip-points we generated spot profiles along the segments (Figure 4d). They reveal that
the tree sorts the CRLM according to gradually changing expression levels, e.g., of the
linearly increasing spot D expression from Seg1.1 to Seg2 opposed by its decay along
Seg3, particularly associated with the accumulation of LMS5 (Figure 4d, middle column,
Movie S1, a movie of the CRLM-portraits along the trajectory from Seg1.1 to Seg3 is pro-
vided in the supplementary file). Hence, tree analysis shows that the activity levels of
biological characteristics such as epithelium (spot A), proliferation (spot C), CIN (spot
D), and endothelium/inflammation (spot E) changes continuously and not abruptly be-
tween the CRLM, which possibly reflects developmental dynamics rather than static states.
Interestingly, these continuous changes are overlaid by the typical differences between
the subtype expression, e.g., of the high expression of spot A in LMS1 and of the lower
expression in LMS2-4 along Seg1 and Seg2.
Note that these expression differences associate with the overall survival hazard
ratio (HR) and thus with prognosis, which also changes smoothly along the trajectories.
Interestingly, CRLM samples of LMS1 split into two subgroups described with a high and
low spot A expression and HR values. A series of individual portraits along the trajectory
Seg1.1 towards Seg3 reveals that the expression patterns dynamically shift towards spot E,
via the lower right part of the map (plasticity plateau, see next subsection), or rather, via its
upper edge (Figure 4d, part below). Hence, the trajectories refer to (pseudo-) dynamically
changing expression patterns beyond the spots, which overall illustrates the diversity of
the transition states linking the states at the tips of the tree, which can be interpreted as
archetypic expression states (Supplementary Figure S8).
Cancers 2023,15, 3835 10 of 31
Cancers2023,15,xFORPEERREVIEW11of34
Figure4.TreetrajectoryanalysisofCRLMheterogeneityusingmonocle[44].(a)Themonocle-tree
dividesintotwomajorsegments(Seg1–3)whereSeg1furthersplitsintosixsubsegments(1.1–6).
MeanexpressionportraitsaveragedovertheCRLMportraitsalongthesegmentsrevealchanging
expressionpaerns.(b)SamplesalongthetreearecoloredbyspotexpressionA–F.Ellipsesindicate
areasofhighexpressionoftherespectivespot.(c)DierentLMSaccumulateindierentsegments
asmarkedbytheellipsesandbytheoverlapcoecients(OC=overlap(LMS,segment)/(min_size
(LMS,segment)betweenLMSandsegments.(d)Prolesalongdierentpathsrevealsmooth
changesinspotexpression.Thelogexpressionprolesofspotsandhazardratios(HR)usesorted
CRLMalongtheabscissa.Pseudotime(PT)-scaledplotsvirtuallycompressthedataalongthe
subsegments1.1and1.3(SupplementaryFigureS8a).SeriesofindividualCRLMportraitsfrom
LMS1andLMS5indicatetheirpseudo-dynamicstowardsspotE(partlybelowandseealsoarrows
intheportraits;forportraitsofothersubtypesseeSupplementaryFigureS8b).Theblueandred
dashedlinesalongtheHRscaerplotvisualizeLMS1HR-highand-lowsubgroups.(e)Sankey
river-owdiagrambetweensubtypeandsegmentstraticationofCRLMs.OCvaluesgreaterthan
0.5showthatLMS3and,partly,LMS4owtoSeg2,andLMS5toSeg3andSeg1.3.Theirdierence
portraitsresembletheepithelialpaernsofSeg1.1tooverlaywithSeg3paerns.Forinterpretation
offunctionalcharacteristicsseeSupplementaryFigureS5andnextsubsection.(f)Comparisonof
plasmaandendothelialcellsignatures(Figure3e)intermsofbiplotsandtheirdierenceprole
indicatethatbothsignaturescorrelatebutcombinewithdierentamplitudesintheCRLM(redand
bluearrowsandframes).
Figure 4.
Tree trajectory analysis of CRLM heterogeneity using monocle [
44
]. (
a
) The monocle-tree
divides into two major segments (Seg1–3) where Seg1 further splits into six subsegments (1.1–6).
Mean expression portraits averaged over the CRLM portraits along the segments reveal changing
expression patterns. (
b
) Samples along the tree are colored by spot expression A–F. Ellipses indicate
areas of high expression of the respective spot. (
c
) Different LMS accumulate in different segments
as marked by the ellipses and by the overlap coefficients (OC = overlap (LMS, segment)/(min_size
(LMS, segment) between LMS and segments. (
d
) Profiles along different paths reveal smooth changes
in spot expression. The log expression profiles of spots and hazard ratios (HR) use sorted CRLM
along the abscissa. Pseudotime (PT)-scaled plots “virtually” compress the data along the subsegments
1.1 and 1.3 (Supplementary Figure S8a). Series of individual CRLM portraits from LMS1 and LMS5
indicate their pseudo-dynamics towards spot E (partly below and see also arrows in the portraits; for
portraits of other subtypes see Supplementary Figure S8b). The blue and red dashed lines along the
HR scatterplot visualize LMS1 HR-high and -low subgroups. (
e
) Sankey river-flow diagram between
subtype and segment stratification of CRLMs. OC values greater than 0.5 show that LMS3 and, partly,
LMS4 flow to Seg2, and LMS5 to Seg3 and Seg1.3. Their difference portraits resemble the epithelial
patterns of Seg1.1 to overlay with Seg3 patterns. For interpretation of functional characteristics
see Supplementary Figure S5 and next subsection. (
f
) Comparison of plasma and endothelial cell
signatures (Figure 3e) in terms of biplots and their difference profile indicate that both signatures
correlate but combine with different amplitudes in the CRLM (red and blue arrows and frames).
Cancers 2023,15, 3835 11 of 31
The river plot visualizes the distribution of LMS between the segments (Figure 4e).
A comparison of the mean portraits on both sides of the plots indicates more diverse
and partly different spot patterns, particularly along Seg1, which suggests a fine granular
activation of cellular programs in the CRLM not resolved in the LMS strata. One finds
CRLM dominated by LMS1 in Seg1.1 and that LMS4 distributes broadly over Seg1.1, 1.2,
1.4–1.6, and 1.3, all of which show footprints of oxidative phosphorylation (oxphos) and
proliferation. To better understand the bimodal split of LMS5 between Seg3 and Seg1.3 we
calculated the difference portrait between them. It reveals that Seg1.3 can be understood
as a superposition of the transcriptional programs of Seg3 and Seg1.2, i.e., of endothelial,
epithelial, and proliferative gene functions, i.e., as a mixture of LMS3 and, to a lesser degree,
of LMS1, which reflects a transition state between both LMS.
Moreover, Seg1.3 and Seg3 exhibit minor differences in the position of spot E in the
right upper corner of the map, which also becomes evident in the difference map (see red
and blue arrow in Figure 4e). Spot E is associated with the TME, which contains endothe-
lial/CAF and inflammatory cells. The biplot of both compounds (where we selected plasma
cells (PC) as a proxy of the immune compound) shows an overall linear relation between
them with maximal values in LMS5 (Figure 4f, left part). The individual CRLM values
considerably scatter with large deviations into positive (CAF dominance, red dashed frame
and arrow) and negative (PC dominance, blue dashed frame and arrow) directions. These
variations reflect relative PC-dominance in LMS1, partly in LMS4, and CAF dominance in
LMS5 and partly in LMS3 (difference plot in Figure 4f, right part).
When considering the marker genes of the other immune cells, we observe that B
cells behave similarly to PC while mast cells more resemble CAFs, which overall reflects
a CRLM-specific change in the cell communities of the TME (Supplementary Figure S9).
Notably, the LMS5 tumors in Seg1.3 and Seg3 refer to a more endothelium/CAF-enriched
and -depleted TME, respectively (Supplementary Figure S9).
In conclusion, we observe a modified ordering of CRLMS along the monocle-tree,
revealing continuously changing expression levels between the LMS, rather than clear-cut
expression differences in most cases. This suggests developmental relationships among the
tumors. Their ordering is governed by epithelial LMS1, CIN-affected LMS3, and immune
cell-enriched LMS5 accumulating near the ends of the three major branches of the tree.
These can be interpreted as archetypic expression states. LMS5 further splits into substrata
enriched more with immune cells or CAFs, where the latter subgroup shares endothelial
functions with LMS1 and also proliferative activity with LMS4.
3.5. High-Resolution Expression Cartography Deciphers an Interplay between Genetic and
Epigenetic Regulation of Metastasis
Next, we sought to better understand the fine-grained details of the expression land-
scape that became visible along the monocle tree. The LMS-averaged mean portraits
provided a transcriptomic landscape with six major overexpression spot modules A–F, each
associating with a specific functional context and a specific differential expression in virtu-
ally all of the subtypes (Figure 5a and also Figures 1and 2). The oposSOM software offers
an alternative segmentation of the expression landscape, such as through a personalized
summary map. This approach leverages the individual expression portraits of CRLM, thus
enabling the detection of less frequent and subtler overexpression patterns independent of
LMS classification (see Supplementary Figure S10 for details). This method reveals a richer,
more structured overexpression landscape with additional details. These include a “plas-
ticity plateau” in the right lower corner as well as an “RNA-protein molecular processing
ridge” of modules assigned to functions such as RNA-processing, de-ubiquitination, and
proteasome in the middle of the map (Figure 5b). The former plateau is associated with
the epigenetic mechanisms of cellular reprogramming into plastic states, such as poised
and repressed gene promoters in colonic tissue, targets of PRC2 (polycomb repressive
complex 2) and of the CpG island methylation phenotype (CIMP) [
45
47
], the genes re-
sponsive for thretionin treatment via rhodopsin receptors [
48
], and gene signatures related
Cancers 2023,15, 3835 12 of 31
to keratinization [
49
,
50
] (Supplementary Figure S5). The “processing ridge” shows distinct
expression modules associating with CRC- and CRLM-related mechanisms, such as protein
degradation and de-ubiquitination [
51
53
], RNA-processing, guanyl-exchange [
54
], and
ribosomal assembly [55], as well as the adaptation of the Golgi apparatus in cancer [56].
Weighted-topology overlap (wto-) networks [
57
] between the spots reveal anticor-
relation between spot E and the other spots on the one hand and between the plasticity
plateau and the proliferative spot C on the other, i.e., overall between the left and right
side of the map (see the wto-correlation maps in Figure 5a,b). This reflects the antagonism
between the inflammatory and endothelial contexts (spot E and LMS5) and the epithelial
and proliferative functions (spot A–D). Indeed, biplots of the gene set activities “cell-cycle”
versus PRC2-targets” reflect a negative slope between LMS5 and the other LMS while the
correlation of “cell cycle” versus “translation” and “oxphos” is positive (Figure 5c, left part).
Profiles and maps of these gene sets provide further interesting details (Figure 5c, right
part): genes of the set “cell cycle” accumulate in spot C and genes of the sets translation
and oxphos accumulate in the three characteristic areas, including spot B (see also Supple-
mentary Figure S5). All these functions associate with low expression in LMS5 and high
expression in the other LMS, however with subtle mutual differences, e.g., the moderate
downregulation of oxphos activity in LMS3.
The genes EZH2 and SUZ12, both encoding components of the PRC2 complex, also
locate near spot C, while, in contrast, the PRC2-target genes accumulate in characteristic
areas in the right part of the map near spots E, partly A, and in the “plasticity plateau”.
They are associated with upregulation in LMS5 and partly in LMS1. Genes activated in the
CIMP (CpG island methylator phenotype) accumulate in similar regions, thus suggesting
the promoter hypomethylation of PRC2-targets. Similar gene accumulation patterns in the
right part of the map were found for repressed and poised promoters in the colon as well
as so-called low-expression transcription factors (low-TF) which associate with virtually
deactivated chromatin states in a wider sense (Figures 5c and S5e) [
58
]. It was recently
reported that the downregulation of EZH2, the catalytic subunit of PRC2, results in increased
DNA replication initiation through the loss of repressive histone methylation marks from
bivalent, poised promoters retaining the ability to be activated as differentiation proceeds [
59
].
It involves thousands of genomic loci of DNA replication origins scattered throughout the
genome that associate with open euchromatin regions and the regulation of pluripotency
and differentiation-related genes. In contrast, so-called high expression-TF genes and active
promoter states accumulate in the left part of the map including spots B–D, in activated
cycling and metabolically active states. Importantly, key mutated genes in CRC and CRLM
also locate in the left high-expression part of the map in support of their driver function for
transcriptome programs (Figure 1c). The bimodal expression of repressed/poised versus
active/transcribed chromatin states, evident in the heatmap in Figure 3k, thus transforms into
virtually left-to-right antagonisms of gene expression in the SOM landscape.
Comparing the averaged expression portraits along the tree segments (Figure 4a) with
the personalized landscape (Figure 5b) enables the inference of a trajectory in the gene-
expression landscape [
60
]. Accordingly, it links expression modules related to epithelial
functions (Seg1) with intestine-developmental (Seg2) and inflammatory and mesenchyme
(Seg3) functions. Moreover, it suggests an additional path through the plasticity plateau
towards the LMS5 tumors accumulating in Seg1.3 (Figure 5a). Interestingly, this “epigenetic”
path includes a region enriched in G-protein-coupled receptors (Figure 5c), which play a
crucial role as signaling transducers in the development of CRC [
61
] in promoting EMT [
62
],
where particular G-protein-coupled receptors (GPCRs) such as ADGRF5 [
63
] and ARRB1
and ARRB2 [
64
] were identified as EMT- and poor survival markers in CRC. These genes
locate near spot E and upregulate in LMS5. The GPCR LGR5 as an exception locates
in the high expression side of the landscapes in spot C, upregulated in highly cycling
CRLM, which agrees with the role of LGR5 as a marker for proliferative stem-like cells in
CRC [
65
,
66
]. In summary, high-resolution cartography of the CRLM transcriptomes reveals
a network of cellular programs governed by genetic lesions (mutations and CNVs) and by
Cancers 2023,15, 3835 13 of 31
epigenetic chromatin remodeling leading to the antagonistic activation of epithelial-like
proliferative cancer cells or inflammatory, mesenchyme-like cells. This provides rationales
for the sequential activation of transcriptional programs and the ordering of CRLM along
the segments of the monocle-tree.
Cancers2023,15,xFORPEERREVIEW14of34
Figure5.TranscriptomiclandscapeofCRLM.(a)LMSlandscapeexpressespeaksA–Fupregulated
inasubtype-specicfashion.Blueregionsassignareasof,onaverage,downregulatedgenomicpro-
gramsineachofthesubtypes.CMS-resemblanceisshowninthesmallmap.Theweighted-topology
overlap(WTO)networkshowscorrelationsbetweenspotsrevealingthatSpotEoverallanticorre-
lateswiththeotherspots.(b).The”personalized”landscapeconsidersupregulatedspotsofindi-
vidualsamples,independentofLMSsubtyping.Itrevealedamoregranularexpressionlandscape
and,particularly,a“plasticityplateau”relatedtovariousepigeneticmechanismspossiblypromot-
ingcellularreprogramming.TheWTOnetworkshowsassociationbetweentheprograms,mostly
demonstratinganticorrelationoftheplasticityplateauwiththeproliferativeprogramsnearspotC.
(c)Biplotsofcellcycleexpressionsignature[67]withPRC2targets[68]indicateoverallnegative
correlationsandpositivecorrelationswithtranslationandoxidativephosphorylation(oxphos)[36].
Thegenesofthelaerfunctionalitiesaccumulateinregionsalongtheleftedgeofthemap(red
circles)whilePRC2targetsoccupyspecicareasintherightpart,which,notably,agreewithareas
occupiedbymethylatedgenesoftheCRC-CIMPtype[69]andalsoG-protein-coupledreceptors
(ReactomeGPCR-signaling).
3.6.PortrayalofClinical,Mutational,andTelomereMaintenanceCharacteristics
Next,wecharacterizedtheCRLMafterstraticationintothedierentLMSwithre-
specttoselectedclinicalitemsandmutations,aswellastelomeremaintenance(TM)
Figure 5.
Transcriptomic landscape of CRLM. (
a
) LMS landscape expresses peaks A–F upregulated in a
subtype-specific fashion. Blue regions assign areas of, on average, downregulated genomic programs in
each of the subtypes. CMS-resemblance is shown in the small map. The weighted-topology overlap
(WTO) network shows correlations between spots revealing that Spot E overall anticorrelates with the
other spots. (
b
). The ”personalized” landscape considers upregulated spots of individual samples,
independent of LMS subtyping. It revealed a more granular expression landscape and, particularly, a
“plasticity plateau” related to various epigenetic mechanisms possibly promoting cellular reprogram-
ming. The WTO network shows association between the programs, mostly demonstrating anticorrelation
of the plasticity plateau with the proliferative programs near spot C. (
c
) Biplots of cell cycle expression
signature [
67
] with PRC2 targets [
68
] indicate overall negative correlations and positive correlations
with translation and oxidative phosphorylation (oxphos) [
36
]. The genes of the latter functionalities
accumulate in regions along the left edge of the map (red circles) while PRC2 targets occupy specific
areas in the right part, which, notably, agree with areas occupied by methylated genes of the CRC-CIMP
type [69] and also G-protein-coupled receptors (Reactome GPCR-signaling).
Cancers 2023,15, 3835 14 of 31
3.6. Portrayal of Clinical, Mutational, and Telomere Maintenance Characteristics
Next, we characterized the CRLM after stratification into the different LMS with
respect to selected clinical items and mutations, as well as telomere maintenance (TM)
pathway activities by means of SOM portrayal and overall survival (OS) curves (Figure 6).
LMS1 has an inferior prognosis compared with the other LMS2-5 with a virtually identical
prognosis (see also [
5
]). These OS-relations partially align with the comparable CMS,
namely the inferior prognosis of CMS1 CRC compared with CMS2-4 [
70
]. However,
this deviates from other studies that reported a poorer prognosis for LMS5-resembling
CMS4 [
24
,
28
], possibly due to the early stage, non-metastatic nature of these CRC cohorts.
Furthermore, males, compared with females, showed a higher incidence (67% versus 33%)
as well as an inferior prognosis. This sexual bimorphism is in-line with previous reports on
CRC [
71
73
] and liver metastases possibly due to the regulation of the TME [
74
]. The sex-
stratified portraits indicate the increased expression of spot D (CIN) in women and of spot
B (PRC2, metabolism) in men together with an increased expression level of the plasticity
plateau. Sex-specific biomarkers for CRC taken from [
75
], located in spot D (CLDN1,
ANAPC7), are upregulated for women, and in/near spot B (ESM1,GUCA2A,VWA2) are
upregulated for men, which further supports the analogy between CRC and CRLM.
Cancers2023,15,xFORPEERREVIEW16of34
Figure6.Subtypes,selectedclinicalcharacteristics,somaticmutations,andtelomeremaintenance
pathwayactivation.Therespectivecaseswereshowninsamplespaceassimilaritynets,asmean
portraitsaveragedovertherespectivesamples,andintermsofprognosisusingoverallsurvival
(OS)curves.ThetableliststhepercentagesofCRLMintherespectivestrata(dataweretakenfrom
[5]).Genelocationsaremappedascrossesintheportraits.Sexualmarkergenesweretakenfrom
[75].TheredarrowsindicateupregulatedspotE(inammation).Analysisoftelomere-maintenance
mechanismsincludesviolinplotsofthepathwaysignalow(PSF)activityoftelomerase(T)and
alternative(A)telomeremaintenanceusingTMpathwaysfrom[76],abiplotofT-andA-TMPSF
valuesandtheirstraticationaccordingtothefourquadrants(a,t…lowvalues,A,T…highvalues),
similarityplot,meanportraits,andOS-curves.
3.7.PrognosticMapsProvideanHRScorealongtheEMTAxisRelatedtoTreatment
Weusedthesurvivaldatafromthecohorttogenerateprognosticmaps,whichdis-
playthehazardratio(HR)foreachpixel(metagene)inaredtobluecolorscale.TheHR
activestatuswasestimatedbyapplyingtwothresholdseingsforoverexpression:either
largerthanthemeanoronestandarddeviationawayfromthemeanofmetageneexpres-
sion(Figure7a,band[11]fordetails).Inferiorprognosis(indicatedbyredareasinthe
map)isassociatedwiththeupregulationofgenesinandaroundspotAandtheprocessing
ridgeinthemiddleofthemap.Theseoverexpressionstatesarepredominatelyfoundin
LMS1.Goodprognosis(blueareas)isfoundaroundspotE,associatingwithLMS5,and
alsospotB,C,andDupregulatedinLMS2-4.Adetailedcomparisonoftheprognostic
mapswiththeoverexpressionmapsrevealsthatmetagenesofmaximumandminimum
HR(HRmaxandHRmin,respectively;forlistsofgenesseeSupplementaryTable S4)
slightlydeviatefromtheoverexpressionspot-areasofmaximumexpression,particularly
ofspotAandE,respectively.Eachofthosemetagenescontainabout80transcripts(red
andbluetrianglesinFigure7a).TheirprolesupregulateinLMS1(maxHR)andLMS5
(minHR),respectively(Figure7b).Wedenedaprognosticscorebycalculatingthelog-
expressiondierence(HR=maxHR−minHR),rankedtheCRLMwithincreasingHR-
score,andcalculatedtheOS-survivalcurvesforthehigh-andlow-riskgroups(topand
Figure 6.
Subtypes, selected clinical characteristics, somatic mutations, and telomere maintenance
pathway activation. The respective cases were shown in sample space as similarity nets, as mean
portraits averaged over the respective samples, and in terms of prognosis using overall survival (OS)
curves. The table lists the percentages of CRLM in the respective strata (data were taken from [
5
]).
Gene locations are mapped as crosses in the portraits. Sexual marker genes were taken from [
75
].
The red arrows indicate upregulated spot E (inflammation). Analysis of telomere-maintenance
mechanisms includes violin plots of the pathway signal flow (PSF) activity of telomerase (T) and
alternative (A) telomere maintenance using TM pathways from [
76
], a biplot of T- and A-TM PSF
values and their stratification according to the four quadrants (a, t
. . .
low values, A, T
. . .
high values),
similarity plot, mean portraits, and OS-curves.
Cancers 2023,15, 3835 15 of 31
Extrahepatic disease accompanying CRLM and synchronous metastasis (within 6 months
of CRC diagnosis) are both associated with an average upregulation of spot E (inflamma-
tion). KRAS and TP53 somatic mutations associate with the expression changes of spot
B, which harbors both genes in its neighborhood, acting in opposite directions, however,
namely in up- and downregulation, respectively, which is in accordance with the major
functions of KRAS as oncogene and of TP53 as a tumor suppressor. These mutations, as
well as that of NRAS, on average activate spot E and thus the endothelial immune respon-
sive LMS5 and, on the other hand, associate with the downregulation of proliferation and
CIN characteristics (spot C and D, respectively) and of the epithelial signature (spot A)
for mutated KRAS. The mean LMS5/spot E characteristics of CRLM with mutated KRAS
and/or TP53 align with the increased mutation frequencies of these genes along a hypoxia
axis that associates with the activation of spot E [3].
Finally, we compared the pathway signal flow (PSF) activation values of the telomere
maintenance pathways via telomerase-dependent (TEL) and alternative (ALT) mechanisms
(Figure 6below, [
76
,
77
]). The ALT-versus-TEL activity biplot shows that the double-low
quadrant “at” (a
. . .
ALT-low, t
. . .
TEL-low) enriches LMS5 with the marked upregulation
of spot E and downregulated proliferation of spot C. The “At” quadrant (A
. . .
ALT-high,
TEL-low) associates with activated proliferation (spot C), while the “aT”-quadrant up-
regulates the plasticity plateau. Poor and better prognoses associate with “at” and “At”
quadrants, respectively, i.e., with the vertical ALT-axis. Interestingly, the high-ALT “At”
quadrant and the high TEL quadrant “aT” associate with the downregulation of ATRX
and the upregulation of TERT, respectively (see gene locations marked by crosses in the
At and aT portraits), which are also known to switch-on ALT and TEL TM in other cancer
types, such as gliomas [
78
]. In summary, the stratification of CRLM according to selected
clinical items, the mutations of key genes as well as TM, reveals a dualism of expression
between LMS5-like and non-LMS5-like patterns with activated spot E versus spot A and C,
respectively, which reflects the overall antagonism between mesenchyme/inflammatory
and proliferative/metabolic cellular functions where, e.g., mutated (TP53,KRAS) CRLM
are more prone to the former states.
3.7. Prognostic Maps Provide an HR-Score along the EMT-Axis Related to Treatment
We used the survival data from the cohort to generate prognostic maps, which display
the hazard ratio (HR) for each pixel (metagene) in a red to blue color scale. The HR active
status was estimated by applying two threshold settings for overexpression: either larger
than the mean or one standard deviation away from the mean of metagene expression
(Figure 7a,b and [
11
] for details). Inferior prognosis (indicated by red areas in the map) is
associated with the upregulation of genes in and around spot A and the processing ridge
in the middle of the map. These overexpression states are predominately found in LMS1.
Good prognosis (blue areas) is found around spot E, associating with LMS5, and also spot
B, C, and D upregulated in LMS2-4. A detailed comparison of the prognostic maps with
the overexpression maps reveals that metagenes of maximum and minimum HR (HRmax
and HRmin, respectively; for lists of genes see Supplementary Table S4) slightly deviate
from the overexpression spot-areas of maximum expression, particularly of spot A and E,
respectively. Each of those metagenes contain about 80 transcripts (red and blue triangles
in Figure 7a). Their profiles upregulate in LMS1 (maxHR) and LMS5 (minHR), respectively
(Figure 7b). We defined a prognostic score by calculating the log-expression difference
(
HR = maxHR
minHR), ranked the CRLM with increasing
HR-score, and calculated
the OS-survival curves for the high- and low-risk groups (top and bottom 50% or 25% of
CRLM), which resulted in OS-curves resembling those between LMS1 and the other three
subtypes (compare with Figure 6).
Substituting the genes taken from the maxHR and minHR metagenes by genes of
spot A and spot E results in similar OS-curves, indicating that limiting the prognosis of
CRLM patients is well reflected by the subtypes LMS1 (most inferior) and LMS5 (better).
Key genes frequently mutated and/or deregulated in CRC and CRLM, including APC,
Cancers 2023,15, 3835 16 of 31
FBXW7,SMAD4,PIK3CA,WNT5A, and ATRX, are found adjacent to the minHR metagene,
suggesting that their upregulation is associated with better prognosis (compare Figure 7a,
left map and Figure 1c). Further, the minHR area enriches genes of the set “Golgi cis-
cisternae” (gene ontology (GO) cellular component) playing a role in sustaining invasion
and metastasis such as the intracellular signaling platform, lipid biosynthesis, protein se-
cretion, the formation of extracellular vesicles, and the overall supporting of the adaptation
of cancer cells to acquire a mesenchymal-like migratory phenotype via extracellular matrix
remodeling ([
56
] and references cited therein). The maxHR regions are enriched in genes of
the epithelial functional context and bad prognosis in cholangiocarcinoma [
79
] as well as
nasopharyngeal- [
80
] carcinomas (Supplementary Figure S5). A plot along the
HR score
(after averaging and smoothing) indicates that HR increases, on average, only at larger
values of the score, which are enriched with LMS1 cases (Figure 7d).
The expression profiles of gene signatures related to neoadjuvant chemotherapy and
treatment resistance in CRC and CRLM show activation in LMS5 and partly in LMS1
suggesting a treatment- and metastasis-driven shift of function towards an immuno-
suppressive phenotype (Figure 7e, left part) [
6
]. A detailed inspection of the signature
profiles and their gene maps reveals close similarity with those of the hallmark set “hy-
poxia” which was recently identified as a driving process in CRC liver metastasis and
chemotherapy-treatment resistance (Figure 7e, right part) [
3
]. The hypoxia signature genes
distribute over spots E and A, where the latter hosts the key regulator SLC2A, which
promotes immunosuppression via M2-macrophage enrichment combined with immune
escape via specific CD4+ T-cell exhaustion [
3
] (Figure 7e, right part). The virtually identi-
cal profiles and maps of the hallmark genes “epithelial-mesenchymal transition” (EMT)
and “angiogenesis” reflect the close relations between these functions: both blood vessel
formation (angiogenesis) [
81
] as well as senescence in a secretory cell phenotype [
82
] fa-
cilitate metastasis and treatment resistance. The respective marker genes accumulate in
and around spot E together with immuno-suppressors and hallmark “EMT” genes thus
indicating co-regulation. Interestingly, the PIEZO1 gene, coding a mechano-sensitive sensor
membrane-channel protein also locates in the endothelial/CAF-rich left region of spot E.
The mechano-sensitivity of tumor and of TME-cells such as macrophages is important
for their ability to transduce mechanical forces into biochemical signals associated with
signaling pathways involved in cancer metastasis and EMT, such as angiogenesis, cell
migration, intravasation, and proliferation [
83
85
], as well as cytokinesis and endosome
trafficking [
86
]. The proper processing of mechanical stress is an essential function to
master the changing cell architecture and extracellular-matrix (ECM) organization along
the EMT-path [87].
In conclusion, the prognostic maps divide the expression landscape into regions of
better and worse prognosis, which are associated with the functional context of the affected
spots assigned above. These maps enable the extraction of a prognostic expression score
along the EMT-axis between spots A and E and their adjacent regions. Treatment resistance,
immune evasion, and a series of EMT-related processes such as hypoxia, angiogenesis,
senescence, and micro-mechanic sensitivity associate with neoadjuvant treatment and the
formation of an immuno-suppressive, mesenchymal state of CRLMS.
3.8. Intra-Patient Heterogeneity of Metastasis
More than one CRLM samples were collected from nearly fifty patients at different
time-points, [
5
] providing a set of multiple CRLM (mCRLM) which enables a glimpse of the
intra-patient mCRLM heterogeneity. A river flow plot links the intra-patient mCRLM with
their LMS and segment classes revealing no unique assignment in many cases (
Figure 8a
).
For an alternative perspective, we applied the continuous
HR prognostic score for het-
erogeneity analysis in terms of a mean-value ranked plot of the mCRLM (Figure 8b). Its
slope and vertical spread closely resemble the respective plot of the individual CRLM
(compare with Figure 7c). The standard deviation of
HR for each mCRLM overall forms
Cancers 2023,15, 3835 17 of 31
a relatively narrow range of intra-patient variability (see box-heights or separate plot in
Figure 8b below).
Cancers2023,15,xFORPEERREVIEW18of34
Figure7.Prognosticmapsandscoresandtheirrelationtotreatmentandtheepithelial–mesenchy-
maltransition(EMT).(a)Theprognosticmapcolorcodesassociationsbetweenmetageneexpression
andthehazardratio(HR)relativetotheoverallsurvivalaveragedoverallCRLMinred(highex-
pression,highHR)toblue(highexpression,lowHR).Theleftmapreferstoalowerandtheright
maptoahigherexpressionthreshold.MetagenesofmaximumandminimumHR(HRmaxand
HRmin,respectively)areindicatedbyredandbluetrianglesintheleftmap.(b)Prolesofthe
HRminandHRmaxmetageneexpression.Theoverallsurvival(OS)-curvesofpatientswithacti-
vatedandnon-activatedexpressionarevirtuallymirrorsymmetricalforHRminandHRmaxre-
gardinggoodandinferiorprognoses.(c)ProlesofgenesfromHRmaxandHRminmetagenesare
combinedintoaHRscore.PositiveandnegativeHRvalueswerestronglyenrichedwithinLMS1
andLMS5CRLMs.A25-percentileselectionalongthescoreprovidesslightlymorediscriminative
OScurvesthanthe50-percentileselectionintogoodandbadprognosisCRLM.Usinggenesofspots
AandEforthescore(spot)providessimilarresults.(d)TheHR-versus-(HR-)scoreplotreveals,
onaverage,anon-linearrelationwhereHRincreasesvirtuallyonlyatlargerscorevaluesatin-
creasedLMS1content.(e)Genesetanalysissupportsthefunctionalinterpretationofneoadjuvant
chemotherapytreatmentresistanceandbadprognosisintermsofEMT:theheatmapconsiders
genespromotingmetastasisviaanangiogenicmechanism[88],“jackpotgenesoftreatmentre-
sistancedrivenbyepigenetics[89],genesupregulatedafterFOLFIRItreatment[90],andgenesup-
ordownregulatedafterchemotherapyofCRLMpatientsandinvitro5-Fluorouracil(5-FU)[6].x–y
biplotsofthetreatmenteect(x—downregulatedgenes,y—upregulatedgenes)showashiftto-
wardsimmunosuppressiveLMS5.Theup-prolesandgenemapsresemblethoseofthehallmark
hypoxia[36],promotingviciousandimmuno-suppressivemetastasis[3].HallmarkgenesofEMT-
andangiogenesisaccumulateinspotEtogetherwithimmunecheckpointinhibitors[81]andsenes-
cencegeneswhichfacilitateresistanceagainstimmunotherapy[82].
3.8.IntraPatientHeterogeneityofMetastasis
MorethanoneCRLMsampleswerecollectedfromnearlyftypatientsatdierent
time-points,[5]providingasetofmultipleCRLM(mCRLM)whichenablesaglimpseof
theintra-patientmCRLMheterogeneity.Ariverowplotlinkstheintra-patientmCRLM
withtheirLMSandsegmentclassesrevealingnouniqueassignmentinmanycases
Figure 7.
Prognostic maps and scores and their relation to treatment and the epithelial–mesenchymal
transition (EMT). (
a
) The prognostic map color codes associations between metagene expression and
the hazard ratio (HR) relative to the overall survival averaged over all CRLM in red (high expression,
high HR) to blue (high expression, low HR). The left map refers to a lower and the right map to
a higher expression threshold. Metagenes of maximum and minimum HR (HRmax and HRmin,
respectively) are indicated by red and blue triangles in the left map. (
b
) Profiles of the HRmin
and HRmax metagene expression. The overall survival (OS)-curves of patients with activated and
non-activated expression are virtually mirror symmetrical for HRmin and HRmax regarding good
and inferior prognoses. (
c
) Profiles of genes from HRmax and HRmin metagenes are combined into
a
HR score. Positive and negative
HR values were strongly enriched within LMS1 and LMS5
CRLMs. A 25-percentile selection along the score provides slightly more discriminative OS curves
than the 50-percentile selection into good and bad prognosis CRLM. Using genes of spots A and E for
the score (
spot) provides similar results. (
d
) The HR-versus-(
HR-)score plot reveals, on average,
a non-linear relation where HR increases virtually only at larger score values at increased LMS1
content. (
e
) Gene set analysis supports the functional interpretation of neoadjuvant chemotherapy
treatment resistance and bad prognosis in terms of EMT: the heatmap considers genes promoting
metastasis via an angiogenic mechanism [
88
], “jackpot” genes of treatment resistance driven by epi-
genetics [
89
], genes upregulated after FOLFIRI treatment [
90
], and genes up- or downregulated after
chemotherapy of CRLM patients and
in vitro
5-Fluorouracil (5-FU) [
6
]. x–y biplots of the treatment
effect (
x—downregulated
genes, y—upregulated genes) show a shift towards immunosuppressive
LMS5. The up-profiles and gene maps resemble those of the hallmark hypoxia [
36
], promoting vicious
and immuno-suppressive metastasis [
3
]. Hallmark genes of EMT- and angiogenesis accumulate
in spot E together with immune checkpoint inhibitors [
81
] and senescence genes which facilitate
resistance against immunotherapy [82].
Cancers 2023,15, 3835 18 of 31
Figure 8.
Intra-patient heterogeneity of CRLM: (
a
) Flow diagram of multiple CRLM (mCRLM) from
the same patient towards LMS and segment classifications to the left and to the right, respectively.
(
b
) Mean
HR score of the mCRLM averaged per patient and ranked from the left to the right. The
boxes refer to the standard deviation (SD) around the median per CRLM and enlarged plot of the
standard deviation. (
c
) SOM portraits of the mCRLM per patient sorted by part (
b
) (see patient no.
and arrows). The color bars assign the LMS.
The overall inter-patient heterogeneity of the prognostic score exceeds the intra-patient
heterogeneity by roughly one order of magnitude, considering the change in
HR across
its linear range of the plot (
∆∆
HR ~1) as a measure of the former and the mean SD (~0.1)
as a rough estimate of the latter effect. About 50% of the mCRLM from the same patient
belong to different subtypes [
5
], a result that suggests a relatively large intra-patient het-
erogeneity in contrast to our estimation. Inspection of the individual portraits (with more
than three mCRLM per patient) reveals that part of the mCRLM of the same patient with
different LMS memberships show very similar expression patterns (see, e.g., pat. 467,
141, 61, 47, Figure 8b). Moreover, the distinction between LMS2–LMS3 is relatively uncer-
tain, which then also applies to the heterogeneity estimation based on LMS-membership
(Figures 1d and 2c).
On the other hand, the SD increases towards the end of the
HR scale. Selected
portraits from these regions (e.g., pat. 503 from the left and pat. 89, 16 from the right
side) show relatively diverse patterns despite their unique LMS assignment, suggesting a
relatively high variability of the underlying transcriptional states, particularly in the low-
expression epigenetic effects. Notably, the portraits of the right, high inferior prognosis side
reveal an enrichment of CRLM with activated “plasticity plateau” referring to high (red)
HR areas in the prognostic map (Figure 7a). In summary, the transcriptomes of mCRLM
from the same patient are relatively homogeneous based on the prognostic
HR score and
Cancers 2023,15, 3835 19 of 31
its variability. mCRLM of inferior prognosis is variant regarding their epigenetic context
showing either the activation of spot A (epithelium) and/or of the plasticity plateau.
4. Discussion
4.1. Subtyping Stratifies CRLM along Cancer Hallmarks and TME Education
We reanalyzed transcriptome data of liver metastases using SOM portrayal, a machine
learning method that generates individual images of each tumor sample, reduces the
dimensions of the transcriptome-wide expression data, and enables straightforward and
intuitive downstream analysis [
21
]. The previously identified five LMS [
5
] were used to
provide subtype-specific SOM portraits revealing six major modules of co-regulated genes.
These related to molecular hallmarks of cancer, namely avoiding immune destruction (spot
A), tumor-promoting inflammation and angiogenesis (spot E), sustaining proliferation
(spot C), deregulating cellular energetics, metabolic activity and DNA repair (spot B),
and CIN-related genomic instability (spot D) (Figure 9) [
91
]. The spot-genes provide
robust signatures for classifying the LMS, meeting a trade-off criterion of balancing the
interpretability and specificity of molecular markers [92].
The functional characteristics of the LMS align with known molecular CRC subtypes,
suggesting that metastasis, at least partly, maintains characteristics of primary tumors,
which could inform subtype-targeted therapy for both metastatic and non-metastatic tu-
mors. A high concordance of genetic key lesions, mutations, and CNV between primary
and metastatic tumors supports this observation. This suggests that there is minimal impact
of the liver microenvironment on CRLM mutation patterns, but rather neutral evolution
mechanisms of early metastasis due to the high fitness of the metastatic clones [
3
,
27
]. On
the other hand, the liver facilitates metastatic expansion from other organs because of its
complex immune system that dampens immunity to cancer-related neoantigens, promotes
immune editing, and changes TME-characteristics of the metastatic compartment com-
pared with the primary tumors [
7
,
93
,
94
] thus offering a tumor-supporting (pre-)metastatic
niche [
95
]. Interestingly, a recent PanCancer metastasis study on 4000 metastatic tumors
across thirty-two cancer types taken from the cancer genome atlas (TCGA) reported four
major molecular groups of metastasis (s1-4) [
10
] partly associating with the function of our
spot modules, namely proliferation, DNA repair and CIN (spot C and D) in s1, metabolism
(spot B and C) in s2, and inflammation and immunity (spot E) in s4. The epigenetic subtype
s3 can be related to the ”plasticity plateau” and, partly, spot B-accumulating PRC2 targets
observed across all LMS with elevated expression. Extensive drug testing in cell lines has
suggested therapeutic options for s1 (e.g., targeting MYC) and s3 (e.g., targeting histone
acetylation and methylation as well as TERT and EZH2). Potential treatment options for
primary CRC were recently reviewed [
96
] and include immune and adjuvant therapy de-
pending on the CIMP-status for CMS1 (and possibly LMS1), targeted treatment to MAPK-
and WNT-pathways, MYC-expression or glycolysis metabolism for CMS2 and/or CMS3
(and possibly LMS2-4), TGFbeta, and also cancer stemness inhibitors for CMS4 (and possibly
LMS5). We also studied telomere maintenance pathways in CRLM and found decreased
activity in both TEL and ALT TM in LMS5, but increased ALT, which directly correlates
with the upregulation of spot C (proliferation, stemness). Changes in TM activity are a
hallmark of cancer modulation in CRC [
77
] and are suggested as markers of response
to therapy [97].
Our analysis identified factors driving CRLM characteristics towards the immune-
suppressive spot-module E, namely accompanying extrahepatic disease(s), synchronous
metastasis, and TP53,NRAS, and KRAS mutations, where the frequency of these mutations
is reported to increase in immuno-suppressive immunotypes expressing spot E in our
data [
3
]. Moreover, neoadjuvant chemotherapy also induces a shift of CRLM and of derived
tumor cell cultures towards a mesenchymal phenotype [
5
,
6
], which is supported by the
LMS5-resemblance of the extracted expression signature. However, it is important to note
that cancer therapies, particularly platinum-based/5-FU therapies, can cause significant
changes in the tumor genome, and introduce an evolutionary bottleneck that selects for
Cancers 2023,15, 3835 20 of 31
known therapy-resistant drivers [
98
] and induces mutational burdens exceeding aging-
related mutations accumulating in the same time-span by several orders of magnitude [
99
].
However, the impact of this mutational “toxicity” on tumor development is not entirely
clear. Early CRC drivers were found to be enriched in CRLM, which can harbor private
mutations (PTPRT, and to a lesser degree, AMER1 and TCLF1), also suggesting stringent
evolutionary selection mechanisms for the part of the CRLM [
27
] that is possibly governed
by treatment [25].
Cancers2023,15,xFORPEERREVIEW23of34
Figure9.Aholisticviewoncolorectalcancerlivermetastasis:molecularheterogeneity,tumormi-
croenvironment,geneticandepigeneticmodesofregulation,prognosis,tumordevelopment,and
possibletreatmentoptions.Seediscussion.
4.2.TrajectoryInferenceandPersonalizedAnalysisDiscoveraHiddenUniverseofContinuous
EpigeneticStatesShapingMetastasis
LMS1-5subtypeswereidentiedbyclassdiscoveryusingnon-negativematrixfac-
torizationfollowedbyclusternumberoptimizationbasedongeneswithhighspecicex-
pressionlevelsacrossthesubtypes[5].Consequently,theLMS-specicspotmarkerswere
biasedtowardshighlyvariantgenes,whichcanunintentionallyneglectlesservariantgene
clusters.Ourpersonalizedlandscape-approachconsidersindividualportraitsbeyondthe
LMS-strataandthereforeextractsadditionalfeaturesappearingasweaklyvariantspot
Figure 9.
A holistic view on colorectal cancer liver metastasis: molecular heterogeneity, tumor
microenvironment, genetic and epigenetic modes of regulation, prognosis, tumor development, and
possible treatment options. See discussion.
Cancers 2023,15, 3835 21 of 31
Genetic alterations and chromosomal instability are necessary but insufficient for
cancer initiation, progression, and metastasis. Instead, tumors, and particularly CRLM,
are rather complex ecosystems involving cancer cells and the TME [
94
]. Our analysis
suggests an interaction between the TME and stroma-mediated immune suppression and
treatment resistance during CRLM development. Using a series of TME-related signatures,
as well as the single-cell deconvolution of bulk expression data, we found indications of an
immunogenic pro-tumoral and tumor-suppressive TME in LMS1 and LMS5, respectively,
and of predominantly immune desert properties in LMS2-4. However, part of the CRLM of
the immune-desert subtypes show a medium-level expression of spot E, thus also reflecting
moderate inflammatory, immuno-suppressive characteristics in these LMS. We found that
spot E overlays signatures of cancer-associated fibroblasts, of the stroma and of different
immune cells such as B-, plasma, pDC, and mast cells reflecting the overall considerable het-
erogeneity and changing composition of the TME-cell communities not explicitly resolved
in the LMS stratification. This suggests the education of the TME towards stroma-mediated
immune suppression and treatment resistance upon CRLM development [
100
,
101
]. Further,
spot E co-expresses with a series of immune checkpoint inhibitors such as CTLA4,PDL1,
and TIM3, whose increased expression associates with T-cell exhaustion, macrophage re-
education from M2 towards M1, and decaying CD8+ expression in agreement with [
3
,
7
,
102
].
The central role of tumor-associated macrophages (TAMs) in the TME is substantiated
by the evolutionary consequences TMA impose through positive or negative selective
pressure exerted by/on tumor cells [
35
]. TAM complexity is paving a possibly way for
the design of novel TAM-targeting therapies to harness their anti-tumoral potential [
103
],
e.g., through CCR5-inhibitor treatment reprogramming TAMs toward a pro-inflammatory
phenotype in metastatic CRC patients [
104
]. Overall, transcriptomic portrayal confirms
the previous subtyping of CRLM, deepens the understanding of their functional context,
provides marker signatures for the LMS, and delineates the remodeling of the TME from
an immuno-active to an immuno-suppressive state.
4.2. Trajectory Inference and Personalized Analysis Discover a Hidden Universe of Continuous
Epigenetic States Shaping Metastasis
LMS 1-5 subtypes were identified by class discovery using non-negative matrix fac-
torization followed by cluster number optimization based on genes with high specific
expression levels across the subtypes [
5
]. Consequently, the LMS-specific spot markers
were biased towards highly variant genes, which can unintentionally neglect lesser variant
gene clusters. Our personalized landscape-approach considers individual portraits beyond
the LMS-strata and therefore extracts additional features appearing as weakly variant
spot modules not detected in the LMS-centered analysis, akin to a “dark matter” of gene
expression (in analogy to the not-visible matter in the universe).
Notably, our approach reveals the activation of regulatory programs with epigenetic
impact such as PRC2-targetted genes with poised and repressed promoters in a healthy
colon which change their promoter status in CRLM through chromatin remodeling [
17
].
We found that these epigenetic modes antagonistically regulate compared with a part of the
high-expression programs including, e.g., genes with active promoters governing functions
such as cell cycle, DNA repair, and oxphos metabolism. The activation of such epigenetic
modes is associated with the ability of cancer cells to acquire plasticity regarding their fates.
It enables them to transit between high-expression states and low-expression states, such
as along the epithelial–mesenchymal transition axis taking place between LMS1 and LMS5.
We observed such epigenetic states in all subtypes, which suggests that all of them are
prone to fate decisions via plasticity-driven changes in their state.
The balancing between active and proliferative genes on the left, and repressed, plastic
chromatin states on the right enables evolutionary adaptation to the TME and drives not
only the adenoma-to-carcinoma transition of CRC but also promotes metastasis, e.g., via
the EMT [
45
]. Components of the PRC2 such as EZH2 and SUZ12 (located in spot C) can
facilitate these transitions and promote CRC stem-like cells (as indicated by the stemness
Cancers 2023,15, 3835 22 of 31
marker LGR5 in spot C, see below) via repressive interactions with their targets (located in
the plasticity plateau) in concert with changes to histone and DNA methylation in CIMP
genes. Notably, the bimodality between active and repressed chromatin states and their
association with proliferative and immunogenic states was described for other tumors
such as lymphomas [
92
] and gliomas [
11
]. In a more general sense, the interplay between
high-expression-genetic and low-expression-epigenetic states, as identified in the CRLM
expression landscape, is a key condition for the reorganization of transcriptional programs
that facilitate tumor development via the emergence of new attractor states better adapted
to the changing TME [
8
]. Epigenetic mechanisms also affect antitumor and protumor
immunity and the TME immune and CAF cell composition, thus facilitating immune state
transitions during tumor development [
103
]. An interesting argument supporting the role
of epigenetics in metastasis comes from the result that mutational patterns of passenger
mutations in metastatic tumors outperform those of driver mutations in predicting their
primary tumor of origin, presumably because the regional density of somatic passenger
mutations reflects chromatin accessibility to DNA-repair complexes, which in turn relates
to the epigenetic state of the cancer cell [105].
Trajectory inference revealed possible developmental paths from epithelial (as seen in
LMS1) to mesenchymal (in LMS5) characteristics, which could lead to enhanced metastatic
fitness through an interplay of genetic and epigenetic factors [
106
]. These trajectories are
governed by continuous alterations in spot expressions suggesting that the LMS subtypes
are “connected” through a continuum of intermediate states. This continuum combines the
“archetypic” LMS characteristics and epigenetic regulatory modes such as the plasticity
plateau with the LMS spots in diverse ways and with varying degrees of influence. Further,
the transition to metastasis is thought to be facilitated by the presence of proliferative
CRC cancer stem cells (CSCs). These CSCs constitute presumably a small population of
highly tumorigenic cells, possessing pluripotency and self-renewal properties that drive
metastasis and treatment resistance. This process is sustained by a dynamic TME [
107
].
Complex interactions between the TME and CSC not only maintain stemness but also fuels
tumor evolution into aggressive, invasive, migratory phenotypes [100], a process thought
to be under the influence of epigenetic control [
108
]. The CSC marker, LGR5, also a marker
for normal intestinal stem cells, suggests that the stem-cell program in the colon could
be conserved in CRC [
106
,
109
] and CRLM. Additional factors such as CRC-typical CIN
(e.g., in spot D), which boost metastasis, amplify oncogenic signaling, leading to more
aggressive cancer clones, punctuated tumor evolution [
110
], and immunoediting in the
TME context [111].
The oscillation between hallmark-related archetypic genetic and plastic epigenetic
states suggests that high-transcription activity stabilizes the genetic states, while low-
transcription promotes transitions in response to developmental or environmental cues [
112
].
Open chromatin states, associated with high expression, may inhibit differentiation and pro-
mote tumor growth via proliferative, metabolic, and CIN hallmarks, primarily in LMS2-4.
Conversely, permissive or “plastic” states may allow cell fate transitions, providing a fitness
advantage along the EMT-trajectory from Seg1.1 (mainly LMS1) to Seg3 (mainly LMS5)
of the monocle tree. We recently showed in melanoma cell lines that targeted treatment
can shift cell physiology from high-transcription to plastic, low-transcription epigenetic
states [
15
]. In summary, combining trajectory inference with Self-Organizing Map (SOM)
portrayal reveals a continuum of transcriptomic states that divides into archetypic hallmark-
states and intermediate transition states driven by epigenetic plasticity and characterized
by diverse combinations of these characteristics.
4.3. Towards Precision Diagnostics and Treatment Decisions of CRLM
This dual continuous and plastic characteristic of CRLM heterogeneity necessitates a
fine granular diagnostic scheme that surpasses the somewhat generalized LMS stratification.
Our portrayal method provides unique molecular images of each tumor, enabling per-
sonalized diagnosis through the intuitive visual interpretation of spot patterns, which are
Cancers 2023,15, 3835 23 of 31
interpretable in terms of activated cellular programs. Hence, these personalized portraits
can be evaluated by experts in a diagnostic pipeline to assess the specific transcriptional
state of a CRLM sample. Such “manual” inspection of molecular portraits could be au-
tomated, for example, through neuronal network machine learning, in future stages of
technical development (see [
92
] for a proof of principle application to another cancer type).
Moreover, transcriptomic maps can be transformed into prognostic landscapes visualiz-
ing the association between gene expression, cellular programs, and their corresponding
HR. To adequately account for the continuous nature of CRLM heterogeneity within the
transcriptomic landscape, we applied a one-dimensional HR-score along the EMT axis as a
proof of principle solution that links the best and worst HR metagenes in the landscape.
This scoring method can be extended to multidimensional molecular coordinates, as identi-
fied by the monocle-tree analysis or by combining different spots for prognostics within a
coordinate system of continuous scores [113].
Another implication of the continuous developmental trajectories of CRLM, which
are governed by epigenetics, is that treatment will require molecular surveillance and
adaptation to evolving states. The dynamic cellular composition and functional character-
istics of the immune landscape along the trajectory of cancer development are expected
to impact therapeutic efficacy and clinical outcomes. Targeting epigenetic modifiers to
remodel the tumor-immune microenvironment holds great potential as an integral part of
anticancer regimens [103].
The so-called oligometastatic hypothesis provides an alternative perspective on the
development and treatment of metastasis, suggesting that the early and aggressive treat-
ment of metastases may prevent the further spread of cancer and even achieve long-term
remission [
104
]. When applied to CRLM, it shows that a canonical, proliferative subtype
resembling LMS3-4, with an elevated CIN-signature score [
114
] of genes accumulating in
and around spot C is associated with increased metastatic propensity, and inferior overall
survival [
115
117
] (Supplementary Figure S5d). Surprisingly, a specific subgroup of these
CRLM, also with a high CIN-score, exhibits the opposite effect. They demonstrate a thera-
peutic vulnerability to DNA-damaging therapies, leading to improved treatment responses
and low clinical risk with a 10-year survival rate of 95% after resection [
116
]. However,
the challenge remains how to differentiate high-CIN-score tumors likely to respond to
DNA-damaging therapies from those unlikely to respond. Our results show that CRLM
with an upregulated CIN-score (spot C, proliferation up) widely distributed over LMS1–
LMS4 with a broad spectrum of functional and genomic characteristics associated with
activated proliferation. This provides a basis for further studies to distinguish responders
from non-responders to DNA-damaging therapy.
4.4. Limitations and Open Points
Our analysis has several key limitations. Firstly, it does not directly consider patient-
paired CRLM and primary CRC, making it unclear how the molecular characteristics of the
primary CRC are maintained after distant spread. While the mapping of CRC signatures
suggests strong parallels, this remains an area requiring further clarification. On the other
hand, the analysis of multiple CRLM based on the metagene HR-score suggests small
intra-patient variability that indirectly implies a relatively homogeneous metastatic spread.
Secondly, our analysis is based on a cross-sectional cohort of bulk tumor samples
without longitudinal follow-up data on tumor development over time. However, the
observed cross-sectional heterogeneity can be interpreted in terms of a series of devel-
opmental CRLM states existing in the different patients along the pseudotime scale,
which simulates developmental dynamics [
44
]. Alternatively, it could reflect the intra-
tumoral CRLM heterogeneity of microscopic lesions all co-existing at the same stage of
tumor progression.
Both interpretations suggest that heterogeneous micro-lesions may change their com-
position along developmental trajectories of metastasis. This view is supported by spatial
transcriptomics of CRLM, which show that cellular neighborhood archetypes reflecting
Cancers 2023,15, 3835 24 of 31
different stages of tumor progression and resembling our LMS-types are active simulta-
neously in different spatial microregions of the same tumor [
118
]. Other studies have
identified distinct modes of micro-vessel vascularization, specific metabolic alterations, and
aWNT-signaling signature with a prognostic impact co-existing in the same CRLM [
119
].
Interestingly, a recent spatial transcriptomics study of synchronous resections of primary
CRC and matched CRLM suggests that the stromal-versus-inflammatory balance of the
invasive edge of CRLM samples impacts prognosis. Specifically, it reported that an increase
in LMS5-like features (increased Tregs and stromal composition) compared with inflam-
matory ones is associated with a worse prognosis [
120
]. Another study along this line
suggested that observable histopathological growth patterns (HGPs) can be interpreted
in terms of molecular mechanisms [
121
]. These patterns differentiate between LMS5-like
lesions (desmoplastic HGP, enriched in EMT, angiogenesis, stroma, and immune signa-
tures) and lesions resembling LMS3-4 (replacement HGP, enriched in metabolism, cell
cycle, and DNA damage repair signatures), as well as combinations of both. This implies
that microscopic imaging in combination with spatial transcriptomics can support molec-
ular diagnostics. Together, this data suggests that the heterogeneity of liver metastases
appears to distribute on a microscopic scale, both spatially and temporally. This implies
that different LMS-states and cellular TME-communities coexist in the same tumor and
change their composition upon development, usually towards immune-suppressive and
treatment-resistant states. Further investigation into these dynamics is necessary to fully
understand their implications for treatment and prognosis.
5. Conclusions
Machine learning using an omics portrayal of CRLM provides a comprehensive and
detailed understanding of the molecular heterogeneity underlying this complex disease.
Our study revealed a shift from treatment-sensitive to treatment-resistant tumors that are
guided by genetic, epigenetic, and microenvironmental factors. Our findings encourage
further studies to better understand the micro-spatial heterogeneity of the CRLM, the
underlying epigenetic mechanisms, the changing cellular communities in the TME inter-
acting with the tumor cells, and also possible genetic determinants. Understanding this
heterogeneity is crucial for tailoring effective treatment strategies, where precision medicine
is one promising approach. It must involve the genomic profiling of individual tumors in
follow-up settings to identify specific alterations and vulnerabilities in space and time to
tackle the diverse manifestations of CRLM and provide better control of disease progres-
sion. This information can guide the selection of targeted therapies, immunotherapies, or
combined treatments.
Supplementary Materials:
The following supporting information can be downloaded at: https:
//www.mdpi.com/article/10.3390/cancers15153835/s1, Figure S1: Browsing the livermets-dataset
using the oposSOM-browser; Figure S2. Gallery of SOM portraits in standard and coastline scales;
Figure S3: Sorting the portraits; Figure S4: Heatmaps of the gene set categories gene ontology
biological process (GO BP) and immunome (taken from [4]); Figure S5: Detailed functional analysis
using gene set maps and profiles; Figure S6: Single cell analysis of CRLM data for cell marker gene
extraction and mapping to CRLM data; Figure S7. Pathway activity analysis of WNT-signaling;
Figure S8: Pseudo-dynamics analysis of CRLM development; Figure S9: Heterogeneity of CAF and
immune cell expression across the LMS; Figure S10. Comparison of different segmentation methods
for visualizing the expression land-scape, extracting expression spot-modules and characterizing
them in terms of spot implications and mutual correlations between them; Table S1. Spot module
characteristics; Table S2: is provided as excel sheet. It contains lists of spot genes A-F; Table S3:
Collection of gene sets specifically implemented in oposSOM in this publication; Table S4: is provided
as supplementary excel sheet. List of genes with maximum and minimum HR, list of delta-HR ranked
CRLM. See also Figure 7c. References [122130] are cited in the supplementary materials. Movie S1:
Tree-trajectory-Seg1-to-Seg3.
Cancers 2023,15, 3835 25 of 31
Author Contributions:
Conception: H.B., A.A., L.N., G.A., D.C. and D.U.; methodology: H.B., A.A.,
M.S., O.A. and N.S.; software: H.L.-W. and M.S.; analysis and interpretation: all authors. All authors
have read and agreed to the published version of the manuscript.
Funding:
A.A.’s contribution to this work was supported by the Science Committee of Armenia
(project number 21AG-1F021).
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement:
An interactive online browser is provided, which provides different
views on the CRLM-data set: https://apps.health-atlas.de/opossom-browser/?dataset=15 and
www.izbi.de (see also Supplementary Figure S1).
Acknowledgments:
The Armenian Bioinformatics Institute (ABI) thanks Agenus Inc. very much
for their support in establishing Cancer Bioinformatics in Armenia. O.A., N.S., and A.K. appreciate
the support of a stipendium paid by Agenus Inc. (Lexington, MA, USA). We acknowledge support
from the German Research Foundation (DFG) and Universität Leipzig within the program of Open
Access Publishing.
Conflicts of Interest:
O.A., N.S., and A.K. were supported by a stipendium paid by Agenus Inc
(Lexington, MA, USA) for their scientific studies. G.A., D.U., D.C., and M.U. were from Agenus
Inc (Lexington, MA), a biotech company with a focus on immunotherapy of cancer, which supports
bioinformatics research in the frame of its support-program for young scientists in Armenia. The
other authors declare no conflict of interest.
Abbreviations
ALT Alternative lengthening of telomeres
AUC Area Under the ROC Curve
CAF Cancer-associated fibroblast
CIN Chromosomal instability
CRC Colorectal cancer
CRLM Colorectal liver metastases
CMS Consensus molecular subtyping
CNV Copy number variations
CIMP CPG island methylation phenotype
EPC Endothelial progenitor cell
ECM Extracellular matrix
FU Fluorouracil
GM Gene module
GO Gene ontology
GINS Gene-interaction-perturbation-network-based subtyping
GPCR G-protein coupled receptor
HR Hazard ratio
HepSC Hepatic stellate cell
HGP Histopathological growth patterns
ICI Immune checkpoint inhibitor
ICA Independent component analysis
KEGG Kyoto Encyclopedia of Genes and Genomes
LIV Liver
LM Liver metastases
LMS Liver metastasis subtype
LSEC Liver sinusoidal endothelial cell
MSI Microsatellite instable
NK Natural killer
OS Overall survival
OC lOverlap coefficient
PSF Pathway signal flow
Cancers 2023,15, 3835 26 of 31
PC Plasma cells
pDC Plasmacytoid dendritic cells
PRC2 Polycomb repressive complex 2
PCA Principal component analysis
PT Pseudotime
ROC Receiver operator characteristic
SOM Self-organizing map
scRNAseq Single cell RNA sequencing
TAF Tumor associated macrophages
TEL Telomerase
TM Telomere maintenance
TCGA The cancer genome atlas
TF Transcription factor
TME Tumor microenvironment
UMAP Uniform manifold approximation and projection for dimension
reduction
UMI Unique molecular identifier
WTO Weighted topological overlap
References
1.
Tsilimigras, D.I.; Brodt, P.; Clavien, P.A.; Muschel, R.J.; D’Angelica, M.I.; Endo, I.; Parks, R.W.; Doyle, M.; de Santibañes, E.;
Pawlik, T.M. Liver metastases. Nat. Rev. Dis. Primers 2021,7, 27. [CrossRef] [PubMed]
2.
Vidal-Vanaclocha, F. The liver prometastatic reaction of cancer patients: Implications for microenvironment-dependent colon
cancer gene regulation. Cancer Microenviron. 2011,4, 163–180. [CrossRef] [PubMed]
3. Yang, S.; Qian, L.; Li, Z.; Li, Y.; Bai, J.; Zheng, B.; Chen, K.; Qiu, X.; Cai, G.; Wang, S.; et al. Integrated Multi-Omics Landscape of
Liver Metastases. Gastroenterology 2023,164, 407–423.e17. [CrossRef] [PubMed]
4.
Liu, J.; Cho, Y.B.; Hong, H.K.; Wu, S.; Ebert, P.J.; Bray, S.M.; Wong, S.S.; Ting, J.C.; Calley, J.N.; Whittington, C.F.; et al. Molecular
dissection of CRC primary tumors and their matched liver metastases reveals critical role of immune microenvironment, EMT
and angiogenesis in cancer metastasis. Sci. Rep. 2020,10, 10725. [CrossRef] [PubMed]
5.
Moosavi, S.H.; Eide, P.W.; Eilertsen, I.A.; Brunsell, T.H.; Berg, K.C.G.; Røsok, B.I.; Brudvik, K.W.; Bjørnbeth, B.A.; Guren, M.G.;
Nesbakken, A.; et al. De novo transcriptomic subtyping of colorectal cancer liver metastases in the context of tumor heterogeneity.
Genome Med. 2021,13, 143. [CrossRef] [PubMed]
6.
Trumpi, K.; Ubink, I.; Trinh, A.; Djafarihamedani, M.; Jongen, J.M.; Govaert, K.M.; Elias, S.G.; van Hooff, S.R.; Medema, J.P.; Lacle,
M.M.; et al. Neoadjuvant chemotherapy affects molecular classification of colorectal tumors. Oncogenesis
2017
,6, e357. [CrossRef]
7.
Wu, Y.; Yang, S.; Ma, J.; Chen, Z.; Song, G.; Rao, D.; Cheng, Y.; Huang, S.; Liu, Y.; Jiang, S.; et al. Spatiotemporal Immune
Landscape of Colorectal Cancer Liver Metastasis at Single-Cell Level. Cancer Discov. 2022,12, 134–153. [CrossRef]
8. Feinberg, A.P.; Levchenko, A. Epigenetics as a mediator of plasticity in cancer. Science 2023,379, eaaw3835. [CrossRef]
9. Waddington, C.H. The Strategy of the Genes; Routledge: Abingdon, UK, 2014.
10.
Zhang, Y.; Chen, F.; Creighton, C.J. Pan-cancer molecular subtypes of metastasis reveal distinct and evolving transcriptional
programs. Cell Rep. Med. 2023,4, 100932. [CrossRef]
11.
Willscher, E.; Hopp, L.; Kreuz, M.; Schmidt, M.; Hakobyan, S.; Arakelyan, A.; Hentschel, B.; Jones, D.T.W.; Pfister, S.M.; Loeffler,
M.; et al. High-Resolution Cartography of the Transcriptome and Methylome Landscapes of Diffuse Gliomas. Cancers
2021
,
13, 3198. [CrossRef]
12.
Loeffler-Wirth, H.; Kreuz, M.; Hopp, L.; Arakelyan, A.; Haake, A.; Cogliatti, S.B.; Feller, A.C.; Hansmann, M.-L.; Lenze, D.; Möller,
P.; et al. A modular transcriptome map of mature B cell lymphomas. Genome Med. 2019,11, 27. [CrossRef]
13.
Kunz, M.; Löffler-Wirth, H.; Dannemann, M.; Willscher, E.; Doose, G.; Kelso, J.; Kottek, T.; Nickel, B.; Hopp, L.; Landsberg, J.; et al.
RNA-seq analysis identifies different transcriptomic types and developmental trajectories of primary melanomas. Oncogene
2018
,
37, 6136–6151. [CrossRef]
14.
Binder, H.; Hopp, L.; Schweiger, M.R.; Hoffmann, S.; Jühling, F.; Kerick, M.; Timmermann, B.; Siebert, S.; Grimm, C.; Nersisyan,
L.; et al. Genomic and transcriptomic heterogeneity of colorectal tumours arising in Lynch syndrome. J. Pathol.
2017
,243, 242–254.
[CrossRef]
15.
Schmidt, M.; Mortensen, L.S.; Loeffler-Wirth, H.; Kosnopfel, C.; Krohn, K.; Binder, H.; Kunz, M. Single-cell trajectories of
melanoma cell resistance to targeted treatment. Cancer Biol. Med. 2022,19, 56–73. [CrossRef]
16.
Hopp, L.; Willscher, E.; Wirth-Loeffler, H.; Binder, H. Function Shapes Content: DNA-Methylation Marker Genes and their Impact
for Molecular Mechanisms of Glioma. J. Cancer Res. Updates 2015,4, 127–148. [CrossRef]
17.
Hopp, L.; Löffler-Wirth, H.; Binder, H. Epigenetic Heterogeneity of B-Cell Lymphoma: DNA Methylation, Gene Expression and
Chromatin States. Genes 2015,6, 812–840. [CrossRef] [PubMed]
18.
Wirth, H.; Löffler, M.; von Bergen, M.; Binder, H. Expression cartography of human tissues using self organizing maps. BMC
Bioinform. 2011,12, 306. [CrossRef]
Cancers 2023,15, 3835 27 of 31
19.
Binder, H.; Wirth, H. Analysis of large-scale OMIC data using Self Organizing Maps. In Encyclopedia of Information Science and
Technology Third Edition; Khosrow-Pour, M., Ed.; IGI Global: Hershey, PA, USA, 2014; pp. 1642–1654.
20.
Wirth, H.; von Bergen, M.; Binder, H. Mining SOM expression portraits: Feature selection and integrating concepts of molecular
function. BioData Min. 2012,5, 18. [PubMed]
21.
Löffler-Wirth, H.; Kalcher, M.; Binder, H. oposSOM: R-package for high-dimensional portraying of genome-wide expression
landscapes on bioconductor. Bioinformatics 2015,31, 3225–3227. [CrossRef] [PubMed]
22.
Che, L.-H.; Liu, J.-W.; Huo, J.-P.; Luo, R.; Xu, R.-M.; He, C.; Li, Y.-Q.; Zhou, A.-J.; Huang, P.; Chen, Y.-Y.; et al. A single-cell
atlas of liver metastases of colorectal cancer reveals reprogramming of the tumor microenvironment in response to preoperative
chemotherapy. Cell Discov. 2021,7, 80. [CrossRef] [PubMed]
23.
Loeffler-Wirth, H.; Reikowski, J.; Hakobyan, S.; Wagner, J.; Binder, H. oposSOM-Browser: An interactive tool to explore omics
data landscapes in health science. BMC Bioinform. 2020,21, 465. [CrossRef]
24.
Guinney, J.; Dienstmann, R.; Wang, X.; de Reynies, A.; Schlicker, A.; Soneson, C.; Marisa, L.; Roepman, P.; Nyamundanda, G.;
Angelino, P.; et al. The consensus molecular subtypes of colorectal cancer. Nat. Med. 2015,21, 1350–1356. [CrossRef] [PubMed]
25.
Mendelaar, P.A.J.; Smid, M.; van Riet, J.; Angus, L.; Labots, M.; Steeghs, N.; Hendriks, M.P.; Cirkel, G.A.; van Rooijen, J.M.; Ten
Tije, A.J.; et al. Whole genome sequencing of metastatic colorectal cancer reveals prior treatment effects and specific metastasis
features. Nat. Commun. 2021,12, 574. [CrossRef] [PubMed]
26.
Huang, D.; Sun, W.; Zhou, Y.; Li, P.; Chen, F.; Chen, H.; Xia, D.; Xu, E.; Lai, M.; Wu, Y.; et al. Mutations of key driver genes in
colorectal cancer progression and metastasis. Cancer Metastasis Rev. 2018,37, 173–187. [CrossRef] [PubMed]
27.
Hu, Z.; Ding, J.; Ma, Z.; Sun, R.; Seoane, J.A.; Scott Shaffer, J.; Suarez, C.J.; Berghoff, A.S.; Cremolini, C.; Falcone, A.; et al.
Quantitative evidence for early metastatic seeding in colorectal cancer. Nat. Genet. 2019,51, 1113–1122. [CrossRef] [PubMed]
28.
Liu, Z.; Weng, S.; Dang, Q.; Xu, H.; Ren, Y.; Guo, C.; Xing, Z.; Sun, Z.; Han, X. Gene interaction perturbation network deciphers a
high-resolution taxonomy in colorectal cancer. eLife 2022,11, e81114. [CrossRef]
29.
Bagaev, A.; Kotlov, N.; Nomie, K.; Svekolkin, V.; Gafurov, A.; Isaeva, O.; Osokin, N.; Kozlov, I.; Frenkel, F.; Gancharova,
O.; et al. Conserved pan-cancer microenvironment subtypes predict response to immunotherapy. Cancer Cell
2021
,39, 845–865.e7.
[CrossRef]
30.
Charoentong, P.; Finotello, F.; Angelova, M.; Mayer, C.; Efremova, M.; Rieder, D.; Hackl, H.; Trajanoski, Z. Pan-cancer Im-
munogenomic Analyses Reveal Genotype-Immunophenotype Relationships and Predictors of Response to Checkpoint Blockade.
Cell Rep. 2017,18, 248–262. [CrossRef] [PubMed]
31.
Zhou, B.; Lawrence, T.; Liang, Y. The Role of Plasmacytoid Dendritic Cells in Cancers. Front. Immunol.
2021
,12, 749190. [CrossRef]
32.
Finotello, F.; Mayer, C.; Plattner, C.; Laschober, G.; Rieder, D.; Hackl, H.; Krogsdam, A.; Loncova, Z.; Posch, W.; Wilflingseder,
D.; et al. Molecular and pharmacological modulators of the tumor immune contexture revealed by deconvolution of RNA-seq
data. Genome Med. 2019,11, 34. [CrossRef]
33.
Cortese, N.; Soldani, C.; Franceschini, B.; Barbagallo, M.; Marchesi, F.; Torzilli, G.; Donadon, M. Macrophages in Colorectal Cancer
Liver Metastases. Cancers 2019,11, 633. [PubMed]
34.
Konstantinov, A.S.; Kovaleva, O.V.; Samoilova, D.V.; Shelekhova, K.V. Role of macrophages in progression of colorectal cancer: A
contrast with the traditional paradigm. Int. J. Clin. Exp. Pathol. 2022,15, 403–411. [PubMed]
35.
Kloosterman, D.J.; Akkari, L. Macrophages at the interface of the co-evolving cancer ecosystem. Cell
2023
,186, 1627–1651.
[CrossRef] [PubMed]
36.
Liberzon, A.; Birger, C.; Thorvaldsdóttir, H.; Ghandi, M.; Mesirov, J.P.; Tamayo, P. The Molecular Signatures Database Hallmark
Gene Set Collection. Cell Syst. 2015,1, 417–425. [PubMed]
37.
Binder, H.; Schmidt, M.; Hopp, L.; Davitavyan, S.; Arakelyan, A.; Loeffler-Wirth, H. Integrated Multi-Omics Maps of Lower-Grade
Gliomas. Cancers 2022,14, 2797.
38.
Tan, E.S.; Knepper, T.C.; Wang, X.; Permuth, J.B.; Wang, L.; Fleming, J.B.; Xie, H. Copy Number Alterations as Novel Biomarkers
and Therapeutic Targets in Colorectal Cancer. Cancers 2022,14, 2223.
39.
Kundaje, A.; Meuleman, W.; Ernst, J.; Bilenky, M.; Yen, A.; Heravi-Moussavi, A.; Kheradpour, P.; Zhang, Z.; Wang, J.; Ziller,
M.J.; et al. Integrative analysis of 111 reference human epigenomes. Nature 2015,518, 317–330. [CrossRef]
40.
Gerber, T.; Willscher, E.; Loeffler-Wirth, H.; Hopp, L.; Schadendorf, D.; Schartl, M.; Anderegg, U.; Camp, G.; Treutlein, B.; Binder,
H.; et al. Mapping heterogeneity in patient-derived melanoma cultures by single-cell RNA-seq. Oncotarget
2017
,8, 846–862.
[CrossRef]
41.
Sobral, D.; Martins, M.; Kaplan, S.; Golkaram, M.; Salmans, M.; Khan, N.; Vijayaraghavan, R.; Casimiro, S.; Fernandes, A.;
Borralho, P.; et al. Genetic and microenvironmental intra-tumor heterogeneity impacts colorectal cancer evolution and metastatic
development. Commun. Biol. 2022,5, 937. [CrossRef]
42.
Ernst, J.; Kellis, M. Chromatin-state discovery and genome annotation with ChromHMM. Nat. Protoc.
2017
,12, 2478–2492.
[CrossRef] [PubMed]
43.
Merotto, L.; Sturm, G. Immunedeconv: Methods for Immune Cell Deconvolution. 2022. Available online: https:/omnideconv.
org/immunedeconv (accessed on 25 July 2023).
44.
Trapnell, C.; Cacchiarelli, D.; Grimsby, J.; Pokharel, P.; Li, S.; Morse, M.; Lennon, N.J.; Livak, K.J.; Mikkelsen, T.S.; Rinn, J.L. The
dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat. Biotechnol.
2014
,
32, 381–386. [CrossRef]
Cancers 2023,15, 3835 28 of 31
45.
Lorzadeh, A.; Romero-Wolf, M.; Goel, A.; Jadhav, U. Epigenetic Regulation of Intestinal Stem Cells and Disease: A Balancing Act
of DNA and Histone Methylation. Gastroenterology 2021,160, 2267–2282. [CrossRef] [PubMed]
46.
Chiacchiera, F.; Rossi, A.; Jammula, S.; Zanotti, M.; Pasini, D. PRC2 preserves intestinal progenitors and restricts secretory lineage
commitment. EMBO J. 2016,35, 2301–2314. [CrossRef] [PubMed]
47.
Hahn, M.A.; Li, A.X.; Wu, X.; Yang, R.; Drew, D.A.; Rosenberg, D.W.; Pfeifer, G.P. Loss of the Polycomb Mark from Bivalent
Promoters Leads to Activation of Cancer-Promoting Genes in Colorectal Tumors. Cancer Res.
2014
,74, 3617–3629. [CrossRef]
[PubMed]
48.
Applegate, C.C.; Lane, M.A. Role of retinoids in the prevention and treatment of colorectal cancer. World J. Gastrointest. Oncol.
2015,7, 184–203. [CrossRef]
49.
Majumdar, D.; Tiernan, J.P.; Lobo, A.J.; Evans, C.A.; Corfe, B.M. Keratins in colorectal epithelial function and disease. Int. J. Exp.
Pathol. 2012,93, 305–318. [CrossRef]
50.
Lähdeniemi, I.A.K.; Misiorek, J.O.; Antila, C.J.M.; Landor, S.K.J.; Stenvall, C.-G.A.; Fortelius, L.E.; Bergström, L.K.; Sahlgren, C.;
Toivola, D.M. Keratins regulate colonic epithelial cell differentiation through the Notch1 signalling pathway. Cell Death Differ.
2017,24, 984–996. [CrossRef] [PubMed]
51.
Zhou, Y.; Wu, J.; Fu, X.; Du, W.; Zhou, L.; Meng, X.; Yu, H.; Lin, J.; Ye, W.; Liu, J.; et al. OTUB1 promotes metastasis and serves as a
marker of poor prognosis in colorectal cancer. Mol. Cancer 2014,13, 258. [CrossRef]
52.
Kim, E.K.; Song, M.J.; Jung, Y.; Lee, W.S.; Jang, H.H. Proteomic Analysis of Primary Colon Cancer and Synchronous Solitary Liver
Metastasis. Cancer Genom. Proteom. 2019,16, 583–592. [CrossRef]
53.
Voutsadakis, I.A. The ubiquitin-proteasome system in colorectal cancer. Biochim. Et Biophys. Acta BBA—Mol. Basis Dis.
2008
,
1782, 800–808. [CrossRef] [PubMed]
54.
Prasad, H.; Mathew, J.K.K.; Visweswariah, S.S. Receptor Guanylyl Cyclase C and Cyclic GMP in Health and Disease: Perspectives
and Therapeutic Opportunities. Front. Endocrinol. 2022,13, 911459. [CrossRef]
55.
Nait Slimane, S.; Marcel, V.; Fenouil, T.; Catez, F.; Saurin, J.C.; Bouvet, P.; Diaz, J.J.; Mertani, H.C. Ribosome Biogenesis Alterations
in Colorectal Cancer. Cells 2020,9, 2361. [CrossRef]
56.
Bui, S.; Mejia, I.; Díaz, B.; Wang, Y. Adaptation of the Golgi Apparatus in Cancer Cell Invasion and Metastasis. Front. Cell Dev.
Biol. 2021,9, 806482. [CrossRef]
57.
Hopp, L.; Wirth, H.; Fasold, M.; Binder, H. Portraying the expression landscapes of cancer subtypes: A glioblastoma multiforme
and prostate cancer case study. Syst. Biomed. 2013,1, 99–121.
58.
Hebenstreit, D.; Fang, M.; Gu, M.; Charoensawan, V.; van Oudenaarden, A.; Teichmann, S.A. RNA sequencing reveals two major
classes of gene expression levels in metazoan cells. Mol. Syst. Biol. 2011,7, 497. [CrossRef]
59.
Prorok, P.; Forouzanfar, F.; Murugarren, N.; Peiffer, I.; Charton, R.; Akerman, I.; Méchali, M. Loss of Ezh2 function remodels the
DNA replication initiation landscape. Cell Rep. 2023,42, 112280. [CrossRef]
60.
Schmidt, M.; Loeffler-Wirth, H.; Binder, H. Developmental scRNAseq Trajectories in Gene- and Cell-State Space—The Flatworm
Example. Genes 2020,11, 1214.
61.
Zeng, Z.; Ma, C.; Chen, K.; Jiang, M.; Vasu, R.; Liu, R.; Zhao, Y.; Zhang, H. Roles of G Protein-Coupled Receptors (GPCRs) in
Gastrointestinal Cancers: Focus on Sphingosine 1-Shosphate Receptors, Angiotensin II Receptors, and Estrogen-Related GPCRs.
Cells 2021,10, 2988.
62.
Kumari, N.; Reabroi, S.; North, B.J. Unraveling the Molecular Nexus between GPCRs, ERS, and EMT. Mediat. Inflamm.
2021
,
2021, 6655417. [CrossRef]
63.
Kang, H.; Fichna, J.; Matlawska-Wasowska, K.; Jacenik, D. The Expression Pattern of Adhesion G Protein-Coupled Receptor
F5 Is Related to Cell Adhesion and Metastatic Pathways in Colorectal Cancer&mdash;Comprehensive Study Based on In Silico
Analysis. Cells 2022,11, 3876.
64.
Song, Q.; Han, Z.; Wu, X.; Wang, Y.; Zhou, L.; Yang, L.; Liu, N.; Sui, H.; Cai, J.; Ji, Q.; et al.
β
-Arrestin1 Promotes Colorectal Cancer
Metastasis Through GSK-3
β
/
β
-Catenin Signaling- Mediated Epithelial-to-Mesenchymal Transition. Front. Cell Dev. Biol.
2021
,
9, 650067. [CrossRef]
65.
Hirsch, D.; Barker, N.; McNeil, N.; Hu, Y.; Camps, J.; McKinnon, K.; Clevers, H.; Ried, T.; Gaiser, T. LGR5 positivity defines
stem-like cells in colorectal cancer. Carcinogenesis 2014,35, 849–858. [CrossRef] [PubMed]
66.
Xu, L.; Lin, W.; Wen, L.; Li, G. Lgr5 in cancer biology: Functional identification of Lgr5 in cancer progression and potential
opportunities for novel therapy. Stem Cell Res. Ther. 2019,10, 219. [CrossRef] [PubMed]
67.
Whitfield, M.L.; Sherlock, G.; Saldanha, A.J.; Murray, J.I.; Ball, C.A.; Alexander, K.E.; Matese, J.C.; Perou, C.M.; Hurt, M.M.;
Brown, P.O.; et al. Identification of Genes Periodically Expressed in the Human Cell Cycle and Their Expression in Tumors. Mol.
Biol. Cell 2002,13, 1977–2000. [CrossRef] [PubMed]
68.
Ben-Porath, I.; Thomson, M.W.; Carey, V.J.; Ge, R.; Bell, G.W.; Regev, A.; Weinberg, R.A. An embryonic stem cell-like gene
expression signature in poorly differentiated aggressive human tumors. Nat. Genet. 2008,40, 499–507. [CrossRef] [PubMed]
69.
Binder, H.; Hopp, L.; Lembcke, K.; Wirth, H. Personalized Disease Phenotypes from Massive OMICs Data. In Big Data Analytics
in Bioinformatics and Healthcare; Baoying, W., Ruowang, L., William, P., Eds.; IGI Global: Hershey, PA, USA, 2015; pp. 359–378.
[CrossRef]
Cancers 2023,15, 3835 29 of 31
70.
Mooi, J.K.; Wirapati, P.; Asher, R.; Lee, C.K.; Savas, P.; Price, T.J.; Townsend, A.; Hardingham, J.; Buchanan, D.; Williams, D.; et al.
The prognostic impact of consensus molecular subtypes (CMS) and its predictive effects for bevacizumab benefit in metastatic
colorectal cancer: Molecular analysis of the AGITG MAX clinical trial. Ann. Oncol. 2018,29, 2240–2246. [CrossRef] [PubMed]
71.
Majek, O.; Gondos, A.; Jansen, L.; Emrich, K.; Holleczek, B.; Katalinic, A.; Nennecke, A.; Eberle, A.; Brenner, H. Sex differences in
colorectal cancer survival: Population-based analysis of 164,996 colorectal cancer patients in Germany. PLoS ONE
2013
,8, e68077.
[CrossRef]
72.
Abancens, M.; Bustos, V.; Harvey, H.; McBryan, J.; Harvey, B.J. Sexual Dimorphism in Colon Cancer. Front. Oncol.
2020
,10, 607909.
[CrossRef]
73.
White, A.; Ironmonger, L.; Steele, R.J.C.; Ormiston-Smith, N.; Crawford, C.; Seims, A. A review of sex-related differences in
colorectal cancer incidence, screening uptake, routes to diagnosis, cancer stage and survival in the UK. BMC Cancer
2018
,18, 906.
[CrossRef]
74.
Milette, S.; Hashimoto, M.; Perrino, S.; Qi, S.; Chen, M.; Ham, B.; Wang, N.; Istomine, R.; Lowy, A.M.; Piccirillo, C.A.; et al.
Sexual dimorphism and the role of estrogen in the immune microenvironment of liver metastases. Nat. Commun.
2019
,10, 5745.
[CrossRef]
75.
Hases, L.; Ibrahim, A.; Chen, X.; Liu, Y.; Hartman, J.; Williams, C. The Importance of Sex in the Discovery of Colorectal Cancer
Prognostic Biomarkers. Int. J. Mol. Sci. 2021,22, 1354. [CrossRef] [PubMed]
76.
Nersisyan, L.; Simonyan, A.; Binder, H.; Arakelyan, A. Telomere Maintenance Pathway Activity Analysis Enables Tissue- and
Gene-Level Inferences. Front. Genet. 2021,12, 662464. [CrossRef]
77.
Nersisyan, L.; Hopp, L.; Loeffler-Wirth, H.; Galle, J.; Loeffler, M.; Arakelyan, A.; Binder, H. Telomere Length Maintenance and
Its Transcriptional Regulation in Lynch Syndrome and Sporadic Colorectal Carcinoma. Front. Oncol.
2019
,9, 1172. [CrossRef]
[PubMed]
78.
Barthel, F.P.; Wei, W.; Tang, M.; Martinez-Ledesma, E.; Hu, X.; Amin, S.B.; Akdemir, K.C.; Seth, S.; Song, X.; Wang, Q.; et al.
Systematic analysis of telomere length and somatic alterations in 31 cancer types. Nat. Genet.
2017
,49, 349. [CrossRef] [PubMed]
79.
Andersen, J.B.; Spee, B.; Blechacz, B.R.; Avital, I.; Komuta, M.; Barbour, A.; Conner, E.A.; Gillen, M.C.; Roskams, T.; Roberts,
L.R.; et al. Genomic and genetic characterization of cholangiocarcinoma identifies therapeutic targets for tyrosine kinase inhibitors.
Gastroenterology 2012,142, 1021–1031.e15. [CrossRef] [PubMed]
80.
Dodd, L.E.; Sengupta, S.; Chen, I.H.; den Boon, J.A.; Cheng, Y.J.; Westra, W.; Newton, M.A.; Mittl, B.F.; McShane, L.; Chen,
C.J.; et al. Genes involved in DNA repair and nitrosamine metabolism and those located on chromosome 14q32 are dysregulated
in nasopharyngeal carcinoma. Cancer Epidemiol. Biomark. Prev. 2006,15, 2216–2225. [CrossRef]
81.
Parikh, A.R.; Szabolcs, A.; Allen, J.N.; Clark, J.W.; Wo, J.Y.; Raabe, M.; Thel, H.; Hoyos, D.; Mehta, A.; Arshad, S.; et al. Radiation
therapy enhances immunotherapy response in microsatellite stable colorectal and pancreatic adenocarcinoma in a phase II trial.
Nat. Cancer 2021,2, 1124–1135. [CrossRef]
82.
Matsuda, S.; Revandkar, A.; Dubash, T.D.; Ravi, A.; Wittner, B.S.; Lin, M.; Morris, R.; Burr, R.; Guo, H.; Seeger, K.; et al. TGF-
β
in the microenvironment induces a physiologically occurring immune-suppressive senescent state. Cell Rep.
2023
,42, 112129.
[CrossRef]
83.
He, J.; Shan, S.; Li, Q.; Fang, B.; Xie, Y. Mechanical Stretch Triggers Epithelial-Mesenchymal Transition in Keratinocytes Through
Piezo1 Channel. Front. Physiol. 2022,13, 745572. [CrossRef]
84.
Dombroski, J.A.; Hope, J.M.; Sarna, N.S.; King, M.R. Channeling the Force: Piezo1 Mechanotransduction in Cancer Metastasis.
Cells 2021,10, 2815. [CrossRef]
85.
De Felice, D.; Alaimo, A. Mechanosensitive Piezo Channels in Cancer: Focus on altered Calcium Signaling in Cancer Cells and in
Tumor Progression. Cancers 2020,12, 1780. [CrossRef] [PubMed]
86.
Carrillo-Garcia, J.; Herrera-Fernández, V.; Serra, S.A.; Rubio-Moscardo, F.; Vogel-Gonzalez, M.; Doñate-Macian, P.; Hevia, C.F.;
Pujades, C.; Valverde, M.A. The mechanosensitive Piezo1 channel controls endosome trafficking for an efficient cytokinetic
abscission. Sci. Adv. 2021,7, eabi7785. [CrossRef] [PubMed]
87.
Sutherland, T.E.; Dyer, D.P.; Allen, J.E. The extracellular matrix and the immune system: A mutually dependent relationship.
Science 2023,379, eabp8964. [CrossRef] [PubMed]
88.
Hübers, C.; Abdul Pari, A.A.; Grieshober, D.; Petkov, M.; Schmidt, A.; Messmer, T.; Heyer, C.M.; Schölch, S.; Kapel, S.S.;
Gengenbacher, N.; et al. Primary tumor–derived systemic nANGPTL4 inhibits metastasis. J. Exp. Med.
2022
,220, e20202595.
[CrossRef] [PubMed]
89.
Shaffer, S.M.; Dunagin, M.C.; Torborg, S.R.; Torre, E.A.; Emert, B.; Krepler, C.; Beqiri, M.; Sproesser, K.; Brafford, P.A.; Xiao,
M.; et al. Rare cell variability and drug-induced reprogramming as a mode of cancer drug resistance. Nature
2017
,546, 431–435.
[CrossRef]
90.
Del Rio, M.; Molina, F.; Bascoul-Mollevi, C.; Copois, V.; Bibeau, F.; Chalbos, P.; Bareil, C.; Kramar, A.; Salvetat, N.; Fraslon, C.; et al.
Gene expression signature in advanced colorectal cancer patients select drugs and response for the use of leucovorin, fluorouracil,
and irinotecan. J. Clin. Oncol. 2007,25, 773–780. [CrossRef]
91. Hanahan, D. Hallmarks of Cancer: New Dimensions. Cancer Discov. 2022,12, 31–46. [CrossRef]
92.
Loeffler-Wirth, H.; Kreuz, M.; Schmidt, M.; Ott, G.; Siebert, R.; Binder, H. Classifying Germinal Center Derived Lymphomas-
Navigate a Complex Transcriptional Landscape. Cancers 2022,14, 3434. [CrossRef]
Cancers 2023,15, 3835 30 of 31
93.
Ciner, A.T.; Jones, K.; Muschel, R.J.; Brodt, P. The unique immune microenvironment of liver metastases: Challenges and
opportunities. Semin. Cancer Biol. 2021,71, 143–156. [CrossRef]
94.
de Visser, K.E.; Joyce, J.A. The evolving tumor microenvironment: From cancer initiation to metastatic outgrowth. Cancer Cell
2023,41, 374–403. [CrossRef]
95.
Wang, H.; Pan, J.; Barsky, L.; Jacob, J.C.; Zheng, Y.; Gao, C.; Wang, S.; Zhu, W.; Sun, H.; Lu, L.; et al. Characteristics of pre-metastatic
niche: The landscape of molecular and cellular pathways. Mol. Biomed. 2021,2, 3. [CrossRef] [PubMed]
96.
Singh, M.P.; Rai, S.; Pandey, A.; Singh, N.K.; Srivastava, S. Molecular subtypes of colorectal cancer: An emerging therapeutic
opportunity for personalized medicine. Genes Dis. 2021,8, 133–145. [CrossRef]
97.
Zhang, Y.; Luo, S.; Jia, Y.; Zhang, X. Telomere maintenance mechanism dysregulation serves as an early predictor of adjuvant
therapy response and a potential therapeutic target in human cancers. Int. J. Cancer 2022,151, 313–327. [CrossRef] [PubMed]
98.
Martínez-Jiménez, F.; Movasati, A.; Brunner, S.R.; Nguyen, L.; Priestley, P.; Cuppen, E.; Van Hoeck, A. Pan-cancer whole-genome
comparison of primary and metastatic solid tumours. Nature 2023,618, 333–341. [CrossRef] [PubMed]
99.
Pich, O.; Muiños, F.; Lolkema, M.P.; Steeghs, N.; Gonzalez-Perez, A.; Lopez-Bigas, N. The mutational footprints of cancer therapies.
Nat. Genet. 2019,51, 1732–1740. [CrossRef] [PubMed]
100.
Nallasamy, P.; Nimmakayala, R.K.; Parte, S.; Are, A.C.; Batra, S.K.; Ponnusamy, M.P. Tumor microenvironment enriches the
stemness features: The architectural event of therapy resistance and metastasis. Mol. Cancer 2022,21, 225. [CrossRef] [PubMed]
101.
Patras, L.; Shaashua, L.; Matei, I.; Lyden, D. Immune determinants of the pre-metastatic niche. Cancer Cell
2023
,41, 546–572.
[CrossRef] [PubMed]
102.
Peranzoni, E.; Lemoine, J.; Vimeux, L.; Feuillet, V.; Barrin, S.; Kantari-Mimoun, C.; Bercovici, N.; Guérin, M.; Biton, J.; Ouakrim,
H.; et al. Macrophages impede CD8 T cells from reaching tumor cells and limit the efficacy of anti–PD-1 treatment. Proc. Natl.
Acad. Sci. USA 2018,115, E4041–E4050. [CrossRef]
103.
Tien, F.-M.; Lu, H.-H.; Lin, S.-Y.; Tsai, H.-C. Epigenetic remodeling of the immune landscape in cancer: Therapeutic hurdles and
opportunities. J. Biomed. Sci. 2023,30, 3. [CrossRef]
104.
Halama, N.; Zoernig, I.; Berthel, A.; Kahlert, C.; Klupp, F.; Suarez-Carmona, M.; Suetterlin, T.; Brand, K.; Krauss, J.; Lasitschka,
F.; et al. Tumoral Immune Cell Exploitation in Colorectal Cancer Metastases Can Be Targeted Effectively by Anti-CCR5 Therapy
in Cancer Patients. Cancer Cell 2016,29, 587–601. [CrossRef]
105.
Jiao, W.; Atwal, G.; Polak, P.; Karlic, R.; Cuppen, E.; Danyi, A.; de Ridder, J.; van Herpen, C.; Lolkema, M.P.; Steeghs, N.; et al. A
deep learning system accurately classifies primary and metastatic cancers using passenger mutation patterns. Nat. Commun.
2020,11, 728. [CrossRef]
106.
Fares, J.; Fares, M.Y.; Khachfe, H.H.; Salhab, H.A.; Fares, Y. Molecular principles of metastasis: A hallmark of cancer revisited.
Signal Transduct. Target. Ther. 2020,5, 28. [CrossRef]
107.
Kilmister, E.J.; Koh, S.P.; Weth, F.R.; Gray, C.; Tan, S.T. Cancer Metastasis and Treatment Resistance: Mechanistic Insights and
Therapeutic Targeting of Cancer Stem Cells and the Tumor Microenvironment. Biomedicines 2022,10, 2988. [CrossRef]
108.
Binder, H.; Steiner, L.; Wirth, H.; Rohlf, T.; Prohaska, S.; Galle, J. Transcriptional regulation by histone modifications: Towards a
theory of chromatin re-organization during stem cell differentiation. Phys. Biol. 2013,10, 026006. [CrossRef]
109.
Morgan, R.G.; Mortensson, E.; Williams, A.C. Targeting LGR5 in Colorectal Cancer: Therapeutic gold or too plastic? Br. J. Cancer
2018,118, 1410–1418. [CrossRef]
110.
Cross, W.C.; Graham, T.A.; Wright, N.A. New paradigms in clonal evolution: Punctuated equilibrium in cancer. J. Pathol.
2016
,
240, 126–136. [CrossRef]
111.
Angelova, M.; Mlecnik, B.; Vasaturo, A.; Bindea, G.; Fredriksen, T.; Lafontaine, L.; Buttard, B.; Morgand, E.; Bruni, D.; Jouret-
Mourin, A.; et al. Evolution of Metastases in Space and Time under Immune Selection. Cell
2018
,175, 751–765.e16. [CrossRef]
[PubMed]
112.
Flavahan, W.A.; Gaskell, E.; Bernstein, B.E. Epigenetic plasticity and the hallmarks of cancer. Science
2017
,357, eaal2380. [CrossRef]
113.
Avagyan, S.; Binder, H. Subtyping or not subtyping—Quo vadis for precision medicine of colorectal cancer. Transl. Cancer Res.
2023,12, 321–339. [CrossRef]
114.
Carter, S.L.; Eklund, A.C.; Kohane, I.S.; Harris, L.N.; Szallasi, Z. A signature of chromosomal instability inferred from gene
expression profiles predicts clinical outcome in multiple human cancers. Nat. Genet. 2006,38, 1043–1048. [CrossRef]
115.
Spurr, L.F.; Martinez, C.A.; Katipally, R.R.; Iyer, S.C.; Pugh, S.A.; Bridgewater, J.A.; Primrose, J.N.; Domingo, E.; Maughan, T.S.;
D’Angelica, M.I.; et al. A proliferative subtype of colorectal liver metastases exhibits hypersensitivity to cytotoxic chemotherapy.
Npj Precis. Oncol. 2022,6, 72. [CrossRef] [PubMed]
116.
Pitroda, S.P.; Khodarev, N.N.; Huang, L.; Uppal, A.; Wightman, S.C.; Ganai, S.; Joseph, N.; Pitt, J.; Brown, M.; Forde, M.; et al.
Integrated molecular subtyping defines a curable oligometastatic state in colorectal liver metastasis. Nat. Commun.
2018
,9, 1793.
[CrossRef] [PubMed]
117.
Katipally, R.R.; Martinez, C.A.; Pugh, S.A.; Bridgewater, J.A.; Primrose, J.N.; Domingo, E.; Maughan, T.S.; Talamonti, M.S.;
Posner, M.C.; Weichselbaum, R.R.; et al. Integrated Clinical-Molecular Classification of Colorectal Liver Metastases: A Biomarker
Analysis of the Phase 3 New EPOC Randomized Clinical Trial. JAMA Oncol. 2023. [CrossRef]
118.
Avraham-Davidi, I.; Mages, S.; Klughammer, J.; Moriel, N.; Imada, S.; Hofree, M.; Murray, E.; Chen, J.; Pelka, K.; Mehta, A.; et al.
Integrative single cell and spatial transcriptomics of colorectal cancer reveals multicellular functional units that support tumor
progression. bioRxiv, 2022. [CrossRef]
Cancers 2023,15, 3835 31 of 31
119.
Fleischer, J.R.; Schmitt, A.M.; Haas, G.; Xu, X.; Zeisberg, E.M.; Bohnenberger, H.; Küffer, S.; Teuwen, L.-A.; Karras, P.J.; Beißbarth,
T.; et al. Molecular differences of angiogenic versus vessel co-opting colorectal cancer liver metastases at single-cell resolution.
Mol. Cancer 2023,22, 17. [CrossRef]
120.
Wood, C.S.; Pennel, K.A.F.; Leslie, H.; Legrini, A.; Cameron, A.J.; Melissourgou-Syka, L.; Quinn, J.A.; van Wyk, H.C.; Hay, J.;
Roseweir, A.K.; et al. Spatially Resolved Transcriptomics Deconvolutes Prognostic Histological Subgroups in Patients with
Colorectal Cancer and Synchronous Liver Metastases. Cancer Res. 2023,83, 1329–1344. [CrossRef]
121.
Hu, M.; Chen, Z.; Hu, D.; Xi, S.; Wang, D.; Zhang, X.; Fong, W.P.; Wen, L.; Cai, Y.; Yuan, Y.; et al. Delineating the molecular
landscape of different histopathological growth patterns in colorectal cancer liver metastases. Front. Immunol.
2022
,13, 1045329.
[CrossRef]
122.
Nersisyan, L.; Loeffler-Wirth, H.; Arakelyan, A.; Binder, H. Gene set- and pathway- centered knowledge discovery assigns
transcriptional activation patterns in brain, blood and colon cancer A bioinformatics perspective. J. Bioinform. Knowl. Min.
2016
,
4, 46–70.
123.
Angelova, M.; Charoentong, P.; Hackl, H.; Fischer, M.L.; Snajder, R.; Krogsdam, A.M.; Waldner, M.J.; Bindea, G.; Mlecnik, B.;
Galon, J.; et al. Characterization of the immunophenotypes and antigenomes of colorectal cancers reveals distinct tumor escape
mechanisms and novel targets for immunotherapy. Genome Biol. 2015,16, 64. [CrossRef]
124.
Kosinski, C.; Li, V.S.; Chan, A.S.; Zhang, J.; Ho, C.; Tsui, W.Y.; Chan, T.L.; Mifflin, R.C.; Powell, D.W.; Yuen, S.T.; et al. Gene
expression patterns of human colon tops and basal crypts and BMP antagonists as intestinal stem cell niche factors. Proc. Natl.
Acad. Sci. USA 2007,104, 15418–15423. [CrossRef] [PubMed]
125.
Olsen, J.; Gerds, T.A.; Seidelin, J.B.; Csillag, C.; Bjerrum, J.T.; Troelsen, J.T.; Nielsen, O.H. Diagnosis of ulcerative colitis before
onset of inflammation by multivariate modeling of genome-wide gene expression data. Inflamm. Bowel Dis.
2009
,15, 1032–1038.
[CrossRef] [PubMed]
126.
Hao, Y.; Hao, S.; Andersen-Nissen, E.; Mauck, W.M.; Zheng, S.; Butler, A.; Lee, M.J.; Wilk, A.J.; Darby, C.; Zager, M.; et al.
Integrated analysis of multimodal single-cell data. Cell 2021,184, 3573–3587.e29. [CrossRef] [PubMed]
127.
Zeng, Z.S.; Shu, W.P.; Cohen, A.M.; Guillem, J.G. Matrix metalloproteinase-7 expression in colorectal cancer liver metastases:
Evidence for involvement of MMP-7 activation in human cancer metastases. Clin. Cancer Res. 2002,8, 144–148. [PubMed]
128.
Bakker, E.R.; Das, A.M.; Helvensteijn, W.; Franken, P.F.; Swagemakers, S.; van der Valk, M.A.; ten Hagen, T.L.; Kuipers, E.J.;
van Veelen, W.; Smits, R. Wnt5a promotes human colon cancer cell migration and invasion but does not augment intestinal
tumorigenesis in Apc 1638N mice. Carcinogenesis 2013,34, 2629–2638. [CrossRef]
129.
Wang, B.; Tang, Z.; Gong, H.; Zhu, L.; Liu, X. Wnt5a promotes epithelial-to-mesenchymal transition and metastasis in non-small-
cell lung cancer. Biosci. Rep. 2017,37, BSR20171092. [CrossRef]
130.
Giannakis, M.; Hodis, E.; Jasmine Mu, X.; Yamauchi, M.; Rosenbluh, J.; Cibulskis, K.; Saksena, G.; Lawrence, M.S.; Qian, Z.R.;
Nishihara, R.; et al. RNF43 is frequently mutated in colorectal and endometrial cancers. Nat. Genet.
2014
,46, 1264–1266. [CrossRef]
Disclaimer/Publisher’s Note:
The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.
... Furthermore, it is important to acknowledge the value of transcriptome analyses, specifically in CRC liver metastasis. For instance, the application of machine learning coupled with bioinformatic analysis allowed insights into gene expression patterns, functional significance and prognostic relevance in CRC liver metastasis patients [28]. This investigation utilised microarray-based gene expression data obtained from hepatic resections of liver metastases from primary colorectal tumours [28]. ...
... For instance, the application of machine learning coupled with bioinformatic analysis allowed insights into gene expression patterns, functional significance and prognostic relevance in CRC liver metastasis patients [28]. This investigation utilised microarray-based gene expression data obtained from hepatic resections of liver metastases from primary colorectal tumours [28]. However, relying solely on transcriptome data from hepatic resections of liver metastases overlooks crucial information about changes in primary tumours. ...
Article
Full-text available
Sequencing-based genome-wide DNA methylation, gene expression studies and associated data on paired colorectal cancer (CRC) primary and liver metastasis are very limited. We have profiled the DNA methylome and transcriptome of matched primary CRC and liver metastasis samples from the same patients. Genome-scale methylation and expression levels were examined using Reduced Representation Bisulfite Sequencing (RRBS) and RNA-Seq, respectively. To investigate DNA methylation and expression patterns, we generated a total of 1.01 × 109 RRBS reads and 4.38 x 108 RNA-Seq reads from the matched cancer tissues. Here, we describe in detail the sample features, experimental design, methods and bioinformatic pipeline for these epigenetic data. We demonstrate the quality of both the samples and sequence data obtained from the paired samples. The sequencing data obtained from this study will serve as a valuable resource for studying underlying mechanisms of distant metastasis and the utility of epigenetic profiles in cancer metastasis.
... SOM portrayal considers the multidimensional nature of gene regulation and pursues a modular view on co-expression, reduces dimensionality and, most importantly, supports visual perception in terms of individual, case-specific expression portraits. The pipeline has been applied to a series of data types and issues, e.g., in the context of molecular oncology Loeffler-Wirth et al., 2022;Ashekyan et al., 2023) and health-related population studies (Nikoghosyan et al., 2019;Schmidt et al., 2020a), which all have proven the analytic strength of the methods in complex, multi-dimensional omics data. In the context of vine genomics, oposSOM has been applied so-far as "SOMmelier" to microarray SNP data to discover the dissemination history of Vitis vinifera as seen by vine genomes (Nikoghosyan et al., 2020). ...
Article
Full-text available
Introduction The escalating challenge of climate change has underscored the critical need to understand cold defense mechanisms in cultivated grapevine Vitis vinifera. Temperature variations can affect the growth and overall health of vine. Methods We used Self Organizing Maps machine learning method to analyze gene expression data from leaves of five Vitis vinifera cultivars each treated by four different temperature conditions. The algorithm generated sample-specific “portraits” of the normalized gene expression data, revealing distinct patterns related to the temperature conditions applied. Results Our analysis unveiled a connection with vitamin B1 (thiamine) biosynthesis, suggesting a link between temperature regulation and thiamine metabolism, in agreement with thiamine related stress response established in Arabidopsis before. Furthermore, we found that epigenetic mechanisms play a crucial role in regulating the expression of stress-responsive genes at low temperatures in grapevines. Discussion Application of Self Organizing Maps portrayal to vine transcriptomics identified modules of coregulated genes triggered under cold stress. Our machine learning approach provides a promising option for transcriptomics studies in plants.
Article
Full-text available
Molecular subtyping; colorectal cancer (CRC); precision oncology; tumor microenvironment (TME); extracellular matrix (ECM)-based scoring
Article
Full-text available
Metastatic cancer remains an almost inevitably lethal disease1–3. A better understanding of disease progression and response to therapies therefore remains of utmost importance. Here we characterize the genomic differences between early-stage untreated primary tumours and late-stage treated metastatic tumours using a harmonized pan-cancer analysis (or reanalysis) of two unpaired primary⁴ and metastatic⁵ cohorts of 7,108 whole-genome-sequenced tumours. Metastatic tumours in general have a lower intratumour heterogeneity and a conserved karyotype, displaying only a modest increase in mutations, although frequencies of structural variants are elevated overall. Furthermore, highly variable tumour-specific contributions of mutational footprints of endogenous (for example, SBS1 and APOBEC) and exogenous mutational processes (for example, platinum treatment) are present. The majority of cancer types had either moderate genomic differences (for example, lung adenocarcinoma) or highly consistent genomic portraits (for example, ovarian serous carcinoma) when comparing early-stage and late-stage disease. Breast, prostate, thyroid and kidney renal clear cell carcinomas and pancreatic neuroendocrine tumours are clear exceptions to the rule, displaying an extensive transformation of their genomic landscape in advanced stages. Exposure to treatment further scars the tumour genome and introduces an evolutionary bottleneck that selects for known therapy-resistant drivers in approximately half of treated patients. Our data showcase the potential of pan-cancer whole-genome analysis to identify distinctive features of late-stage tumours and provide a valuable resource to further investigate the biological basis of cancer and resistance to therapies.
Article
Full-text available
Unlabelled: Strong immune responses in primary colorectal cancer correspond with better patient survival following surgery compared with tumors with predominantly stromal microenvironments. However, biomarkers to identify patients with colorectal cancer liver metastases (CRLM) with good prognosis following surgery for oligometastatic disease remain elusive. The aim of this study was to determine the practical application of a simple histological assessment of immune cell infiltration and stromal content in predicting outcome following synchronous resection of primary colorectal cancer and CRLM and to interrogate the underlying functional biology that drives disease progression. Samples from patients undergoing synchronous resection of primary colorectal cancer and CRLM were evaluated in detail through histological assessment, panel genomic and bulk transcriptomic assessment, IHC, and GeoMx spatial transcriptomics (ST) analysis. High immune infiltration of metastases was associated with improved cancer-specific survival. Bulk transcriptomic analysis was confounded by stromal content, but ST demonstrated that the invasive edge of the metastases of long-term survivors was characterized by adaptive immune cell populations enriched for type II IFN signaling and MHC-class II antigen presentation. In contrast, patients with poor prognosis demonstrated increased abundance of regulatory T cells and neutrophils with enrichment of Notch and TGFβ signaling pathways at the metastatic tumor center. In summary, histological assessment can stratify outcomes in patients undergoing synchronous resection of CRLM, suggesting that it has potential as a prognostic biomarker. Furthermore, ST analysis has revealed significant intratumoral and interlesional heterogeneity and identified the underlying transcriptomic programs driving each phenotype. Significance: Spatial transcriptomics uncovers heterogeneity between patients, between matched lesions in the same patient, and within individual lesions and identifies drivers of metastatic progression in colorectal cancer with reactive and suppressed immune microenvironments.
Article
Full-text available
TGF-β induces senescence in embryonic tissues. Whether TGF-β in the hypoxic tumor microenvironment (TME) induces senescence in cancer and how the ensuing senescence-associated secretory phenotype (SASP) remodels the cellular TME to influence immune checkpoint inhibitor (ICI) responses are unknown. We show that TGF-β induces a deeper senescent state under hypoxia than under normoxia; deep senescence correlates with the degree of E2F suppression and is marked by multinucleation, reduced reentry into proliferation, and a distinct 14-gene SASP. Suppressing TGF-β signaling in tumors in an immunocompetent mouse lung cancer model abrogates endogenous senescent cells and suppresses the 14-gene SASP and immune infiltration. Untreated human lung cancers with a high 14-gene SASP display immunosuppressive immune infiltration. In a lung cancer clinical trial of ICIs, elevated 14-gene SASP is associated with increased senescence, TGF-β and hypoxia signaling, and poor progression-free survival. Thus, TME-induced senescence may represent a naturally occurring state in cancer, contributing to an immune-suppressive phenotype associated with immune therapy resistance.
Article
Importance: Personalized treatment approaches for patients with oligometastatic colorectal liver metastases are critically needed. We previously defined 3 biologically distinct molecular subtypes of colorectal liver metastases: (1) canonical, (2) immune, and (3) stromal. Objective: To independently validate these molecular subtypes in the phase 3 New EPOC randomized clinical trial. Design, setting, and participants: This retrospective secondary analysis of the phase 3 New EPOC randomized clinical trial included a bi-institutional discovery cohort and multi-institutional validation cohort. The discovery cohort comprised patients who underwent hepatic resection for limited colorectal liver metastases (98% received perioperative chemotherapy) from May 31, 1994, to August 14, 2012. The validation cohort comprised patients who underwent hepatic resection for liver metastases with perioperative chemotherapy (fluorouracil, oxaliplatin, and irinotecan based) with or without cetuximab from February 26, 2007, to November 1, 2012. Data were analyzed from January 18 to December 10, 2021. Interventions: Resected metastases underwent RNA sequencing and microRNA (miRNA) profiling in the discovery cohort and messenger RNA and miRNA profiling with microarray in the validation cohort. Main outcomes and measures: A 31-feature (24 messenger RNAs and 7 miRNAs) neural network classifier was trained to predict molecular subtypes in the discovery cohort and applied to the validation cohort. Integrated clinical-molecular risk groups were designated based on molecular subtypes and the clinical risk score. The unique biological phenotype of each molecular subtype was validated using gene set enrichment analyses and immune deconvolution. The primary clinical end points were progression-free survival (PFS) and overall survival (OS). Results: A total of 240 patients were included (mean [range] age, 63.0 [56.3-68.0] years; 151 [63%] male), with 93 in the discovery cohort and 147 in the validation cohort. In the validation cohort, 73 (50%), 28 (19%), and 46 (31%) patients were classified as having canonical, immune, and stromal metastases, respectively. The biological phenotype of each subtype was concordant with the discovery cohort. The immune subtype (best prognosis) demonstrated 5-year PFS of 43% (95% CI, 25%-60%; hazard ratio [HR], 0.37; 95% CI, 0.20-0.68) and OS of 63% (95% CI, 40%-79%; HR, 0.38; 95% CI, 0.17-0.86), which was statistically significantly higher than the canonical subtype (worst prognosis) at 14% (95% CI, 7%-23%) and 43% (95% CI, 32%-55%), respectively. Adding molecular subtypes to the clinical risk score improved prediction (the Gönen and Heller K for discrimination) from 0.55 (95% CI, 0.49-0.61) to 0.62 (95% CI, 0.57-0.67) for PFS and 0.59 (95% CI, 0.52-0.66) to 0.63 (95% CI, 0.56-0.70) for OS. The low-risk integrated group demonstrated 5-year PFS of 44% (95% CI, 20%-66%; HR, 0.38; 95% CI, 0.19-0.76) and OS of 78% (95% CI, 44%-93%; HR, 0.26; 95% CI, 0.08-0.84), superior to the high-risk group at 16% (95% CI, 10%-24%) and 43% (95% CI, 32%-52%), respectively. Conclusions and relevance: In this prognostic study, biologically derived colorectal liver metastasis molecular subtypes and integrated clinical-molecular risk groups were highly prognostic. This novel molecular classification warrants further study as a possible predictive biomarker for personalized systemic treatment for colorectal liver metastases. Trial registration: isrctn.org Identifier: ISRCTN22944367.
Article
In metazoan cells, DNA replication initiates from thousands of genomic loci scattered throughout the genome called DNA replication origins. Origins are strongly associated with euchromatin, particularly open genomic regions such as promoters and enhancers. However, over a third of transcriptionally silent genes are associated with DNA replication initiation. Most of these genes are bound and repressed by the Polycomb repressive complex-2 (PRC2) through the repressive H3K27me3 mark. This is the strongest overlap observed for a chromatin regulator with replication origin activity. Here, we asked whether Polycomb-mediated gene repression is functionally involved in recruiting DNA replication origins to transcriptionally silent genes. We show that the absence of EZH2, the catalytic subunit of PRC2, results in increased DNA replication initiation, specifically in the vicinity of EZH2 binding sites. The increase in DNA replication initiation does not correlate with transcriptional de-repression or the acquisition of activating histone marks but does correlate with loss of H3K27me3 from bivalent promoters.
Article
Macrophages are versatile and heterogeneous innate immune cells undertaking central functions in balancing immune responses and tissue repair to maintain homeostasis. This plasticity, once co-opted by malignant outgrowth, orchestrates manifold reciprocal interactions within the tumor microenvironment, fueling the evolution of the cancer ecosystem. Here, we review the multilayered sources of influence that jointly underpin and longitudinally shape tumor-associated macrophage (TAM) phenotypic states in solid neoplasms. We discuss how, in response to these signals, TAMs steer tumor evolution in the context of natural selection, biological dispersion, and treatment resistance. A number of research frontiers to be tackled are laid down in this review to therapeutically exploit the complex roles of TAMs in cancer. Building upon knowledge obtained from currently applied TAM-targeting strategies and using next generation technologies, we propose conceptual advances and novel therapeutic avenues to rewire TAM multifaceted regulation of the co-evolving cancer ecosystem.
Article
Primary tumors actively and specifically prime pre-metastatic niches (PMNs), the future sites of organotropic metastasis, preparing these distant microenvironments for disseminated tumor cell arrival. While initial studies of the PMN focused on extracellular matrix alterations and stromal reprogramming, it is increasingly clear that the far-reaching effects of tumors are in great part achieved through systemic and local PMN immunosuppression. Here, we discuss recent advances in our understanding of the tumor immune microenvironment and provide a comprehensive overview of the immune determinants of the PMN's spatiotemporal evolution. Moreover, we depict the PMN immune landscape, based on functional pre-clinical studies as well as mounting clinical evidence, and the dynamic, reciprocal crosstalk with systemic changes imposed by cancer progression. Finally, we outline emerging therapeutic approaches that alter the dynamics of the interactions driving PMN formation and reverse immunosuppression programs in the PMN ensuring early anti-tumor immune responses.
Article
Cancers represent complex ecosystems comprising tumor cells and a multitude of non-cancerous cells, embedded in an altered extracellular matrix. The tumor microenvironment (TME) includes diverse immune cell types, cancer-associated fibroblasts, endothelial cells, pericytes, and various additional tissue-resident cell types. These host cells were once considered bystanders of tumorigenesis but are now known to play critical roles in the pathogenesis of cancer. The cellular composition and functional state of the TME can differ extensively depending on the organ in which the tumor arises, the intrinsic features of cancer cells, the tumor stage, and patient characteristics. Here, we review the importance of the TME in each stage of cancer progression, from tumor initiation, progression, invasion, and intravasation to metastatic dissemination and outgrowth. Understanding the complex interplay between tumor cell-intrinsic, cell-extrinsic, and systemic mediators of disease progression is critical for the rational development of effective anti-cancer treatments.
Article
For decades, immunologists have studied the role of circulating immune cells in host protection, with a more recent appreciation of immune cells resident within the tissue microenvironment and the intercommunication between nonhematopoietic cells and immune cells. However, the extracellular matrix (ECM), which comprises at least a third of tissue structures, remains relatively underexplored in immunology. Similarly, matrix biologists often overlook regulation of complex structural matrices by the immune system. We are only beginning to understand the scale at which ECM structures determine immune cell localization and function. Additionally, we need to better understand how immune cells dictate ECM complexity. This review aims to highlight the potential for biological discovery at the interface of immunology and matrix biology.