Fig 3 - uploaded by Marcelo Brigido
Content may be subject to copyright.
Distribution of ZDRs throughout human chromosome 14: black lines show ZDR clusters around the TSS in contrast to random quasi-uniform distibution depicted in light grey. 

Distribution of ZDRs throughout human chromosome 14: black lines show ZDR clusters around the TSS in contrast to random quasi-uniform distibution depicted in light grey. 

Source publication
Conference Paper
Full-text available
Z-DNA is an alternative conformation of the DNA molecule implied in regulation of gene expression. However, the exact role of this structure in cell metabolism is not yet fully understood. Here we present a novel Z-DNA analysis workflow using the R software environment which aims to investigate Z-DNA forming regions (ZDRs) throughout the genome. It...

Context in source publication

Context 1
... with PolII and therefore change the occupation rates of the enzyme. This analysis was carried out using public data retrieved from the NCBI Short Reads Archive (SRA) [11], particularly from Joseph et al. [9]. Reads derived from Illumina platform were divided in two groups: (i) cells treated with estradiol and (ii) untreated cells (control). Both groups were then aligned to the human chromosome 14 by the Bowtie aligner [10] with default parameters, except for --best which reports only the best alignments (those with fewer mismatches). The aligned reads were filtered out by a process called “peak calling” which identifies genome enriched areas where reads cluster together [14]. This process was performed in R by the BayesPeak package [4] and the differential occupation fold-change analysis was calculated by the DESeq package. [2] To further investigate the distribution of ZDRs in relation to genomic structure features, we searched for ZDRs within exons, introns, UTRs and intergenic regions. Although this analysis used the same ENCODE annotation file, it differed by being processed through the GenomicFeatures [5] suite so that the genes’ annotation would be subdivided into their features. Then, those features were matched against the ZDRs to return an overall percentage of ZDRs relative to each genomic feature. As stated above, the first step of our analysis was to correlate ZDRs found by Z-Catcher with the gene annotation from ENCODE . Figure 3 shows that our approach was able to reproduce literature findings by showing an overall clus- tering of ZDRs around TSSs [22], as opposed to randomized distances to each TSS. Once the ZDRs are not equally spread over the genome, this distribution suggests that the ZDRs may play a role in [17] or be dependent upon transcription events [12]. To further address this issue, we looked deeper into the distribution and plotted the exact location of ZDRs relative to their nearest TSS and its respective transcript. With this analysis, we wanted to investigate if there was any bias towards specific ZDR hotspots around or within transcripts. Indeed it is possible to observe in Figure 4 that ZDRs seem to be more concen- trated upstream of TSSs, which would corroborate the hypothesis on the Z-DNA relationship with transcription events. To date, no Z-DNA mapping approach has focused on the ZDR distribution throughout the genome in relation to its genomic features. Taking that into account, it is important to further investigate this correlation, since it may re- veal some unknown distribution pattern and may also help to elucidate Z-DNA function. Some works had suggested that the presence of ZDRs within introns would enable and guide the coupling of proteins from the ADAR1 family, which are responsible for mRNA editing [1]. These proteins are not only known to be present in Z-DNA binding sites with high affinity but also to be responsible for the deamination of adenosines to inosines (which are translated as guanines). These editing events act as a source of phenotypic variation [7] and could play an important role at modulation of the nervous system [20]. If it is found that this interaction is dependent on Z-DNA formation, an important function for ZDRs would be revealed. ZDRs found inside transcripts lie almost exclusively within introns, which account for roughly 18% of the total number of detected ZDRs (Figure 4). Con- sidering that genes are composed mostly by intronic sequences, this percentage may not represent a strict preference of the ZDRs’ distribution. Anyhow, in Table 1 it is possible to see five of the transcripts and their associated genes, which exhibit the largest number of ZDRs within introns. Such genes may be good candidates for further investigation of ADAR1 family mechanism of action, which would contribute to understand the potential role of Z-DNA guiding RNA editing enzymes. Even though we were able to show the correlation between ZDRs and TSSs, it still remained unclear if it represents only a by-product of gene transcription events or if ZDRs indeed act as gene expression regulators. To address this point, we analysed whether ZDR presence could modify RNA Polymerase II occupation of transcription start sites. Our dataset of PolII was taken from a ChIP-Seq experiment which analysed the occupation of the promoter region of the ER- α estrogen receptor of MCF-7 cells in two conditions: activated (with presence of its ligand, estradiol) and inac- tivated (controlled set). With this analysis, we were able to investigate whether the differential enrichment of the PolII tags in specific locations may correlate to the presence of ZDRs. We divided our fold-change results in two groups, one with transcripts with up-regulated occupation in the activated condition (fold-change ≥ 2) and the other with transcripts with down-regulated occupation (fold-change < 0.5). Next, we ranked the 100 topmost differentially occupied transcripts of each group and analysed their ZDRs related position. Figure 5 shows that the number of ZDRs overlapped by PolII tags at the 100 topmost differentially occupied transcripts exhibits an interesting pattern. Those ZDRs present in up-regulated genes tend to gather upstream of TSSs while those ZDRs present in down-regulated genes exhibit a weak tendency to gather downstream. This may suggest a positive correlation between ZDRs position and the activation state of genes, consider- ing that the formation of ZDRs upstream could favour gene transcription, while ZDRs downstream could inhibit it. Nevertheless, our data need additional experimental evidence and further statistical analysis to confirm these statements. In this work we developed a workflow for analysis of ZDR regions in animal cells that merges in silico data with experimental ones. The workflow uses several bioconductor packages and retrieves biological data from high throughput sequencing. Hence, one could easily correlate data from ChIP-seq and RNA-seq to ZDR regions in whole chromosomes. Our case study focused on the human chromosome 14, and the results showed that our workflow approach was able to conduct ZDR distribution analysis that corroborates previous studies. It brought as well new information on how those ZDRs spread over the chromosomal sequences. The role of Z-DNA in gene regulation has been debated for a long time. In our case study we showed that the majority of ZDRs appear upstream of the transcripts. We also showed that when accounted for internal genomic features, ZDRs tend to concentrate in introns rather than exons. Although this was expected, it showed that our approach is able to succesfully detect ZDRs’ distribution within transcripts. Hence, one could investigate in other human chromosomes or another species genome the hypothesis of ZDRs serving as anchor sites for Z-DNA binding factors such as ADAR1, which is responsible for RNA edition. The comparison of ZDRs prediction to PolII occupancy in steroid regulated genes suggests differences in ZDRs positioning in relation to TSSs. Up regulated genes seem to concentrate ZDRs upstream of TSSs as opposed to down regulated genes that tends to concentrate ZDRs downstream. However, further experimental studies and statistical investigation are still necessary to convinc- ingly correlate ZDRs to gene expression. We are presently working to assemble the R scripts developed for this workflow in a user friendly R package, where the user will be able to perform similar analysis as those previously shown. Our goal is to deliver an easy and fast way to perform basic distribution analysis associated to biological information in different kinds of genomes, allowing for an efficient computing platform for the Z-DNA biology ...

Similar publications

Presentation
course and tutorial used for teaching
Article
Full-text available
Metastasis is the main cause of breast cancer‑related mortalities. The present study aimed to uncover the relevant molecular mechanisms of breast cancer metastasis and to explore potential biomarkers that may be used for prognosis. Expression profile microarray data GSE8977, which contained 22 stroma samples (15 were from normal breast and 7 were f...
Article
Full-text available
Sorafenib is an inhibitor of a variety of tyrosine kinase receptors used to treat various cancers including hepatocellular, renal cell and thyroid carcinoma. It has been shown to change various targets associated with osteosarcoma, but the detailed mechanism remains unclear. In order to identify key genes, enriched pathways and important modules du...
Preprint
Full-text available
Mitochondria are organelles whose main role is energy production and might influence obesity. They are the only organelles with their own genome. Here we have genotyped 435 patients with type 1 diabetes using Illumina Infinium Omni Express Exome-8 v1.4 arrays and performed mitoGWAS on BMI. We have analyzed additive interactions between mitochondria...

Citations

... Interestingly, three ZDRs co-localize to a single intronic sequence, suggesting that they came from a ZDR hot spot, a genomic region with high propensity to assume the Z-DNA conformation [11]. This result is in contrast to a previous study that predicted in silico ZDR and found that the majority of them (77 %) were outside gene loci [34]. Although only a small number of ZDR were tested, their distribution of in vivo Z-DNA regions may reflect particular physiological states and may not match the distribution of ZDR found in silico, a situation that has also been reported by others [11]. ...
Article
Full-text available
Left-handed Z-DNA is a physiologically unstable DNA conformation, and its existence in vivo can be attributed to localized torsional distress. Despite evidence for the existence of Z-DNA in vivo, its precise role in the control of gene expression is not fully understood. Here, an in vivo probe based on an anti-Z-DNA intrabody is proposed for native Z-DNA detection. The probe was used for chromatin immunoprecipitation of potential Z-DNA-forming sequences in the human genome. One of the isolated putative Z-DNA-forming sequences was cloned upstream of a reporter gene expression cassette under control of the CMV promoter. The reporter gene encoded an antibody fragment fused to GFP. Transient co-transfection of this vector along with the Z-probe coding vector improved reporter gene expression. This improvement was demonstrated by measuring reporter gene mRNA and protein levels and the amount of fluorescence in co-transfected CHO-K1 cells. These results suggest that the presence of the anti-Z-DNA intrabody can interfere with a Z-DNA-containing reporter gene expression. Therefore, this in vivo probe for the detection of Z-DNA could be used for global correlation of Z-DNA-forming sequences and gene expression regulation.