One of the clearest evidences that emerged from the eukaryotic genome sequencing projects was the high content in repetitive DNA sequences that these genomes harbour, being the extraordinary genome size variation found between taxa attributed to the differential amplification and deletion of these sequence families. However, despite its abundance, the role(s) that these sequences play in the genomes has always been shrouded in great mystery, as unlike genes, it was never assigned to them the ability to code proteins (at least proteins that are not involved in their own replication and genomic integration, as is the case of Transposable Elements). For this reason, these sequences were initially designated as “junk” DNA, with no function assigned. Presently, these sequences have won the deserved respect and are now regarded as a crucial fraction of eukaryotic genomes, recognized as important regulatory elements and also as being implicated in the occurrence of chromosomal rearrangements, with an important role in genome evolution. This was, precisely, the main goal of this work: to contribute to the understanding of the repetitive sequences significance in the evolution of eukaryotic genomes. For this purpose, it was analysed the repetitive genomic fraction of five Cricetidae/Muridae Rodentia species, with very distinct karyotypes, regarding tandem and dispersed repeats: Satellite DNAs (satDNAs), Interstitial Telomeric Sequences (ITSs) and LINE-1 Retrotransposons. A detailed analysis about the distribution and molecular nature of the Constitutive Heterochromatin (CH) of these rodent genomes was also performed.
The integration of all data allowed to understand how the five studied genomes evolved and to reconstruct the chromosomal evolutionary events elapsed, where the repetitive sequences were unquestionably involved. Indeed, all the results obtained here converge to this same conclusion. Namely, a strong association was observed between both the distribution and the level of CH heterogeneity with the evolutionary pathway that these karyotypes followed. In fact, for two of these species, a detailed analysis on the location of evolutionary breakpoint and CH regions revealed a very high coincidence between them. Other works focused on the evolution of the other three species, reported a similar relationship. Therefore, the repeats located in heterochromatin seem to be highly involved in the occurrence of chromosomal rearrangements, either by promoting directly chromosome reorganizations and/or because correspond to fragile regions prone to chromosome breakage. The analysis of the repeats located in CH regions performed here, namely satDNAs (exclusively heterochromatic), ITSs (mainly located in CH regions) and LINE-1 (frequently located in CH), really suggest the repetitive sequences as a driving force in the occurrence of chromosomal rearrangements. The repetitive nature per se of the different classes of repeats studied favours recombinational events between homologous sequences in non-homologous regions, which may culminate in chromosomal restructurings. Nevertheless the role in the origin of chromosomal reorganizations is particularly proposed for satDNAs, due to its characteristic evolutionary mode (concerted evolution) generally marked by rapid sequence mutations, copy number variations and/or intragenomic movements, driven by different recombinational events (as unequal crossing-over or rolling circle replication/reinsertion), that may induce chromosome breakage.
Additionally, beyond its important function in genome restructuring, the data obtained in this work also suggest other roles to repetitive sequences. An analysis devoted to the transcriptional activity of some of the studied satDNAs supports the role of these sequences in many other functions, as in control of gene expression, chromatin remodelation, cellular response to stress and centromeric function. LINE-1 sequences as well have important functions in control of gene expression, acting in gene imprinting and in X-chromosome inactivation. Thereby, despite initially considered useless genomic elements, in the light of all this data, it is impossible to deny that repetitive sequences are crucial for proper functioning and evolution of eukaryotic genomes, dethroning to our view the importance given in the past just to the protein-coding sequences. It is really now difficult to understand how these sequences, so abundant in eukaryotic genomes, may have been considered unnecessary, just because a coding capacity was not reported. After all, the protein-coding sequences only account for a tiny part of genomes (~1,5% of the human genome).
The present thesis resulted in the elaboration of seven articles that are published, submitted or in preparation for submission to indexed international scientific journals.
KEY WORDS: Repetitive DNA Sequences, Chromosomal Evolution, Tandem Repeats, Long Interspersed Nuclear Elements-1, Rodentia