At least 40 human disorders, including Huntington disease and myotonic dystrophy, are caused by the expansion of a simple sequence repeat . More repeats are associated with a more severe form of the disease. Expanded disease-associated alleles are highly unstable and frequently expand during intergenerational transmission accounting for the anticipation observed in these disorders. Expanded disease-associated alleles are also unstable in the soma, in a process that is age-dependent, tissue-specific, and expansion-biased . Notably, large expansions accumulate in affected tissues, such as the brain in Huntington disease and muscle in myotonic dystrophy, driving the tissue specificity and progressive nature of the symptoms. We have also established that residual variation in age at onset and disease severity not accounted for by inherited repeat length is inversely associated with residual variation in somatic expansion rates not accounted for by inherited repeat length and age (i.e., individuals in who the repeat expands more rapidly, get earlier and more severe symptoms than expected)[3-6]. As such, prevention of somatic expansions presents as a novel therapeutic target in these disorders. Insights from animal and cell models have revealed that expansions are critically dependent on the DNA mismatch repair pathway. Using candidate gene and genome-wide association studies we have also revealed that common polymorphisms in the some DNA repair genes modify the rate of somatic expansion and disease severity in both myotonic dystrophy type 1 and Huntington disease [5,7-10] However, powerful as such approaches are, the application of genome wide association studies in the repeat expansion disorders is limited by the rarity of the conditions that generally precludes the assembly of the very large cohorts needed to conduct them. However, although expansions at the disease associated loci are rare, at least one locus, ERDA1, presents with a high frequency of expanded alleles (~20%) in the general population. These alleles are not associated with a disease state, but are genetically unstable. It is our hypothesis that, as we have done at the Huntington disease [5,11] and myotonic dystrophy type 1 loci, we can use high-throughput ultra-deep sequencing to derive individual-specific measures of mutational dynamics that act as biomarkers of genetic instability and can be used as molecular phenotypes in genome wide association studies. To this end, the student will address the following aims:
- Develop assay. Develop high-throughput ultra-deep sequencing assay for sequencing the triplet repeat at the ERDA1 locus.
- Determine the range of ERDA1 alleles present in general population. Sequence large numbers of alleles in the general population (from the Generation Scotland collection) to derive allele length distribution and identify potential variant repeats that we have shown to have a profound effect as cis-acting modifiers of somatic mutational dynamics and disease severity in myotonic dystrophy type 1 and Huntington disease [5,12].
- Measure somatic instability. The data generated will be used to quantify the degree of somatic mosaicism in the general population and determine the role of sequence purity, allele length and age in mediating the degree of somatic instability.
- Identification of therapeutic targets. After correcting for sequence purity, allele length and age, residual variation in somatic mosaicism will be used as a molecular phenotype in a genome wide association study in the Generation Scotland cohort. The results will identify novel therapeutic targets for the repeat expansion disorders.
The studentship will provide training in state-of-the-art DNA sequencing technologies, bioinformatics (including use of the Galaxy platform), computational modelling and core skills such as the use of mathematics and statistics in handling large datasets and experimental design.