Queen Mary, University of London

Graduate Student, School of Biological and Chemical Sciences

James Cotton
Richard Nichols

About

For any two species, there is not actually a single time back to their common ancestor. In fact we expect to see different times for different loci, because of the variation in times back to a common ancestor of two lineages in the common ancestral population (coalescence process). The differences in timing can be so large that, in some cases, different loci may actually show different phylogenetic branching patterns. On top of these differences in timing, different loci can have different mutation rates. How then are we to combine information from different loci?

The issue of rate variation has been addressed in more recent molecular-dating techniques but since we know little of the underlying pattern of rate variation, it is challenging to evaluate its accuracy and precision. Techniques such as bayesian relaxed clock methods and autocorrelated models attempt to model the pattern of variation between lineages without trying to understand the process, ie. the causes of this variation.

All molecular genetic inferences need to take this effect into account, yet most attempts ignore this huge variation in timing completely. It will affect fundamental questions such as "Does 'speciation' occur at the same time for different parts of the genome?", "Do different loci have different rates of evolution?", "Are there mutation hotspots?", "Does the rate of substitution vary along different evolutionary branches?" and "What was the ancestral population size?".

The genomic data that have recently accumulated provide an exciting and unexploited opportunity to investigate the fundamental evolutionary processes affecting the variation in substitution rate between loci, as orthologous sequences can now be obtained from many species. This abundance of genomic data will also allow us to use a large number of loci to estimate the variation within the species due to stochastic noise (coalescence process).

I am currently working on a model which attempts to sample parameters such as the divergence time of the species and the rate of coalecence while taking into account variation in the coalescence process, variation in the rate of mutation between loci and the stochastic nature of the substitution process. This will allow me to estimate the ancestral population size of the species in question and the time of their divergence. The model can then be expanded to include rate of recombination, non-silent sites, functionality and positions in the genome (ie. active sites will have different evolutionary constraints than non-essential parts of a protein sequence).

 

x

Log In

or reset password

Reset Password

Enter the email address you signed up with, and we'll send a reset password email to that address

Academia © 2012