I had to wait for a long time to actually write about a project of my own. I had been thinking about this problem for at least 2 years before finding the solution. The problem I wanted to address was a classic question: how can you map a mutation? If you do a selection and find a mutant which exerts your phenotype of interest, how would you go around finding it? The classic approach involves painstakingly going through a set of known markers around the genome and calculate the linkage between the known marker and your site of mutation. Then choose the closest marker and redo the linkage analysis with more markers close to that site, until the location becomes small enough for direct PCR amplification and sequencing. But this whole protocol, takes weeks if not months and it is quite labor-intensive. Especially, if your mutant carries multiple effective mutations, finding them becomes more complicated.
With the advent of whole-genome sequencing or re-sequencing of bacterial genomes can help us find all the mutations in the genome. But in many cases, it is not obvious whether a mutation is responsible in eliciting a certain phenotype or not. In other words, every single mutation should then be tested for its effect on the phenotype of interest.
To tackle this problem, we developed ADAM as a whole-genome approach that enables you to find all the adaptive mutations (and not silent mutations) in a matter of days. ADAM, uses parallel, genome-wide linkage analysis to simultaneously identify all mutated loci with direct fitness contributions. ADAM has three components:
1. A library of selectable markers embedded in the DNA of the parental strain.
2. A mechanism for transferring markers from the parental strain's library into the evolved strain in such a way that DNA from the parental strain adjacent to the marker replaces the corresponding DNA in the evolved strain.
3. A method for measuring the frequency of markers throughout the genome.
As the first component, we used a high coverage library of kanamycin-marked transposon insertions all across the wild-type genome. We then trasnfer the markers from the WT genome to the evolved background using P1 transduction. In this step, if a transposon is close to a site of mutation, the recombination results in the correction of this mutation back to the WT genotype. Now, in this secondary transposon library in the mutant background, the markers whose introduction to the genome has been accompanied by the loss of an adaptive mutation will be at a relative disadvantage under the selective conditions. To find the location of these transposons, we compare the frequency of transposon insertion events in each locus under selective and non-selective conditions. Adaptive mutations can then be discovered in locations in which a stretch of loci show marker depletion upon selection. For this step, we use a microarray-based whole genome footprinting.
We used ADAM to find a known CmlR cassette as a proof-of-principle. We then used this method to identify adaptive mutations in lab-evolved strains (growth on Asn and ethanol tolerance).