Click here to return to the Mouse Strain Comparison application

Requirements:

One of
  • Firefox 3.0 or Greater
  • Internet Explorer 7 or Greater
and JavaScript must be enabled (this is typically the default setting for Firefox and Internet Explorer).

Overview:

Using this application in the simplest possible way you can determine which genomic intervals are either a match or a miss-match between two selected strains. Beyond that you can build a more complicated expression by AND'ing and OR'ing the pairwise strain comparisons together. For example if I want to see all of the interval where B6 matches C3H, but where both of these are different from NZO I would create an expression like: (B6 = C3H) AND (B6 ≠ NZO). Note that because of the equivalence constraint on B6 and C3H the expression (B6 = C3H) AND (C3H ≠ NZO) will give the same exact results.

Genome Selection:

The genome data that we currently have available is the imputed SNP dataset that is available from the CGD data site.

Defining Equivalence:

The most important thing to keep in mind when you are defining the equivalence parameters for strains is that unequal intervals are just the inverse of equivalent intervals. What this means is that when you specify that 10 SNPs in a row must match before an interval is defined as "equivalent" that guarantees that every interval returned by the expression (B6 = C3H) will span at least 10 SNPs, but it does not mean that every interval returned by (B6 ≠ C3H) will span 10 SNPs . In fact they can be single SNP intervals!

Building the Expression:

The expression builder lets you create boolean strain comparison expressions by AND'ing and OR'ing together pairwise strain expressions. You can insert new pairwise expressions by clicking the (+) button or remove them by clicking on the (-) button. You can also use the drop-down options to change the strains being compared, the relationship you are looking for (= or ≠) and whether a given row will be AND'd or OR'd with the next pairwise expression.

Let us take a look at an example of how this can be used. If you have four "high" strains (ie. have high values for a phenotype of interest) named "A", "B", "C" and "D" and you have three low strains named "X", "Y" and "Z", you may want to examine the regions where all the high strains match, all of the low strains match AND none of the high strains match the low strains. We can test this relationship with the following expression: (A=B) AND (A=C) AND (A=D) AND (X=Y) AND (X=Z) AND (X≠A)

The Graph:

The graph has a "refresh" button that allows you to update the graph whenever parameters have been modified. This button is only enabled if there are changes that have not yet been applied to the graph. There is also a "busy indicator" that lets you know whether or not the graph is being updated. If you want to save the graph you should be able to right click on it for download.

Data:

The data links allow you to download a comma-seperated files which contain all of the intervals that were specified. You can either download the entire genome at once, or a single chromosome at a time. Note that these files can be used directly as input for the Genome Interval Overlap Calculator.