Bait Tiling masking strategies and stringency levels |
|
In a Bait Tiling job, eArray can exclude baits that cover repetitive sequences within the tiling region. Depending on the genome species and stringency level you select, eArray uses one or more of the following masking tools to determine if a sequence is repetitive.
RepeatMasker – eArray obtains these sequences directly from UCSC (http://hgdownload.cse.ucsc.edu/downloads.html). They are generated using RepeatMasker and Tandem Repeat Finder (with a period of 12 or less).
WindowMasker – To generate these sequences, eArray obtains the unmasked sequences from UCSC and then masks repetitive regions using the WindowMasker tool from NCBI with its default parameters.
Uniqueness 35 track – eArray uses this Duke Uniqueness track to find 35mer sequences that occur more than 5 times in the tiling interval.
You have 4 options for the level of stringency eArray uses in designating repetitive regions: Most stringent, Moderately stringent, Least stringent, and No masking. The default masking selection is Moderately stringent masking.
When you select No masking, eArray does not mask any sequences and creates baits across the entire tiling interval.
When you select a stringency of Most, Moderate or Least, eArray masks sequences based on one or more of the masking tools described above. Because different species may have different masked sequence sets available, the criteria for the stringency options are dependent on the species you specify.
For the H. sapiens genome, the stringency criteria are:
Least stringent masking – A sequence must be masked by RepeatMasker, WindowMasker, and the Duke Uniqueness 35 track in order to be masked by eArray. Because a sequence must be identified as repetitive by all 3 masking tools, this option results in the least stringent masking of your tiling interval.
Moderately stringent masking – A sequence must be masked by RepeatMasker and WindowsMasker in order to be masked by eArray. It does not need to be in the Duke Uniqueness 35 sequence set.
Most stringent masking – A sequence must be masked by RepeatMasker in order to be masked by eArray. It does not need to be in the WindowMasker or Duke Uniqueness 35 sequence sets.
For non-human genomes, such as mouse (M. musculus) and rat (R. norvegicus), the Least stringent option and the Moderately stringent option use the same criteria because the Duke Uniqueness 35 track is not available. Some genomes also do not have a RepeatMasker sequence set available (e.g. Arabidopsis thaliana). For these species, all 3 stringency options use the same criteria. Consult the table below for a complete list of the criteria for each stringency level by species.
|
Least stringent |
Moderately stringent |
Most stringent |
A. thaliana |
WindowMasker |
WindowMasker |
WindowMasker |
B. taurus |
WindowMasker RepeatMasker |
WindowMasker RepeatMasker |
RepeatMasker |
B. taurus_UMD |
WindowMasker |
WindowMasker |
WindowMasker |
C. elegans |
WindowMasker RepeatMasker |
WindowMasker RepeatMasker |
RepeatMasker |
C. familiaris |
WindowMasker RepeatMasker |
WindowMasker RepeatMasker |
RepeatMasker |
C. jacchus |
WindowMasker |
WindowMasker |
WindowMasker |
D. melanogaster |
WindowMasker RepeatMasker |
WindowMasker RepeatMasker |
RepeatMasker |
D. rerio |
WindowMasker RepeatMasker |
WindowMasker RepeatMasker |
RepeatMasker |
G. gallus |
WindowMasker RepeatMasker |
WindowMasker RepeatMasker |
RepeatMasker |
H. sapiens |
WindowMasker RepeatMasker Uniqueness 35 |
WindowMasker RepeatMasker |
RepeatMasker |
M. mulatta |
WindowMasker RepeatMasker |
WindowMasker RepeatMasker |
RepeatMasker |
M. musculus |
WindowMasker RepeatMasker |
WindowMasker RepeatMasker |
RepeatMasker |
O. cuniculus |
WindowMasker RepeatMasker |
WindowMasker RepeatMasker |
RepeatMasker |
O. latipes |
WindowMasker |
WindowMasker |
WindowMasker |
O. sativa |
WindowMasker |
WindowMasker |
WindowMasker |
R. norvegicus |
WindowMasker RepeatMasker |
WindowMasker RepeatMasker |
RepeatMasker |
S. cerevisiae |
WindowMasker |
WindowMasker |
WindowMasker |
S. pombe |
WindowMasker |
WindowMasker |
WindowMasker |