Manual Identification Protocol

The manual identification protocol is used to identify sequence tags that do not satisfy the criteria specified in the automated annotation protocol. It allows for the identification of sequence tags that otherwise would be left unidentified by automated methods. The sequence tags that require manual identification contain one or more regions we describe as "questionable sequence".

Questionable sequence is defined as sequence derived from a poor quality section of an ABI trace file. This section may have long stretches of homopolymeric regions, poor resolution, overlapping peaks, or peaks obscured by background noise. Homopolymeric regions occur because the polymerase has difficulty processing a repeated stretch of one base type, sometimes referred to as slippage or "stuttering". The most common case is slippage occurring during a poly-A stretch, causing sequencing irregularities downstream of the poly-A region. Another cause of poor quality or questionable sequence is poor resolution in the gel. Poor resolution is more common near the bottom of the gel, where it becomes more difficult to call bases accurately. Also, overlapping peaks may indicate multiple products. This type of noise is especially problematic when signal strengths are low.

If questionable sequence occurs

For all manually edited sequences, we urge you to examine the ABI trace file and to BLAST the sequence tag in order to understand the basis for the identification. In addition, the manually identified sequence tags are regularly subjected to the automated process and may be updated.

Because these cell lines contain anomalies in their ABI trace files, resequencing of manually identified cell lines is strongly recommended before use in biological experiments.

 


Copyright 2001 Regents of the University of California. All rights reserved.