Genomic sequencing strategies

– 2 major strategies:

1.  prepare genomic library (~15-fold coverage of genome)

(e.g. 100-300 kb inserts in BACs)

– less rearrangements

– map to find order, overlaps

– restriction maps, chromosome walking techniques

– map clones with minimal overlaps

– fewest clones needed

– e.g. sequence-tagged connectors (STC)

– sequence ends of all BACs

– use unique sequence from 1 clone

– find overlapping clones

– restriction map

– find clones with minimal overlaps

– or sequence-tagged sites (STS)

– unique regions amplified by PCR

– find clones with fewest of same STS

– minimal overlap

– completely sequence chosen clones

– shotgun clone fragments of BAC into M13 vectors

–>5-fold duplication of sequence (~3,000 clones)

– sequence individual clones

– assemble map from sequence overlap

– assemble each sequence into larger map by overlap

Advantage:  minimizes duplication of sequence efforts

Disadvantage:  mapping clones takes lots of time

2.  newer strategy

– random shotgun clone entire genome into M13 vectors

– ~60 million clones for human genome

– assemble into overlapping sequence by computer analysis

Advantage:  generates large amounts of sequence fast

Disadvantage:  fragmentary sequence

– massive computer job to assemble

– but simplified by identifying known markers in sequence

– e.g. genes, BACs, STS

– allow sequences to be mapped to known sites

shotgun cloning (for either strategy above)

– generate library of ~0.8-1.5 kb M13 vector clones

–with >5-fold duplication of desired sequence

– randomly distributed through sequence

1.  ligate ends of large DNA fragments (~100 kb)

– stops ends from being underrepresented in library

2.  generate random fragments

Methods:

– shearing – break DNA up physically

– using sonication (high speed vibrations)

– or nebulization

(force DNA solution through high pressure siphon)

– droplets contain sheared DNA

Advantages:  totally random fragments

– independent of sequence

Disadvantages – DNA ends frayed & uneven

– repair with T4 DNA polymerase & blunt end clone

– DNase I cleavage – endonuclease

– cleaves (fairly) randomly in DNA sequences

– partial restriction endonuclease cleavage – e.g. CviJI

– cleaves at PyGCPy and PuGCPu – blunt ends

– sites ~32 bp apart (on average)

– blunt ends for cloning

3.  clone in M13 vectors – clones in both directions

4.  sequence and align on computer to find overlaps