X1-11 | X12-20 | 2L | 2R | 3L | 3R | 4 | Main |
The Release 2 (October 2000) sequence of the Drosophila genome produced by the BDGP and Celera Genomics is a whole genome shotgun sequence assembly with ~1300 gaps and a limited number of regions of low sequence quality. We are in the process of producing Release 3. Our aims for Release 3 are:
For Release 3, Celera has provided the BDGP with the primary sequence trace data from the whole genome shotgun project with the traces sorted in the order they occur in the Release 2 sequence assembly. We are producing Release 3 by a six step process. Chromosome arms 2L, 2R, 3R, 4 and numbered divisions 12-20 of the X chromosome are being finished at Lawrence Berkeley National Laboratory, and questions on this portion of the genome sequence may be directed to [email protected]. Chromosome arm 3L and numbered divisions 1-11 of the X chromosome are being finished at the Human Genome Sequencing Center at Baylor College of Medicine, and questions on this portion of the genome sequence may be directed to David Wheeler ([email protected]) and Steve Scherer ([email protected]). The finishing strategy is as follows:
Once sequences have passed through steps 1-5, we submit the BAC-based sequences to GenBank, and we make the BAC-based and concatenated large sequence contigs publicly available through this web site. This Release 2.5 sequence is of high quality and contains no gaps.
Note on accession numbers BACs that are verified and submitted to NCBI have their GenBank accession numbers displayed next to the BAC name on the assembly page. BACs that are part of finished regions but do not yet have accession numbers are noted with an asterisk. These are generally finished to Phase 3 standards but have not completed all of the quality control checks. Some of these BACs may have old records available at GenBank, but the sequence may differ from the working sequence available here.
Click on any of the chromosome arm names to see the clones and assembled segments for that arm.
X1-11 | X12-20 | 2L | 2R | 3L | 3R | 4 | Main |
Arm | Bases in assembled segments |
---|---|
X1-11 | 11405022 |
X12-20 | 8788192 |
2L | 22205349 |
2R | 20300755 |
3L | 23088932 |
3R | 27902919 |
4 | 1236870 |
# segments | 37 |
Total bases in segments | 114928039 |
Average segment size | 3106163 |
Largest segment size | 27902919 |