The DNA sequence of chromosome I of an African trypanosome: gene content, chromosome organisation, recombination and polymorphism.
Hall N., Berriman M., Lennard NJ., Harris BR., Hertz-Fowler C., Bart-Delabesse EN., Gerrard CS., Atkin RJ., Barron AJ., Bowman S., Bray-Allen SP., Bringaud F., Clark LN., Corton CH., Cronin A., Davies R., Doggett J., Fraser A., Grüter E., Hall S., Harper AD., Kay MP., Leech V., Mayes R., Price C., Quail MA., Rabbinowitsch E., Reitter C., Rutherford K., Sasse J., Sharp S., Shownkeen R., MacLeod A., Taylor S., Tweedie A., Turner CMR., Tait A., Gull K., Barrell B., Melville SE.
The African trypanosome, Trypanosoma brucei, causes sleeping sickness in humans in sub-Saharan Africa. Here we report the sequence and analysis of the 1.1 Mb chromosome I, which encodes approximately 400 predicted genes organised into directional clusters, of which more than 100 are located in the largest cluster of 250 kb. A 160-kb region consists primarily of three gene families of unknown function, one of which contains a hotspot for retroelement insertion. We also identify five novel gene families. Indeed, almost 20% of predicted genes are members of families. In some cases, tandemly arrayed genes are 99-100% identical, suggesting an active process of amplification and gene conversion. One end of the chromosome consists of a putative bloodstream-form variant surface glycoprotein (VSG) gene expression site that appears truncated and degenerate. The other chromosome end carries VSG and expression site-associated genes and pseudogenes over 50 kb of subtelomeric sequence where, unusually, the telomere-proximal VSG gene is oriented away from the telomere. Our analysis includes the cataloguing of minor genetic variations between the chromosome I homologues and an estimate of crossing-over frequency during genetic exchange. Genetic polymorphisms are exceptionally rare in sequences located within and around the strand-switches between several gene clusters.