Large-scale statistical analysis of secondary xylem ESTs in pine.

Pavy N.; Laroche J.; Bousquet J.; Mackay J.

Large-scale statistical analysis of secondary xylem ESTs in pine.

Pavy N., Laroche J., Bousquet J., Mackay J.

A computational analysis of pine transcripts was conducted to contribute to the functional annotation of conifer sequences. A statistical analysis of expressed sequential tags(ESTs) belonging the 7732 contigs in the TIGR Pinus Gene Index (PGI1.0) identified 260 differentially represented gene sequences across six cDNA libraries from loblolly pine secondary xylem. Cluster analysis of this subset of contigs resulted in five groups representing genes preferentially represented in one of the xylem samples (compression wood, plannings, root xylem, latewood) and one group containing mostly genes simultaneously present in compression and side wood libraries. To complement the sequence annotation, 27 cDNA clones representing selected transcripts were completely sequenced. Several genes were identified that could represent putative markers for xylem from different organs, at different stages of development. Several sequences encoding regulatory proteins were over-represented in root xylem as opposed to the other xylem samples. Some of them belonged to known families of plant transcription factors, but two genes were previously uncharacterized in plants. One transcript was homologous to the gene encoding the Smad4 interacting factor, a key co-activator in TGFbeta (transforming growth factor) signalling in animals. Thus, the digital analysis of pine ESTs highlighted a putative gene function of potentially broad interest but that has yet to be investigated in plants. More generally, this study showed that the application of numerical approaches to EST databases should be helpful in establishing priorities among genes to consider for targeted functional studies. Thus, we illustrated the potential of extracting information from conifer sequences already accessible through well-structured public databases.

Original publication

DOI

10.1007/s11103-004-6969-7

Type

Journal article

Journal

Plant Mol Biol

Publication Date

01/2005

Volume

Pages

203 - 224

Keywords

Amino Acid Sequence, Cluster Analysis, DNA, Complementary, Data Interpretation, Statistical, Expressed Sequence Tags, Gene Expression Profiling, Gene Expression Regulation, Plant, Gene Library, Molecular Sequence Data, Pinus, Plant Structures, Reproducibility of Results, Reverse Transcriptase Polymerase Chain Reaction, Sequence Alignment, Sequence Analysis, DNA, Sequence Homology, Amino Acid

Cookies on this website

Large-scale statistical analysis of secondary xylem ESTs in pine.

Pavy N., Laroche J., Bousquet J., Mackay J.

DOI

Type

Journal

Publication Date

Volume

Pages

Keywords