Publications - Published papers

Please find below publications of our group. Currently, we list 53 papers. Some of the publications are in collaboration with the group of Peter Stadler and are also listed in the publication list for his group. Access to published papers (access) is restricted to our local network and chosen collaborators.
If you have problems accessing electronic information, please let us know:

©NOTICE: All papers are copyrighted by the authors; If you would like to use all or a portion of any paper, please contact the author.

Quantitative Comparison of Genomic-Wide Protein Domain Distributions

Arli A. Parikesit , Peter F. Stadler & Sonja J. Prohaska


PREPRINT 10-034: [ PDF ]


GCB2010 conference proceeding. Vol P-173: pp 93-102


Investigations into the origins and evolution of regulatory mechanisms require quantitative estimates of the abundance and co-occurrence of functional protein domains among distantly related genomes. Currently available databases, such as the SUPERFAMILY, are not designed for quantitative comparisons since they are built upon transcript and protein annotations provided by the various different genome annotation projects. Large biases are introduced by the differences in genome annotation protocols, which strongly depend on the availability of transcript information and well-annotated closely related organisms. Here we show that the combination of de novo gene predictors and subsequent HMM-based annotation of SCOP domains in the predicted peptides leads to consistent estimates with acceptable accuracy that in particular can be utilized for systematic studies of the evolution of protein domain occurrences and co-occurrences. As an application, we considered four major classes of DNA binding domains: zink-finger, leucine-zipper, winged-helix, and HMG-box. We found that different types of DNA binding domains systematically avoid each other throughout the evolution of Eukarya. In contrast, DNA binding domains belonging to the same superfamily readily co-occur in the same protein.