Show simple item record

Depositordc.contributorBarker, Daniel
Funderdc.contributor.otherNSF - National Science Foundationen_UK
Data Creatordc.creatorDarby, Charlotte
Data Creatordc.creatorStolzer, Maureen
Data Creatordc.creatorRopp, Patrick
Data Creatordc.creatorBarker, Daniel
Data Creatordc.creatorDurand, Dannie
Citationdc.identifier.citationDarby, Charlotte; Stolzer, Maureen; Ropp, Patrick; Barker, Daniel; Durand, Dannie. (2016). Preprocessed gene trees from "Xenolog Classification", [dataset].
Persistent Identifierdc.identifier.uri
Persistent Identifierdc.identifier.uri
Dataset Description (abstract)dc.description.abstractPreprocessed gene trees from 49 taxa of Cyanobacteria and 16 Proteobacteria used by Darby et al. (2017, Bioinformatics 33:640-649; doi:10.1093/bioinformatics/btw686) are provided here. These consist of trees for 13623 of the gene families of Latysheva et al. (2012, doi:10.1093/bioinformatics/bts008). Using Notung- (Stolzer et al. 2012, doi:10.1093/bioinformatics/bts386), unrooted trees with bootstrap support (out of 200; doi:10.7488/ds/1485) were preprocessed with the following steps: (1) root with DTL (duplication, transfer, loss) model --costdup 3 --costtrans 2.5 --costloss 2; (2) rearrange with DL model --costdup 3 --costloss 2 --threshold 90%; (3) reroot with DTL model --costdup 3 --costtrans 2.5 --costloss 2. Both zip files contain the output from this pipeline. The Newick format trees reflect the tree topology after this process. The Notung format trees also include the reconciliation, and other meta data. The Notung format is described in detail in Appendix A (File Formats) of the Notung manual ( ## Acknowledgment ## This work was funded by the National Science Foundation grant DBI-1262593. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.en_UK
Dataset Description (TOC) 13623 preprocessed trees in Newick format. The start of each filename gives the orthologous group number. Tip labels in trees consist of the protein accession, followed by an underscore, then the three-letter abbreviation for the taxon. Orthologous group numbers and taxon abbreviations are as in Darby et al. (2016, "Xenolog classification", Bioinformatics, submitted) and Latysheva et al. (2012, 10.1093/bioinformatics/bts008). 13623 preprocessed trees in Notung format. The start of each filename gives the orthologous group number. Tips are labelled as in
Relation (Is Referenced By)dc.relation.isreferencedby
Relation (Is Referenced By)dc.relation.isreferencedbyBarker, D., Darby, C., Stolzer, M., Ropp, P., & Durand, D. (2016). Xenolog Classification. Bioinformatics. DOI: 10.1093/bioinformatics/btw686en
Rightsdc.rightsCopyright and related rights waived via CC0 1.0 Public Domain Dedication (
Subjectdc.subjectGene familiesen_UK
Subjectdc.subjectGene tree-species tree reconciliationen_UK
Subject Classificationdc.subject.classificationBiological Sciencesen_UK
Titledc.titlePreprocessed gene trees from "Xenolog Classification"en_UK

Download All
zip file MD5 Checksum: 842c5b75421325c339a4bfe7e079950d

Files in this item


This item appears in the following Collection(s)

  • Barker Group
    Bioinformatics education, phylogeny reconstruction and philosophy of science.

Show simple item record