Open Tree of Life reference taxonomy version 2.9
Version 2.9 draft 12 was generated on 12 October 2015. Draft 12 is the final draft in the 2.9 series of drafts.
Download (gzipped tar file, 91 Mbyte)
All files are encoded UTF-8. For documentation about file formats, see the documentation in the reference taxonomy
taxonomy.tsv: The file that contains the taxonomy.
synonyms.tsv: The list of synonyms.
conflicts.tsv: Report on taxa from input taxonomies that are
hidden because they are paraphyletic with respect to a higher
taxon from a higher priority input taxonomy. Number in first column is depth in taxonomic tree of
nearest common ancestor of its children.
deprecated.tsv: List all taxon ids occurring in phylesystem studies that have been deprecated since previous version.
log.tsv: Debugging information related to homonym resolution.
version.tsv: The version of OTT.
forwards.tsv: Forwarding pointers - a list of OTT ids that are
retired and should be replaced by new ones (usually due to
weaklog.csv: internal debugging tool
The reference taxonomy is an algorithmic combination of several
source taxonomies. For code,
source code repository.
Version 2.9 draft 12 was generated using
Any errors in OTT
should be assumed to have been introduced by the Open Tree of Life
project until confirmed as originating in the source taxonomy.
Download locations are for the particular versions used to construct
OTT 2.9. For new work, current versions of these sources should be
DS Hibbett, M Binder, JF Bischoff, M Blackwell, et al.
A higher-level phylogenetic classification of the Fungi.
Mycological Research 111(5):509-547, 2007.
Newick string with revisions
archived at http://figshare.com/articles/Fungal_Classification_2015/1465038.
Download location: http://purl.org/opentree/ott/??TBD??
Taxonomy from: SILVA 16S ribosomal RNA database, version 115.
See: Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J,
Glöckner FO (2013) The SILVA ribosomal RNA gene database project:
improved data processing and web-based tools.
Nucleic Acids Research 41 (D1): D590-D596.
Web site: http://www.arb-silva.de/.
Download location: ftp://ftp.arb-silva.de/release_115/Exports/tax_ranks_ssu_115.csv.
Download location: derived from database query result files provided by Paul
Kirk, 7 April 2014 (personal communication).
Web site: http://www.indexfungorum.org/.
Download location (converted to OTT format): http://purl.org/opentree/ott/??TBD??.
Schäferhoff, B., Fleischmann, A., Fischer, E., Albach, D. C., Borsch,
T., Heubl, G., and Müller, K. F. (2010). Towards resolving Lamiales
relationships: insights from rapidly evolving chloroplast
BMC evolutionary biology 10(1), 352..
Manually transcribed from the paper and converted to OTT format.
Download location: http://purl.org/opentree/ott/ott2.8/inputs/lamiales-20140118.tsv
World Registry of Marine Species (WoRMS) - harvested from web site using web API over several days ending around 1 October 2015.
NCBI Taxonomy, from the
US National Center on Biotechnology Information.
Web site: http://www.ncbi.nlm.nih.gov/Taxonomy/.
Current version download location:
For OTT 2.9 we used a version downloaded from NCBI on 6 October 2015.
Download location: http://purl.org/opentree/ott/??TBD??.
GBIF Backbone Taxonomy, from the
Global Biodiversity Information facility.
Current version download location:
We used a version dated 2013-07-02.
Download location: http://purl.org/opentree/gbif-backbone-2013-07-02.zip.
Interim Register of Marine and Nonmarine Genera (IRMNG), from CSIRO.
Current version download location:
We used a version dated 2014-01-31. Download location:
Taxon identifiers are carried over from OTT 2.8 when possible
It has been requested that we relay the following statement:
REUSE OF IRMNG CONTENT:
The Open Tree Taxonomy does not reproduce its sources in their
entirety or in their original form of expression, but only uses
limited information expressed in them. See "Scientific names of
organisms: attribution, rights, and licensing" (http://dx.doi.org/10.1186/1756-0500-7-79)
regarding use of taxonomic information and attribution.
Where taxonomies conflict regarding taxon relationships, they are
resolved in favor of the higher priority taxonomy. The priority
ordering is as given above, with the following exceptions:
The non-Fungi content of Index Fungorum is separated from the Fungi
content and given a priority lower than NCBI but higher than GBIF.
The non-Malacostraca content of WoRMS is separated from the
Malacostraca content and given a priority lower than NCBI but higher
Changes since OTT 2.8 (a.k.a 2.8draft5) which was built on 11 June 2014:
- Identifiers: 3528349
- Visible: 2628944
- Synonyms: 867366
- In deprecated file (used in phylesystem): 2451
- In deprecated file (used in synthesis): 368
- Source taxa dissolved due to conflict (conflicts.tsv): 1054
- unplaced - similar to incertae_sedis (this means a child of an
inconsistent taxon, where t is inconsistent if it occurs in a
lower-priority taxonomy but is inconsistent with the higher-priority
taxonomies. 'tattered' is now deprecated)
- unplaced_inherited - descends from a placed taxon.
- inconsistent (formerly 'tattered') - taxon in lower priority
taxonomy that is inconsistent (see above).
- merged - this taxon was consistent with another and got folded
into it. Taxon is hidden, children aren't. Taxon may be
revived if it's learned later that the it is actually different.
- was_container - treat same as incertae_sedis, merged, and
inconsistent - the 'taxon' was formerly a 'bucket' but is now empty and is
preserved as a placeholder.
- extinct - replaces extinct_direct.
- major_rank_conflict - replaces major_rank_conflict_direct.
- incertae_sedis - (former) child of an incertae sedis container.
- sibling_lower is deprecated, that information is not recorded (but you can
always tell, just by looking at ranks of the siblings). sibling_higher
- Deprecated: tattered, tattered_inherited
Specific content changes (inputs):
- Added WoRMS
- Updated Hibbett 2007 from http://figshare.com/articles/Fungal_Classification_2015/1465038
- Minor IF update (to 7 April 2014 and modified processing software)
- Minor GBIF update (same origin content, modified processing, much faster)
- NCBI update (6 October 2015)
- Fixes for many bugs reported in feedback and reference-taxonomy repos (see milestones)
Generic content changes (processing):
- 'Lumping' is now allowed more promiscuously than before. E.g. if NCBI
has names A and B with B a synonym of A, and GBIF has A and B as separate
taxa, then GBIF's A and B will both map to NCBI's A.
- New file forwards.tsv gives replacement ids for some ids that no
longer exist in the taxonomy. E.g. if A and B were separate in an earlier
version of OTT, and 'lumped' in this version, then there will be
a row in forwards.tsv mapping B's old id to A's id.
- The "unique names" column shows the highest distinguishing taxon, e.g. "Morganella
(genus in kingdom Fungi)" instead of the lowest "Morganella (genus in family
- Somewhat more informative deprecated.tsv
- Deprecated.tsv file is now restricted to taxa mentioned in phylesystem,
and includes not only deprecated ids but also newly hidden ids (those
that were not hidden in 2.8, but are hidden now)
- As a heuristic, taxa that come only from PaleoDB are marked extinct
- 'skeleton' feature replaces 'pinning' for homonym separation (see
tax/skel/ for list of barrier nodes)