OMA

Keyword Protein Sequence Group Entry

OMA Download – April 2009

The entire OMA database is available for download in several formats. It is also possible to download each group separately. This option is available in the group view. Please read our terms and conditions before integrating OMA data into your own research or database.

Orthology Relationships
The orthology relationships are available in two types: groups or pairs of orthologs. The information is given in terms of OMA identifiers (of the form HUMAN04376).
OMA groups: downloadoma-groups.txt.gz
Pairwise orthologs: downloadoma-pairs.txt.gz (all pairs)
Sequences
All sequences with the corresponding OMA identifiers can be downloaded in fasta files. The proteins are all in one file, while the coding DNA is split into two files, one for the Eukaryotes and one for the Prokaryotes.
Protein sequences: downloadoma-seqs.fa.gz
cDNA Eukaryotes: downloadeukaryotes.cdna.fa.gz
cDNA Prokaryotes: downloadprokaryotes.cdna.fa.gz
Identifier Mapping
Mappings of the OMA identifier to various other databases are available:
Mapping to UniProt: downloadoma-uniprot.txt.gz
Mapping to Ensembl: downloadoma-ensembl.txt.gz
Mapping to NCBI: downloadoma-ncbi.txt.gz
Mapping to GO: downloadoma-go.txt.gz
Mapping to Wormbase: downloadoma-wormbase.txt.gz
Yeast mapping: downloadoma-yeast.txt.gz
Mapping to JGI: downloadoma-jgi.txt.gz