... | ... | @@ -3,6 +3,7 @@ title: Functional annotations |
|
|
---
|
|
|
|
|
|
|
|
|
|
|
|
Functional annotation helps bringing biological meaning to genetic sequences. Functional annotation is usually obtained through protein sequence similarity. Indeed, across two organisms, if two sequences are very similar, one can infer that they can encode for the same biological function.
|
|
|
There are several main parameters that will impact the process of functional annotation:
|
|
|
- how distant is the species which was actually annotated with experimental data (the reference)
|
... | ... | @@ -17,8 +18,13 @@ In Ortho_KB, we integrate functional annotation from EggNOG, MapMan, TRAPID and |
|
|
## EggNOG
|
|
|
|
|
|
First, install emapper, which can be done for example with conda:
|
|
|
|
|
|
`conda create -n emapper -c bioconda eggnog-mapper`
|
|
|
|
|
|
Then create the eggNOG database which contains ortholog groups and the functional annotation. In our case, we will chose the taxonomic group Viridiplantae, whose code is 33090 (see [http://eggnog5.embl.de/#/app/downloads](http://eggnog5.embl.de/#/app/downloads))
|
|
|
|
|
|
`download_eggnog_data.py --data_dir eggnog_db -F -P -M -H --dbname 33090`
|
|
|
|
|
|
Finally, the emapper program can be run to annotate each genome:
|
|
|
|
|
|
`emapper.py --cpu 10 -i $input --output ${prefix}.tsv -m diamond --data_dir eggnog_db --database 33090` |
|
|
\ No newline at end of file |