Reference databases
GS-24-prok
The GS-24-prok database (default) is a 2024 gold standard (GS) set of reference proteins from only Bacteria and Archaea, with experimentally annotated molecular functions (confirmed E.C. annotations). It includes 4,577 manually curated protein sequences and 2,159 E.C. annotations.
GS-24-bac
The GS-24-bac database is a 2024 gold standard set of reference proteins from only Bacteria, with experimentally annotated molecular functions (confirmed E.C. annotations). It includes 4,076 manually curated protein sequences and 2,021 E.C. annotations.
GS-24-all
The GS-24-all database is a 2024 gold standard set of reference proteins from Eukaryota, Archaea, Bacteria and Viruses, with experimentally annotated molecular functions (confirmed E.C. annotations). It includes 12,491 manually curated protein sequences and 3,686 E.C. annotations.
GS-21-bac
The GS-21-bac database is a 2021 gold standard set of reference proteins from only Bacteria, with experimentally annotated molecular functions (confirmed E.C. annotations). It includes 7,288 manually curated protein sequences and 1,882 E.C. annotations.
GS-21-all
The GS-21-all database is a 2021 gold standard set of reference proteins from Eukaryota, Archaea, Bacteria and Viruses, with experimentally annotated molecular functions (confirmed E.C. annotations). It includes 15,524 manually curated protein sequences and 2,716 E.C. annotations.
GS
The gold standard (GS) database is the original mi-faser database. It includes 2,810 reference proteins and 1,257 experimentally annotated molecular functions (confirmed E.C. annotations).
GS+
The GS+ database is an expansion of the original GS database. It includes an additional 55 manually curated protein sequences, introducing 28 new E.C.s that represent important microbial functions in the environment.