EukHeist: Recovering Eukaryotic MAGs from Metagenomes

EukHeist: Recovering Eukaryotic MAGs from Metagenomes

Eukaryotic microbes, or protists, are key drivers of marine ecosystems, with roles ranging from primary producers (phytoplankton) to consumers (heterotrophs or mixotrophs). As with their bacteria and archaea, many protists are difficult or impossible to culture. This has limited our ability to directly interrogate their biology in the laboratory and has led us to underestimate their diversity across marine ecosystems. Molecular and genomic approaches, particularly those applied to whole, mixed communities (e.g. metagenomics, metatranscriptomics), have shed light on the ecological roles, evolutionary histories, and physiological capabilities of these organisms.

EukHeist presents a scalable and reproducible bioinformatic pipeline to facilitate the retrieval of eukaryotic metagenome assembled genomes (MAGs) from mixed metagenomes, called EUKHeist (https://github.com/AlexanderLabWHOI/EUKHeist). EUKHeist streamlines and automates the discovery, recovery, quality assessment, and analysis of eukaryotic metagenome assembled genomes (MAGs) from mixed community metagenomes. EukHeist incorporates taxonoßßmic annotation through EUKulele (https://github.com/AlexanderLabWHOI/EUKulele) as well as scalable structural and functional annotation of coding regions in eukaryotic MAGs with EukMetaSanity (https://github.com/cjneely10/EukMetaSanity).

Project repository: https://github.com/AlexanderLabWHOI/EUKHeist

Funding for this project provided by the Simons Foundation and WHOI