Descriptive information about these mouse, human and termite metagenomes
can be found in the GOLD database under Gm00071, Gm00052, Gm00013 GOLD IDs, respectively. Within IMG/M the “”Compare Genomes”" tool was chosen to extract COG and Pfam protein profiles from the swine, mouse, human, and termite gut microbiomes. These profiles were then normalized for sequencing coverage by calculating the percent distribution, prior to downstream statistical analysis. To find over-abundant or unique functions to a given metagenomic dataset, a two-way Crizotinib datasheet hierarchical clustering of normalized COG and Pfam abundances was performed using the Bioinformatics Toolbox with Matlab SB273005 molecular weight version 2009a. Additionally, to determine if unique or overabundant functions were statistically meaningful, the binomial test within the Shotgun FunctionalizeR program was employed [38]. The GS20 and FLX pig fecal datasets were also compared against gut metagenomes available within the MG-RAST metagenomic annotation pipeline. The two pig fecal metagnonomic datasets were compared against the following MG-RAST metagenomic projects: cow rumen (Cow Rumen Project: 444168.3), chicken cecum (FS-CAP
Project:4440285.3), human infant subjects In-A, In-B, In-D, In-E, In-M and In-R (Human Faeces Projects: 4440946.3, 4440945.3, 4440948.3, 4440950.3, 4440949.3, 4440951.3), human adult subjects F1-S, F1-T, F1-U, F2-V, F2-W, F2-X,
and F2-Y (Human Faeces Projects: 4440939.9, 4440941.3, 4440940.3, 4440942.3, LOXO-101 concentration 4440943.3, 4440944.3, and 4440947.3), healthy fish gut (Fish Gut Project: 4441695.3), and lean mouse cecum (Human Faeces Project: 4440463.3). Within MG-RAST, phylogenetic information was extracted from these gut metagenomes using RDP [31], SILVA SSU [32], and Greengenes[33] databases (e-value less than 1 × 10-5 and a sequence match length greater than 50 nucleotides). These taxonomic profiles were then normalized for differences in sequencing coverage by calculating percent distribution, Epigenetics inhibitor prior to downstream statistical analysis. A non-parametric Wilcoxon exact test was used to statistically compare the taxonomic composition in any two metagenomes. Additionally, within MG-RAST, the functional annotations (hits to SEED Subsystems) were extracted (e-value less than 1 × 10-5 and a sequence match length greater than 50 nucleotides) to compare functional attributes across these gut metagenomes. In order to identify statistically significant and biologically meaningful differences between the swine gut and other endiobiotic microbiomes, we employed the two-way Fisher’s exact test with a Benjamin-Hochberg FDR multiple test correction within STAMP v1.0.