Table of Content
I.	Project Summary
II.	Workflow Checklist
III.	NGS Sequencing Results
IV.	Complete Report Download
V.	Raw Sequence Data Download
VI.	Analysis - DADA2 Read Processing
	Sample Meta Info
	Read Count by Sample
VII.	Analysis - Read Taxonomy Assignment
	Taxonomy Barplots
VIII.	Analysis - Alpha Diversity
IX.	Analysis - Beta Diversity
X.	Analysis - Differential Abundance
	ANCOM Result
	LEfSe Result
XI.	Analysis - Heatmap Profile
XII.	Analysis - Network Association
XIII.	Disclaimer

(Click to navigate)

FOMC Service Report

16S rRNA Gene V1V3 Amplicon Sequencing

Version V1.43

Version History

The Forsyth Institute, Cambridge, MA, USA

October 09, 2023

Project ID: 20231003_dada2_trim

I. Project Summary

Project 20231003_dada2_trim services include NGS sequencing of the V1V3 region of the 16S rRNA gene amplicons from the samples. First and foremost, please download this report, as well as the sequence raw data from the download links provided below. These links will expire after 60 days. We cannot guarantee the availability of your data after 60 days.

Full Bioinformatics analysis service was requested. We provide many analyses, starting from the raw sequence quality and noise filtering, pair reads merging, as well as chimera filtering for the sequences, using the DADA2 denosing algorithm and pipeline.

We also provide many downstream analyses such as taxonomy assignment, alpha and beta diversity analyses, and differential abundance analysis.

For taxonomy assignment, most informative would be the taxonomy barplots. We provide an interactive barplots to show the relative abundance of microbes at different taxonomy levels (from Phylum to species) that you can choose.

If you specify which groups of samples you want to compare for differential abundance, we provide both ANCOM and LEfSe differential abundance analysis.

II. Workflow Checklist

☑	1.	Sample Received
☑	2.	Sample Quality Evaluated
☑	3.	Sample Prepared for Sequencing
☑	4.	Next-Gen Sequencing
☑	5.	Sequence Quality Check
☑	6.	Absolute Abundance
☑	7.	Report and Raw Sequence Data Available for Download
☑	8.	Bioinformatics Analysis - Reads Processing (DADA2 Quality Trimming, Denoising, Paired Reads Merging)
☑	9.	Bioinformatics Analysis - Reads Taxonomy Assignment
☑	10.	Bioinformatics Analysis - Alpha Diversity Analysis
☑	11.	Bioinformatics Analysis - Beta Diversity Analysis
☑	12.	Bioinformatics Analysis - Differential Abundance Analysis
☑	13.	Bioinformatics Analysis - Heatmap Profile
☑	14.	Bioinformatics Analysis - Network Association

III. NGS Sequencing

The samples were processed and analyzed with the ZymoBIOMICS® Service: Targeted Metagenomic Sequencing (Zymo Research, Irvine, CA).

DNA Extraction: If DNA extraction was performed, one of three different DNA extraction kits was used depending on the sample type and sample volume and were used according to the manufacturer’s instructions, unless otherwise stated. The kit used in this project is marked below:

☐	ZymoBIOMICS® DNA Miniprep Kit (Zymo Research, Irvine, CA)
☐	ZymoBIOMICS® DNA Microprep Kit (Zymo Research, Irvine, CA)
☐	ZymoBIOMICS®-96 MagBead DNA Kit (Zymo Research, Irvine, CA)
☑	N/A (DNA Extraction Not Performed)
Elution Volume: 50µL
Additional Notes: NA

Targeted Library Preparation: The DNA samples were prepared for targeted sequencing with the Quick-16S™ NGS Library Prep Kit (Zymo Research, Irvine, CA). These primers were custom designed by Zymo Research to provide the best coverage of the 16S gene while maintaining high sensitivity. The primer sets used in this project are marked below:

☐	Quick-16S™ Primer Set V1-V2 (Zymo Research, Irvine, CA)
☑	Quick-16S™ Primer Set V1-V3 (Zymo Research, Irvine, CA)
☐	Quick-16S™ Primer Set V3-V4 (Zymo Research, Irvine, CA)
☐	Quick-16S™ Primer Set V4 (Zymo Research, Irvine, CA)
☐	Quick-16S™ Primer Set V6-V8 (Zymo Research, Irvine, CA)
☐	Other: NA
Additional Notes: NA

The sequencing library was prepared using an innovative library preparation process in which PCR reactions were performed in real-time PCR machines to control cycles and therefore limit PCR chimera formation. The final PCR products were quantified with qPCR fluorescence readings and pooled together based on equal molarity. The final pooled library was cleaned up with the Select-a-Size DNA Clean & Concentrator™ (Zymo Research, Irvine, CA), then quantified with TapeStation® (Agilent Technologies, Santa Clara, CA) and Qubit® (Thermo Fisher Scientific, Waltham, WA).

Control Samples: The ZymoBIOMICS® Microbial Community Standard (Zymo Research, Irvine, CA) was used as a positive control for each DNA extraction, if performed. The ZymoBIOMICS® Microbial Community DNA Standard (Zymo Research, Irvine, CA) was used as a positive control for each targeted library preparation. Negative controls (i.e. blank extraction control, blank library preparation control) were included to assess the level of bioburden carried by the wet-lab process.

Sequencing: The final library was sequenced on Illumina® MiSeq™ with a V3 reagent kit (600 cycles). The sequencing was performed with 10% PhiX spike-in.

Absolute Abundance Quantification*: A quantitative real-time PCR was set up with a standard curve. The standard curve was made with plasmid DNA containing one copy of the 16S gene and one copy of the fungal ITS2 region prepared in 10-fold serial dilutions. The primers used were the same as those used in Targeted Library Preparation. The equation generated by the plasmid DNA standard curve was used to calculate the number of gene copies in the reaction for each sample. The PCR input volume (2 µl) was used to calculate the number of gene copies per microliter in each DNA sample.
The number of genome copies per microliter DNA sample was calculated by dividing the gene copy number by an assumed number of gene copies per genome. The value used for 16S copies per genome is 4. The value used for ITS copies per genome is 200. The amount of DNA per microliter DNA sample was calculated using an assumed genome size of 4.64 x 10⁶ bp, the genome size of Escherichia coli, for 16S samples, or an assumed genome size of 1.20 x 10⁷ bp, the genome size of Saccharomyces cerevisiae, for ITS samples. This calculation is shown below:

Calculated Total DNA = Calculated Total Genome Copies × Assumed Genome Size (4.64 × 10⁶ bp) ×
Average Molecular Weight of a DNA bp (660 g/mole/bp) ÷ Avogadro’s Number (6.022 x 10²³/mole)

* Absolute Abundance Quantification is only available for 16S and ITS analyses.

The absolute abundance standard curve data can be viewed in Excel here:

The absolute abundance standard curve is shown below:

Absolute Abundance Standard Curve

IV. Complete Report Download

The complete report of your project, including all links in this report, can be downloaded by clicking the link provided below. The downloaded file is a compressed ZIP file and once unzipped, open the file “REPORT.html” (may only shown as "REPORT" in your computer) by double clicking it. Your default web browser will open it and you will see the exact content of this report.

Please download and save the file to your computer storage device. The download link will expire after 60 days upon your receiving of this report.

Complete report download link:

To view the report, please follow the following steps:

1. Download the .zip file from the report link above.

2. Extract all the contents of the downloaded .zip file to your desktop.

3. Open the extracted folder and find the "REPORT.html" (may shown as only "REPORT").

4. Open (double-clicking) the REPORT.html file. Your default browser will open the top age of the complete report. Within the report, there are links to view all the analyses performed for the project.

V. Raw Sequence Data Download

The raw NGS sequence data is available for download with the link provided below. The data is a compressed ZIP file and can be unzipped to individual sequence files. Since this is a pair-end sequencing, each of your samples is represented by two sequence files, one for READ 1, with the file extension “*_R1.fastq.gz”, another READ 2, with the file extension “*_R1.fastq.gz”. The files are in FASTQ format and are compressed. FASTQ format is a text-based data format for storing both a biological sequence and its corresponding quality scores. Most sequence analysis software will be able to open them. The Sample IDs associated with the R1 and R2 fastq files are listed in the table below:


Sample ID Original Sample ID Read 1 File Name Read 2 File Name
F12829.S10 original sample ID here zr12829_10V1V3_R1.fastq.gz zr12829_10V1V3_R2.fastq.gz
F12829.S11 original sample ID here zr12829_11V1V3_R1.fastq.gz zr12829_11V1V3_R2.fastq.gz
F12829.S12 original sample ID here zr12829_12V1V3_R1.fastq.gz zr12829_12V1V3_R2.fastq.gz
F12829.S13 original sample ID here zr12829_13V1V3_R1.fastq.gz zr12829_13V1V3_R2.fastq.gz
F12829.S14 original sample ID here zr12829_14V1V3_R1.fastq.gz zr12829_14V1V3_R2.fastq.gz
F12829.S15 original sample ID here zr12829_15V1V3_R1.fastq.gz zr12829_15V1V3_R2.fastq.gz
F12829.S16 original sample ID here zr12829_16V1V3_R1.fastq.gz zr12829_16V1V3_R2.fastq.gz
F12829.S17 original sample ID here zr12829_17V1V3_R1.fastq.gz zr12829_17V1V3_R2.fastq.gz
F12829.S18 original sample ID here zr12829_18V1V3_R1.fastq.gz zr12829_18V1V3_R2.fastq.gz
F12829.S19 original sample ID here zr12829_19V1V3_R1.fastq.gz zr12829_19V1V3_R2.fastq.gz
F12829.S01 original sample ID here zr12829_1V1V3_R1.fastq.gz zr12829_1V1V3_R2.fastq.gz
F12829.S20 original sample ID here zr12829_20V1V3_R1.fastq.gz zr12829_20V1V3_R2.fastq.gz
F12829.S21 original sample ID here zr12829_21V1V3_R1.fastq.gz zr12829_21V1V3_R2.fastq.gz
F12829.S22 original sample ID here zr12829_22V1V3_R1.fastq.gz zr12829_22V1V3_R2.fastq.gz
F12829.S23 original sample ID here zr12829_23V1V3_R1.fastq.gz zr12829_23V1V3_R2.fastq.gz
F12829.S24 original sample ID here zr12829_24V1V3_R1.fastq.gz zr12829_24V1V3_R2.fastq.gz
F12829.S25 original sample ID here zr12829_25V1V3_R1.fastq.gz zr12829_25V1V3_R2.fastq.gz
F12829.S26 original sample ID here zr12829_26V1V3_R1.fastq.gz zr12829_26V1V3_R2.fastq.gz
F12829.S27 original sample ID here zr12829_27V1V3_R1.fastq.gz zr12829_27V1V3_R2.fastq.gz
F12829.S28 original sample ID here zr12829_28V1V3_R1.fastq.gz zr12829_28V1V3_R2.fastq.gz
F12829.S29 original sample ID here zr12829_29V1V3_R1.fastq.gz zr12829_29V1V3_R2.fastq.gz
F12829.S02 original sample ID here zr12829_2V1V3_R1.fastq.gz zr12829_2V1V3_R2.fastq.gz
F12829.S30 original sample ID here zr12829_30V1V3_R1.fastq.gz zr12829_30V1V3_R2.fastq.gz
F12829.S31 original sample ID here zr12829_31V1V3_R1.fastq.gz zr12829_31V1V3_R2.fastq.gz
F12829.S32 original sample ID here zr12829_32V1V3_R1.fastq.gz zr12829_32V1V3_R2.fastq.gz
F12829.S33 original sample ID here zr12829_33V1V3_R1.fastq.gz zr12829_33V1V3_R2.fastq.gz
F12829.S34 original sample ID here zr12829_34V1V3_R1.fastq.gz zr12829_34V1V3_R2.fastq.gz
F12829.S35 original sample ID here zr12829_35V1V3_R1.fastq.gz zr12829_35V1V3_R2.fastq.gz
F12829.S36 original sample ID here zr12829_36V1V3_R1.fastq.gz zr12829_36V1V3_R2.fastq.gz
F12829.S03 original sample ID here zr12829_3V1V3_R1.fastq.gz zr12829_3V1V3_R2.fastq.gz
F12829.S04 original sample ID here zr12829_4V1V3_R1.fastq.gz zr12829_4V1V3_R2.fastq.gz
F12829.S05 original sample ID here zr12829_5V1V3_R1.fastq.gz zr12829_5V1V3_R2.fastq.gz
F12829.S06 original sample ID here zr12829_6V1V3_R1.fastq.gz zr12829_6V1V3_R2.fastq.gz
F12829.S07 original sample ID here zr12829_7V1V3_R1.fastq.gz zr12829_7V1V3_R2.fastq.gz
F12829.S08 original sample ID here zr12829_8V1V3_R1.fastq.gz zr12829_8V1V3_R2.fastq.gz
F12829.S09 original sample ID here zr12829_9V1V3_R1.fastq.gz zr12829_9V1V3_R2.fastq.gz

Please download and save the file to your computer storage device. The download link will expire after 60 days upon your receiving of this report.

Raw sequence data download link:

VI. Analysis - DADA2 Read Processing

What is DADA2?

DADA2 is a software package that models and corrects Illumina-sequenced amplicon errors. DADA2 infers sample sequences exactly, without coarse-graining into OTUs, and resolves differences of as little as one nucleotide. DADA2 identified more real variants and output fewer spurious sequences than other methods.

DADA2’s advantage is that it uses more of the data. The DADA2 error model incorporates quality information, which is ignored by all other methods after filtering. The DADA2 error model incorporates quantitative abundances, whereas most other methods use abundance ranks if they use abundance at all. The DADA2 error model identifies the differences between sequences, eg. A->C, whereas other methods merely count the mismatches. DADA2 can parameterize its error model from the data itself, rather than relying on previous datasets that may or may not reflect the PCR and sequencing protocols used in your study.

DADA2 Publication: Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJ, Holmes SP. DADA2: High-resolution sample inference from Illumina amplicon data. Nat Methods. 2016 Jul;13(7):581-3. doi: 10.1038/nmeth.3869. Epub 2016 May 23. PMID: 27214047; PMCID: PMC4927377.

DADA2 Software Package is available as an R package at : https://benjjneb.github.io/dada2/index.html

Analysis Procedures:

DADA2 pipeline includes several tools for read quality control, including quality filtering, trimming, denoising, pair merging and chimera filtering. Below are the major processing steps of DADA2:

Step 1. Read trimming based on sequence quality The quality of NGS Illumina sequences often decreases toward the end of the reads. DADA2 allows to trim off the poor quality read ends in order to improve the error model building and pair mergicing performance.

Step 2. Learn the Error Rates The DADA2 algorithm makes use of a parametric error model (err) and every amplicon dataset has a different set of error rates. The learnErrors method learns this error model from the data, by alternating estimation of the error rates and inference of sample composition until they converge on a jointly consistent solution. As in many machine-learning problems, the algorithm must begin with an initial guess, for which the maximum possible error rates in this data are used (the error rates if only the most abundant sequence is correct and all the rest are errors).

Step 3. Infer amplicon sequence variants (ASVs) based on the error model built in previous step. This step is also called sequence "denoising". The outcome of this step is a list of ASVs that are the equivalent of oligonucleotides.

Step 4. Merge paired reads. If the sequencing products are read pairs, DADA2 will merge the R1 and R2 ASVs into single sequences. Merging is performed by aligning the denoised forward reads with the reverse-complement of the corresponding denoised reverse reads, and then constructing the merged “contig” sequences. By default, merged sequences are only output if the forward and reverse reads overlap by at least 12 bases, and are identical to each other in the overlap region (but these conditions can be changed via function arguments).

Step 5. Remove chimera. The core dada method corrects substitution and indel errors, but chimeras remain. Fortunately, the accuracy of sequence variants after denoising makes identifying chimeric ASVs simpler than when dealing with fuzzy OTUs. Chimeric sequences are identified if they can be exactly reconstructed by combining a left-segment and a right-segment from two more abundant “parent” sequences. The frequency of chimeric sequences varies substantially from dataset to dataset, and depends on on factors including experimental procedures and sample complexity.

Results

1. Read Quality Plots NGS sequence analaysis starts with visualizing the quality of the sequencing. Below are the quality plots of the first sample for the R1 and R2 reads separately. In gray-scale is a heat map of the frequency of each quality score at each base position. The mean quality score at each position is shown by the green line, and the quartiles of the quality score distribution by the orange lines. The forward reads are usually of better quality. It is a common practice to trim the last few nucleotides to avoid less well-controlled errors that can arise there. The trimming affects the downstream steps including error model building, merging and chimera calling. FOMC uses an empirical approach to test many combinations of different trim length in order to achieve best final amplicon sequence variants (ASVs), see the next section “Optimal trim length for ASVs”.

Quality plots for all samples:

quality_plots_21-36.pdf

quality_plots_1-20.pdf

2. Optimal trim length for ASVs The final number of merged and chimera-filtered ASVs depends on the quality filtering (hence trimming) in the very beginning of the DADA2 pipeline. In order to achieve highest number of ASVs, an empirical approach was used -

Create a random subset of each sample consisting of 5,000 R1 and 5,000 R2 (to reduce computation time)
Trim 10 bases at a time from the ends of both R1 and R2 up to 50 bases
For each combination of trimmed length (e.g., 300x300, 300x290, 290x290 etc), the trimmed reads are subject to the entire DADA2 pipeline for chimera-filtered merged ASVs
The combination with highest percentage of the input reads becoming final ASVs is selected for the complete set of data

Below is the result of such operation, showing ASV percentages of total reads for all trimming combinations (1st Column = R1 lengths in bases; 1st Row = R2 lengths in bases):

R1/R2	281	271	261	251	241	231
321	65.99%	66.44%	66.90%	67.41%	63.61%	56.88%
311	66.32%	66.78%	67.27%	63.36%	55.06%	38.76%
301	66.30%	66.78%	62.80%	54.41%	37.23%	19.88%
291	66.31%	62.38%	53.89%	36.65%	19.13%	9.72%
281	62.19%	53.67%	36.50%	18.75%	9.35%	7.27%
271	53.42%	36.66%	18.76%	8.99%	6.97%	2.12%

Based on the above result, the trim length combination of R1 = 321 bases and R2 = 251 bases (highlighted red above), was chosen for generating final ASVs for all sequences. This combination generated highest number of merged non-chimeric ASVs and was used for downstream analyses, if requested.

3. Error plots from learning the error rates After DADA2 building the error model for the set of data, it is always worthwhile, as a sanity check if nothing else, to visualize the estimated error rates. The error rates for each possible transition (A→C, A→G, …) are shown below. Points are the observed error rates for each consensus quality score. The black line shows the estimated error rates after convergence of the machine-learning algorithm. The red line shows the error rates expected under the nominal definition of the Q-score. The ideal result would be the estimated error rates (black line) are a good fit to the observed rates (points), and the error rates drop with increased quality as expected.

Forward Read R1 Error Plot

Reverse Read R2 Error Plot

The PDF version of these plots are available here:

4. DADA2 Result Summary The table below shows the summary of the DADA2 analysis, tracking paired read counts of each samples for all the steps during DADA2 denoising process - including end-trimming (filtered), denoising (denoisedF, denoisedF), pair merging (merged) and chimera removal (nonchim).

Sample ID	F12829.S01	F12829.S02	F12829.S03	F12829.S04	F12829.S05	F12829.S06	F12829.S07	F12829.S08	F12829.S09	F12829.S10	F12829.S11	F12829.S12	F12829.S13	F12829.S14	F12829.S15	F12829.S16	F12829.S17	F12829.S18	F12829.S19	F12829.S20	F12829.S21	F12829.S22	F12829.S23	F12829.S24	F12829.S25	F12829.S26	F12829.S27	F12829.S28	F12829.S29	F12829.S30	F12829.S31	F12829.S32	F12829.S33	F12829.S34	F12829.S35	F12829.S36	Row Sum	Percentage
input	149,561	142,276	145,265	146,982	152,122	122,929	209,765	247,101	216,437	172,726	168,829	157,995	188,652	161,054	158,976	161,480	164,988	153,076	183,077	265,336	249,126	189,466	142,667	188,331	149,564	152,305	135,151	139,725	135,302	190,298	198,710	294,636	258,515	157,279	127,909	140,066	6,317,677	100.00%
filtered	118,837	112,689	115,822	115,454	120,835	97,144	166,389	196,109	171,583	136,623	133,853	124,580	149,465	127,972	125,623	127,933	130,775	120,859	144,872	210,326	196,456	149,541	113,016	149,394	118,949	120,516	107,168	110,144	106,551	150,951	157,669	234,027	203,872	124,390	101,192	110,582	5,002,161	79.18%
denoisedF	117,684	111,516	114,606	114,796	119,857	96,343	166,101	195,501	171,059	136,258	133,404	124,272	147,867	126,666	124,148	127,050	129,925	120,042	144,547	209,476	195,907	148,902	112,533	148,986	117,586	119,178	105,881	109,434	105,786	149,890	157,407	232,134	203,321	123,868	100,895	110,169	4,972,995	78.72%
denoisedR	116,252	110,386	113,351	113,487	118,510	95,219	164,359	193,247	169,334	134,731	132,126	122,966	146,633	125,622	123,005	125,589	128,705	118,781	143,171	207,918	194,056	147,572	111,447	147,413	116,398	117,883	104,893	108,130	104,626	148,514	155,966	231,003	201,220	122,928	99,750	108,916	4,924,107	77.94%
merged	109,929	104,495	107,161	109,613	113,173	91,070	143,657	186,681	164,196	132,328	129,730	120,880	138,498	118,825	115,433	119,871	123,585	114,039	133,365	201,105	188,464	143,979	108,633	143,786	109,167	110,906	98,147	103,617	100,428	142,617	147,004	224,449	197,228	119,632	97,231	106,317	4,719,239	74.70%
nonchim	101,638	96,402	98,461	103,390	101,142	82,753	142,364	177,594	155,406	115,894	112,506	107,050	128,659	109,798	105,229	108,422	114,525	104,381	129,115	195,997	183,273	123,035	90,129	122,432	97,126	99,884	86,658	94,986	90,988	127,874	140,094	216,540	191,486	99,004	83,128	89,364	4,326,727	68.49%

This table can be downloaded as an Excel table below:

5. DADA2 Amplicon Sequence Variants (ASVs). A total of 2072 unique merged and chimera-free ASV sequences were identified, and their corresponding read counts for each sample are available in the "ASV Read Count Table" with rows for the ASV sequences and columns for sample. This read count table can be used for microbial profile comparison among different samples and the sequences provided in the table can be used to taxonomy assignment.

The table can be downloaded from this link:

Sample Meta Information

Download Sample Meta Information

#SampleID	SampleName	Method	HL	Source	Group
F12829.S01	MA1	Masterpure	T	SUPA	SUPA
F12829.S02	MA2	Masterpure	F	SUPA	SUPA
F12829.S03	MA3	Masterpure	F	SUPA	SUPA
F12829.S04	MB1	Masterpure	T	SUPB	SUPB
F12829.S05	MB2	Masterpure	F	SUPB	SUPB
F12829.S06	MB3	Masterpure	F	SUPB	SUPB
F12829.S07	MOM1	Masterpure	T	OM	OM
F12829.S08	MOM2	Masterpure	F	OM	OM
F12829.S09	MOM3	Masterpure	F	OM	OM
F12829.S10	MZM1	Masterpure	T	ZM	ZM
F12829.S11	MZM2	Masterpure	F	ZM	ZM
F12829.S12	MZM3	Masterpure	F	ZM	ZM
F12829.S13	PA1	PowerSoil	T	SUPA	SUPA
F12829.S14	PA2	PowerSoil	F	SUPA	SUPA
F12829.S15	PA3	PowerSoil	F	SUPA	SUPA
F12829.S16	PB1	PowerSoil	T	SUPB	SUPB
F12829.S17	PB2	PowerSoil	F	SUPB	SUPB
F12829.S18	PB3	PowerSoil	F	SUPB	SUPB
F12829.S19	POM1	PowerSoil	T	OM	OM
F12829.S20	POM2	PowerSoil	F	OM	OM
F12829.S21	POM3	PowerSoil	F	OM	OM
F12829.S22	PZM1	PowerSoil	T	ZM	ZM
F12829.S23	PZM2	PowerSoil	F	ZM	ZM
F12829.S24	PZM3	PowerSoil	F	ZM	ZM
F12829.S25	ZA1	Zymo	T	SUPA	SUPA
F12829.S26	ZA2	Zymo	F	SUPA	SUPA
F12829.S27	ZA3	Zymo	F	SUPA	SUPA
F12829.S28	ZB1	Zymo	T	SUPB	SUPB
F12829.S29	ZB2	Zymo	F	SUPB	SUPB
F12829.S30	ZB3	Zymo	F	SUPB	SUPB
F12829.S31	ZOM1	Zymo	T	OM	OM
F12829.S32	ZOM2	Zymo	F	OM	OM
F12829.S33	ZOM3	Zymo	F	OM	OM
F12829.S34	ZZM1	Zymo	T	ZM	ZM
F12829.S35	ZZM2	Zymo	F	ZM	ZM
F12829.S36	ZZM3	Zymo	F	ZM	ZM

ASV Read Counts by Samples

#Sample ID	Read Count
F12829.S06	82,753
F12829.S35	83,128
F12829.S27	86,658
F12829.S36	89,364
F12829.S23	90,129
F12829.S29	90,988
F12829.S28	94,986
F12829.S02	96,402
F12829.S25	97,126
F12829.S03	98,461
F12829.S34	99,004
F12829.S26	99,884
F12829.S05	101,142
F12829.S01	101,638
F12829.S04	103,390
F12829.S18	104,381
F12829.S15	105,229
F12829.S12	107,050
F12829.S16	108,422
F12829.S14	109,798
F12829.S11	112,506
F12829.S17	114,525
F12829.S10	115,894
F12829.S24	122,432
F12829.S22	123,035
F12829.S30	127,874
F12829.S13	128,659
F12829.S19	129,115
F12829.S31	140,094
F12829.S07	142,364
F12829.S09	155,406
F12829.S08	177,594
F12829.S21	183,273
F12829.S33	191,486
F12829.S20	195,997
F12829.S32	216,540

VII. Analysis - Read Taxonomy Assignment

Read Taxonomy Assignment - Methods

The species-level, open-reference 16S rRNA NGS reads taxonomy assignment pipeline

Version 20210310

1. Raw sequences reads in FASTA format were BLASTN-searched against a combined set of 16S rRNA reference sequences. It consists of MOMD (version 0.1), the HOMD (version 15.2 http://www.homd.org/index.php?name=seqDownload&file&type=R ), HOMD 16S rRNA RefSeq Extended Version 1.1 (EXT), GreenGene Gold (GG) (http://greengenes.lbl.gov/Download/Sequence_Data/Fasta_data_files/gold_strains_gg16S_aligned.fasta.gz) , and the NCBI 16S rRNA reference sequence set (https://ftp.ncbi.nlm.nih.gov/blast/db/16S_ribosomal_RNA.tar.gz). These sequences were screened and combined to remove short sequences (<1000nt), chimera, duplicated and sub-sequences, as well as sequences with poor taxonomy annotation (e.g., without species information). This process resulted in 1,015 from HOMD V15.22, 495 from EXT, 3,940 from GG and 18,044 from NCBI, a total of 25,120 sequences. Altogether these sequence represent a total of 15,601 oral and non-oral microbial species.

The NCBI BLASTN version 2.7.1+ (Zhang et al, 2000) was used with the default parameters. Reads with ≥ 98% sequence identity to the matched reference and ≥ 90% alignment length (i.e., ≥ 90% of the read length that was aligned to the reference and was used to calculate the sequence percent identity) were classified based on the taxonomy of the reference sequence with highest sequence identity. If a read matched with reference sequences representing more than one species with equal percent identity and alignment length, it was subject to chimera checking with USEARCH program version v8.1.1861 (Edgar 2010). Non-chimeric reads with multi-species best hits were considered valid and were assigned with a unique species notation (e.g., spp) denoting unresolvable multiple species.

2. Unassigned reads (i.e., reads with < 98% identity or < 90% alignment length) were pooled together and reads < 200 bases were removed. The remaining reads were subject to the de novo operational taxonomy unit (OTU) calling and chimera checking using the USEARCH program version v8.1.1861 (Edgar 2010). The de novo OTU calling and chimera checking was done using 98% as the sequence identity cutoff, i.e., the species-level OTU. The output of this step produced species-level de novo clustered OTUs with 98% identity. Representative reads from each of the OTUs/species were then BLASTN-searched against the same reference sequence set again to determine the closest species for these potential novel species. These potential novel species were pooled together with the reads that were signed to specie-level in the previous step, for down-stream analyses.

Reference:
Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010 Oct 1;26(19):2460-1. doi: 10.1093/bioinformatics/btq461. Epub 2010 Aug 12. PubMed PMID: 20709691.

3. Designations used in the taxonomy:

	1) Taxonomy levels are indicated by these prefixes:
	
	   k__: domain/kingdom
	   p__: phylum
	   c__: class
	   o__: order
	   f__: family
	   g__: genus  
	   s__: species
	
	   Example: 
	
	   k__Bacteria;p__Firmicutes;c__Clostridia;o__Clostridiales;f__Lachnospiraceae;g__Blautia;s__faecis
		
	2) Unique level identified – known species:
	   
	   k__Bacteria;p__Firmicutes;c__Clostridia;o__Clostridiales;f__Lachnospiraceae;g__Roseburia;s__hominis
	
	   The above example shows some reads match to a single species (all levels are unique)
	
	3) Non-unique level identified – known species:

	   k__Bacteria;p__Firmicutes;c__Clostridia;o__Clostridiales;f__Lachnospiraceae;g__Roseburia;s__multispecies_spp123_3
	   
	   The above example “s__multispecies_spp123_3” indicates certain reads equally match to 3 species of the 
	   genus Roseburia; the “spp123” is a temporally assigned species ID.
	
	   k__Bacteria;p__Firmicutes;c__Clostridia;o__Clostridiales;f__Lachnospiraceae;g__multigenus;s__multispecies_spp234_5
	   
	   The above example indicates certain reads match equally to 5 different species, which belong to multiple genera.; 
	   the “spp234” is a temporally assigned species ID.
	
	4) Unique level identified – unknown species, potential novel species:
	   
	   k__Bacteria;p__Firmicutes;c__Clostridia;o__Clostridiales;f__Lachnospiraceae;g__Roseburia;s__ hominis_nov_97%
	   
	   The above example indicates that some reads have no match to any of the reference sequences with 
	   sequence identity ≥ 98% and percent coverage (alignment length)  ≥ 98% as well. However this groups 
	   of reads (actually the representative read from a de novo  OTU) has 96% percent identity to 
	   Roseburia hominis, thus this is a potential novel species, closest to Roseburia hominis. 
	   (But they are not the same species).
	
	5) Multiple level identified – unknown species, potential novel species:
	   k__Bacteria;p__Firmicutes;c__Clostridia;o__Clostridiales;f__Lachnospiraceae;g__Roseburia;s__ multispecies_sppn123_3_nov_96%
	
	   The above example indicates that some reads have no match to any of the reference sequences 
	   with sequence identity ≥ 98% and percent coverage (alignment length)  ≥ 98% as well. 
	   However this groups of reads (actually the representative read from a de novo  OTU) 
	   has 96% percent identity equally to 3 species in Roseburia. Thus this is no single 
	   closest species, instead this group of reads match equally to multiple species at 96%. 
	   Since they have passed chimera check so they represent a novel species. “sppn123” is a 
	   temporary ID for this potential novel species.

4. The taxonomy assignment algorithm is illustrated in this flow char below:

Read Taxonomy Assignment - Result Summary *

Code	Category	MPC=0% (>=1 read)	MPC=0.01%(>=432 reads)
A	Total reads	4,326,727	4,326,727
B	Total assigned reads	4,323,326	4,323,326
C	Assigned reads in species with read count < MPC	0	28,358
D	Assigned reads in samples with read count < 500	0	0
E	Total samples	36	36
F	Samples with reads >= 500	36	36
G	Samples with reads < 500	0	0
H	Total assigned reads used for analysis (B-C-D)	4,323,326	4,294,968
I	Reads assigned to single species	3,519,851	3,498,298
J	Reads assigned to multiple species	406,100	404,238
K	Reads assigned to novel species	397,375	392,432
L	Total number of species	637	220
M	Number of single species	389	193
N	Number of multi-species	27	8
O	Number of novel species	221	19
P	Total unassigned reads	3,401	3,401
Q	Chimeric reads	42	42
R	Reads without BLASTN hits	169	169
S	Others: short, low quality, singletons, etc.	3,190	3,190
	A=B+P=C+D+H+Q+R+S
	E=F+G
	B=C+D+H
	H=I+J+K
	L=M+N+O
	P=Q+R+S

* MPC = Minimal percent (of all assigned reads) read count per species, species with read count < MPC were removed.

* Samples with reads < 500 were removed from downstream analyses.

* The assignment result from MPC=0.1% was used in the downstream analyses.

Read Taxonomy Assignment - ASV Species-Level Read Counts Table

This table shows the read counts for each sample (columns) and each species identified based on the ASV sequences. The downstream analyses were based on this table.

SPID	Taxonomy	F12829.S01	F12829.S02	F12829.S03	F12829.S04	F12829.S05	F12829.S06	F12829.S07	F12829.S08	F12829.S09	F12829.S10	F12829.S11	F12829.S12	F12829.S13	F12829.S14	F12829.S15	F12829.S16	F12829.S17	F12829.S18	F12829.S19	F12829.S20	F12829.S21	F12829.S22	F12829.S23	F12829.S24	F12829.S25	F12829.S26	F12829.S27	F12829.S28	F12829.S29	F12829.S30	F12829.S31	F12829.S32	F12829.S33	F12829.S34	F12829.S35	F12829.S36
SP10	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Capnocytophaga;gingivalis	590	595	590	912	569	608	0	0	0	0	0	0	743	554	549	1150	1196	881	0	0	0	0	0	0	320	394	363	683	547	790	0	0	0	0	0	0
SP100	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Porphyromonadaceae;Porphyromonas;catoniae	206	159	191	221	192	145	0	0	0	0	0	0	221	184	193	197	210	153	0	0	0	0	0	0	108	170	120	146	105	149	0	0	0	0	0	0
SP102	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Alloprevotella;tannerae	852	617	758	0	0	0	0	6	0	0	0	0	851	583	637	0	0	0	0	0	0	0	0	0	332	430	348	0	0	0	0	0	0	0	0	0
SP104	Bacteria;Firmicutes;Clostridia;Eubacteriales;Lachnospiraceae;Lachnospiraceae_[G-3];bacterium HMT100	136	112	120	188	154	136	0	0	0	0	0	0	160	121	87	206	142	169	0	0	0	0	0	0	90	81	60	107	97	142	0	0	0	0	0	0
SP109	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;sp. HMT472	356	384	350	104	116	59	0	0	0	0	0	0	409	253	330	103	110	78	0	0	0	0	0	0	170	222	143	50	71	88	0	0	0	0	0	0
SP11	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;sp. HMT169	764	674	876	1165	998	850	0	45	6	0	0	0	1244	1109	1103	2137	2313	2357	0	0	0	0	0	0	1179	1126	979	2289	2265	3246	0	101	76	0	0	0
SP110	Bacteria;Fusobacteria;Fusobacteriia;Fusobacteriales;Leptotrichiaceae;Leptotrichia;trevisanii	100	111	80	0	0	0	0	0	0	0	0	0	109	85	71	0	0	0	0	0	0	0	0	0	61	88	73	0	0	0	0	0	0	0	0	0
SP111	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Porphyromonadaceae;Tannerella;serpentiformis	0	29	18	67	71	72	0	0	0	0	0	0	16	22	0	74	77	68	0	0	0	0	0	0	0	0	0	45	36	57	0	0	0	0	0	0
SP114	Bacteria;Actinobacteria;Actinobacteria;Propionibacteriales;Propionibacteriaceae;Arachnia;propionica	135	133	136	43	42	29	0	0	0	0	0	0	335	314	242	81	88	77	0	0	0	0	0	0	248	297	266	85	91	102	0	0	0	0	0	0
SP115	Bacteria;Firmicutes;Clostridia;Clostridiales;Peptostreptococcaceae_[XI];Peptostreptococcaceae_[XI][G-7];[Eubacterium]_yurii_subsps._yurii_&_margaretiae	53	30	27	307	211	252	0	0	0	0	0	0	56	23	32	262	286	232	0	0	0	0	0	0	30	13	29	186	185	270	0	0	0	0	0	0
SP119	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;nigrescens	688	504	626	1008	604	581	0	0	0	0	0	0	603	428	452	789	597	656	0	0	0	0	0	0	301	319	241	475	390	547	0	0	0	0	0	0
SP12	Bacteria;Fusobacteria;Fusobacteria;Fusobacteriales;Fusobacteriaceae;Fusobacterium;nucleatum	3296	2953	3034	1519	1227	1079	10	2113	1447	0	0	0	3627	2645	2860	1546	1448	1156	0	1151	3360	0	0	0	1908	2015	1769	885	848	1231	2	2692	2720	0	0	0
SP120	Bacteria;Fusobacteria;Fusobacteriia;Fusobacteriales;Leptotrichiaceae;Leptotrichia;buccalis	694	572	615	499	461	386	0	0	0	0	0	0	904	686	614	521	480	532	0	0	0	0	0	0	423	492	456	417	367	519	0	0	0	0	0	0
SP128	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Bergeyella;sp. HMT900	75	79	80	12	0	6	0	0	0	0	0	0	95	71	65	16	9	5	0	0	0	0	0	0	31	46	26	5	4	7	0	0	0	0	0	0
SP129	Bacteria;Proteobacteria;Gammaproteobacteria;Pasteurellales;Pasteurellaceae;Aggregatibacter;sp. HMT513	265	275	270	0	0	0	0	0	0	0	0	0	394	235	239	0	0	0	0	0	0	0	0	0	173	181	166	0	0	0	0	0	0	0	0	0
SP131	Bacteria;Proteobacteria;Betaproteobacteria;Burkholderiales;Comamonadaceae;Acidovorax;temperans	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	221	416	0	0	0
SP132	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;sp. HMT897	95	105	108	0	0	0	0	0	0	0	0	0	716	433	461	0	0	0	0	0	0	0	0	0	443	477	376	0	0	0	0	0	0	0	0	0
SP133	Bacteria;Bacteroidota;Bacteroidia;Bacteroidales;Bacteroidaceae;Phocaeicola;vulgatus	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	1194	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0
SP135	Bacteria;Proteobacteria;Gammaproteobacteria;Cardiobacteriales;Cardiobacteriaceae;Cardiobacterium;valvarum	17	16	24	100	69	80	0	0	0	0	0	0	29	45	44	83	84	68	0	0	0	0	0	0	5	17	8	44	57	68	0	0	0	0	0	0
SP136	Bacteria;Firmicutes;Clostridia;Clostridiales;Lachnospiraceae_[XIV];Stomatobaculum;sp. HMT097	85	54	83	16	19	20	0	0	0	0	0	0	60	70	46	17	21	12	0	0	0	0	0	0	57	78	64	16	0	30	0	0	0	0	0	0
SP14	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Porphyromonadaceae;Porphyromonas;pasteri	1174	1092	1111	46	85	49	0	34	0	0	0	0	1116	965	961	38	122	32	0	0	0	0	0	0	633	748	558	21	7	52	0	0	0	0	0	0
SP143	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;sp. HMT170	88	75	101	279	214	229	0	0	0	0	0	0	163	143	124	702	808	888	0	0	0	0	0	0	152	114	133	934	960	1359	0	0	0	0	0	0
SP144	Bacteria;Firmicutes;Clostridia;Clostridiales;Peptostreptococcaceae_[XI];Peptostreptococcus;stomatis	16	16	18	0	0	0	0	0	0	0	0	0	88	56	99	0	0	0	0	0	0	0	0	0	84	69	82	0	0	0	0	0	0	0	0	0
SP145	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;loescheii	165	172	170	88	105	81	0	0	0	0	0	0	221	203	189	75	108	68	0	0	0	0	0	0	94	81	75	67	60	105	0	0	0	0	0	0
SP146	Bacteria;Fusobacteria;Fusobacteriia;Fusobacteriales;Leptotrichiaceae;Leptotrichia;hofstadii	137	150	157	0	0	0	0	0	0	0	0	0	166	156	137	0	0	0	0	0	0	0	0	0	96	74	98	0	0	0	0	0	0	0	0	0
SP148	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;salivarius	41	0	29	631	611	518	0	0	0	0	0	0	166	202	160	3433	3296	3602	0	0	0	0	0	0	140	175	176	2745	2923	3759	0	0	0	0	0	0
SP15	Bacteria;Fusobacteria;Fusobacteriia;Fusobacteriales;Leptotrichiaceae;Leptotrichia;sp. HMT221	1280	1167	1290	0	0	0	0	0	0	0	0	0	1539	1379	1502	0	0	0	0	0	0	0	0	0	957	1249	1075	0	0	0	0	0	0	0	0	0
SP150	Bacteria;Firmicutes;Clostridia;Negativicutes;Veillonellaceae;Veillonella;sp. HMT780	625	539	574	0	0	0	0	0	0	0	0	0	477	368	363	0	0	0	0	0	0	0	0	0	219	252	232	0	0	0	0	0	0	0	0	0
SP151	Bacteria;Firmicutes;Clostridia;Clostridiales;Lachnospiraceae_[XIV];Lachnoanaerobaculum;gingivalis	0	0	0	194	64	119	0	0	0	0	0	0	0	0	0	107	171	179	0	0	0	0	0	0	0	0	0	62	57	128	0	0	0	0	0	0
SP152	Bacteria;Fusobacteria;Fusobacteriia;Fusobacteriales;Leptotrichiaceae;Leptotrichia;sp. HMT417	109	99	122	16	0	8	0	0	0	0	0	0	752	178	146	14	19	0	0	0	0	0	0	0	155	166	166	14	0	22	0	0	0	0	0	0
SP155	Bacteria;Firmicutes;Negativicutes;Selenomonadales;Selenomonadaceae;Selenomonas;sputigena	88	62	93	135	119	74	0	0	0	0	0	0	102	65	46	85	89	75	0	0	0	0	0	0	48	36	36	33	47	68	0	0	0	0	0	0
SP156	Bacteria;Firmicutes;Negativicutes;Veillonellales;Veillonellaceae;Megasphaera;micronuciformis	85	52	81	6	0	2	0	0	0	0	0	0	83	60	33	7	0	0	0	0	0	0	0	0	50	48	34	0	0	0	0	0	0	0	0	0
SP157	Bacteria;Proteobacteria;Gammaproteobacteria;Oceanospirillales;Halomonadaceae;Halomonas;alkaliantarctica	0	0	0	0	0	0	0	118	44	0	0	0	0	0	0	0	0	0	0	258	252	0	0	0	0	0	0	0	0	0	0	363	342	0	0	0
SP158	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Capnocytophaga;sp. HMT864	234	193	188	0	0	0	0	0	0	0	0	0	250	198	210	0	0	0	0	0	0	0	0	0	105	149	119	0	0	0	0	0	0	0	0	0
SP159	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;gerencseriae	139	136	169	22	21	33	0	0	0	0	0	0	287	331	295	76	91	64	0	0	0	0	0	0	368	314	307	68	77	111	0	0	0	0	0	0
SP161	Bacteria;Saccharibacteria_(TM7);Saccharibacteria_(TM7)_[C-1];Saccharibacteria_(TM7)_[O-1];Saccharibacteria_(TM7)_[F-1];Saccharibacteria_(TM7)_[G-6];bacterium HMT870	237	204	217	61	59	52	0	0	0	0	0	0	172	177	151	53	67	43	0	0	0	0	0	0	124	128	84	42	27	50	0	0	0	0	0	0
SP163	Bacteria;Fusobacteria;Fusobacteria;Fusobacteriales;Fusobacteriaceae;Fusobacterium;sp. HMT204	121	84	112	81	76	67	0	0	0	0	0	0	140	125	70	95	90	84	0	0	0	0	0	0	59	73	60	61	47	104	0	0	0	0	0	0
SP164	Bacteria;Saccharibacteria_(TM7);Saccharibacteria_(TM7)_[C-1];Saccharibacteria_(TM7)_[O-1];Saccharibacteria_(TM7)_[F-1];Saccharibacteria_(TM7)_[G-1];bacterium HMT349	138	168	182	0	0	0	0	0	0	0	0	0	140	106	74	0	0	0	0	0	0	0	0	0	50	115	43	0	0	0	0	0	0	0	0	0
SP165	Bacteria;Proteobacteria;Gammaproteobacteria;Enterobacterales;Enterobacteriaceae;Salmonella;enterica	0	0	0	0	0	0	0	1515	7	25883	25924	23319	0	0	0	0	0	0	0	0	0	14716	10114	18712	0	0	0	0	0	0	0	0	0	9630	6892	8899
SP167	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;salivae	217	200	221	9	9	15	0	0	0	0	0	0	230	235	215	14	8	0	0	0	0	0	0	0	93	108	109	0	0	10	0	0	0	0	0	0
SP168	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;mitis	3249	4848	4246	2103	2636	2040	0	0	0	0	0	0	4485	5017	4718	2225	2368	2504	0	0	0	0	0	0	4805	5357	4603	2032	2275	2971	0	0	0	0	0	0
SP169	Bacteria;Proteobacteria;Gammaproteobacteria;Pasteurellales;Pasteurellaceae;Haemophilus;haemolyticus	373	283	338	35	13	18	0	0	0	0	0	0	376	312	326	34	39	26	0	0	0	0	0	0	176	241	173	17	12	28	0	0	0	0	0	0
SP17	Bacteria;Firmicutes;Clostridia;Clostridiales;Lachnospiraceae_[XIV];Stomatobaculum;longum	214	216	259	0	8	0	0	0	0	0	0	0	239	209	167	7	0	17	0	0	0	0	0	0	228	266	200	10	16	9	0	0	0	0	0	0
SP171	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Porphyromonadaceae;Tannerella;sp. HMT286	127	133	133	0	0	0	0	0	0	0	0	0	226	110	121	0	0	0	0	0	0	0	0	0	90	90	74	0	0	0	0	0	0	0	0	0
SP174	Bacteria;Proteobacteria;Epsilonproteobacteria;Campylobacterales;Campylobacteraceae;Campylobacter;showae	104	96	109	503	453	340	0	0	0	0	0	0	139	121	101	499	498	337	0	0	0	0	0	0	39	62	60	314	271	338	0	0	0	0	0	0
SP175	Bacteria;Fusobacteria;Fusobacteria;Fusobacteriales;Fusobacteriaceae;Fusobacterium;nucleatum_subsp._animalis	678	590	675	93	151	171	0	0	0	0	0	0	721	589	589	95	147	116	0	0	0	0	0	0	395	506	389	97	90	117	0	0	0	0	0	0
SP178	Bacteria;Firmicutes;Clostridia;Negativicutes;Veillonellaceae;Veillonella;atypica	52	54	38	84	56	77	0	0	0	0	0	0	37	40	29	52	76	51	0	0	0	0	0	0	24	25	0	32	42	55	0	0	0	0	0	0
SP18	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Alloprevotella;sp. HMT308	69	53	80	255	187	180	0	0	0	0	0	0	69	50	60	215	216	169	0	0	0	0	0	0	42	39	29	121	118	179	0	0	0	0	0	0
SP181	Bacteria;Proteobacteria;Betaproteobacteria;Neisseriales;Neisseriaceae;Eikenella;corrodens	130	121	142	186	114	124	0	0	0	0	0	0	184	134	161	184	155	117	0	0	0	0	0	0	81	95	82	119	90	141	0	0	0	0	0	0
SP183	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;sp. HMT475	0	0	0	254	303	183	0	0	0	0	0	0	0	0	0	184	208	173	0	0	0	0	0	0	0	0	0	73	100	142	0	0	0	0	0	0
SP185	Bacteria;Proteobacteria;Gammaproteobacteria;Enterobacterales;Enterobacteriaceae;Shigella;boydii	0	0	0	0	0	0	0	187	0	3368	3336	3294	0	0	0	0	0	0	0	0	0	2125	1428	2554	0	0	0	0	0	0	0	0	0	1285	1025	1243
SP188	Bacteria;Fusobacteria;Fusobacteriia;Fusobacteriales;Leptotrichiaceae;Leptotrichia;wadei	1972	1657	1841	428	398	308	0	0	19	0	0	0	2247	2194	2206	394	396	387	0	0	0	0	0	0	1362	1733	1802	301	245	463	0	0	0	0	0	0
SP19	Bacteria;Actinobacteria;Actinomycetia;Corynebacteriales;Corynebacteriaceae;Corynebacterium;matruchotii	2861	2690	2694	74	56	60	0	568	406	0	0	0	5551	4386	4223	154	204	74	0	254	0	0	0	0	2745	2869	2677	77	30	144	0	269	394	0	0	0
SP190	Bacteria;Firmicutes;Bacilli;Bacillales;Bacillaceae;Bacillus;halotolerans	0	0	0	0	0	0	0	1704	44	42576	42180	40588	0	0	0	0	0	0	0	0	0	23586	18316	28223	0	0	0	0	0	0	0	0	0	18733	15065	17532
SP198	Bacteria;Proteobacteria;Betaproteobacteria;Burkholderiales;Burkholderiaceae;Ralstonia;pickettii	0	0	0	0	0	0	0	0	28	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	510	347	0	0	0
SP199	Bacteria;Proteobacteria;Betaproteobacteria;Burkholderiales;Comamonadaceae;Ottowia;sp. HMT894	143	139	153	789	493	548	0	0	0	0	0	0	227	242	197	621	703	521	0	0	0	0	0	0	87	124	91	387	312	534	0	0	0	0	0	0
SP2	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Capnocytophaga;leadbetteri	561	521	547	5059	4221	3492	0	0	0	0	0	0	738	578	584	4515	4463	3540	0	0	0	0	0	0	292	445	279	2675	2152	3030	0	0	0	0	0	0
SP200	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Alloprevotella;sp. HMT914	17	6	13	79	62	55	0	0	0	0	0	0	0	7	7	70	57	47	0	0	0	0	0	0	0	10	7	30	35	44	0	0	0	0	0	0
SP201	Bacteria;Firmicutes;Negativicutes;Selenomonadales;Selenomonadaceae;Selenomonas;artemidis	531	449	470	282	239	231	0	0	0	0	0	0	540	310	382	237	238	207	0	0	0	0	0	0	255	282	208	148	98	135	0	0	0	0	0	0
SP203	Bacteria;Firmicutes;Bacilli;Lactobacillales;Aerococcaceae;Abiotrophia;defectiva	61	62	51	1438	1689	1376	0	0	0	0	0	0	90	81	79	2081	2635	2498	0	0	0	0	0	0	125	84	81	3409	3646	4975	0	0	0	0	0	0
SP204	Bacteria;Proteobacteria;Gammaproteobacteria;Pasteurellales;Pasteurellaceae;Aggregatibacter;segnis	368	307	378	0	0	0	0	0	0	0	0	0	413	333	306	0	0	0	0	0	0	0	0	0	199	264	230	0	0	0	0	0	0	0	0	0
SP205	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;mutans	34	34	40	0	0	0	0	141	20	0	0	0	395	370	388	0	0	0	0	105	132	0	0	0	401	378	379	0	0	0	0	535	1585	0	0	0
SP207	Bacteria;Firmicutes;Negativicutes;Veillonellales;Veillonellaceae;Anaeroglobus;geminatus	19	16	0	66	50	43	0	0	0	0	0	0	13	8	22	61	42	42	0	0	0	0	0	0	0	5	0	18	26	39	0	0	0	0	0	0
SP21	Bacteria;Proteobacteria;Alphaproteobacteria;Hyphomicrobiales;Phyllobacteriaceae;Phyllobacterium;myrsinacearum	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	575	339	0	0	0
SP211	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;oris	264	254	279	698	537	541	0	0	11	0	0	0	520	563	441	1795	2153	2175	0	0	0	0	0	0	541	522	515	2314	2319	3404	0	0	0	0	0	0
SP217	Bacteria;Actinobacteria;Actinobacteria;Actinomycetales;Actinomycetaceae;Schaalia;georgiae	28	26	29	9	4	0	0	44	0	0	0	0	50	63	46	10	20	17	0	0	0	0	0	0	60	71	64	14	23	27	0	0	0	0	0	0
SP22	Bacteria;Fusobacteria;Fusobacteriia;Fusobacteriales;Leptotrichiaceae;Leptotrichia;sp. HMT212	198	167	180	2890	3007	2475	0	0	0	0	0	0	251	270	217	2711	2889	2078	0	0	0	0	0	0	133	151	115	1637	1475	2324	0	0	0	0	0	0
SP221	Bacteria;Actinobacteria;Actinobacteria;Actinomycetales;Actinomycetaceae;Schaalia;sp. HMT180	1591	1615	1626	204	196	117	0	0	0	0	0	0	2286	2190	1848	241	245	260	0	0	0	0	0	0	2525	2386	2036	269	261	405	0	0	0	0	0	0
SP224	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;sp. HMT414	135	86	111	0	0	0	0	0	0	0	0	0	463	364	399	0	0	0	0	0	0	0	0	0	329	309	320	0	0	0	0	0	0	0	0	0
SP227	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;gordonii	3058	2090	3065	1783	1726	1876	0	51	0	0	0	0	3191	3373	3055	4599	5128	5238	0	0	0	0	0	0	3220	3172	2907	4905	4948	7111	0	0	0	0	0	0
SP229	Bacteria;Fusobacteria;Fusobacteriia;Fusobacteriales;Leptotrichiaceae;Leptotrichia;sp. HMT392	226	205	202	1269	1384	1069	0	0	24	0	0	0	212	276	178	1439	1423	1111	0	0	0	0	0	0	97	150	123	805	692	1075	0	0	0	0	0	0
SP23	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;sp. HMT171	121	119	133	113	106	76	0	0	0	0	0	0	221	209	227	279	276	341	0	0	0	0	0	0	265	147	205	391	380	513	0	0	0	0	0	0
SP230	Bacteria;Proteobacteria;Betaproteobacteria;Neisseriales;Neisseriaceae;Neisseria;flava	233	191	212	2282	1386	1803	0	0	0	0	0	0	236	164	169	3291	3334	2449	0	0	0	0	0	0	91	136	99	1838	1746	2235	0	0	0	0	0	0
SP231	Bacteria;Actinobacteria;Actinobacteria;Actinomycetales;Actinomycetaceae;Schaalia;sp. HMT178	243	281	243	5	0	0	0	24	0	0	0	0	703	689	563	14	23	23	0	0	0	0	0	0	824	721	766	10	6	12	0	0	0	0	0	0
SP234	Bacteria;Actinobacteria;Coriobacteriia;Coriobacteriales;Atopobiaceae;Lancefieldella;parvula	67	44	53	121	127	119	0	0	0	0	0	0	26	40	14	45	59	71	0	0	0	0	0	0	29	23	18	51	61	99	0	0	0	0	0	0
SP236	Bacteria;Proteobacteria;Gammaproteobacteria;Pasteurellales;Pasteurellaceae;Haemophilus;parainfluenzae	649	620	622	129	77	71	0	0	0	0	0	0	773	631	695	109	104	92	0	0	0	0	0	0	469	470	385	57	51	102	0	0	0	0	0	0
SP238	Bacteria;Proteobacteria;Betaproteobacteria;Neisseriales;Neisseriaceae;Neisseria;elongata	338	395	369	0	0	0	0	0	0	0	0	0	566	525	369	0	0	0	0	0	0	0	0	0	202	293	206	0	0	0	0	0	0	0	0	0
SP239	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;vestibularis	0	31	22	0	0	0	0	0	0	0	0	0	132	117	101	0	0	0	0	0	0	0	0	0	91	162	99	0	0	0	0	0	0	0	0	0
SP245	Bacteria;Fusobacteria;Fusobacteriia;Fusobacteriales;Leptotrichiaceae;Leptotrichia;sp. HMT225	0	0	0	386	453	371	0	0	0	0	0	0	0	0	4	369	353	300	0	0	0	0	0	0	0	5	0	210	210	289	0	0	0	0	0	0
SP247	Bacteria;Spirochaetes;Spirochaetia;Spirochaetales;Treponemataceae;Treponema;sp. HMT237	76	66	60	0	0	0	0	0	0	0	0	0	69	54	41	0	0	0	0	0	0	0	0	0	30	35	34	0	0	0	0	0	0	0	0	0
SP25	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;sp. HMT175	887	810	919	92	94	59	0	50	0	0	0	0	1189	1029	949	233	243	330	0	0	0	0	0	0	1112	1037	868	299	333	493	0	0	0	0	0	0
SP250	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;constellatus	15	0	16	47	41	32	0	0	0	0	0	0	90	72	59	116	137	164	0	0	0	0	0	0	67	70	50	140	115	216	0	0	0	0	0	0
SP257	Bacteria;Actinobacteria;Actinomycetia;Micrococcales;Micrococcaceae;Rothia;mucilaginosa	240	402	275	31	50	41	0	0	0	0	0	0	284	245	250	50	54	71	0	0	0	0	0	0	332	378	337	95	127	125	0	0	0	0	0	0
SP258	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Capnocytophaga;sp. HMT902	100	95	83	0	0	0	0	0	0	0	0	0	118	77	89	0	0	0	0	0	0	0	0	0	58	62	46	0	0	0	0	0	0	0	0	0
SP26	Bacteria;Firmicutes;Clostridia;Clostridiales;Lachnospiraceae_[XIV];Lachnoanaerobaculum;saburreum	451	424	492	10	0	10	0	29	0	0	0	0	583	582	513	15	24	22	0	0	0	0	0	0	421	485	418	12	19	15	0	0	0	0	0	0
SP260	Bacteria;Firmicutes;Bacilli;Lactobacillales;Enterococcaceae;Enterococcus;faecalis	0	0	0	0	0	0	0	162	0	2384	2314	2464	0	0	0	0	0	0	0	0	0	12210	9152	11649	0	0	0	0	0	0	0	0	0	7415	7712	8014
SP263	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;histicola	64	52	44	23	41	31	0	0	0	0	0	0	66	60	57	20	31	14	0	0	0	0	0	0	36	52	31	11	8	15	0	0	0	0	0	0
SP264	Bacteria;Actinobacteria;Actinomycetia;Micrococcales;Micrococcaceae;Rothia;dentocariosa	2861	3479	3258	595	593	539	0	0	20	0	0	0	3666	3724	3379	702	913	838	0	0	0	0	0	0	3054	3488	2872	763	707	1238	0	0	0	0	0	0
SP267	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;copri	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	525	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0
SP268	Bacteria;Saccharibacteria_(TM7);Saccharibacteria_(TM7)_[C-1];Saccharibacteria_(TM7)_[O-1];Saccharibacteria_(TM7)_[F-1];Saccharibacteria_(TM7)_[G-1];bacterium HMT347	360	403	296	17	34	33	0	0	0	0	0	0	411	307	303	13	13	15	0	0	0	0	0	0	238	268	202	14	15	19	0	0	0	0	0	0
SP27	Bacteria;Firmicutes;Clostridia;Clostridiales;Peptoniphilaceae;Parvimonas;sp. HMT110	169	143	140	172	169	135	0	0	0	0	0	0	172	169	165	160	157	169	0	0	0	0	0	0	73	71	124	174	204	259	0	0	0	0	0	0
SP271	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;sobrinus	0	0	0	167	92	83	0	0	0	0	0	0	0	0	0	1512	1732	1909	0	0	136	0	0	0	0	0	0	1688	1687	2674	0	0	0	0	0	0
SP272	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;sp. HMT448	1026	1002	1035	0	0	0	0	12	0	0	0	0	2350	2588	2153	0	0	0	0	0	0	0	0	0	2575	2608	2282	0	0	0	0	0	0	0	0	0
SP273	Bacteria;Absconditabacteria_(SR1);Absconditabacteria_(SR1)_[C-1];Absconditabacteria_(SR1)_[O-1];Absconditabacteria_(SR1)_[F-1];Absconditabacteria_(SR1)_[G-1];bacterium HMT874	59	59	63	0	0	0	0	0	0	0	0	0	64	66	81	0	0	0	0	0	0	0	0	0	49	44	44	0	0	0	0	0	0	0	0	0
SP274	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Alloprevotella;sp. HMT473	5474	4593	5388	0	0	0	0	21	15	0	0	0	5127	3994	4055	0	0	0	0	0	0	0	0	0	2546	2888	2281	0	0	0	0	0	0	0	0	0
SP275	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Capnocytophaga;sp. HMT335	167	174	160	0	0	0	0	13	0	0	0	0	202	189	143	0	0	0	0	0	0	0	0	0	97	141	86	0	0	0	0	0	0	0	0	0
SP276	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Bergeyella;sp. HMT322	224	200	204	349	363	206	0	0	0	0	0	0	216	174	161	353	393	272	0	0	0	0	0	0	96	122	107	212	183	257	0	0	0	0	0	0
SP278	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;cristatus_clade_578	291	346	343	22	28	22	0	0	0	0	0	0	522	581	497	20	31	17	0	0	0	0	0	0	647	614	593	13	18	80	0	0	0	0	0	0
SP285	Bacteria;Proteobacteria;Gammaproteobacteria;Enterobacterales;Enterobacteriaceae;Escherichia;coli	0	0	0	0	0	0	0	183	29	6740	6833	6295	0	0	0	0	0	0	0	0	0	4138	2914	5078	0	0	0	0	0	0	0	0	0	2677	2073	2564
SP294	Bacteria;Proteobacteria;Gammaproteobacteria;Enterobacterales;Enterobacteriaceae;Shigella;sonnei	0	0	0	0	0	0	0	968	20	13623	13496	12365	0	0	0	0	0	0	0	0	0	8182	5492	9838	0	0	0	0	0	0	0	0	0	5026	4052	5040
SP295	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;sp. HMT942	71	74	77	0	0	0	0	0	0	0	0	0	89	50	78	0	0	0	0	0	0	0	0	0	34	41	45	0	0	0	0	0	0	0	0	0
SP296	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;dentalis	209	196	227	0	0	7	0	25	0	0	0	0	1061	996	917	6	10	0	0	0	0	0	0	0	929	933	825	7	6	6	0	0	0	0	0	0
SP298	Bacteria;Proteobacteria;Betaproteobacteria;Burkholderiales;Oxalobacteraceae;Herbaspirillum;huttiense	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	166	495	0	0	0
SP299	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;sp. HMT066	0	0	0	12	37	15	0	0	0	0	0	0	12	15	7	34	44	40	0	0	0	0	0	0	14	13	10	44	67	84	0	0	0	0	0	0
SP3	Bacteria;Firmicutes;Negativicutes;Selenomonadales;Selenomonadaceae;Selenomonas;noxia	434	342	415	0	0	0	0	0	0	0	0	0	426	284	331	0	0	0	0	0	0	0	0	0	205	174	203	0	0	0	0	0	0	0	0	0
SP302	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Capnocytophaga;sp. HMT338	171	106	136	0	0	0	0	0	0	0	0	0	134	124	127	0	0	0	0	0	0	0	0	0	97	99	72	0	0	0	0	0	0	0	0	0
SP308	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;anginosus	82	84	80	0	0	0	0	0	0	0	0	0	277	293	253	0	0	0	0	0	0	0	0	0	250	331	310	0	0	0	0	0	0	0	0	0
SP31	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Porphyromonadaceae;Porphyromonas;gingivalis	0	0	0	0	0	0	345	87909	87656	0	0	0	0	0	0	0	0	0	237	66622	78377	0	0	0	0	0	0	0	0	0	185	104322	89719	0	0	0
SP311	Bacteria;Firmicutes;Clostridia;Clostridiales;Peptostreptococcaceae_[XI];Peptostreptococcaceae_[XI][G-9];[Eubacterium]_brachy	52	35	34	21	6	20	0	0	0	0	0	0	89	134	66	33	45	33	0	0	0	0	0	0	133	75	85	22	34	54	0	0	0	0	0	0
SP316	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Bacteroidales_[F-2];Bacteroidales_[G-2];bacterium HMT274	38	62	55	0	0	0	0	0	0	0	0	0	127	45	47	0	0	0	0	0	0	0	0	0	24	56	67	0	0	0	0	0	0	0	0	0
SP319	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;johnsonii	10	10	16	73	86	65	0	0	0	0	0	0	53	58	49	232	215	270	0	0	0	0	0	0	57	46	37	231	230	333	0	0	0	0	0	0
SP32	Bacteria;Firmicutes;Negativicutes;Veillonellales;Veillonellaceae;Dialister;invisus	247	209	214	99	69	85	0	0	0	0	0	0	293	191	237	141	90	95	0	0	0	0	0	0	153	184	135	51	50	50	0	0	0	0	0	0
SP321	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;intermedius	156	169	188	225	298	244	0	0	0	0	0	0	342	360	326	413	430	514	0	0	0	0	0	0	398	409	338	447	544	641	0	0	0	0	0	0
SP322	Bacteria;Firmicutes;Negativicutes;Selenomonadales;Selenomonadaceae;Selenomonas;sp. HMT920	0	4	14	95	70	47	0	0	0	0	0	0	8	14	2	61	67	58	0	0	0	0	0	0	5	10	0	30	35	47	0	0	0	0	0	0
SP323	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Porphyromonadaceae;Porphyromonas;sp. HMT278	322	263	304	98	105	86	0	0	0	0	0	0	316	239	221	96	102	86	0	0	0	0	0	0	127	193	137	50	44	76	0	0	0	0	0	0
SP326	Bacteria;Proteobacteria;Betaproteobacteria;Neisseriales;Neisseriaceae;Kingella;oralis	590	547	530	141	148	97	0	0	0	0	0	0	697	545	562	148	168	134	0	0	0	0	0	0	376	345	296	87	78	151	0	0	0	0	0	0
SP327	Bacteria;Fusobacteria;Fusobacteriia;Fusobacteriales;Leptotrichiaceae;Leptotrichia;hongkongensis	333	324	315	874	1054	736	0	0	0	0	0	0	422	355	315	802	662	602	0	0	0	0	0	0	181	217	147	513	440	635	0	0	0	0	0	0
SP33	Bacteria;Firmicutes;Bacilli;Bacillales;Bacillaceae;Allobacillus;halotolerans	749	0	0	469	0	0	10801	1479	477	205	0	0	2677	0	0	1373	0	0	61317	753	1104	799	0	0	3017	0	0	1899	0	0	90712	2163	2318	1393	0	0
SP331	Bacteria;Firmicutes;Clostridia;Eubacteriales;Oscillospiraceae;Faecalibacterium;prausnitzii	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	439	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0
SP332	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Capnocytophaga;sp. HMT863	59	59	42	0	0	0	0	17	0	0	0	0	65	87	63	0	0	0	0	0	0	0	0	0	31	42	34	0	0	0	0	0	0	0	0	0
SP337	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;oralis_subsp._tigurinus_clade_070	247	267	316	0	0	0	0	0	0	0	0	0	846	864	804	0	0	0	0	0	0	0	0	0	819	962	770	0	0	0	0	0	0	0	0	0
SP34	Bacteria;Deinococcus-Thermus;Deinococci;Trueperales;Trueperaceae;Truepera;radiovictrix	0	3	4	0	0	0	49	24282	26208	0	0	0	19	7	7	4	6	6	213	81459	59604	5	4	0	5	3	5	4	5	10	198	44119	40847	3	0	0
SP346	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Capnocytophaga;sp. HMT324	113	116	119	0	0	0	0	0	0	0	0	0	162	90	120	0	0	0	0	0	0	0	0	0	77	115	41	0	0	0	0	0	0	0	0	0
SP349	Bacteria;Firmicutes;Bacilli;Bacillales;Staphylococcaceae;Staphylococcus;warneri	0	0	0	0	0	0	9	1865	605	0	0	0	0	0	0	0	0	0	0	991	235	0	0	0	0	0	0	0	0	0	0	2474	1929	0	0	0
SP35	Bacteria;Proteobacteria;Betaproteobacteria;Neisseriales;Neisseriaceae;Neisseria;subflava	0	0	0	0	0	0	29	8699	7963	0	0	0	0	0	0	0	0	0	30	7621	8856	0	0	0	0	0	0	2	0	0	24	15001	11650	0	0	0
SP350	Bacteria;Proteobacteria;Betaproteobacteria;Neisseriales;Neisseriaceae;Neisseria;cinerea	854	757	848	0	0	0	0	0	0	0	0	0	789	684	653	0	0	0	0	0	0	0	0	0	425	477	387	0	0	0	0	0	0	0	0	0
SP356	Bacteria;Actinobacteria;Actinomycetia;Micrococcales;Micrococcaceae;Rothia;aeria	46	51	52	272	193	162	0	0	0	0	0	0	49	60	56	226	239	235	0	0	0	0	0	0	31	61	35	225	274	333	0	0	0	0	0	0
SP358	Bacteria;Actinobacteria;Coriobacteriia;Coriobacteriales;Atopobiaceae;Olsenella;sp. HMT807	24	17	33	26	31	22	0	0	0	0	0	0	254	186	189	74	38	36	0	0	0	0	0	0	196	165	189	38	51	52	0	0	0	0	0	0
SP36	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;denticola	73	71	81	136	121	68	0	7	10	0	0	0	79	54	74	168	95	83	0	0	0	0	0	0	38	61	68	45	51	97	0	0	0	0	0	0
SP369	Bacteria;Actinobacteria;Actinobacteria;Actinomycetales;Actinomycetaceae;Schaalia;meyeri	27	31	43	0	26	0	0	49	136	0	0	0	40	34	35	42	23	32	0	0	0	0	0	0	39	42	17	53	57	61	0	0	0	0	0	0
SP375	Bacteria;Firmicutes;Clostridia;Clostridiales;Peptoniphilaceae;Parvimonas;micra	234	193	292	0	0	0	0	0	0	0	0	0	288	260	255	0	0	0	0	0	0	0	0	0	310	267	212	0	0	0	0	0	0	0	0	0
SP378	Bacteria;Actinobacteria;Coriobacteriia;Coriobacteriales;Atopobiaceae;Lancefieldella;rimae	32	46	45	44	89	52	0	0	0	0	0	0	24	21	26	38	37	48	0	0	0	0	0	0	39	21	28	46	29	56	0	0	0	0	0	0
SP379	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Capnocytophaga;endodontalis	362	368	318	0	0	0	0	0	0	0	0	0	424	272	369	0	0	0	0	0	0	0	0	0	212	217	160	0	0	0	0	0	0	0	0	0
SP38	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Capnocytophaga;granulosa	537	475	502	74	33	53	0	30	0	0	0	0	679	501	529	62	67	72	0	0	0	0	0	0	290	344	286	48	43	52	0	0	0	0	0	0
SP380	Bacteria;Fusobacteria;Fusobacteriia;Fusobacteriales;Leptotrichiaceae;Leptotrichia;goodfellowii	134	79	92	0	0	0	0	0	0	0	0	0	123	76	78	0	0	0	0	0	0	0	0	0	51	70	52	0	0	0	0	0	0	0	0	0
SP384	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;sp. HMT423	4067	6652	5303	1857	2141	1672	0	111	0	0	0	0	6929	7384	6758	2910	3196	3052	0	0	0	0	0	0	7774	8336	7759	2950	2867	4027	0	0	0	0	0	0
SP396	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;intermedia	610	679	853	214	179	124	0	0	0	0	0	0	679	482	486	164	148	126	0	0	0	0	0	0	385	380	223	80	76	107	0	0	0	0	0	0
SP397	Bacteria;Saccharibacteria_(TM7);Saccharibacteria_(TM7)_[C-1];Saccharibacteria_(TM7)_[O-1];Saccharibacteria_(TM7)_[F-1];Saccharibacteria_(TM7)_[G-1];bacterium HMT348	426	454	463	63	87	74	0	0	0	0	0	0	362	336	300	52	67	50	0	0	0	0	0	0	202	304	229	19	28	74	0	0	0	0	0	0
SP398	Bacteria;Saccharibacteria_(TM7);Saccharibacteria_(TM7)_[C-1];Saccharibacteria_(TM7)_[O-1];Saccharibacteria_(TM7)_[F-1];Saccharibacteria_(TM7)_[G-1];bacterium HMT346	552	542	504	242	219	168	0	0	73	0	0	0	530	448	446	153	154	133	0	0	0	0	0	0	375	421	373	94	104	167	0	0	0	0	0	0
SP4	Bacteria;Actinobacteria;Actinobacteria;Actinomycetales;Actinomycetaceae;Peptidiphaga;sp. HMT183	347	299	348	0	0	0	0	7	0	0	0	0	938	889	861	8	13	0	0	0	0	0	0	0	1143	875	982	7	17	17	0	0	0	0	0	0
SP404	Bacteria;Actinobacteria;Actinobacteria;Actinomycetales;Actinomycetaceae;Schaalia;sp. HMT877	84	77	75	0	0	0	0	0	0	0	0	0	224	173	208	0	0	0	0	0	0	0	0	0	262	223	233	0	0	0	0	0	0	0	0	0
SP41	Bacteria;Proteobacteria;Betaproteobacteria;Burkholderiales;Burkholderiaceae;Lautropia;mirabilis	90	106	80	3397	4061	2852	0	4	0	0	0	0	76	45	85	2319	2618	2401	0	0	0	0	0	0	79	110	72	3059	3282	4812	0	0	0	0	0	0
SP411	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;cristatus	214	231	214	0	0	0	0	0	0	0	0	0	382	427	352	0	0	0	0	0	0	0	0	0	427	409	369	0	0	0	0	0	0	0	0	0
SP412	Bacteria;Actinobacteria;Actinobacteria;Bifidobacteriales;Bifidobacteriaceae;Scardovia;wiggsiae	0	0	0	90	152	81	0	0	0	0	0	0	0	0	0	112	102	129	0	0	0	0	0	0	0	0	0	132	136	186	0	0	0	0	0	0
SP413	Bacteria;Firmicutes;Erysipelotrichia;Erysipelotrichales;Erysipelotrichaceae;Solobacterium;moorei	57	62	68	74	69	52	0	0	0	0	0	0	78	43	52	51	40	45	0	0	0	0	0	0	56	60	41	53	39	74	0	0	0	0	0	0
SP42	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;sp. HMT314	451	397	406	0	0	0	0	0	0	0	0	0	515	325	363	0	0	0	0	0	0	0	0	0	266	256	170	0	0	0	0	0	0	0	0	0
SP423	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Capnocytophaga;sp. HMT412	283	229	294	0	0	0	0	0	0	0	0	0	295	253	270	0	0	0	0	0	0	0	0	0	141	188	149	0	0	0	0	0	0	0	0	0
SP424	Bacteria;Proteobacteria;Betaproteobacteria;Neisseriales;Neisseriaceae;Neisseria;flavescens	116	88	87	0	0	0	0	0	0	0	0	0	96	73	77	0	0	0	0	0	0	0	0	0	74	49	60	0	0	0	0	0	0	0	0	0
SP426	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Propionibacteriaceae;Cutibacterium;acnes	0	0	0	0	0	0	14	1145	177	0	0	0	0	0	0	0	9	0	4	694	279	0	0	0	0	0	0	0	0	0	0	174	0	0	0	0
SP430	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Porphyromonadaceae;Porphyromonas;endodontalis	175	145	211	0	0	0	0	0	0	0	0	0	195	186	128	0	0	0	0	0	0	0	0	0	86	124	72	0	0	0	0	0	0	0	0	0
SP44	Bacteria;Firmicutes;Bacilli;Lactobacillales;Lactobacillaceae;Limosilactobacillus;fermentum	0	0	0	0	0	0	0	229	0	3197	3364	3806	2	0	0	0	0	0	0	0	0	12399	9705	10032	0	0	0	0	0	3	0	0	0	20409	15549	15182
SP46	Bacteria;Firmicutes;Clostridia;Clostridiales;Lachnospiraceae_[XIV];Lachnoanaerobaculum;umeaense	90	86	79	0	0	7	0	0	0	0	0	0	101	75	76	0	0	0	0	0	0	0	0	0	63	71	57	0	0	4	0	0	0	0	0	0
SP49	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;oulorum	502	425	502	131	89	88	0	0	0	0	0	0	636	473	421	95	81	67	0	0	0	0	0	0	282	286	254	40	46	62	0	0	0	0	0	0
SP5	Bacteria;Firmicutes;Clostridia;Negativicutes;Veillonellaceae;Veillonella;parvula	3400	3187	3253	1595	1392	1299	0	45	0	0	0	0	2639	2415	2515	1023	1312	1054	0	0	0	0	0	0	1595	2062	1630	670	715	977	0	0	0	0	0	0
SP52	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;maculosa	106	81	109	6	9	2	0	0	0	0	0	0	91	60	59	10	4	7	0	0	0	0	0	0	53	63	44	6	0	6	0	0	0	0	0	0
SP53	Bacteria;Firmicutes;Bacilli;Bacillales;Gemellaceae;Gemella;haemolysans	287	264	349	115	89	93	0	0	0	0	0	0	703	765	672	174	195	213	0	0	0	0	0	0	620	743	607	166	170	264	0	0	0	0	0	0
SP54	Bacteria;Proteobacteria;Alphaproteobacteria;Sphingomonadales;Erythrobacteraceae;Qipengyuania;seohaensis	0	0	0	0	0	0	3	795	470	0	0	0	0	0	0	0	0	0	0	555	964	0	0	0	0	0	0	0	0	0	0	1020	336	0	0	0
SP55	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Capnocytophaga;sputigena	30	23	17	1418	807	1017	0	0	0	0	0	0	31	24	15	1573	1629	1323	0	0	0	0	0	0	17	14	10	939	806	1114	0	0	0	0	0	0
SP56	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;naeslundii	194	178	192	123	81	76	0	15	0	0	0	0	544	570	517	321	374	381	0	0	0	0	0	0	613	548	532	409	398	604	0	0	0	0	0	0
SP58	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;sp. HMT300	140	101	117	116	84	59	0	0	0	0	0	0	149	105	87	87	64	57	0	0	0	0	0	0	72	82	83	28	37	56	0	0	0	0	0	0
SP59	Bacteria;Firmicutes;Bacilli;Bacillales;Listeriaceae;Listeria;monocytogenes	0	0	0	0	0	0	0	98	0	3422	3159	3501	0	0	0	0	0	0	0	0	0	16386	12531	11950	0	0	0	0	0	0	0	0	0	14198	13869	13595
SP6	Bacteria;Firmicutes;Clostridia;Negativicutes;Veillonellaceae;Veillonella;dispar	887	733	739	0	31	0	0	0	0	0	0	0	625	626	571	0	0	0	0	0	0	0	0	0	375	476	338	0	0	0	0	0	0	0	0	0
SP60	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;sp. HMT064	2719	3823	3540	38	46	26	0	31	0	0	0	0	4553	4864	4536	28	14	48	0	0	0	0	0	0	5265	5721	4988	35	49	39	0	0	0	0	0	0
SP62	Bacteria;Firmicutes;Clostridia;Clostridiales;Lachnospiraceae_[XIV];Catonella;morbi	131	102	135	167	186	132	0	0	0	0	0	0	148	95	102	132	142	161	0	0	0	0	0	0	73	70	55	108	106	146	0	0	0	0	0	0
SP63	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;sp. HMT525	32	24	23	37	36	33	0	0	0	0	0	0	136	116	136	171	158	148	0	0	0	0	0	0	200	133	132	154	141	272	0	0	0	0	0	0
SP64	Bacteria;Firmicutes;Clostridia;Clostridiales;Lachnospiraceae_[XIV];Oribacterium;sp. HMT078	186	165	220	10	12	6	0	0	0	0	0	0	230	209	174	0	9	8	0	0	0	0	0	0	189	200	174	6	0	9	0	0	0	0	0	0
SP65	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;melaninogenica	1279	1071	1239	2569	2517	1840	0	0	0	0	0	0	1063	959	993	1801	1793	1595	0	0	0	0	0	0	548	727	514	1009	1009	1363	0	0	0	0	0	0
SP66	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;oralis_subsp._tigurinus_clade_071	650	1351	941	194	318	264	0	34	0	0	0	0	1581	1766	1624	467	455	513	0	0	0	0	0	0	1922	2002	1842	436	460	650	0	0	0	0	0	0
SP67	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;saccharolytica	56	60	65	65	73	39	0	0	0	0	0	0	74	32	53	42	38	44	0	0	0	0	0	0	22	23	25	35	22	28	0	0	0	0	0	0
SP68	Bacteria;Saccharibacteria_(TM7);Saccharibacteria_(TM7)_[C-1];Saccharibacteria_(TM7)_[O-1];Saccharibacteria_(TM7)_[F-1];Saccharibacteria_(TM7)_[G-1];bacterium HMT957	36	48	52	5382	6855	4387	0	0	0	0	0	0	43	36	38	5158	5381	4397	0	0	0	0	0	0	40	41	35	3353	3392	4693	0	0	0	0	0	0
SP69	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;oralis_subsp._dentisani_clade_058	323	338	339	11	25	14	0	0	0	0	0	0	420	441	374	8	15	15	0	0	0	0	0	0	378	474	346	16	11	21	0	0	0	0	0	0
SP70	Bacteria;Firmicutes;Bacilli;Lactobacillales;Carnobacteriaceae;Granulicatella;adiacens	2339	2057	1843	1137	1235	1165	0	29	0	0	0	0	2381	2307	2140	1285	1415	1349	0	0	0	0	0	0	2383	2640	2056	1325	1176	1820	0	0	0	0	0	0
SP71	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;massiliensis	381	359	379	0	0	0	0	0	0	0	0	0	901	802	753	42	40	53	0	0	0	0	0	0	898	795	756	36	34	49	0	0	0	0	0	0
SP72	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;chosunense	858	912	934	28	23	0	0	22	0	0	0	0	809	879	786	0	5	25	8	0	0	0	0	0	767	975	827	0	0	20	0	0	0	0	0	0
SP77	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;oralis	935	2302	1356	101	134	98	0	0	0	0	0	0	1790	2036	1695	142	139	205	0	0	0	0	0	0	1974	2081	1949	188	214	257	0	0	0	0	0	0
SP78	Bacteria;Proteobacteria;Gammaproteobacteria;Cardiobacteriales;Cardiobacteriaceae;Cardiobacterium;hominis	71	56	77	26	12	12	0	0	0	0	0	0	100	60	61	12	10	12	0	0	0	0	0	0	29	42	41	0	6	16	0	0	0	0	0	0
SP80	Bacteria;Spirochaetes;Spirochaetia;Spirochaetales;Treponemataceae;Treponema;socranskii	74	52	92	26	24	15	0	0	0	0	0	0	63	37	48	34	21	19	0	0	0	0	0	0	36	39	34	11	17	21	0	0	0	0	0	0
SP81	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;sanguinis	539	481	559	3120	2990	2630	0	0	0	0	0	0	1126	1100	1018	6625	7791	7989	0	0	0	0	0	0	1130	1242	1053	9332	8953	11651	0	0	0	0	0	0
SP82	Bacteria;Actinobacteria;Actinobacteria;Actinomycetales;Actinomycetaceae;Peptidiphaga;gingivicola	63	73	106	0	0	0	0	28	0	0	0	0	179	192	175	0	0	0	0	0	0	0	0	0	222	238	147	0	0	0	0	0	0	0	0	0
SP83	Bacteria;Proteobacteria;Epsilonproteobacteria;Campylobacterales;Campylobacteraceae;Campylobacter;gracilis	1058	930	1007	300	108	184	0	0	0	0	0	0	1290	885	953	283	256	201	0	0	0	0	0	0	640	705	567	134	134	152	0	0	0	0	0	0
SP84	Bacteria;Actinobacteria;Actinobacteria;Actinomycetales;Actinomycetaceae;Schaalia;odontolytica	155	134	121	4530	4536	3504	0	0	0	0	0	0	183	204	200	7130	8183	8930	0	0	0	0	0	0	231	251	188	10985	11611	16576	0	0	0	0	0	0
SP86	Bacteria;Proteobacteria;Gammaproteobacteria;Pasteurellales;Pasteurellaceae;Haemophilus;sp. HMT036	325	276	345	23	16	20	0	0	0	0	0	0	365	299	295	0	0	13	0	0	0	0	0	0	184	250	162	0	13	12	0	0	0	0	0	0
SP87	Bacteria;Fusobacteria;Fusobacteria;Fusobacteriales;Fusobacteriaceae;Fusobacterium;hwasookii	10	11	24	2739	2497	2134	0	0	0	0	0	0	12	13	14	2949	2902	2025	0	0	0	0	0	0	0	0	0	1632	1435	2029	0	0	0	0	0	0
SP89	Bacteria;Fusobacteria;Fusobacteria;Fusobacteriales;Fusobacteriaceae;Fusobacterium;nucleatum_subsp._vincentii	533	601	609	12	0	0	0	0	0	0	0	0	709	594	519	0	0	0	0	0	0	0	0	0	405	344	321	0	0	0	0	0	0	0	0	0
SP9	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;oris	477	471	496	604	530	452	65	24473	18076	0	0	0	448	332	344	488	418	399	39	15758	16098	0	0	0	251	258	156	225	191	359	23	24337	20895	0	0	0
SP90	Bacteria;Firmicutes;Clostridia;Clostridiales;Ruminococcaceae;Ruminococcaceae_[G-1];bacterium HMT075	146	127	171	684	807	554	0	0	0	0	0	0	168	100	112	528	520	532	0	0	0	0	0	0	74	64	78	356	330	432	0	0	0	0	0	0
SP91	Bacteria;Fusobacteria;Fusobacteriia;Fusobacteriales;Leptotrichiaceae;Leptotrichia;sp. HMT215	455	480	512	73	66	61	0	33	0	0	0	0	460	433	408	31	49	40	0	0	0	0	0	0	280	377	308	12	24	41	0	0	0	0	0	0
SP94	Bacteria;Firmicutes;Negativicutes;Selenomonadales;Selenomonadaceae;Selenomonas;infelix	44	56	51	62	51	43	0	0	0	0	0	0	73	54	39	35	36	39	0	0	0	0	0	0	32	38	42	17	22	25	0	0	0	0	0	0
SP95	Bacteria;Proteobacteria;Gammaproteobacteria;Pseudomonadales;Pseudomonadaceae;Pseudomonas;aeruginosa	0	0	0	0	0	0	0	745	37	10101	10666	10060	0	0	0	0	0	0	0	0	0	5248	3696	7005	0	0	0	0	0	0	0	0	4	3364	2662	3144
SP96	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;sp. HMT317	627	530	578	10	0	0	0	0	0	0	0	0	724	462	559	0	0	0	0	0	0	0	0	0	283	350	343	5	0	0	0	0	0	0	0	0
SP97	Bacteria;Firmicutes;Bacilli;Lactobacillales;Carnobacteriaceae;Granulicatella;elegans	688	656	659	297	309	253	0	0	0	0	0	0	503	512	516	169	218	208	0	0	0	0	0	0	378	438	333	129	152	198	0	0	0	0	0	0
SP98	Bacteria;Proteobacteria;Epsilonproteobacteria;Campylobacterales;Campylobacteraceae;Campylobacter;concisus	524	519	526	43	46	43	0	0	0	0	0	0	744	466	442	38	38	34	0	0	0	0	0	0	309	360	312	8	15	18	0	0	0	0	0	0
SP99	Bacteria;Firmicutes;Bacilli;Bacillales;Gemellaceae;Gemella;morbillorum	192	170	220	235	181	248	0	0	0	0	0	0	317	330	312	518	589	565	0	0	0	0	0	0	338	330	288	674	584	858	0	0	0	0	0	0
SPN108	Bacteria;Bacteroidota;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Frondibacter;mangrovi_nov_92.484%	4306	6	0	4005	0	0	131030	14646	9438	3167	0	0	4472	0	0	2280	0	0	67205	11550	12127	1549	0	0	1983	0	0	1462	0	0	48943	14464	14420	945	0	0
SPN109	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;sp. HMT171 nov_96.813%	41	48	68	19	24	18	0	0	0	0	0	0	172	184	212	33	54	52	0	0	0	0	0	0	314	222	223	49	71	63	0	0	0	0	0	0
SPN121	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;israelii_nov_94.882%	98	55	71	0	0	0	0	0	0	0	0	0	119	131	143	0	0	0	0	0	0	0	0	0	141	169	143	0	0	0	0	0	0	0	0	0
SPN133	Bacteria;Firmicutes;Clostridia;Eubacteriales;Oscillospiraceae;Faecalibacterium;prausnitzii_nov_96.976%	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	986	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0
SPN141	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;veroralis_nov_97.342%	145	124	109	0	0	0	0	0	0	0	0	0	136	123	129	0	0	0	0	0	0	0	0	0	78	74	78	0	0	0	0	0	0	0	0	0
SPN142	Bacteria;Saccharibacteria_(TM7);Saccharibacteria_(TM7)_[C-1];Saccharibacteria_(TM7)_[O-1];Saccharibacteria_(TM7)_[F-1];Saccharibacteria_(TM7)_[G-1];bacterium HMT957 nov_97.550%	125	113	145	0	0	0	0	0	0	0	0	0	119	94	109	0	0	0	0	0	0	0	0	0	78	108	87	0	0	0	0	0	0	0	0	0
SPN153	Bacteria;Firmicutes;Clostridia;Eubacteriales;Lachnospiraceae;Lacrimispora;xylanolytica_nov_88.613%	54	29	63	10	10	15	0	0	0	0	0	0	139	98	89	43	40	36	0	0	0	0	0	0	61	73	66	33	24	27	0	0	0	0	0	0
SPN164	Bacteria;Bacteroidetes;Flavobacteriia;Flavobacteriales;Flavobacteriaceae;Bergeyella;zoohelcum_nov_92.593%	115	89	85	0	0	0	0	0	0	0	0	0	146	153	100	0	0	0	0	0	0	0	0	0	70	61	34	0	0	0	0	0	0	0	0	0
SPN176	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;sp. HMT169 nov_97.992%	99	66	95	0	0	0	0	0	0	0	0	0	98	102	91	0	0	0	0	0	0	0	0	0	82	86	65	0	0	0	0	0	0	0	0	0
SPN185	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;israelii_nov_96.647%	0	0	0	20	20	23	0	0	0	0	0	0	0	18	0	115	100	87	0	0	0	0	0	0	0	17	0	142	71	116	0	0	0	0	0	0
SPN194	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;naeslundii_nov_97.760%	0	0	0	24	23	19	0	0	31	0	0	0	0	0	0	51	87	89	0	0	0	0	0	0	0	0	0	96	80	113	0	0	0	0	0	0
SPN204	Bacteria;Proteobacteria;Epsilonproteobacteria;Campylobacterales;Campylobacteraceae;Campylobacter;concisus_nov_97.397%	75	72	87	0	0	0	0	0	0	0	0	0	64	47	64	0	0	0	0	0	0	0	0	0	30	55	30	0	0	0	0	0	0	0	0	0
SPN3	Bacteria;Actinobacteria;Actinomycetia;Actinomycetales;Actinomycetaceae;Actinomyces;sp. HMT175 nov_97.746%	886	871	928	0	0	0	0	85	0	0	0	0	1360	1342	1237	0	0	0	0	0	0	0	0	0	1397	1327	1246	0	0	0	0	0	0	0	0	0
SPN43	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Porphyromonadaceae;Porphyromonas;sp. HMT284 nov_97.746%	130	107	132	0	0	0	0	0	0	0	0	0	146	111	107	0	0	0	0	0	0	0	0	0	63	122	68	0	0	0	0	0	0	0	0	0
SPN52	Bacteria;Actinobacteria;Actinomycetia;Corynebacteriales;Corynebacteriaceae;Corynebacterium;matruchotii_nov_97.959%	826	718	771	0	0	0	0	0	0	0	0	0	1688	1455	1306	0	0	0	0	0	0	0	0	0	833	1047	971	0	0	0	0	0	0	0	0	0
SPN64	Bacteria;Saccharibacteria_(TM7);Saccharibacteria_(TM7)_[C-1];Saccharibacteria_(TM7)_[O-1];Saccharibacteria_(TM7)_[F-1];Saccharibacteria_(TM7)_[G-6];bacterium HMT870 nov_96.994%	706	606	654	0	0	0	0	0	0	0	0	0	523	483	466	0	0	0	0	0	0	0	0	0	352	357	361	0	0	0	0	0	0	0	0	0
SPN74	Bacteria;Actinobacteria;Actinomycetia;Propionibacteriales;Nocardioidaceae;Aeromicrobium;panaciterrae_nov_96.104%	0	0	0	0	0	0	0	725	398	0	0	0	0	0	0	0	0	0	0	365	413	0	0	0	0	0	0	0	0	0	0	716	347	0	0	0
SPN85	Bacteria;Firmicutes;Clostridia;Negativicutes;Veillonellaceae;Veillonella;sp. HMT780 nov_97.228%	456	370	431	0	0	0	0	0	0	0	0	0	326	278	335	0	0	0	0	0	0	0	0	0	201	222	189	0	0	0	0	0	0	0	0	0
SPN96	Bacteria;Actinobacteria;Actinomycetia;Streptosporangiales;Nocardiopsaceae;Nocardiopsis;nikkonensis_nov_97.257%	0	0	0	0	0	0	6	531	601	0	0	0	0	0	0	0	0	0	3	198	353	0	0	0	0	0	0	0	0	0	0	413	459	0	0	0
SPP1	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;multispecies_spp1_2	7	0	0	86	108	103	0	0	0	0	0	0	0	0	0	77	96	116	0	0	0	0	0	0	0	0	19	88	138	93	0	0	0	0	0	0
SPP11	Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae;Streptococcus;multispecies_spp11_2	0	0	0	1444	1781	1378	0	0	0	0	0	0	0	0	0	1192	1427	1575	0	0	0	0	0	0	0	0	0	1281	1313	1729	0	0	0	0	0	0
SPP14	Bacteria;Proteobacteria;Gammaproteobacteria;Pasteurellales;Pasteurellaceae;Haemophilus;multispecies_spp14_2	363	273	318	0	0	0	0	0	0	0	0	0	330	274	284	0	0	0	0	0	0	0	0	0	169	197	179	0	0	0	0	0	0	0	0	0
SPP21	Bacteria;Bacteroidetes;Bacteroidia;Bacteroidales;Prevotellaceae;Prevotella;multispecies_spp21_2	113	124	109	0	0	0	0	0	0	0	0	0	130	95	112	0	0	0	0	0	0	0	0	0	76	68	68	0	0	0	0	0	0	0	0	0
SPP25	Bacteria;Firmicutes;Clostridia;Negativicutes;Veillonellaceae;Veillonella;multispecies_spp25_2	11022	10046	10487	31676	35405	28667	0	33	49	0	0	0	8508	7680	7785	21129	24981	20592	0	131	0	0	0	0	5321	6241	5302	13357	13580	18729	0	0	0	0	0	0
SPP4	Bacteria;Firmicutes;Bacilli;Bacillales;Staphylococcaceae;Staphylococcus;multispecies_spp4_3	0	0	0	0	0	0	0	136	0	1201	1210	1335	0	0	0	0	0	0	0	0	0	21688	16772	17358	0	0	0	0	0	0	0	0	0	13893	14221	14131
SPP5	Bacteria;Fusobacteria;Fusobacteria;Fusobacteriales;Fusobacteriaceae;Fusobacterium;multispecies_spp5_2	418	336	379	97	0	0	0	22	0	0	0	0	462	346	381	73	0	0	0	0	0	0	0	0	245	316	268	0	0	0	0	0	0	0	0	0
SPP9	Bacteria;Proteobacteria;Gammaproteobacteria;Oceanospirillales;Halomonadaceae;Halomonas;multispecies_spp9_2	0	0	0	0	0	0	0	45	114	0	0	0	0	0	0	0	0	0	0	209	226	0	0	0	0	0	0	0	0	0	0	245	57	0	0	0

Download OTU Tables at Different Taxonomy Levels
Phylum	Count*:	Relative**:	CLR***:
Class	Count*:	Relative**:	CLR***:
Order	Count*:	Relative**:	CLR***:
Family	Count*:	Relative**:	CLR***:
Genus	Count*:	Relative**:	CLR***:
Species	Count*:	Relative**:	CLR***:
* Read count
** Relative abundance (count/total sample count)
*** Centered log ratio transformed abundance

;

The species listed in the table has full taxonomy and a dynamically assigned species ID specific to this report. When some reads match with the reference sequences of more than one species equally (i.e., same percent identiy and alignmnet coverage), they can't be assigned to a particular species. Instead, they are assigned to multiple species with the species notaton "s__multispecies_spp2_2". In this notation, spp2 is the dynamic ID assigned to these reads that hit multiple sequences and the "_2" at the end of the notation means there are two species in the spp2.

You can look up which species are included in the multi-species assignment, in this table below:

Another type of notation is "s__multispecies_sppn2_2", in which the "n" in the sppn2 means it's a potential novel species because all the reads in this species have < 98% idenity to any of the reference sequences. They were grouped together based on de novo OTU clustering at 98% identity cutoff. And then a representative sequence was chosed to BLASTN search against the reference database to find the closest match (but will still be < 98%). This representative sequence also matched equally to more than one species, hence the "spp" was given in the label.

Taxonomy Bar Plots for All Samples

Taxonomy Bar Plots for Individual Comparison Groups

Comparison No.	Comparison Name	Families		Genera		Species
Comparison 1	SUPA vs SUPB vs OM vs ZM	PDF	SVG	PDF	SVG	PDF	SVG
Comparison 2	T vs F	PDF	SVG	PDF	SVG	PDF	SVG
Comparison 3	Masterpure vs PowerSoil vs Zymo	PDF	SVG	PDF	SVG	PDF	SVG

VIII. Analysis - Alpha Diversity

In ecology, alpha diversity (α-diversity) is the mean species diversity in sites or habitats at a local scale. The term was introduced by R. H. Whittaker[1][2] together with the terms beta diversity (β-diversity) and gamma diversity (γ-diversity). Whittaker's idea was that the total species diversity in a landscape (gamma diversity) is determined by two different things, the mean species diversity in sites or habitats at a more local scale (alpha diversity) and the differentiation among those habitats (beta diversity).

References:
Whittaker, R. H. (1960) Vegetation of the Siskiyou Mountains, Oregon and California. Ecological Monographs, 30, 279–338. doi:10.2307/1943563
Whittaker, R. H. (1972). Evolution and Measurement of Species Diversity. Taxon, 21, 213-251. doi:10.2307/1218190

Alpha Diversity Analysis by Rarefaction

Diversity measures are affected by the sampling depth. Rarefaction is a technique to assess species richness from the results of sampling. Rarefaction allows the calculation of species richness for a given number of individual samples, based on the construction of so-called rarefaction curves. This curve is a plot of the number of species as a function of the number of samples. Rarefaction curves generally grow rapidly at first, as the most common species are found, but the curves plateau as only the rarest species remain to be sampled.

References:
Willis AD. Rarefaction, Alpha Diversity, and Statistics. Front Microbiol. 2019 Oct 23;10:2407. doi: 10.3389/fmicb.2019.02407. PMID: 31708888; PMCID: PMC6819366.

Boxplot of Alpha-diversity Indices

The two main factors taken into account when measuring diversity are richness and evenness. Richness is a measure of the number of different kinds of organisms present in a particular area. Evenness compares the similarity of the population size of each of the species present. There are many different ways to measure the richness and evenness. These measurements are called "estimators" or "indices". Below is a diversity of 3 commonly used indices showing the values for all the samples (dots) and in groups (boxes).

Alpha Diversity Box Plots for All Groups

Alpha Diversity Box Plots for Individual Comparisons

Comparison 1	SUPA vs SUPB vs OM vs ZM	View in PDF	View in SVG
Comparison 2	T vs F	View in PDF	View in SVG
Comparison 3	Masterpure vs PowerSoil vs Zymo	View in PDF	View in SVG

Group Significance of Alpha-diversity Indices

To test whether the alpha diversity among different comparison groups are different statistically, we use the Kruskal Wallis H test provided the "alpha-group-significance" fucntion in the QIIME 2 "diversity" package. Kruskal Wallis H test is the non-parametric alternative to the One Way ANOVA. Non-parametric means that the test doesn’t assume your data comes from a particular distribution. The H test is used when the assumptions for ANOVA aren’t met (like the assumption of normality). It is sometimes called the one-way ANOVA on ranks, as the ranks of the data values are used in the test rather than the actual data points. The H test determines whether the medians of two or more groups are different.

Below are the Kruskal Wallis H test results for each comparison based on three different alpha diversity measures: 1) Observed species (features), 2) Shannon index, and 3) Simpson index.

Comparison 1.	SUPA vs SUPB vs OM vs ZM	Observed Features	Shannon Index	Simpson Index
Comparison 2.	T vs F	Observed Features	Shannon Index	Simpson Index
Comparison 3.	Masterpure vs PowerSoil vs Zymo	Observed Features	Shannon Index	Simpson Index

IX. Analysis - Beta Diversity

NMDS and PCoA Plots

Beta diversity compares the similarity (or dissimilarity) of microbial profiles between different groups of samples. There are many different similarity/dissimilarity metrics. In general, they can be quantitative (using sequence abundance, e.g., Bray-Curtis or weighted UniFrac) or binary (considering only presence-absence of sequences, e.g., binary Jaccard or unweighted UniFrac). They can be even based on phylogeny (e.g., UniFrac metrics) or not (non-UniFrac metrics, such as Bray-Curtis, etc.).

For microbiome studies, species profiles of samples can be compared with the Bray-Curtis dissimilarity, which is based on the count data type. The pair-wise Bray-Curtis dissimilarity matrix of all samples can then be subject to either multi-dimensional scaling (MDS, also known as PCoA) or non-metric MDS (NMDS).

MDS/PCoA is a scaling or ordination method that starts with a matrix of similarities or dissimilarities between a set of samples and aims to produce a low-dimensional graphical plot of the data in such a way that distances between points in the plot are close to original dissimilarities.

NMDS is similar to MDS, however it does not use the dissimilarities data, instead it converts them into the ranks and use these ranks in the calculation.

In our beta diversity analysis, Bray-Curtis dissimilarity matrix was first calculated and then plotted by the PCoA and NMDS separately. Below are beta diveristy results for all groups together:

NMDS and PCoA Plots for All Groups

The above PCoA and NMDS plots are based on count data. The count data can also be transformed into centered log ratio (CLR) for each species. The CLR data is no longer count data and cannot be used in Bray-Curtis dissimilarity calculation. Instead CLR can be compared with Euclidean distances. When CLR data are compared by Euclidean distance, the distance is also called Aitchison distance.

Below are the NMDS and PCoA plots of the Aitchison distances of the samples:

NMDS and PCoA Plots for Individual Comparisons

Comparison No.	Comparison Name	NMDA				PCoA
Comparison No.	Comparison Name	Bray-Curtis		CLR Euclidean		Bray-Curtis		CLR Euclidean
Comparison 1	SUPA vs SUPB vs OM vs ZM	PDF	SVG	PDF	SVG	PDF	SVG	PDF	SVG
Comparison 2	T vs F	PDF	SVG	PDF	SVG	PDF	SVG	PDF	SVG
Comparison 3	Masterpure vs PowerSoil vs Zymo	PDF	SVG	PDF	SVG	PDF	SVG	PDF	SVG

Interactive 3D PCoA Plots - Bray-Curtis Dissimilarity

Interactive 3D PCoA Plots - Euclidean Distance

Interactive 3D PCoA Plots - Correlation Coefficients

Group Significance of Beta-diversity Indices

To test whether the between-group dissimilarities are significantly greater than the within-group dissimilarities, the "beta-group-significance" function provided in the QIIME 2 "diversity" package was used with PERMANOVA (permutational multivariate analysis of variance) as the group significant testing method.

Three beta diversity matrics were used: 1) Bray–Curtis dissimilarity 2) Correlation coefficient matrix , and 3) Aitchison distance (Euclidean distance between clr-transformed compositions).

Comparison 1.	SUPA vs SUPB vs OM vs ZM	Bray–Curtis	Correlation	Aitchison
Comparison 2.	T vs F	Bray–Curtis	Correlation	Aitchison
Comparison 3.	Masterpure vs PowerSoil vs Zymo	Bray–Curtis	Correlation	Aitchison

X. Analysis - Differential Abundance

16S rRNA next generation sequencing (NGS) generates a fixed number of reads that reflect the proportion of different species in a sample, i.e., the relative abundance of species, instead of the absolute abundance. In Mathematics, measurements involving probabilities, proportions, percentages, and ppm can all be thought of as compositional data. This makes the microbiome read count data “compositional” (Gloor et al, 2017). In general, compositional data represent parts of a whole which only carry relative information (http://www.compositionaldata.com/).

The problem of microbiome data being compositional arises when comparing two groups of samples for identifying “differentially abundant” species. A species with the same absolute abundance between two conditions, its relative abundances in the two conditions (e.g., percent abundance) can become different if the relative abundance of other species change greatly. This problem can lead to incorrect conclusion in terms of differential abundance for microbial species in the samples.

When studying differential abundance (DA), the current better approach is to transform the read count data into log ratio data. The ratios are calculated between read counts of all species in a sample to a “reference” count (e.g., mean read count of the sample). The log ratio data allow the detection of DA species without being affected by percentage bias mentioned above

In this report, a compositional DA analysis tool “ANCOM” (analysis of composition of microbiomes) was used. ANCOM transforms the count data into log-ratios and thus is more suitable for comparing the composition of microbiomes in two or more populations. "ANCOM" generates a table of features with W-statistics and whether the null hypothesis is rejected. The “W” is the W-statistic, or number of features that a single feature is tested to be significantly different against. Hence the higher the "W" the more statistical sifgnificant that a feature/species is differentially abundant.

References:

Gloor GB, Macklaim JM, Pawlowsky-Glahn V, Egozcue JJ. Microbiome Datasets Are Compositional: And This Is Not Optional. Front Microbiol. 2017 Nov 15;8:2224. doi: 10.3389/fmicb.2017.02224. PMID: 29187837; PMCID: PMC5695134.

Mandal S, Van Treuren W, White RA, Eggesbø M, Knight R, Peddada SD. Analysis of composition of microbiomes: a novel method for studying microbial composition. Microb Ecol Health Dis. 2015 May 29;26:27663. doi: 10.3402/mehd.v26.27663. PMID: 26028277; PMCID: PMC4450248.

Lin H, Peddada SD. Analysis of compositions of microbiomes with bias correction. Nat Commun. 2020 Jul 14;11(1):3514. doi: 10.1038/s41467-020-17041-7. PMID: 32665548; PMCID: PMC7360769.

ANCOM Differential Abundance Analysis

ANCOM Results for Individual Comparisons

Comparison No.	Comparison Name
Comparison 1.	SUPA vs SUPB vs OM vs ZM
Comparison 2.	T vs F
Comparison 3.	Masterpure vs PowerSoil vs Zymo

ANCOM-BC2 Differential Abundance Analysis

Starting with version V1.2, we include the results of ANCOM-BC (Analysis of Compositions of Microbiomes with Bias Correction) (Lin and Peddada 2020). ANCOM-BC is an updated version of "ANCOM" that:
(a) provides statistically valid test with appropriate p-values,
(b) provides confidence intervals for differential abundance of each taxon,
(c) controls the False Discovery Rate (FDR),
(d) maintains adequate power, and
(e) is computationally simple to implement.

The bias correction (BC) addresses a challenging problem of the bias introduced by differences in the sampling fractions across samples. This bias has been a major hurdle in performing DA analysis of microbiome data. ANCOM-BC estimates the unknown sampling fractions and corrects the bias induced by their differences among samples. The absolute abundance data are modeled using a linear regression framework.

Starting with version V1.43, ANCOM-BC2 is used instead of ANCOM-BC, So that multiple pairwise directional test can be performed (if there are more than two gorups in a comparison). When performing pairwise directional test, the mixed directional false discover rate (mdFDR) is taken into account. The mdFDR is the combination of false discovery rate due to multiple testing, multiple pairwise comparisons, and directional tests within each pairwise comparison. The mdFDR is adopted from (Guo, Sarkar, and Peddada 2010; Grandhi, Guo, and Peddada 2016). For more detail explanation and additional features of ANCOM-BC2 please see author's documentation.

References:

Lin H, Peddada SD. Analysis of compositions of microbiomes with bias correction. Nat Commun. 2020 Jul 14;11(1):3514. doi: 10.1038/s41467-020-17041-7. PMID: 32665548; PMCID: PMC7360769.

Guo W, Sarkar SK, Peddada SD. Controlling false discoveries in multidimensional directional decisions, with applications to gene expression data on ordered categories. Biometrics. 2010 Jun;66(2):485-92. doi: 10.1111/j.1541-0420.2009.01292.x. Epub 2009 Jul 23. PMID: 19645703; PMCID: PMC2895927.

Grandhi A, Guo W, Peddada SD. A multiple testing procedure for multi-dimensional pairwise comparisons with application to gene expression studies. BMC Bioinformatics. 2016 Feb 25;17:104. doi: 10.1186/s12859-016-0937-5. PMID: 26917217; PMCID: PMC4768411.

ANCOM-BC Results for Individual Comparisons

Comparison No.	Comparison Name
Comparison 1.	SUPA vs SUPB vs OM vs ZM
Comparison 2.	T vs F
Comparison 3.	Masterpure vs PowerSoil vs Zymo

LEfSe - Linear Discriminant Analysis Effect Size

LEfSe (Linear Discriminant Analysis Effect Size) is an alternative method to find "organisms, genes, or pathways that consistently explain the differences between two or more microbial communities" (Segata et al., 2011). Specifically, LEfSe uses rank-based Kruskal-Wallis (KW) sum-rank test to detect features with significant differential (relative) abundance with respect to the class of interest. Since it is rank-based, instead of proportional based, the differential species identified among the comparison groups is less biased (than percent abundance based).

Reference:

Segata N, Izard J, Waldron L, Gevers D, Miropolsky L, Garrett WS, Huttenhower C. Metagenomic biomarker discovery and explanation. Genome Biol. 2011 Jun 24;12(6):R60. doi: 10.1186/gb-2011-12-6-r60. PMID: 21702898; PMCID: PMC3218848.

SUPA vs SUPB vs OM vs ZM

LEfSe Results for All Comparisons

Comparison No.	Comparison Name
Comparison 1.	SUPA vs SUPB vs OM vs ZM
Comparison 2.	T vs F
Comparison 3.	Masterpure vs PowerSoil vs Zymo

XI. Analysis - Heatmap Profile

Species vs Sample Abundance Heatmap for All Samples

Heatmaps for Individual Comparisons

A) Two-way clustering - clustered on both columns (Samples) and rows (organism)

Comparison No.	Comparison Name	Family Level		Genus Level		Species Level
Comparison 1	SUPA vs SUPB vs OM vs ZM	PDF	SVG	PDF	SVG	PDF	SVG
Comparison 2	T vs F	PDF	SVG	PDF	SVG	PDF	SVG
Comparison 3	Masterpure vs PowerSoil vs Zymo	PDF	SVG	PDF	SVG	PDF	SVG

B) One-way clustering - clustered on rows (organism) only

Comparison No.	Comparison Name	Family Level		Genus Level		Species Level
Comparison 1	SUPA vs SUPB vs OM vs ZM	PDF	SVG	PDF	SVG	PDF	SVG
Comparison 2	T vs F	PDF	SVG	PDF	SVG	PDF	SVG
Comparison 3	Masterpure vs PowerSoil vs Zymo	PDF	SVG	PDF	SVG	PDF	SVG

C) No clustering

Comparison No.	Comparison Name	Family Level		Genus Level		Species Level
Comparison 1	SUPA vs SUPB vs OM vs ZM	PDF	SVG	PDF	SVG	PDF	SVG
Comparison 2	T vs F	PDF	SVG	PDF	SVG	PDF	SVG
Comparison 3	Masterpure vs PowerSoil vs Zymo	PDF	SVG	PDF	SVG	PDF	SVG

XII. Analysis - Network Association

To analyze the co-occurrence or co-exclusion between microbial species among different samples, network correlation analysis tools are usually used for this purpose. However, microbiome count data are compositional. If count data are normalized to the total number of counts in the sample, the data become not independent and traditional statistical metrics (e.g., correlation) for the detection of specie-species relationships can lead to spurious results. In addition, sequencing-based studies typically measure hundreds of OTUs (species) on few samples; thus, inference of OTU-OTU association networks is severely under-powered. Here we use SPIEC-EASI (SParse InversE Covariance Estimation for Ecological Association Inference), a statistical method for the inference of microbial ecological networks from amplicon sequencing datasets that addresses both of these issues (Kurtz et al., 2015). SPIEC-EASI combines data transformations developed for compositional data analysis with a graphical model inference framework that assumes the underlying ecological association network is sparse. SPIEC-EASI provides two algorithms for network inferencing – 1) Meinshausen-Bühlmann's neighborhood selection (MB method) and inverse covariance selection (GLASSO method, i.e., graphical least absolute shrinkage and selection operator). This is fundamentally distinct from SparCC, which essentially estimate pairwise correlations. In addition to these two methods, we provide the results of a third method - SparCC (Sparse Correlations for Compositional Data)(Friedman & Alm 2012), which is also a method for inferring correlations from compositional data. SparCC estimates the linear Pearson correlations between the log-transformed components.

References:

Kurtz ZD, Müller CL, Miraldi ER, Littman DR, Blaser MJ, Bonneau RA. Sparse and compositionally robust inference of microbial ecological networks. PLoS Comput Biol. 2015 May 7;11(5):e1004226. doi: 10.1371/journal.pcbi.1004226. PMID: 25950956; PMCID: PMC4423992.

Friedman J, Alm EJ. Inferring correlation networks from genomic survey data. PLoS Comput Biol. 2012;8(9):e1002687. doi: 10.1371/journal.pcbi.1002687. Epub 2012 Sep 20. PMID: 23028285; PMCID: PMC3447976.

SPIEC-EASI Network Inference by Neighborhood Selection (MB Method)

Association Network Inference by SparCC

XIII. Disclaimer

The results of this analysis are for research purpose only. They are not intended to diagnose, treat, cure, or prevent any disease. Forsyth and FOMC are not responsible for use of information provided in this report outside the research area.