Readme for Data files associated with "The impact of mercury selection and conjugative genetic elements on community structure and resistance gene transfer" by Hall, Pärnänen, Harrison, Virta, and Brockhurst.

May 2020

Five files:

SOILCOM_Dynamics_Data.csv

Comma-separated values file showing the population dynamics data presented in Figures 1, S1, and S2. Columns are as follows: 'replicate' indicates replicate (values a, b, c, d, e, f); timepoint indicates transfer number, with 0 indicating the start of the experiment; trt is a two-digit treatment code describing plasmid, mercury, and natural community treatment, with the first digit indicating plasmid status (1 = no pQBR plasmid added, 2 = pQBR57, 3 = pQBR103) and the second digit indicating mercury and natural community status (1 and 4 = 0 µg/g, 2 and 5 = 16 µg/g, 3 and 6= 64 µg/g; 1, 2, 3 = P. fluorescens SBW25 alone; 4, 5, 6 = with natural community); plasmid indicates starting plasmid treatment (values 'plasmid-', 'pQBR57', 'pQBR103'); mercury indicates mercury treatment (added mercury concentration in Hg2+ µg/g); enc indicates addition or not of natural community (values 'alone' or 'with_enc'); type indicates the population being measured (values are 'SBW25_total' = P. fluorescens SBW25; 'SBW25_hgr' = mercury-resistant P. fluorescens SBW25; 'ENC' = natural community; 'ENC_hgr' = mercury-resistant natural community); density is the density in cfu/g soil estimated from plate counts.

Short-read sequencing data

Short-read sequencing data associated with this project is available at the European Nucleotide Archive PRJEB34647. A table linking samples with metadata is provided in the file SOILCOM_Short_Read_Data.csv.

Analysis scripts

Sample analysis scripts in Rmarkdown format are provided at raw .Rmd and formatted .html files in subdirectories in SOILCOM_Analysis_Scripts.zip. Along with the raw data they can reproduce the figures and statistics presented in the manuscript.

Amplicon analysis output for plotting

The following tables represent the output of processing the short-read sequencing data described above. They are provided in a compressed .zip archive, SOILCOM_amplicon_processed_data.zip. Intermediate files used in their production, as well as sample analysis scripts, are provided in SOILCOM_amplicons.zip.

3_faith-pd-group-significance-hg-cat.tsv

Data presented in Figure 3, showing Faith's phylogenetic diversity for the analysed populations.

3supp_evenness-group-significance-hg-cat.tsv

Data presented in Figure S3, showing Pielou's evenness for the analysed populations.

3supp_shannon-group-significance-hg-cat.tsv

Data presented in Figure S3, showing Shannon's H for the analysed populations.

4_unweighted_unifrac_distance-matrix.tsv

Distance matrix used to prepare Figure 4, showing unweighted UniFrac distances between populations. Columns and rows are named using the trt format described above.

4supp_bray_curtis_distance-matrix.tsv

Distance matrix used to prepare Figure S4, showing Bray-Curtis distances between populations. Columns and rows are named using the trt format described above.

4supp_weighted_unifrac_distance-matrix.tsv

Distance matrix used to prepare Figure S4, showing weighted UniFrac distances between populations. Columns and rows are named using the trt format described above.

6_epic-16S-filtered-table_.tsv

Table showing ASV counts for each population that was used to prepare Figure 6. Populations (columns) are named using the trt format described above, rows refer to unique ASV identifiers that can be interpreted taxonomically using 6_epic_taxonomy.tsv.

6_epic_taxonomy.tsv

Taxonomy assignments for ASVs using the GreenGenes database.

6supp_epic_w_16S_bray_curtis_distance-matrix.tsv

Distance matrix used to prepare Figure S6, showing Bray-Curtis distances between populations. Columns and rows are named using the trt format described above.

6supp_epic_w_16S_unweighted_unifrac_distance-matrix.tsv

Distance matrix used to prepare Figure S6, showing unweighted UniFrac distances between populations. Columns and rows are named using the trt format described above.

6supp_epic_w_16S__weighted_unifrac_distance-matrix.tsv

Distance matrix used to prepare Figure S6, showing weighted UniFrac distances between populations. Columns and rows are named using the trt format described above.