Bacteria with functional DNA repair systems are expected to have low mutation rates due to strong natural selection for genomic stability. However, our study of the wild-type Streptococcus pneumoniae D39, a pathogen responsible for many common diseases, revealed a high spontaneous mutation rate of 0.02 per genome per cell division in mutation-accumulation (MA) lines. This rate is orders of magnitude higher than that of other non-mutator bacteria and is characterized by a high mutation bias in the A/T direction. The high mutation rate may have resulted from a reduction in the overall efficiency of selection, conferred by the tiny effective population size in nature. In line with this, S. pneumoniae D39 also exhibited the lowest DNA mismatch-repair (MMR) efficiency among bacteria. Treatment with the antibiotic penicillin did not elevate the mutation rate, as penicillin did not induce DNA damage and S. pneumoniae lacks a stress response pathway. Our findings suggested that the MA results are applicable to within-host scenarios and provide insights into pathogen evolution.
Streptococcus pneumoniae is a human nasopharyngeal pathogen that causes diseases, including strep throat, pneumonia, meningitis, and otitis media, particularly in young children and the elderly. The virulence factors and pathogenesis of this Gram-positive pathogenic bacteria have been extensively studied (Kadioglu et al. 2008; Mitchell and Mitchell 2010; Musher 1992; Van der Poll and Opal 2009). However, information on how the initial nasopharyngeal colonization process influences within-host evolution and future transmission, is currently lacking (Chaguza et al. 2020; Green et al. 2021). Within the hosts, the evolution of this species is driven by multiple selective forces (clonal expansion, transmission pressure, host immunity), which may complicate the evolutionary patterns of this pathogen. To reveal the evolutionary patterns of S. pneumoniae, we examined spontaneous mutational features at the whole-genome level without strong selective drivers. To do so, we conducted multiple mutation accumulation experiments (MA) combined with whole-genome-sequencing (WGS) (Keightley and Halligan 2009; Lee et al. 2012; Lynch et al. 2008).
Cells possess multiple mechanisms to repair DNA damage and thus prevent mutations. However, the repair patterns of canonical repair pathways remain unclear in S. pneumoniae. One such pathway is DNA mismatch repair (MMR), which is composed of a few well-conserved proteins (MutS/MutL/MutH), functioning in the post-replicative repair of mismatched DNA (Jun et al. 2006; Kolodner and Marsischky 1999; Kunkel and Erie 2005; Li 2008; Modrich and Lahue 1996). Other repair pathways, involved with a nucleotide triphosphate hydrolase MutT (named as MutX in S. pneumoniae) and an A/G-specific adenine glycosylase MutY play important roles in repairing DNA oxidative damages (Ames and Gold 1991; Mejean et al. 1994; Nghiem et al. 1988; Tajiri et al. 1995). Considering that S. pneumoniae is catalase-deficient (Pericone et al. 2002), it might be subjected to high mutation pressure. Uncovering the repair patterns of the MMR and the oxidative damage repair pathways (MutX and MutY) can provide insights into the mutation spectrum determinants and the long-term evolution of S. pneumoniae.
During clinical treatments of diseases caused by S. pneumoniae an increasing number of strains have developed resistance to clinically-recommended antibiotics such as the β-lactam penicillin, one of the most widely-used antibiotics for treating S. pneumoniae (Jacobs 1992; Van de Beek et al. 2006; Van der Poll and Opal 2009; Yu et al. 2003). Currently, our knowledge of the mutational processes underlying acquisition of resistance under penicillin exposure is limited for this pathogen. Particularly lacking is information on the influence of penicillin on the mutation rate, as opposed to the ability of natural selection to promote resistance mutations. It has been reported that bacteria are often exposed to sublethal (i.e., subinhibitory) concentrations of certain antibiotics, which can not only promote the emergence and spread of resistant bacteria in humans and animals but may even induce multi-drug resistant mutations, posing a threat to human health (Andersson and Hughes 2014; Baquero et al. 2008; Carmeli et al. 1999; Jiang et al. 2013; Jørgensen et al. 2013; Long et al. 2016; Zurek and Ghosh 2014). De novo mutations are the main source of antibiotic resistance (Woodford and Ellington 2007). Based on above, uncovering the effects of penicillin treatment on genome-wide mutations of S. pneumoniae may provide essential guidance for clinical disease treatment.
In this study, we determined sublethal concentrations of penicillin in vitro on the wild-type S. pneumoniae D39 and three mutant strains constructed from the wild-type: ΔmutS (MMR-deficient), ΔmutX (MutX-deficient) and ΔmutY (MutY-deficient). Then, we applied the MA-WGS strategy to investigate spontaneous mutational features at whole-genome level and the repair patterns of the classical DNA damage repair systems, as well as the effects on genomic mutations upon penicillin treatment. Our results uncovered the spontaneous mutational features of S. pneumoniae in different genetic backgrounds at the whole-genome level, including the rate, molecular spectrum of genomic mutations, and repair patterns of the aforementioned repair systems. This study also provided quantitative insights into the response of this pathogen to penicillin treatment.
Materials and methods
Bacterial strains and growth conditions
Streptococcus pneumoniae strains were derived from the encapsulated serotype-2 D39W progenitor strain IU1690 (Lanie et al. 2007; Slager et al. 2018). For cultures in liquid broth, bacteria were grown statically in Becton-Dickinson brain heart infusion (BHI) broth at 37 ℃ in an atmosphere of 5% CO2, and growth was monitored by OD620 as described before (Tsui et al. 2016). During growth for mutation accumulation (MA) procedure, colonies were grown at 37 ℃, 5% CO2 on brain–heart infusion agar plates (BHI, Oxoid) supplemented with 5% (vol/vol) defibrinated sheep blood. Streaked colonies on BHI agar plates appeared as ~ 1 mm-diameter smooth colonies with a grass–green colored hemolytic ring around them. Stock solutions of penicillin G sodium salt (Cat. No.: P3032-10MU, Sigma–Aldrich) were made according to the manufacturers' instructions. We followed biosafety level 2 procedures as stated in the bioprotocol 15-038 (Institutional Biosafety Committee, Indiana University at Bloomington). Details on construction of deletion mutant strains are in Supplementary Table S1, following previously published procedures (Ramos-Montanez et al. 2008; Tsui et al. 2011, 2014, 2016).
Determining sublethal concentrations of penicillin for S. pneumoniae D39 in vitro
To determine the range of sublethal concentrations of penicillin used for further experiments, we generated survival curves by measuring the efficiency of plating (EOP). These curves were established as follows. Cells were inoculated from frozen glycerol stocks into BHI broth, serially diluted, and incubated 12–15 h statically at 37 ℃ in an atmosphere of 5% CO2. The next day, cultures at OD620 ≈ 0.1–0.4 were diluted to OD620 ≈ 0.005 in BHI broth. They were then plated onto BHI agar plates with gradients of penicillin concentrations: 0, 0.002, 0.004, 0.006, 0.008, 0.01, 0.012, 0.014, 0.016 ng/μL (each group was with four replicates). Plates were maintained at 37 ℃ and 5% CO2 for 24 h, after which colony forming units (CFU) were counted. The EOP was then calculated as m/N, where m is the CFU of the plate containing 0–0.016 ng/μL penicillin, and N is the CFU of the penicillin-free plate.
Mutation accumulation (MA) procedures
To explore the whole-genome mutational features and mutational responses to penicillin treatment, we then performed MA experiments on the four strains (wild-type, ΔmutS, ΔmutX, and ΔmutY). For each strain, to initiate the MA lines, cells from an ancestral colony were plated onto a BHI plate without penicillin. The grown colonies were then streaked onto BHI plates containing 0, 0.002, 0.004, 0.006, and 0.008 ng/μL. Number of MA lines of different strains for each penicillin concentration was not balanced: for the wild-type n = 24; for ΔmutSn = 8; for ΔmutX and ΔmutYn = 16. All MA lines were transferred every 24 h by streaking single colonies.
The MA experiments lasted 45–52 days. The number of cell divisions between transfers was estimated every ~ 3 weeks by cutting single colonies of five randomly-picked lines from agar plates, serially diluted in 1 × PBS to a suitable cell density, plated on penicillin-free plates, and then CFU was determined as outlined above (see Supplementary Table S1). The effective population size (Ne) during MA experiments of each treatment was calculated by taking the harmonic mean of successive doublings from a population size of 1 until the final population size was reached (Wahl and Gerrish 2001).
DNA extraction, library construction, and sequencing
The genomic DNA of the wild-type ancestor strain and all the 320 MA lines were extracted with the Wizard Genomic DNA Purification Kit (Promega). A Nextera DNA Library Preparation Kit (Illumina) was used to construct the genomic DNA libraries with an insert size of 500 bp for Illumina Hiseq 2500 PE250 sequencing; this led to 128 × (SE: 4.13) mean depth of sequencing coverage and 98.85% (SE: 0.23%) of the genome covered with high-quality reads, using the chromosome sequence of the model strain S. pneumoniae D39 (NC_008533.2) as the reference (Table 1; Supplementary Table S2). 18 MA lines with coverage of less than 20 × or cross-contamination were removed (Supplementary Table S3).
Table
1.
MA experimental details
Strain
Penicillin (ng/μL)
N
Transfers
Division
Ne
Depth (×)
Wild-type
0 (WT_A)
23
51
884
9.00
106
Wild-type
0.002 (WT_B)
24
52
884
9.00
110
Wild-type
0.004 (WT_C)
22
52
854
8.50
98
Wild-type
0.006 (WT_D)
22
52
845
8.50
142
Wild-type
0.008 (WT_E)
24
50
829
9.00
133
ΔmutS
0 (mutSA)
7
50
879
9.50
103
ΔmutS
0.002 (mutSB)
7
52
897
9.00
109
ΔmutS
0.004 (mutSC)
8
52
857
8.50
146
ΔmutS
0.006 (mutSD)
8
52
863
9.00
118
ΔmutS
0.008 (mutSE)
8
50
840
9.00
105
ΔmutX
0 (mutXA)
16
47
788
9.00
131
ΔmutX
0.002 (mutXB)
15
47
772
9.00
150
ΔmutX
0.004 (mutXC)
15
48
791
9.00
149
ΔmutX
0.006 (mutXD)
15
47
790
9.00
124
ΔmutX
0.008 (mutXE)
16
45
732
8.50
152
ΔmutY
0 (mutYA)
13
50
840
9.00
129
ΔmutY
0.002 (mutYB)
15
49
756
8.00
130
ΔmutY
0.004 (mutYC)
15
49
811
8.50
163
ΔmutY
0.006 (mutYD)
14
48
817
9.00
142
ΔmutY
0.008 (mutYE)
15
48
742
8.50
122
N, number of replicate MA lines; Transfers, mean number of transfers during the experiment; Division, mean cell divisions in the final analyses; Ne, effective population size of each penicillin-treatment corresponded to a certain concentration; Depth (×), depth of sequencing coverage
After trimming adapters of the raw data by Trimmomatic (v-0.36) (Bolger et al. 2014), the trimmed reads of each MA line were mapped to the reference genome (NCBI Genome accession No.: NC_008533.2) with Burrows-Wheeler Aligner (v-0.7.17) mem (Li and Durbin 2009). The HaplotypeCaller module in Genome Analysis Toolkit (GATK v-4.1.2) was used for calling base-pair substitutions (BPSs) and short indels using GATK's best practices recommendations, and only unique BPSs were considered (DePristo et al. 2011; McKenna et al. 2010). The short sequence reads of MA lines of each group were mapped to the D39 reference genome to identify structural variations in Breseq (v-0.35.1) (Deatherage and Barrick 2014). Validation of BPSs and indels with Integrative Genomics Viewer (IGV v-2.8.2) was also performed (Thorvaldsdóttir et al. 2013). Genome-wide BPS mutation rate μ (per nucleotide site per cell division) was calculated by the following formula:
s∑n1N×T,
where s is the total number of mutations across all MA lines, n is the total number of MA lines, N is the analyzed sites in one MA line, and T is the number of cell divisions passed in the entire mutation accumulation process of one MA line. BPS mutation rate of each gene was also calculated using the above formula, in which s is the total number of mutations carried by a certain gene, n is the total number of MA lines, N is the base number of the gene, and T is the number of cell divisions passed in the entire mutation accumulation process of one MA line.
The repair efficiency was calculated by the following formula:
μmutant−μwildtype μmutant,
where μmutant was the BPS mutation rate of ΔmutS, ΔmutX, or ΔmutY and μwildtype was the BPS mutation rate of the wild-type.
The mutation bias m was calculated by m=μG:C→A:T+G:C→T:A/μA:T→G:C+A:T→C:G, and the transition to transversion ratios (ts/tv; n is the total number of MA lines) with the following formula:
∑n1transitions∑n1transversions.
Calculation of population genetic parameters
To estimate the effective population size (Ne), we downloaded 196 natural strains of S. pneumoniae with different serotypes from the NCBI SRA database (Supplementary Table S4). We used Unicycler (v-0.4.8) (Wick et al. 2017) to assemble the draft genomes of all the natural strains, then blasted their crucial genes related to multiple DNA repair pathways—mutS, mutL, mutM, mutX, and mutY with the gene sequences of S. pneumoniae D39. One strain was then removed due to incomplete crucial genes, as we required all strains to have intact reading frames of these genes, and a total of 195 strains met the conditions. Then we used BWA (v-0.7.12) (Li and Durbin 2009) to align the raw reads of the 195 natural strains to the chromosome of S. pneumoniae D39 (NCBI Genome accession No.: NC_008533.2) and calculated πs, the average pairwise genetic distance at four-fold degenerate sites using the following formula:
πs=nn−1×(1−Σπ2),
where π is the average number of nucleotide differences per site between two DNA sequences in all possible pairs of the strains, n is the number of strains. Ne of S. pneumoniae was calculated using the following formula, where μ is BPS mutation rate per nucleotide site per cell division:
Ne≈πs2μ.
Gene expression analysis based on RNAseq
RNAseq data of S. pneumoniae D39 wild-type strain were retrieved from NCBI SRA with the BioProject Number of PRJNA6952 (SRR13563519–SRR13563522) (Hirschmann et al. 2021). Raw reads of the four parallel samples were trimmed with default parameters of Fastp (v-0.20.1) (Chen et al. 2018), and an index of the genome from the file NC_008533.2 (S. pneumoniae D39) was created with Hisat2 (v-2.2.1) (Kim et al. 2015). Gene expression levels in these samples were analyzed with StringTie (v-2.1.5) (Pertea et al. 2016).
Results
Neutrality tests on the mutation accumulation datasets
Survival curves indicated that penicillin concentrations below 0.008 ng/μL had sublethal effects (Fig. 1A), and mutation accumulation experiments (MA) were performed at this level and below (Table 1; Supplementary Tables S5, S6). We compared the nonsynonymous/synonymous mutation ratio of MA lines of each strain with the random expectation and found no evidence of mutations being biased by selection in the penicillin-free MA lines (P > 0.05, χ2 test; Supplementary Tables S7–S10). Furthermore, the nonsynonymous/synonymous base-pair substitutions (BPSs) ratio in the control and penicillin-treated groups did not differ significantly (χ2 test, P > 0.05 in all cases), indicating that the majority of acquired amino acid-altering BPSs were not selectively promoted by penicillin treatment but simply accumulated in a neutral fashion.
Figure
1.
Efficiency of plating (EOP) and mutation rates. A Efficiency of plating (EOP) of the four strains under nine penicillin concentrations (points are means, error bars denote SE; n = 4). B BPS (base-pair-substitution) and indel (insertion and deletion) mutation rates of the penicillin-free MA lines of the four strains (error bars denote 95% Poisson confidence intervals)
However, we identified three genes in the wild-type control MA lines with more mutations than the random expectation (Supplementary Tables S11, S12): an aminotransferase of the DegT/DnrJ/EryC1/StrS family (gene ID: SPD_RS08630), a transcriptional regulator of the GntR family (SPD_RS00325), and a member of the glycosyltransferase family 1 (SPD_RS01740). No selection was detected in SPD_RS01740 (P = 0.53, exact binomial test for goodness of fit; nonsynonymous/synonymous sites ratio = 3.68, as expected, with the transitions/transversions ratio considered; Supplementary Table S7). However, all 10 and three BPSs, carried by the SPD_RS08630 and SPD_RS00325, respectively, were nonsynonymous. The high proportion of nonsynonymous BPSs in these two genes might indicate strong positive selection acting on them during the MA process (Andreani et al. 2019; Fineran et al. 2005; Hillerich and Westpheling 2006). Therefore, we excluded the BPSs of the two strongly selected genes (SPD_RS08630 and SPD_RS00325) from the mutational feature calculation, although this barely changes any pattern.
The features of S. pneumoniae D39 spontaneous mutations at the whole-genome level
We detected 415 BPSs in the wild-type MA lines without penicillin treatment, resulting in a mean BPS mutation rate of 1.01 × 10–8 (CI: 9.13 × 10–9, 1.11 × 10–8) per nucleotide site per cell division, or ~ 0.02 per genome per cell division (Table 2; Fig. 1B; Supplementary Tables S2, S13, S14). Additionally, we found 35 indels (9 insertions and 26 deletions), with an insertion/deletion ratio of 0.35, yielding a mean small-scale indel rate of 8.50 × 10–10 (CI: 5.92 × 10–10, 1.18 × 10–9) per nucleotide site per cell division, which represented 8.43% of the BPS mutation rate (Table 2; Fig. 1B). The ratio of insertions/deletions indicates a deletion bias, which was consistent with most studied bacteria (Long et al. 2018a). We observed 10 indels within simple sequence repeats (SSR), which were tandem repeats of short DNA nucleotides, such as homopolymer runs and microsatellite repeats. These were usually generated by unequal crossover or polymerase slippage during replication of SSRs (Levinson and Gutman 1987).
Table
2.
Mutation information
Strain
Penicillin (ng/μL)
BPSs
μBPS
ts/tv
AT bias
Indels
μIndel
Ins/Del
Wild type
0 (WT_A)
415
1.01
1.10
7.02
35
0.09
0.35
Wild type
0.002 (WT_B)
388
0.90
1.85
7.30
49
0.11
0.48
Wild type
0.004 (WT_C)
241
0.63
1.70
5.27
24
0.06
0.50
Wild type
0.006 (WT_D)
257
0.68
2.13
4.60
31
0.08
0.41
Wild type
0.008 (WT_E)
209
0.52
1.87
5.44
25
0.06
0.79
ΔmutS
0 (mutSA)
407
3.27
4.48
2.66
72
0.58
0.76
ΔmutS
0.002 (mutSB)
472
3.72
6.46
2.28
90
0.71
0.91
ΔmutS
0.004 (mutSC)
461
3.32
7.23
1.96
77
0.56
1.14
ΔmutS
0.006 (mutSD)
368
2.63
9.49
2.57
73
0.52
0.83
ΔmutS
0.008 (mutSE)
419
3.09
8.84
1.99
73
0.54
0.78
ΔmutX
0 (mutXA)
818
3.21
0.26
0.44
19
0.07
0.46
ΔmutX
0.002 (mutXB)
773
3.30
0.20
0.49
13
0.06
0.44
ΔmutX
0.004 (mutXC)
661
2.76
0.25
0.46
16
0.07
0.60
ΔmutX
0.006 (mutXD)
593
2.48
0.23
0.60
10
0.04
0.25
ΔmutX
0.008 (mutXE)
463
1.96
0.22
0.36
7
0.03
6.00
ΔmutY
0 (mutYA)
831
3.77
0.10
68.21
16
0.07
0.14
ΔmutY
0.002 (mutYB)
1056
4.61
0.12
51.50
13
0.06
0.86
ΔmutY
0.004 (mutYC)
1020
4.15
0.10
79.88
13
0.05
0.30
ΔmutY
0.006 (mutYD)
872
3.78
0.11
43.82
24
0.10
0.50
ΔmutY
0.008 (mutYE)
727
3.23
0.11
38.76
19
0.08
0.27
BPSs, total number of base-pair substitutions in the MA lines; μBPS, base-pair substitution mutation rate; ts/tv, the ratio of transitions/transversions; AT bias, mutation bias in the A/T direction, (see Materials and methods for details); Indels, number of insertions and deletions; μIndel, indel mutation rate; Ins/Del, the ratio of insertions/deletions. All mutation rates are in units of × 10–8 per nucleotide site per cell division
The within-line rates of different mutation types exhibited significant differences (ANOVA, F = 54, P < 0.0001). Specifically, the mutation rates of G: C→A: T transitions and G: C→T: A transversions were significantly higher than other types of mutations (Tukey test, P < 0.0001; Fig. 2A, B; Supplementary Table S15). The ratio of transitions/transversions (ts/tv) was 1.10, resulting in an extreme mutation bias in the A/T direction of 7.02 (mutation bias in the A/T direction was calculated as: m=μG:C→A:T+G:C→T:A/μA:T→G:C+A:T→C:G; Table 2). The mutation spectrum was characterized by a high G: C→T: A transversion rate and a relatively low A: T→G: C transition rate.
Figure
2.
Mutational features of penicillin-free MA lines of the four strains. A Summary of genome-wide distribution of mutations (plotted by TBtools and R package 'circlize', Chen et al. 2020; Gu et al. 2014). Genome coordinates are in kbp. Circles from the outside inward: the gene density per 10-kbp bin (the whole genome is divided into 205 bins; the legend is on the left); the histogram in purple is the gene expression levels in fragments per kilobase million (FPKM) per 10-kbp bin, and the orange dots represent mutation rate per nucleotide site per cell division of each gene of the wild-type (the lowest: 6.43 × 10–9; the highest: 4.01 × 10–7); the next four inner circles with colored tiles represent the genomic location of BPSs of wild-type, ΔmutS, ΔmutX and ΔmutY MA lines respectively (the legend is on the right). B Mutation spectra of penicillin-free MA lines of the four strains. Error bars denote 95% Poisson confidence intervals
Specificity of MMR and oxidative damage repair systems
To quantify the repair specificity of the major DNA repair systems in S. pneumoniae, we compared the mutational features of MMR-deficient ΔmutS and two oxidative-damage-repair deficient strains ΔmutX and ΔmutY with those of the wild-type (Table 1). The genome-wide BPS mutation rate of the ΔmutS strain, estimated from the MA lines, was 3.27 × 10–8 (CI: 2.96 × 10–8, 3.60 × 10–8) per nucleotide site per cell division, which was 3.24 × higher than that of the wild-type. This indicated that MMR repairs 69.17% of genome-wide pre-mutations derived from mismatched nucleotides (Table 2; Fig. 1B; Supplementary Tables S2, S13, S16). In addition, the small-scale indel mutation rate of the ΔmutS strain was 5.78 × 10–9 (CI: 4.52 × 10–9, 7.28 × 10–9) per nucleotide site per cell division, which was a 6.80 × elevation from the wild-type. The insertions/deletions ratio was 0.76 (vs. 0.35 in the wild-type). We then calculated the repair efficiency of MMR on indels, insertions, and deletions, yielding values of 85.3%, 91.2% and 80.8%, respectively (Table 2; Fig. 1B; Supplementary Tables S2, S13, S17). These results indicated a preference for MMR in repairing insertions.
The MA results showed that the genomic BPS mutation rates of the oxidative-damage-repair deficient strains ΔmutX and ΔmutY were 3.21 × 10–8 (CI: 2.99 × 10–8, 3.44 × 10–8) and 3.77 × 10–8 (CI: 3.52 × 10–8, 4.03 × 10–8) per nucleotide site per cell division respectively, which were 3.18 and 3.74 × higher than that of the wild-type. These resulted in the overall repair efficiencies of 68.6% and 73.3% by MutX and MutY, respectively (Table 2; Fig. 1B; Supplementary Tables S2, S13, S16). The small-scale indel mutation rates of the ΔmutX and ΔmutY strains were 7.45 × 10–10 (CI: 4.49 × 10–10, 1.16 × 10–9) and 7.26 × 10–10 (CI: 4.15 × 10–10, 1.18 × 10–9) per nucleotide site per cell division, which were similar to that of the wild-type, reflecting that oxidative-damage-repair systems in S. pneumoniae did not repair indels (Table 2; Fig. 1B; Supplementary Tables S2, S13, S17).
The mutation spectrum of the ΔmutS strain was typical of hypermutator bacteria with dysfunctional MMR, with significantly increased transitions, while transversions other than A: T→T: A were not (Mann Whitney test, G: C→A: T, P < 0.0001; A: T→G: C, P < 0.0001; A: T→T: A, P = 0.0027; Fig. 2A, B; Supplementary Table S15). 73.03% G: C→A: T and 91.98% A: T→G: C transitions were repaired by MMR, and 76.48% for A: T→T: A transversions (Supplementary Table S17). The inflated transition rates led to a transition/transversion ratio of 4.48 ×, higher than that of the wild-type, reflecting that MMR of S. pneumoniae preferentially repairs transitions. The mutation bias of the ΔmutS MA lines in the A/T direction was 2.66, which was much weaker than the 7.02 of the wild-type (Table 2).
Compared to the wild-type, A: T→C: G transversions are significantly increased in the ΔmutX MA lines (ANOVA, F = 171, P < 0.001; Tukey test, P < 0.0001), leading to ~ threefold increase in genome-wide mutation rate, a ts/tv ratio of 0.19, and mutations biased in the G/C direction (m = 0.44; Table 2; Fig. 2A, B). The repair efficiency of MutX for A: T→C: G transversions was 96.7% (Supplementary Table S17). In the ΔmutY MA lines, G: C→T: A transversions were the most significantly elevated mutation type (ANOVA, F = 118, P < 0.001; Tukey test, P < 0.0001), yielding a ts/tv of 0.11 and an extreme A/T mutation bias of ~ 68. The repair efficiency for G: C→T: A transversions was 90.52%, supporting that MutY specifically repairs G: C→T: A transversions (vs. 98.65% of E. coli; Table 2; Fig. 2A, B; Supplementary Table S17).
Penicillin treatment caused various effects on mutational features of different strains
In the wild-type MA lines, linear regression analysis revealed a weak negative correlation between the BPS mutation rates and the penicillin concentration (R2 = 0.19, P < 0.0001; Figs. 3A, B, 4A; Supplementary Fig. S1A), so did the ΔmutX (R2 = 0.20, P < 0.0001; Fig. 4A; Supplementary Figs. S1C, S2A, B) and the ΔmutY MA lines (R2 = 0.06, P = 0.04; Fig. 4A; Supplementary Figs. S1D, S2C, D). By contrast, there was no significant linear relationship between mutation rates and penicillin doses in the ΔmutS MA lines [(R2 = 0.09, P = 0.06; Figs. 3C, D, 4A; Supplementary Fig. S1B]. In addition, in all of the four strains, penicillin treatment did not change the rate of small-indels, the genome-wide distribution, or the mutation spectrum (Fig. 3B, D; Supplementary Figs. S2B, D, S3, S4).
Figure
3.
Mutational responses to penicillin treatment of the wild-type and the ΔmutS MA lines. A BPS and indel mutation rates treated with different penicillin concentrations of the wild-type MA lines. B The distribution of the BPS mutations on the whole genome of the wild-type MA lines. C BPS and indel mutation rates per nucleotide site per cell division treated with different penicillin concentrations of the ΔmutS MA lines. D The distribution of the BPS mutations on the whole genome of the ΔmutS MA lines. Dots and the triangles represent BPS and indel mutation rates of each MA line; error bars denote SE (A, C). Genome coordinates are in kbp; circles with colored tiles from outside to inside represent BPSs for 0, 0.002, 0.004, 0.006 and 0.008 ng/μL-penicillin-treated MA lines (B, D)
Figure
4.
Statistical analyses of the BPS mutation rates (A) and repair efficiencies (B) in response to penicillin does. A Regression analysis of the BPS mutation rates against variable penicillin concentration across the four strains using linear models. Each line represents the linear regression fit for each strain, with shaded areas indicating 95% confidence intervals: y = (− 6.02 × 10−7)x + (9.91 × 10−9), R2 = 0.19, P < 0.0001, for wild-type (WT); y = (− 9.21 × 10−7)x + (3.54 × 10−8), R2 = 0.09, P = 0.06, for ΔmutS; y = (− 1.67 × 10−6)x + (3.40 × 10−8), R2 = 0.20, P < 0.0001, for ΔmutX; y = (− 1.03 × 10−6)x + (4.33 × 10−8), R2 = 0.06, P = 0.04, for ΔmutY. B Repair efficiencies of the three pathways treated with variable penicillin concentration, error bars denote SE of a ratio, which was calculated by equation A1.19b on page 818 (Lynch and Walsh 1998), assuming no covariance between mutation rates of the wild-type and the knockout strains
In addition to analyzing BPSs and indels, we calculated the rates of structure variations (SVs). We detected in total 16 de novo SVs—all large-scale deletions—in all of the wild-type MA lines: 4, 4, 2, 2, 4 SVs for the groups treated with 0–0.008 ng/μL penicillin, using Breseq for SV-analysis and IGV for visual verification (Supplementary Tables S18, S19). No statistically significant linear relationship occurred between the SV rate and penicillin concentration (R2 = 0.06, P = 0.7). This pattern also extended to the other three strains (ΔmutS: R2 = 0.68, P = 0.08; ΔmutX: R2 = 0.0002, P = 0.98; ΔmutY: R2 = 0.17, P = 0.48; Supplementary Table S18).
We further explored whether penicillin treatment affected the efficiency of DNA repair systems. We found that there was no significant correlation observed between repair efficiency and penicillin doses (MMR repair efficiency: R2 = 0.56, P = 0.14; oxidative damage repair efficiency: ΔmutX, R2 = 0.26, P = 0.38; ΔmutY, R2 = 0.63, P = 0.11; Fig. 4B; Supplementary Table S16).
Context-dependence of the wild-type mutation rate
As penicillin treatment did not alter BPS mutation spectra (Supplementary Fig. S4), we combined all 1510 BPSs from the control and penicillin-treated wild-type MA lines. This pooling strategy was used to enhance the statistical power of the analysis on the context-dependence of the mutation rate. The substantial number of mutations observed in the wild-type and those in the ΔmutS MA lines thus provided an opportunity to investigate the context-dependence of the mutation rate before and after the MMR repair.
In the MMR-deficient (ΔmutS) lines, the context-dependent mutation rates of nucleotides flanked by G/C indicated higher mutation rates compared to contexts lacking G/C flanking (Fig. 5A; Supplementary Table S20). Specifically, among all the 64 tri-nucleotide contexts (ranked from high to low), the top 10 contexts with the highest mutation rates were G/C-flanked. While in the wild-type lines, nine out of the top 10 contexts were G/C-flanked, except for 5′-T[G→N]T-3′, which ranked as the second highest. Additionally, the specific order of each context differed from that of the ΔmutS MA lines (where N represents any nucleotide) (Fig. 5B). These findings demonstrated that the context-dependence of mutation rates for focal nucleotides was maintained even after pre-mutations were repaired by MMR. However, the particular pattern was altered due to the differential context effects of MMR (with 79.2% repairing efficiency for contexts containing at least one G/C vs. 66.7% without any G/C; one-sided t test, P = 0.0026). Although flanking G/C nucleotides could elevate both the mutation rate and the repair efficiency, there was no association between the context-dependent mutation rate and repair efficiency of a focal nucleotide before MMR (i.e., in the ΔmutS lines) (R2 = − 0.0033, P = 0.38). Also, MMR preferentially repaired A/T focal nucleotides (with an 84.5% repair efficiency) compared to G/C ones (67.6%) (Fig. 5C; one-sided t test, P = 6.01 × 10−7), thus contributing to the genome-wide A/T mutation bias observed in the wild-type lines.
Figure
5.
Context-dependence of the wild-type and the ΔmutS MA lines. The heatmaps show the mutation rate of each nucleotide context (the bases on the bottom are focal bases, the 5′ flanking nucleotides and the 3′ flanking nucleotides are on the left and right respectively). A The context-dependent mutation rate of the ΔmutS MA lines. B The context-dependent mutation rate of the wild-type MA lines. C The context-dependent MMR repair-efficiency; RE on the scale is the repair efficiency
In this study, we explored the genomic mutational features of the Gram-positive pathogen S. pneumoniae D39. Our MA-WGS analysis revealed that it has the highest mutation rates of BPSs and indels among all studied bacteria with functional DNA repair systems. When cultured on non-stressing rich media and assessed using MA procedures, the mutation rate of S. pneumoniae D39 wild-type MA lines was estimated to be 0.02 per genome per cell division. This rate is orders of magnitude higher than those observed in all other non-mutator bacteria. Among previous studies on bacterial with functional MMR, spontaneous mutation rates have been estimated to range from 0.0003 to 0.0045 per genome per cell division. The highest rate was observed in the cold-adapted marine bacterium Colwellia psychrerythraea, while the lowest was documented in the insect-killing Photorhabdus luminescens (Supplementary Table S14) (Dettman et al. 2016; Dillon et al. 2015, 2017; Lee et al. 2012; Long et al. 2015, 2016, 2018a, b; Pan et al. 2021, 2022; Sung et al. 2012, 2015).
Based on the single nucleotide polymorphism sites (SNPs) of 195 natural strains (four–fold degenerate sites: 227, 481; SNPs at these sites: 37, 047), we calculated πs (nucleotide diversity at silent sites or four-fold degenerate sites) of S. pneumoniae to be 0.023. The high πs is supported, at least partially, by the multiple subpopulations existing in different parts of the host nasal cavity, such as the outer surface and the nasal tissues (Briles et al. 2005; Green et al. 2021). For haploid organisms, in mutation-drift equilibrium at neutrally evolving sites, the effective population size (Ne) ≈ πs/2μ (Lynch et al. 2016; Sung et al. 2012). According to the above formula, we calculated Ne of S. pneumoniae to be 1.17 × 106, one to two orders of magnitude lower than the Ne of other bacteria studied (Pan et al. 2022; Sung et al. 2012). The remarkably high mutation rate and the low Ne align well with the predictions of the drift-barrier hypothesis (DBH), as that a small Ne can be conductive to the evolution of high mutation rate according to DBH (Lynch 2010; Lynch et al. 2016; Sung et al. 2012). Besides the above, the natural competence of S. pneumoniae in transformation could contribute to the high mutation rate, potentially through frequent recombination events during colonization and infection within the host.
Our MA-WGS analysis revealed a distinct mutation spectrum for S. pneumoniae D39, deviating from those observed in most other non-pathogenic prokaryotes. The dominant types of mutations identified were G: C→A: T transitions and G: C→T: A transversions (Fig. 2B; Supplementary Fig. S4) (Dettman et al. 2016; Dillon et al. 2015; Kucukyildirim et al. 2016; Long et al. 2015, 2018a; Sung et al. 2015). G: C→T: A transversions can be predominantly caused by the oxidation of guanines in the DNA template strand, resulting from 8-oxo-G: A mismatches (Michaels et al. 1992). This pattern aligned with the fact that S. pneumoniae secretes a large amount of reactive oxygen species (ROSs) during vegetative growth and is catalase-deficient (Pericone et al. 2002). These distinctive mutational features highlight the significant influence of the idiosyncratic biology of S. pneumoniae on the spectrum of spontaneous mutations. Based on this observation, we hypothesize that guanines in the DNA template strand are more susceptible to oxidation compared to those in the nucleotide substrate pool. However, further exploration is required to support this hypothesis.
Multiple studies conducted in model organisms have established that mutation rates of individual nucleotides in DNA strands are influenced by their surrounding nucleotide context. Specifically, a higher G/C composition correlates with an increased mutation rate (Blake et al. 1992; Hess et al. 1994; Lee et al. 2012; Morton 2003; Pan et al. 2022). However, previous explorations have primarily focused on BPSs of mutator strains deficient in MMR, which have mutation rates orders of magnitude higher than those of the wild-type strains. This is because the number of de novo BPSs in wild-type MA lines was low and lacked the statistical power required for analyzing context-dependent mutation rates. The repair efficiency of MMR is also known to be context dependent, with a higher efficiency at nucleotides flanked by a higher G/C composition (Jones et al. 1987; Long et al. 2018a). As a result, it remains unclear whether the context-dependence of mutation rates persists after pre-mutations are repaired by the context-dependent MMR. In this study, we uncovered that among all the 64 tri-nucleotide contexts of the wild-type MA lines (ranked from high to low), nine out of the top 10 contexts were G/C-flanked, except for 5′-T[G→N]T-3′, which ranks as the second highest. These findings demonstrated that the context-dependence of mutation rates for focal nucleotides was maintained even after pre-mutations are repaired by MMR (Fig. 5A, B; Supplementary Table S20).
Compared with the BPS mutation rate of the wild-type S. pneumoniae, there was only ~ 3 times elevation in the BPS mutation rate of the ΔmutS, viz. MMR repairs 69.17% of pre-mutations at whole genome level. The genome-wide repair efficiency of MMR in S. pneumoniae was close to that observed in the Gram-positive bacterium Deinococcus radiodurans. In D. radiodurans, MMR showed an overall repair efficiency of 72.94%, with 75.33% of G: C→A: T and 87.79% of A: T→G: C transitions repaired at the whole genome level (Table 2; Fig. 1B; Supplementary Tables S2, S13, S16; Long et al. 2018a). The repair efficiency of MMR in S. pneumoniae was the lowest compared to all other bacteria studied with MA experiments. This provided a strong biological basis for the exceptionally high mutation rate in the wild-type strain. The function of MMR in most bacteria has undergone extensive refinement through natural selection, as MMR-deficient strains are known to be transient (Longerich et al. 1995). While natural selection primarily operates at the whole-bacterium level, it is possible that the cumulative fitness effects of mutations resulting from low MMR efficiency could be counteracted by multiple layers of other DNA repair mechanisms (Lynch 2012). For example, these may include highly efficient oxidative damage repair systems, akin to those observed in D. radiodurans (Krisko and Radman 2013; Long et al. 2015).
Given the natural absence of catalase in S. pneumoniae, it is reasonable to assert that pathways linked to MutX and MutY, responsible for fixing ROS-caused oxidative damages, play crucial roles in repairing mutations originating from oxidized guanines present in the cellular nucleotide pool or the DNA strands (Mejean et al. 1994; Michaels et al. 1992; Tajiri et al. 1995). Indeed, repair efficiencies were high in specific mutation categories. MutX demonstrated a robust repair efficiency of 96.7% for A: T→C: G transversions, mutations typically caused by oxidized guanines in the cellular nucleotide pool. Similarly, MutY had a high repair efficiency of 90.52% for G: C→T: A transversions, mutations typically arising from oxidized guanines in the DNA strands. It has been reported that when infecting hosts, S. pneumoniae can utilize catalase from co-existing pathogens to reduce its cellular ROS levels, and secrete ROS to inhibit the surrounding pathogens, providing a competitive advantage for its growth (Bogaert et al. 2004; Dahiya and Speck 1968; Pericone et al. 2000). The high levels of secreted ROSs can also assist in host invasion by destroying specific barriers or tissue cells (Weiser et al. 2018; Yesilkaya et al. 2013), suggesting that the absence of catalase in this pathogen is a double-edged sword rather than being entirely harmful. Therefore, in addition to mutation pressure arising from mismatches during DNA replication, it is plausible that S. pneumoniae may experience additional mutation pressure due to oxidative DNA damages.
Penicillin has been used to fight S. pneumoniae infections for ~ 80 years, but penicillin-resistant strains have now spread globally (Mandell et al. 2007; Watson et al. 1993; Yu et al. 2003). Resistance can arise through spontaneous mutations, including point mutations and horizontally transferred genes, or induced mutations caused by physical or chemical mutagens (Munita and Arias 2016; Zhang et al. 2023). In this study, we found that penicillin treatment did not elevate the mutation rates of S. pneumoniae at any mutation scale. Our previous research demonstrated that treatment with the fluoroquinolone antibiotic norfloxacin can increase genomic mutation rates in E. coli by inducing the SOS stress response. This SOS response is triggered by DNA breaks resulting from norfloxacin inhibiting DNA gyrase, leading to the introduction of more mutations by the low-fidelity DNA polymerases in treated cells compared to untreated ones (Long et al. 2016; Napolitano et al. 2000). As that penicillin does not cause DNA damages directly and S. pneumoniae naturally lacks the SOS response pathway, it supports the stress-induced mutagenesis hypothesis from a new perspective, suggesting that the mutation rate is not elevated by antibiotic stress in the absence of the SOS response pathway (Gasc et al. 1980; Napolitano et al. 2000; Sauvage et al. 2008; Waxman and Strominger 1983). Stress-induced mutagenesis may, therefore, be limited to agents specifically activating the SOS response pathway by inducing DNA damage, rather than being extended to general stress conditions.
In comparison to other model bacteria, such as E. coli, S. pneumoniae demonstrates a greater specialization in habitats and presents higher costs and labor/lab requirement for conducting long-term and large-scale evolution experiments due to its classification as a biosafety level 2 pathogen. Nevertheless, this study paves the way for further exploration of the evolutionary trajectory of S. pneumoniae, which is listed as one of the 12 priority pathogens by the World Health Organization (Asokan et al. 2019). This bacterium is responsible for multiple diseases and demonstrates a propensity for developing multidrug resistance (Weiser et al. 2018). The two orders of magnitude higher genomic mutation rate observed in this MMR-functional pathogen, compared to the mutation rates of most reported bacteria, presents a valuable opportunity for testing hypotheses regarding mutation rate evolution.
This study was financially supported by Laoshan Laboratory (LSKJ202203203), the National Natural Science Foundation of China (31961123002, 32270435), the Fundamental Research Funds for the Central Universities of China (202041001), the Shandong Provincial Natural Science Foundation (ZR2023QC191), the Postdoctoral Fellowship Program of CPSF under Grant Number GZC20232504, the Multidisciplinary University Research Initiative Award from the US Army Research Office (W911NF-09-1-0444), and National Institutes of Health award (R35-GM122566 to ML) and (R35-GM131767 to MEW). We thank Wei Yang and Kun Wu for technical help. All bioinformatic analyses were performed with IEMB-1 computation clusters at OUC.
Author contributions
HL and ML conceived the study. HL, CER, CT and HTT performed experiments. WJ, TL, JP, and YW analyzed data. WJ and HL wrote the manuscript with input from all other co-authors.
Data availability
All sequencing data in this research are available at NCBI SRA with the BioProject number of PRJNA781759.
Declarations
Conflict of interest
The authors declare no competing interests. Author Hongan Long is one of the Editorial Board Members, but he was not involved in the journal's review of, or decision related to, this manuscript.
Animal and human rights statement
Not applicable.
Open Access
This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Ames BN, Gold LS (1991) Endogenous mutagens and the causes of aging and cancer. Mutat Res 250: 3–16 doi: 10.1016/0027-5107(91)90157-J
Andersson DI, Hughes D (2014) Microbiological effects of sublethal levels of antibiotics. Nat Rev Microbiol 12: 465–478 doi: 10.1038/nrmicro3270
Andreani NA, Carraro L, Zhang L, Vos M, Cardazzo B (2019) Transposon mutagenesis in Pseudomonas fluorescens reveals genes involved in blue pigment production and antioxidant protection. Food Microbiol 82: 497–503 doi: 10.1016/j.fm.2019.03.028
Asokan GV, Ramadhan T, Ahmed E, Sanad H (2019) WHO global priority pathogens list: a bibliometric analysis of medline-pubmed for knowledge mobilization to infection prevention and control practices in Bahrain. Oman Med J 34: 184–193 doi: 10.5001/omj.2019.37
Baquero F, Martínez J-L, Cantón R (2008) Antibiotics and antibiotic resistance in water environments. Curr Opin Biotechnol 19: 260–265 doi: 10.1016/j.copbio.2008.05.006
Blake R, Hess ST, Nicholson-Tuell J (1992) The influence of nearest neighbors on the rate and pattern of spontaneous point mutations. J Mol Evol 34: 189–200 doi: 10.1007/BF00162968
Bogaert D, de Groot R, Hermans PWM (2004) Streptococcus pneumoniae colonisation: the key to pneumococcal disease. Lancet Infect Dis 4: 144–154 doi: 10.1016/S1473-3099(04)00938-7
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30: 2114–2120 doi: 10.1093/bioinformatics/btu170
Briles DE, Novak L, Hotomi M, van Ginkel FW, King J (2005) Nasal colonization with Streptococcus pneumoniae includes subpopulations of surface and invasive pneumococci. Infect Immun 73: 6945–6951 doi: 10.1128/IAI.73.10.6945-6951.2005
Carmeli Y, Troillet N, Karchmer AW, Samore MH (1999) Health and economic outcomes of antibiotic resistance in Pseudomonas aeruginosa. Arch Intern Med 159: 1127–1132 doi: 10.1001/archinte.159.10.1127
Chaguza C, Senghore M, Bojang E, Gladstone RA, Lo SW, Tientcheu P-E, Bancroft RE, Worwui A, Foster-Nyarko E, Ceesay F (2020) Within-host microevolution of Streptococcus pneumoniae is rapid and adaptive during natural colonisation. Nat Commun 11: 3442 doi: 10.1038/s41467-020-17327-w
Chen C, Chen H, Zhang Y, Thomas HR, Frank MH, He Y, Xia R (2020) TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol Plant 13: 1194–1202 doi: 10.1016/j.molp.2020.06.009
Chen S, Zhou Y, Chen Y, Gu J (2018) Fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34: i884–i890 doi: 10.1093/bioinformatics/bty560
Dahiya R, Speck M (1968) Hydrogen peroxide formation by lactobacilli and its effect on Staphylococcus aureus. J Dairy Sci 51: 1568–1572 doi: 10.3168/jds.S0022-0302(68)87232-7
Deatherage DE, Barrick JE (2014) Identification of mutations in laboratory-evolved microbes from next-generation sequencing data using breseq. In: Sun L, Shou W (eds) Engineering and analyzing multicellular systems, vol 1151. Humana Press, New York, pp 165–188
DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, Del Angel G, Rivas MA, Hanna M (2011) A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet 43: 491–498 doi: 10.1038/ng.806
Dettman JR, Sztepanacz JL, Kassen R (2016) The properties of spontaneous mutations in the opportunistic pathogen Pseudomonas aeruginosa. BMC Genom 17: 27 doi: 10.1186/s12864-015-2244-3
Dillon MM, Sung W, Lynch M, Cooper VS (2015) The rate and molecular spectrum of spontaneous mutations in the GC-rich multichromosome genome of Burkholderia cenocepacia. Genetics 200: 935–946 doi: 10.1534/genetics.115.176834
Dillon MM, Sung W, Sebra R, Lynch M, Cooper VS (2017) Genome-wide biases in the rate and molecular spectrum of spontaneous mutations in Vibrio cholerae and Vibrio fischeri. Mol Biol Evol 34: 93–109 doi: 10.1093/molbev/msw224
Fineran PC, Everson L, Slater H, Salmond GPC (2005) A GntR family transcriptional regulator (PigT) controls gluconate-mediated repression and defines a new, independent pathway for regulation of the tripyrrole antibiotic, prodigiosin, in Serratia. Microbiology 151: 3833–3845 doi: 10.1099/mic.0.28251-0
Gasc A, Sicard N, Claverys J, Sicard A (1980) Lack of SOS repair in Streptococcus pneumoniae. Mutat Res 70: 157–165 doi: 10.1016/0027-5107(80)90155-4
Green AE, Howarth D, Chaguza C, Echlin H, Langendonk RF, Munro C, Barton TE, Hinton JCD, Bentley SD, Rosch JW, Neill DR (2021) Pneumococcal colonization and virulence factors identified via experimental evolution in infection models. Mol Biol Evol 38: 2209–2226 doi: 10.1093/molbev/msab018
Gu Z, Gu L, Eils R, Schlesner M, Brors B (2014) Circlize implements and enhances circular visualization in R. Bioinformatics 30: 2811–2812 doi: 10.1093/bioinformatics/btu393
Hess ST, Blake JD, Blake R (1994) Wide variations in neighbor-dependent substitution rates. J Mol Biol 236: 1022–1033 doi: 10.1016/0022-2836(94)90009-4
Hillerich B, Westpheling J (2006) A new GntR family transcriptional regulator in Streptomyces coelicolor is required for morphogenesis and antibiotic production and controls transcription of an ABC transporter in response to carbon source. J Bacteriol 188: 7477–7487 doi: 10.1128/JB.00898-06
Hirschmann S, Gómez-Mejia A, Mäder U, Karsunke J, Driesch D, Rohde M, Häussler S, Burchhardt G, Hammerschmidt S (2021) The two-component system 09 regulates pneumococcal carbohydrate metabolism and capsule expression. Microorganisms 9: 468 doi: 10.3390/microorganisms9030468
Jacobs MR (1992) Treatment and diagnosis of infections caused by drug-resistant Streptococcus pneumoniae. Clin Infect Dis 15: 119–127 doi: 10.1093/clinids/15.1.119
Jiang L, Hu X, Xu T, Zhang H, Sheng D, Yin D (2013) Prevalence of antibiotic resistance genes and their relationship with antibiotics in the Huangpu River and the drinking water sources, Shanghai, China. Sci Total Environ 458: 267–272 doi: 10.1016/j.scitotenv.2013.04.038
Jones M, Wagner R, Radman M (1987) Repair of a mismatch is influenced by the base composition of the surrounding nucleotide sequence. Genetics 115: 605–610 doi: 10.1093/genetics/115.4.605
Jørgensen KM, Wassermann T, Jensen PØ, Hengzuang W, Molin S, Høiby N, Ciofu O (2013) Sublethal ciprofloxacin treatment leads to rapid development of high-level ciprofloxacin resistance during long-term experimental evolution of Pseudomonas aeruginosa. Antimicrob Agents Chemother 57: 4215–4221 doi: 10.1128/AAC.00493-13
Jun SH, Kim TG, Ban C (2006) DNA mismatch repair system: classical and fresh roles. FEBS J 273: 1609–1619 doi: 10.1111/j.1742-4658.2006.05190.x
Kadioglu A, Weiser JN, Paton JC, Andrew PW (2008) The role of Streptococcus pneumoniae virulence factors in host respiratory colonization and disease. Nat Rev Microbiol 6: 288–301 doi: 10.1038/nrmicro1871
Keightley PD, Halligan DL (2009) Analysis and implications of mutational variation. Genetica 136: 359–369 doi: 10.1007/s10709-008-9304-4
Kim D, Langmead B, Salzberg SL (2015) HISAT: a fast spliced aligner with low memory requirements. Nat Methods 12: 357–360 doi: 10.1038/nmeth.3317
Kolodner RD, Marsischky GT (1999) Eukaryotic DNA mismatch repair. Curr Opin Genet Dev 9: 89–96 doi: 10.1016/S0959-437X(99)80013-6
Krisko A, Radman M (2013) Biology of extreme radiation resistance: the way of Deinococcus radiodurans. Cold Spring Harb Perspect Biol 5: a012765 doi: 10.1101/cshperspect.a012765
Kucukyildirim S, Long H, Sung W, Miller SF, Doak TG, Lynch M (2016) The rate and spectrum of spontaneous mutations in Mycobacterium smegmatis, a bacterium naturally devoid of the postreplicative mismatch repair pathway. G3-Genes Genom Genet 6: 2157–2163 doi: 10.1534/g3.116.030130
Lanie JA, Ng WL, Kazmierczak KM, Andrzejewski TM, Davidsen TM, Wayne KJ, Tettelin H, Glass JI, Winkler ME (2007) Genome sequence of Avery's virulent serotype 2 strain D39 of Streptococcus pneumoniae and comparison with that of unencapsulated laboratory strain R6. J Bacteriol 189: 38–51 doi: 10.1128/JB.01148-06
Lee H, Popodi E, Tang H, Foster PL (2012) Rate and molecular spectrum of spontaneous mutations in the bacterium Escherichia coli as determined by whole-genome sequencing. Proc Natl Acad Sci USA 109: E2774–E2783 doi: 10.1073/pnas.1210309109
Levinson G, Gutman GA (1987) Slipped-strand mispairing: a major mechanism for DNA sequence evolution. Mol Biol Evol 4: 203–221
Li G-M (2008) Mechanisms and functions of DNA mismatch repair. Cell Res 18: 85–98 doi: 10.1038/cr.2007.115
Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25: 1754–1760 doi: 10.1093/bioinformatics/btp324
Long H, Kucukyildirim S, Sung W, Williams E, Lee H, Ackerman M, Doak TG, Tang H, Lynch M (2015) Background mutational features of the radiation-resistant bacterium Deinococcus radiodurans. Mol Biol Evol 32: 2383–2392 doi: 10.1093/molbev/msv119
Long H, Miller SF, Strauss C, Zhao C, Cheng L, Ye Z, Griffin K, Te R, Lee H, Chen CC, Lynch M (2016) Antibiotic treatment enhances the genome-wide mutation rate of target cells. Proc Natl Acad Sci USA 113: E2498–E2505 doi: 10.1073/pnas.1601208113
Long H, Miller SF, Williams E, Lynch M (2018a) Specificity of the DNA mismatch repair system (MMR) and mutagenesis bias in bacteria. Mol Biol Evol 35: 2414–2421 doi: 10.1093/molbev/msy134
Long H, Sung W, Kucukyildirim S, Williams E, Miller SF, Guo W, Patterson C, Gregory C, Strauss C, Stone C, Berne C, Kysela D, Shoemaker WR, Muscarella ME, Luo H, Lennon JT, Brun YV, Lynch M (2018b) Evolutionary determinants of genome-wide nucleotide composition. Nat Ecol Evol 2: 237–240 doi: 10.1038/s41559-017-0425-y
Longerich S, Galloway AM, Harris RS, Wong C, Rosenberg SM (1995) Adaptive mutation sequences reproduced by mismatch repair deficiency. Proc Natl Acad Sci USA 92: 12017–12020 doi: 10.1073/pnas.92.26.12017
Lynch M (2010) Evolution of the mutation rate. Trends Genet 26: 345–352 doi: 10.1016/j.tig.2010.05.003
Lynch M (2012) Evolutionary layering and the limits to cellular perfection. Proc Natl Acad Sci USA 109: 18851–18856 doi: 10.1073/pnas.1216130109
Lynch M, Ackerman MS, Gout JF, Long H, Sung W, Thomas WK, Foster PL (2016) Genetic drift, selection and the evolution of the mutation rate. Nat Rev Genet 17: 704–714 doi: 10.1038/nrg.2016.104
Lynch M, Sung W, Morris K, Coffey N, Landry CR, Dopman EB, Dickinson WJ, Okamoto K, Kulkarni S, Hartl DL (2008) A genome-wide view of the spectrum of spontaneous mutations in yeast. Proc Natl Acad Sci USA 105: 9272–9277 doi: 10.1073/pnas.0803466105
Lynch M, Walsh B (1998) Genetics and analysis of quantitative traits, vol 1. Sinauer, Sunderland
Mandell LA, Wunderink RG, Anzueto A, Bartlett JG, Campbell GD, Dean NC, Dowell SF, File TM Jr, Musher DM, Niederman MS, Torres A, Whitney CG (2007) Infectious Diseases Society of America/American Thoracic Society consensus guidelines on the management of community-acquired pneumonia in adults. Clin Infect Dis 44: S27–S72 doi: 10.1086/511159
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M (2010) The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20: 1297–1303 doi: 10.1101/gr.107524.110
Mejean V, Salles C, Bullions LC, Bessman MJ, Claverys JP (1994) Characterization of the mutX gene of Streptococcus pneumoniae as a homologue of Escherichia coli mutT, and tentative definition of a catalytic domain of the dGTP pyrophosphohydroiases. Mol Microbiol 11: 323–330 doi: 10.1111/j.1365-2958.1994.tb00312.x
Michaels ML, Cruz C, Grollman AP, Miller JH (1992) Evidence that MutY and MutM combine to prevent mutations by an oxidatively damaged form of guanine in DNA. Proc Natl Acad Sci USA 89: 7022–7025 doi: 10.1073/pnas.89.15.7022
Modrich P, Lahue R (1996) Mismatch repair in replication fidelity, genetic recombination, and cancer biology. Annu Rev Biochem 65: 101–133 doi: 10.1146/annurev.bi.65.070196.000533
Morton BR (2003) The role of context-dependent mutations in generating compositional and codon usage bias in grass chloroplast DNA. J Mol Evol 56: 616–629 doi: 10.1007/s00239-002-2430-1
Napolitano R, Janel-Bintz R, Wagner J, Fuchs R (2000) All three SOS-inducible DNA polymerases (Pol Ⅱ, Pol Ⅳ and Pol Ⅴ) are involved in induced mutagenesis. EMBO J 19: 6259–6265 doi: 10.1093/emboj/19.22.6259
Nghiem Y, Cabrera M, Cupples C, Miller J (1988) The mutY gene: a mutator locus in Escherichia coli that generates G. C→T. A transversions. Proc Natl Acad Sci USA 85: 2709–2713 doi: 10.1073/pnas.85.8.2709
Pan J, Li W, Ni J, Wu K, Konigsberg I, Rivera CE, Tincher C, Gregory C, Zhou X, Doak TG, Lee H, Wang Y, Gao X, Lynch M, Long H (2022) Rates of mutations and transcript errors in the foodborne pathogen Salmonella enterica subsp. enterica. Mol Biol Evol 39: msac081 doi: 10.1093/molbev/msac081
Pan J, Williams E, Sung W, Lynch M, Long H (2021) The insect-killing bacterium Photorhabdus luminescens has the lowest mutation rate among bacteria. Mar Life Sci Technol 3: 20–27 doi: 10.1007/s42995-020-00060-0
Pericone CD, Overweg K, Hermans PW, Weiser JN (2000) Inhibitory and bactericidal effects of hydrogen peroxide production by Streptococcus pneumoniae on other inhabitants of the upper respiratory tract. Infect Immun 68: 3990–3997 doi: 10.1128/IAI.68.7.3990-3997.2000
Pericone CD, Bae D, Shchepetov M, McCool T, Weiser JN (2002) Short-sequence tandem and nontandem DNA repeats and endogenous hydrogen peroxide production contribute to genetic instability of Streptococcus pneumoniae. J Bacteriol 184: 4392–4399 doi: 10.1128/JB.184.16.4392-4399.2002
Pertea M, Kim D, Pertea GM, Leek JT, Salzberg SL (2016) Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat Protoc 11: 1650–1667 doi: 10.1038/nprot.2016.095
Ramos-Montanez S, Tsui HC, Wayne KJ, Morris JL, Peters LE, Zhang F, Kazmierczak KM, Sham LT, Winkler ME (2008) Polymorphism and regulation of the spxB (pyruvate oxidase) virulence factor gene by a CBS-HotDog domain protein (SpxR) in serotype 2 Streptococcus pneumoniae. Mol Microbiol 67: 729–746 doi: 10.1111/j.1365-2958.2007.06082.x
Sauvage E, Kerff F, Terrak M, Ayala JA, Charlier P (2008) The penicillin-binding proteins: structure and role in peptidoglycan biosynthesis. FEMS Microbiol Rev 32: 234–258 doi: 10.1111/j.1574-6976.2008.00105.x
Slager J, Aprianto R, Veening JW (2018) Deep genome annotation of the opportunistic human pathogen Streptococcus pneumoniae D39. Nucleic Acids Res 46: 9971–9989
Sung W, Ackerman MS, Gout J-F, Miller SF, Williams E, Foster PL, Lynch M (2015) Asymmetric context-dependent mutation patterns revealed through mutation-accumulation experiments. Mol Biol Evol 32: 1672–1683 doi: 10.1093/molbev/msv055
Sung W, Ackerman MS, Miller SF, Doak TG, Lynch M (2012) Drift-barrier hypothesis and mutation-rate evolution. Proc Natl Acad Sci USA 109: 18488–18492 doi: 10.1073/pnas.1216223109
Tajiri T, Maki H, Sekiguchi M (1995) Functional cooperation of MutT, MutM and MutY proteins in preventing mutations caused by spontaneous oxidation of guanine nucleotide in Escherichia coli. Mutat Res 336: 257–267 doi: 10.1016/0921-8777(94)00062-B
Tsui HT, Boersma MJ, Vella SA, Kocaoglu O, Kuru E, Peceny JK, Carlson EE, VanNieuwenhze MS, Brun YV, Shaw SL, Winkler ME (2014) Pbp2x localizes separately from Pbp2b and other peptidoglycan synthesis proteins during later stages of cell division of Streptococcus pneumoniae D39. Mol Microbiol 94: 21–40 doi: 10.1111/mmi.12745
Tsui HC, Keen SK, Sham LT, Wayne KJ, Winkler ME (2011) Dynamic distribution of the SecA and SecY translocase subunits and septal localization of the HtrA surface chaperone/protease during Streptococcus pneumoniae D39 cell division. mBio 2: e00202–00211 doi: 10.1128/mBio.00202-11
Tsui HC, Zheng JJ, Magallon AN, Ryan JD, Yunck R, Rued BE, Bernhardt TG, Winkler ME (2016) Suppression of a deletion mutation in the gene encoding essential PBP2b reveals a new lytic transglycosylase involved in peripheral peptidoglycan synthesis in Streptococcus pneumoniae D39. Mol Microbiol 100: 1039–1065 doi: 10.1111/mmi.13366
Van de Beek D, de Gans J, Tunkel AR, Wijdicks EF (2006) Community-acquired bacterial meningitis in adults. New Engl J Med 354: 44–53 doi: 10.1056/NEJMra052116
Van der Poll T, Opal SM (2009) Pathogenesis, treatment, and prevention of pneumococcal pneumonia. Lancet 374: 1543–1556 doi: 10.1016/S0140-6736(09)61114-4
Wahl LM, Gerrish PJ (2001) The probability that beneficial mutations are lost in populations with periodic bottlenecks. Evolution 55: 2606–2610
Watson DA, Musher DM, Jacobson JW, Verhoef J (1993) A brief history of the Pneumococcus in biomedical research: a panoply of scientific discovery. Clin Infect Dis 17: 913–924 doi: 10.1093/clinids/17.5.913
Waxman DJ, Strominger JL (1983) Penicillin-binding proteins and the mechanism of action of beta-lactam antibiotics. Annu Rev Biochem 52: 825–869 doi: 10.1146/annurev.bi.52.070183.004141
Wick RR, Judd LM, Gorrie CL, Holt KE (2017) Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comp Biol 13: e1005595 doi: 10.1371/journal.pcbi.1005595
Woodford N, Ellington MJ (2007) The emergence of antibiotic resistance by mutation. Clin Microbiol Infect 13: 5–18 doi: 10.1111/j.1469-0691.2006.01492.x
Yesilkaya H, Andisi VF, Andrew PW, Bijlsma JJ (2013) Streptococcus pneumoniae and reactive oxygen species: an unusual approach to living with radicals. Trends Microbiol 21: 187–195 doi: 10.1016/j.tim.2013.01.004
Yu VL, Chiou CC, Feldman C, Ortqvist A, Rello J, Morris AJ, Baddour LM, Luna CM, Snydman DR, Ip M (2003) An international prospective study of pneumococcal bacteremia: correlation with in vitro resistance, antibiotics administered, and clinical outcome. Clin Infect Dis 37: 230–237 doi: 10.1086/377534
Zhang Y, Zhang C, Huo W, Wang X, Zhang M, Palmer K, Chen M (2023) An expectation–maximization algorithm for estimating proportions of deletions among bacterial populations with application to study antibiotic resistance gene transfer in Enterococcus faecalis. Mar Life Sci Technol 5: 28–43 doi: 10.1007/s42995-022-00144-z
Zurek L, Ghosh A (2014) Insects represent a link between food animal farms and the urban environment for antibiotic resistance traits. Appl Environ Microbiol 80: 3562–3567 doi: 10.1128/AEM.00600-14
N, number of replicate MA lines; Transfers, mean number of transfers during the experiment; Division, mean cell divisions in the final analyses; Ne, effective population size of each penicillin-treatment corresponded to a certain concentration; Depth (×), depth of sequencing coverage
BPSs, total number of base-pair substitutions in the MA lines; μBPS, base-pair substitution mutation rate; ts/tv, the ratio of transitions/transversions; AT bias, mutation bias in the A/T direction, (see Materials and methods for details); Indels, number of insertions and deletions; μIndel, indel mutation rate; Ins/Del, the ratio of insertions/deletions. All mutation rates are in units of × 10–8 per nucleotide site per cell division