Introduction

Acinetobacter baumannii is a notorious opportunistic Gram-negative pathogen that causes hospital-acquired infections, such as bacteraemia, pneumonia, wound infections and urinary tract infections1 that are difficult—and in many cases impossible—to treat due to high levels of resistance to several antimicrobial classes2,3. Indeed, the spread of A. baumannii strains resistant to all available antimicrobials has become a major global concern3,4, and carbapenem-resistant A. baumannii has been flagged by the World Health Organisation as the number one priority for antimicrobial development5. In A. baumannii, antimicrobial resistance (AMR) is known to occur primarily by the horizontal acquisition of AMR genes via mobile genetic elements (MGEs) such as transposons and plasmids3,6,7,8. Notably, A. baumannii plasmids are increasingly recognised as a major source for disseminating AMR genes such as those that confer resistance to carbapenems and colistin9,10,11,12,13.

A. baumannii has a unique repertoire of plasmids that capture and mobilise a wide range of genetic material involved in pathogenesis and AMR (6, 15–18). Recently, we developed a plasmid typing scheme based on the sequence of replication initiation genes from 621 complete A. baumannii plasmids called Acinetobacter Plasmid Typing (APT) scheme13. This first version of the APT scheme includes 80 Rep types belonging to three families; R1 types 1 to 6, R3 types 1 to 69, and RP types 1–513,14. However, the role of each plasmid type in dissemination of AMR genes remains to be established. To date, several studies have examined the role of A. baumannii plasmids in the spread of AMR genes9,10,15,16,17,18 but most of these studies have only reported on individual or a limited number of plasmids and thus, a comprehensive overview of how various plasmid types are involved in the dissemination of AMR genes remains elusive. In this study, we address this gap by examining the distribution of chromosomal sequence types, sampling, geographies, and AMR genes carried on plasmids originally included in the APT database alongside an additional 193 complete plasmids that have since been deposited in GenBank (as of August 18, 2022). We also provide an update to the original APT scheme with the addition of novel rep/Rep types.

Results and discussion

Overview of genome and plasmid dataset

As of August 18, 2022, 450 complete A. baumannii genomes were available in GenBank. Of these 450 complete genomes, 80% (n = 355) had at least one plasmid (Supplementary Table 1) with 236 genomes containing one (n = 1) plasmid, and 113 carrying two plasmids (Table 1 and Supplementary Table 1). Ninety-five (n = 95) genomes lacked a plasmid and were not studied here. To broaden our plasmid dataset, we extended our search to the RefSeq database and captured an additional 92 genomes that contained at least one (n = 1) plasmid. Of these 92 genomes/unique (non-duplicate) strains, 63 were not linked to a genome project and 29 genomes were sourced from WGS (Whole Genome Shotgun), which included draft genomes with complete plasmid entries. Following curation of the dataset (i.e. exclusion of duplicate entries and assembly QC; see methods for more details), our final dataset was comprised of 813 non-redundant plasmid entries corresponding to at least 439 unique isolates (Supplementary Table 1 and Supplementary Table 2; n = 2/813 plasmids unassigned due to absence of BioSample and strain name). Notably, of the 439 unique genomes/isolates, the rep/Rep sequences of 328 had already been analysed in the original APT scheme13. Indeed, in this study, we investigated the prevalence of antimicrobial resistance (AMR) genes within different plasmid types, considering their associated meta-data. A total of 439 unique genomes/strains were analysed, comprising both the 328 genomes previously reported and an additional 111 genomes captured in this study. Over half of the isolates carried one plasmid (n = 228 isolates; 52%) and 27% (n = 120) carried two plasmids (Table 1). Seven genomes contained 6–11 plasmids, indicating that some A. baumannii strains have the capacity to carry a significantly high load of plasmids.

Table 1 Number of plasmids found in 439 unique BioSamples and/or strain names.

Of the 439 unique isolates, ST2 represented the most abundant sequence type (n = 199), followed by ST1 (n = 25), ST25 (n = 14), and ST622 (n = 10). In 2019, we reported that both the geographies and sampling types of publicly available A. baumannii genomes deposited in NCBI were extremely skewed3. This is similarly reflected in this dataset, with the vast majority sequenced in strains recovered from clinical samples (n = 358; 81.5%), and few from non-clinical sources (Fig. 1). This is primarily because of continued significance and attention paid towards clinical strains, and a lessened focus on studying the role of non-clinical reservoirs (e.g. environment, animals) as potential sources for plasmids and AMR genes. Most complete plasmid entries were sequenced from isolates collected in East Asia (primarily China, n = 104/154 isolates), followed by North America (primarily US, n = 77/90) and South Asia (primarily India, n = 44/46). The strains were isolated from forty-three countries, and those with at least ten plasmids plus isolates within the dataset included South Korea (n = 38), Australia (n = 15), France (n = 15), Canada (n = 13), Iraq (n = 12), Mexico (n = 12), and Italy (n = 10; see Fig. 1 and Supplementary Table 1 and Supplementary Table 2).

Fig. 1: Geographical distribution and isolation sources of publicly available A. baumannii strains carrying plasmids.
figure 1

a Geographical distribution of plasmid-containing strains, colour-coded by the number of genomes accessible in GenBank as of August 2022. b Isolation sources of plasmid-carrying strains, depicted with a scale bar provided above.

Novel rep sequences and update to the Acinetobacter Plasmid Typing (APT) scheme

Of the 813 plasmids studied here, 620 (i.e. 71%) were previously used to generate the first version of the APT scheme. An additional 193 complete plasmid entries, corresponding to 92 isolates, had since been released in GenBank between February 2021 and mid-August 2022. Using the criteria we had previously established for assigning rep types (i.e., 95% DNA identity)13,  12 novel rep types were identified (n = 10 R3 types, designated R3-T70 to R3-T78; n = 2 R1 types, R1-T7 and R1-T8; n =  1 RP type, RP-T6), resulting in a total of  77, 7 and 6 R3, R1 and RP types, respectively (Table 2, Supplementary Table 2). In addition to the novel rep/Rep sequences, we also report additional updates to the scheme as follows. R3-T49 has been removed from the updated APT scheme, as the corresponding rep sequence (previously r3-T49_NZ_AYFZ01000080.1_pABUH2a-5.6_c33) has been identified as a R3-T26 variant. Specifically, this variant carries an insert of 84 bp that differentiates this sequence from the other R3-T26 variants. This entry has been subsequently renamed to R3-T26* and R3-T49 retired from the scheme. R1-T3 was also retired due to possible sequencing/assembly errors resulting in shortening the Rep reading frame by ~150 amino acids. Lastly, we highlight two corrections to Fig. 3 and Table 3 published in the original APT paper13: (i) R3-T3 was annotated twice in Fig. 3; the first annotated clade highlighted in orange should be corrected to R3-T8, and (ii) in Table 3, the rows and columns should read R1-T1 to R1-T6 (i.e. not P1-T1 to P1-T6).

Table 2 Number of plasmids in each rep family.
Table 3 Distribution of plasmids corresponding to the 15 most abundant R3 Rep types in major globally distributed sequence types (STs).

Plasmids encoding the Rep_1 family replication protein (Rep_1 or R1 plasmids) do not carry AMR genes

R1-type plasmids (encoding Pfam01446) are often 2–3 kb in length and are typically comprised of a replication initiation protein and only two or three additional open reading frames encoding hypothetical proteins. R1-type plasmids constitute a small fraction of the plasmid dataset (n = 16 plasmids, 13 isolates) and none of these carried AMR genes, suggesting that these plasmids are not yet involved in the acquisition and spread of AMR. Strains belonging to global clone 2 (GC2; largely represented by ST2) constitute over 90% of all A. baumannii genomes in GenBank, but R1 plasmids appear to be mainly associated with strains belonging to global clone 1 (GC1; largely represented by ST1, n = 9/13 isolates) with only one GC2 strain found with R1 plasmids (Fig. 2).

Fig. 2: Phylogenetic relationship and distribution of antimicrobial resistance determinants in R1-type plasmids across major globally distributed sequence types and global clones.
figure 2

Plots show the plasmids linked with a particular plasmid type, with each data point corresponding to a unique plasmid, grouped by chromosomal sequence type/clone, and coloured by geographical region as shown in the Figure key. Empty circles (marked no AMR) indicate the absence of antimicrobial resistance gene.

Rep_3 (R3) plasmids disseminate important AMR genes

R3 plasmids encoding the Rep_3-type plasmid replication proteins (Pfam01051) represent, by far, the most diverse A. baumannii plasmid group. This is due to several reasons, including the Rep/rep sequence divergence combined with their floating genetic structure arising from the presence of pdif modules (examples in Fig. 3). Over half of the plasmids were typed as R3 (n = 478/813 plasmids; 59%), and these were detected in at least 344 unique isolates (note, n = 2 R3 type plasmids were unassigned to an isolate). Variants of R3-T1, T2 and T3 constitute the most abundant types and were collectively detected in 224 plasmids. R3-type plasmids appear to be geographically dispersed, but some types appear to be limited to distinct regions (Fig. 4). For example, some variants of R3-T1 plasmids that carry a carbapenemase appear to be limited to North America (e.g. pAB120 carrying blaOXA-72 from the US; GenBank accession number CP031446.1) or Europe (e.g. p1ABST78 from Italy; GenBank accession number AEOZ01000236.1 (Fig. 4, Supplementary Table 1 and Supplementary Table 2), and may represent local plasmid circulation or expansions.

Fig. 3: Schematic comparison of Rep_3 family (R3-type; Pfam01051) plasmid structures.
figure 3

Horizontal arrows show the length and orientation of genes with rep genes coloured black, resistance genes red, toxin/anti-toxins yellow and mobilisation genes blue. Green boxes indicate insertion sequences with their transposase shown inside the box. Small thick vertical bars marked with “i” indicate iterons. Dotted lines show the boundaries of pdif modules. Other vertical bards marked with “C/D or D/C” indicate the location of pdif sites. Regions with significant DNA identities are shown using shades of grey with % identities also shown using red numbers.

Fig. 4: Phylogenetic relationship and distribution of antimicrobial resistance determinants in R3-type plasmids across major globally distributed sequence types and global clones.
figure 4

The overall phylogenetic tree is depicted in panel (a), and clades 1 and 2 shown in greater resolution in panels (b) and (c), respectively. Nodes that are coloured red correspond to plasmid types where the presence of antimicrobial resistance genes is detected in at least one plasmid. Plots in panels (b) and (c) show the number of plasmids linked with a particular plasmid type; each data point corresponds to a unique plasmid, grouped by chromosomal sequence type/clone, and is coloured by geographical region as shown in the Figure key. Empty circles indicate plasmids with no AMR, triangles indicate plasmids with carbapenemases and filled circles represent plasmids with AMR (no carbapenemase). Data for three variants corresponding to R3-T3, R3-T4 and R3-T1 types are separately shown due to spacing.

The updated R3 phylogeny generated from  145 R3 rep nucleotide variants (n = 77 distinct rep types) revealed two deep-branching clades containing 61 and 84 rep variants, with plasmids carrying AMR genes (including carbapenemases) dispersed across both clades. Approximately half of the R3 types were associated with plasmids without AMR (n = 36/77 R3 types), while  18 were linked only to plasmids that carry AMR, and 24 (including the majority of the top 15 most common R3 types; see Fig. 4) were associated with both AMR+ and AMR plasmids.

A quarter of R3-type plasmids (n = 121/478; n = 42 distinct R3 types collectively) were associated with AMR, and 72 of these plasmids (from at least n = 71 isolates) carried carbapenemases. Various AMR genes were detected on R3 plasmids; however, the blaOXA-58, blaOXA-72, blaOXA-24 carbapenem resistance, tet39 tetracycline resistance, sul2 sulfonamide resistance and msr-mph(E) macrolide resistance genes were amongst the most abundant AMR genes found (Supplementary Table 2). R3-type plasmids with AMR genes are carried by all major global clones including ST10, ST15, ST25, ST79, and ST85 strains, which collectively accounted for  26/344 isolates carrying  46 R3-type plasmids, but were predominantly detected in members of GC2 and GC1, accounting for  150 and 26 isolates with R3-type plasmids, respectively (n = 209/478 R3-type plasmids; see Fig. 4 and Table 3).

Colistin is a last resort within our arsenal of antibiotics that has largely remained effective against multidrug-resistant (MDR) A. baumannii19. Here, the mcr colistin resistance gene was present in only   five isolates, all of which were linked to R3-type plasmids. These included 4 plasmids with the mcr-4.3 gene carried by strains recovered in clinical samples (two strains isolated in each of China and the Czech Republic) on an R3-T22 plasmid (Supplementary Table 2). The remaining plasmid, p8E072658, was recently described in an environmental isolate from recycled fibre pulp in a paper mill in Finland15. This plasmid carries the novel mcr-4.7 colistin resistance gene in a Tn3-family transposon and encodes two novel R3-type Reps (R3-T73 and R3-T74). Acquisition of the mcr plasmids is clinically significant given the importance of colistin in treatment of MDR A. baumannii, and while it currently appears to be uncommon, future monitoring of R3 plasmids with mcr may be warranted.

The RP-type plasmids (encoding RepPriCT_1) disseminate aminoglycoside and carbapenem resistance genes

RP-type plasmids encoding RepPriCT_1 (Pfam03090) Rep have been reported in various A. baumannii strains and contribute to the emergence of MDR strains12,20,21,22,23,24,25. Approximately one-fifth of the dataset (n = 158 plasmids; 19.4%) were identified as RP-type. A significant portion of RP-type plasmids (n = 64/158; 40.5%) carry at least one AMR gene highlighting the importance of this plasmid group in the acquisition and spread of AMR determinants. The majority of RP-type plasmids were typed as RP-T1 followed by RP-T2, accounting for  130 and 22 plasmids (82.3% and 13.9%), respectively. The genetic structure of the predominant RP-T1 plasmid variant (pACICU2; GenBank accession number CP031382.1), is illustrated in Supplementary Figure 1.

In fact, RP-T1 and RP-T2 plasmids accounted for all AMR+ RP-type plasmids. All 49 RP-T1 AMR+ plasmids carried either blaOXA-23 (carbapenemase; n = 31 RP-T1 plasmids) and/or aphA6 (amikacin resistance; n = 30 RP-T1);  12 RP-T1 plasmids carried both. Other AMR genes detected in the RP-T1 plasmids included sul1, dfrA7, aacA4, blaGES-11, strAB, aadA2, cmlA1, aadB, and the blaOXA-58 carbapenemase (Fig. 5 and Supplementary Table 2). Moreover, it appears that variants of RP-T1 plasmids have similarly been acquired by all major globally distributed clones including members of GC1 and GC2, ST10, ST15, ST25, ST79 and ST622, recovered across all continents (Fig. 5).

In contrast to the global distribution of RP-T1 plasmids, all  22 RP-T2 plasmids were sequenced from isolates collected in East Asia (predominantly China, except for one plasmid with no AMR genes sequenced from an isolate in South Korea). Notably,  15/22 plasmids contained a blaOXA-23 copy suggesting that RP-T2 plasmids with blaOXA-23 are circulating in China and have not yet been detected elsewhere. Plasmids corresponding to the remaining RP-types were generally small plasmids ranging in size from 4.5 kb to 6.8 kb (except for RP-T3; 52.5 kb) and carry no AMR genes. Interestingly, phylogenetic analysis of RP-type rep sequences (RepPriCT_1 family) revealed a clear separation of the smaller plasmids that lack AMR genes (RP-T4, RP-T5 and RP-T6) from the larger RP-T1, T2 and T3 plasmids (Fig. 5), suggesting distinct evolutionary trajectories that have likely influenced the accumulation of additional genes including those conferring AMR.

Fig. 5: Phylogenetic relationship and distribution of antimicrobial resistance determinants in RP-type plasmids across major globally distributed sequence types and global clones.
figure 5

Plots show the number of plasmids linked with a particular plasmid type; each data point corresponds to a unique plasmid, grouped by chromosomal sequence type/clone, and is coloured by geographical region as shown in the Figure key. Empty circles indicate plasmids with no AMR, triangles indicate plasmids with carbapenemases and filled circles represent plasmids with AMR (no carbapenemase).

Distribution of AMR genes in plasmids with no identifiable replication gene

We previously reported that a fraction of A. baumannii plasmids do not encode an identifiable replication initiation gene (i.e. 22.9%; n = 142/621 plasmids)13. Such plasmids might therefore use an alternative mechanism that does not involve a Rep to initiate replication or encode a novel Rep that is yet to be discovered. Here,  161/813 plasmids did not encode an identifiable replication initiation gene. This rep-less group constitutes a set of highly diverse plasmids ranging in size from 4 kb to over 200 kb. Almost a third of these (n = 52; 32.3%) appear to carry no AMR genes and range in size from 2.4 to 145.7 kb (Table 4). These plasmids are not discussed further as they lack AMR genes. The remaining  109 plasmids (length range 3.8 kb to >200 kb) carry at least one AMR gene and constitute various plasmid variants. Some variants are associated with the carriage of clinically significant AMR genes, and include those related to pRAY*, large MPFF conjugative plasmids such as pA297-3 (Supplementary Fig. 2), and pNDM-BK01 (n = 28, 31 and 8, respectively; accounting for 41.6% rep-less plasmids). These plasmids are further discussed below.

  1. (i)

    pRAY*—an important small plasmid spreading resistance to aminoglycosides. It has been shown that the small plasmid pRAY* and its variants play a role in the spread of the aadB gene conferring resistance to tobramycin, gentamicin and kanamycin, which are considered clinically significant antimicrobials26. Although some variants did not carry an AMR gene (an example shown in Fig. 6), we observed  28 plasmids that were either identical or closely related to pRAY*, and most (n = 25/28) carried aadB. These plasmids were found in strains assigned to at least 14 STs, including ST1, ST81, ST2, ST25 and ST85 (Supplementary Table 3). The strains were also geographically diverse, indicating global dissemination of pRAY* plasmids. Moreover, all strains with pRAY* were recovered in clinical samples suggesting aminoglycoside selective pressures may play a role in driving their stable maintenance within clinical settings (Supplementary Table 3).

    Fig. 6: Linearised map of pD36-1 (cryptic) compared with pRAY* (pD36-2).
    figure 6

    Central horizontal lines indicate the plasmid backbones. Arrows represent the extent and orientation of genes, and the gene cassette is boxed. The grey shadings indicate regions with significant identity with the % identities indicated in red. The scale bar is shown. Drawn to scale from GenBank accession numbers CP012954 (pRAY*), and CP012953 (pD36-1).

  2. (ii)

    Spread of diverse AMR genes by conjugative plasmids encoding the MPFF transfer system. This group constitutes a diverse set of  31 large plasmids (146.7–236.2 kb in size) known to lack an identifiable rep gene. These plasmids were detected in at least 11 distinct STs with the highest count corresponding to ST622 (n = 10) followed by ST25 (n = 7) and were also present in members of the major global clones (e.g. ST1, ST10, ST25; Supplementary Table 4). The 200.6 kb plasmid, pA297-3 (Supplementary Table 4) is considered the representative as it has the most common backbone type and was one of the earliest described and shown to be conjugative11. It carries sul2 and strAB, conferring resistance to sulphonamide and streptomycin, respectively. Most plasmids in this group (except p40288 and pR32_1; Supplementary Table 4) carry a copy of sul2. Most members also carry strAB (n = 28/31), msr-mph(E) (n = 21), blaPER-7 (n = 18), and armA (n = 20) conferring resistance to streptomycin, macrolides, extended-spectrum β-lactamases (ESBLs), and aminoglycosides, respectively (Supplementary Table 4). The latter, armA, encodes the 16S rRNA methylation protein that confers resistance to all aminoglycosides27. Two plasmids, pPM193665_1 and pPM194122_1 (GenBank accession numbers CP050416 and CP050426, respectively) from strains recovered in India, also contain the blaNDM metallo-ß-lactam carbapenem resistance gene.

  3. (iii)

    Conjugative plasmids encoding MPFT transfer system. Though not very common, blaNDM has now been reported in A. baumannii in several countries3,28,29,30,31. In this study, we found eight plasmids with no identifiable rep gene that encode the MPFT type conjugative transfer system32 and carried the blaNDM metallo-beta-lactam carbapenem resistance gene (Supplementary Table 5). All these plasmids were found to be related to pNDM-BJ01 (GenBank accession number JQ001791.1), which was first reported in Acinetobacter lwoffii and shown to be conjugative at a high frequency33. These plasmids were carried by strains recovered in clinical, environmental (wastewater) and animal samples in different countries including China, Japan, US, Colombia, and Brazil showing their wide geographical distribution. They were found in various sequence types, of which only one (p1AR_0088; GenBank accession number CP027532.1; Supplementary Table 2 and Supplementary Table 5) was in a ST25 strain, which is an important globally distributed ST3,16,34,35,36. Given the potential for accelerated resistance dissemination of resistance to a last-line antimicrobial and hence heightened therapeutic challenges, targeted surveillance of MPFT type plasmids with blaNDM may be warranted.

Table 4 Summary of rep-less plasmids.

Diverse plasmid types facilitating the spread of carbapenem resistance genes

Carbapenemases stand out as important AMR determinants as carbapenems are one of the last resort lines of defence in antimicrobial treatment3. Here, we showed that various R3, RP and rep-less plasmids were associated with the spread of carbapenem resistance genes including blaoXA-23, blaOXA-24, blaOXA-58, and blaNDM. Carbapenemases were observed in 149 plasmids (18.3%), of which blaOXA-type genes were the most common carbapenemase type followed by blaNDM (n = 131, 11 and 7 plasmids with blaOXA only, blaNDM only and blaOXA plus blaNDM, respectively). We detected twelve allelic variants of blaOXA-type genes; blaOXA-23 was the most prevalent (n = 54), followed by blaOXA-58 (n = 32, of which n = 6 also carried blaNDM-1), blaOXA-72 (n = 27), and blaOXA-24 (n = 11). The prevalence of some of these alleles appear to be associated with distinct plasmid types. For example,  46/54 blaOXA-23 plasmids were typed as RP-T1 or RP-T2, while R3-type plasmids appear to play a key role in the dissemination of blaOXA-58 (i.e. n = 27/32 plasmids with blaOXA-58), blaOXA-72 (n = 21/27), and blaOXA-24 (n = 10/11). Notably, except for blaOXA-23, which was abundant in ST2 isolates (n = 33/54 blaOXA-23 plasmids), there was no clear correlation of carbapenemase-carrying plasmids with particular sequence types (ST), indicating widespread distribution of plasmids between various A. baumannii clones.

The blaNDM carbapenem resistance gene is clinically significant in all Gram-negative bacteria, especially Enterobacterales as its rapid spread among different bacterial species worldwide has become a serious threat to public health37. A single allelic variant, blaNDM-1, was observed in this dataset and detected in  13 plasmids including  eight pNDM-BJ01-type variants, two R3-type two pA297-3-type, and a novel plasmid pCCBH31258 (GenBank accession number CP101888). The presence of blaNDM on conjugative plasmids in A. baumannii is significant as it highlights the potential for the rapid transmission of this important carbapenemase via horizontal gene transfer.

Opportunities and limitations

Advances in whole genome sequencing technologies combined with the rapid accumulation of genome data in publicly available databases such as GenBank has provided a valuable opportunity to gain genomic insights towards the circulation of AMR genes in critical pathogens such as A. baumannii and, more importantly, MGEs that disseminate AMR. However, this unique opportunity is associated with important caveats given that publicly available genome sequences are largely geographically skewed for several reasons, including the lack of technology, financial support and expertise in developing countries. Currently, the bulk of genome sequence data in GenBank has been sequenced from isolates collected in the US, China and Australia; these countries accounted for ~50% of the dataset in this study. The geographical skew of genome sequence data makes it difficult to gain comprehensive insights into population structure, MGEs and AMR genes circulating in other parts of the world e.g. Africa and the Middle East. Moreover, genomes of environmental A. baumannii strains are also very limited (Fig. 1), making it difficult to understand the distribution of AMR genes and MGEs including plasmids in the environment. Consequently, the scarcity of environmental genome data also limits the study of how plasmids, particularly those with AMR, circulate between the strains from clinical and environmental origins.

Conclusions

Traditionally, A. baumannii has been characterized as an organism that primarily acquires AMR genes through large chromosomal islands. However, this definition is changing as more plasmids that carry important AMR genes are being characterised. This study also highlights the pivotal role of various plasmid types, particularly certain families in the dissemination of clinically important AMR genes within this pathogen. We showed that many RP-T1, R3-types (e.g. RP-T1 and R3-T2) and rep-less pNDM plasmids can spread various carbapenem resistance genes. These are of particular concern as AMR genes conferring resistance to carbapenems are often considered as the last line of defence in treatment. Furthermore, these plasmids typically carry additional AMR genes conferring resistance to multiple antimicrobials, which further compounds treatment management and the threat posed by A. baumannii.

Although the plasmid repertoire of A. baumannii exhibits remarkable diversity, this investigation highlights the significance of specific plasmid families in harbouring and disseminating AMR genes (e.g. RP-types). The findings from this study provide new insights into which plasmid types are over-represented among those that disseminate AMR and may be flagged as targets for focused AMR surveillance. Finally, this study showed that, in the ongoing battle against antibiotic resistance in A. baumannii, its plasmids play a significant role in exacerbating the crisis. Their ability to transfer AMR genes across different sequence types, coupled with the bacterium’s adaptability, poses a formidable challenge to healthcare systems worldwide.

Methods

Plasmid sequence data

A local database of complete A. baumannii plasmids that were publicly available as of mid-August 2022 was generated. Our local database included plasmid sequences of (i) 354 complete genomes out of 449 complete A. baumannii genomes (i.e. n = 95 entries with no plasmids) sourced from GenBank38; labelled as ‘Complete genome project’ as data source in Supplementary Table 1 and Supplementary Table 2) and (ii) an additional 92 genomes/unique strains (released between February 2021 and mid-August 2022) captured in RefSeq39. The latter included 29 genomes sourced from Whole Genome Shotgun projects (labelled as ‘WGS’ in Table Supplementary 2) and 63 unique strains that were not linked to a genome project (i.e. direct plasmid submission to GenBank; labelled as ‘GenBank non-redundant db’ in Supplementary Table 2). This resulted in the curation of our final dataset consisted of 813 non-redundant plasmid entries corresponding to at least 439 unique isolates (n = 354 isolates from Complete genome projects, n = 63 from GenBank non-redundant database, and n = 29 from WGS). Of the 813 plasmid entries, 620 were those we previously used to develop the Acinetobacter Plasmid Typing scheme13. Note that 621 plasmids had been included in the original scheme, but one plasmid (accession CP059478.1) was excluded from this current study as it was found to be associated with a non-baumannii genome (i.e. A. pittii). All supporting data and protocols have been provided within the article or through supplementary data files. The online version of this article has four supplementary tables and three supplementary figures.

Bioinformatics and sequence analysis

The chromosomal sequences associated with each plasmid were found by exporting the BioSample accession numbers using the RefSeq39 followed by the curation of a list of chromosomal GenBank accession numbers and downloading the sequence data through Entrez Programming Utilities (E-utilities)40. The chromosomal sequences were uploaded to Pathogenwatch41 and the Speciator tool was used to verify that the sequences were A. baumannii. Multi-locus Sequence Types (MLSTs) were assigned using the mlst v.2.0 software42. Standalone BLAST43 was used for plasmid sequence comparisons within the rep-less plasmid group and assign ‘related known plasmid’ variants as labelled in Supplementary Table 2. The SnapGene® (V.6.0.5) software was used to examine the structure of individual plasmids. The plasmids were screened for AMR genes using Abricate v1.0.144 using the ResFinder v.2.1 database45. Data visualisation was performed using the ggplot2 package46 in R (v1.1.456) and Adobe Illustrator (V23.0.3).

Clustering and phylogenetic analysis of the rep/Rep sequence data

rep/Rep sequences were extracted from the novel plasmid entries using the SRST2 software47 followed by manual curation. Clusters comprising rep sequences at >95% nucleotide identity were derived using CD-HIT Suite48, as previously described13. The rep nucleotide sequences were separately aligned for each of the Rep families using MUSCLE v3.8.3149. Phylogenies were generated using the aligned rep sequences as input into RAxML v8.2.9 run five times with the generalised time-reversible (GTR) model and a Gamma distribution. The final trees with the highest likelihoods were selected, visualised in FigTree v1.4.450, and annotated with the plotTree code51 in R v1.1.456.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.