Myxobacteria are a group of gram-negative bacteria classified into the phylum Myxococcota and order Myxococcales. Archangium gephyra is a myxobacterium belonging to the family Cystobacteraceae and order Myxococcales (Reichenbach, 2005). Myxobacteria produce diverse bioactive secondary metabolites (Weissman and Müller, 2010; Herrmann et al., 2017; Hyun and Cho, 2018). Secondary metabolites isolated from A. gephyra strains include gephyronic acid (Sasse et al., 1995), melithiazols (Sasse et al., 1999), tubulysins (Sasse et al., 2000), argyrins (Sasse et al., 2002), archazolids (Sasse et al., 2003), and aurafurons (Kunze et al., 2005).
Archangium gephyra KYC5002 is a natural dispersed variant of A. gephyra KYC2615, isolated from Gyeongju, Gyeongsangbuk-do, Republic of Korea (Hyun et al., 2021; Yu et al., 2023). Most myxobacteria isolated from nature grow in liquid media producing aggregated cells. In contrast, A. gephyra KYC5002 strain grows dispersed in liquid media, allowing quantitation. Archangium gephyra KYC5002 is also known as MEHO_002 strain (Choi et al., 2021). Archangium gephyra KYC5002 produces argyrins and tubulysins (Yu et al., 2023). Argyrins are octapeptides with immunosuppressive and antitumorigenic activities (Sasse et al., 2002; Nickeleit et al., 2008). Tubulysins are cytotoxic to eukaryotes because they induce the depolymerization of β-tubulin, causing microtubules to disassemble (Sasse et al., 2000; Khalil et al., 2006). We sequenced the genome of A. gephyra KYC5002 because it produces argyrins and tubulysins, unlike DSM 2261T, which is the A. gephyra type strain.
Archangium gephyra KYC5002 was cultured in CYS medium (Shin et al., 2013), and the genomic DNA was extracted using the cetyltrimethylammonium bromide (CTAB) method (Wilson, 2001). The whole A. gephyra KYC5002 genome was sequenced using the PacBio Sequel IIe system and Illumina HiSeq Xten sequencing platform at Macrogen, Inc. In total, 29,174 HiFi reads (221,452,323 bp) were sequenced using the PacBio Sequel IIe system, and 13,758,734 short reads (2,077,568,834 bp) were sequenced using the Illumina platform. De novo assembly was conducted using the Flye assembler (v2.4.2) (Kolmogorov et al., 2019) with PacBio HiFi reads only, followed by error correction of the contig bases with Illumina reads using Pilon (v1.21) (Walker et al., 2014). The resultant A. gephyra KYC5002 whole genome is a circular chromosome of 13,249,988 bp with a G + C content of 68.8%. When calculated using the OrthoANIu algorithm (Yoon et al., 2017), the genome of strain KYC5002 had an average nucleotide identity (ANI) of 92.18% with the genome of strain DSM 2261T (GenBank accession number: CP011509.1), the type strain of A. gephyra, and the 16S rRNA sequence was 99.11% similar. Annotation using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) revision 6.6 (Tatusova et al., 2016) revealed that the genome comprises 10,298 protein-coding genes, 12 rRNA genes, 95 tRNA genes, 4 ncRNA genes, and 38 pseudogenes (Table 1).
Myxobacteria have many secondary metabolite biosynthetic gene clusters (BGCs) in their genomes (Hyun and Cho, 2018). Therefore, we analyzed the secondary metabolite biosynthetic genes present in the A. gephyra KYC5002 genome using the antiSMASH program (Blin et al., 2023). Fifty-two BGSs were detected in 46 regions, and the combined sequence length of these BGCs was 901,678 bp or 6.81% of the genome. The genome was predicted to contain BGCs for argyrins, carotenoids, DKxanthenes, gephyronic acid, geosmins, microviridins, myxochellins, and tubulysins (Table 2). Genome-wide comparative analysis using the antiSMASH program showed that, of these BGCs, those that biosynthesize argyrins, DKxanthenes, gephyronic acid, and tubulysins were present only in the genome of A. gephyra KYC5002 and absent in the genome of the type strain A. gephyra DSM 2261T (Table 2). The whole genome sequence of A. gephyra KYC5002 is expected to be useful for studying the production of secondary bioactive compounds.
The complete genome sequence of Archangium gephyra KYC5002 has been deposited in GenBank under the accession number CP137851. The strain was deposited in the Korean Collection for Type Cultures (KCTC) under the accession number KCTC14104BP.
Argyrin과 tubulysin을 생산하는 점액세균 Archangium gephyra KYC5002의 전장 유전체 서열을 분석하였다. KYC 5002 균주의 유전체는 13,249,988 bp 크기로 68.8%의 G + C 함량을 갖는 원형의 유전체로 조립되었다. 단백질을 암호화하는 유전자는 10,298개이었고, rRNA 유전자는 12개, tRNA 유전자는 95개이었다. KYC5002 균주의 유전체에는 46개 지역에서 53개 이차대사산물 생합성 유전자군이 탐색되었는데, 이들 유전자들의 총 길이는 전체 유전체의 6.81%에 해당하였다. KYC5002 균주의 유전체에는 argyrins, carotenoids, DKxanthenes, gephyronic acid, geosmins, microviridins, myxochellins, tubulysins 등을 생산하는 이차대사 생합성 유전자군들이 존재하는 것으로 분석되었다.
This research was supported by MECOX CureMed Co. and the Basic Science Research Program through the National Research Foundation of Korea (NRF), funded by the Ministry of Education (2021R1I1A3044432).
The authors have no conflict of interest to report.