Chromosome-level genome assembly of the pygmy grasshopper Eucriotettix oculatus (Orthoptera: Tetrigoidea)

Li, Ran; Qin, Yingcan; Rong, Wantao; Deng, Wei-an; Li, Xiaodong

doi:10.1038/s41597-024-03276-2

Download PDF

Data Descriptor
Open access
Published: 26 April 2024

Chromosome-level genome assembly of the pygmy grasshopper Eucriotettix oculatus (Orthoptera: Tetrigoidea)

Ran Li ORCID: orcid.org/0000-0001-8567-3527^1,2,
Yingcan Qin³,
Wantao Rong^1,3,
Wei-an Deng^1,4 &
…
Xiaodong Li^1,3

Scientific Data volume 11, Article number: 431 (2024) Cite this article

394 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

The pygmy grasshoppers, which belong to the superfamily Tetrigoidea, exhibit remarkable environmental adaptability. However, no study has yet reported a reference genome for this group. In this study, we assembled a high-quality chromosome-scale genome of Eucriotettix oculatus, which survive in the environment heavily polluted by heavy metals, achieved through Illumina and PacBio sequencing, alongside chromosome conformation capture techniques. The resulting genome spans 985.45 Mb across seven chromosomes (range: 71.55 to 266.65 Mb) and features an N50 length of 123.82 Mb. Chr5 is considered to be the single sex chromosome (X). This genome is composed of 46.42% repetitive elements and contains 14,906 predicted protein-coding genes, 91.63% of which are functionally annotated. Decoding the E. oculatus genome not only promotes future studies on environmental adaptation for the pygmy grasshopper, but also provides valuable resources for in-depth investigation on phylogeny, evolution, and behavior of Orthoptera.

Chromosome-scale genome assemblies of Himalopsyche anomala and Eubasilissa splendida (Insecta: Trichoptera)

Article Open access 05 March 2024

The chromosome-level genome assembly of the giant dobsonfly Acanthacorydalis orientalis (McLachlan, 1899)

Article Open access 08 April 2024

Chromosome-level genome assembly of the caenogastropod snail Rapana venosa

Article Open access 16 August 2023

Background & Summary

The pygmy grasshoppers of the superfamily Tetrigoidea (Orthoptera) constitute a single cosmopolitan family, Tetrigidae, which is widely distributed throughout the world and has 287 genera within seven subfamilies^1,2,3. The species generally inhabit moist environments such as mountain streams, small rivers, swamps, grasslands and bushes, feed on humus, mosses, and lichen^1,4. Their distribution is highly dependent on specific natural environments, especially some are very sensitive to microhabitat changes, hence, they are important environmental indicator species^5,6. Meanwhile, the pygmy grasshoppers have complex behaviors and are generally regarded as ideal materials for behavioral researches^1,7. Some tetrigoid species have shown the ability to survive in environments contaminated with heavy metals⁸. However, studies of this family mainly focused on morphology, biology and ecology for the past decades, with a few studies on molecular mechanism of ecological and biological characteristics^9,10,11,12. The lack of genomic information has made it difficult to conduct in-depth investigations of the pygmy grasshoppers.

Eucriotettix oculatus (Bolivar, 1898) is a typical Oriental species belonging to the genus Eucriotettix in the family, widely distributed in the southern provinces of China and South Asia region (Fig. 1)¹³. This species has strong adaptability to different environments, and the population which lives in mining regions around the Diaojiang River (China) has been polluted for hundreds of years¹⁴. Our previous analysis showed the composition and diversity of the intestinal microbial community of E. oculatus was significantly reduced in heavy metal pollution¹⁵. Meanwhile, we also found that heavy metals could change the composition of metabolites in the intestine¹⁶. However, there is limited knowledge on molecular mechanisms that support the environmental adaptation of E. oculatus to heavy metal pollution due to the gaps in genomic information.

In the present study, we reported the first genome of a pygmy grasshopper in the superfamily Tetrigoidea including the determination of the X chromosome. The high-quality genome was de novo assembled using integrated technologies (Illumina sequencing, PacBio sequencing, as well as proximity ligation chromatin conformation capture) to assist in chromosome-level assembly. We successfully annotated the protein-coding genes (PCGs), repetitive elements (REs), and non-coding RNAs (ncRNAs) within the genome. This high-quality genome will be a valuable resource for in-depth studies on basic biological possesses and environmental adaptation of the pygmy grasshopper.

Methods

Animal materials

Specimens of E. oculatus were originally collected from a wild population in Yizhou, Guangxi, China, and subsequently maintained at Hechi University for further study. Only adult speciments were utilized for high-quality genomic DNA and RNA extraction. The female bodies were collected for Illumina and PacBio genome sequencing, and muscle tissues of legs were prepared for transcriptome and Hi-C sequencing.

Genome and transcriptome sequencing

Five female specimens were pooled and total DNA was then extracted using a Blood & Cell Culture DNA Mini Kit (Qiagen). DNA quantity and quality were finally measured by a 2100 Bioanalyzer (Agilent) and a Qubit 3.0 Fluorometer (Invitrogen), with integrity confirmed via 1% agarose gel electrophoresis. Whole-genome shotgun sequencing was performed for five female individuals with a single molecule real-time (SMRT) PacBio system. PacBio Sequel II libraries (insert size of 30 kb) were constructed with SMRTbell^TM Template Prep Kit 2.0. Additionally, two short paired-end libraries were prepared with Truseq DNA PCR-free kit, and short reads were yielded on the Illumina NovaSeq 6000 platform.

Muscle tissues of five female insects were collected for constructing pseudo-chromosomes. The Hi-C library was constructed according to the standard protocols described previously¹⁷. After quality control, 150 bp paired-end reads (PE150) were also generated by the Illumina NovaSeq 6000 platform. RNA of five female and male individuals (three biological replicates) was isolated using TRIzol Total RNA Isolation Kit (Takara). The cDNA library was built using TruSeq RNA Sample Prep Kit v2 (Illumina) and sequenced on the Illumina HiSeq 6000 platform using the paired-end strategy.

Genome size estimation

In order to get a preliminary understanding of the genome size and other genome characteristics, a total of 97.53 Gb Illumina reads of female individuals were firstly produced (Table 1). Quality control was performed using the BBTools v38.82 package¹⁸. The 21-mer distribution was calculated using “khist.sh” (BBTools), and the genome survey analysis was carried out using GenomeScope v2.0^19,20. Based on the k-mer distribution of the cleaned data, the genome size was fell within the range of 936.54–939.87 Mb (Fig. 2, Table 2). The genome size was determined to be 939.87 Mb with the number of unique k-mers peaked at 21.

Table 1 Statistics of the DNA sequence data used for genome assembly.

Full size table

Table 2 The information of genome survey analysis.

Full size table

Genome assembly

A total of 85.76 Gb PacBio long reads (~91.3-fold coverage of the estimated genome size) were obtained after removing adaptors in polymerase reads with default parameters. The mean length and N50 length of PacBio subreads were 14.39 and 25.17 kb, respectively (Table 1). After self-corrected and long read polished, genome initial assembly was performed using the Flye v2.7.1²¹. As a result, we generated a 1.06 Gb genome assembly with the contig N50 of 1.95 Mb (Table 3). The size of the primary assembled genome was significantly larger than the genome size estimated by k-mer analysis. To further improve the quality and accuracy, we corrected the genome by removing haplotigs and contig overlaps from the genome, and short-read polishing with high coverage of Illumina reads using Purge dups v1.0.1 and NextPolish v1.1.0, respectively^22,23. Total size of the draft genome assembly was 993.58 Mb with an N50 length of 2.3 Mb (Table 3). To produce the chromosome-level assembly, 114.23 Gb Hi-C sequencing data (380,773,738 reads) was generated and used to anchor contigs into pseudo-chromosomes with 3D-DNA v180922 pipeline²⁴. Juicebox v1.6.2 was subsequently employed to review and manually curate scaffolding errors²⁵. Finally, a high-quality chromosome-level genome assembly was generated after JBAT review. Approximately 279 million unique mapped reads (73.40%) and 126 million valid reads (33.10%) were produced. 973.09 Mb data on the base level was anchored and orientated onto 7 chromosomes with a mounting rate of up to 98.78%, and the chromosome lengths ranged from 71.55 to 266.65 Mb (Table 4). After scaffolds were clustered, ordered and orientated to restore their relative locations, the heatmap of chromosome crosstalk indicated that the genome assembly was robust and complete (Fig. 3). Finally, the size of this genome was 985.45 Mb, consisting of 248 scaffolds and 1,944 contigs with an N50 length of 123.82 and 2.09 Mb, respectively (Table 5). Results showed that the size of the assembled pygmy grasshopper genome is close to the genome the estimated size, suggesting that the non-redundant genome was appropriate.

Table 3 Summary of each step in construction of the E. oculatus genome assembly.

Full size table

Table 4 Statistics of chromosome-level genome assembly of E. oculatus.

Full size table

Table 5 Genome assembly and annotation statistics of E. oculatus.

Full size table

Sex chromosome determination

To identify the X chromosome of E. oculatus, resequencing for males produced a total of 39.4 Gb high-quality data with a mean Q30 of 93.5% (Table 1). The data was then mapped to 7 chromosomes, and the sequencing depth was used to identify the X chromosome. The results showed that the mean sequencing depth of Chr1-4, 6 and 7 was nearly two-fold greater than that of Chr5. The Chr5 was hence considered to be the X chromosome (Fig. 3, Table 4).

Repeat annotation

Repetitive elements (REs) were detected by two routine approaches, including ab initio and homology prediction. For ab initio prediction, RepeatModeler v2.0.1 was firstly used to identify the REs, and a de novo repeat sequence library was subsequently built using the results²⁶. Finally, a custom library was constructed combining with two databases (Dfam v3.1²⁷ and RepBase v20181026²⁸). For homology prediction, REs were masked by RepeatMasker v4.1.0 on the custom library²⁹. A total of 457.39 Mb REs were identified (constituting 46.42% of E. oculatus genome), including 45.03% transposable elements (TEs), 1.01% simple repeats, 0.16% low-complexity regions, and 0.15% small RNAs, 0.06% satellites (Fig. 4). The predominant 6 categories of TEs were unclassified (19.47%), long interspersed nuclear elements (LINEs, 15.61%), DNA transposon elements (5.78%), rolling-circles (RCs, 2.42%), long terminal repeats (LTRs, 1.06%), and short interspersed nuclear elements (SINEs, 0.69%).

All ncRNAs (rRNAs, snRNAs and miRNAs) were detected by Infernal v1.1.3³⁰ and tRNAscan-SE v2.0.7³¹, yielding 5,514 tRNAs (21 isotypes, Supres lacking), 37 small nuclear RNAs (snRNAs), 32 ribosomal RNAs (rRNAs), 21 micro RNAs (miRNAs), 1 small RNA (sRNA), and 28 other types of ncRNAs. The snRNAs were classified as 30 spliceosomal RNAs (U2, U4 and U6), 1 minor spliceosomal RNA (U6atac), 3 C/D box small nucleolar RNAs (snoRNAs), and 3 H/ACA box snoRNA.

Protein-coding gene annotation

MAKER v3.01.03 was employed with an integration of 3 strategies, including ab initio prediction, transcriptome-based and homology-based annotation³². The ab initio prediction was performed using BRAKER v2.1.5³³, which automatically trained the predictors Augustus v3.3.4³⁴ and GeneMark-ES/ET/EP 4.59_lic³⁵, and made use of the mapped transcriptome data and protein homology information. The transcriptome information in BAM alignments was generated by HISAT2 v2.2.0³⁶, and the protein sequences were extracted from the database OrthoDB10 v1³⁷. For transcriptome-based annotation, RNA-seq data were firstly mapped to our assembly using HISAT2, and the transcriptome information in BAM alignments was produced. BRAKER was then run with the default parameters. With our reference assembly, transcriptome data were further assembled into transcripts using StringTie v2.1.4³⁸. Protein sequences of three model insects (Drosophila melanogaster, Bombyx mori and Tribolium castaneum) and three representative species (Daphnia magna, Apis mellifera and Rhopalosiphum maidis) were downloaded from NCBI. Finally, MAKER was used to integrated the results of these three strategies using EVidenceModeler (EVM) pipeline v1.1.1³⁹, weight 1, 2 and 8 was assigned to ab initio, protein homology and transcriptome, respectively. Overall, 14,906 protein-coding genes were predicted (Table 5), and the average gene length was 15,352.03 bp and the average CDS length was 1,569.20 bp. The average exon number of per gene was 9.45, with average exon length of 288.38 bp and average intron length of 1542.64 bp. On the basis of BUSCO analysis, 95.2% of the BUSCO database (insecta_odb10) genes were identified (single-copy genes: 85.3%, duplicated genes: 9.9%, fragmented genes: 1.0%, missing genes: 3.8%), further underlining the accuracy and completeness of gene predictions (Table 5).

Diamond v0.9.24 was firstly used to search the existing database UniProtKB with the sensitive model to obtain gene functions⁴⁰. InterProScan v5.41–78.0⁴¹ was then used to screen proteins against the synthesis databases [Pfam, SMART, Gene3D, Superfamily, and Conserved Domain Database (CDD)] for predicting the protein domains. And eggNOG v5.0 database⁴² was searched for Gene Ontology (GO), Expression coherence (EC), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, KEGG orthologous groups (KOs), and clusters of orthologous groups (COG) functional category annotation of the predicted protein-coding genes. Out of the protein-coding genes predicted in the pygmy grasshopper genome, 13,659 (91.63%) genes were matched the UniProtKB database (SwissProt + TrEMBL) to be assigned functions. Integrated analysis identified the homology and conserved protein domains for 13,011 (87.29%) genes. 11,178 genes were classified according to GO terms, and 9,968 genes were mapped to the KEGG pathway database. In addition, 8,711 KEGG ko terms, 2,972 Enzyme Codes, 11,393 Reactome pathways and 12,619 COG categories were predicted.

Data Records

The genomic Illumina sequencing data were deposited in the NCBI Sequence Read Archive (SRA) database under accession No. SRR14826261⁴³ and SRR14826262⁴⁴.

The genomic Pacbio sequencing data were deposited in SRA database under accession No. SRR14843516⁴⁵.

The transcriptome Illumina sequencing data were deposited in SRA database under accession No. SRR14825792⁴⁶.

The Hi-C sequencing data were deposited in SRA database under accession No. SRR14827093⁴⁷.

The assembled genome was deposited in the GenBank at NCBI under accession No. JAEMUL000000000⁴⁸.

Genome annotation information of repeated sequences, gene structure and functional prediction is available in the Figshare database⁴⁹.

Technical Validation

The completeness and accuracy of the assembled genome were evaluated using two different strategies. First, BUSCO analysis showed that 97.4% (single-copied gene: 96.5%, duplicated gene: 0.9%) of 1,367 insect single-copy orthologues (in the insect_odb10 database) were successfully identified as complete, 0.6% were fragmented and 2.0% were missing in the assembly. Then, we mapped the sequencing data to the assembled genome for verifying the accuracy. The mapping rates was 94.92%, 93.62% and 96.77% for the Illumina, RNA-seq and PacBio data, respectively. Overall, the assessment results indicated that our E. oculatus genome assembly was complete, accuracy as well as high quality.

Code availability

No specific script was used in this work. The codes and pipelines used in data processing were all executed according to the manual and protocols of the corresponding bioinformatics software.

References

Deng, W. A. Taxonomic study of Tetrigoidea from China. Huazhong Agricultural University (2016).
Deng, W. A., Chen, D. N., Sheng, Q., Zhao, C. L. & Wu, F. P. An annotated catalogue of the pygmy grasshoppers of the genus Criotettix Bolívar, 1887 (Orthoptera: Tetrigidae) with two new Criotettix species from China. Zootaxa 4629, zootaxa-4629.4.2 (2019).
Article PubMed Google Scholar
Cigliano, M. M., Braun, H., Eades, D. C. & Otte, D. Orthoptera Species File http://orthoptera.speciesfile.org (2024).
Wei, S. Z., Xin, L. & Deng, W. A. Pygmy grasshoppers of the genus Paragavialidium Zheng, 1994 (Orthoptera: Tetrigoidea: Scelimeninae). Orient Insects 53, 449–469 (2019).
Article Google Scholar
Tan, M. K., Yeo, H. & Hwang, W. S. Ground dwelling pygmy grasshoppers (Orthoptera: Tetrigidae) in Southeast Asian tropical freshwater swamp forest prefer wet microhabitats. J. Orthoptera Res. 1, 73–80 (2017).
Article Google Scholar
Li, R., Ying, X., Deng, W. A., Rong, W. T. & Li, X. D. Mitochondrial genomes of eight Scelimeninae species (Orthoptera) and their phylogenetic implications within Tetrigoidea. PeerJ 9, e10523 (2021).
Article PubMed PubMed Central Google Scholar
Hochkirch, A. et al. A field study of the escape behaviour of Tetrix subulata (Linnaeus, 1758) and Tetrix tenuicornis (Sahlberg, 1893) (Orthoptera: Tetrigidae). Articulata 17, 19–31 (2002).
Google Scholar
Warchałowska-Śliwa, E., Niklińska, M., Görlich, A., Michailova, P. & Pyza, E. Heavy metal accumulation, heat shock protein expression and cytogenetic changes in Tetrix tenuicornis (L.) (Tetrigidae, Orthoptera) from polluted areas. Environ. Pollut. 133, 373–81 (2005).
Article PubMed Google Scholar
Subedi, M. & Kasalo, N. Aryalidonta itishreea, a new genus and species of Thoradontini (Orthoptera, Tetrigidae) from Nepal honors the Emperor of Laughter. J. Orthoptera Res. 32 (2023).
Gao, G. Z., Liu, P. Y. & Yin, Z. Description of a new species of the genus Tetrix Latreille (Orthoptera: Tetrigoidea: Tetrigidae) from Zhejiang, China. Zootaxa 5138, 347–350 (2022).
Article PubMed Google Scholar
Kasalo, N., Naskrecki, P., Rebrina, F. & Skejo, J. Central American Tetrigidae Rambur, 1838 (Orthoptera): a preliminary catalogue. Zoosystema 45, 177–212 (2023).
Article Google Scholar
Pan, Z. X., Hong, F. & Jiang, G. F. Morphometrics reveal correlation between morphology and bioclimatic factors and population mixture in Tetrix japonica (Orthoptera: Tetrigidae). Acta Zoologica 99, 199–210 (2018).
Article Google Scholar
Zheng, Z. M. Fauna of Tetrigoidea from Western China (Science Press, 2005).
Xiao, S., Cui, P., Li, X. D., Deng, W. A. & Rong, W. T. Life history and biological characteristics of Eucriotettix oculatus. J. Environ. Entomol. 41, 1366–1374 (2019).
Google Scholar
Li, X. D. et al. Effect of heavy metals pollution on the composition and diversity of the intestinal microbial community of a pygmy grasshopper (Eucriotettix oculatus). Ecotox. Environ. Safe 223, 112582 (2021).
Article ADS CAS Google Scholar
Rong, W. T. et al. Effects of combined pollution of heavy metals on the metabolomics of Eucriotettix oculatus. Zcta. Entomol. Sinica 65, 437–450 (2022).
Google Scholar
Belton, J. M. et al. Hi-C: a comprehensive technique to capture the conformation of genomes. Methods 58, 268–276 (2012).
Article CAS PubMed Google Scholar
Bushnell, B. BBMap Download. SourceForge.net https://sourceforge.net/projects/bbmap/ (2014).
Vurture, G. W. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics 33, 2202–2204 (2017).
Article CAS PubMed PubMed Central Google Scholar
Marçais, G. & Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27, 764–770 (2011).
Article PubMed PubMed Central Google Scholar
Kolmogorov, M., Yuan, J., Lin, Y. & Pevzner, P. A. Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol. 37, 540–546 (2019).
Article CAS PubMed Google Scholar
Roach, M. J., Schmidt, S. A. & Borneman, A. R. Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies. BMC bioinformatics 19, 1–10 (2018).
Article Google Scholar
Hu, J. NextPolish: a fast and efficient genome polishing tool for long-read assembly. Bioinformatics 36, 2253–2255 (2020).
Article CAS PubMed Google Scholar
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).
Article CAS PubMed PubMed Central Google Scholar
Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. PNAS 117, 9451–9457 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Hubley, R. et al. The Dfam database of repetitive DNA families. Nucleic Acids Res. 44, D81–89 (2016).
Article CAS PubMed Google Scholar
Bao, W., Kojima, K. K. & Kohany, O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mobile DNA 6, 1–6 (2015).
Article Google Scholar
Smith, A., Hubley, R. & Green, P. RepeatMasker https://www.repeatmasker (2023).
Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29, 2933–2935 (2013).
Article CAS PubMed PubMed Central Google Scholar
Chan, P. P. & Lowe, T. M. tRNAscan-SE: searching for tRNA genes in genomic sequences. Gene prediction: methods and protocols 1–4 (2019).
Holt, C. & Yandell, M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC bioinformatics 12, 1–4 (2011).
Article Google Scholar
Hoff, K. J., Lange, S., Lomsadze, A., Borodovsky, M. & Stanke, M. BRAKER1: unsupervised RNA-Seq-based genome annotation with GeneMark-ET and AUGUSTUS. Bioinformatics 32, 767–769 (2016).
Article CAS PubMed Google Scholar
Stanke, M., Steinkamp, R., Waack, S. & Morgenstern, B. AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic Acids Res. 32, W309–312 (2004).
Article CAS PubMed PubMed Central Google Scholar
Brůna, T., Lomsadze, A. & Borodovsky, M. GeneMark-EP+: eukaryotic gene prediction with self-training in the space of genes and proteins. NAR Genom. Bioinform. 2, lqaa026 (2020).
Article PubMed PubMed Central Google Scholar
Kim, D., Paggi, J. M., Park, C., Bennett, C. & Salzberg, S. L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 37, 907–915 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kriventseva, E. V. et al. OrthoDB v10: sampling the diversity of animal, plant, fungal, protist, bacterial and viral genomes for evolutionary and functional annotations of orthologs. Nucleic Acids Res. 47, D807–811 (2019).
Article CAS PubMed Google Scholar
Kovaka, S. et al. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 20, 1–3 (2019).
Article Google Scholar
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 9, 1–22 (2008).
Article Google Scholar
Buchfink, B., Xie, C. & Huson, D. H. Fast and sensitive protein alignment using DIAMOND. Nat. Methods 12, 59–60 (2015).
Article CAS PubMed Google Scholar
Finn, R. D. et al. InterPro in 2017-beyond protein family and domain annotations. Nucleic Acids Res. 45, D190–199 (2017).
Article CAS PubMed Google Scholar
Huerta-Cepas, J. et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 47, D309–314 (2019).
Article CAS PubMed Google Scholar
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR14826261 (2023).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR14826262 (2023).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR14843516 (2023).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR14825792 (2023).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR14827093 (2023).
Li, R. & Li, X.-D. Eucriotettix oculatus isolate LXD-2020, whole genome shotgun sequencing project. GenBank https://identifiers.org/nucleotide:JAEMUL000000000 (2023).
Li, R. & Li, X.-D. Chromosome-level genome assembly of the pygmy grasshopper Eucriotettix oculatus (Orthoptera: Tetrigoidea). Figshare https://doi.org/10.6084/m9.figshare.15029535 (2023).

Download references

Acknowledgements

The project was supported by the National Natural Science Foundation of China (Grant No. 32160122, 31702049 and 31960111), the Guangxi Natural Science Foundation (Grant No. 2023GXNSFDA026037), the Starting Project of High-level Talents Scientific Research in Hechi University (2021GCC016) and the high level Innovation team and Outstanding Scholars Program of Guangxi Colleges and Universities.

Author information

Authors and Affiliations

Guangxi Key Laboratory of Sericulture Ecology and Applied Intelligent Technology, Hechi University, Hechi, China
Ran Li, Wantao Rong, Wei-an Deng & Xiaodong Li
College of Life Sciences, Qufu Normal University, Qufu, China
Ran Li
School of Chemistry and Bioengineering, Hechi University, Yizhou, China
Yingcan Qin, Wantao Rong & Xiaodong Li
College of Life Sciences, Guangxi Normal University, Guilin, China
Wei-an Deng

Authors

Ran Li
View author publications
You can also search for this author in PubMed Google Scholar
Yingcan Qin
View author publications
You can also search for this author in PubMed Google Scholar
Wantao Rong
View author publications
You can also search for this author in PubMed Google Scholar
Wei-an Deng
View author publications
You can also search for this author in PubMed Google Scholar
Xiaodong Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Li R., Deng W.A. and Li X.D. conceived and designed the research. Li R., Qing Y.C. and Rong W.T. collected the samples and extracted the genomic DNA. Li R. and Li X.D. conducted the experiments, analyzed the data, and wrote the manuscript. All authors read, revised and approved the final version of the manuscript.

Corresponding authors

Correspondence to Wei-an Deng or Xiaodong Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, R., Qin, Y., Rong, W. et al. Chromosome-level genome assembly of the pygmy grasshopper Eucriotettix oculatus (Orthoptera: Tetrigoidea). Sci Data 11, 431 (2024). https://doi.org/10.1038/s41597-024-03276-2

Download citation

Received: 24 January 2024
Accepted: 17 April 2024
Published: 26 April 2024
DOI: https://doi.org/10.1038/s41597-024-03276-2