婴鲤属的全长转录组学分析
作者:
作者单位:

1.江西农业大学生物科学与工程学院 南昌 330045;2.南昌大学生命科学学院 南昌 330031;3.德国图宾根大学 德国图宾根 72074

作者简介:

叶昀,男,硕士研究生;研究方向:生物信息学;E-mail:yeyun@stu.jxau.edu.cn。

通讯作者:

中图分类号:

基金项目:

江西省杰出青年基金项目(No. 20232ACB215003),江西省自然科学基金面上项目(No. 20224BAB205013);


Full-Length Transcriptomic Analysis of the Genus Paedocypris
Author:
Affiliation:

1.School of Bioscience and Bioengineering, Jiangxi Agricultural University, Nanchang 330045; 2.School of Life Sciences, Nanchang University, Nanchang 330031, China; 3.University of Tübingen, Tübingen 72074, Germany

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    婴鲤属(Paedocypris)是世界上已知最小的鱼类,也是最小的脊椎动物之一。前期研究通过绘制P. carbunculusP. micromegethes的基因组草图和比较基因组学分析表明,Hox家族基因的丢失和内含子缩短/重复序列减少可能是分别导致婴鲤属体型小型化和基因组简化的潜在原因。然而,其体型小型化在转录表达层面上的分子机制尚未得到探究。本研究对P. carbunculusP. micromegethes样本分别开展全长转录组测序,获得两种婴鲤的全长转录组文库。经校正去冗余后,分别获得38 651和23 165条校正后共识序列,平均长度分别为2 413 bp和2 460 bp,这些序列分别有35 883(92.84%)和20 381(87.98%)条比对到了参考基因组上。对这两个物种全长转录组数据进行分类及特征分析,分别获得19 352和11 139条全长转录本。本研究在P. carbunculusP. micromegethes的全长转录本数据中分别检测到9 404和6 068个可变剪切事件,以A5型可变剪接事件为主。还观察到大多数基因(分别占74.55%和84.75%)只有一个可变多聚腺苷酸化位点。对转录因子进行注释和分类显示,在婴鲤属中数量最多的转录因子家族为zf-C2H2,其次是ZBTB。还在这两个婴鲤属物种中分别预测到1 382和1 649条长链非编码RNA。通过将婴鲤与鲤(Cyprinus carpio)、鲢(Hypophthalmichthys molitrix)、大西洋鲑(Salmo salar)和银大麻哈鱼(Oncorhynchus kisutch)的全长转录本数据进行比较,发现婴鲤Hox家族表达的基因数量远小于其他4种鱼,这表明在基因表达层面上,Hox家族基因的缺失可能也是从功能上导致婴鲤属体型小型化的原因之一。研究还发现婴鲤属表达更多与重要发育通路相关的基因,这些与发育功能相关的基因相比转录组中其他基因拥有更少的基因融合事件。本研究报道了婴鲤属两个物种的全长转录组全景图谱及相关分析结果,为进一步解析婴鲤体型小型化的分子机制提供了新的见解。

    Abstract:

    [Objectives] The genus Paedocypris comprises the smallest known fish species and one of the smallest vertebrates, with adult sizes ranging between 8﹣10 mm. Previous studies have indicated that the loss of Hox gene family members and the reduction of intron length and repetitive sequences might be potential causes for the miniaturization of the Paedocypris species and the simplification of their genomes, as demonstrated through draft genome assembly and comparative genomic analysis of P. carbunculus and P. micromegethes. The objectives of this study include characterizing the full-length transcriptome of the two species, P. carbunculus and P. micromegethes, identifying potential regulatory changes that could contribute to their miniaturized stature, with a particularly focus on Hox family and developmental gene expression patterns and gene structure. [Methods] All samples were purchased from an aquarium market in Guangzhou, comprising seven P. carbunculus and eleven P. micromegethes live specimens (Fig. 1). RNA was extracted from whole fish, and the purity, concentration, and integrity were tested. RNA from the seven P. carbunculus and the eleven P. micromegethes was mixed respectively and used for the construction of a full-length transcriptome library. The sequencing was performed on the PacBio Sequel platform. Quality control of full-length transcriptome sequencing data from the two Paedocypris species were performed. Then, we described the data quality, mapping statistics, annotation statistics and transcript classification of the full-length transcriptome data. By employing different software and pipelines, we annotated the gene structure, transcription factors, alternative splicing sites, long non-coding RNAs (LncRNAs), and gene fusions. To characterize the expression situation of Hox family genes in Paedocypris, we downloaded 81 zebrafish Hox family gene coding sequences belonging to 48 different Hox genes and align to the full-length transcriptome sequences of Paedocypris and other four species using Blastn with a threshold e-value < 1020. To explore the expression of genes in important developmental pathways in the Paedocypris genus, we selected seven key GO developmental pathways encompassing 1 581 non-redundant zebrafish genes and align with the full-length transcriptome data of Paedocypris and the other four species for analysis. To explore whether there are significant structural differences between the developmental-related genes mapped in the Paedocypris genus and other genes on the Paedocypris genome, based on the GO annotation information of the Paedocypris reference genome, we conducted significance tests (t-test) on the developmental-related genes mapped in the Paedocypris genus against all annotated genes on the Paedocypris reference genome for alternative splicing, polyadenylation, exon count, and fusion genes. [Results] In this study, we conducted full-length transcriptome sequencing on whole-body samples of P. carbunculus and P. micromegethes (Tables 1﹣4), obtaining the first full-length transcriptome library for Paedocypris. After correction and deduplication, 38 651 and 23 165 corrected consensus sequences were obtained, with average lengths of 2 413 bp and 2 460 bp, respectively. Among these sequences, 35 883 (92.84%) and 20 381 (87.98%) were mapped to the Paedocypris reference genome (Table 2). The classification and characterization analyses of the full-length transcriptome data for these two species resulted in 19 352 and 11 139 full-length transcripts, respectively (Tables 4 and 5). In the full-length transcriptome data of P. carbunculus and P. micromegethes, 9 404 and 6 068 alternative splicing events were identified, predominantly of the A5 type (Table 6). We also observed that the majority of genes (74.55% and 84.75%, respectively) had only one alternative polyadenylation site. Annotation and classification of transcription factors revealed that the most abundant transcription factor family in Paedocypris was zf-C2H2, followed by ZBTB. Additionally, we predicted 1 382 and 1 649 LncRNAs in the two Paedocypris species, respectively (Fig. 2). Comparative analysis of the full-length transcriptome data between Paedocypris and carp, silver carp, salmon, and rainbow trout revealed that the number of expressed genes in the Hox gene family in Paedocypris was significantly lower than that in the other four fish species (Fig. 3). We also identified that Paedocypris has the most expressed developmental genes belonging to the seven GO terms than the four fish species (Fig. 3). We found that those developmental-related genes have fewer gene fusion events than other genes in the full-length transcriptome of Paedocyrpis (Fig. 4). [Conclusion] We provide a full-length transcriptome landscape of the genus Paedocypris by performing full-length transcriptome sequencing. Data quality, mapping statistics, annotation statistics, and transcript classification of the full-length transcriptome data of Paedocypris are described. The analysis highlights that the loss of Hox genes at the expression level may be one of the functional reasons for the miniaturization of Paedocypris. This study provides new insights into the molecular mechanisms underlying the miniaturization of Paedocypris.

    参考文献
    相似文献
    引证文献
引用本文

叶昀,张素蓉,万柏祺,张亚琪,史文天,杨慧,张万昌.2025.婴鲤属的全长转录组学分析.动物学杂志,60(2):224-238.

复制
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-03-06
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-04-25
  • 出版日期: