盐源山溪鲵转录组组装与分析

[2]

Liu

C C

.

Amphibians of Western China[M]. Chicago: Chicago Natural History Museum, 1950:83-86.

[3]

黄棨通, 龚大洁, 张海军, 等.

我国山溪鲵属分布及保护对策

[J]. 野生动物学报, 2017, 38(4):682-688.

[4]

赵尔宓.

中国濒危动物红皮书——两栖类和爬行类[M]. 北京: 科学出版社, 1998.

[5]

Mohamed

S

, Caird

E R

, Wang

J N

, et al.

Characterization of the rainbow trout transcriptome using sanger and 454-pyrosequencing approach

[J]. BMC Genomics, 2010, 11:564.

Background: Rainbow trout are important fish for aquaculture and recreational fisheries and serves as a model species for research investigations associated with carcinogenesis, comparative immunology, toxicology and evolutionary biology. However, to date there is no genome reference sequence to facilitate the development of molecular technologies that utilize high throughput characterizations of gene expression and genetic variation. Alternatively, transcriptome sequencing is a rapid and efficient means for gene discovery and genetic marker development. Although a large number (258,973) of EST sequences are publicly available, the nature of rainbow trout duplicated genome hinders assembly and complicates annotation. Results: High-throughput deep sequencing of the Swanson rainbow trout doubled-haploid transcriptome using 454-pyrosequencing technology yielded similar to 1.3 million reads with an average length of 344 bp, a total of 447 million bases. De novo assembly of the sequences yielded 151,847 Tentative Consensus (TC) sequences (average length of 662 bp) and 224,391 singletons. A combination assembly of both the 454-pyrosequencing ESTs and the preexisting sequences resulted in 161,818 TCs (average length of 758 bp) and 261,071 singletons. Gene Ontology analysis of the combination assembly showed high similarities to transcriptomes of other fish species with known genome sequences. Conclusion: The 454 library significantly increased the suite of ESTs available for rainbow trout, allowing improved assembly and annotation of the transcriptome. Furthermore, the 454 sequencing enables functional genome research in rainbow trout, providing a wealth of sequence data to serve as a reference transcriptome for future studies including identification of paralogous sequences and/or allelic variation, digital gene expression and proteomic research.

[6]

Han

X F

, Ling

Q F

, Li

C J

, et al.

Characterization of pikeperch (Sander lucioperca) transcriptome and development of SSR markers

[J]. Biochemical Systematics and Ecology, 2016, 66:188-195.

[7]

Cao

S M

, Zhu

L J

, Nie

H T

, et al.

De novo assembly,gene annotation,and marker development using Illumina paired-end transcriptome sequencing in the Crassadoma gigantean

[J]. Gene, 2018, 658:54-62.

[8]

Jia

Z Y

, Wang

Q A

, Wu

K K

, et al.

De novo transcriptome sequencing and comparative analysis to discover genes involved in ovarian maturity in Strongylocentrotus nudus

[J]. Comparative Biochemistry and Physiology Part D:Genomics and Proteomics, 2017, 23:27-38.

[9]

Li

Y

, Zhou

Z

, Tian

M

, et al.

Exploring single nucleotide polymorphism (SNP),microsatellite (SSR) and differentially expressed genes in the jellyfish (Rhopilema esculentum) by transcriptome sequencing

[J]. Marine Genomics, 2017, 34:31-37.

[10]

张亚男, 熊建利, 刘强强, 等.

龙洞山溪鲵的血细胞组成及血红蛋白含量检测

[J]. 动物学杂志, 2018, 53(1):75-81.

[11]

黄敏毅, 张育辉, 王宏元.

北方山溪鲵外周血细胞的组织学观察

[J]. 陕西师范大学学报(自然科学版), 2004, 32(3):87-90.

[12]

张寒珍, 刘绍龙, 赵云, 等.

山溪鲵的骨骼系统

[J]. 四川动物, 2009, 28(3):412-416.

[13]

刘炯宇, 江建平, 何开泽, 等.

山溪鲵皮肤分泌物抗菌活性的初步研究

[J]. 天然产物研究与开发, 2004, 16(5):415-419.

[14]

李亚琳, 张育辉.

雌二醇及其受体在北方山溪鲵精巢中的周期性分布

[J]. 西北农林科技大学学报(自然科学版), 2007, 35(2):58-62.

[15]

李悦, 吴敏, 王秀玲.

小鲵科线粒体16S rRNA基因序列分析及其系统发育

[J]. 动物学报, 2004, 50(3):464-469.

[16]

Dobin

A

, Davis

C A

, Schlesinger

F

, et al.

STAR:ultrafast universal RNA-seq aligner

[J]. Bioinformatics, 2013, 29(1):15-21.

Motivation: Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases.

[17]

Pertea

M

, Pertea

G M

, Antonescu

C M

, et al.

StringTie enables improved reconstruction of a transcriptome from RNA-seq reads

[J]. Nature Biotechnology, 2015, 33(3):290-295.

Methods used to sequence the transcriptome often produce more than 200 million short sequences. We introduce StringTie, a computational method that applies a network flow algorithm originally developed in optimization theory, together with optional de novo assembly, to assemble these complex data sets into transcripts. When used to analyze both simulated and real data sets, StringTie produces more complete and accurate reconstructions of genes and better estimates of expression levels, compared with other leading transcript assembly programs including Cufflinks, IsoLasso, Scripture and Traph. For example, on 90 million reads from human blood, StringTie correctly assembled 10,990 transcripts, whereas the next best assembly was of 7,187 transcripts by Cufflinks, which is a 53% increase in transcripts assembled. On a simulated data set, StringTie correctly assembled 7,559 transcripts, which is 20% more than the 6,310 assembled by Cufflinks. As well as producing a more complete transcriptome assembly, StringTie runs faster on all data sets tested to date compared with other assembly software, including Cufflinks.

[18]

Altschul

S F

, Madden

T L

, Schäffer

A A

, et al.

Gapped BLAST and PSI-BLAST:a new generation of protein database search programs

[J]. Nucleic Acids Research, 1997, 25(17):3389-3402.

The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST programs to be decreased substantially while enhancing their sensitivity to weak similarities. A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original. In addition, a method is introduced for automatically combining statistically significant alignments produced by BLAST into a position-specific score matrix, and searching the database using this matrix. The resulting Position-Specific Iterated BLAST (PSI-BLAST) program runs at approximately the same speed per iteration as gapped BLAST, but in many cases is much more sensitive to weak but biologically relevant sequence similarities. PSI-BLAST is used to uncover several new and interesting members of the BRCT superfamily.

[19]

Grabherr

M G

, Haas

B J

, Yassour

M

, et al.

Trinity:reconstructing a full-length transcriptome without a genome from RNA-Seq data

[J]. Nature Biotechnology, 2011, 29(7):644-652.

Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.

[20]

Finseth

F R

, Harrison

R G

.

A comparison of next generation sequencing technologies for transcriptome assembly and utility for RNA-Seq in a non-model bird

[J]. PLoS One, 2014, 9 (10):e108550.

DOI URL [本文引用: 2]

[21]

Liu

S

, Wang

X

, Sun

F

, et al.

RNA-Seq reveals expression signatures of genes involved in oxygen transport,protein synthesis,folding,and degradation in response to heat stress in catfish

[J]. Physiological Genomics, 2013, 45 (12):462-476.

Temperature is one of the most prominent abiotic factors affecting ectotherms. Most fish species, as ectotherms, have extraordinary ability to deal with a wide range of temperature changes. While the molecular mechanism underlying temperature adaptation has long been of interest, it is still largely unexplored with fish. Understanding of the fundamental mechanisms conferring tolerance to temperature fluctuations is a topic of increasing interest as temperature may continue to rise as a result of global climate change. Catfish have a wide natural habitat and possess great plasticity in dealing with environmental variations in temperature. However, no studies have been conducted at the transcriptomic level to determine heat stress-induced gene expression. In the present study, we conducted an RNA-Seq analysis to identify heat stress-induced genes in catfish at the transcriptome level. Expression analysis identified a total of 2,260 differentially expressed genes with a cutoff of twofold change. qRT-PCR validation suggested the high reliability of the RNA-Seq results. Gene ontology, enrichment, and pathway analyses were conducted to gain insight into physiological and gene pathways. Specifically, genes involved in oxygen transport, protein folding and degradation, and metabolic process were highly induced, while general protein synthesis was dramatically repressed in response to the lethal temperature stress. This is the first RNA-Seq-based expression study in catfish in response to heat stress. The candidate genes identified should be valuable for further targeted studies on heat tolerance, thereby assisting the development of heat-tolerant catfish lines for aquaculture.

[22]

罗辉, 叶华, 肖世俊, 等.

转录组学技术在水产动物研究中的运用

[J]. 水产学报, 2015, 39(4):598-607.

[23]

Yang

Z Z

, Wafula

E K

, Honaas

L A

, et al.

Comparative transcriptome analyses reveal core parasitism genes and suggest gene duplication and repurposing as sources of structural novelty

[J]. Molecular Biology & Evolution, 2015, 32(3):767-790.

[24]

岳华梅, 翟晴, 宋明月, 等.

基于转录组测序的兴国红鲤微卫星标记筛选

[J]. 淡水渔业, 2016, 46(1):24-28.

[25]

Zhou

X X

, Wang

H D

, Cui

J

.

Transcriptome analysis of tube foot and large-scale marker discovery in sea cucumber,Apostichopus japonicas

[J]. Comparative Biochemistry and Physiology Part D:Genomics and Proteomics, 2016, 20:41-49.

[26]

Huang

Y

, Xiong

J L

, Gao

X C

, et al.

Transcriptome analysis of the Chinese giant salamander (Andrias davidianus) using RNA-sequencing

[J]. Genomics Data, 2017, 14:126-131.