The Structure of
Scientific Papers
Pre-reading Task
Put
the following sections in the order that they appear in a scientific paper.
a) Results, b) Materials and Methods, c) Acknowledgements, d) Introduction, e) Abstract, f) Discussion, g) References
i) _____ ii) _____ iii) _____ iv) _____ v)
_____ vi) _____ vii) _____
Task
1:
Below are parts of six sections of a scientific paper. With a
partner, put them in order and label them in the chart below. Add any clues
that led you to your conclusions Remember, only a part of each section has been
reproduced. Also, in this paper the results and discussion section were
combined. The paper was titled
Bioinformatic analysis of exon
repetition, exon scrambling and trans-splicing in
humans.
by Xiang Shao, Valery Shepelev,
and Alexei Fedorov
Taken from Bioinformatics:Vol. 22 no. 6 2006, pages 692–698
Section
A:
In order to detect and to characterize exon scrambling (ExScram) and exon repetition (ExRep) events in
ESTs, we developed a program package, SCRAMBLING, for
comparison of all currently available human EST sequences (5 992 495) with the
entire set of 20 342 human intron-containing genes.
SCRAMBLING begins computations from establishing a correspondence between each
EST sequence and the gene it was produced from. As a result, 3 221 193 ESTs were linked unambiguously with their native genes.
Next, we mapped the beginning and the end of each EST onto the corresponding
mRNA sequence and calculated the length of the mRNA fragment between these two
mapped positions. If the length of this mRNA fragment differed from the length
of EST sequence by >40 nt,
that EST was processed further to test for possible ExScram
and ExRep events. The threshold of 40 nt is high enough to discard all
small variations in length due to inaccuracy of EST sequencing as well as most
of the cases of alternative splicing with alternative usage of nearby 50- or
30-splice sites. At the same time, the 40 nt
threshold is relatively low in comparison with the average human exon length (125 nt). So the vast
majority of known cases of ExScram and ExRep would exceed this threshold. When we obtained several
mapping positions of EST termini onto corresponding mRNA (due to exon duplication or repetitive segments within mRNA), we
considered all possible combinations between mapped EST beginnings and ends.
There were a total of 182 100 EST sequences that passed this initial selection
(Table 1).
Section
B:
We would like to thank Dr Robert Blumenthal,
Medical University of Ohio, for discussion and valuable suggestions on our
manuscript. Support for this work was provided by the Medical University of
Ohio Foundation and the Stranahan Foundation, through
the Program in Bioinformatics and Proteomics/Genomics. Conflict of Interest:
none declared.
Section
C:
Human genomic sequences were obtained from GenBank, Build 35.1. After processing this database with
the Exon-Intron Database (EID) toolkit (Saxonov et al., 2000), a comprehensive collection of human
gene, mRNA, and individual exon and intron sequences was obtained based on annotations in GenBank. This secondary database (human genome EID) is
publicly available from our EID website http://www.meduohio.edu/bioinfo/eid/
(file hs35p1.EID.tar.gz), and is accompanied with a description of the database(see README file at
http://www.meduohio.edu/bioinfo/eid/word/ README_Sept05.DOC). The advantages to
using EID in our project include a convenient representation of exon/intron gene structures and availability of each gene
sequence together with the related sequences of its mRNA, exons
and introns that share a common EID identifier. The
order of genes in EID strictly follows that of GenBank,
and therefore corresponds to the physical order of genes in chromosomes.
Several programs within the EID toolkit check the original input database and
report possible errors and problems with primary sources.
Section
D:
Excision of introns from
pre-mRNA is a complex process in which several types of small nuclear RNAs and several dozens of proteins assemble into a spliceosomal complex (Maniatis
and Reed, 2002). The precise recognition of exon–intron junctions by a spliceosome
is crucial for the production of functional mRNAs. However, there is often
ambiguity in the choice of exon–intron
junctions. This results in a process known as alternative splicing, which
occurs in _50% of mammalian genes and enables production of multiple mRNA isoforms from the same gene, often in a tissue-specific or
development-stage-specific manner (Modrek and Lee,
2002; Stamm, 2002). There are three other types of
splicing-associated peculiarities reported to result in abnormal exon arrangement within mRNA molecules. The first of these
is called exon repetition (ExRep)
and was first characterized by Caudevilla et al.
(1998).
Section E:
Akopian,A.N. et al. (1999)
Trans-splicing of a voltage-gated sodium channel is
regulated
by nerve growth factor. FEBS
Lett., 445, 177–182.
Benson,D.A. et al. (1999)
GenBank. Nucleic Acids Res.,
27, 12–17.
Boguski,M.S. et al. (1993)
dbEST—database for eexpressed sequence tagsf. Nat.
Genet., 4, 332–333.
Bonen,L. (1993) Trans-splicing of pre-mRNA in plants,
animals, and protists.
FASEB
J., 7, 40–46.
Caldas,C. et al. (1998)
Exon scrambling of MLL transcripts occur commonly and
mimic
partial genomic duplication of the gene. Gene, 208, 167–176.
Caudevilla,C. et al. (2001)
Localization of an exonic splicing enhancer
responsible for
mammalian
natural trans-splicing. Nucleic
Acids Res., 29, 3108–3115.
Section
F:
Motivation: Using bioinformatic
approaches we aimed to characterize poorly understood abnormalities in splicing
known as exon scrambling,exon repetition and trans-splicing.
Results: We developed a software package that allows
large-scale comparison of all human expressed sequence tags (EST) sequences to
the entire set of human gene sequences. Among 5 992 495 ESTsequences, 401 cases of exon
repetition and 416 cases of exon scramblingwere
found. The vast majority of identifiedESTs
contain fragments rather than full-length repeated or scrambled exons. Their structures suggest that the scrambled or
repeated exon fragments may have arisen in the
process of cDNA cloning and not from splicing
abnormalities. Nevertheless, we found 11 cases of full-length exon repetition showing that this phenomenon is real yet
very rare. In searching for examples of trans-splicing, we looked only at
reproducible events where at least two independent ESTs
represent the same putative trans-splicing event. We found 15 ESTs representing five types of putative trans-splicing.
However, all 15 cases were derived from human malignant tissues and could have
resulted from genomic rearrangements. Our results provide support for a very
rare but physiological occurrence of exon repetition,
but suggest that apparent exon scrambling and
trans-splicing result, respectively, from in vitro artifact and gene-level
abnormalities.
Order |
Label |
Clues |
1) Section |
|
|
2) Section |
|
|
3) Section |
|
|
4) Section |
|
|
5) Section |
|
|
6) Section |
|
|
Task
2: In groups discuss any features you think are found in each section.
********************************************************************************