The structure of scientific papers

Pre-reading Task:
Put the following sections in the order that they appear in a scientific paper.

a) Results, b) Materials and Methods, c) Acknowledgements, d) Introduction, e) Abstract, f) Discussion, g) References
i) _____ ii) _____ iii) _____ iv) _____ v) _____ vi) _____ vii) _____

Task 1: Below are parts of six sections of a scientific paper. With a partner, put them in order and label them in the chart below. Add any clues that led you to your conclusions Remember, only a part of each section has been reproduced. Also, in this paper the results and discussion section were combined. The paper was titled -
Bioinformatic analysis of exon repetition, exon scrambling and trans-splicing in humans.
by Xiang Shao, Valery Shepelev, and Alexei Fedorov
Taken from Bioinformatics:Vol. 22 no. 6 2006, pages 692–698
Section A:
In order to detect and to characterize exon scrambling (ExScram) and exon repetition (ExRep) events in ESTs, we developed a program package, SCRAMBLING, for comparison of all currently available human EST sequences (5 992 495) with the entire set of 20 342 human intron-containing genes. SCRAMBLING begins computations from establishing a correspondence between each EST sequence and the gene it was produced from. As a result, 3 221 193 ESTs were linked unambiguously with their native genes. Next, we mapped the beginning and the end of each EST onto the corresponding mRNA sequence and calculated the length of the mRNA fragment between these two mapped positions. If the length of this mRNA fragment differed from the length of EST sequence by >40 nt, that EST was processed further to test for possible ExScram and ExRep events. The threshold of 40 nt is high enough to discard all small variations in length due to inaccuracy of EST sequencing as well as most of the cases of alternative splicing with alternative usage of nearby 50- or 30-splice sites. At the same time, the 40 nt threshold is relatively low in comparison with the average human exon length (125 nt). So the vast majority of known cases of ExScram and ExRep would exceed this threshold. When we obtained several mapping positions of EST termini onto corresponding mRNA (due to exon duplication or repetitive segments within mRNA), we considered all possible combinations between mapped EST beginnings and ends. There were a total of 182 100 EST sequences that passed this initial selection (Table 1).

Section B:
We would like to thank Dr Robert Blumenthal, Medical University of Ohio, for discussion and valuable suggestions on our manuscript. Support for this work was provided by the Medical University of Ohio Foundation and the Stranahan Foundation, through the Program in Bioinformatics and Proteomics/Genomics. Conflict of Interest: none declared.

Section C:
Human genomic sequences were obtained from GenBank, Build 35.1. After processing this database with the Exon-Intron Database (EID) toolkit (Saxonov et al., 2000), a comprehensive collection of human gene, mRNA, and individual exon and intron sequences was obtained based on annotations in GenBank. This secondary database (human genome EID) is publicly available from our EID website http://www.meduohio.edu/bioinfo/eid/ (file hs35p1.EID.tar.gz), and is accompanied with a description of the database(see README file at http://www.meduohio.edu/bioinfo/eid/word/ README_Sept05.DOC). The advantages to using EID in our project include a convenient representation of exon/intron gene structures and availability of each gene sequence together with the related sequences of its mRNA, exons and introns that share a common EID identifier. The order of genes in EID strictly follows that of GenBank, and therefore corresponds to the physical order of genes in chromosomes. Several programs within the EID toolkit check the original input database and report possible errors and problems with primary sources.

Section D:
Excision of introns from pre-mRNA is a complex process in which several types of small nuclear RNAs and several dozens of proteins assemble into a spliceosomal complex (Maniatis and Reed, 2002). The precise recognition of exon–intron junctions by a spliceosome is crucial for the production of functional mRNAs. However, there is often ambiguity in the choice of exon–intron junctions. This results in a process known as alternative splicing, which occurs in _50% of mammalian genes and enables production of multiple mRNA isoforms from the same gene, often in a tissue-specific or development-stage-specific manner (Modrek and Lee, 2002; Stamm, 2002). There are three other types of splicing-associated peculiarities reported to result in abnormal exon arrangement within mRNA molecules. The first of these is called exon repetition (ExRep) and was first characterized by Caudevilla et al. (1998).

Section E:
Akopian,A.N. et al. (1999) Trans-splicing of a voltage-gated sodium channel is regulated by nerve growth factor. FEBS Lett., 445, 177–182.
Benson,D.A. et al. (1999) GenBank. Nucleic Acids Res., 27, 12–17.
Boguski,M.S. et al. (1993) dbEST—database for ‘expressed sequence tags’. Nat. Genet., 4, 332–333.
Bonen,L. (1993) Trans-splicing of pre-mRNA in plants, animals, and protists. FASEB J., 7, 40–46.
Caldas,C. et al. (1998) Exon scrambling of MLL transcripts occur commonly and mimic partial genomic duplication of the gene. Gene, 208, 167–176.
Caudevilla,C. et al. (2001) Localization of an exonic splicing enhancer responsible for mammalian natural trans-splicing. Nucleic Acids Res., 29, 3108–3115.

Section F:
Motivation: Using bioinformatic approaches we aimed to characterize poorly understood abnormalities in splicing known as exon scrambling,exon repetition and trans-splicing.

Results: We developed a software package that allows large-scale comparison of all human expressed sequence tags (EST) sequences to the entire set of human gene sequences. Among 5 992 495 ESTsequences, 401 cases of exon repetition and 416 cases of exon scramblingwere found. The vast majority of identifiedESTs contain fragments rather than full-length repeated or scrambled exons. Their structures suggest that the scrambled or repeated exon fragments may have arisen in the process of cDNA cloning and not from splicing abnormalities. Nevertheless, we found 11 cases of full-length exon repetition showing that this phenomenon is real yet very rare. In searching for examples of trans-splicing, we looked only at reproducible events where at least two independent ESTs represent the same putative trans-splicing event. We found 15 ESTs representing five types of putative trans-splicing. However, all 15 cases were derived from human malignant tissues and could have resulted from genomic rearrangements. Our results provide support for a very rare but physiological occurrence of exon repetition, but suggest that apparent exon scrambling and trans-splicing result, respectively, from in vitro artifact and gene-level abnormalities.

 
Order
Label
Clues
1 Section

2 Section

3 Section

4 Section

5 Section

6 Section


Task 2: In groups discuss any features you think are found in each section.