How To Write and Publish An Original Research Article: Education
How To Write and Publish An Original Research Article: Education
org
EDUCATION
How to write and publish an original research article
Anthony M. Vintzileos, MD; Cande V. Ananth, PhD, MPH
FIGURE
Framework of the peer-reviewed article reflecting the ideal flow of the paper and its connecting 4 main parts
Study Objective(s)
Study design
1. Type of study
Conclusion(s) 2. IRB approval
1. Clear statement what the new knowledge is 3. Demographics
2. Comparison with previous studies 4. Inclusion and exclusion criteria
3. Possible explanation(s) 5. Description of procedures and
4. Strengths and weaknesses tests
5. Direction of future research 6. Definition of exposures and
outcomes (primary and secondary)
7. Sample size calculation
8. Types of measurements
Results
Inclusion of all (primary and secondary) outcomes
and measures
Vintzileos. How to write and publish an original research article. Am J Obstet Gynecol 2010.
Many busy journal readers frequently Introduction mentation of institutional review board
read the condensation first even before The introduction is 1 of the most critical (IRB) approval and type of consent ob-
deciding whether to read the abstract or parts of the article because it sets the tone tained, if applicable; (3) demographics of
the paper. for the reader and the reviewer. Ideally, the study cohort, if applicable; (4) inclu-
the introduction should contain 2 com- sion and/or exclusion criteria; (5) clear
Abstract
ponents, preferably expressed in 2 para- description of any procedures or tests;
The abstract is the “mirror” of the full
graphs. The first paragraph should sum- (6) clear definitions of exposures, and
manuscript. There is no doubt that the
marize the background information primary and secondary outcomes; (7)
abstract is the most widely read part of
the article by both those who peruse leading to a rationale for performing the sample size calculation based on primary
through the journal, as well as those who study, justifying the need for the study, outcome; (8) types of measurements
perform electronic literature searches. and clarifying the new information that used; and (9) methods of statistical anal-
The abstract should be structured with the study aims to offer. The first para- ysis and level of significance.
specifically designed headings focusing graph should convince the reader that In our computerized age, there is
the writer (and the reader) to the 4 main the study is not a “fishing expedition” widespread use of statistical software
elements, namely, objective(s), study de- but it is based on a solid background with among young academic physicians and
sign, results, and conclusion(s). The au- respect to plausibility. The second para- scientists. Some of the commonly used
thor has the opportunity to use succinct graph should clearly state the study ob- statistical software programs are SAS
language to summarize the paper by fol- jective (or the hypothesis). (http://www.sas.com/), SPSS (http://
lowing 2 principles: (1) the abstract www.spss.com/), STATA (http://www.
should be clear enough to be understood Materials and Methods stata.com/), R (http://www.r-project.
by a reader who may not read the rest of (or Patients and Methods) org/), SPlus (http://www.splus.com/),
the article, and (2) the abstract should The Materials and Methods (or Patients and MINITAB (http://www.minitab.
not be too long (word limitation varies and Methods) section should include de- com/). Tables 1 and 2 can serve as a quick
from journal to journal). scriptions of (1) study design; (2) docu- reference to the choice of the appropriate
TABLE 1
Appropriate statistical tests for continuous, ordinal, and categorical data
Comparison Continuous data Ordinal data Categorical data
2 groups of different subjects Unpaired t testa or Z-testa Mann-Whitney rank sum testb 2 or Fisher’s exact testb
................................................................................................................................................................................................................................................................................................................................................................................
ⱖ3 groups of different subjects ANOVA a
Kruskal-Wallis test b
or Fisher’s exact test
2 b
................................................................................................................................................................................................................................................................................................................................................................................
Same subjects (before/after treatment) Paired (matched) t test a
Wilcoxon-signed rank test b
McNemar’s test 2 b
................................................................................................................................................................................................................................................................................................................................................................................
a b b
Same subjects (ⱖ3 treatments) Repeated measures ANOVA Friedman test Cochrane Q test
................................................................................................................................................................................................................................................................................................................................................................................
a a b
Association between 2 variables Linear regression/correlation Spearman correlation Contingency coefficient
................................................................................................................................................................................................................................................................................................................................................................................
ANOVA, analysis of variance.
a
Indicates test for parametric data; b indicates test for nonparametric data.
Vintzileos. How to write and publish an original research article. Am J Obstet Gynecol 2010.
statistical test(s) according to the type of to establish the accuracy of a test there influenced by the prevalence of the dis-
the data to be analyzed and compared. should be a phase of observation (natu- ease and that only positive and negative
The descriptive statistics and methods ral history) without an intervention that predictive values are influenced by the
of statistical analysis should be carefully can alter the outcome. If there is an in- prevalence of the disease. However, this
determined based on the type of data to tervention as a result of an abnormal test principle is true only if the test condi-
be analyzed. Data can be continuous (in- result, which possibly can alter the out- tions are “fixed” and the results are re-
terval or linear), ordinal, or categorical. come of interest, then the accuracy of the producible. If the test conditions are not
In descriptive statistics, continuous data test cannot be established. The classical fixed or if the results of the test are sub-
that are assumed to follow a normal dis- example of this scenario is our inability jective, then sensitivity and specificity
tribution are often expressed as “mean to truly establish the accuracy of intra- are definitely influenced by the preva-
and standard deviation,” continuous partum fetal heart rate monitoring be- lence of disease. Here, the classical exam-
data without normal distribution or or- cause in real life, for ethical reasons, an ple is the performance of an obstetric ul-
dinal data are expressed as “median and “abnormal” fetal heart rate pattern will trasound to rule out fetal anomalies. If
interquartiles or ranges” and categorical most likely lead to intervention (ie, in- the a priori risk for a fetal anomaly is high
data are expressed as proportions strumental or cesarean delivery), which (high prevalence of disease), the exam-
(percent). may have prevented the “bad” outcome. iner will most likely pay much higher at-
Unfortunately, peer-reviewed articles Nevertheless, in such cases, the fetal tention and spend more time to visualize
frequently use inappropriate descriptive heart rate monitoring “abnormality” completely all fetal structures. On the
statistics. For instance, it is not infre- may be erroneously perceived by some as contrary, in a routine sonogram on
quent for obstetric articles to use Apgar a “false-positive test.” Another area that women with low a priori risk (low prev-
scores or parity (or gravidity) as contin- authors and readers should be aware of is alence of disease), it is expected that the
uous variables and express them as the effect that “prevalence” of disease has ultrasound examiner may not exercise
“mean” Apgar scores (or mean parity/ on the diagnostic accuracy of a test. The the same degree of scrutiny in the ultra-
gravidity). Examples include a mean traditional teaching is that the sensitivity sound examinaton. Thus, in the first case
1-minute Apgar score of 7.5 or parity 1.5. and specificity of the “disease” are not (high prevalence of disease), the sensitiv-
However, Apgar scores, as well as parity
(or gravidity) are often not normally dis-
tributed. In addition, in real life, Apgar TABLE 2
scores or parity/gravidity are not ex- Appropriate regression analyses according to the type of data
pressed with decimals. Therefore, it is
Data Type of regression analysis
more appropriate to describe such vari-
Dependent (Y) variable (outcome) is continuous Linear regression
ables by using median and ranges. ..............................................................................................................................................................................................................................................
One of the most frequently used statis- Powers of independent (X) variables Polynomial regression
..............................................................................................................................................................................................................................................
tical methods with respect to the accu- ⱖ2 independent (X) variables Multiple regression
..............................................................................................................................................................................................................................................
racy of a diagnostic test is the determina- Selection of best set of independent (X) variables Stepwise regression
tion of its sensitivity, specificity, and ..............................................................................................................................................................................................................................................
Dependent (Y) variable (outcome) has 2 categories Logistic regression
positive and negative predictive values. ..............................................................................................................................................................................................................................................
The thorough and clear understanding Dependent (Y) variable (outcome) has ⱖ3 categories Polytomous logistic regression
of these terms is an absolute necessity for or discriminant analysis
..............................................................................................................................................................................................................................................
both the author, as well as the reader of a Vintzileos. How to write and publish an original research article. Am J Obstet Gynecol 2010.
peer-reviewed article. It is axiomatic that
ity of the (ultrasound) test to identify fe- table or figure, it is a good idea to con- have authors who have completed simi-
tal anomalies is higher as compared with sider that each table or figure is a candi- lar work and therefore, they are consid-
the second case (low prevalence of dis- date for reproduction by another author ered experts.
ease). Thus, it appears that the condi- for another publication. The text and ta- At this point, it should be emphasized
tions of the (ultrasound) test are not bles should not contain raw numbers that for studies requiring IRB submis-
fixed because there is intraobserver vari- without percentages or percentages sion and approval, and almost all studies
ation in the accuracy of the test depend- without the raw numbers. The numbers do, the introduction, materials and
ing on the prevalence of the sought dis- should be internally consistent and in methods, a significant portion of the dis-
ease (a priori risk). In addition, there is agreement between the tables and the cussion, and the references are already
interobserver (subjective) variation in text. Any internal discrepancies in the most likely included in the IRB applica-
the accuracy of ultrasound as a test de- numbers put in jeopardy the credibility tion. Thus, most of the hard work in re-
pending on the skill of the ultrasound of the author(s) and severely compro- gard to the writing of the paper is already
examiner. mise the chances for publication. Confi- done.
Another issue to be considered is that dence intervals provide more accurate
case-control studies can provide esti- indication of the strength of the associa- Tips to consider before submitting
mates of positive and negative likelihood tions and therefore, provide better infor- the manuscript for publication
ratios but not positive or negative pre- mation than P values; thus, in conjunc- Before submitting the paper for publica-
dictive values. In addition, case-control tion with the effect measure, confidence tion, all coauthors should have the op-
studies do not allow determination of intervals should be used liberally. portunity to review the manuscript and
population prevalence or incidence of a The text should include a brief de- provide suggestions. In addition, consid-
disease. scription and analysis of the findings and eration should be given to the following:
Another consideration in this section it should follow the order that tables and (1) ask someone with experience in writ-
pertains to articles in which a relatively figures appear. The important findings ing or reviewing peer-review articles to
new methodology is used where those should be highlighted that may or may review the paper. It would be preferable
details have been previously published in not be statistically significant. Again, the that this senior reviewer is not very fa-
another report. In such instances, it is editorial space should be respected and miliar with the study details, so that he/
preferable to describe, briefly, that par- detailed description or repetition of all she can be a neutral barometer regarding
ticular methodology again rather than the information depicted in the tables or the quality of the paper; (2) the paper
referring the reader to those previously figures should be avoided. should not contain any contradictions;
published reports. This shows respect for (3) the paper should be understood by a
the reader’s time. Comment (or discussion) reader with average knowledge; and (4)
The comment (or discussion) section avoid errors.
Results should include the following: (1) clear As per the authors’ experience as peer
In the results section the author(s) statement of what the principal findings reviewers, the most frequent errors are as
should be prepared to give “results” for were, as well as the new knowledge that follows: (1) inappropriate conclusion(s),
all outcome measures that are described the current study offered; (2) strengths for example, conclusions that are applied
under “Materials and Methods.” The and weaknesses of the study; (3) com- to populations different than the one
converse is also true. The paper should parison of the findings of the current used in the study or conclusions imply-
not contain results that are not men- study with those of previous studies; (4) ing cause-and-effect relationship based
tioned under “Materials and Methods.” clarification regarding the similarities on inappropriate study designs; (2) lack
The results may include text, tables, fig- and differences with the findings of pre- of power analysis; (3) inadequate sample
ures, or any combination of the above. vious studies; (5) possible explanation(s) size; (4) too much confidence in negative
Results should be given for all outcome for the different findings; (6) clear and results from small samples; (5) improper
measures (primary and secondary) that concise conclusion of the meaning of the use of statistics; (6) when multiple com-
are described under “Materials and study as it relates to clinical practice or parisons are made true clinical signifi-
Methods.” Here, the opportunity exists future research; and (7) proposal for fu- cance should not be assumed if 1 or few
for reporting all the raw data. However, ture research. comparisons turn out to be statistically
the editorial space should be respected significant because this can happen by
by the judicious use of tables and/or fig- References chance alone; (7) incorrect use of statis-
ures, which have the purpose of saving The references list may be one of the most tical terminology with the terms “multi-
editorial space and at the same time important parts of the paper with respect variable” vs “multivariate.” Unfortu-
make it easier for the reader to under- to the chances for publication. The rea- nately, these 2 terms are being used
stand or interpret the results. Each table son for this is because editors frequently interchangeably. However, the term
or figure should stand on its own and be use as reviewers those included as au- “multivariable” refers to situations when
self-explanatory. In deciding the exact thors in the reference list. This is only a response or disease status is measured
format and data to be depicted in each natural because some of the references once (ie, as in case-control studies), and
the association between an exposure and will be unwilling to go back and read the be lengthy. If a suggested revision was
an outcome is assessed after adjustment original article again to obtain the com- followed, it is good practice to state in the
for confounders. In contrast, the term plete picture. In other words, the damage cover letter: “the revision was followed”
“multivariate” refers to situations when may be irreparable. There have been in- and indicate its exact location in the re-
the response is measured repeatedly on stances that typographic errors were not vised manuscript. All revisions should be
the same subject, thereby yielding a “vec- caught until months or years later when highlighted in the revised manuscript. If
tor” of responses for each subject, and a particular table or figure was repro- a suggested revision was not followed, it
hence “multivariate”; (8) inappropriate duced for another publication in another is imperative to explain why. If the rebut
reporting of “rates of proportions” with- journal. This is clearly an avoidable is not convincing to the editor and it in-
out any reference to the numerators and situation for which particular attention volves several revisions, then the editor
denominators. It is not good practice to has to be given, especially by busy may send the revised manuscript back to
present (or interpret) relative measures physicians. the reviewer who suggested the particu-
(ie, relative risk) without examining ab- If the paper needs revisions, we sus- lar revisions in the first place and that
solute measures (ie, absolute risk); (9) pect most papers do, the revisions can be may decrease the chances for publication
possible retrospective manipulation of minor or major. It is a good policy to dramatically. Thus, it is of paramount
the study objective according to the follow all minor revisions assuming that importance to put forward an extremely
(positive) findings; and (10) misspelling the clarity or quality of the paper will not logical and concise rebut in your cover
of the authors’ names that are cited in the be compromised. In such cases, the ac- letter to the editor, so he/she does not
references. If an authors’ name is mis- ceptance for publication is almost cer- send your revised manuscript back to the
spelled, and that author is one of the as- tain. However, the situation is quite dif- reviewers.
signed reviewers, the reviewer may be ferent when major revisions are The fourth possibility, which is quite
“turned off.” required. Major revisions can be catego- frequent, is that the paper is rejected
rized in 4 categories (each requiring a from publication. This can be extremely
Tips to consider after receiving different action): (1) those that improve disappointing, even heartbreaking, espe-
the editor’s response the quality of the paper, when revised (it cially for young investigators who may
A few weeks after submission of the pa- is strongly advisable to implement the take the criticisms personally. Fortu-
per, it is expected that the editor’s re- suggested revisions); (2) those that have nately, many editors use language to in-
sponse, along with the reviewers’ com- no effect on the quality of the paper (it is dicate that their decision was very much
ments, should be received. Based on the advisable to implement those changes to influenced by the high volume of papers
editor’s letter and the reviewers’ com- the extent possible); (3) those that may that they receive and that this does not
ments, the presumed disposition of the diminish the quality or clarity of the pa- necessarily mean that the paper has no
paper can be in 1 of 4 categories: (1) ac- per (it is advisable to not follow those value. In case of rejection, it is advisable
ceptance without revisions, as is (ex- suggestions); and (4) those that are im- to try to improve the paper, before sub-
tremely rare); (2) possible acceptance af- possible to do (they cannot be done). mitting it to a second journal, by ad-
ter minor revisions; (3) possible The rule of thumb is that if all or most dressing all the issues raised by the re-
acceptance after major revisions; and (4) suggested major revisions are followed, viewers. This is recommended for 2
rejection. Each 1 of these 4 initial dispo- the acceptance is almost certain, whereas reasons: (1) some journals may ask if this
sitions requires a different course of ac- if several of the suggested revisions are is the first submission. If it is not, they
tion to maximize the chances for rebut the acceptance remains uncertain. may want to see how the criticisms of
publication. The chances are that the paper most previous reviewers have been addressed;
If the paper is accepted without revi- likely will not be accepted if all suggested and (2) quite frequently the same re-
sions, we suspect this is a very rare occur- major revisions are refuted. viewers may be asked by the editor(s) of
rence, the only challenge is to correct the When the paper needs to be revised, the second journal to review that same
galley proofs very carefully. This task especially when major revisions are paper. If none of their criticisms have
should not be taken lightly because needed, the most critical factor that will been addressed it is natural, and quite
“what goes in print, stays in print for- determine the fate of the paper is the justifiable, to feel insulted, it is certain
ever.” The quality of the galley proofs cover letter to the editor. This letter that the paper will be rejected again. The
varies from journal to journal. Some- should address each revision separately. same process should be carried out in the
times, editors or publishers may change Here, the author has the opportunity to event of more submissions to more jour-
the meaning of the article in their at- “communicate” directly with the indi- nals. However, one should be mindful
tempt to improve it by substituting cer- vidual who has the ultimate power in de- that the paper does not lose clarity or fo-
tain words with others. If the error is ciding the fate of his/her paper. Editors cus in its final form after multiple
substantial and it is caught late, it may be read a large number of manuscripts and revisions.
acknowledged as an “erratum” in a sub- cover letters every day. Therefore, the The last piece of advice for those
sequent edition of the journal; but, it is cover letter has to be clear and concise young investigators who are apprehen-
almost certain that most of the audience addressing each criticism and need not sive or fearful about writing and submit-
ting a paper for publication has to do ports of meta-analyses of randomised studies in epidemiology (STROBE) statement:
with the realization that there is a very controlled trials: the QUOROM statement: qual- guidelines for reporting observational studies.
ity of reporting of meta-analyses. Lancet 1999; Lancet 2007;370:1453-7. Available at: http://
large number of obstetrics and gynecol- 354:1896-900. Available at: http://www. strobe-statement.org/index.php?id⫽checklist.
ogy journals, both in the United States consort-statement.org/index.aspx?o⫽1347. Accessed May 7, 2009.
and abroad, which are available and will- Accessed May 7, 2009. 6. Little J, Higgins JP, Ioannidis JP, et al.
ing to consider publishing their work. In 3. Stroup DF, Berlin JA, Morton SC, et al. Meta- Strengthening the reporting of genetic associa-
our view, adherence to the guidelines de- analysis of observational studies in epidemiol- tion studies (STREGA): an extension of the
ogy: a proposal for reporting, meta-analysis of STROBE Statement. Hum Genet 2009;125:
scribed previously will most likely in- observational studies in epidemiology (MOOSE)
crease the chances for publication. f 131-51. Available at: http://hum-molgen.org/
group. JAMA 2000;283:2008-12. Available at:
NewsGen/03-2009/000015.html. Accessed
http://www.consort-statement.org/index.aspx?
May 7, 2009.
o⫽1346. Accessed May 7, 2009.
REFERENCES 7. Vintzileos AM, Beazoglou T. Design, execu-
4. Bossuyt PM, Reitsma JB, Bruns DE, et al.
1. Moher D, Schulz KF, Altman D. The CON- Towards complete and accurate reporting of tion, interpretation, and reporting of economic
SORT statement: revised recommendations for studies of diagnostic accuracy: the STARD Ini- evaluation studies in obstetrics. Am J Obstet
improving the quality of reports of parallel-group tiative. Ann Intern Med 2003;138:40-4. Avail- Gynecol 2004;191:1070-6. Available at:
randomized trials. JAMA 2001;285:1987-91. able at: http://www.stard-statement.org. Ac- http://www.elsevier.com/framework_products/
Available at: http://www.consort-statement. cessed May 7, 2009. promis_misc/ajoghealth.pdf. Accessed May 7,
org. Accessed May 7, 2009. 5. von Elm E, Altman DG, Egger M, Pocock SJ, 2009.
2. Moher D, Cook DJ, Eastwood S, Olkin I, Ren- Gotzsche PC, Vandenbroucke JP. The
nie D, Stroup DF. Improving the quality of re- strengthening the reporting of observational