`DOI 10.1186/s12859-015-0668-z
`
`ED I T O R I A L
`Open Access
`Use and mis-use of supplementary material
`in science publications
`Mihai Pop1* and Steven L. Salzberg2
`
`Abstract
`
`Supplementary material is a ubiquitous feature of scientific articles, particularly in journals that limit the length of
`the articles. While the judicious use of supplementary material can improve the readability of scientific articles, its
`excessive use threatens the scientific review process and by extension the integrity of the scientific literature. In
`many cases supplementary material today is so extensive that it is reviewed superficially or not at all. Furthermore,
`citations buried within supplementary files rob other scientists of recognition of their contribution to the scientific
`record. These issues are exacerbated by the lack of guidance on the use of supplementary information from the
`journals to authors and reviewers. We propose that the removal of artificial length restrictions plus the use of
`interactive features made possible by modern electronic media can help to alleviate these problems. Many journals,
`in fact, have already removed article length limitations (as is the case for BMC Bioinformatics and other BioMed
`Central journals). We hope that the issues raised in our article will encourage publishers and scientists to work
`together towards a better use of supplementary information in scientific publishing.
`
`Introduction
`is ubiquitous in scientific pa-
`Supplementary material
`pers. For example, in the most recent issues of Science
`and Nature, every single paper contains supplementary
`information (data and/or text) that does not appear in
`the print version of the journal. Primarily used to cir-
`cumvent page limits imposed by journals, supplementary
`material can in some instances help improve the presen-
`tation, even in papers not subjected to length limitations.
`For example, a manuscript might present a high-level
`view of the methods employed in the analysis while de-
`tailed technical descriptions of the methods (essential
`for ensuring reproducibility) can be relegated to an on-
`line supplement. As a result, the story presented in the
`main manuscript can be laid out in a more concise and
`clear fashion, while still allowing interested readers to
`drill down into the details of the analysis. When used
`appropriately, supplementary material made available as
`an online companion to a paper provides scientific au-
`thors and publishers the means to achieve a compromise
`between readability and reproducibility.
`
`* Correspondence: mpop@umiacs.umd.edu
`1Department of Computer Science and Center for Bioinformatics and
`Computational Biology, University of Maryland, College Park, MD 20742, USA
`Full list of author information is available at the end of the article
`
`At the same time, the use of supplementary material
`raises several important questions and concerns. What is
`the appropriate balance between the main text and sup-
`plementary information? How is the scientific validity
`and relevance of supplementary material evaluated dur-
`ing the review process? What is the best method to link
`supplementary information to the primary paper?
`
`Why is supplementary material needed? Why is it
`a problem?
`The use of supplementary material is generally more ex-
`tensive in journals that impose page limits. Compare, for
`example, papers published in Bioinformatics (a journal
`that strictly controls manuscript length) and BMC Bio-
`informatics (a journal without page limits). At the same
`time, the extensive use of supplementary material is by
`no means uncommon even in journals that do not im-
`pose manuscript length restrictions. One valid 'excuse' is
`the need for conciseness in the main manuscript; how-
`ever, if the effort to squeeze the main findings into a lim-
`ited space is associated with a lack of attention to the
`supplementary information, the result may ultimately re-
`duce the clarity of the entire presentation. Paradoxically,
`despite or maybe because of the large amount of infor-
`mation often available in a supplement,
`finding and
`extracting specific points from a supplement can be very
`
`© 2015 Pop and Salzberg. This is an Open Access article distributed under the terms of the Creative Commons Attribution
`License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any
`medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://
`creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
`
`AAA, Ex. 2002
`Evergreen Theragnostics, Inc. v. AAA SA
`PGR2021-0002
`Page 1 of 4
`
`
`
`Pop and Salzberg BMC Bioinformatics (2015) 16:237
`
`Page 2 of 4
`
`difficult – particularly when the supplementary material
`is effectively a grab-bag of all the analyses that did not
`make it into the main paper.
`Even a cursory examination of papers published in
`top-tier journals reveals the extent to which supplemen-
`tary material is used in our field (a summary for the top
`10 most highly cited papers in 7 scientific journals is
`provided in Additional File 1). One can easily find ex-
`tremes such as these two articles published in Science:
`the first, a 2010 by Werren et al. [1], is a 6-page article
`accompanied by 165 pages of supplementary material.
`The second, a 2012 paper by Meyer et al. [2], is a 5-page
`article with 144 pages of supplementary material plus a
`spreadsheet with six additional supplementary tables. In
`[1], almost half (71 pages) of the supplementary mate-
`rials contain text supporting (or extending) the informa-
`tion provided in the main manuscript. In addition to the
`main text, the supplementary material included 210 cita-
`tions, or 5 times as many as the citations in the main
`manuscript. In [2], the supplement is organized as 20
`separate “Notes”, each with a separate author list and
`separate first authors and corresponding authors from
`the main paper. 168 citations are included while the
`main paper has just 28. These observations are troubling
`for several reasons; one is that citations within supple-
`mentary material do not get tracked by citation indices
`[3]. The supplementary references generally cite methods
`that were critical to the study being published. As a re-
`sult, an important body of work does not receive ap-
`propriate recognition – a troubling observation given
`the increasing use of quantitative impact measures (cit-
`ation counts,
`impact factors, etc.) in promotion and
`funding decisions. Furthermore, science advances through
`the incremental addition of knowledge to an existing
`body of work, and the proper acknowledgment of the
`previous work is a fundamental
`feature of scientific
`practice. We are not the first to make this observation
`(see [3–5]) yet, to our knowledge, neither publishers
`nor the scientific community have taken any steps to-
`wards remedying the situation. If citations within sup-
`plementary material are to be allowed, they should be
`appropriately tracked by citation indices – an impos-
`sible proposition today given that most journals do not
`provide properly formatted online citations for support-
`ing information.
`In fact, the majority of journals provide little or no
`guidance regarding the use of references within supple-
`mentary material, in many cases because the initial in-
`tent for such material was to enable the addition of data
`(such as more extensive tables, figures, movies, etc.) ra-
`ther than supporting text. Science was the only journal,
`among several that we examined, that clearly discussed
`the issue of references in supplementary material at the
`time when we originally wrote this article. Nature has
`
`since also clarified their policy. Science requires all refer-
`ences within supplementary material to be included in
`the main reference list: “References only cited in the sup-
`plementary material should be include at the end of the
`reference section of the main text, and the reference num-
`bering should continue as if the Supplementary Mate-
`rials was a continuation of the main text” [6]. This
`policy, though apparently useful, is not followed as ex-
`emplified by the articles discussed above [1, 2]. Nature
`currently explicitly discourages the use of references in
`supplementary material: “Please note that we do not en-
`courage deposition of references within SI as they will not
`be live links and will not contribute towards citation
`measures for the papers concerned. Authors who never-
`theless wish to post reference lists should continue the
`numbering from the last reference listed in the print ver-
`sion, rather than repeating the numbering in the print
`version” [7]. In fact, both Science and Nature strictly
`limit the number of references that can appear in print,
`a policy that runs directly counter to the very essence of
`scholarship. Given that references can be provided on-
`line for essentially no cost, these policies need to be
`changed.
`
`Is supplementary material being reviewed?
`Most journals ask reviewers to evaluate supplementary
`material, either to assess whether the information is ne-
`cessary, or to actually review it for scientific accuracy.
`For example, at the journal Science, the instructions to
`authors clearly state: “To be accepted for posting, supple-
`mentary materials must be essential to the scientific in-
`tegrity and excellence of
`the paper. The material
`is
`subject to the same editorial standards and peer-review
`procedures as the print publication” [6]. At the same
`time, many other journals do not provide any guidance
`thereby encouraging ad hoc reviewing
`to reviewers,
`practices that ultimately depend on each reviewer's own
`decisions.
`Despite the instructions provided to reviewers by some
`journals, supplementary material are rarely reviewed, es-
`pecially when the length of the supplementary text far
`exceeds that of the article being published. This fact is well
`evidenced by the manuscripts highlighted above [1, 2],
`which are merely two examples among thousands of man-
`uscripts submitted each year with lengthy supplements.
`Despite the fact that the instructions to authors for the
`journal Science require that all items in the supplementary
`material be appropriately referenced from the main text,
`in Werren et al. [1] only 9 out of the 25 supplementary
`figures, and 17 of the 58 supplementary tables are expli-
`citly mentioned in the main article. The entirety of the 71
`pages of supplementary text are referenced through a sin-
`gle citation from the main text (citation 6 in the article),
`making it difficult for an interested reader or reviewer to
`
`AAA, Ex. 2002
`Page 2 of 4
`
`
`
`Pop and Salzberg BMC Bioinformatics (2015) 16:237
`
`Page 3 of 4
`
`even find the specific section being referenced. The fact
`that this article ignores the journal's own policy strongly
`suggests that neither the reviewers nor the editors care-
`fully evaluated the supplementary material. We do not
`wish to single out this article; in fact, we would argue that
`this example is typical of most high-profile papers pub-
`lished today.
`This is a troubling observation as it suggests the possi-
`bility that fundamental errors in methods or analyses bur-
`ied in supplementary files may go undetected, thereby
`bringing into question the scientific accuracy and validity
`of the published articles.
`
`Is supplementary material easy to use?
`The primary intent of supplementary material is to pro-
`vide additional useful
`information that supports and
`complements the main text. In addition to figures, the
`most common form of supplementary information are
`tables detailing data presented in the main text. These
`tables are often extensive, containing, for example, infor-
`mation about a large set of genes in an organism. Such
`information is most useful to readers in a computer
`readable format (such as tab- or comma-separated plain
`text files, or a common spreadsheet format). In many
`cases, though, supplementary tables are provided only in
`PDF format, thereby significantly hampering the use of
`these data by researchers attempting to reproduce pub-
`lished results.
`the
`Furthermore, as we already highlighted above,
`main text is often not well integrated with the support-
`ing information provided in supplementary material.
`Readers often have to sift through tens or hundreds of
`pages of text to find information simply referenced from
`the main text as 'see Supplementary material'.
`
`A way forward
`The situation outlined above is simply unacceptable in
`today's technologically-advanced world. The limits im-
`posed on the length of articles and their corresponding
`references derive almost entirely from the constraints of
`paper-based publication. While these made sense for
`most of the 20th century, they make no sense at all
`today, and they distort and even imperil the scientific
`process. In the 21st century, fewer and fewer scientists
`peruse paper copies of journals. While one might argue
`that supplementary material can help improve the pres-
`entation of articles, especially in electronic form, the ex-
`cessive and largely unregulated use of supplementary
`material is harmful to science. As we discussed above,
`the scientific quality and validity of supplementary files
`is rarely evaluated during the review process. Further-
`more, cross-referencing prior works is a vital component
`of the scientific endeavor, yet many scientists' contribu-
`tions go unrecognized, buried deeply in supplementary
`
`files and not tracked by citation indices. This situation
`disproportionately affects scientists developing the ana-
`lytical methods that have, in many respects, made the
`current scientific revolution possible. Authors, reviewers,
`and journals alike must ensure the adequate acknow-
`ledgment, within every scientific article, of all prior work
`relevant to the study being published.
`The ubiquitous use of electronic media in modern sci-
`entific publishing provides an opportunity for the better
`integration of supplementary material with the primary
`article. Specifically, we propose that
`supplementary
`items,
`irrespective of format, be directly hyper-linked
`from the text itself. Such references should be to specific
`sections of the supplementary material rather than the
`full supplementary text. Mechanisms for providing such
`links are available in virtually all commonly used word
`processors, as well as in the commonly used display
`media (HTML, PDF, etc.), thereby requiring no add-
`itional infrastructure to be put into place. The availabil-
`ity of
`the supplementary information just 'one click
`away' would not only dramatically improve the utility of
`published scientific articles, but also increase the likeli-
`hood that supplementary material are adequately evalu-
`ated during the review process.
`Some journals have already taken steps towards pro-
`viding a rich interface to their articles, and in many
`cases the supplementary tables, figures, or other media
`are appropriately hyperlinked directly from the manu-
`script. In PNAS, for example, online articles are presented
`in a feature-rich format that includes several useful inter-
`active items: (i) hovering on a citation retrieves the cit-
`ation in a pop-up widget; (ii) figures and table references
`are hyperlinked to the actual display item; (iii) files con-
`taining supplementary tables and other data are directly
`hyperlinked from the manuscript, allowing readers to
`download these items with a single click. In PNAS, these
`features are also preserved in the PDF version of the arti-
`cles, and furthermore the supplementary material is auto-
`matically included within the downloaded PDF. In most
`other journals supplementary material must be down-
`loaded separately.
`In addition, we believe that removing arbitrary article
`size limits, at least for the online versions of articles,
`would have an important impact on removing the artifi-
`cial distinction between supplementary material and the
`main manuscript text. An interesting compromise in this
`direction is exemplified by Nature Methods, where arti-
`cles are accompanied by an Online Methods section that
`appears in both the online version of the article and the
`downloaded PDF.
`In our discussion above we have singled out two man-
`uscripts published in Science, primarily because Science
`is one of the few journals that provides clear instructions
`to authors and reviewers on supplementary material, yet
`
`AAA, Ex. 2002
`Page 3 of 4
`
`
`
`Pop and Salzberg BMC Bioinformatics (2015) 16:237
`
`Page 4 of 4
`
`6.
`
`7.
`
`8.
`
`Science instructions to authors [http://www.sciencemag.org/site/feature/
`contribinfo/prep/prep_online.xhtml]
`Nature instructions to authors [http://www.nature.com/nature/authors/
`submissions/final/suppinfo.html]
`NISO/NFAIS: Recommended practices for online supplemental journal article
`materials: National Information Standards Organization and the National
`Federation of Advanced Information Services.; NISO RP-15-2013 2013.
`
`frequently overuse
`articles published in this journal
`supplements. A more extensive analysis of supplemen-
`tary materials across journals is beyond the scope of
`this editorial, however interested readers can examine
`such an analysis recently done for environmental sci-
`ence journals [5], as well as our own survey of 70
`highly cited genomics papers from 7 different journals
`(Additional file 1: Table S1).
`Given the extensive use of supplementary material,
`and the potential harm it poses to science, it is critical
`that all scientific journals develop clear and consistent
`policies on the use and review of supplementary mater-
`ial. Some initial recommendations on the use of supple-
`mentary material were recently outlined in a report of
`the National Information Standards Organization and
`the National Federation of Advanced Information Ser-
`vices [8], but these recommendations still need to be im-
`plemented and refined to ensure the ethical and
`consistent use of supplementary material in our discip-
`line. We hope our paper will motivate scientists and
`publishers to enact desperately needed changes in the
`way supplementary materials are evaluated and used in
`scientific publishing.
`
`Additional file
`
`Additional file 1: Spreadsheet containing summary statistics about
`supplementary material use in 7 scientific journals. Data is presented
`for 10 genomics papers from each journal selected based on their
`number of citations during 2010–2011.
`
`Acknowledgments
`This paper was inspired by a spirited discussion with Julien Tap. We would
`also like to thank Emmanuelle LeChatelier, Edi Prifti, and Eduardo Rocha for
`comments and suggestions on the manuscript.
`
`Author details
`1Department of Computer Science and Center for Bioinformatics and
`Computational Biology, University of Maryland, College Park, MD 20742, USA.
`2Center for Computational Biology McKusick-Nathans Institute of Genetic
`Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
`21205, USA.
`
`Received: 8 July 2015 Accepted: 9 July 2015
`
`References
`1. Werren JH, Richards S, Desjardins CA, Niehuis O, Gadau J, Colbourne JK,
`et al. Functional and evolutionary insights from the genomes of three
`parasitoid Nasonia species. Science. 2010;327:343–8.
`2. Meyer M, Kircher M, Gansauge MT, Li H, Racimo F, Mallick S, et al. A
`high-coverage genome sequence from an archaic Denisovan individual.
`Science. 2012;338:222–6.
`Seeber F. Citations in supplementary information are invisible. Nature.
`2008;451:887.
`4. Weiss MS, Einspahr H, Baker EN, Dauter Z, Kaysser-Pyzalla AR, Kostorz G,
`et al. Citations in supplementary material. Acta crystallographica Section D,
`Biological crystallography. 2010;66:1269–70.
`Kenyon J, Sprague NR: Trends in the Use of Supplementary Materials in
`Environmental Science Journals. Issues in Science and Technology
`Librarianship 2014.
`
`3.
`
`5.
`
`Submit your next manuscript to BioMed Central
`and take full advantage of:
`
`• Convenient online submission
`
`• Thorough peer review
`
`• No space constraints or color figure charges
`
`• Immediate publication on acceptance
`
`• Inclusion in PubMed, CAS, Scopus and Google Scholar
`
`• Research which is freely available for redistribution
`
`Submit your manuscript at
`www.biomedcentral.com/submit
`
`AAA, Ex. 2002
`Page 4 of 4
`
`