Meeting Report: BioSharing at ISMB 2010

Field, Dawn; Sansone, Susanna; DeLong, Edward F.; Sterk, Peter; Friedberg, Iddo; Gaudet, Pascale; Lewis, Susanna; Kottmann, Renzo; Hirschman, Lynette; Garrity, George; Cochrane, Guy; Wooley, John; Meyer, Folker; Hunter, Sarah; White, Owen; Bramlett, Brian; Gregurick, Susan; Lapp, Hilmar; Orchard, Sandra; Rocca-Serra, Philippe; Ruttenberg, Alan; Shah, Nigam; Taylor, Chris; Thessen, Anne

doi:10.4056/sigs/1403501

Open access
Published: 31 December 2010

Meeting Report: BioSharing at ISMB 2010

Dawn Field¹,
Susanna Sansone²,
Edward F. DeLong³,
Peter Sterk¹,
Iddo Friedberg^4,5,
Pascale Gaudet⁶,
Susanna Lewis⁷,
Renzo Kottmann⁸,
Lynette Hirschman⁹,
George Garrity¹⁰,
Guy Cochrane¹¹,
John Wooley¹²,
Folker Meyer¹³,
Sarah Hunter¹¹,
Owen White¹⁴,
Brian Bramlett¹⁵,
Susan Gregurick¹⁶,
Hilmar Lapp¹⁷,
Sandra Orchard¹¹,
Philippe Rocca-Serra²,
Alan Ruttenberg¹⁸,
Nigam Shah¹⁹,
Chris Taylor¹¹ &
…
Anne Thessen²⁰

Standards in Genomic Sciences volume 3, pages 254–258 (2010)Cite this article

859 Accesses
17 Citations
Metrics details

Abstract

This report summarizes the proceedings of the one day BioSharing meeting held at the Intelligent Systems for Molecular Biology (ISMB) 2010 conference in Boston, MA, USA This inaugural BioSharing event was hosted by the Genomic Standards Consortium as part of its M3 & BioSharing special interest group (SIG) workshop. The BioSharing event included invited talks from a range of community leaders and a panel discussion at the end of the day. The panel session led to the formal agreement among community leaders to join together to promote cross-community knowledge exchange and collaborations. A key focus of the newly formed Biosharing community will be linking up resources to promote real-world data sharing (virtuous cycle of data) and supporting compliance with data policies through the creation of a one-stop-portal of information. Further information about the newly established BioSharing effort can be found at http://biosharing.org.

Introduction

The M3 & Biosharing special interest group (SIG) hosted by the Genomic Standards Consortium (GSC, [1]) at the Intelligent Systems in Molecular Biology (ISMB) 2010 conference explored the latest concepts, informatics resources, and standards that are being developed to cope with the analysis of vast quantities of metagenomic data [2]. As part of the outreach of the GSC to other data-sharing communities, the second day of the SIG served as the inaugural meeting of the BioSharing initiative [3]. During this day-long meeting of communities interested in data-sharing, the focus shifted to addressing the wider issue of how to increase engagement between funding agencies and researchers to build better data policies to promote real-world data sharing through the use of standards.

An increased focus on ’omics data sharing

Data sharing policies are emerging in response to increased funding for high-throughput approaches in major bioscience domains [4], including genomics and functional genomics. But despite their commonalities, the policies are heterogeneous by nature, given the different types of communities served and the data types they cover. In parallel, an escalating number of community-developed standards (minimal requirements checklists [5], ontologies [6], and file-formats) operate to support the harmonization of the reporting process, so that different experiments can be compared or integrated. The proliferation of these standardization efforts is a positive sign of community engagement, but it also brings with it new sociological and technological challenges - creating interoperability and avoiding unnecessary overlap and duplication of effort that hampers their wider uptake.

The BioSharing initiative [3] seeks to facilitate a broader dialogue among funders, journals, standards developers, technology developers and researchers on the critical issue of data sharing within the metagenomics community and beyond. To help encourage this dialogue, 14 community leaders were invited to come together to present overviews of their community-level efforts and discussion how to move forward. This report briefly summarizes the presentations and discussions of this BioSharing day.

BioSharing - Towards real-world data sharing

The agenda of the day was designed to focus on the intersections of science, standards, and policy. Dawn Field and Susanna Sansone, founding members of BioSharing, described how the concept of a BioSharing community stemmed from their recent article Omics data sharing, written in collaboration with a large number of funders developing and maintaining data sharing policies [4]. The purpose of this BioSharing day was to bring together, for the first time, representatives of a variety of these communities to kick-start cross-community interactions and achieve agreement on how to move forward.

The BioSharing Plenary Talk - Strong Data Policies from Funding Agencies

The day opened with a plenary talk from Susan Gregurick, a co-author of the ’Omics data sharing paper [4] and representative of the Department of Energy (DOE [7]), which maintains a strong data sharing policy within its Genomes to Life (GTL) program. Dr. Gregurick gave an overview of the mission of the DOE and its strong commitment to data sharing. To help set the stage for the discussion at this meeting, she also announced that the National Science Foundation (NSF,[8]) would be implementing a new approach to data stewardship through the ‘Data Management Plan’ requirement in future grants. This approach will likely be rolled out to other federal funding agencies. This will require researchers to be increasingly familiar with existing and planned data sharing solutions for their particular area of research.

Community Introductions

All remaining presentations of this day were dedicated to community introductions by community leaders. In turn, each community representative was asked to state the purpose and current status of work in their community, its mission, and to highlight specific activities it might be undertaking to work at the interface of the many activities covered within BioSharing, such as ontologies, checklists, data formats, enabling technologies, scientific publications, databases, and data policies. A total of 12 formalized community-level projects were described covering the perspectives of checklists, ontologies, software, databases, journals and collaborative data sharing efforts (Table 1). In addition, there were talks to represent the general ‘database’ and ‘natural language processing (BioNLP) communities from Guy Cochrane (EBI) and Lynette Hirschman (MITRE). Combined, this group covered a wide range of expertise and projects. The need to ‘close the virtuous cycle’, through increased collaboration at the intersections of these communities, was a common theme. All presentations are available online from the BioSharing website.

Table 1. List of communities, their missions, and community representative, in the order in which they were presented at the first BioSharing workshop.

Full size table

Panel Discussion: developing a vision for the future

Chaired by Dawn Field and Susanna Sansone, the Panel discussion at the end of the day included all speakers. This was the first time many of these community leaders had met in person and all agreed on the importance of this meeting as the first step in working together. All agreed to move forward as a group to build linkages through a BioSharing effort. There was strong interest in a follow up meeting. To support further activities, it was agreed that the BioSharing forum will have a combination of targeted and open-attendance meetings, normally as part of larger meetings so as to reach as broad an audience as possible, especially potential grant awardees and therefore future users of standards. The forum will utilize all possible means to disseminate information (such as RSS feeds, position papers, presentations). Following this successful meeting, a statement of purpose was formulated. It can be found in full on the BioSharing website [3]. Also, as a result of the meeting Pascale Gaudet took forward the development of a minimum information checklist for describing databases, BioDBCore as a BioSharing project led by her community, the International Society of Biocuration (ISB) [14].

BioSharing Forum Statement of Purpose

The BioSharing community will work at the global level to build stable linkages between funders, implementing data sharing policies, and well-constituted standardization efforts in the biosciences domain, to expedite the communication and the production of an integrated standards-based framework for the capture and sharing of high-throughput genomics and functional genomic bioscience data.

This overall objective has several components, each of which can be further decomposed:

Web site to centralize bioscience data policies, reporting standards and links to other related portals

◦ Providing a “one-stop shop” for those seeking data sharing policy documents and information about the standards and technologies that support them.
◦ Exposing core information on well-constituted, community-driven standardization efforts and link to their reporting standards (checklists, ontologies and file-formats), documentation, training material, news and contact point.
◦ Linking to existing portals or new resources (to be developed collaboratively with other groups and initiatives) for those seeking information on systems serving or implementing the standards.

Communication forum for funders and leaders of the standardization efforts to achieve harmonization and mutual support

◦ Lobbying for intra-harmonization within these two groups to promote:

exchange of ideas and policy components among public and private funders, and between funders and finding recipients, to ensure that the difference among the policies (such as the reporting standards that may be supported) ultimately do not impede seamless interoperability of the data.
collaboration among the standardization efforts to create interoperable reporting standards and to avoid unnecessary overlap, duplication of effort and incompatible tools.

◦ Identifying a mutual support system between the two stakeholder groups to ensure:

funding agencies are abreast with challenges the standardization efforts face and can provide targeted funds to sustain their development and maintenance;
when community-developed standards are mature and appropriate standards-compliant systems become available these are channeled to the appropriate funding agencies, which in turn endorse them in agency data sharing policies, thus achieving wider harmonization of the data.

References

Genomic Standards Consortium. http://gensc.org/gc_wiki/index.php/Main_Page
Metagenomics versus Moore’s law. Nat Methods 2009; 6:623. doi:10.1038/nmeth0909-623
The biosharing website. http://biosharing.org/
Field D, Sansone SA, Collis A, Booth T, Dukes P, Gregurick SK, Kennedy KL, Kolar P, Kolker E, Maxon M, et al. ’Omics Data Sharing. Science 2009; 326:234–236. PubMed doi:10.1126/science.1180598
Article PubMed Central CAS PubMed Google Scholar
Taylor CF, Field D, Sansone SA, Aerts J, Apweiler R, Ashburner M, Ball CA, Binz PA, Bogue M, Booth T, et al. Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project. Nat Biotechnol 2008; 26:889–896. PubMed doi:10.1038/nbt.1411
Article PubMed Central CAS PubMed Google Scholar
Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W, Goldberg LJ, Eilbeck K, Ireland A, Mungall CJ, et al. The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol 2007; 25:1251–1255. PubMed doi:10.1038/nbt1346
Article PubMed Central CAS PubMed Google Scholar
Department of Energy. http://www.energy.gov
National Science Foundation. http://www.nsf.gov/
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, Thomson N, Allen MJ, Angiuoli SV, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol 2008; 26:541–547. PubMed doi:10.1038/nbt1360
Article PubMed Central CAS PubMed Google Scholar
Taylor CF, Hermjakob H, Julian RK, Jr., Garavelli JS, Aebersold R, Apweiler R. The work of the Human Proteome Organisation’s Proteomics Standards Initiative (HUPO PSI). OMICS 2006; 10:145–151. PubMed doi:10.1089/omi.2006.10.145
Article CAS PubMed Google Scholar
Howe D, Costanzo M, Fey P, Gojobori T, Hannick L, Hide W, Hill DP, Kania R, Schaeffer M, St Pierre S, et al. Big data: The future of biocuration. Nature 2008; 455:47–50. doi:10.1038/455047a
Article PubMed Central CAS PubMed Google Scholar
Garrity GM, Field D, Kyrpides N, Hirschman L, Sansone SA, Angiuoli S, Cole JR, Glockner FO, Kolker E, Kowalchuk G, et al. Toward a standards-compliant genomic and metagenomic publication record. OMICS 2008; 12:157–160. PubMed doi:10.1089/omi.2008.A2B2
Article CAS PubMed Google Scholar
Rocca-Serra P, Brandizi M, Maguire E, Sklyar N, Taylor C, Begley K, Field D, Harris S, Hide W, Hofmann O, et al. ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level. Bioinformatics 2010; 26:2354–2356. PubMed doi:10.1093/bioinformatics/btq415
Article PubMed Central CAS PubMed Google Scholar
Gaudet P, Bairoch A, Field D, Sansone SA, Taylor C, Attwood TK, Bateman A, Blake JA, Bult CJ, Cherry JM, et al. Towards BioDBcore: a community-defined information specification for biological databases. Nucleic Acids Res 2010. doi:10.1093/nar/gkq1173
Wooley JC, Godzik A, Friedberg I. A primer on metagenomics. PLOS Comput Biol 2010; 6:e1000667. PubMed doi:10.1371/journal.pcbi.1000667
Article PubMed Central PubMed Google Scholar
Vision TJ. Open Data and the Social Contract of Scientific Publishing. Bioscience 2010; 60:330–331. doi:10.1525/bio.2010.60.5.2
Article Google Scholar

Download references

Acknowledgements

Many thanks to invited and selected speakers and everyone that participated in this SIG meeting. We gratefully acknowledge the support from the US National Science Foundation grant (NSF) RCN4GSC, DBI-0840989.

Author information

Authors and Affiliations

Centre for Ecology & Hydrology, Maclean Building, Benson Lane, Crowmarsh Gifford, Wallingford, Oxfordshire, OX10 8BB, UK
Dawn Field & Peter Sterk
Oxford e-Research Centre, University of Oxford, Oxford, UK
Susanna Sansone & Philippe Rocca-Serra
Department of Biological Engineering, Massachusetts Institue of Technology, Cambridge, MA, 02138, USA
Edward F. DeLong
Department of Microbiology, Miami University, Oxford, OH, 45056, USA
Iddo Friedberg
Department of Computer Science and Software Engineering, Miami University, Oxford, OH, 45056, USA
Iddo Friedberg
Swiss Institute of Bioinformatics, CMU - 1, rue Michel Servet, CH-1211, Geneva 4, Switzerland
Pascale Gaudet
Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, USA
Susanna Lewis
Microbial Genomics Group, Max Planck Institute for Marine Microbiology & Jacobs University Bremen, D-28359, Bremen, Germany
Renzo Kottmann
Information Technology Center, The MITRE Corporation, 202 Burlington Road, Bedford, MA, 01730, USA
Lynette Hirschman
Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, Michigan, 48824, USA
George Garrity
European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
Guy Cochrane, Sarah Hunter, Sandra Orchard & Chris Taylor
University of California San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA
John Wooley
Argonne National Laboratory, 9700 South Cass Avenue, Argonne, IL, 60439, USA
Folker Meyer
Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, 21201, USA
Owen White
Lux Bio Group, 121 SW Morrison Street, Suite 1550, Portland, Oregon, 97204, USA
Brian Bramlett
Biological Systems Science Division, Department of Energy, 1000 Independence Ave., Washington, DC, 20585, USA
Susan Gregurick
National Evolutionary Synthesis Center (NESCent), Durham, NC, 27705, USA
Hilmar Lapp
Science Commons, c/o Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory, Building 32-386D, 32 Vassar Street, Cambridge, MA, 02139, USA
Alan Ruttenberg
Stanford Center for Biomedical Informatics Research, Medical School Office Building X-215, 251 Campus Drive, Stanford, CA, 94305, USA
Nigam Shah
Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA, 02543, USA
Anne Thessen

Authors

Dawn Field
View author publications
You can also search for this author in PubMed Google Scholar
Susanna Sansone
View author publications
You can also search for this author in PubMed Google Scholar
Edward F. DeLong
View author publications
You can also search for this author in PubMed Google Scholar
Peter Sterk
View author publications
You can also search for this author in PubMed Google Scholar
Iddo Friedberg
View author publications
You can also search for this author in PubMed Google Scholar
Pascale Gaudet
View author publications
You can also search for this author in PubMed Google Scholar
Susanna Lewis
View author publications
You can also search for this author in PubMed Google Scholar
Renzo Kottmann
View author publications
You can also search for this author in PubMed Google Scholar
Lynette Hirschman
View author publications
You can also search for this author in PubMed Google Scholar
George Garrity
View author publications
You can also search for this author in PubMed Google Scholar
Guy Cochrane
View author publications
You can also search for this author in PubMed Google Scholar
John Wooley
View author publications
You can also search for this author in PubMed Google Scholar
Folker Meyer
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Hunter
View author publications
You can also search for this author in PubMed Google Scholar
Owen White
View author publications
You can also search for this author in PubMed Google Scholar
Brian Bramlett
View author publications
You can also search for this author in PubMed Google Scholar
Susan Gregurick
View author publications
You can also search for this author in PubMed Google Scholar
Hilmar Lapp
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Orchard
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Rocca-Serra
View author publications
You can also search for this author in PubMed Google Scholar
Alan Ruttenberg
View author publications
You can also search for this author in PubMed Google Scholar
Nigam Shah
View author publications
You can also search for this author in PubMed Google Scholar
Chris Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Anne Thessen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dawn Field.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Field, D., Sansone, S., DeLong, E.F. et al. Meeting Report: BioSharing at ISMB 2010. Stand in Genomic Sci 3, 254–258 (2010). https://doi.org/10.4056/sigs/1403501

Download citation

Published: 31 December 2010
Issue Date: November 2010
DOI: https://doi.org/10.4056/sigs/1403501

Meeting Report: BioSharing at ISMB 2010

Abstract

Introduction

An increased focus on ’omics data sharing

BioSharing - Towards real-world data sharing

The BioSharing Plenary Talk - Strong Data Policies from Funding Agencies

Community Introductions

Panel Discussion: developing a vision for the future

BioSharing Forum Statement of Purpose

This overall objective has several components, each of which can be further decomposed:

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Environmental Microbiome

Contact us

Meeting Report: BioSharing at ISMB 2010

Abstract

Introduction

An increased focus on ’omics data sharing

BioSharing - Towards real-world data sharing

The BioSharing Plenary Talk - Strong Data Policies from Funding Agencies

Community Introductions

Panel Discussion: developing a vision for the future

BioSharing Forum Statement of Purpose

This overall objective has several components, each of which can be further decomposed:

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Environmental Microbiome

Contact us