About us
Learn how GA4GH helps expand responsible genomic data use to benefit human health.
Learn how GA4GH helps expand responsible genomic data use to benefit human health.
Our Strategic Road Map defines strategies, standards, and policy frameworks to support responsible global use of genomic and related health data.
Discover how a meeting of 50 leaders in genomics and medicine led to an alliance uniting more than 5,000 individuals and organisations to benefit human health.
GA4GH Inc. is a not-for-profit organisation that supports the global GA4GH community.
The GA4GH Council, consisting of the Executive Committee, Strategic Leadership Committee, and Product Steering Committee, guides our collaborative, globe-spanning alliance.
The Funders Forum brings together organisations that offer both financial support and strategic guidance.
The EDI Advisory Group responds to issues raised in the GA4GH community, finding equitable, inclusive ways to build products that benefit diverse groups.
Distributed across a number of Host Institutions, our staff team supports the mission and operations of GA4GH.
Curious who we are? Meet the people and organisations across six continents who make up GA4GH.
More than 500 organisations connected to genomics — in healthcare, research, patient advocacy, industry, and beyond — have signed onto the mission and vision of GA4GH as Organisational Members.
These core Organisational Members are genomic data initiatives that have committed resources to guide GA4GH work and pilot our products.
This subset of Organisational Members whose networks or infrastructure align with GA4GH priorities has made a long-term commitment to engaging with our community.
Local and national organisations assign experts to spend at least 30% of their time building GA4GH products.
Anyone working in genomics and related fields is invited to participate in our inclusive community by creating and using new products.
Wondering what GA4GH does? Learn how we find and overcome challenges to expanding responsible genomic data use for the benefit of human health.
Study Groups define needs. Participants survey the landscape of the genomics and health community and determine whether GA4GH can help.
Work Streams create products. Community members join together to develop technical standards, policy frameworks, and policy tools that overcome hurdles to international genomic data use.
GIF solves problems. Organisations in the forum pilot GA4GH products in real-world situations. Along the way, they troubleshoot products, suggest updates, and flag additional needs.
NIF finds challenges and opportunities in genomics at a global scale. National programmes meet to share best practices, avoid incompatabilities, and help translate genomics into benefits for human health.
Communities of Interest find challenges and opportunities in areas such as rare disease, cancer, and infectious disease. Participants pinpoint real-world problems that would benefit from broad data use.
The Technical Alignment Subcommittee (TASC) supports harmonisation, interoperability, and technical alignment across GA4GH products.
Find out what’s happening with up to the minute meeting schedules for the GA4GH community.
See all our products — always free and open-source. Do you work on cloud genomics, data discovery, user access, data security or regulatory policy and ethics? Need to represent genomic, phenotypic, or clinical data? We’ve got a solution for you.
All GA4GH standards, frameworks, and tools follow the Product Development and Approval Process before being officially adopted.
Learn how other organisations have implemented GA4GH products to solve real-world problems.
Help us transform the future of genomic data use! See how GA4GH can benefit you — whether you’re using our products, writing our standards, subscribing to a newsletter, or more.
Help create new global standards and frameworks for responsible genomic data use.
Align your organisation with the GA4GH mission and vision.
Want to advance both your career and responsible genomic data sharing at the same time? See our open leadership opportunities.
Join our international team and help us advance genomic data use for the benefit of human health.
Share your thoughts on all GA4GH products currently open for public comment.
Solve real problems by aligning your organisation with the world’s genomics standards. We offer software dvelopers both customisable and out-of-the-box solutions to help you get started.
Learn more about upcoming GA4GH events. See reports and recordings from our past events.
Speak directly to the global genomics and health community while supporting GA4GH strategy.
Be the first to hear about the latest GA4GH products, upcoming meetings, new initiatives, and more.
Questions? We would love to hear from you.
Read news, stories, and insights from the forefront of genomic and clinical data use.
Attend an upcoming GA4GH event, or view meeting reports from past events.
See new projects, updates, and calls for support from the Work Streams.
Read academic papers coauthored by GA4GH contributors.
Listen to our podcast OmicsXchange, featuring discussions from leaders in the world of genomics, health, and data sharing.
Check out our videos, then subscribe to our YouTube channel for more content.
View the latest GA4GH updates, Genomics and Health News, Implementation Notes, GDPR Briefs, and more.
Discover all things GA4GH: explore our news, events, videos, podcasts, announcements, publications, and newsletters.
22 Oct 2019
The GA4GH Steering Committee recently approved Phenopackets, a standard file format for sharing phenotypic information. The Phenopackets standard aims to facilitate communication between the research and clinical genomics communities by creating an ecosystem of interoperable tools and resources that can use phenotypic data with fewer barriers.
Image Credit: Stephanie Li, GA4GH
More than 60 million genomes are expected to be sequenced for healthcare purposes over the next five years. This mass of data has the potential to inform human health and medicine in unprecedented ways, but that promise will only be realized if the data can be shared across disciplines and effectively linked to clinical outcomes.
The majority of existing formats for describing genotype information do not include a means to share corresponding phenotypic information (e.g. observable characteristics, signs/symptoms of disease). While some genomic databases have defined their own formats for representing phenotypic information, the lack of uniformity amongst these organizations hinders communication and limits the ability to perform analyses across them.
The GA4GH Steering Committee recently approved Phenopackets, a standard file format for sharing phenotypic information. The Phenopackets standard aims to facilitate communication between the research and clinical genomics communities by creating an ecosystem of interoperable tools and resources that can use phenotypic data with fewer barriers.
A phenopacket file contains a set of mandatory and optional fields to share information about a patient or participant’s phenotype, such as clinical diagnosis, age of onset, results from lab tests, and disease severity. It is also able to link to a separate file containing a patient’s genetic sequence, if available. Phenopackets are expected to standardize phenotypic data exchange within the medical and scientific settings. This will allow phenotypic data to flow between clinics, databases, clinical labs, journals, and patient registries in ways currently only feasible for more quantifiable data, like sequence data.
“Phenotype data is, by its nature, complex due to the wide array of modalities used to capture trait information,” said EMBL-EBI Bioinformatician Terry Meehan, who has implemented Phenopackets within the International Mouse Phenotyping Consortium (IMPC). “This complexity leads to challenges in data interoperability as differing languages are used between biomedical databases to describe similar results—a serious bottleneck in translating research for clinicians.”
The standard is of significant relevance to the rare disease and cancer communities, in which clinical data—such as lab test results, physical attributes, or disease progression and severity—are often used to differentiate between conditions that share similar phenotypes.
“Phenopackets will greatly simplify representation and exchange of phenotypic information, opening the door for matching rare disease patients in federated query systems supported by GA4GH,” said Metadata Standards Coordinator at EMBL-EBI, Melanie Courtot, who is leading implementation of the Phenopackets standard within the BioSamples database.
Using Phenopackets, clinicians can search through genetic variants that produce similar phenotypes and determine which one best matches their patient. Overall, such matching supports better and faster diagnosis and treatment, and higher chances of remission. Phenopackets also benefit researchers by opening up opportunities to analyze more data and strengthen our understanding of human health and disease.
“Clinicians and researchers with varying degrees of genomics expertise will find the file format useful,” said Melissa Haendel, Principal Investigator for the Monarch Initiative and Lead of the GA4GH Clinical & Phenotypic Data Capture Work Stream. “Phenopackets provide different levels of complexity so that we can exchange both high-level clinical phenotype information as well as in-depth data.” For instance, the standard can be used to describe anything from abnormal fetal movement or decreased white blood cell count to eye color or height.
Most of the fields within the file are optional, giving clinicians and researchers freedom to report only the phenotypic information they choose. If specific lab tests are not administered or a patient’s whole genome is not sequenced, those data do not need to be included in a phenopacket that stores other related information. This flexibility will also allow for the omission of identifiable information, such as date of birth or name, to preserve patient privacy.
To read a phenopacket file, researchers and clinicians can utilize existing software, such as Phenotools (for validating Phenopackets) and Exomiser (for annotating variants).
Peter Robinson, a computational biologist and pediatric physician at the Jackson Laboratory, leads Phenopackets development. Robinson notes that the team hopes to soon release a guide for implementing phenopackets within electronic health records built on the HL7 FHIR framework (the leading standard for storing electronic health data) in order to drive uptake among the clinical community. The development team is also working with journals to require phenotype data to be submitted in the Phenopacket format, which will encourage research scientists to adopt this standard into practice.
“Phenopackets enable a massive network of genomic data sharing, not only within the research or clinical communities, but also between the two groups,” said Robinson. “Now researchers can use patient phenotype information to further their understanding of human biology, and clinicians can reap the benefits of research findings in healthcare.”