Diversity in Datasets

Shares actionable recommendations to promote global diversity in datasets within genomic research

If we want all people to truly benefit from scientific advancement and the full potential of genomics, we need to use diverse datasets for research and clinical care. But at all stages of genomic research, we see a critical lack of dataset diversity — from research participation and recruitment, to the genomic workforce, to emerging techniques and approaches such as polygenic risk scores and machine learning. In the Diversity in Datasets policy framework, the GA4GH Regulatory & Ethics Work Stream (REWS) explores and defines concepts such as “diversity” and “representation” and shares actionable recommendations for researchers in order to uphold diversity in their research and findings.

Jump to...

Benefits

  • Shares guidance on how to best promote diverse datasets in associated research
  • Promotes an international lens on meaningful diversity in datasets, a topic which is often limited to national discussion

Target users

Researchers

Community resources

Dive deeper into this product! There is well known bias in current genomic datasets with much of the data being from those of caucasian ancestry. This has led to efforts to diversify global datasets however, there have been inconsistencies with the usage of diversity in datasets and how groups may apply this notion. This group seeks to work towards defining this notion and elucidate how researchers can practically implement such notions into their work depending on their contexts and aims.


Don't see your name? Get in touch:

  • Mutiat Afolabi
    Wellcome Sanger Institute (WSI)
  • Shu Hui Chen
    NIH National Heart, Lung, and Blood Institute (NHLBI)
  • Megan Doerr
    Sage Bionetworks
  • Tina Hernandez-Boussard
    Stanford University
  • Jacob Shujui Hsu
    National Taiwan University
  • Sumit Jamuar
    Global Gene Corp
  • Saumya Jamuar
    KK Women's and Children's Hospital
  • Beatrice Kaiser
    McGill University / Université McGill, Centre of Genomics and Policy
  • Anna Lewis
    Harvard University
  • Zane Lombard
    University of the Witwatersrand, National Health Laboratory Service
  • Maxine Mackintosh
    Genomics England
  • Maili Raven-Adams
    The Nuffield Council on Bioethics
  • Alham Saadat
    Broad Institute of MIT and Harvard
  • Sikha Singh
    Association of Public Health Laboratories
  • Diya Uberoi
    McGill University / Université McGill, Centre of Genomics and Policy

News, events, and more

Catch up with all news and articles associated with Diversity in Datasets.

12 Nov 2024
What do we mean by “more diverse” data?: GA4GH’s new product encourages a holistic approach to diversity in datasets
See more
A DNA strand extending across a blue background, filled with molecular structures and more DNA
28 May 2024
GA4GH submits comments on the WHO’s draft principles for human genome access, use, and sharing
See more
25 Mar 2022
OmicsXchange episode 14: genomic surveillance and outbreak response in Africa with Alan Christoffels
See more