Using nothing more than a simple vial of saliva, millions of people have created DNA profiles on genealogy websites.
But this wealth of information is effectively inaccessible to genetics researchers, with the sites painstakingly safeguarding their databases, fearful of a leak that could cost them dearly in terms of credibility.
This problem of access is one that Bonnie Berger, a professor of mathematics at the Massachusetts Institute of Technology, and her colleagues think they can solve, with a new cryptographic system to protect the information.
VCG Photo
"We're currently at a stalemate in sharing all this genomic data," Berger told AFP. "It's really hard for researchers to get any of their data, so they're not really helping science."
"No one can gain access to help them find the link between genetic variations and disease," she said. "But just think what could happen if we could leverage the millions of genomes out there."
The idea of this new cryptographic method, described Thursday in the US journal Science, was developed in connection to finding drug candidates in datasets from pharmaceuticals companies.
In an earlier work, the researchers have shown the concept could be applied to DNA profiles.
Labs are constantly looking to identify links between millions of drug compounds and the tens of thousands of proteins in the human body, to identify good candidates for certain drugs.
But they don't want their competitors to know what they are working on. Often, their drug compounds are patented and secret. So they don't share much.
VCG Photo
Privacy concerns
The intersection of genetics and genealogy has made headlines in the United States. Last week, a new study showed that half of all Americans could be identified from relatives' DNA samples found in GEDmatch, a free website.
This technique has been a boon for US police forces, who have used it to identify suspects in cold cases dating back decades, such as the "Golden State Killer," who is blamed for 12 murders and more than 50 rapes starting in the mid-1970s.
It can also be used by people looking for their biological parents.
But what happens if the data falls into the wrong hands? Hackers could potentially exploit the information to nefarious ends. Or what if insurance companies and others used it to discriminate against customers?
Benjamin Berkman, a bioethics researcher at the National Institutes of Health, told AFP there is "not really evidence of systemic discrimination," but noted that "doesn't mean that it couldn't become a problem."
"People are very worried about genomic privacy. It's something that they cite as a reason why they're not getting genetic testing, or they're not enrolling in research," Berkman said.