BLAIR DNA Project
DNA 103: Grouping Participants
Placing your participants into various groups based on their DNA test results is one of the most important things a Project Administrator can do. I strongly recommend that you start doing this as soon as matches start to appear. As your project grows it becomes much easier to add new matches to an existing group or start a new group.
I group participants primarily based on the strength of their DNA match, but I also consider their paper trail to a lesser degree.
Strength of the DNA Match
The Strength of the DNA Match depends on:
A 36 for 37 marker match with a rare value on one or more of the markers is a stronger match than a 24 for 25 marker match with common values on all the markers.
Markers Tested - Markers Matched
McGee’s Y-DNA Comparison Utility
As groups grow in size it becomes increasingly more difficult to calculate all the marker mismatches between members of a group. Fortunately there is an online utility that will do all the number crunching for you. McGee’s Y-DNA Comparison Utility allows you to copy data from a spreadsheet or other source, paste it into the program, and produce a chart like the one shown here.
This particular chart has been somewhat enhanced but the McGee program gives you all the data in an almost identical format.
By creating this matrix you can see the exact number of mismatches between any two participants.
Note that in addition to the 7 actual participants, I’ve included a hypothetical Anc02.
One of the things I do for each of my groups is create an Ancestral Haplotype for that group. He’s known as Anc01 or Anc02, etc and is the hypothetical "common ancestor" of the participants in the Group.
Although it’s impossible to know his actual DNA results, it is possible to deduce his most likely test results based on the results of his descendants. In its simplest form the ancestral haplotype is simply the most frequent marker values of the participants in the group.
The example above illustrates the ancestral haplotype of 4 factious participants. Note that although each participant mismatches the other participants on 2 markers, they all match the hypothetical ancestor on 24 of 25 markers.
As you add more participants to a group it is possible that the ancestral haplotype will change. If a group contains a large number of participants who share a known common ancestor with a distinct marker value you may have to make adjustments so you do not skew the haplotype.
Unusual Marker Values
Sharing a rare value on one or more of your markers can be a strong indication that participants share a common ancestor, provided the rest of their DNA results support that conclusion. It can be especially valuable in the case of borderline groupings.
In Group 3 of the Blair DNA Project we have 17 participants with a value of 26 on DYS#390 which occurs only about 1% of the time. 14 of these same participants also have values of 12/14 on DYS#385a/b which occurs less than 4% of the time.
Several websites have developed frequency distributions for the various marker values. I’ve included the website address of the sites listed here at the bottom of the page. I used the Sorenson Molecular Genealogy Foundation Website.
One of the major reasons for DNA testing is to either support or refute conventional research. So using conventional research to place someone in a DNA group may seem illogical. Conventional research should ONLY be used as a tie breaker.
No matter how good the paper trails may be, if the DNA results don’t match, I won’t put the participants in the same group.
But what if the DNA results are inconclusive or borderline? Then I look at the conventional research.
Whether I include them in the same group depends on the answers to all of these questions.
DNA References for
McGee’s Y-DNA Comparison Utility - http://www.mymcgee.com/tools/yutility.html
Frequency Distribution of Marker Values
Sorenson Molecular Genealogy Foundation (SMGF) - Y-Chromosome Marker Details - http://www.smgf.org/ychromosome/marker_details.jspx
Y-Base Statistics - http://www.ybase.org/statistics.asp
Leo Little data from FTDNA data and Y-search - http://freepages.genealogy.rootsweb.ancestry.com/~geneticgenealogy/yfreq.htm
This WebPage was last updated 01/17/2013
© January 1, 2010, blairdna.com
and its Allies