The Enthusiastic Genealogist: Color Clustering: Identifying "In Common" Surnames

Wednesday, August 8, 2018

Color Clustering: Identifying "In Common" Surnames

Please see an updated version of this post and more on the Leeds Method of DNA Color Clustering on my new website, www.danaleeds.com

After creating Color Clusters using the new Color Cluster Method (aka Leeds Method), the next step is to identify the surnames associated with these groups. (For creating Color Clusters, please read my original Color Clustering post.)

Note: This method is especially useful for people working with adoptees or other unknown parentage cases where they do not already know what surnames to concentrate on!

COLOR CLUSTERS: Identifying Common Surnames

STEP 1: Create Color Clusters and determine which clusters you need to work with (or work with all of them).

Actual data from an adoptee I worked with,
but names changed for privacy.

In this case, the adoptee identified the Blue Cluster as her biological mother's. We were trying to identify her biological father, so we concentrated on the Orange and Yellow Clusters. (The Green column did not have a cluster.)

STEP 2: Determine which matches have trees and which do not and label.

Actual data from an adoptee I worked with,
but names changed for privacy.

I look at each match and see if they have a tree - whether attached or not attached! I then label them to indicate "tree" or "no tree."

STEP 3: List the "4th Gen" (great grandparents) surnames for each match with a tree. If they don't have 4th Generation matches, use grandparents or even parents.

Actual data from an adoptee I worked with,
but names changed for privacy.

To find the surnames, open the match's "pedigree and surnames" page and look at the surnames under the "4th Gen" column. If their tree is complete enough, you will see 8 surnames at this level - the match's great grandparents. In this example, both Gabby and Jamie have all 8 great grandparents listed on their tree along with their surnames.

STEP 4: Identify common surnames, if any, in each Color Cluster.

Actual data from an adoptee I worked with,
but names changed for privacy.

(I find this step truly amazing!) I have highlighted the shared surnames:

Orange Cluster: Griffin & Bartles
Yellow Cluster: Paulson, Austin, and Gray

STEP 5: Assign potential surnames to the Color Clusters, if identified, and use these clues to further your research!

Actual data from an adoptee I worked with,
but names changed for privacy.

At this point, you have clues as to what surnames you are looking for in each cluster. Continue your research using these clues!

You also might be able to look at first cousins or other "close family" matches to help label these clusters. (And, a big thank you to John Motzi for his help in refining this process!)

Happy Clustering!

25 comments:

ClorindaAugust 8, 2018 at 3:25 PM
Thank you! This looks fairly straightforward to do. I have some families that I would like to try it with once DNA testing is done.
ReplyDelete
Replies
Texas Connie GrayAugust 8, 2018 at 10:53 PM
Excellent explanation of how to use your system to identify ancestors, this will help so many people! Great work!
ReplyDelete
Replies
Marian B. WoodAugust 9, 2018 at 6:02 AM
Dana, I especially like your reminder that other people's trees are *clues* and not *facts*! So many trees are incomplete or downright wrong, which means we have to confirm anything and everything on someone else's tree. I have your first blog post printed out so I can follow along as I try this new method. TY!
ReplyDelete
Replies
Beth BenkoAugust 9, 2018 at 9:56 AM
This is a fabulous process. I'm finally able to make sense of all those DNA matches. I've been able to make progress on my most troublesome line. Thanks!
ReplyDelete
Replies
laurie dolanAugust 10, 2018 at 6:12 PM
I'm working with a distant cousin trying to identify her bio dad. There are only a few matches with trees, a couple just have a few people on their trees. Would it be useful to try to make private mirror trees to possibly fill out the missing 4th gens? Of course just to generate clues to check out, not as facts.
ReplyDelete
Replies
Robin in Short PumpAugust 12, 2018 at 8:00 AM
Hi Dana, this is exciting and I'm sure this is going to help me a lot.

But I'd like to point something out, because I am Captain Obvious, and if it helps someone else, yay!

My Mom's very first 2nd cousin match is a test that is managed by someone else (I know who they both are). When I pulled up his tree to grab those surnames, I went what the what? There was one person showing on that tab (the tester), yet it says there are over a thousand people in that tree. I went to view the full tree and the answer is clear. The test manager is the brother-in-law of the tester. The tree is the test manager's family, not the tester's. Since the tester is an in-law, he only got his rightful place in the tree, but none of his own family.

I don't know how common this might be, but thank goodness I already had the tester in my offline database and had enough info on the in-law to figure it out.

So if you're seeing weird things in your matches' trees that are managed by others, this might be one reason.

On to the rest of them!
ReplyDelete
Replies
CatherineAugust 13, 2018 at 7:41 PM
Having fun playing with this method, thanks. We don't have more than a couple of 3rd cousins, so using the top 100-200 3rd-4th cousins. One thing I've done is to colour the font for female names red in my list (so many have initials etc) and add '(mg name...)' if the kit is managed by someone as often there's a few managed by the same person.
ReplyDelete
Replies
Susan ShiffmanAugust 14, 2018 at 7:07 PM
Hi Dana,

I happened across this blog the other day and have spent the last 3 full days color clustering!!

I am adopted and used 23 and Me for my DNA. Although a few matches are listed as 2nd cousins, I had MANY 3rd and 4th. Because I have almost no information, I created a 10 column chart , hoping to see a pattern and so far I have clusters in 9 of them. After reading all of the comments on the blog, I am not sure having 9 columns containing clusters makes sense. I also have a good amount of over lapping. I have surnames for 8/10 of the columns too.

Should I keep plugging along??
ReplyDelete
Replies
prubleAugust 16, 2018 at 3:35 AM
Hello. I am working on my husbands DNA line (he was adopted). This tool is great! However, here is what happened for me. I ended up with 7 clusters, the first four were all related to the paternal tree (I know of at least 2 second cousin marriages), and two of the clusters I identified as the maternal clusters. I am pretty sure of that because I do know my husband's biomother, and there were people in those trees that matched that tree. I have one cluster of two people that I am not sure about.

Here is what is odd. My second and third cousins neatly fell in the above clusters but when I started looking at 4th cousins I started having crossover into the maternal cluster! What would that indicate? I am thinking it might be an earlier marriage between the two lines? But why wouldn't it show up until the 4th cousin check.

Thanks again for this wonderful tool! It has helped me a lot.
ReplyDelete
Replies
AnonymousOctober 8, 2019 at 5:16 AM
Bonjour je travaille sur la lignée d'une copine qui est adoptée et après les classements via les colonnes de couleurs j'ai beaucoup de chevauchement et aussi 8 colonnes, alors serait-ce possible de faire un billet à savoir comment procéder avec ce problème et bravo pour vos explications, très claire
ReplyDelete
Replies

Subscribe to: Post Comments (Atom)