2:22 PM

FAN Club Genealogy Research: How-to and Tips

If you're trying to learn what a genealogy FAN club (or the FAN principle) is, read this post, instead.

Today's post gets into a few more nitty-gritty details of how to start FAN club research for genealogy (also called cluster research or cluster genealogy).

IMPORTANT: This is the technique for traditional genealogy research, not "auto-clustering" which is used for DNA.

Before I dive in, why would you use cluster research for specific problems instead of just "normal" research? First, this is "normal" research, you're just researching beyond your direct ancestors and even beyond your ancestor's family (the ancestor's FAN club).

You particularly use FAN club research for genealogy brick walls (i.e. a difficult problem) like finding parents' names, finding a maiden name, or identity problems (example: "is this two men of the same name or one man?").

You can use cluster research for any problem you want. It's great when you run out of direct evidence, to make sure you've done reasonably exhaustive research, and even for things like social history.

It's really useful!

So you've got an idea why you'd use it. This post is going to look at getting started and using a FAN club in general.

Getting Started with FAN Club Research

In my previous post about what the FAN club principle is, I recommended capturing FANs before you knew you needed to use them in your family history research (capture them before you have a problem where you are specifically going to use cluster research). So let's look at how to get started.

To do cluster research, i.e. to use your FAN club, you need a specific problem you want to solve.

Whenever you have a tricky problem, you need to start by writing down the details of the EXACT problem you are working on. Writing it down makes sure all the details are clear in your mind. If you struggle to write a description of your problem, there is some additional research you need to do (or maybe some past research you need to review).

See this article about asking a good specific research question to get an idea of the types of details you want to include.

Think of this first step as a double-check that it's time to employ a more complex technique or strategy like cluster or collateral research.

[Jargon and Semantics: BE AWARE!

There is technically a difference between cluster research and collateral research---collateral research includes extended family, not just direct ancestors. The cluster is family and FANs. That means cluster research involves family and non-family whereas collaterals are only family.

"Cluster research" and "FAN club research" can be used interchangeably. Some people define the FAN club as being "friends, associates, and neighbors" which might not be any family---but you won't know this when you get started so it's easiest to think of the cluster and a FAN club as being equivalent terms.

People often use collateral and cluster interchangeably, too, although they are not technically equivalent. Rarely do the minor differences in the definitions make a difference as long as you understand what you need to do for your specific problem.]

You can collect FANs without fully doing FAN research. The "dividing line," so to speak, is analysis. First, you need to identify FANs (the cluster). Then you do analysis that is appropriate for the question you are trying to answer.

Is that clear?

FANs are defined by their relationship to a person, not a familial relationship but the fact they have some type of interaction with the person. These are people you want to "collect."
Once you have a question you can't answer and want to use cluster research, you analyze what you know about the FANs and that is the actual "research" as opposed to just collection.

You will need to collect the FANs before you can do analysis and you must have a clear question before you do analysis. However, you can start building the cluster before or after defining your question. Without a research question, you are only "collecting," not doing "research." A collection of names (and other information) won't help you. You need to do something with those names to benefit from collecting them.

Does "Analysis" Sound Too Complicated?

You have already been doing analysis in genealogy (doing something beyond just collecting names and information), but it was so basic, you didn't even think about it.

You didn't just write down a name and birth information. You "analyzed" some piece of information that told you that name and birth information was for a person in your tree. If you put a lot of the wrong people in your tree when you got started, your analysis skills weren't developed, yet.

(FYI, if you trusted automated suggestions to build your tree, that could have been the problem. Computers and AI aren't as good at analysis as you are, although they can search much faster. Their speed makes it look like they are better than you, but searching isn't analysis. Automated tools are just an artificial recreation of the analysis humans can learn to do. They also are only as good as the information you give them, as are you, and if you don't have much info, or wrong info, the analysis will be off.)

You did early analysis using something like a document, person, or artifact that said that person was your ancestor's father. But you probably did even better analysis. If you could understand that your ancestor's brother shared the same parents without needing to be explicitly told the brother's parents' names, you were doing analysis.

Analysis isn't complicated. You had to do it to do genealogy and if you're reading this post, you probably went beyond the basic examples I just gave (not everyone does, so good job, you!).

You Can Do Genealogy Analysis

To continue building your family tree, you'll need to learn to do more complex analysis and you'll need different information to analyze. That's the point of FAN club research. It provides additional information which you can analyze and use as clues to the answers you seek.

Analysis using clusters, particularly non-relatives, can be really complex (but doesn't have to be). No matter what, you have to start with collecting information.

I wanted to make sure you understood that "FAN club research" isn't just collection. It's more, but it's something you can do, and improve your skills to do better. Now let's look at how to do the easier work of "collecting." If you can start building your cluster early (and keep track of your fan club research), you are more likely to use it successfully.

Collecting FANs

When we collect FANs, we are gathering the person's information. Just a name isn't enough.

However, this can be a stumbling block to getting started. It sounds easy, but when you try to record a FAN like you would an ancestor or collateral relative, you can find it tricky. Below I've got an easy method you can use to get started and then adapt as you go.

But first...

To make this a little easier to discuss, as far as what information to collect, I'm going to refer to "data points." Data points are just pieces of information. It's fewer characters to type and to me, it is quicker to read. I also find it makes it a little clearer that this isn't information in the sense of a paragraph. We want this to be simple! It's just data points, not a life history.

Normally when I talk about data points in genealogy I mean first name, last name, estimated year of birth, birthplace, etc. But with cluster research, it is:

first name,
last name,
location the name came in contact with your ancestor (or focus person),
date the name came in contact with your ancestor, and
the source where you found the information.

There may be additional data points but usually, those five points are all you hope for from most members of a cluster. When you're getting started you may think the only data point you have is a name. But you always have these five so ALWAYS record them.

I’ve said “name” instead of “person.” When you start, you may not be able to tell if the same names are the same person or ANYTHING about the person of that name other than the data points I’ve just mentioned.

Tip: If you use a spreadsheet or database, make sure first and last name are treated as two separate data points (separate fields or columns). It took me years to realize Dennis Miller was Bud Miller. If I tried matching on "name" they wouldn't match up. But obviously their surnames match. Clusters can be massive so you might not "see" two people with different first names could be the same if you use a single "name" field.

If there are additional data points like the person's age, occupation, religion, role in the community (like Justice of the Peace), or anything that identifies the person, you should also record that. But don't get so excited when you have more information you miss out on the core data points. You don't want to waste time later trying to figure out when or where your person came in contact with the person in the cluster.

How to Record Your FANs

So this is much trickier for me to answer. If all you have are the five data points, it is pretty much impossible to record FANs in an online tree. Online trees can hold this information with some manipulation, but what's important isn't keeping it but using it. It's not usually possible to get the FANs out of an online tree to do analysis.

Online trees are not the same as genealogy software and software may be able to hold your FAN club info and output the information in a way you can use. However, you need to be sure BEFORE you enter all those names that this will work. Software may say it can record FANs but you need to be sure the output is something you can work with (both you can figure out how to get the information out and that it comes out in a format you can use, the latter can be tricky if you don't have a clue how you'd use it).

This post isn't about how to record your FANs because it is complex. Here's my succinct recommendations.

Don't use an online tree
Only use software if you already use that software heavily. Enter a few test names and generate a report (or several types if they are available). You don't have to use software even if you use it for your ancestors.
I recommend a spreadsheet and keep it simple. Use those five data points as your columns, plus a sixth column for the link to your notes taken from that source. (You will create a different, customized spreadsheet, based on this one, which contains any additional data points when it comes time for analysis. The main spreadsheet just needs the six columns).
Spreadsheets not an option for you? Keep it simple with a doc. Include the equivalent of the six spreadsheet columns. You'll have to figure out what will work for you when it comes time to analyze, but you'll at least be able to copy and paste.
Want to work on paper? Use index cards. This is the analog of a spreadsheet. Record the same info as the six columns on the spreadsheet.

An Example of Using Basic Data Points

Let's say you've reached a point in your analysis where you've got two Dred Ledfords in the cluster. Not a really common name. You want to determine if you can combine this into one person. Combining FANs into a confirmed (or nearly confirmed) single person might be pivotal to deciding both the records are for your John Smith, not some other John Smith.

Here's our starting info. One Dred Ledford is a J.P. The other lived on land adjoining a piece John Smith bought. Are they the same man?

I've told you nothing to help you say yes or no.

OK, let's add a data point we should have recorded. The J.P. was encountered in North Carolina, the neighbor in Oklahoma (you should have recorded the most specific location possible from the source, but for the example I'm keeping it simple).

Still not a yes or no.

If you found these records because you know your John Smith was in North Carolina and Oklahoma, Dred Ledford could just as easily have lived in both, too. So let's add more data points.

The J.P. married a John Smith to Mary Jones (in North Carolina) in 1823. The neighbor was living in Oklahoma in 1880. Once again, if it was possible both John Smith's are your man, this could be one Dred Ledford.

Now you start doing cluster RESEARCH instead of just collecting names.

You research Dred Ledford, treating them as two different men for now. You find the J.P. died in North Carolina in 1841. Obviously not the same man. This doesn't mean it isn't one family with such an unusual name, but you wouldn't base any analysis on this single individual because it's not one individual.

But what if INSTEAD your research revealed the neighbor died in 1885 at the age of 93, a native of North Carolina? Hmmm, that might be one man and an important cluster member. It's worth doing more research.

Without those basic data points, particularly where this FAN encountered your research subject, you can't even get started with cluster research. The name of a neighbor in a deed or land record is often just a name. You can determine the date and location from the source you're using.

That's also why it's vital to record the source. You'll have questions when you start using the FANs and in most cases, the first stop is looking at the source again to see if you can glean more based on your defined goal and anything new you've learned since you captured that FAN.

Analysis Takes It Further

You can also estimate the neighbor is an adult, so you could also record a data point of "probably born before [date of record minus 20]" (if you remember from the previous post you should be taking notes so I literally record:

This person should be an adult which would make him born before___. However, I haven't done any legal research or other research that could indicate he is under 21, so this is only a likely estimate.

The note about legal research not indicating he's under 21 is because that is a safe legal age to use but you used 20 to calculate the born-before date. This is an estimate so the extra note is to help make it clear you shouldn't use it as an exact date.

The J.P. is also an adult and likely not just 21, although it's possible. This is why notes are so important. You can record this "likely" information so you use it in your comparisons. Notes are the appropriate place for this type of information, so it is usable but doesn't cause issues in your analysis like it can when entered in a tree, software field, or spreadsheet where it's just a date.

Many members of the cluster will just be names. Those 5 basic data points are so important.

A person's age doesn't help if you don't know WHEN they were that age, and you can't estimate a born-before date without the when data point. Locations are sometimes the best information we have to work with for our cluster. In genealogy you don't get the absolute "best" information, you get the best available---work with what you've got.

[Recap: We want the name and source for every FAN club member. From the source we can determine a location and date where our person and the FAN club member interacted. Those are our key 5 data points (given name is one data point, last name is one data point).

You may also be able to estimate the person is an adult. Make sure you write in your notes that you are estimating their year of birth as being before a certain date and how you are making your estimate. Clusters often have a lot of people with similar names so you will NOT remember "ideas" you have like one FAN club member was likely a young adult whereas another was likely older. Write it down in your notes. This applies to any ideas you have or estimates you make. You need estimates and ideas but you need to know they aren't "facts," too.]

Without analysis, you’ve just collected data points. When you start to analyze all your data points, though, you start to define people.

Check out this post for more about taking genealogy notes.

For more related posts check-out more about research logs or creating a research plan template!