With 26 letters in the alphabet halfway through is between M and N, right?
Not if you're looking at an alphabetic list by surname frequency.
In our research we often find ourselves looking for a particular name in an alphabetically ordered list where there's no index, just a block of pages or images and you have to guess how far in to go to find the name you want.
Here's a table you may find useful showing the cumulative percent of the way through to go to find the start of an initial surname letter.
It varies depending on nationality. Wales loves Jones and Jenkins, Scotland has a lot of names beginning with Mc. Those distributions are calculated from a table at the Surname Studies website. Ottawa figures are based on the 2009 phone book.
For example, if you're looking for Smith in a Scottish database the table shows that the S surnames start 78% of the way into the database.
England | Wales | Scotland | UK | Ottawa | |
---|---|---|---|---|---|
Cum % | Cum % | Cum % | Cum % | Cum % | |
A | 0 | 0 | 0 | 0 | 0 |
B | 3 | 2 | 3 | 3 | 4 |
C | 14 | 8 | 11 | 14 | 14 |
D | 22 | 12 | 19 | 21 | 22 |
E | 26 | 20 | 25 | 26 | 29 |
F | 29 | 25 | 26 | 28 | 30 |
G | 32 | 27 | 30 | 32 | 33 |
H | 37 | 31 | 35 | 36 | 39 |
I | 46 | 39 | 40 | 45 | 44 |
J | 47 | 40 | 41 | 46 | 44 |
K | 49 | 51 | 43 | 49 | 46 |
L | 51 | 52 | 45 | 51 | 49 |
M | 56 | 57 | 49 | 55 | 58 |
N | 63 | 63 | 70 | 64 | 69 |
O | 65 | 64 | 71 | 66 | 70 |
P | 66 | 67 | 72 | 67 | 72 |
Q | 72 | 73 | 76 | 72 | 77 |
R | 72 | 73 | 76 | 72 | 78 |
S | 77 | 80 | 81 | 78 | 83 |
T | 86 | 85 | 90 | 86 | 91 |
U | 90 | 90 | 93 | 91 | 94 |
V | 90 | 90 | 93 | 91 | 94 |
W | 91 | 91 | 94 | 91 | 96 |
X | 100 | 100 | 100 | 100 | 99 |
Y | 100 | 100 | 100 | 100 | 99 |
Z | 100 | 100 | 100 | 100 | 100 |
Often you don't have the whole range to work with. Wouldn't it be nice if there was an app where you could enter page numbers and surname from the beginning and end of the range and get an estimate of the page for a particular surname Hey RootsTech, that would be an innovation.
7 comments:
Can you explain more clearly?
Thank you
Can yo explain the chart more clearly?
Thanks You
I added an example which should help.
Very interesting, John.
My Gillespie surnames will be usually 1/3rd of the way through, while the Pettygrove/Pettigrew surnames are around 2/3rds of the way. Cool. I tend to jump back and forth in such databases or indexes, but this might make my jumps a bit more logical and useful. Thanks for the post.
The table might be a bit clearer if you were to round to the nearest tenth of a percent instead of just to the nearest percent, at least for the last few rows. As the table is now, nearly all columns round to 100% by "X".
A good point Chad, to be weighed against giving a deceptive impression of accuracy.
When searcrhing for a surname ending in W through Z, I've always found the easiest way to find it is to start at the last page and work backwards.
Post a Comment