Statistics

These statistics are provided only as a guide to help you understand the layouts presented here

I did not author the analyzers that produced these stats, and I have made every effort possible to represent all layouts in equal "apples-apples" terms, and report these statistics accurately. Nevertheless, I encourage anyone interested to conduct their own tests; I have provided all the configuration info and files necessary to do so on the individual layout sections and the Download page. If there are any errors in reporting, they are purely accidental, and I will correct them as soon as I can.

A word about Keyboard Layout Analyzers (KLAs) and stats

Click for a wordy ramble about why I think that although these stats are useful, they are not the last word in deciding what layout is right for you.

Just as with layouts…

Isn't there one analyzer that is just better than all the others? 

No, there isn't.

Every Keyboard Layout Analyzer (KLA) is built around an important set of statistics and algorithms that reflect design priorities.  One analyzer may be designed to highlight the always important same-finger bigrams (SFBs), and another may be designed to help reduce the load on specific fingers (ie. pinkies), while still another might emphasize finger or hand alternation. These are often competing priorities. You can get fewer SFB by spreading letters out among all the fingers, but that runs the risk of making the pinkies or ring fingers do too much work. There are myriad issues involved, especially in the scoring algorithms that produce rankings. The rankings for any KLA must reflect those priorities, (with a given sample corpus). 

Then comes the decision you must make regarding those priorities and scores.  Does moving your ring finger from top to bottom row make you squirm as much as it does me? Some layouts put a heavy burden on the ring fingers, and so they may not be right for you, even if they have amazing overall scores. Some layouts put a much heavier burden on the index fingers than others, and might score really well on one analyzer, might score poorly on another, or have trouble integrating with other smart keyboard features (i.e. home row mods). Obviously, simply scoring well on one analyzer doesn't necessarily mean it's the absolute best layout for you. That it is why it is so critical to know what those priorities are (and the corpus) in order to interpret the scoring of any KLA. 

Here's a list of analyzers I've used (or at least consulted their algorithms), in no particular order:

There are a number of analyzers based on Patrick Gillespie's (PatorJ) KLA that produce a linear ranking of several layouts.  PatorJ) KLA puts equal weight on individual finger movement, use of particular fingers, and hand alternation. Ian Douglas' KLA at kla.keyboard-design.com is based on the Den1 fork, with similar priorities to tho PatorJ original but with a different ranking scheme that favors layouts with hand alternation or those that shy away from heavy burden on any one finger—BEAKL, Dvorak, & MTGAP like layouts that favor hand alternation preform better there (i.e. Hands Down). SteveP [SP] fork scoring reflects greater sensitivity to SFB and less to the burden placed on any given finger (or hand alternation), meaning that layouts that may demand more of any one finger (esp. index) will perform well provided the SFBs are low—Colemak like layouts perform better there. It appears that the  (Den 1,2,3) KLA has resurfaced in a new version (Den4)  that addresses more variables, including (inward) rolls and hand balance.  The scoring with Den4 weighs use of specific fingers to penalize layouts that abuse pinkies (or any other finger). You'll find that BEAKL layouts perform exceptionally well there. Ian Douglas' KLA newer KLANext has perhaps the most accurate measurements of the PatorJ derivatives, with appropriate measures for finger distance and same finger use, the two most important metrics. The scoring also seems to me to more accurately reflect an actual typing experience, (if that is even possible!).

While all these KLAs share a common heritage, they have different scoring priorities, and some measure things a bit differently. Currently, all the PatorJ derived KLAs will incorrectly score (inflate the score) of any layout that is missing letter/characters found in the sample corpus, so I have included all the standard characters, and then some, to try to be as accurate as possible. (Drop me a note if you find any errors by clicking on the info icon at the bottom of the page.) I've tried to ensure that other layouts used for comparison also have all the glyphs, and are on similar boards, to reduce the variables involved and not handicap anything.

Other analyzers/optimizers are more complicated to set up (Carpalx, MTGAP, etc. use python, C++, Rust, etc.), but can provide deeper insights. Some can iterate on a set of weightings (simulated annealing) to produce optimized layouts. I'll write more about my experience with some of these later. In the mean time, you may want to check out the Alt Keyboard Layouts (AKL) Discord for a lot of really interesting work being done on layouts and analyzers.

I really do not think any one analyzer is better than all the rest. 

There are many other layout analyzers that don't use JSON templates like these, and each has differing methodologies. The slick Colemak Mod-DH analyzer, for example, is fast, somewhat configurable, and focuses on the very important same-finger bigram problem; but its weighting of SFBs is fixed, and it doesn't handle some scenarios (like letters on thumbs, different corpora). You can easily copy a Hands Down layout from the top of each section and paste it into the analyzer to see its take on SFBs, especially. I think it is important to look at any layout from many points of view, with many sample texts, to see how it might perform under different real-world conditions. 

 I feel that any layout within about ±3% effort are probably comparable in general efficiency,
so what matters is how it feels to the individual, on their own keyboard, with their own texts.  

The above tests are from KLANext, which at this writing (Aug '21) seems to have the most accurate individual measurements of the PatorJ-style analyzers (esp. distance).  I also happen to feel that the weighting used by KLANext  for its ranking is very close to my own typing preferences (rankings are rather subjective). There are several types of KLAs, and all KLAs ranking systems vary, and individual typing preferences (and texts) vary, which is why I don't particularly like linear ranking as a means to say which is better. There are far too many things to consider, and layouts that have very similar scores may feel very different to type on. These scores are thus provided to help you get a relative sense of what these layouts may be like if you were to try them yourself. You may find the weighting used by another KLA is more suited to your preferences, so go get the JSON files and try them out on another compatible KLA (listed below).

Patrick Gillespie's original KLA

"The optimal layout score is based on a weighed calculation that factors in the distance your fingers moved (33%), how often you use particular fingers (33%), and how often you switch fingers and hands while typing (34%)."

SteveP's [SP] fork of PatorJ's

"The layout score is based on a weighed calculation that factors in a distance penalty due to how much your fingers moved (50%), how often you use particular fingers (20%), and how often you switch fingers while typing (30%). See the About page for detailed information."

kla.keyboard-design.com
(Ian's fork of Den1's fork of PatorJ's original)

"The optimal layout score is based on a weighed calculation that factors in the distance your fingers moved (33%), how often you use particular fingers (33%), and how often you switch fingers and hands while typing (34%). Lower scores are better, means less effort."

Oxey's Layout Playground

An extremely handy tool for poking around with variations of a layout's alpha layer to find what may work for you.

Layout Stats

The above stats from SteveP's Colemak-DH analyzer, all using simple matrix layout and default scoring w/o num row.

Any statistic is only a measure of what you're looking for,
and is influenced by how you look for it.

All tests above are all on ergodox type layouts, shift on thumbs, with Alice in Wonderland ch 1, unless otherwise stated.  The JSON  files used were identical on all tests with all analyzers. You should be able to easily reproduce these tests on the various KLAs with the JSON files provided on the download page, and conduct tests using your own sample texts.

It is clear with these comparisons of PatorJ derived KLAs that the analyzers are not at all the same. Each was each designed to score the layouts emphasizing different criteria. There are many other analyzers that require some programming, like those that produced MTGAP, CarpalX, genkey, (and many more, like a200, Oxeylizer,  the Keyboard Genetics that produced Halmak, and the analyzers behind the French Bépo and German AdNW and layouts–all interesting analyzers producing unique insights). Each of these analyzers approach the problem in different ways, and therefore reveal the analyzer designer(s)' different understanding of typing, and what is important to them. Just because a layout scores better than another on one analyzer does not mean it is inherently better. 

An analyzer only reports what it is looking for, and no analyzer to date measures all possible factors that influence typing speed or comfort. Metrics regarding inward and outward rolling, "pinballing" or "redirects", row steps or jump, (and the directions), and the differences on various form factors are just a few of the many concerns. To date, I know of no analyzers that can assess the impact of advanced features such as Home Row mods (which  reduces finger motion), combos (including lingers, which also reduce distance traveled), and especially things like Adaptive Keys, (which have further reduced SFBs on my implementations of Hands Down.)

It is therefore very important to understand the scoring criteria before you can make a useful judgement about which layout might be more suitable for you. A layout that consistently scores well across a variety of scoring systems would likely be more suitable for "general" use. Other layouts that perform very well on one analyzer with a given text, but poorly on another, might be better suited for very specific use cases. 

"Lies, damned lies, and statistics" — Mark Twain

Not tested on endangered pinkies

The test above is from Den's (Den3) KLA, using Alice in Wonderland Chapter 1 as the sample text, in an attempt to simulate real-world typing challenges. And, honestly, because designing a layout feels a bit like falling down a rabbit hole.  Den's KLA, derived from Patrick Gillespie's, considers more factors with different weighting that I feel better approximates the performance of a layout on the physical keyboard. I have selected for comparison some of the better known high performance layouts, many produced by simulated annealing algorithms, and assured that each layout was on similar ergonomic keyboards to which their designs are well suited, and all have shift on thumbs.  (I use the Hands Down Alt variation).

No Cherry Picking Statistics

Hands Down ranked #1, and averages 6% better total effort than the #2 layout, with 13/21 English prose and 6/12 academic and other specialized texts. If not #1, Hands Down always ranked in the top 3, and averaged only 3% worse effort than the #1 layout. Together, Hands Down had the highest average rank of any other layout tested across a wide variety of sample texts, with various KLAs. In a test of 200K words of literary, academic, personal prose, only 1/7 being mine, plus a few typing challenges and word lists to cover special characters and infrequent data entry tasks, Hands Down Reference and Hands Down Alt were statistically even with X8.3 ranking #1. BEAKL Opted4 ranked #2 (+1%) effort, Colemak-DH was #3 (+4%), MTGAP #4 (+6%), Notarise #7 (+13%), Workman #8 (+21%), and Dvorak #9 (+31%) greater effort. I take this to mean only that Hands Down is very well rounded, suitable for everyday use. 

The tests below are from the  Patrick Gillespie's (PatorJ) KLA (first image) and SteveP (second image) fork of the same.  The stats for Hands Down Alt below are from the KLA at keyboard-design.com. Unfortunately, it appears that the Den's KLA (above) has disappeared, or at least moved (mirror here). These tests show that Hands Down performs well with a variety of analyzers that emphasize different attributes of a layout.  (see my note about these KLAs.) No other layout I examined was more consistently at or near the top regardless of KLA used or sample text applied. But don't just take my word for it, get the JSON files and test it for yourself, to see if it might be worth trying yourself.

Disclaimer: Stats were sampled on 2020/10/24. Many of the sites referenced are under active development. Changes to the site or your browser could result in differences from the stats reported here. Some of the sites may no longer be available, and the keyboard definitions or sample texts may have changed since I used them.

More Hands Down Statistics* — the recipe for delicious bigrams

A menu of same-finger bigrams

I've evaluated Hands Down with many different analyzers, and where possible with a huge variety of sample texts. It consistently ranks in the top 5 layouts by some metric or another: frequently #1, sometimes at 4 or 5, but usually in the top #2 or #3 (by layout family)But stats are not the whole story. For the reasons stated above in the design goals for this project, I think that Hands Down has some merit. Here are some detailed stats from the  Colemak-DH(mods) analyzer, that is great for quickly discovering potential problems with same-finger bigrams.


Allergy Warning. The stats here were processed on digital equipment developed by others, and used to develop (or promote) other layouts. These numbers represent only a snapshot in time, based on a weighted list of bigrams from one convenient analyzer, the Colemak-DH Layout Analysis Tool, with default settings. It's a reasonable, fast, configurable tool. Actual stats, like hardcore across the board analytics, are huge, complicated affair, and suggest a variety of conclusions. Use a different test bed, or a different methodology, and you'll likely have very different numbers. I could have tweaked these stats to reflect my own bias regarding index finger fatigue, as mentioned above, to show Hands Down in an even more favorable light (I have, and it does). I am offering these because they are clear and concise, are reasonable approximations of other results, and they are not my own calculations. All of the stats are from others' freely available analyzers, and are all noted throughout. I figure that if Hands Down performs well by someone else's metric, then it might be objectively something of value to others.

Poisson distributions: ratios of good vs bad bigrams

All Hands Down variations deliberately try to distribute finger burden roughly aiming to slightly favor the index and middle fingers while producing a functional balance between the critical same-finger bigrams (SFB), neighbor-finger bigrams, and individual finger usage frequency.  A layout with the lowest SFBs may over burden some fingers, or cause awkward row jumps, or other undesirable traits in practice. Any single metric alone cannot represent the best layout design.

Balanced Hand and Finger Diet

finger 0 6.31% finger 9 8.08%

finger 1 11.56% finger 8 10.06%

finger 2 15.62% finger 7 20.65%

finger 3 16.73% finger 6 10.99%

total L 49.77% total R 50.23%

Although this near perfect balance is exceptional, I feel that anything under about 3~4% feels balanced in practice...Any more than that and I can definitely feel the imbalance after a typing session. Swapping E<->A results in 16.98% for finger 7, and 13.07% for finger 9. See below for Pink E, Roll E, and Thumb E variations that all reduce the burden on finger 7, albeit with somewhat lower overall statistical scores.

Same-Finger Lickin' Good Bigram Frequency

finger 0 0.009% finger 9 0.003%

finger 1 0.319% finger 8 0.021%

finger 2 0.193% finger 7 0.360%

finger 3 0.116% finger 6 0.179%

total 1.309%

Hands Down primary design goal was to reduce same-finger bigrams, especially on the pinkies. It has among the lowest same-finger bigram burden and total bigram penalties in this critical metric. This is the measure that you feel really quickly when typing, perhaps more than any other. I feel that anything over 0.3% on any one finger is noticeable, and annoying with anything over 1%, or anything at all on the pinkies.  But a warning, too: some layouts with really good bigram stats suffer with awkward sequences that are so uncomfortable that a slightly higher bigram number may be worth it.  Hands Down is no exception. There will be awkward sequences

Fat Free Same-Finger Bigrams

finger 1 NC 0.295%
finger 7 E. 0.245%
finger 6 FU 0.095%
finger 2 HR 0.079%
finger 2 RL 0.075%

Hands Down has  among the lowest frequency same-finger and neighbor-finger bigrams, yet does not over-burden the awkward ring and pinky fingers. The added burden is taken up primarily by the strong middle finger. Phonetic analysis helped to isolate bigrams and balance finger usage to avoid co-occurrent phonemes placed on the same finger columns where possible, (QSX, CRM, HNL, and all 6 letters on each index finger). U attracts the fewest bigrams of any vowel so it is exposed to the most letters, and OE is the least frequently occurring vowel bigram, though they are the first and fourth most frequently occurring letters.

Low Carb Neighbour-Finger Bigrams

finger 8–7 BE 0.644%

finger 1–2 CH 0.518%

finger 9-8 AI 0.483%

finger 7-8 E, 0.406%

finger 8-7 IO 0.328%

Exceptionally low frequency neighbour-finger bigrams, similar to the other top ranking layouts. Trigram analysis is much harder to measure, and very sensitive to physiological sensations, and linguistic (word/meaning) influences because we're getting closer to spelling/sounding words out. I couldn't find an analyzer to help. The theory is that if the bigram numbers are low, then the trigrams will require a shift away from a finger or two, and it'll sort itself out. This is true in practice, but some awkward sequences still occur. The C-vowel-M tri-gram sequence is the most annoying to me on Hands Down.

Avocados are good for you

Some of the better performing layouts encourage "good bigrams and trigrams" as a part of a healthy typing diet, rather than avoiding same-hand bigrams altogether. Depending on the pattern, I agree with this. On the consonant hand, the most common consonant bigrams are deliberately placed for comfortable same-hand rolls or rakes, such as TH, and the most common N-bigram group of NG, NT, ND, and WH, LD, and the GH+T bi-gram is very easy one-hand sweep "the blighted ghost is happy." On the vowel hand, the least common vowel bigram, OE, is on the powerful middle finger. So, while finger 7 is heavily used, it is almost never used in quick succession on a different key. The result is that all of the vowel bigrams are easy rolls. OU IO EA EI AE IE. Bonus: LS isn't bad, a common Dvorak complaint. The L-R hand inversion of Dvorak is intentional, as it leaves more keys on the hand they were on QWERTY, which has made the layout easier for me to learn, and may help when necessity arises that you must resort to QWERTY.

Haute Qui-scene

Check out this layout evaluator that compares many layouts using weighted bigram lists from multiple languages. It shows many interesting things, including the reality that there are many good layout options that perform statistically similar to each other, along the lines of what I've discussed and recommended here. It somewhat confirms a key design goal: Hands Down performs very well in English without penalizing use of other languages. Sure, other optimizations will be better in a given language or a given corpus, and different analyzers will argue one layout is marginally better than another based on the stats or weighting schema they use, but in practice I can only use one layout at a time...

A layout is also a matter of taste

Artichokes to Ziti: I'm not trying to sell you on Hands Down, or make claims about the other layouts. I'm just excited that it works for me, and wanted to share it. You decide if it works for you, or doesn't. Sometimes significant variation occurs with different statistical weighting or sample corpus being analyzed. While a good number of layouts are close to Hands Down statistically, they all arrive at those similar-ish results in different ways, which means that typing on statistically similar layouts will likely feel very differentand that feel is very important. As much pun as I have had here with the name, there is, simply, no way to claim that any layout is, Hands Down, the beets, nor would that necessarily mean you should like borscht. Variations are GOOD FOR YOUchange it to suit your needs. Some keystroke patterns will freak someone out, and be no problem for someone else. That's why it's called the Hands Down Reference layout. I expect you to season it to suit your taste.

So always take stats with a dash of salt, knowing that what matters most in any Human Computer Interface is always the particular human. The important thing is that it works for the user, across a variety of texts they are likely to encounter (including languages). All this analytic stuff is just to help develop the layout, then communicate it to those who might want to use itlike a menu. It may look and sound delicious, but all that matters in the end is whether it is delicious to you.

English Prose tests at Keyboard-Design.com

Hands Down (and Hands On) for standard (ansi/iso/jis) keyboards

It is very important to compare layouts on like keyboards. The keyboard itself has a profound influence on the typing efficiency and comfort. These tests all use layouts on a standard ansi keyboard.

You can download the JSON files (above) to see just how good Hands Down is on a run of the mill row staggered keyboard. Or you can download the OS Layout bundles (below) and install Hands Down now to use on the keyboard you already have. Hands Down Reference, Alt, and Extreme work very well on a typical row staggered keyboard. So do Hands On and Notarise. All of these are included in the native OS layout bundle installer.

kla.keyboard-design.com with a 22k word corpus of 1/2 english prose, 1/4 academic prose, and 1/4 javascript code. All layouts are on a standard ansi/iso/jis type keyboard, shift on pinkies, etc.

Hands Down was designed for programmable, split  ergonomic keyboards with shift and other mods on home row (or on thumbs). But even without the split ergo keyboard with thumb keys or home-row mods, Hands Down is still a superior layout.