Talking with Ray today about some Verity things, I was sharing some tidbits about the english locale and performance that I realized should be published more widely.
First, some history. The Verity 5.5 version in CFMX7 is very different that the verity 2.5 version we shipped in CF5/6/6.1. In particular we (and verity) defaulted to the "english" locale. Since then, Verity has acquired the linguistiX technology, which is the company that used to provide their international locales, but I believe they have since bought the company and/or technology and now they default to using "englishx". Where the "x" stands for linguistiX.
But ColdFusion MX 7 still defaults to english. Why? Two reasons.
1. To ensure compatibility with previous CF releases. Users expect to get the same results from indexing and searching when they upgrade to CFMX7.
2. Performance. The englishx locale is "better". According to Verity it is much better than the old english locale. Better means better search results. In order to get these results, it takes more time in the indexing of the data. Since we were trying to maintain indexing performance, we used the english locale as it means indexing will be faster.
Bottom line: If you want better search results, and indexing performance is not critical, create your collections using English (advanced) in the CF administrator or "englishx" when using the cfcollection tag.