This is best handled with custom word breakers. Note the international
English word breakers evidently know about *tion =*xion and other British
Spellings.

Signature
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html
Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
> This is best handled with custom word breakers. Note the international
> English word breakers evidently know about *tion =*xion and other British
> Spellings.
Are you saying that if I use International English word breakers that it
will stem both *tion and *xion?
Would it stem both colour and color to the same set of words?
Are there third-party Word Breakers / Stemmers available that might do such
things?
Or would I need to write it myself?
Thanks for the clarification,
Brian
Hilary Cotter - 08 Apr 2008 13:10 GMT
This is an answer I got on the question on the difference between US and UK
(International English) from one of the main search developers several years
ago.
Neither XXXX or I are aware of any significant differences between the two.
I think there may be some ISE/IZE differences in the lexicons.
Microsoft purchased the original word breaker product from InfoSoft so they
may not be aware of all the idiosyncrasies of it.
What I think this means is that if you search on exercize (UK English) it
knows to stem it as exercizing, exercized, as opposed to doing matching with
exercise.

Signature
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html
Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
>> This is best handled with custom word breakers. Note the international
>> English word breakers evidently know about *tion =*xion and other British
[quoted text clipped - 12 lines]
>
> Brian