Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
Discussion Groups
DB Engine
SQL ServerMSDESQL Server CE
Services
Analysis (Data Mining)Analysis (OLAP)DTSIntegration ServicesNotification ServicesReporting Services
Programming
CLRConnectivitySQLXML
Other Technologies
ClusteringEnglish QueryFull-Text SearchReplicationService Broker
General
Data WarehousingPerformanceSecuritySetupSQL Server ToolsOther SQL Server Topics
DirectoryUser Groups
Related Topics
MS AccessOther DB ProductsMS Server Products.NET DevelopmentVB DevelopmentJava DevelopmentMore Topics ...

SQL Server Forum / Other Technologies / Full-Text Search / March 2007

Tip: Looking for answers? Try searching our database.

Phonetic Searching and Full Text Indexing For Business and Personal Names an Addresses

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
stabbert@purestrategy.net - 10 Mar 2007 13:36 GMT
I am undertaking a project which requires the ability to perform
phonetic searching across business and personal names and addresses.
This data is relatively uncleansed to date.  The amount of work to
cleanse the data is tremendous based upon existing data analysis.  My
thoughts are that if we can find the right tools we should be able to
perform matching and searching across the data without implementing
heavy duty cleansing prior to the search.  When I talk about
uncleansed data in terms of names I am talking about things like the
following:

Acme, Inc
Acme, Incorp.
Acme Incorporated

**Note variations in abbreviations of Inc. this occurs on other words
such as Company, Association, Limited, DBA and other unknowns.

Smith Plumbing
Smyth Plumbing
Kool Runnings
Cool Runnings

**Note variations in phonetic spelling of words

Ultimately what I am wondering is to what success have people had with
searching fuzzy type searches like this using solely Microsofts Full
Text Indexing.  I would also be interested to know if SQL Server 2005
improves on the capabilities of fuzzy matching.  In addition I found
some information regardng SQL Turbo http://www.quest.com/sql_turbo/
which seems to be an extender or different implementation of FTS which
does support phonetic matche and I would be interested to know if
anyone has used this tool as well.  Any thoughts on how to approach
this matter would be appreciated.
Hilary Cotter - 10 Mar 2007 17:01 GMT
You need to build a thesaurus with all the alternative spellings to
implement this in SQL FTS. You could try to cleanse your data before
indexing it using fuzzy grouping, or by rolling your own levenstein edit
distance. SQL FTS fuzzy search (aka freetext search) will stem your search
phrase for verb forms but will not do the kind of fuzzy searches you are
looking for.

There was a time when SQL Turbo was actually faster than SQL 2000 FTS, but
SQL 2005 is now faster, but I have not compared it against the new version
of SQL Turbo yet.

Signature

Hilary Cotter

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com

>I am undertaking a project which requires the ability to perform
> phonetic searching across business and personal names and addresses.
[quoted text clipped - 29 lines]
> anyone has used this tool as well.  Any thoughts on how to approach
> this matter would be appreciated.
Russell Fields - 12 Mar 2007 20:42 GMT
Have you looked at: http://www.codeproject.com/string/dmetaphone6.asp

The double-metaphone algorithm is available and has been implemented as an
extended stored procedure.  It is pretty good for people names, but I don't
know if it would work well for business names.

FWIW - RLF

>I am undertaking a project which requires the ability to perform
> phonetic searching across business and personal names and addresses.
[quoted text clipped - 29 lines]
> anyone has used this tool as well.  Any thoughts on how to approach
> this matter would be appreciated.
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2009 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.