Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
Discussion Groups
DB Engine
SQL ServerMSDESQL Server CE
Services
Analysis (Data Mining)Analysis (OLAP)DTSIntegration ServicesNotification ServicesReporting Services
Programming
CLRConnectivitySQLXML
Other Technologies
ClusteringEnglish QueryFull-Text SearchReplicationService Broker
General
Data WarehousingPerformanceSecuritySetupSQL Server ToolsOther SQL Server Topics
DirectoryUser Groups
Related Topics
MS AccessOther DB ProductsMS Server Products.NET DevelopmentVB DevelopmentJava DevelopmentMore Topics ...

SQL Server Forum / Other Technologies / Full-Text Search / February 2006

Tip: Looking for answers? Try searching our database.

Hidden Ignored Words?

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
Jack - 21 Feb 2006 12:51 GMT
We are getting the dreaded "A clause of the query contained only
ignored words." error when a user searches for the following term:
"3.0" (without quotes).
I've checked all the noise word lists and "3.0" is not in there. "3" is
however.

Also, when one searches for "3.1" it's fine - no error.
When one searches for "3.00" it's also fine.

How can one explain this and which other "hidden" noise words should we
add to our noise word filter (in the search form)?
Hilary Cotter - 21 Feb 2006 16:20 GMT
remove the 3 and the 0 from your noise word lists. You will need to rebuild
the catalogs!

Signature

Hilary Cotter
Director of Text Mining and Database Strategy
RelevantNOISE.Com - Dedicated to mining blogs for business intelligence.

This posting is my own and doesn't necessarily represent RelevantNoise's
positions, strategies or opinions.

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com

> We are getting the dreaded "A clause of the query contained only
> ignored words." error when a user searches for the following term:
[quoted text clipped - 7 lines]
> How can one explain this and which other "hidden" noise words should we
> add to our noise word filter (in the search form)?
Jack - 21 Feb 2006 17:32 GMT
Aha! It's because of the "." being a word-breaker - is that correct?
Shall I remove it from the noise_eng file? or the enu...

I also noticed that a standalone "#" was being treated as a noise word
and throwing that ugly error. Can you explain that one?

Many thanks
Hilary Cotter - 21 Feb 2006 17:46 GMT
Yes . is a word or token boundary (actually there are some rules about
this). # is not a noise word when prefaced by a c, j or f.

Signature

Hilary Cotter
Director of Text Mining and Database Strategy
RelevantNOISE.Com - Dedicated to mining blogs for business intelligence.

This posting is my own and doesn't necessarily represent RelevantNoise's
positions, strategies or opinions.

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com

> Aha! It's because of the "." being a word-breaker - is that correct?
> Shall I remove it from the noise_eng file? or the enu...
[quoted text clipped - 3 lines]
>
> Many thanks
Jack - 22 Feb 2006 07:31 GMT
I actually need to understand where I could see a complete list of
noise words. # is not in the noise word list. Is it a token boundary?
Where is the complete list of these?
Currently we are fighting fires as these errors occur.
Hilary Cotter - 23 Feb 2006 02:09 GMT
The noise words are in the noise word lists. Some characters have special
significance - for example C#, and C++.

Signature

Hilary Cotter
Director of Text Mining and Database Strategy
RelevantNOISE.Com - Dedicated to mining blogs for business intelligence.

This posting is my own and doesn't necessarily represent RelevantNoise's
positions, strategies or opinions.

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com

>I actually need to understand where I could see a complete list of
> noise words. # is not in the noise word list. Is it a token boundary?
> Where is the complete list of these?
> Currently we are fighting fires as these errors occur.
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2009 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.