Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
Discussion Groups
DB Engine
SQL ServerMSDESQL Server CE
Services
Analysis (Data Mining)Analysis (OLAP)DTSIntegration ServicesNotification ServicesReporting Services
Programming
CLRConnectivitySQLXML
Other Technologies
ClusteringEnglish QueryFull-Text SearchReplicationService Broker
General
Data WarehousingPerformanceSecuritySetupSQL Server ToolsOther SQL Server Topics
DirectoryUser Groups
Related Topics
MS AccessOther DB ProductsMS Server Products.NET DevelopmentVB DevelopmentJava DevelopmentMore Topics ...

SQL Server Forum / Other Technologies / Full-Text Search / January 2007

Tip: Looking for answers? Try searching our database.

Indexing text within pdf content as a text file

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
benjamin.soulier@gmail.com - 22 Jan 2007 18:24 GMT
Hello everyone,

I have a very strange problem : I've installed SQL server 2005, create
a storage for full index and full text index on a table. I've also
installed Adobe IFilter. Searches are OK but only on one criteria I
have problem. let me explain.

I'm searching in files in database for "c#" and some pdf files returned
by search are not containing "c#". Opening these PDF files with notepad
and searching for string "c#" let me find out why this file is returned
by search... Is it really the problem or not ? (is seems to do the same
on word files too...)

Thanks for your help.
Regards,

Ben.
Hilary Cotter - 22 Jan 2007 19:17 GMT
SQL FTS interprets C# as C#, but c# as c. So you need to capitalize the C in
your searches and content for this to work correctly.

Signature

Hilary Cotter

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com

> Hello everyone,
>
[quoted text clipped - 13 lines]
>
> Ben.
benjamin.soulier@gmail.com - 23 Jan 2007 09:00 GMT
Hello Hilary,

Thanks for your quick answer, but it seems that the problem still
remains : I was wondering if it has something to do with the full text
index in database, as I configured it to case insensitive.
Do I have to put it back to case sensitive to make it work ?
Do I have to change noise words files (I've already removed "C" letter
from neutral and english files) ?.

Hilary Cotter a ?crit :
> SQL FTS interprets C# as C#, but c# as c. So you need to capitalize the C in
> your searches and content for this to work correctly.
[quoted text clipped - 25 lines]
> >
> > Ben.
benjamin.soulier@gmail.com - 23 Jan 2007 09:31 GMT
I tried to do a test for myself on this :
I created a new table, with full text index in case sensitive mode,
adding documents to it.
I still get files not containing C# values (all types of documents).
I even tried to use CONTAINSTABLE and FREETEXTTABLE, but problem is the
same...

Any ideas ?

benjamin.soulier@gmail.com a écrit :
> Hello Hilary,
>
[quoted text clipped - 35 lines]
> > >
> > > Ben.
Hilary Cotter - 26 Jan 2007 03:51 GMT
I take it you are Swiss and using the German or French (or Italian) word
breakers. In these languages c# and C# are indexed as c and C# - a search on
c# or C#  will match with c and c# (lower and upper case). English
unfortunately is the exception here. c# is indexed as c, and C# is indexed
as C#, the search is done on C# which will match with C# and a search on c#
will match with c.
Signature

Hilary Cotter

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com

Hello Hilary,

Thanks for your quick answer, but it seems that the problem still
remains : I was wondering if it has something to do with the full text
index in database, as I configured it to case insensitive.
Do I have to put it back to case sensitive to make it work ?
Do I have to change noise words files (I've already removed "C" letter
from neutral and english files) ?.

Hilary Cotter a écrit :
> SQL FTS interprets C# as C#, but c# as c. So you need to capitalize the C
> in
[quoted text clipped - 29 lines]
> >
> > Ben.
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2009 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.