Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
Discussion Groups
DB Engine
SQL ServerMSDESQL Server CE
Services
Analysis (Data Mining)Analysis (OLAP)DTSIntegration ServicesNotification ServicesReporting Services
Programming
CLRConnectivitySQLXML
Other Technologies
ClusteringEnglish QueryFull-Text SearchReplicationService Broker
General
Data WarehousingPerformanceSecuritySetupSQL Server ToolsOther SQL Server Topics
DirectoryUser Groups
Related Topics
MS AccessOther DB ProductsMS Server Products.NET DevelopmentVB DevelopmentJava DevelopmentMore Topics ...

SQL Server Forum / Other Technologies / Full-Text Search / August 2007

Tip: Looking for answers? Try searching our database.

An issue with Full Text Search against PDF blobs...

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
Camilo - 09 Aug 2007 23:26 GMT
My client is trying to use SQL Server Full Text Search to search PDFs stored
in a varbinary(max) column. Full text search works fine for other Microsoft
Office documents stored in the same table but it does NOT give back any
results from the PDF documents.

Following is a high-level view of what the client did:

1. Installed Adobe PDF IFilter 6.0
2. Ran the stored procedure sp_fulltext_service (as documented)
3. Restarted the server
4. Verified that the filter got properly installed by querying the system
view
   sys.fulltext_document_types
5. Created a full text index on the table with the documents
6. Started a full population of the index
7. Ran a sample query with a string he knows is in the PDF file like the
following:
   select * from documents where freetext(document, ‘Review’) and got no
results
   back
8. Ran the same sample query with a string he knows is in some Word files
like the
   following: select * from documents where freetext(document, ‘SQL’) the
query
   returned several rows back as expected.

Does anybody know what might be happening here?

Thank you!

Camilo Leon
Russell Fields - 10 Aug 2007 17:03 GMT
Camilo,

Are you using 64-bit Windows and SQL Server?  If so, last time I looked the
Adobe PDF IFilter was only 32-bit.

RLF
> My client is trying to use SQL Server Full Text Search to search PDFs
> stored
[quoted text clipped - 29 lines]
>
> Camilo Leon
Camilo - 10 Aug 2007 18:52 GMT
Russell,

No, we are using a 32-bit machine in this case.

I did make FTS work with PDF files by creating a new table, importing the
data, creating a new catalog and populating it again.

Thanks!

Camilo

> Camilo,
>
[quoted text clipped - 35 lines]
> >
> > Camilo Leon
Hilary Cotter - 14 Aug 2007 04:00 GMT
Check the gatherer logs to see what the status of your population was.

Signature

relevantNoise - dedicated to mining blogs for business intelligence.

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com

> My client is trying to use SQL Server Full Text Search to search PDFs
> stored
[quoted text clipped - 29 lines]
>
> Camilo Leon
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2009 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.