Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
Discussion Groups
DB Engine
SQL ServerMSDESQL Server CE
Services
Analysis (Data Mining)Analysis (OLAP)DTSIntegration ServicesNotification ServicesReporting Services
Programming
CLRConnectivitySQLXML
Other Technologies
ClusteringEnglish QueryFull-Text SearchReplicationService Broker
General
Data WarehousingPerformanceSecuritySetupSQL Server ToolsOther SQL Server Topics
DirectoryUser Groups
Related Topics
MS AccessOther DB ProductsMS Server Products.NET DevelopmentVB DevelopmentJava DevelopmentMore Topics ...

SQL Server Forum / Other Technologies / Full-Text Search / December 2005

Tip: Looking for answers? Try searching our database.

Difference between indexing txt or word doc files, help!

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
Jim Florence - 13 Dec 2005 21:36 GMT
Hi,

I'm putting together a system and one of the requirements is to have a
searchable CV function.

I've got all the code to load the files on to the image fields, I've indexed
and got it kinda working.

Before I go to far down the road what is your opinion on having txt files
instead of doc files held on the table search? The SQL seems to be more
flexible on
searches rather than on the binary files and the index files themselves are
smaller
.i.e. when I tried a like clause it told me this would only work against a
varchar field
(i'm thinking this may be a schoolboy error so forgive me)

My main concern is I'd have to do the text conversion automatically, any
pointers on this?

Does anyone have any views on the best way to go about this or views on
holding and full searches again word files

Many thanks for any help you can give

Jim Florence
Hilary Cotter - 15 Dec 2005 18:08 GMT
Text means faster indexing times, but not by much. With text you can query
the columns and read the contents, you can't do this with binary.

Search SQL is equally as flexible with text and binary. You can only do a
like against text or char columns.

To do the conversion use filtdump -b (you can get this from the Platform
SDK), or you can use ole-automation against the word documents to extract
the text paragraph by paragraph.

Signature

Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com

> Hi,
>
[quoted text clipped - 24 lines]
>
> Jim Florence
Jim Florence - 16 Dec 2005 00:05 GMT
Hilary,

Many thanks for that, very, very useful. I've started playing with the
indexing service as well to try and find a best fit.

I'll give this a go

many thanks for such a quick and informative response

Regards

Jim

> Text means faster indexing times, but not by much. With text you can query
> the columns and read the contents, you can't do this with binary.
[quoted text clipped - 34 lines]
>>
>> Jim Florence
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2009 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.