Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
Discussion Groups
DB Engine
SQL ServerMSDESQL Server CE
Services
Analysis (Data Mining)Analysis (OLAP)DTSIntegration ServicesNotification ServicesReporting Services
Programming
CLRConnectivitySQLXML
Other Technologies
ClusteringEnglish QueryFull-Text SearchReplicationService Broker
General
Data WarehousingPerformanceSecuritySetupSQL Server ToolsOther SQL Server Topics
DirectoryUser Groups
Related Topics
MS AccessOther DB ProductsMS Server Products.NET DevelopmentVB DevelopmentJava DevelopmentMore Topics ...

SQL Server Forum / Other Technologies / Full-Text Search / August 2007

Tip: Looking for answers? Try searching our database.

Dtsearch

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
gymtrym@hotmail.com - 29 Aug 2007 17:42 GMT
Hello,

I have an issue with Dtsearch - I have very large .pdf documents in
excess of 4000 pages that I am using Dtsearch to index.  It seems that
around the 4000 mark Dtsearch stops indexing and does not display any
error message.  I have tried adjusting memory size availability and
nothing seems to work.  It appears to be the number of pages in the
document and not the size of the document that is causing the
problem.  Has anyone seen this before?

Thanks

M
Hilary Cotter - 29 Aug 2007 21:04 GMT
Contact DTsearch about this limitation.

Signature

RelevantNoise.com - dedicated to mining blogs for business intelligence.

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com

> Hello,
>
[quoted text clipped - 9 lines]
>
> M
gymtrym@hotmail.com - 29 Aug 2007 22:50 GMT
On Aug 29, 10:42 am, gymt...@hotmail.com wrote:
> Hello,
>
[quoted text clipped - 9 lines]
>
> M

I have and they have responded that they don't have a limitation.  It
appears that it is the amount of content that is causing the issue.
Hilary Cotter - 30 Aug 2007 00:05 GMT
Last time I check Microsoft was not supporting DTSearch. I am not sure why
you are posting here.

Some comments on your problem - by default Microsoft search products on
index up to 256k (IIRC) of text content in any document (doc, txt, pdf).

Also PDF consist of text and images. Sometimes what you see in a PDF which
appears as text is actually an image.

Signature

Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com

> On Aug 29, 10:42 am, gymt...@hotmail.com wrote:
>> Hello,
[quoted text clipped - 13 lines]
> I have and they have responded that they don't have a limitation.  It
> appears that it is the amount of content that is causing the issue.
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2009 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.