Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
Discussion Groups
DB Engine
SQL ServerMSDESQL Server CE
Services
Analysis (Data Mining)Analysis (OLAP)DTSIntegration ServicesNotification ServicesReporting Services
Programming
CLRConnectivitySQLXML
Other Technologies
ClusteringEnglish QueryFull-Text SearchReplicationService Broker
General
Data WarehousingPerformanceSecuritySetupSQL Server ToolsOther SQL Server Topics
DirectoryUser Groups
Related Topics
MS AccessOther DB ProductsMS Server Products.NET DevelopmentVB DevelopmentJava DevelopmentMore Topics ...

SQL Server Forum / Other Technologies / Full-Text Search / August 2005

Tip: Looking for answers? Try searching our database.

Can I see what is inside Full-Text Search Catalog?

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
Marko Mladenovic - 16 Aug 2005 10:33 GMT
I have a problem whith knowing what is inside catalog, especially words that
are indexed in tables. I want to improve some search capabilities, and I need
to know structure of index (catalog). My improvements will probably be
approximate string matching, by word distance, and I need to know set of
indexed words. I work with SQL Server 2005.
Thankful in advance,
Marko Mladenovic
Hilary Cotter - 16 Aug 2005 11:16 GMT
Have a look at cidump. You can find it in C:\Program Files\Microsoft SQL
Server\MSSQL.1\MSSQL\Binn

Signature

Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com

> I have a problem whith knowing what is inside catalog, especially words that
> are indexed in tables. I want to improve some search capabilities, and I need
[quoted text clipped - 3 lines]
> Thankful in advance,
> Marko Mladenovic
John Kane - 16 Aug 2005 16:13 GMT
Marko,
The SQL Server 2005 Beta2 as well as Bet3 Books Online (BOL) have yet to be
updated on this most useful SQL FTS utility. In the meantime, you should
checkout the cidump utility. Use cidump /? to get the syntax as well as
example of use, located at
\MSSQL90\MSSQL.1\MSSQL\Binn:

cidump /?
-- edited output:

Use cidump.exe for:
- Dumping the content of the catalog or a specified index
      cidump -dump <catalog_path> [options]

- Checking the integrity of the catalog or a specified index
      cidump -check <catalog_path> [options]

- Computing statistics on the content of the catalog or a specified index
      cidump -stats <catalog_path> [options]
...

Display global statistics for the index 0001002A:
   cidump -stats c:\catalog -g -i 0x1002A

cidump /??
Advanced options:

  -x [<format>] - Dump the index
       <format> can be also:
                   ks  - statistics per key
                   kw+ - keys and wids and wid metadata (all but
occurrences)
                   kp  - list of keys with position in index
                   kwp - list of keys and wids with position in index
                   kph - dump phrases that contain a given key.
                         Requires -kph and -pch options.
                   ph  - dump all phrases in more than a given nr of docs
                         Requires -phc option.
  -dir [<format>] - Dump the index directory
       <format> can be:
                      kbo - keys and BitOffset in index (default)
                      kp - list of keys with position in the directory file
  -phk <key> - Used only with "-x kph" dump format.
               Dump phrases that contain the given <key>. Use also -phc.
  -phc <minWidCount> - Used only with "-x ph" and "-x kph" dump formats.
       Dump all the 2-3 word phrases that occur in at least <minOccCount>
       documents in the indexed corpus.
  -kwc <minWidCount> <maxWidCount> - Display only keys in a widcount range.
  -alr - Display the allocated ranges for the master index.
  -fbs - Force binary search when dumping widsets.
  -rec <index> <type> <maxWid> <R/W> - Dump a standalone index.
                                       Don't use the Index Table.
       <index>  - the index id (e.g.) 0x1001C
       <type>   - 0 - master index; 1 - shadow index.
       <maxWid> - the maximum workid in the index.
       <R/W>    - 0 - complete index; 1 - incomplete index (write mode)

Enjoy!
John
Signature

SQL Full Text Search Blog
http://spaces.msn.com/members/jtkane/

>I have a problem whith knowing what is inside catalog, especially words
>that
[quoted text clipped - 5 lines]
> Thankful in advance,
> Marko Mladenovic
Marko Mladenovic - 17 Aug 2005 11:24 GMT
Thank You Hilary and John, Your answers were very helpful to me. If You can
answer me just one more question, please. How can I relate "wid" from a
catalog with real database records, i.e. table, row and column.
Thanks again,
Marko.

> Marko,
> The SQL Server 2005 Beta2 as well as Bet3 Books Online (BOL) have yet to be
[quoted text clipped - 64 lines]
> > Thankful in advance,
> > Marko Mladenovic
Hilary Cotter - 17 Aug 2005 19:20 GMT
You can't, its a work identifier - it is changes with each population.

Signature

Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com

> Thank You Hilary and John, Your answers were very helpful to me. If You can
> answer me just one more question, please. How can I relate "wid" from a
[quoted text clipped - 70 lines]
> > > Thankful in advance,
> > > Marko Mladenovic
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2009 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.