Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
Discussion Groups
DB Engine
SQL ServerMSDESQL Server CE
Services
Analysis (Data Mining)Analysis (OLAP)DTSIntegration ServicesNotification ServicesReporting Services
Programming
CLRConnectivitySQLXML
Other Technologies
ClusteringEnglish QueryFull-Text SearchReplicationService Broker
General
Data WarehousingPerformanceSecuritySetupSQL Server ToolsOther SQL Server Topics
DirectoryUser Groups
Related Topics
MS AccessOther DB ProductsMS Server Products.NET DevelopmentVB DevelopmentJava DevelopmentMore Topics ...

SQL Server Forum / Other Technologies / Full-Text Search / March 2005

Tip: Looking for answers? Try searching our database.

Indexing Service problem: Query returned only ignored words

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
dotNetCoder - 12 Mar 2005 08:17 GMT
Hello.
I'm trying to create a search engine for my Web Site using Indexing
Service. The content is stored in text files in UTF8 Encoding (arabic
text).
The search utility works well on my local server but it throws an
exception online: the query return only ignored words.
Every file contains the following statement in its header:
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">.
I will be thankful for your help .
Hilary Cotter - 14 Mar 2005 16:07 GMT
at the top of your asp/aspx code page you have to use session.codepage= and
set it equal to the LCID for the particular form of Arabic you are using.

Signature

Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com

> Hello.
> I'm trying to create a search engine for my Web Site using Indexing
[quoted text clipped - 8 lines]
> *** Sent via Developersdex http://www.developersdex.com ***
> Don't just participate in USENET...get rewarded for it!
John Kane - 14 Mar 2005 17:33 GMT
dotNetCoder,
As you're using UTF8 encoding with Arabic text, you might want to checkout
the new
"Microsoft Arabic Word-Breaker (Arabic Search Engine) - Beta" at
http://www.microsoft.com/middleeast/arabicdev/beta/search/ and download the
Installation Guide and the Microsoft Arabic Word-Breaker.  Under the
"Installation Requirements for SQL Server 2000" section, you will find that
the Arabic Word-Breaker is "...the Full-Text Search service is enabled for
running queries using the new Arabic Word-Breaker."

Please, download it and let this newsgroup know if you find it effective in
resolving your FTS issues with Arabic HTML files, although, you most likely
will have to include the meta name="MS.Locale" content="AR">" (Arabic) html
tags.

You might find the following material useful - In addition to the BOL
documentation, there is now on MSDN - "Arabic
Language Support in Microsoft SQL Server 2000" at
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/dnsql2k/html/sq
l_arabicsupport.asp

that might also be helpful to you.

Additionally, and assuming that the HTML documents that you are storing in
your IMAGE column PageText are in Arabic, could you confirm that all of the
HTML files have the correct language identifiers? Specifically, confirm if
there are any <meta> tags within the <header> tags and that they are
"<head><meta name='ms.locale' content='EN-US'></head>"  vs. "<meta
name="MS.Locale" content="AR">" (the latter is Arabic). For more info see
the "HTML Filter" at
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/indexsrv/html/i
xufilt_2uuq.asp

as well as http://www.otal.umd.edu/uupractice/non_english/

Thanks,
John
Signature

SQL Full Text Search Blog
http://spaces.msn.com/members/jtkane/

> Hello.
> I'm trying to create a search engine for my Web Site using Indexing
[quoted text clipped - 8 lines]
> *** Sent via Developersdex http://www.developersdex.com ***
> Don't just participate in USENET...get rewarded for it!
dotNetCoder - 17 Mar 2005 12:05 GMT
Hilary and Jone, I want to thank you for your help.
Actually till now I couldn't resolve the problem.
I think the problem is in the ASP.NET Framework, because in the past I
have created a Search Engine for a classic ASP website and I didn't face
that problem.
The problem is that if I search for an english text (ASCII characters),
the search returns the correct results but if I try any arabic string, I
get the exception "Query returned only ignored words".
The difference between classic ASP page and ASP.NET page is that my .Net
Web pages are UTF8 encoded (I have to do this because my server default
language for non unicode text isn't arabic and obviously I can't ask my
host to set it to arabic). So I think that the problem is in the UTF8
encoding of my pages.

Jone: for the arabic Wordbreaker, I have tried it before and I found it
efficient in my arabic search but it worked only when the default option
in my server "Regional Options" was set to Arabic (otherwise, I noticed
that the neutral language is used not the arabic language).
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2009 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.