Page 1 of 1

[LOW:KNOWN ISSUE:1.0.11] seachbot content not accurate??

Posted: Mon Oct 30, 2006 5:21 pm
by emanuel37
Hello to all

On our site http://www.mobilitaetsberatung.ch the search function seems to work not 100% properly.
Search Item: "mobilitätszentrale" (should be found in http://www.mobilitaetsberatung.ch/content/view/40/40/) or
search item: "mobilitätsdurchblick" (should be found in http://www.mobilitaetsberatung.ch/content/view/23/38/).
Even if almost all other search keys are found, this effect seems strange to me.
Does anybody have a clou?

Best regards
Emanuel

Re: seachbot content not accurate??

Posted: Mon Oct 30, 2006 6:17 pm
by Robin
Q&T Note; Status > Under review, Impact Low

Re: [LOW:UNDER REVIEW:1.0.11] seachbot content not accurate??

Posted: Mon Oct 30, 2006 6:18 pm
by Robin
Hi emanuel,

Can you tell me which version of Joomla! you are using?

Thanks and regards, Robin

Re: [LOW:UNDER REVIEW:1.0.11] seachbot content not accurate??

Posted: Mon Oct 30, 2006 10:19 pm
by emanuel37
Hi Robin,
Thanks for the quick reply.
I use version Joomla! 1.0.11 Stable [ Sunbow ].
Best regards emanuel

09:25: it seems, that the umlauts (äöü, what's the correct word in english?) in the search key are not converted into proper html (ä for ä) for comparison with the content table.

em

Re: [LOW:UNDER REVIEW:1.0.11] seachbot content not accurate??

Posted: Wed Nov 01, 2006 11:53 am
by diri
@emanuel37: AFAIK it's called "german Umlauts".

This encoding is the problem:
Search engine should search for both - encoded and not encoded values.

Means:
It should search for "Ä" and "Ä" when keyword is "Ärger" and exact search is requested. When similar search is requested it should search for "Ä", "ä", "Ä" and "ä".

You see the problem?

Sad enough there are not only german Umlauts. You will find other special characters as well ...

Re: [LOW:UNDER REVIEW:1.0.11] seachbot content not accurate??

Posted: Wed Nov 01, 2006 12:55 pm
by emanuel37
So, does that mean, that it is a known issue in the Joomla! search function?

Could the search function (bot or module or whatever) not take the same algorithm as the online editor to convert the umlauts to named entities for the search?

Re: [LOW:UNDER REVIEW:1.0.11] seachbot content not accurate??

Posted: Wed Nov 01, 2006 1:17 pm
by diri
It's a known issue with every search function no matter which product you mean.

Oops ... who pressed save?

You will bring yourself in trouble with this solution because UTF-8 translates into other codes. This problem is well known in relation to internationalization. You will encounter it at every level of computing (be it operating system or application).

Re: [LOW:UNDER REVIEW:1.0.11] seachbot content not accurate??

Posted: Fri Nov 03, 2006 4:27 pm
by emanuel37
I'm not deep into internationalization, but how about the php function "htmlspecialchars()" or "htmlentities()"?
For my understanding they should do it.
For example CMSimple, a very small, but simple & smart cms does it.

Re: [LOW:UNDER REVIEW:1.0.11] seachbot content not accurate??

Posted: Wed Nov 08, 2006 2:35 am
by RobS
Well, from a Q&T perspective this definitely falls into the feature request area.  Internationalization is very poorly supported in 1.0.x but it is much better in 1.5.x.  This is not something that can be addressed in 1.0.x but it may be possible to fix this problem in 1.5.x.  So, for now, there is nothing we can do about it.

Q&T Note; Status> Known Issue; Moving to Known Issues forum.