Copied to clipboard

Flag this post as spam?

This post will be reported to the moderators as potential spam to be looked at


  • Rotem Orbach 121 posts 607 karma points
    Aug 29, 2018 @ 13:08
    Rotem Orbach
    0

    Lucene Search Engine results Hebrew

    Hi, I have implemented Lucene Search Engine results in my website, and it works flawlessly, except one distinct issue: When I try to search a Hebrew term that has double quotation marks in it, I get no results. For example: דפ"ר

    this word should return results, but it doesn't.

    Note: wild card works: דפ*

    Any thoughts?

    thanks, Rotem

  • Yakov Lebski 590 posts 2344 karma points
    Sep 10, 2018 @ 16:33
    Yakov Lebski
    0

    You can try to use escape function - please see Lucene documentation https://lucene.apache.org/core/294/queryparsersyntax.html

  • Ismail Mayat 4511 posts 10091 karma points MVP 2x admin c-trib
    Sep 11, 2018 @ 08:43
    Ismail Mayat
    0

    Rotem,

    Are you using hebrew analyser? See https://our.umbraco.com/packages/developer-tools/hebrew-search/ this may get round the problem. Ideally when doing multilingual you should use the language specific analyser if available.

    Regards

    Ismail

  • Yakov Lebski 590 posts 2344 karma points
    Sep 13, 2018 @ 08:31
    Yakov Lebski
    0

    just note that HebMorph is not free for commercial use

    from HebMorph website:

    If you are developing a commercial application, please contact us to discuss a commercial license. This helps us support development of future versions that make Hebrew search even better. Paid licenses are bundled with a proprietary dictionary, which replaces the AGPL3'd hspell used by the open-source version, as well as integration support.

  • Ismail Mayat 4511 posts 10091 karma points MVP 2x admin c-trib
    Sep 13, 2018 @ 09:02
    Ismail Mayat
    0

    ah did not know that good spot.

  • Rotem Orbach 121 posts 607 karma points
    Sep 13, 2018 @ 08:34
    Rotem Orbach
    0

    Hi,

    Thank you for the answers. I tried https://our.umbraco.com/packages/developer-tools/hebrew-search/

    and it does not solve the issue :(

    also, when I debug the search I can clearly see that the word is already escaped (דפ\"ר) Meaning: I can see the slash being added to the string, but still, no value is being returned.

    any other thoughts? :)

    thanks

  • Ismail Mayat 4511 posts 10091 karma points MVP 2x admin c-trib
    Sep 13, 2018 @ 09:02
    Ismail Mayat
    100

    Rotem,

    I would reach out to itamar https://twitter.com/synhershko he ported lucene to lucene.net and put together HebMorph also he is native hebrew speaker.

    Regards

    Ismail

  • Rotem Orbach 121 posts 607 karma points
    Sep 20, 2018 @ 12:16
    Rotem Orbach
    0

    Thanks, I marked your answer as solution, but unfortunately, Itamar did not replay, so I guess i'll have to face this one myself.

    :)

Please Sign in or register to post replies

Write your reply to:

Draft