Copied to clipboard

Flag this post as spam?

This post will be reported to the moderators as potential spam to be looked at


  • Damon 217 posts 288 karma points
    Feb 11, 2016 @ 09:00
    Damon
    0

    Examine config - what is analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net" ?

    Hi,

    I am using Examine index. When you configure a search provider and add analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net" , then the index seems to index only nodes under the Content node.

    When you leave analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net", then everything, including media items, and templates, etc.

    The question is, what is analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net",

    and what other option are there, so you can decide what exactly you want in the index?

    Thanks a lot!

  • Marc Goodson 2157 posts 14411 karma points MVP 9x c-trib
    Feb 14, 2016 @ 02:07
    Marc Goodson
    2

    Hi Damon

    Theres a pretty good article on getting started with examine, here; https://umbraco.com/follow-us/blog-archive/2011/9/16/examining-examine/

    Essentially Examine is built on Lucene.Net; the analyzers are the 'rules' that are used to build your index, and in turn to search your index.

    You can see they are from a Lucene.Net namespace, so you can plugin existing analyzer's that people have written for Lucene, for example for different languages. You can also write your own.

    Umbraco comes with the StandardAnalyzer which is setup to analyze text using the English language, - common 'stop works' like 'and', 'it', 'I' etc are excluded from the index; this reduces the size of the index.

    But some English 'stop words' are actually proper words in other languages; so Umbraco also uses the WhitespaceAnalyzer this does not ignore stop words - and uses whitespace to determine the point at which to break words and add them to the index. So this results in larger indexes but with more flexibility.

    But if your site is in Danish, you can download an analyzer that has been written to optimize the indexing and searching of Danish language text.

    Other point to make sure is that your Indexer and Searchers are using the same Analyzer so they are following the same rules when trying to search the content that has been indexed...

Please Sign in or register to post replies

Write your reply to:

Draft