Examine config - what is analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net" ?
Hi,
I am using Examine index. When you configure a search provider and add analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net" , then the index seems to index only nodes under the Content node.
When you leave analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net", then everything, including media items, and templates, etc.
The question is, what is analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net",
and what other option are there, so you can decide what exactly you want in the index?
Essentially Examine is built on Lucene.Net; the analyzers are the 'rules' that are used to build your index, and in turn to search your index.
You can see they are from a Lucene.Net namespace, so you can plugin existing analyzer's that people have written for Lucene, for example for different languages. You can also write your own.
Umbraco comes with the StandardAnalyzer which is setup to analyze text using the English language, - common 'stop works' like 'and', 'it', 'I' etc are excluded from the index; this reduces the size of the index.
But some English 'stop words' are actually proper words in other languages; so Umbraco also uses the WhitespaceAnalyzer this does not ignore stop words - and uses whitespace to determine the point at which to break words and add them to the index. So this results in larger indexes but with more flexibility.
But if your site is in Danish, you can download an analyzer that has been written to optimize the indexing and searching of Danish language text.
Other point to make sure is that your Indexer and Searchers are using the same Analyzer so they are following the same rules when trying to search the content that has been indexed...
Examine config - what is analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net" ?
Hi,
I am using Examine index. When you configure a search provider and add analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net" , then the index seems to index only nodes under the Content node.
When you leave analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net", then everything, including media items, and templates, etc.
The question is, what is analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net",
and what other option are there, so you can decide what exactly you want in the index?
Thanks a lot!
Hi Damon
Theres a pretty good article on getting started with examine, here; https://umbraco.com/follow-us/blog-archive/2011/9/16/examining-examine/
Essentially Examine is built on Lucene.Net; the analyzers are the 'rules' that are used to build your index, and in turn to search your index.
You can see they are from a Lucene.Net namespace, so you can plugin existing analyzer's that people have written for Lucene, for example for different languages. You can also write your own.
Umbraco comes with the StandardAnalyzer which is setup to analyze text using the English language, - common 'stop works' like 'and', 'it', 'I' etc are excluded from the index; this reduces the size of the index.
But some English 'stop words' are actually proper words in other languages; so Umbraco also uses the WhitespaceAnalyzer this does not ignore stop words - and uses whitespace to determine the point at which to break words and add them to the index. So this results in larger indexes but with more flexibility.
But if your site is in Danish, you can download an analyzer that has been written to optimize the indexing and searching of Danish language text.
Other point to make sure is that your Indexer and Searchers are using the same Analyzer so they are following the same rules when trying to search the content that has been indexed...
is working on a reply...