I was just wondering when does examine reindex content and how long does it usually take to re-index?
We had some content indexed and then i added the word lemon to the description to check if it would be indexed
I was able to find the page previously by seraching for 'Tafe' or 'students'. However, I added the word 'lemon' to the start of the content to try and search for that. I could not find any results for 'lemon' and the page no longer appears in any other search results. It appears to have been removed from the search index.
The examine index looks like:
<?xml version="1.0"?>
<!--
Umbraco examine is an extensible indexer and search engine.
This configuration file can be extended to create your own index sets.
Index/Search providers can be defined in the UmbracoSettings.config
More information and documentation can be found on CodePlex: http://umbracoexamine.codeplex.com-->;
<ExamineLuceneIndexSets>
<!-- The internal index set used by Umbraco back-office - DO NOT REMOVE -->
<IndexSet SetName="Default" IndexPath="~/App_Data/ExamineIndexes/Internal/">
<IndexAttributeFields>
<add Name="id" />
<add Name="nodeName" />
<add Name="updateDate" />
<add Name="writerName" />
<add Name="path" />
<add Name="nodeTypeAlias" />
<add Name="parentID" />
</IndexAttributeFields>
<IndexUserFields />
<IncludeNodeTypes/>
<ExcludeNodeTypes />
</IndexSet>
<IndexSet SetName="All" IndexPath="~/App_Data/ExamineIndexes/All/">
<IndexAttributeFields>
<add Name="id" />
<add Name="nodeName" />
<add Name="updateDate" />
<add Name="writerName" />
<add Name="path" />
<add Name="nodeTypeAlias" />
<add Name="parentID" />
</IndexAttributeFields>
<IndexUserFields>
<add Name="title" />
<add Name="description" />
<add Name="content" />
<add Name="overviewDescription" />
<add Name="leftColumnContent" />
<add Name="rightColumnContent" />
<add Name="scheduleTitle" />
<add Name="scheduleBlurb" />
<add Name="longDescription" />
<add Name="questionText"/>
<add Name="answerText"/>
</IndexUserFields>
<IncludeNodeTypes/>
<ExcludeNodeTypes>
<add Name="externalLink" />
<add Name="Blank" />
<add Name="SitemapXml" />
</ExcludeNodeTypes>
</IndexSet>
</ExamineLuceneIndexSets>
The settings looks like so:
<?xml version="1.0"?>
<!--
Umbraco examine is an extensible indexer and search engine.
This configuration file can be extended to add your own search/index providers.
Index sets can be defined in the ExamineIndex.config if you're using the standard provider model.
More information and documentation can be found on CodePlex: http://umbracoexamine.codeplex.com
-->
<Examine>
<ExamineIndexProviders enableDefaultEventHandler="true">
<providers>
<!-- Interval is 30 seconds by default and sets the number of seconds between intervals that the indexing queue will be processed -->
<!-- When debug is true, indexing will happen synchronously and throw visible exceptions if there are errors, otherwise
if runAsync = true, indexing occurs async -->
<add name="InternalIndex" type="UmbracoExamine.LuceneExamineIndexer, UmbracoExamine"
indexSet="Default"
enabled="true"
runAsync="true"
supportUnpublished="true"
interval="10"
analyzer="Lucene.Net.Analysis.WhitespaceAnalyzer, Lucene.Net"/>
<!-- Support protected is false by default but if set to true, the system will index nodes that have public access policies applied to them -->
<add name="OurSiteIndex" type="UmbracoExamine.LuceneExamineIndexer, UmbracoExamine" indexSet="All"
debug="false"
enabled="true"
supportProtected="true"
supportUnpublished="false"
analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net"/>
</providers>
</ExamineIndexProviders>
<ExamineSearchProviders defaultProvider="OurSiteIndex">
<providers>
<add name="InternalSearch" type="UmbracoExamine.LuceneExamineSearcher, UmbracoExamine"
indexSet="Default"
analyzer="Lucene.Net.Analysis.WhitespaceAnalyzer, Lucene.Net"/>
<add name="OurSiteIndex" type="UmbracoExamine.LuceneExamineSearcher, UmbracoExamine" indexSet="All"
analyzer="Lucene.Net.Analysis.WhitespaceAnalyzer, Lucene.Net"/>
</providers>
</ExamineSearchProviders>
</Examine>
I was also wondering what is the difference between the standard and whitespace analyzer.. I know our implementation is a little old but we're running custom umbraco code that needs updating to the new release.
Is the word lemon definately in field that you have told examine to index? Secondly can you try searching using luke http://www.getopt.org/luke/ also if you dig around the forum there is a way to write out the actual search query that is being generated you can try and use that query in luke as well this will narrow down if problem is with generated query. One more thing change OursiteIndex analyzer to standard then rebuild index you can use http://our.umbraco.org/projects/backoffice-extensions/examine-index-admin to rebuild index other way is to republish all your content.
Hi Ismail thanks so very much for your answers.. Yes I had checked in luke and it was in the index.. I then changed the analysers to match up and hey presto! worked perfectly.. thanks for your help! Is the umbraco add-on for examine 4.5/4.7 compatible?
Cheers,
Tom
Umbraco Examine Questions/Wierdness
Hi we've got examine up and running on our system
I was just wondering when does examine reindex content and how long does it usually take to re-index?
We had some content indexed and then i added the word lemon to the description to check if it would be indexed
I was able to find the page previously by seraching for 'Tafe' or 'students'. However, I added the word 'lemon' to the start of the content to try and search for that. I could not find any results for 'lemon' and the page no longer appears in any other search results. It appears to have been removed from the search index.
The examine index looks like:
The settings looks like so:
Tom,
Is the word lemon definately in field that you have told examine to index? Secondly can you try searching using luke http://www.getopt.org/luke/ also if you dig around the forum there is a way to write out the actual search query that is being generated you can try and use that query in luke as well this will narrow down if problem is with generated query. One more thing change OursiteIndex analyzer to standard then rebuild index you can use http://our.umbraco.org/projects/backoffice-extensions/examine-index-admin to rebuild index other way is to republish all your content.
Regards
Ismail
Tom,
What regards to analyzers see http://wiki.apache.org/lucene-java/LuceneFAQ#Why_is_it_important_to_use_the_same_analyzer_type_during_indexing_and_search.3F
Regards
Ismail
Hi Ismail thanks so very much for your answers.. Yes I had checked in luke and it was in the index.. I then changed the analysers to match up and hey presto! worked perfectly.. thanks for your help! Is the umbraco add-on for examine 4.5/4.7 compatible? Cheers, Tom
is working on a reply...