Press Ctrl / CMD + C to copy this to your clipboard.
This post will be reported to the moderators as potential spam to be looked at
I am using CogUmbracoExamineMediaIndexer(https://bitbucket.org/thecogworks/cogumbracoexaminemediaindexer) for searching media. I would like to remove noise words and specific words from indexing.
My definition is:
<ExamineLuceneIndexSets>
<IndexSet SetName="MediaIndexSet" IndexPath="~/App_Data/MediaIndexSet">
<IndexAttributeFields>
<add Name="id" />
<add Name="nodeName" />
<add Name="updateDate" />
<add Name="writerName" />
<add Name="path" />
<add Name="nodeTypeAlias" />
<add Name="parentID" />
</IndexAttributeFields>
<IncludeNodeTypes>
<add Name="File" />
</IncludeNodeTypes>
</IndexSet>
</ExamineLuceneIndexSets>
and
<ExamineIndexProviders>
<add name="MediaIndexer" type="CogUmbracoExamineMediaIndexer.MediaIndexer, CogUmbracoExamineMediaIndexer"
extensions=".pdf,.docx"
umbracoFileProperty="umbracoFile"
youTubeUrlProperty=""/>
</providers>
</ExamineIndexProviders>
<ExamineSearchProviders>
<add name="MediaSearcher"
type="UmbracoExamine.LuceneExamineSearcher, UmbracoExamine"
indexSet="MediaIndexSet" analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net"/>
</ExamineSearchProviders>
Pav,
Update
<add name="MediaIndexer" type="CogUmbracoExamineMediaIndexer.MediaIndexer, CogUmbracoExamineMediaIndexer" extensions=".pdf,.docx" umbracoFileProperty="umbracoFile" youTubeUrlProperty=""/>
to
<add name="MediaIndexer" type="CogUmbracoExamineMediaIndexer.MediaIndexer, CogUmbracoExamineMediaIndexer" extensions=".pdf,.docx" umbracoFileProperty="umbracoFile" youTubeUrlProperty="" analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net" />
It should work you will need to rebuild index.
Regards
Ismail
Can I add exclude noise words too?
is working on a reply...
This forum is in read-only mode while we transition to the new forum.
You can continue this topic on the new forum by tapping the "Continue discussion" link below.
Continue discussion
How to remove noise words from indexing?
I am using CogUmbracoExamineMediaIndexer(https://bitbucket.org/thecogworks/cogumbracoexaminemediaindexer) for searching media. I would like to remove noise words and specific words from indexing.
My definition is:
<ExamineLuceneIndexSets>
<IndexSet SetName="MediaIndexSet" IndexPath="~/App_Data/MediaIndexSet">
<IndexAttributeFields>
<add Name="id" />
<add Name="nodeName" />
<add Name="updateDate" />
<add Name="writerName" />
<add Name="path" />
<add Name="nodeTypeAlias" />
<add Name="parentID" />
</IndexAttributeFields>
<IncludeNodeTypes>
<add Name="File" />
</IncludeNodeTypes>
</IndexSet>
</ExamineLuceneIndexSets>
and
<ExamineIndexProviders>
<add name="MediaIndexer" type="CogUmbracoExamineMediaIndexer.MediaIndexer, CogUmbracoExamineMediaIndexer"
extensions=".pdf,.docx"
umbracoFileProperty="umbracoFile"
youTubeUrlProperty=""/>
</providers>
</ExamineIndexProviders>
<ExamineSearchProviders>
<add name="MediaSearcher"
type="UmbracoExamine.LuceneExamineSearcher, UmbracoExamine"
indexSet="MediaIndexSet" analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net"/>
</ExamineSearchProviders>
Pav,
Update
to
It should work you will need to rebuild index.
Regards
Ismail
Can I add exclude noise words too?
is working on a reply...
This forum is in read-only mode while we transition to the new forum.
You can continue this topic on the new forum by tapping the "Continue discussion" link below.