Press Ctrl / CMD + C to copy this to your clipboard.
This post will be reported to the moderators as potential spam to be looked at
I am using CogUmbracoExamineMediaIndexer(https://bitbucket.org/thecogworks/cogumbracoexaminemediaindexer) for searching media. I would like to remove noise words and specific words from indexing.
My definition is:
<ExamineLuceneIndexSets>
<IndexSet SetName="MediaIndexSet" IndexPath="~/App_Data/MediaIndexSet">
<IndexAttributeFields>
<add Name="id" />
<add Name="nodeName" />
<add Name="updateDate" />
<add Name="writerName" />
<add Name="path" />
<add Name="nodeTypeAlias" />
<add Name="parentID" />
</IndexAttributeFields>
<IncludeNodeTypes>
<add Name="File" />
</IncludeNodeTypes>
</IndexSet>
</ExamineLuceneIndexSets>
and
<ExamineIndexProviders>
<add name="MediaIndexer" type="CogUmbracoExamineMediaIndexer.MediaIndexer, CogUmbracoExamineMediaIndexer"
extensions=".pdf,.docx"
umbracoFileProperty="umbracoFile"
youTubeUrlProperty=""/>
</providers>
</ExamineIndexProviders>
<ExamineSearchProviders>
<add name="MediaSearcher"
type="UmbracoExamine.LuceneExamineSearcher, UmbracoExamine"
indexSet="MediaIndexSet" analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net"/>
</ExamineSearchProviders>
Pav,
Update
<add name="MediaIndexer" type="CogUmbracoExamineMediaIndexer.MediaIndexer, CogUmbracoExamineMediaIndexer" extensions=".pdf,.docx" umbracoFileProperty="umbracoFile" youTubeUrlProperty=""/>
to
<add name="MediaIndexer" type="CogUmbracoExamineMediaIndexer.MediaIndexer, CogUmbracoExamineMediaIndexer" extensions=".pdf,.docx" umbracoFileProperty="umbracoFile" youTubeUrlProperty="" analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net" />
It should work you will need to rebuild index.
Regards
Ismail
Can I add exclude noise words too?
is working on a reply...
Write your reply to:
Upload image
Image will be uploaded when post is submitted
How to remove noise words from indexing?
I am using CogUmbracoExamineMediaIndexer(https://bitbucket.org/thecogworks/cogumbracoexaminemediaindexer) for searching media. I would like to remove noise words and specific words from indexing.
My definition is:
<ExamineLuceneIndexSets>
<IndexSet SetName="MediaIndexSet" IndexPath="~/App_Data/MediaIndexSet">
<IndexAttributeFields>
<add Name="id" />
<add Name="nodeName" />
<add Name="updateDate" />
<add Name="writerName" />
<add Name="path" />
<add Name="nodeTypeAlias" />
<add Name="parentID" />
</IndexAttributeFields>
<IncludeNodeTypes>
<add Name="File" />
</IncludeNodeTypes>
</IndexSet>
</ExamineLuceneIndexSets>
and
<ExamineIndexProviders>
<add name="MediaIndexer" type="CogUmbracoExamineMediaIndexer.MediaIndexer, CogUmbracoExamineMediaIndexer"
extensions=".pdf,.docx"
umbracoFileProperty="umbracoFile"
youTubeUrlProperty=""/>
</providers>
</ExamineIndexProviders>
<ExamineSearchProviders>
<add name="MediaSearcher"
type="UmbracoExamine.LuceneExamineSearcher, UmbracoExamine"
indexSet="MediaIndexSet" analyzer="Lucene.Net.Analysis.Standard.StandardAnalyzer, Lucene.Net"/>
</ExamineSearchProviders>
Pav,
Update
to
It should work you will need to rebuild index.
Regards
Ismail
Can I add exclude noise words too?
is working on a reply...