Hi, was just wondering if Umbraco now has examine PDF indexing included? I was looking at the Examine settings documentation and saw an example of the PDF index set. If so, in what version was this included?
Thanks Ismael, I've got it working with the package you are referring to, was just curious...
Now that I've got your attention... I have set up a multi index searcher, content and pdf. I'm using a raw lucene qurey but the results all have the same score? Any ideas why this is the case?
Try taking off the wildcard just to see if that is the issue? If so then you can try my suggestion in the post if you really need the score.
One thing your query looks straight forward any reason why you are using raw query and the fluent api? Also do you need the search on umbracoExtension?
Regarding the raw qurey, I was reading the Examine examine post. As I understood it I have to use the raw qurey If I would like to make a search on multiple words.
nodeName:paging in XSLT
instead of this phrase:
nodeName:"paging in XSLT"
Perhaps I have misunderstood it? I get kind of confused with the search thing.. :)
Hm, the umbracoExtension lets me search for "pdf". This will get me all the pdf files, perhaps not so useful as I can search the content of the pdfs... I'm still experimenting on the search... :)
EDIT: ah, the "umbracoFile" includes the file extension. Then the "umbracoExtension" is of course not so useful... :D
Umbraco examine PDF
Hi, was just wondering if Umbraco now has examine PDF indexing included? I was looking at the Examine settings documentation and saw an example of the PDF index set. If so, in what version was this included?
Best regards /David
David,
It does not have it out of the box you have to use https://www.nuget.org/packages/UmbracoCms.UmbracoExamine.PDF/
Regards
Ismail
Thanks Ismael, I've got it working with the package you are referring to, was just curious...
Now that I've got your attention... I have set up a multi index searcher, content and pdf. I'm using a raw lucene qurey but the results all have the same score? Any ideas why this is the case?
Thanks! /David
I recall seeing something similar although that was a while ago under the context of wildcards see https://our.umbraco.com/forum/developers/extending-umbraco/13963-Examine-wildcard-search-score
Are you wildcard searching?
Indeed I am. The qurey looks like this:
David,
Try taking off the wildcard just to see if that is the issue? If so then you can try my suggestion in the post if you really need the score.
One thing your query looks straight forward any reason why you are using raw query and the fluent api? Also do you need the search on umbracoExtension?
Great, I will try and see what happens.
Regarding the raw qurey, I was reading the Examine examine post. As I understood it I have to use the raw qurey If I would like to make a search on multiple words.
instead of this phrase:
Perhaps I have misunderstood it? I get kind of confused with the search thing.. :)
Hm, the umbracoExtension lets me search for "pdf". This will get me all the pdf files, perhaps not so useful as I can search the content of the pdfs... I'm still experimenting on the search... :)
EDIT: ah, the "umbracoFile" includes the file extension. Then the "umbracoExtension" is of course not so useful... :D
Thanks /David
is working on a reply...