umbraco examine pdf

Go to solution

Press Ctrl / CMD + C to copy this to your clipboard.

Copied to clipboard

Flag this post as spam?

This post will be reported to the moderators as potential spam to be looked at

David Amri 214 posts 740 karma points

Nov 16, 2018 @ 08:27

0

Umbraco examine PDF

Hi, was just wondering if Umbraco now has examine PDF indexing included? I was looking at the Examine settings documentation and saw an example of the PDF index set. If so, in what version was this included?

Best regards /David

Copy Link
Ismail Mayat 4511 posts 10092 karma points MVP 2x admin c-trib

Nov 16, 2018 @ 09:18

100

David,

It does not have it out of the box you have to use https://www.nuget.org/packages/UmbracoCms.UmbracoExamine.PDF/

Regards

Ismail

Copy Link
David Amri 214 posts 740 karma points

Nov 16, 2018 @ 09:22

0

Thanks Ismael, I've got it working with the package you are referring to, was just curious...

Now that I've got your attention... I have set up a multi index searcher, content and pdf. I'm using a raw lucene qurey but the results all have the same score? Any ideas why this is the case?

Thanks! /David

Copy Link
Ismail Mayat 4511 posts 10092 karma points MVP 2x admin c-trib

Nov 16, 2018 @ 09:26

0

I recall seeing something similar although that was a while ago under the context of wildcards see https://our.umbraco.com/forum/developers/extending-umbraco/13963-Examine-wildcard-search-score

Are you wildcard searching?

Copy Link

David Amri 214 posts 740 karma points

Nov 16, 2018 @ 09:26

Indeed I am. The qurey looks like this:

var luceneSearchTerm = "pageTitleSearch:" + searchTerm + "*";
                        luceneSearchTerm += " aTextSearch:" + searchTerm + "*";
                        luceneSearchTerm += " bTextSearch:" + searchTerm + "*";
                        luceneSearchTerm += " cTextSearch:" + searchTerm + "*";
                        luceneSearchTerm += " dTextSearch:" + searchTerm + "*";
                        luceneSearchTerm += " eTextSearch:" + searchTerm + "*";

                        luceneSearchTerm += " areaOfInterest:" + searchTerm + "*";
                        luceneSearchTerm += " docFilter:" + searchTerm + "*";

                        luceneSearchTerm += " updateDocsPyramidVersion:" + searchTerm + "*";

                        luceneSearchTerm += " FileTextContent:" + searchTerm + "*";
                        luceneSearchTerm += " umbracoFile:" + searchTerm + "*";
                        luceneSearchTerm += " umbracoExtension:" + searchTerm + "*";

Copy Link

Ismail Mayat 4511 posts 10092 karma points MVP 2x admin c-trib

Nov 16, 2018 @ 09:28

0

David,

Try taking off the wildcard just to see if that is the issue? If so then you can try my suggestion in the post if you really need the score.

One thing your query looks straight forward any reason why you are using raw query and the fluent api? Also do you need the search on umbracoExtension?

Copy Link
David Amri 214 posts 740 karma points

Nov 16, 2018 @ 09:40

0

Great, I will try and see what happens.

Regarding the raw qurey, I was reading the Examine examine post. As I understood it I have to use the raw qurey If I would like to make a search on multiple words.

nodeName:paging in XSLT

instead of this phrase:

nodeName:"paging in XSLT"

Perhaps I have misunderstood it? I get kind of confused with the search thing.. :)

Hm, the umbracoExtension lets me search for "pdf". This will get me all the pdf files, perhaps not so useful as I can search the content of the pdfs... I'm still experimenting on the search... :)

EDIT: ah, the "umbracoFile" includes the file extension. Then the "umbracoExtension" is of course not so useful... :D

Thanks /David

Copy Link
is working on a reply...

Please Sign in or register to post replies

Flag this post as spam?