I searched the forums and there are some posts about indexing and searching PDF documents. I could not find any tutorial/how-to, or if it is even possible on 4.5.2.
Current site uses xsltSearch for the content.
Is there any step-by-step guide on how to implement this on 4.5.2 somewhere?
AFAIK that won't be possible with xslt search and you'll have to change to use Examine and look into what version of Umbraco will best support this (I believe PDF indexing is built into v7.2, but the upgrade might not be worth it just to add PDF searching)
I think xslt search will search only on Umbraco content in the xml cache, to index PDF's you would usually usually use a third party tool (free ones are available like https://pdfapi.codeplex.com/) and that would read the PDF file and add it to a Lucene index which you would query with Examine. I'm not sure which packages already exist to do this but hopefully this will point you in the right direction.
Index and Search PDF documents Umbraco 4.5.2?
I searched the forums and there are some posts about indexing and searching PDF documents. I could not find any tutorial/how-to, or if it is even possible on 4.5.2.
Current site uses xsltSearch for the content.
Is there any step-by-step guide on how to implement this on 4.5.2 somewhere?
Hi wschwarte,
Where are you storing your pdfs ? Do you want to search inside pdf files, or only basic info ?
THanks
HI,
they are stored in the media section. I would like to search inside the PDF.
thanks!
AFAIK that won't be possible with xslt search and you'll have to change to use Examine and look into what version of Umbraco will best support this (I believe PDF indexing is built into v7.2, but the upgrade might not be worth it just to add PDF searching)
I think xslt search will search only on Umbraco content in the xml cache, to index PDF's you would usually usually use a third party tool (free ones are available like https://pdfapi.codeplex.com/) and that would read the PDF file and add it to a Lucene index which you would query with Examine. I'm not sure which packages already exist to do this but hopefully this will point you in the right direction.
regards
Tom
is working on a reply...