Getting some weird stuff happening - i have about 20 or so pdfs, as far as i can tell it has searchable text (i can search within reader) . Configs set below, no errors, can run the index in the dashboard - does something for a short while then seems to come back with zero or 4 results randomly. luke screengrab below too.
well, i've now tried the CogUmbracoExamineMediaIndexer - i'm getting results but when i look at the index in luke, the fileTextContent is empty. I've checked the pdfs and can select the text (i.e. they're not just graphic pdfs).
am i to assume that my pdfs are not searchable for some reason?
ok, had a look into further - cant get the original pdf indexer to work, the CogUmbracoExamineMediaIndexer works but doesn't have any value in fileTextContent, so took a stab and downloaded lastest version of itextsharp core dll, popped that in the bin folder, re-ran the indexes and bingo! i now have indexed and searchable pdfs...
Examine & PDFs, not indexing correctly
Getting some weird stuff happening - i have about 20 or so pdfs, as far as i can tell it has searchable text (i can search within reader) . Configs set below, no errors, can run the index in the dashboard - does something for a short while then seems to come back with zero or 4 results randomly. luke screengrab below too.
Any pointers to where the problem is
umbraco 7.1.8 (using hybrid framework)
other indexes on content working fine.
Thanks
well, i've now tried the CogUmbracoExamineMediaIndexer - i'm getting results but when i look at the index in luke, the fileTextContent is empty. I've checked the pdfs and can select the text (i.e. they're not just graphic pdfs).
am i to assume that my pdfs are not searchable for some reason?
ok, had a look into further - cant get the original pdf indexer to work, the CogUmbracoExamineMediaIndexer works but doesn't have any value in fileTextContent, so took a stab and downloaded lastest version of itextsharp core dll, popped that in the bin folder, re-ran the indexes and bingo! i now have indexed and searchable pdfs...
is working on a reply...