Copied to clipboard

Flag this post as spam?

This post will be reported to the moderators as potential spam to be looked at


  • Tom 115 posts 204 karma points
    1 week ago
    Tom
    0

    UmbrcoExamie for Office Documents

    I am on Umbraco 7.5.9. I did not find a NuGet package for MS Office documents (Word & Excel). The only one I found was for PDF.

    Does anyone have a C# example of how to index\search office documents?

    Thanks

    Tom

  • Ismail Mayat 4171 posts 8732 karma points MVP admin c-trib
    1 week ago
    Ismail Mayat
    1

    Tom,

    You can use https://our.umbraco.com/packages/backoffice-extensions/examinefileindexer/ it uses apache tika under the hood and will index office documents.

    With regards to search are you looking to do combined search with content and media or just media? When you install the package it will create indexer and searcher for you and you can use that searcher but it will only be on the media.

    If you want combined search you will have to look up how to do multi index search.

    Regards

    Ismail

  • Tom 115 posts 204 karma points
    1 week ago
    Tom
    0

    OK that worked. YOU rock.

    One more question.

    Do you know if there is a way to extract PDF meta data into search engine? I would like to add PDF Title, Author, description, create date to indexer.

    Thanks

    Tom

  • Ismail Mayat 4171 posts 8732 karma points MVP admin c-trib
    1 week ago
    Ismail Mayat
    1

    Tom,

    It should by default extract it, check the index with Luke you should see all associated meta data.

    Regards

    Ismail

Please Sign in or register to post replies

Write your reply to:

Draft