Copied to clipboard

Flag this post as spam?

This post will be reported to the moderators as potential spam to be looked at


  • Mike Chambers 636 posts 1253 karma points c-trib
    Oct 14, 2019 @ 16:14
    Mike Chambers
    0

    Examine Lucene Boost() but ignore IDF (inverse document frequency)

    I'd like to boost by a doctype parameter, so that results are artificially ordered (boosted) by that type. However, as lucene also scores by the document frequency, the type that I want to boost the the least ends up at the top of my results as there are less instances of that type.

    { Category: content, LuceneQuery: (hideFromSearch:0 +(__NodeTypeAlias:dtcontenttile) +(tileContentOrigination:external^31.0 tileContentOrigination:partner^32.0 tileContentOrigination:originator^33.0)) }
    

    so because external has only 1 doc compared to 50 originator, external ends up the top result.

    Or maybe this just isn't something that lucene can do with it's tf/idf approach?

  • Simon Dingley 1474 posts 3451 karma points c-trib
    Jan 20, 2020 @ 08:03
    Simon Dingley
    0

    Did you ever find a solution for this?

  • Mike Chambers 636 posts 1253 karma points c-trib
    Jan 20, 2020 @ 13:35
    Mike Chambers
    1

    Unfortunately not... There is a bit of a discussion started here... https://github.com/Shazwazza/Examine/issues/131

  • This forum is in read-only mode while we transition to the new forum.

    You can continue this topic on the new forum by tapping the "Continue discussion" link below.

Please Sign in or register to post replies