Copied to clipboard

Flag this post as spam?

This post will be reported to the moderators as potential spam to be looked at


  • Mike Chambers 636 posts 1253 karma points c-trib
    Oct 14, 2019 @ 16:14
    Mike Chambers
    0

    Examine Lucene Boost() but ignore IDF (inverse document frequency)

    I'd like to boost by a doctype parameter, so that results are artificially ordered (boosted) by that type. However, as lucene also scores by the document frequency, the type that I want to boost the the least ends up at the top of my results as there are less instances of that type.

    { Category: content, LuceneQuery: (hideFromSearch:0 +(__NodeTypeAlias:dtcontenttile) +(tileContentOrigination:external^31.0 tileContentOrigination:partner^32.0 tileContentOrigination:originator^33.0)) }
    

    so because external has only 1 doc compared to 50 originator, external ends up the top result.

    Or maybe this just isn't something that lucene can do with it's tf/idf approach?

  • Simon Dingley 1474 posts 3431 karma points c-trib
    Jan 20, 2020 @ 08:03
    Simon Dingley
    0

    Did you ever find a solution for this?

  • Mike Chambers 636 posts 1253 karma points c-trib
    Jan 20, 2020 @ 13:35
    Mike Chambers
    1

    Unfortunately not... There is a bit of a discussion started here... https://github.com/Shazwazza/Examine/issues/131

Please Sign in or register to post replies

Write your reply to:

Draft