Copied to clipboard

Flag this post as spam?

This post will be reported to the moderators as potential spam to be looked at


  • Louis Ferreira 2 posts 22 karma points
    14 days ago
    Louis Ferreira
    0

    Help with Indexes (Internal and External)

    Hello,

    We have noticed that our site has become very slow to start due to indexing, in particular the size of the indexes. Inspecting the indexes, I have noticed that some fields hold duplicate data, and these fields are the largest in the record set. See below for example screenshot (I have edited the html to reduce the markup size for screenshot)

    enter image description here

    This is happening in both Internal and external indexes, resulting in a Examine folder size that is well over a Gigabyte in size.

    Now I can remove some fields in the external index by overriding the ContentValueSetValidator as follows:

     //remove any '__Raw*' and '*.Row' values to reduce duplicate data being indexed
    foreach (var publishField in valueSet.Values.Where(x => x.Key.StartsWith("__Raw") || x.Key.EndsWith(".Row")).ToList())
    {
        valueSet.Values.Remove(publishField.Key);
        isFiltered = true;
    }
    

    and this works well, but I cannot find how to achieve this for the internal index. I have scoured the Umbraco.Examine source code, and cannot see a way to do this.

    Is there a way to do this, or is there a better way to achieve this?

    Thanks

Please Sign in or register to post replies

Write your reply to:

Draft