how to search content when nodes consist of complex content structure

Press Ctrl / CMD + C to copy this to your clipboard.

Copied to clipboard

Flag this post as spam?

This post will be reported to the moderators as potential spam to be looked at

Martin Rud 261 posts 1022 karma points c-trib

Mar 25, 2024 @ 09:27
0

How to search content when nodes consist of complex content structure?
In an Umbraco 13 solution I want to implement a search that searches in nodes (pages in the frontend) that consist of content from these "sources":
1. Normal properties like text, text areas and rich text editors on the nodes themselves
2. Block list elements properties which also have difference document types attached with properties like text, text areas and rich text editors
3. Blocks in rich text editor (and these blocks also have properties like text, text areas and rich text editors)
4. Multi node tree pickers that gets content from other content nodes with normal properties and rich text editor blocks
I guess that bullet #1 is straight forward with the default Umbraco index and search functionality.

But how about bullets #2, #3 and #4? Including handling that a rich text editor can have a block and inside that block there is a rich text property that can also contain a block (and so on recursively).

I get the sense that it might be much easier crawling the site from the frontend and indexing that crawled content by url.

So my questions are these:
1. Should I use Umbracos standard indexing that indexes "from the inside" or should I crawl the site "from the outside"?
2. In case of "from the inside"; how?
3. In case of "from the outside"; how?
Copy Link
Søren Kottal 713 posts 4571 karma points MVP 6x c-trib

Mar 25, 2024 @ 09:47

100

You can use full text search to index and search the frontend rendering https://marketplace.umbraco.com/package/our.umbraco.fulltextsearch

Copy Link
Martin Rud 261 posts 1022 karma points c-trib

Mar 25, 2024 @ 10:17

0

Cool, thanks. I will look into that.

I am also checking out Algolia. It's seems like a quite cool search engine, but I need to make a "mirror" of the frontend in a json structure as I read their docs. So I will have to maintain two structures of content. But I think I will be ok with that.

Copy Link
Martin Rud 261 posts 1022 karma points c-trib

Mar 25, 2024 @ 13:31

0

I have marked Søren Kottals answer as a solution since it answers my question of how it's possible to crawl an Umbraco site from the frontend.

I plan, though, to use Algolio since it has a lot of greate features that I would like to take advantage of.

Copy Link
Farooq Alwi 13 posts 103 karma points

Mar 25, 2024 @ 10:06
0
You can write a search resolver which primarily uses the UmbracoIndexes to search the passed searchParam by iterating the parent node of all pages, this resolver uses the defined aliases of all fields which you want to include for search. You can use the below hint for this:
```
var canGetSearcher = _examineManager.TryGetIndex(UmbracoIndexes.ExternalIndexName, out IIndex index);
if (canGetSearcher)
{
    var searcher = index.Searcher;
    var searchResult = searcher.CreateQuery(IndexTypes.Content)
            .Field("title", searchTerm)
            .Execute();
}
```
Copy Link
Martin Rud 261 posts 1022 karma points c-trib

Mar 25, 2024 @ 10:15

0

But that solution will not crawl the site, right (i.e. "see" it from the frontend)?

Copy Link
Farooq Alwi 13 posts 103 karma points

Mar 25, 2024 @ 10:42

0

No, It will not crawl the site on frontend, you will pass the searchTerm from frontend to Search Resolver. You should create SearchResolverApi for that.

Copy Link
is working on a reply...

This forum is in read-only mode while we transition to the new forum.

You can continue this topic on the new forum by tapping the "Continue discussion" link below.

Please Sign in or register to post replies

Flag this post as spam?

How to search content when nodes consist of complex content structure?