index files in virtual directory in lucene

Press Ctrl / CMD + C to copy this to your clipboard.

Copied to clipboard

Flag this post as spam?

This post will be reported to the moderators as potential spam to be looked at

Simon 3 posts 73 karma points

Oct 21, 2015 @ 14:11

0

Index files in Virtual Directory in Lucene

I've a problem with indexing a bunch of html-files which lies in a Virtual Directory in an Umbraco site, which needs to be indexed in Lucene.

The html-pages are only visible through a single IFrame-page. The IFrame doc type has a Url-field which is used to reference the index.html page in the Virtual Directory.

Is it possible to index all these individual pages?

Copy Link
Simon 3 posts 73 karma points

Oct 27, 2015 @ 08:32

0

Does any one have a clue?

Copy Link
Ismail Mayat 4511 posts 10092 karma points MVP 2x admin c-trib

Oct 27, 2015 @ 08:49

1

Simon,

Examine out of the box only indexes your umbraco content using document event handlers, so on publish it will get content and push it into index. If you have some other content you want to push into examine/lucene then you need to write your own indexer. This is pretty straight forward take a look at https://github.com/Shazwazza/Examine/wiki/Indexer and https://github.com/Shazwazza/Examine/blob/master/Projects/Examine.Web.Demo/OrmReaderDataService.cs the second link is code to create indexer to index a database you could modify this to read your html files and put into a new index. You will then need to do multi index search if you want to search umbraco content and the html content.

The other alternative is to implement gatheringnode data event and for the doc types that have iframe get the iframe url and then get content of html and put that in a field then that content will be searchable and it would in the context of the umbraco page. See http://thecogworks.co.uk/blog/posts/2012/november/examiness-hints-and-tips-from-the-trenches-part-2/ for example of gatheringnode data.

Regards

Ismail

Copy Link
Simon 3 posts 73 karma points

Oct 27, 2015 @ 09:52

0

Thank you for your input! I will dig into the information you posted.

Copy Link
is working on a reply...

Please Sign in or register to post replies

Flag this post as spam?