Copied to clipboard

Flag this post as spam?

This post will be reported to the moderators as potential spam to be looked at


  • blackhawk 313 posts 1368 karma points
    Oct 18, 2017 @ 14:32
    blackhawk
    0

    Hiding all media assets from Google

    I'm going to have thousands of PDFs and Word documents accessible for content nodes, all stored within the Media section. I have examine working perfectly in searching for these documents within my development site.

    But would there be a way (perhaps through this package), to prevent these same Media assets from being indexed on major search engines like Google and Bing? I pretty much want people to come to my site to find the documents through the examine search engine, not depend on external search engines.

    Thanks for any suggestions, tips on moving forward with this process.

  • kevainc 5 posts 97 karma points
    Oct 18, 2017 @ 14:48
    kevainc
    3

    You can use a robot.txt file to disallow the indexing of your media folder.

    User-agent: *
    Disallow: /media/
    
  • Steve Morgan 1346 posts 4453 karma points c-trib
    Oct 18, 2017 @ 14:49
    Steve Morgan
    2

    Hi,

    A quick win is to just add the media directory to your robots.txt - this asks crawl bots nicely to exclude it. You could also try adding nofollow tags to all links. Note this would also exclude your images.

     User-agent: *
     disallow: /media/*
    
  • blackhawk 313 posts 1368 karma points
    Oct 18, 2017 @ 16:40
    blackhawk
    0

    Thanks guys! I'll follow back when I get this all setup.

Please Sign in or register to post replies

Write your reply to:

Draft