Copied to clipboard

Flag this post as spam?

This post will be reported to the moderators as potential spam to be looked at


  • David 4 posts 95 karma points
    May 18, 2017 @ 13:49
    David
    0

    Umbraco Id Page Crawling/Indexing

    Hi there, not sure this is the right place to ask this but I'm hoping someone here can offer some guidance on an issue I'm having with a Umbraco site regarding crawling.

    We have a number of urls that have been indexed that use the Umbraco page Id in the url, ie 'https://www.domain.co.uk/1234/', but the actual url is 'https://www.domain.co.uk/the-is-the-page-url/'.

    To make things worse the Umbraco Id version is actually what's indexed.

    Any thoughts on this would be greatly appreciated. If I have missed anything important that would aid understanding the issue please advise.

    Many thanks, David

  • Alex Skrypnyk 6132 posts 23951 karma points MVP 7x admin c-trib
    May 19, 2017 @ 21:23
    Alex Skrypnyk
    1

    Hi David

    As I know URLs can be indexed only if there is a link on the site.

    So probably you have somewhere on the site links with href attribute with Umbraco page ids instead of url of pages. Can you check which pages has this problem? We can fix it I think.

    And it's not a problem of Umbraco, it looks like developer or editor made a mistake.

    Thanks

    Alex

  • Alex Skrypnyk 6132 posts 23951 karma points MVP 7x admin c-trib
    May 22, 2017 @ 21:38
    Alex Skrypnyk
    0

    Hi David

    Did you solve the issue? Please share with us

    Alex

  • David 4 posts 95 karma points
    May 23, 2017 @ 07:31
    David
    0

    Hi Alex,

    I have forwarded your thoughts to my developer and will update in due course.

    Many thanks, David

  • Ismail Mayat 4511 posts 10090 karma points MVP 2x admin c-trib
    May 23, 2017 @ 08:09
    Ismail Mayat
    0

    David,

    What is doing the crawling?

    Regards

    Ismail

  • David 4 posts 95 karma points
    May 23, 2017 @ 08:44
    David
    0

    Hi Ismail,

    Google is crawling some of these urls and has indexed them.

    Thanks, David

  • Ismail Mayat 4511 posts 10090 karma points MVP 2x admin c-trib
    May 23, 2017 @ 08:57
    Ismail Mayat
    0

    David,

    Do you have xml sitemap and google master account? If not then create xml sitemap and push that to google and use that to index the site this way you can control what google indexes.

    Regards

    Ismail

  • Alex Skrypnyk 6132 posts 23951 karma points MVP 7x admin c-trib
    May 23, 2017 @ 09:09
    Alex Skrypnyk
    0

    Hi David

    I think it's easy to find this problem, just find on which page you have these links and what part of code renders this link

    Alex

  • David 4 posts 95 karma points
    May 23, 2017 @ 09:55
    David
    100

    Problem solved.

    There are some additional sitemaps where these pages were listed on which were not on the main sitemap. Not sure why it was done this way by a previous developer and why he didn't check his work.

    I have also been advised that Umbraco ID urls are no longer accessible as well.

    Thanks for all your guidance on this Alex and Ismail.

    Regards, David

  • Alex Skrypnyk 6132 posts 23951 karma points MVP 7x admin c-trib
    May 23, 2017 @ 20:41
    Alex Skrypnyk
    0

    You are welcome, David.

    Glad that we helped!

    Alex

Please Sign in or register to post replies

Write your reply to:

Draft