Transliterating cyrillic URLs with umbracoSettings.config
While modern systems (on server and client side) work quite well with non-latin (UTF-8) URLs, they still present some technical risk. Transliteration of cyrillic URLs to their latin representation can be accomplished quite easily with the standard umbraco URL-Replacing in "umbracoSettings.config" (in directory "/config/").
It is not possible to account for some of the finer details, but using a simple replacement table will handle 95%+ of most document URLs. Here's the list to insert below the already existing "char"-Items inside the urlReplacing-Tag:
let me try to post my transliteration piece of umbracoConfig here as a picture, hope the ugly code editor will keep it as is... :-D
As you can see, there are no apostrophes in ascii parts of the pairs at all... Your file also contains some of the chars that we still not added to the config ("i" with double dots above etc.), these symbols maybe all from cyrillic part of unicode set, but not part of the russian alphabet, so we not needed them for the russian sites we made for the moment :-) If these specific chars can possibly appear in the URLs of your projects, you can keep them all with your current transliteration or without apostrophes...
thank you for the feedback. I will remove the apostrophes, never liked such URLs anyways. The non-russian, but nonetheless cyrillic letters I want to keep for a broader reusability.
I guess you had the same trouble with the editor I had with my first post ... ;-)
Could you upload your translit-config and provide a link? That would be great. Thanks!
I made the same as described by Marc, but without success. My version of umbraco is 7.10.3. So I added chars to umbracoSettings.config, then opened web.config and saved it. Could someone explain to me what I'm doing wrong?
Transliterating cyrillic URLs with umbracoSettings.config
While modern systems (on server and client side) work quite well with non-latin (UTF-8) URLs, they still present some technical risk. Transliteration of cyrillic URLs to their latin representation can be accomplished quite easily with the standard umbraco URL-Replacing in "umbracoSettings.config" (in directory "/config/").
It is not possible to account for some of the finer details, but using a simple replacement table will handle 95%+ of most document URLs. Here's the list to insert below the already existing "char"-Items inside the urlReplacing-Tag:
Maybe this is of use for someone. I also hope, the forum editor will handle this characters well ... ;)
Remember to reload the umbraco config (i.e. by touching the web.config in the root) after applying!
Editor waving the flag, edit post not working ("XSLT error"), so here's a link: http://hello.mindrevolution.com/umbraco/cyrillic-url-transliteration.txt
Feedback welcome, specially on the apostrophes. Should they be kept or removed?
Hello Marc,
let me try to post my transliteration piece of umbracoConfig here as a picture, hope the ugly code editor will keep it as is... :-D
As you can see, there are no apostrophes in ascii parts of the pairs at all... Your file also contains some of the chars that we still not added to the config ("i" with double dots above etc.), these symbols maybe all from cyrillic part of unicode set, but not part of the russian alphabet, so we not needed them for the russian sites we made for the moment :-) If these specific chars can possibly appear in the URLs of your projects, you can keep them all with your current transliteration or without apostrophes...
Hope it will help...
WBR
Hi Alexander,
thank you for the feedback. I will remove the apostrophes, never liked such URLs anyways. The non-russian, but nonetheless cyrillic letters I want to keep for a broader reusability.
I guess you had the same trouble with the editor I had with my first post ... ;-)
Could you upload your translit-config and provide a link? That would be great. Thanks!
Marc.
Marc,
I made one change to my post, I hope now the evil code editor will not be able to dig in the image :-)
Perfect. Thank you, Alexander!
Due to some requests, here's the current version we use with several multi-lingual sites: https://gist.github.com/esn303/f181a66bb701be1bd304
Hi guys,
I made the same as described by Marc, but without success. My version of umbraco is 7.10.3. So I added chars to umbracoSettings.config, then opened web.config and saved it. Could someone explain to me what I'm doing wrong?
Thanks a lot!
is working on a reply...