World is now on Opti ID! Learn more

Deane Barker
Sep 3, 2010
  4034
(0 votes)

Correcting All Raw URLs Inside a String of HTML

If you create a link inside an EPiServer page to another EPiServer page, the link that gets embedded in the HTML is not friendly.  It looks something like this:

/Templates/PageTypes/TextPage.aspx?id=66

EPiServer corrects these links very late in the page lifecycle – actually in a response filter after the entire page has been generated.  This way, it’s sure to correct links no matter where on the page they might be.

What I noticed, however, is that these links were only getting corrected for Web forms.  More specifically, they were only getting corrected for anything with a “text/html” content type.

This was usually fine because I could change the response type, but sometimes it was an issue, especially when doing Ajax callbacks.  You often send those back as some plain text format like JSON, and you form the response at the code level, concatening the raw values of properties into a string.  Before long, you notice that your links aren’t corrected in blocks of HTML.

It’s not hard to get the friendly URL from a PageData object:

UrlBuilder url = new UrlBuilder(pd.LinkURL);
UrlRewriteProvider.ConvertToExternal(url, somePageData.PageLink, UTF8Encoding.UTF8);
string theFriendlyURl = url.ToString();

This is great to go from a PageData object to a friendly URL.  What I found to be much tricker, however, was if I had a big string of HTML with a bunch of embedded links and I needed to correct them all before working with the string.

When I was in Sweden for the Partner Summit, I posed this question to Magnus Stråle.  I was awfully grateful when he took the time to dig through EPiServer’s unit tests to find a chunk of code for me.  It’s distilled down to this function:

private string CorrectLinks(string rawHtml)
{
    var toExternal = new FriendlyHtmlRewriteToExternal(UrlBuilder.RebaseKind.ToRootRelative);
    return toExternal.RewriteString(
        new UrlBuilder(HttpContext.Current.Request.Path),
        new UrlBuilder(HttpContext.Current.Request.RawUrl),
        HttpContext.Current.Response.ContentEncoding,
        rawHtml);
}

This will take a string of HTML, correct all the links inside of it, and return another string, ready for insertion into any response, be it text, JSON, or whatever.

Thanks to Magnus for taking the time to help me solve this.

Sep 03, 2010

Comments

Please login to comment.
Latest blogs
Make Global Assets Site- and Language-Aware at Indexing Time

I had a support case the other day with a question around search on global assets on a multisite. This is the result of that investigation. This co...

dada | Jun 26, 2025

The remote server returned an error: (400) Bad Request – when configuring Azure Storage for an older Optimizely CMS site

How to fix a strange issue that occurred when I moved editor-uploaded files for some old Optimizely CMS 11 solutions to Azure Storage.

Tomas Hensrud Gulla | Jun 26, 2025 |

Enable Opal AI for your Optimizely products

Learn how to enable Opal AI, and meet your infinite workforce.

Tomas Hensrud Gulla | Jun 25, 2025 |

Deploying to Optimizely Frontend Hosting: A Practical Guide

Optimizely Frontend Hosting is a cloud-based solution for deploying headless frontend applications - currently supporting only Next.js projects. It...

Szymon Uryga | Jun 25, 2025

World on Opti ID

We're excited to announce that world.optimizely.com is now integrated with Opti ID! What does this mean for you? New Users:  You can now log in wit...

Patrick Lam | Jun 22, 2025

Avoid Scandinavian Letters in File Names in Optimizely CMS

Discover how Scandinavian letters in file names can break media in Optimizely CMS—and learn a simple code fix to automatically sanitize uploads for...

Henning Sjørbotten | Jun 19, 2025 |