Strip HTML Tags in JavaScript | CSS-Tricks

Dango

# August 27, 2010

Your script works great! Cheers!

Reply

admire

# October 13, 2010

this is so cool , i like it

Reply

Pushpinder Bagga

# November 7, 2010

function strip(html)
{
var tmp = document.createElement("DIV");
tmp.innerHTML = html;
return tmp.textContent || tmp.innerText;
}

Reply

JC

Permalink to comment# December 23, 2011

This was even better for my needs. No issues with special characters etc…
Morgan Roderick

Permalink to comment# March 28, 2013

That is awful advice!

If for some reason (like malicious intent of users) the html argument contains a script tag, you’ve now opened up for XSS attacks!!!

Don’t use the DOM for something that doesn’t require it.

Also, the DOM is really slow.
Martin Adámek

Permalink to comment# June 1, 2013

This solution is great for using of inner content from paragraph in JS Alert window – it strips nbsp and em efectivelly,
thanks
Venkat

Permalink to comment# September 9, 2015

Pushpinder,
Lovely. Worked great
Nephi

Permalink to comment# February 10, 2021
If you don’t need to support IE6, maybe try using the DOMParser directly as it won’t download images nor execute scripts:
```
function stripHtml(dirtyString) {
  const doc = new DOMParser().parseFromString(dirtyString, 'text/html');
  return doc.body.textContent || '';
}
```
Now if you run something like stripHtml("<img onerror="alert(\"could run arbitrary JS here\")"" src="bogus">"); it won’t causes issues while still allowing the browser to do the work.
ashleedawg

Permalink to comment# September 13, 2021

One-Liner:

Here’s a one-liner if you happen to be using jQuery anyway:

txt=$(document.createElement("DIV")).html('<b>Hi</b>').text();

derek

# February 10, 2011

hey!!!..this is so ridiculous..

Reply

Eugene

# March 9, 2011

Thank you for great example

Reply

Sebastian

# August 2, 2011

Thanks, this does exactly what I need (and so concisely, too!)

Reply

Ian

# August 18, 2011

Thanks! A quick note about the regexp: the “i” isn’t needed here because there are no characters to be case-insensitive about. However, it does exactly what you want either way.

Reply

Porter

# October 14, 2011

Nice, but the parentheses are unnecessary.

.replace(/<[^>]+>/ig,””);

Reply

Jainam Shah

Permalink to comment# September 10, 2014

Thanks

Florian Ricard

# October 21, 2011

Hi :)

I saw your contact form and i must say i love it!
Do you have a tutorial or something like that? It’s a wonderful one :)^
Hope to hear some news of you,

A french reader,

Florian

Reply

Truong Duong

# October 22, 2011

Thank for script :)

@Ricard: If you want to make a copy of the contact form, just view source or save this page to you local ;)

Reply

jeu-jeu-jeu.net

# December 16, 2011

beautul site thank you for great example

Reply

DScout

# December 17, 2011

the /i for case insensitivity is definitely recommended.
When using contenteditable, IE produces upper case tags, mozilla would only create lower case… To strip those you need it case insensitive.

Reply

do0g

Permalink to comment# May 2, 2014

DScout, this is incorrect. There are no specified alphabetical characters in the regular expression – the case insensitivity modifier therefore affects nothing.

Sadia

# December 25, 2011

Hi

I have following code:

var text = ‘[$ ssIncludeXml(docName,”wcm:root/wcm:element[@name=’innerpage_content’]/text()”) $]’;
var StrippedString = text.replace(/(]+)>)/ig,””);

where ‘[$ ssIncludeXml(docName,”wcm:root/wcm:element[@name=’innerpage_content’]/text()”) $]’
is Idoc script that brings a block of HTML from a placeholder. But i am getting “unterminated string literal” Error at first line.

What i want to do is to remove or strip all HTML tags and to get plain text out of that markup.

Kindly let me know if there is any solution.

Thanks

Reply

JOhn

# January 6, 2012

works great but doest strip whitespaces….

Reply

Valutar BNR

# March 2, 2012

Thank you! It was very useful for me and I think that is useful for everyone.
Thank you again!

Reply

Elliott

# March 8, 2012

Yeah, this solution removed all sorts of HTML, paragraph, line breaks, in-line styles etc etc

Reply

reena upadhyay

# March 29, 2012

This does not works for IE. Please provide solution to strip tag in javascript that works for all browsers

Reply

Shilpa Agrawal

# April 18, 2012

Thanks for this script
It work greate

Reply

Ammar

# April 19, 2012

i am trying it on

var message;

    firstName = document.getElementById("username").value;

    if (firstName == null || firstName == "" || firstName == NaN || firstName == "First Name") {
        message = "Please Add some name.";
        document.body.insertAdjacentHTML("BeforeEnd", "" + message + "");
    }
    else {
        if (document.getElementById("myMessage")) {
            debugger;
            arguments = document.getElementById("myMessage").value.replace(/(]+)>)/ig, "");
        }
    }

but it is not working and saying

cannot call method ‘replace’ of undefined

Reply

Ryan Mc Closkey

# May 22, 2012

Was wondering how this would be implemented if I only wanted to remove the href tags from a string of text, instead of removing all the tags? I’m trying to retrieve a page of text from a website but I only want the plain text with the formatting tags (p, ul, li).
Hope this makes sense, thanks in advance.

Reply

DropTheNerd

# May 24, 2012

This was excellent! Thanks!

Reply

totingnya

# July 26, 2012

your “\S” is missing… or not?

/(<\S([^>]+)>)/ig

Reply

do0g

Permalink to comment# May 2, 2014

\S means not whitespace, and ^> means not greater than, so your modified regex only ensures that single character tags will not be replaced.

javier

# October 19, 2012

Great! Thanks!

Reply

Hemant Vaniya

# November 15, 2012

Thanks,
Its working fine.

Reply

Emmanuel Sayson

# November 16, 2012

Cool! This is perfectly working…

Reply

anonymous

# November 28, 2012

What about < b r / > or < h r / > (the self closing tags) ?

Reply

Nagarjuna Gottimukkala

# December 20, 2012

Cool……Nice Example.

Reply

Jeremy

# March 14, 2013

Looks like “newInput” doesn’t do anything at all? So it’s either extraneous or there’s a problem with the code.

Reply

Hardik Sondagar

# May 10, 2013

I have developed same thing using javascript Regular Expression.
It’ll strip all the html tags excluding tag provided in exclude list by user.
source code is also available on github
check here. HTML Tag Stripper

Reply

Germano

# May 15, 2013

Nice, but it’s not that safe… I’d rather use jQuery:

$("<div/>").text('<img alt="a>b" src="a_b.gif" />').text();

Reply

Ahahahaha

# June 4, 2013

document.body.innerText

<a onclick=”return a > b”> ~ fail

Reply

SKV

# July 15, 2013

But this code is not working well with HTML table content.

Reply

Al

# November 4, 2013

How can strip all tags except anchor and img tags?

Reply

Jonas

# November 29, 2013

You can easily leave out the case sensitivity /i and the grouping ():

var noHtml = hasHtml.replace(/<[^>]+>/ig, '')

Reply

duromir

# April 6, 2014

using jQuery
jQuery(stringWithTags).text()

Reply

farzad

# March 10, 2015

jQuery(stringWithTags).text();
it is what i want. tanx…

Reply

Muhammad Navaid

# June 23, 2015

not working with AngularJS.

Reply

Mohammad Mustafa Ahmedzai

# September 17, 2015

Probably the simplest probably I found online. Thanks a bunch for it. Worked just fine!

Reply

Hamada Abdelaziz

# September 21, 2015

this is the best solution i have find
http://phpjs.org/functions/strip_tags/
this is equivalent to PHP strip_tags function

Reply

cccccccccc

# June 21, 2016

string.replace(/\n/g, "");
string.replace(/[\t ]+\</g, "<");
string.replace(/\>[\t ]+\</g, "><");
string.replace(/\>[\t ]+$/g, ">");

Reply

Samantha

# August 2, 2016

Doesn’t anyone see how this solution greatly affects this text:

Rounded amounts < 3 are way easier for people to use in calculations, since they are so tiny than numbers that are >=3

Becomes: Rounded amounts =3

Reply

Samantha

Permalink to comment# August 2, 2016

This one is better; phpjs.org/functions/strip_tags/

mindfullsilence

# January 9, 2017

Safe way to use the DOM to strip html.

function striptags(content) {
  var frag = document.createDocumentFragment();
  var innerEl = document.createElement('div');
  frag.appendChild(innerEl);
  innerEl.innerHTML = content;
  return frag.firstChild.innerText;
}
striptags('<script>alert("xss attack!")</script>');

Reply

Shaun

# January 14, 2018

I chucked together a function that allows some tags to be kept, similar to how the php function works.

As with PHP it comes with the following two caveats:

Because strip_tags() does not actually validate the HTML, partial or broken tags can result in the removal of more text/data than expected.

and

This function does not modify any attributes on the tags that you allow using allowable_tags, including the style and onmouseover attributes that a mischievous user may abuse when posting text that will be shown to other users.

/**
 * Native javascript function to emulate the PHP function strip_tags.
 * 
 * @param {string} str The original HTML string to filter.
 * @param {array|string} allowable_tags A tag name or array of tag
 * names to keep. Intergers, objects, and strings that don't follow the
 * standard tag format of a letter followed by numbers and letters will
 * be ignored. This means that invalid tags will also be removed.
 * @return {string} The filtered HTML string.
 */
function strip_tags(str, allowable_tags) {
    allowable_tags = [].concat(allowable_tags);
    var keep = '';
    allowable_tags.forEach(function(tag) {
        if (('' + tag).match(/^[a-z][a-z0-9]+$/i))
            keep += (keep.length ? '|' : '') + tag;
    } );
    return str.replace(new RegExp(']+>', 'ig'), '');
}

Additional checks have been implemented to prevent invalid tags from being removed where possible, by ensuring that the opening of each tag starts with a potential tag name; it does not account for greater than symbols within attributes. Comments will be retained but can be removed with a similar regex.

var no_comments = strip_tags('This is not a comment. ').replace(//, '');

Reply

John Fiala

Permalink to comment# July 22, 2020

Hi!

I hate to bother you, but it looks like the last line of your function has been corrupted somehow – that’s not a valid Regex. Any chance you could fix it?

BanZai

# July 26, 2018

Hi guys! I am currently facing a javascript problem with the regex / replace function you mention here.
I would like to bring a text around some of its HTML tags.

For this I use the function:

var regex = / (<([^>] +)>) / ig;
bodyValue = bodyValue.replace (regex, "");

Here all tags are deleted.

But I want to keep the and tags and found these two separate functions that worked for me:

                               var regex = / <(?! \ s * \ /? \ s * p \ b) [^>] *> / gi; // deletes all HTML except


                               var regex = / <(?! br \ s * \ /?) [^>] +> / gi; // deletes all HTML except for

Do you know how to combine the two conditions in one?

Reply

Fran

# October 18, 2018

This not only removes the offending characters, but also the rest of the text.

Reply

Geoff Graham

Permalink to comment# October 19, 2018

What’s the HTML you’re working with?

jass

# August 12, 2020

Why don’t you use Element.textContent?

Reply

Shanti Tripathy

# September 4, 2020

Just what I needed…Thanks

Reply

kudos

# April 30, 2021

.replace(/(<([^> ]+)>)/ig, "")
added a space after the chevron to allow for things like: “< heey >”

Reply

Bilelz

# June 29, 2021

Another tip: use the browser’s ability to remove tags:

const fakeDiv = document.createElement("div");
fakeDiv.innerHTML = html;
document.getElementById("stripped").innerHTML = fakeDiv.textContent || fakeDiv.innerText || "";

Reply

Godswill

# August 9, 2021

Hello sir. Please I wish to know if I can get help from you.
I have a frontend submission which users can share their article but will want to remove every link on the form.
Is there a way to do this only for the post submitted by users who are not admin?
Thanks
I already have the frontend post set and it works properly except what I am seeking for help.

Reply

Comments

Leave a Reply Cancel reply