I need to parse some HTML in my project. It is fairly simple and controlled HTML, that is, we don't parse just any malformed HTML out there in the wild.
I was thinking of using Regex for this purpose, but I am not (yet) an expert in building Regex patterns.
However, I found the following pattern that will match all HTML tags:
Does anyone have feedback on this pattern? Will it indeed capture all HTML tags? Any weaknesses?
As an alternative I believe I could use the HTML Agility Pack. I know that the Orchard Project uses it internally.
Does anyone want to comment on the appropriateness of using the Agility Pack for my purposes?