Skip to content Skip to sidebar Skip to footer

Extract Data With Regex From Html

I would like to extract items from this sample html, more specificly, i would like to isolate the following ones: algp1, PRODUCTION 50733 GEN_APPL KANTOOR

Solution 1:

Check out JTidy. It will parse the HTML and give you a DOM interface to iterate over.

I would strongly recommend not using a regexp for all but the simplest cases. HTML isn't regular and has no end of edge cases to trip you up.


Solution 2:

Take a look at regulazy...

It will allow you to create a regexp from an input string with a simple point and click interface.

http://osherove.com/tools/


Post a Comment for "Extract Data With Regex From Html"