Hi all
For my GSoC project Radiology Reporting Enhancement, i will need to parse html files. After googling with the search query
best java library for parsing html files
The first 6 links talked only about jsoup. It has a very good documentation and after playing with it for a while i think it will be good for my needs. But i would like to know if the general community has a recommendation as to what to use.
Also i would like to know if i would be able to parse these html files just by using the htmlform entry module since we will be using it for the creation of html forms to serve as report templates. I would like to know if it can parse general html files not created with the module.