Parsing and extracting information from (possibly malformed) HTML/XML documents

Edit Package ghc-tagsoup
http://hackage.haskell.org/package/tagsoup

TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making
it ideal for screen-scraping.

Users should start from the Text.HTML.TagSoup module.

Refresh
Refresh
Source Files
Filename Size Changed
ghc-tagsoup.changes 0000005046 4.93 KB
ghc-tagsoup.spec 0000002526 2.47 KB
tagsoup-0.14.8.tar.gz 0000043894 42.9 KB
Latest Revision
Samu Voutilainen's avatar Samu Voutilainen (Smar) committed (revision 2)
osc copypac from project:openSUSE.org:SUSE:SLE-15-SP5:GA package:ghc-tagsoup revision:1
Comments 0