Java library for working with HTML

Edit Package jsoup

jsoup is a Java library for working with HTML.
It provides an API for extracting and manipulating data,
using DOM, CSS, and jquery-like methods.

jsoup implements the WHATWG HTML5 specification.

- scrapes and parses HTML from a URL, file, or string
- finds and extracts data, using DOM traversal or CSS selectors
- manipulates the HTML elements, attributes, and text
- cleans user-submitted content against a safe white-list,
to prevent XSS attacks
- outputs tidied HTML

jsoup can deal with invalid HTML tag soup.

Refresh
Refresh
Source Files
Filename Size Changed
_service 0000000484 484 Bytes
jsoup-1.14.2.tar.xz 0000227624 222 KB
jsoup-build.xml 0000007968 7.78 KB
jsoup.changes 0000000847 847 Bytes
jsoup.spec 0000002743 2.68 KB
Revision 1 (latest revision is 2)
Samu Voutilainen's avatar Samu Voutilainen (Smar) committed (revision 1)
osc copypac from project:openSUSE.org:SUSE:SLE-15-SP2:Update package:jsoup revision:9f5818fbbbc65b36a463134790fa942e
Comments 0