trafilatura-0.8.0
- improved link discovery and handling
- fixes in metadata extraction, feeds and sitemaps processing
- breaking change: the
extract
function now reads target format fromoutput_format
argument only - new extraction option: preserve links, CLI options re-ordered
- more opportunistic backup extraction