Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bump the dependencies group with 8 updates #525

Closed
wants to merge 1 commit into from

Conversation

dependabot[bot]
Copy link
Contributor

@dependabot dependabot bot commented on behalf of github Mar 18, 2024

Bumps the dependencies group with 8 updates:

Package From To
trafilatura 1.5.0 1.7.0
beautifulsoup4 4.12.1 4.12.3
boilerpy3 1.0.6 1.0.7
goose3 3.1.13 3.1.19
html2text 2020.1.16 2024.2.26
inscriptis 2.3.2 2.5.0
news-please 1.5.22 1.5.44
resiliparse 0.14.3 0.14.5

Updates trafilatura from 1.5.0 to 1.7.0

Release notes

Sourced from trafilatura's releases.

trafilatura-1.7.0

Extraction:

  • improved html2txt() function (#483)

Downloads:

  • add advanced fetch_response() function → pending deprecation for fetch_url(decode=False)

Maintenance:

trafilatura-1.6.4

Maintenance:

  • MacOS: fix setup, update htmldate and add tests (#460)
  • drop invalid XML element attributes with @​vbarbaresi in #462
  • remove cyclic imports (#458)

Navigation:

  • introduce MAX_REDIRECTS config setting and fix urllib3 redirect handling by @​vbarbaresi in #461
  • improve feed detection (#457)

Documentation:

trafilatura-1.6.3

Extraction:

Metadata:

  • more precise date extraction (see htmldate)
  • new htmldate extensive search parameter in config (#434)
  • changes in URLs: normalization, trackers removed (see courlan)

Navigation:

  • reviewed code for feeds (#443)
  • new config option: external URLs for feeds/sitemaps (#441)

Documentation:

trafilatura-1.6.2

Extraction:

... (truncated)

Changelog

Sourced from trafilatura's changelog.

1.7.0

Extraction:

  • improved html2txt() function

Downloads:

  • add advanced fetch_response() function → pending deprecation for fetch_url(decode=False)

Maintenance:

1.6.4

Maintenance:

  • MacOS: fix setup, update htmldate and add tests (#460)
  • drop invalid XML element attributes with @​vbarbaresi in #462
  • remove cyclic imports (#458)

Navigation:

  • introduce MAX_REDIRECTS config setting and fix urllib3 redirect handling by @​vbarbaresi in #461
  • improve feed detection (#457)

Documentation:

1.6.3

Extraction:

Metadata:

  • more precise date extraction (see htmldate)
  • new htmldate extensive search parameter in config (#434)
  • changes in URLs: normalization, trackers removed (see courlan)

Navigation:

  • reviewed code for feeds (#443)
  • new config option: external URLs for feeds/sitemaps (#441)

Documentation:

1.6.2

... (truncated)

Commits

Updates beautifulsoup4 from 4.12.1 to 4.12.3

Updates boilerpy3 from 1.0.6 to 1.0.7

Release notes

Sourced from boilerpy3's releases.

v1.0.7

Fixes

  • Fixes python_requires field in setup.py (thanks @​jsirois!)
Changelog

Sourced from boilerpy3's changelog.

Changelog

Commits

Updates goose3 from 3.1.13 to 3.1.19

Release notes

Sourced from goose3's releases.

Version 3.1.19

Version 3.1.18

Version 3.1.17

Version 3.1.16

Version 3.1.15

Version 3.1.14

Changelog

Sourced from goose3's changelog.

3.1.19

3.1.18

3.1.17

3.1.16

3.1.15

3.1.14

Commits

Updates html2text from 2020.1.16 to 2024.2.26

Release notes

Sourced from html2text's releases.

2024.2.26

What's Changed

Full Changelog: Alir3z4/html2text@2024.2.25...2024.2.26

2024.2.25

What's Changed

New Contributors

Full Changelog: Alir3z4/html2text@2020.1.16...2024.2.25

Changelog

Sourced from html2text's changelog.

2024.2.26


  • Fixes #409: IndexError on empty strong mark.

2024.2.25


  • Fix #332: Insert at most one space for multiple emphasis
  • Feature #318: Make padded tables more similar to pandoc's pipe_tables.
  • Add support for Python 3.9.
  • Fix extra line breaks inside html link text (between '[' and ']')
  • Fix #344: indent <ul> inside <ol> three spaces instead of two to comply with CommonMark, GFM, etc.
  • Fix #324: unnecessary spaces around <b>, <em>, and strike tags.
  • Don't wrap tables by default and add a --wrap-tables config option.
  • Feature #198: Ignore <p> tags inside table rows.
  • Don't wrap tables by default and add a --wrap-tables config option
  • Remove support for Python ≤ 3.5. Now requires Python 3.6+.
  • Support for Python 3.10+.
  • Fix #320 padding empty tables and tables with no </tr> tags.
  • Add ignore_mailto_links config option to ignore mailto: style links.
  • Feature #407: Support the superscript and subscript tags.
  • Fix #373: \n inside text of a Markdown link.
  • Feature #406: Improve support for null atttibute values.
Commits
  • 7ae5948 Release 2024.2.26
  • 3a487dd Update AUTHORS.rst file to include new contributors and apply some little cle...
  • b053a5a Fixes #409: IndexError on empty strong mark on version. (#410)
  • ff0db81 Release 2024.2.25
  • 42278c6 Support sup and sub html tags (#408)
  • e375689 Improve support for null atttibute values (#406)
  • 7ba8431 Ignore <p> tags in table rows (#354)
  • 1e7cb73 Merge branch 'mborsetti-py310'
  • 12706e2 Merge branch 'py310' of github.com:mborsetti/html2text into mborsetti-py310
  • 8c15ad2 Added support Python 3.10, 3.11 and 3.12; removed older Python
  • Additional commits viewable in compare view

Updates inscriptis from 2.3.2 to 2.5.0

Release notes

Sourced from inscriptis's releases.

Custom HTML Handling and HTML engine improvements

  • add working support for specifying custom html tags (fixes #81)
  • improved html_engine.py
  • improved typing across all modules
  • added unittests for
    • inscript
    • inscriptis-api
  • documentation update

Fix documentation build and update publish script.

  • fix building documentation on readthedocs.org
  • update publish script

Code cleanup, improved Web service and distribution

  • added official Python 3.12 support
  • Inscriptis command line client
    • renamed inscript.py to inscript and install client via pip
    • added --timeout argument.
  • Inscriptis Web service:
    • migrate the Web service to FastAPI and uvicorn
    • enable install as an extra using pip install inscriptis[web-service]
  • code cleanup
  • migrate to pyproject.toml and poetry for package distribution
  • use black for code formatting
  • improved tox config and code checks
Commits
  • 667b356 Merge pull request #84 from weblyzard/fix/bug-81-custom-html-handling2
  • 504863d Merge branch 'fix/bug-81-custom-html-handling2' of github.com:weblyzard/inscr...
  • db012a4 chg: optimized imports and additional type hints.
  • f138377 chg: code cleanup.
  • 9e31c6c chg: code cleanup.
  • 16ba471 add: unittesting for the inscriptis web service.
  • 1fe8da7 add: annotation profiles required for the unittests.
  • 942c4c7 fix: Exception handling for annotation rule files.
  • 35626dd chg: fully cover the inscript client with unittests.
  • 80c8dd1 add: unittests for the command line client.
  • Additional commits viewable in compare view

Updates news-please from 1.5.22 to 1.5.44

Commits

Updates resiliparse from 0.14.3 to 0.14.5

Commits
  • 0a6d6e6 Add -headerpad_max_install_names to macOS LD_FLAGS
  • a6fdc5b Replace re2::StringPiece with std::string_view
  • ed97829 Set compiler optimisation flags
  • 45c795d Remove explicit abseil dependency
  • 92270ff Use custom vcpkg triplets
  • d262edf Use --overlay-ports
  • ea8d1d3 Build abseil with C++17 ABI
  • 7c16359 Update build image
  • adc93e3 Bump version number
  • e28d3fa Update build image
  • Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore <dependency name> major version will close this group update PR and stop Dependabot creating any more for the specific dependency's major version (unless you unignore this specific dependency's major version or upgrade to it yourself)
  • @dependabot ignore <dependency name> minor version will close this group update PR and stop Dependabot creating any more for the specific dependency's minor version (unless you unignore this specific dependency's minor version or upgrade to it yourself)
  • @dependabot ignore <dependency name> will close this group update PR and stop Dependabot creating any more for the specific dependency (unless you unignore this specific dependency or upgrade to it yourself)
  • @dependabot unignore <dependency name> will remove all of the ignore conditions of the specified dependency
  • @dependabot unignore <dependency name> <ignore condition> will remove the ignore condition of the specified dependency and ignore conditions

Bumps the dependencies group with 8 updates:

| Package | From | To |
| --- | --- | --- |
| [trafilatura](https://github.com/adbar/trafilatura) | `1.5.0` | `1.7.0` |
| [beautifulsoup4](https://www.crummy.com/software/BeautifulSoup/bs4/) | `4.12.1` | `4.12.3` |
| [boilerpy3](https://github.com/jmriebold/BoilerPy3) | `1.0.6` | `1.0.7` |
| [goose3](https://github.com/goose3/goose3) | `3.1.13` | `3.1.19` |
| [html2text](https://github.com/Alir3z4/html2text) | `2020.1.16` | `2024.2.26` |
| [inscriptis](https://github.com/weblyzard/inscriptis) | `2.3.2` | `2.5.0` |
| [news-please](https://github.com/fhamborg/news-please) | `1.5.22` | `1.5.44` |
| [resiliparse](https://github.com/chatnoir-eu/chatnoir-resiliparse) | `0.14.3` | `0.14.5` |


Updates `trafilatura` from 1.5.0 to 1.7.0
- [Release notes](https://github.com/adbar/trafilatura/releases)
- [Changelog](https://github.com/adbar/trafilatura/blob/master/HISTORY.md)
- [Commits](v1.5.0...v1.7.0)

Updates `beautifulsoup4` from 4.12.1 to 4.12.3

Updates `boilerpy3` from 1.0.6 to 1.0.7
- [Release notes](https://github.com/jmriebold/BoilerPy3/releases)
- [Changelog](https://github.com/jmriebold/BoilerPy3/blob/master/CHANGELOG.md)
- [Commits](jmriebold/BoilerPy3@v1.0.6...v1.0.7)

Updates `goose3` from 3.1.13 to 3.1.19
- [Release notes](https://github.com/goose3/goose3/releases)
- [Changelog](https://github.com/goose3/goose3/blob/master/CHANGELOG.md)
- [Commits](goose3/goose3@v3.1.13...v3.1.19)

Updates `html2text` from 2020.1.16 to 2024.2.26
- [Release notes](https://github.com/Alir3z4/html2text/releases)
- [Changelog](https://github.com/Alir3z4/html2text/blob/master/ChangeLog.rst)
- [Commits](Alir3z4/html2text@2020.1.16...2024.2.26)

Updates `inscriptis` from 2.3.2 to 2.5.0
- [Release notes](https://github.com/weblyzard/inscriptis/releases)
- [Commits](weblyzard/inscriptis@2.3.2...2.5.0)

Updates `news-please` from 1.5.22 to 1.5.44
- [Commits](https://github.com/fhamborg/news-please/commits)

Updates `resiliparse` from 0.14.3 to 0.14.5
- [Commits](chatnoir-eu/chatnoir-resiliparse@v0.14.3...v0.14.5)

---
updated-dependencies:
- dependency-name: trafilatura
  dependency-type: direct:production
  update-type: version-update:semver-minor
  dependency-group: dependencies
- dependency-name: beautifulsoup4
  dependency-type: direct:production
  update-type: version-update:semver-patch
  dependency-group: dependencies
- dependency-name: boilerpy3
  dependency-type: direct:production
  update-type: version-update:semver-patch
  dependency-group: dependencies
- dependency-name: goose3
  dependency-type: direct:production
  update-type: version-update:semver-patch
  dependency-group: dependencies
- dependency-name: html2text
  dependency-type: direct:production
  update-type: version-update:semver-major
  dependency-group: dependencies
- dependency-name: inscriptis
  dependency-type: direct:production
  update-type: version-update:semver-minor
  dependency-group: dependencies
- dependency-name: news-please
  dependency-type: direct:production
  update-type: version-update:semver-patch
  dependency-group: dependencies
- dependency-name: resiliparse
  dependency-type: direct:production
  update-type: version-update:semver-patch
  dependency-group: dependencies
...

Signed-off-by: dependabot[bot] <[email protected]>
@dependabot dependabot bot added the dependencies Pull requests that update a dependency file label Mar 18, 2024
@adbar adbar closed this Mar 18, 2024
Copy link
Contributor Author

dependabot bot commented on behalf of github Mar 18, 2024

This pull request was built based on a group rule. Closing it will not ignore any of these versions in future pull requests.

@adbar adbar deleted the dependabot/pip/dependencies-7148a7a7ea branch March 18, 2024 14:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dependencies Pull requests that update a dependency file
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant