Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WAT extractor: do not add <meta itemprop="..." > from body as metadata #42

Merged
merged 2 commits into from
Dec 10, 2024

Conversation

sebastian-nagel
Copy link

(fixes #40)

Do not add metadata with a single key-value pair, such as content = '...', require at least two key-value pairs.

Add unit test to proof that <meta itemprop="..." content="..."> annotations are ignored.

@sebastian-nagel sebastian-nagel force-pushed the 40-ignore-metadata-in-body branch from b4c46a7 to feae6d4 Compare December 10, 2024 09:00
- rebase to recent head / master
- unit test: merge methods to verify any kind of metadata attributes
@sebastian-nagel sebastian-nagel force-pushed the 40-ignore-metadata-in-body branch from feae6d4 to b474f5d Compare December 10, 2024 09:43
@sebastian-nagel sebastian-nagel merged commit 48e46d6 into master Dec 10, 2024
5 checks passed
@sebastian-nagel sebastian-nagel deleted the 40-ignore-metadata-in-body branch December 10, 2024 15:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

WAT extractor: do not add <meta itemprop="..." > from body as metadata
1 participant