add lint for duplicate feature under a statement #2573

v1bh475u · 2025-01-24T09:43:25Z

closes #2250

Checklist

No CHANGELOG update needed

No new tests needed

No documentation update needed

Signed-off-by: vibhatsu <[email protected]>

williballenthin

Great start @v1bh475u

From the tracking issue:

We cannot use rule passed as argument for detecting duplicate features as the Rule class when retrieves features from the yaml file, it simply ignores the redundancies. Hence, we have to rely on definition which contains the entire unparsed yaml file as string.

With this background, it makes sense why you parsed the raw string, rather than using the structured data we already have. At the very least, this information should be provided as a comment to explain why the code appears so complex.

Given what it does, the code isn't too hard to follow. In fact, I like your code style quite a bit! Still, it takes some effort to follow along. I'd like either: simpler logic or tests that demonstrate the code working as expected.

For simpler logic, could you use a yaml parser to convert the raw text into a structured representation and avoid all the string splitting and searching?

If you want to stick with the current algorithm, please add some test cases that show the linter working as expected. Since the linter is found in ./scripts and not in the Python module, I don't think we can easily integrate with pytest, so I'd recommend some "self check" code that runs at the start of main everytime the linter is started. Its not very efficient, but the linter isn't run often and the tests shouldn't take long anyways.

Thoughts?

Also, please update the changelog and acknowledge yourself

scripts/lint.py

github-actions

Please add bug fixes, new features, breaking changes and anything else you think is worthwhile mentioning to the master (unreleased) section of CHANGELOG.md. If no CHANGELOG update is needed add the following to the PR description: [x] No CHANGELOG update needed

Signed-off-by: vibhatsu <[email protected]>

CHANGELOG updated or no update needed, thanks! 😄

Signed-off-by: vibhatsu <[email protected]>

v1bh475u · 2025-01-29T14:27:08Z

@williballenthin I used one of the function load of already imported ruamel.yaml which does not discard duplicate features so we can use it and rewritten the entire code. Please review.

v1bh475u · 2025-01-29T14:45:16Z

Also, please explain how I should handle conflicts in CHANGELOG

williballenthin · 2025-01-29T15:02:54Z

Also, please explain how I should handle conflicts in CHANGELOG

use your best judgement to merge the changes. order doesn't really matter, just that the content go into the right sections.

williballenthin · 2025-01-29T15:06:09Z

I used one of the function load of already imported ruamel.yaml which does not discard duplicate features so we can use it and rewritten the entire code.

The new code is so much cleaner! Great work.

Signed-off-by: vibhatsu <[email protected]>

…g of line numbers Signed-off-by: vibhatsu <[email protected]>

v1bh475u · 2025-01-29T17:49:29Z

Done with changes.

v1bh475u added 4 commits January 24, 2025 15:01

add lint for duplicate feature under a statement

921f303

add support for more scopes

cea29b4

fix format for duplicate feature lint

95ef4ee

fix false positives for duplicate features lint

053bb82

v1bh475u mentioned this pull request Jan 26, 2025

add lint to avoid duplicate features #2250

Open

remove unused code and comments

71e8cd0

Signed-off-by: vibhatsu <[email protected]>

williballenthin requested changes Jan 28, 2025

View reviewed changes

scripts/lint.py Outdated Show resolved Hide resolved

Merge branch 'mandiant:master' into feat/lint-duplicate-features

582296a

github-actions bot previously requested changes Jan 29, 2025

View reviewed changes

v1bh475u added 2 commits January 29, 2025 19:49

refactor duplicate feature lint to use yaml parser

2aa7fb4

Signed-off-by: vibhatsu <[email protected]>

update CHANGELOG

b8a7055

Signed-off-by: vibhatsu <[email protected]>

clarify for using rule definition

1bb930b

Signed-off-by: vibhatsu <[email protected]>

v1bh475u and others added 3 commits January 29, 2025 21:38

Merge branch 'master' into feat/lint-duplicate-features

0ad6795

update CHANGELOG

ba4bb96

Signed-off-by: vibhatsu <[email protected]>

refactor duplicate feature lint to improve key generation and trackin…

274fc3f

…g of line numbers Signed-off-by: vibhatsu <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add lint for duplicate feature under a statement #2573

add lint for duplicate feature under a statement #2573

v1bh475u commented Jan 24, 2025 •

edited

Loading

williballenthin left a comment •

edited

Loading

github-actions bot left a comment

v1bh475u commented Jan 29, 2025

v1bh475u commented Jan 29, 2025

williballenthin commented Jan 29, 2025

williballenthin commented Jan 29, 2025

v1bh475u commented Jan 29, 2025

add lint for duplicate feature under a statement #2573

Are you sure you want to change the base?

add lint for duplicate feature under a statement #2573

Conversation

v1bh475u commented Jan 24, 2025 • edited Loading

Checklist

williballenthin left a comment • edited Loading

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

v1bh475u commented Jan 29, 2025

v1bh475u commented Jan 29, 2025

williballenthin commented Jan 29, 2025

williballenthin commented Jan 29, 2025

v1bh475u commented Jan 29, 2025

v1bh475u commented Jan 24, 2025 •

edited

Loading

williballenthin left a comment •

edited

Loading