Shell injection detection operator #308

Anilm3 · 2024-06-14T16:03:50Z

This PR introduces support for a new operator shi_detector which can be used for detecting Shell injections given a set of request parameters and a shell command. The heuristic primarily attempts to find injected executables and redirections, with most of the work being done by the tokenizer.

In this instance, the tokenizer does a bit more work than just finding simple tokens, as it attempts to minimise the number of needed tokens and keeps sufficient context to track nested commands and their boundaries. It currently consists in two phases, the first one generates low and high level tokens, while the second one finds executables and strips whitespaces.

And some work which will be done in a subsequent PR:

Supporting a shell command as an array rather than as a string
Expanding the pseudo-tokenizer
Expanding the list of forbidden injected tokens
Benchmark scenario

To use this new operator,a rule such as the following could be used:

  - id: rsp-930-004
    name: SHi Exploit detection
    tags:
      type: shi
      category: exploit_detection
      module: rasp
    conditions:
      - parameters:
          resource:
            - address: server.sys.shell.cmd
          params:
            - address: server.request.query
            - address: server.request.body
            - address: server.request.path_params
            - address: grpc.server.request.message
            - address: graphql.server.all_resolvers
            - address: graphql.server.resolver
        operator: shi_detector

Related Jira: APPSEC-52939

codecov-commenter · 2024-06-14T16:11:17Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 84.28%. Comparing base (38a4b0e) to head (5985929).

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #308      +/-   ##
==========================================
+ Coverage   83.68%   84.28%   +0.60%     
==========================================
  Files         137      141       +4     
  Lines        6086     6554     +468     
  Branches     2882     3023     +141     
==========================================
+ Hits         5093     5524     +431     
- Misses        369      374       +5     
- Partials      624      656      +32

Flag	Coverage Δ
waf_test	`84.28% <ø> (+0.60%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

pr-commenter · 2024-06-14T16:52:12Z

Benchmarks

Benchmark execution time: 2024-07-15 12:35:05

Comparing candidate commit 5985929 in PR branch anilm3/shell-injection with baseline commit 38a4b0e in branch master.

Found 5 performance improvements and 3 performance regressions! Performance is the same for 11 metrics, 0 unstable metrics.

scenario:ip_match_matcher.random

🟩 execution_time [-80.208µs; -76.761µs] or [-4.752%; -4.548%]

scenario:is_xss_matcher.random

🟩 execution_time [-3.867ms; -3.849ms] or [-4.562%; -4.541%]

scenario:phrase_match_matcher.enforce_word_boundary.random

🟩 execution_time [-3.057ms; -3.052ms] or [-29.964%; -29.913%]

scenario:regex_match_matcher.case_insensitive_flag.random

🟥 execution_time [+749.893µs; +754.775µs] or [+14.768%; +14.864%]

scenario:regex_match_matcher.case_insensitive_option.random

🟥 execution_time [+745.048µs; +750.366µs] or [+14.655%; +14.759%]

scenario:regex_match_matcher.lowercase_transformer.random

🟥 execution_time [+692.932µs; +697.938µs] or [+10.586%; +10.663%]

scenario:regex_match_matcher.random

🟩 execution_time [-119.122µs; -114.641µs] or [-4.966%; -4.779%]

scenario:string_equals_matcher.random

🟩 execution_time [-73.449µs; -69.163µs] or [-4.760%; -4.482%]

… without interesting tokens

… value

src/condition/shi_detector.cpp

cataphract · 2024-07-08T15:12:52Z

src/tokenizer/base.hpp

+
+namespace ddwaf {
+
+template <typename T> struct base_token {


it would make the code more readable if you had start(), end(), size(). In particular end, because token.index + token.str.size() is a bigger expression. Or just an interval() method, as mentioned.

Since this comment and the one below affect other tokenizers as well, I'll try to address them together in a separate PR.

cataphract · 2024-07-08T15:27:06Z

src/condition/shi_detector.cpp

+        std::size_t i = 0;
+        for (; i < resource_tokens.size(); ++i) {
+            const auto &token = resource_tokens[i];
+            if (end_index >= token.index && param_index < (token.index + token.str.size())) {


this would be easier to understand if you had an interval abstraction with contains, overlaps, ...

Since this comment and the one above affect other tokenizers as well, I'll try to address them together in a separate PR.

src/condition/shi_detector.cpp

src/tokenizer/base.hpp

src/tokenizer/shell.cpp

Anilm3 added 7 commits June 11, 2024 21:28

Shell injection detection operator

56c63cf

Small changes

2149387

Add support for more token types, add tests, format

21ce3e7

More tests and parser

e11aa53

Fixes and tests

0ce6c9c

Fixes to resolve executables

fd4b437

Add fuzzer, fixes and more tests

0b8f5f8

Format

56b13c6

Anilm3 added 8 commits June 14, 2024 22:54

Tests and fixes

4ca4f61

Add more tests

b9c8b02

Expand the definition of executable and flatten double quoted strings…

31fc3c2

… without interesting tokens

Minor changes

4ac722d

Merge branch 'master' into anilm3/shell-injection

059ea13

Merge branch 'master' into anilm3/shell-injection

df8295b

Add whitespace token and fix cases in which the space has syntactical…

690c152

… value

Split tokenizer in two phases to better identify executables

21e7b7a

Anilm3 marked this pull request as ready for review June 20, 2024 15:56

Anilm3 requested a review from a team as a code owner June 20, 2024 15:56

Anilm3 added 10 commits June 20, 2024 22:23

Add more tests and fixes

a419e5d

Add corpus files and remove stray cout

1bea7f4

Merge branch 'master' into anilm3/shell-injection

667d786

shi_detector fuzzer and workflow

2411381

Lint

0509045

Tests

44754cf

Merge branch 'master' into anilm3/shell-injection

333e4dc

Merge branch 'master' into anilm3/shell-injection

09aa393

Fix build after merge

4b5b21f

Merge branch 'master' into anilm3/shell-injection

e175364

cataphract reviewed Jul 9, 2024

View reviewed changes

Anilm3 added 15 commits July 9, 2024 21:48

Minor changes

bbe32ba

Support multidigit file descriptors on redirections

3c3f8e9

Minor changes

34b5a44

Improve support for arithmetic expressions and refactor some code

cd9fb3a

Merge branch 'master' into anilm3/shell-injection

3ed2e83

Support expansions within arrays

0fc7d31

Add one more missing arithmetic expansion case

8ed8906

Improve support for subshell and compound commands

07ee3a6

Refactor expandable scope processing, remove literals

1aaf52f

Remove literals and dead code

e1679a6

Add more tests, improve handling of arrays and file redirections

82d65f8

Fixes and more tests

f7d8b8d

Remove dead code

43a31bb

Remove more dead code and add more tests

b5bc028

Support new line as end of compound command sequence

5985929

cataphract approved these changes Jul 15, 2024

View reviewed changes

Anilm3 merged commit 09de0e0 into master Jul 15, 2024
46 checks passed

Anilm3 deleted the anilm3/shell-injection branch July 15, 2024 13:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shell injection detection operator #308

Shell injection detection operator #308

Anilm3 commented Jun 14, 2024 •

edited

Loading

codecov-commenter commented Jun 14, 2024 •

edited

Loading

pr-commenter bot commented Jun 14, 2024 •

edited

Loading

cataphract Jul 8, 2024

Anilm3 Jul 12, 2024

cataphract Jul 8, 2024

Anilm3 Jul 12, 2024

Shell injection detection operator #308

Shell injection detection operator #308

Conversation

Anilm3 commented Jun 14, 2024 • edited Loading

codecov-commenter commented Jun 14, 2024 • edited Loading

Codecov Report

pr-commenter bot commented Jun 14, 2024 • edited Loading

Benchmarks

scenario:ip_match_matcher.random

scenario:is_xss_matcher.random

scenario:phrase_match_matcher.enforce_word_boundary.random

scenario:regex_match_matcher.case_insensitive_flag.random

scenario:regex_match_matcher.case_insensitive_option.random

scenario:regex_match_matcher.lowercase_transformer.random

scenario:regex_match_matcher.random

scenario:string_equals_matcher.random

cataphract Jul 8, 2024

Choose a reason for hiding this comment

Anilm3 Jul 12, 2024

Choose a reason for hiding this comment

cataphract Jul 8, 2024

Choose a reason for hiding this comment

Anilm3 Jul 12, 2024

Choose a reason for hiding this comment

Anilm3 commented Jun 14, 2024 •

edited

Loading

codecov-commenter commented Jun 14, 2024 •

edited

Loading

pr-commenter bot commented Jun 14, 2024 •

edited

Loading