LABSCon 2022 Techniques #24

moohax · 2022-12-31T08:06:50Z

New PR for CLA.

Changes:

tf -> tensorflow
np -> numpy

GrosQuildu

I added metadata to few rules. All rules need similar fields. (We will make CONTRIBUTING file soon with all requirements for new rules).

Left a few comments - some rules can be merged; some have invalid tests.

All tests need improvements:

add # ruleid: <name of rule> where the semgrep should find the issue
add # ok: <name of rule> where semgrep should not find issue

I am a bit worried about number of false positives - is using all these methods always insecure, or can we filter-out some common false positives (e.g., using hardcoded paths)?

If possible, we can enhance messages with guidance on how to fix the issues - propose more secure alternatives or e.g., sanitizing inputs.

Rules like these can be simplified:

patterns:
    - pattern-either: 
      - pattern: |
      tensorflow.load_op_library($PATH)

with

patterns:
  - pattern: |
    tensorflow.load_op_library($PATH)

Once everything is fixed, run semgrep --test ./

python/numpy-distutils.py

python/onnx-convert-ort.py

python/onnx-session-options.yml

python/tensorflow-load-op-library.py

python/tensorflow-load-op-library.yml

python/numpy-f2py-compile.yml

python/numpy-in-pytorch-modules.py

Co-authored-by: Paweł Płatek <e2.8a.95@gmail.com>

GrosQuildu · 2023-01-26T12:36:39Z

To limit false positive, removed detection of hardcoded strings. This is in accordance with similar, official semgrep rules.

Removed:

onnx-convert-ort - semgrep doesn't support bash language yet
pandas-read-* - reading json/csv is not inherently insecure, it depends form where you get data. There are advanced rules that can track data flow and determine if reading specific file is safe. Consider extending such rule with pandas. I guess CodeQL should be a good match here.
pickle-load - already exists r/python.lang.security.deserialization.pickle.avoid-pickle
tensorflow-load-op-library - merged with tensorflow-load-op-library

GrosQuildu · 2023-01-27T09:19:00Z

Thanks a lot @moohax ! Finally merged the rules.

LABSCon 2022 Techniques

d2acf45

GrosQuildu self-requested a review January 3, 2023 10:29

fix metadata

1c21524

GrosQuildu requested changes Jan 3, 2023

View reviewed changes

moohax commented Jan 4, 2023

View reviewed changes

python/numpy-f2py-compile.yml Outdated Show resolved Hide resolved

moohax commented Jan 4, 2023

View reviewed changes

python/numpy-in-pytorch-modules.py Show resolved Hide resolved

moohax and others added 2 commits January 5, 2023 22:47

Update python/tensorflow-load-op-library.py

da9dcc8

Co-authored-by: Paweł Płatek <e2.8a.95@gmail.com>

[python/ml/moohax] remove wrong rules; fix other

f3fbb4f

GrosQuildu approved these changes Jan 26, 2023

View reviewed changes

GrosQuildu merged commit 0d7345f into trailofbits:main Jan 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LABSCon 2022 Techniques #24

LABSCon 2022 Techniques #24

Uh oh!

moohax commented Dec 31, 2022

Uh oh!

GrosQuildu left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

GrosQuildu commented Jan 26, 2023

Uh oh!

GrosQuildu commented Jan 27, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LABSCon 2022 Techniques #24

LABSCon 2022 Techniques #24

Uh oh!

Conversation

moohax commented Dec 31, 2022

Changes:

Uh oh!

GrosQuildu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

GrosQuildu commented Jan 26, 2023

Uh oh!

GrosQuildu commented Jan 27, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants