-
Notifications
You must be signed in to change notification settings - Fork 106
Update EmTest #1463
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Update EmTest #1463
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -1563,7 +1563,7 @@ class EMConfig { | |
| @PercentageAsProbability(false) | ||
| @Cfg("If using THRESHOLD for AI Classification Repair, specify its value." + | ||
| " All classifications with probability equal or above such threshold value will be accepted.") | ||
| var classificationRepairThreshold = 0.8 | ||
| var classificationRepairThreshold = 0.5 | ||
|
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. are these changes based on latest experiments?
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. ah... i see you wrote it in the description of this PR... :) |
||
|
|
||
| @Experimental | ||
| @Cfg("Specify how the classification of actions's response will be used to execute a possible repair on the action.") | ||
|
|
@@ -1602,7 +1602,7 @@ class EMConfig { | |
| @Experimental | ||
| @Cfg("Minimum confidence threshold required for the AI response classifier to decide" + | ||
| "whether to send a request as-is or attempt a repair.") | ||
| var aIResponseClassifierWeaknessThreshold = 0.4 | ||
| var aIResponseClassifierWeaknessThreshold = 0.8 | ||
|
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. are these changes based on latest experiments? |
||
|
|
||
| @Cfg("Output a JSON file representing statistics of the fuzzing session, written in the WFC Report format." + | ||
| " This also includes a index.html web application to visualize such data.") | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i m unsure about this... we will need to discuss. for example, if the model is always weak, would it mean this test will always pass? that would be against the point of having a E2E. or is guassian not able to reliably solve these simples APIs in these E2Es?