Explore Creating a Bot Filter

The NOAA web sites have been inundated by malicious bots over the past few months. We are concerned that that our data will differ significantly from the DAP data as we do our best to filter out the bots (where possible). We are using the following criteria: 
<meta charset="utf-8"><b style="font-weight:normal;" id="docs-internal-guid-20430373-7fff-4bed-2e17-6c2a4045d3e8"><h3 dir="ltr" style="line-height:1.38;margin-top:0pt;margin-bottom:4pt;"><span style="font-size:13pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:700;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Evaluation Risk Weighting</span></h3><p dir="ltr" style="line-height:1.38;margin-top:0pt;margin-bottom:12pt;"><span style="font-size:11pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">The tool calculates the probability of malicious activity based on five key technical dimensions.&nbsp;</span></p><div dir="ltr" style="margin-left:0pt;" align="left">
Dimension | Threshold | Details | Significance
-- | -- | -- | --
Language | Greater than 45% | Language is not English | Detection of non-standard or mismatching browser language settings.
Source | Greater than 40% | Source and medium are not available | Analysis of traffic origin and referral strings.
Country | Greater than 60% | Country is not United States | High-volume clusters from non-standard geographic regions.
Operating System | Greater than 10% | Outdated Operating System | Use of outdated or uncommon OS versions (e.g., Macintosh Intel 10.15).
Screen Resolution | Greater than 1% | Screen resolution is square | Detection of headless browser signatures (e.g., 1366x1366).

</div></b>

We were wondering if something like this could be implemented as an optional filter for DAP data - at least for NOAA? 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explore Creating a Bot Filter #28

Evaluation Risk Weighting

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Dimension	Threshold	Details	Significance
Language	Greater than 45%	Language is not English	Detection of non-standard or mismatching browser language settings.
Source	Greater than 40%	Source and medium are not available	Analysis of traffic origin and referral strings.
Country	Greater than 60%	Country is not United States	High-volume clusters from non-standard geographic regions.
Operating System	Greater than 10%	Outdated Operating System	Use of outdated or uncommon OS versions (e.g., Macintosh Intel 10.15).
Screen Resolution	Greater than 1%	Screen resolution is square	Detection of headless browser signatures (e.g., 1366x1366).

Explore Creating a Bot Filter #28

Description

Evaluation Risk Weighting

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions