Release v10.17.0 -- Write products to user buckets by jhkennedy · Pull Request #3106 · ASFHyP3/hyp3

jhkennedy · 2026-05-20T23:42:44Z

This release will allow HyP3 to publish products straight to end-user managed S3 buckets, once they apply an appropriate bucket policy.

First, a user would hit the new /bucket-policy/{bucket_name} endpoint to get a bucket policy for their bucket -- let's use jhk-lavas-test for example:
https://hyp3-test-api.asf.alaska.edu/bucket-policy/jhk-lavas-test

And then apply it to their bucket:

which will then allow them to provide the bucket and bucket_prefix parameters to any job they would like to run and the output products will show up in the jhk-lavas-test bucket. See all the examples 👇 .

Importantly, this requires no downstream changes to the plugins! All plugins already are required to accept a bucket and bucket_prefix argument; we just hid this from users as they are fixed to the HyP3 content bucket and job ID, respectively.

Example jobs

First, lets start with a basic INSAR_ISCE_BURST job, to confirm existing behavior:

{
  "validate_only": false,
  "jobs": [
    {
      "job_type": "INSAR_ISCE_BURST",
      "name": "test-user-bucket",
      "job_parameters": {
        "granules": [
          "S1_136231_IW2_20200604T022312_VV_7C85-BURST",
          "S1_136231_IW2_20200616T022313_VV_5D11-BURST"
        ],
        "apply_water_mask": false,
        "looks": "20x4"
      }
    }
  ]
}

Submitting that job results in:
https://hyp3-test-api.asf.alaska.edu/jobs/8a731ede-7468-45c8-b22b-17d1d87dd577

{
  "status_code": "PENDING",
  "user_id": "jhkennedy",
  "credit_cost": 1,
  "job_parameters": {
    "apply_water_mask": false,
    "looks": "20x4",
    "granules": [
      "S1_136231_IW2_20200604T022312_VV_7C85-BURST",
      "S1_136231_IW2_20200616T022313_VV_5D11-BURST"
    ]
  },
  "priority": 8000,
  "bucket": "hyp3-edc-uat-contentbucket-1cesvjwsurrfn",
  "bucket_prefix": "8a731ede-7468-45c8-b22b-17d1d87dd577",
  "job_id": "8a731ede-7468-45c8-b22b-17d1d87dd577",
  "execution_started": false,
  "job_type": "INSAR_ISCE_BURST",
  "name": "test-user-bucket",
  "request_time": "2026-05-21T03:47:44+00:00"
}

Note how the job response now has bucket, which is the HyP3 content bucket for this deployment, and bucket-prefix, which is the job ID, maintaining the existing behavior for HyP3. Also, when this job succeeds (and it did!) the download URLs for products and images will before for the cloudfront distribution, since this is an EDC deployment.

Now, if we add a bucket parameter to the job, products will be written to the jhk-lavas-test bucket. After submitting this:

{
  "validate_only": false,
  "jobs": [
    {
      "job_type": "INSAR_ISCE_BURST",
      "name": "test-user-bucket",
      "bucket": "jhk-lavas-test",
      "job_parameters": {
        "granules": [
          "S1_136231_IW2_20200604T022312_VV_7C85-BURST",
          "S1_136231_IW2_20200616T022313_VV_5D11-BURST"
        ],
        "apply_water_mask": false,
        "looks": "20x4"
      }
    }
  ]
}

We'll get back:
https://hyp3-test-api.asf.alaska.edu/jobs/347a0c73-1604-4510-93e5-5520f31df67d

{
  "status_code": "PENDING",
  "user_id": "jhkennedy",
  "credit_cost": 1,
  "job_parameters": {
    "apply_water_mask": false,
    "looks": "20x4",
    "granules": [
      "S1_136231_IW2_20200604T022312_VV_7C85-BURST",
      "S1_136231_IW2_20200616T022313_VV_5D11-BURST"
    ]
  },
  "priority": 8000,
  "bucket": "jhk-lavas-test",
  "bucket_prefix": "347a0c73-1604-4510-93e5-5520f31df67d",
  "job_id": "347a0c73-1604-4510-93e5-5520f31df67d",
  "execution_started": false,
  "job_type": "INSAR_ISCE_BURST",
  "name": "test-user-bucket",
  "request_time": "2026-05-21T03:47:44+00:00"
}

Note: the bucket_prefix is still the job ID but now bucket is what I specified. Importantly, when this job succeeds (and it did!) it will have S3 download URLs since this is a non-hyp3 bucket and not part of our cloudfront distribution -- these URLs will work if the bucket is public and return a 403 Access Denied response if not.

I can also specify bucket_prefix, but this parameter has some expansions that are possible:

If {job_id} is in the string, it will expand to the job ID
If {name} is in the string, it will expand to the job name

So, for example I could do:

{
  "validate_only": false,
  "jobs": [
    {
      "job_type": "INSAR_ISCE_BURST",
      "name": "test-user-bucket",
      "bucket": "jhk-lavas-test",
      "bucket_prefix": "HyP3/{name}/{job_id}",
      "job_parameters": {
        "granules": [
          "S1_136231_IW2_20200604T022312_VV_7C85-BURST",
          "S1_136231_IW2_20200616T022313_VV_5D11-BURST"
        ],
        "apply_water_mask": false,
        "looks": "20x4"
      }
    }
  ]
}

We'll get back:
https://hyp3-test-api.asf.alaska.edu/jobs/347a0c73-1604-4510-93e5-5520f31df67d

{
  "job_id": "7fc85d8d-088b-439b-977e-9e0e148b8337",
  "user_id": "jhkennedy",
  "status_code": "PENDING",
  "execution_started": false,
  "request_time": "2026-05-21T04:04:05+00:00",
  "priority": 7998,
  "job_type": "INSAR_ISCE_BURST",
  "name": "test-user-bucket",
  "bucket": "jhk-lavas-test",
  "bucket_prefix": "HyP3/test-user-bucket/7fc85d8d-088b-439b-977e-9e0e148b8337",
  "job_parameters": {
    "apply_water_mask": false,
    "looks": "20x4",
    "granules": [
      "S1_136231_IW2_20200604T022312_VV_7C85-BURST",
      "S1_136231_IW2_20200616T022313_VV_5D11-BURST"
    ]
  },
  "credit_cost": 1
}

Note that the expansion happens immediately in the API so the value placed in the DynamoDB is the expanded value.

We do not provide expansion for the bucket. Fortunately, if I try and provide any expansion for the bucket name (e.g., "bucket": "{name},") or a bad expansion in the prefix (e.g,., "bucket_prefix": "{bad_expansion}",) the job will be rejected by the API as { and } not allowable characters for an S3 key. This is enabled by the the OpenAPI spec pattern provided for these parameters -- validation happens after expansion .

And, to see how a failure works, we can submit this AUTORIFT job we know will fail:

{
  "validate_only": false,
  "jobs": [
    {
      "job_type": "AUTORIFT",
      "name": "test-user-bucket",
      "bucket": "jhk-lavas-test",
      "bucket_prefix": "{job_id}",
      "job_parameters": {
        "granules": [
          "S2B_MSIL1C_20260415T005509_N0512_R059_T51DWE_20260415T031050",
          "S2B_MSIL1C_20251116T005509_N0511_R059_T51DWE_20251116T020121"
        ]
      }
    }
  ]
}

and when it fails, the job response will look like:

{
  "browse_images": [],
  "bucket": "jhk-lavas-test",
  "job_parameters": {
    "granules": [
      "S2B_MSIL1C_20260415T005509_N0512_R059_T51DWE_20260415T031050",
      "S2B_MSIL1C_20251116T005509_N0511_R059_T51DWE_20251116T020121"
    ]
  },
  "processing_times": null,
  "thumbnail_images": [],
  "request_time": "2026-05-21T04:30:04+00:00",
  "execution_started": true,
  "bucket_prefix": "0f8d12bf-4834-4001-971e-d8927dae8443",
  "job_type": "AUTORIFT",
  "files": [],
  "status_code": "FAILED",
  "user_id": "jhkennedy",
  "expiration_time": "2120-10-21T00:00:00+00:00",
  "credit_cost": 25,
  "priority": 7997,
  "logs": [
    "https://jhk-lavas-test.s3.us-west-2.amazonaws.com/0f8d12bf-4834-4001-971e-d8927dae8443/0f8d12bf-4834-4001-971e-d8927dae8443.log"
  ],
  "name": "test-user-bucket",
  "job_id": "0f8d12bf-4834-4001-971e-d8927dae8443"
},

Note how the log files are also written to the user-bucket.

Potential concerns

With this implementation, there are a few concerns that are worth mentioning.

The bucket policy grants permissions the entire HyP3 AWS account. That means potentially anyone with access to the HyP3 account can use those permissions.
1. A HyP3 developer could list, get, and put items in user buckets themselves.
2. Users could their products in another users bucket if they knew the other users' bucket name.
The bucket policy grants permissions to the entire bucket.
The best/only way to check that permissions are set up correctly is to run a job and see it succeed. If get-files or upload-log don't have the right permissions, you'll end up with a failed job that has no reported files and no logs (or even log key in the job dict).
We include the download URL for the products browse/thumbnail images, but they won't work when products are placed in an end-user bucket.
Without updates to the SDK, if you want to submit jobs with the bucket/bucket-prefix parameters, you'll need to use build the job dictionaries and use the submit_prepared_jobs method in the SDK or the API directly.

Fist, I think (4) is largely fine since we also provided the S3 info for products in the job response and clients like vertex should be able to gracefully handle an access denied (403) response and will likely just not show the images.

As for the rest, I am planning on effectively soft-launching this feature and using for internal projects (e.g., VolcSARvatory, LAVAs, AK FIRE SAFE) for a period since none of these concerns apply to these projects -- we all have admin access to all the AWS accounts involved and we strictly control EDL accounts with access to the HyP3 deployments. For (5) specifically, we're already largely building job-dictionaries as the custom job types aren't available in HyP3 Basic or HyP3+ yet.

Before make it generally available (e.g., add it to hyp3-docs and update the SDK), we'll wan to address some/most/all of those concerns, so I've opened a number of follow on issues 👇 with more details for these concerns -- please discuss concerns in those issues, if appropriate and possible.

Follow on issues

…gs in step function

Updates to user defined buckets

Co-authored-by: Joseph H Kennedy <me@jhkennedy.org>

User Defined Buckets

github-actions · 2026-05-20T23:42:55Z

jhkennedy · 2026-05-21T04:53:09Z

If the step function code has changed, have you drained the job queue before merging?

Caution

The Step Function code has changed! The job queue should be drained before merging!

Someone from @ASFHyP3/tools will have to do this and then merge the PR since there is no review-gate for EDC deployments.

AndrewPlayer3 added 30 commits May 13, 2026 13:30

add api route for retrieving a bucket policy

32ea660

formatting

3eba802

add boto3 req

c9656f9

add bucket and bucket prefix from api ref

9795e68

pass content bucket to api

a7bf25d

add user provided publish bucket option for all jobs

2d396a2

add bucket and bucket_prefix to batch params

e223823

bucket read permissions

b49e150

cleaner get_files handler

53f77c3

cleaner handler for content bucket

c8c14ab

changed defaults for bucket and prefix

f8ae4cf

add return type for get_current_account_arn

5175a22

add return type for get_bucket_policy

86dfb41

ruff

c0d180d

add return type for _handle_content_bucket

a534722

mypy

de28431

better regex for bucket and bucket prefix

34be2fd

fixed regex escape characters

4c39afb

use enumerate rather than range

1268937

move bucket handling to dynamo, handle nulls, and add env var for tests

18dd47d

update tests for bucket and bucket_prefix handling

800c72c

update test_put_jobs

2f6e8d9

cleaner _handle_content_bucket and fixed test

db70009

add error for attempting to use custom prefix with default bucket

fa0b2ae

add ref to content bucket for api

991eea4

fixed test_put_jobs credit count

7e03b40

updated changelog

90d2e9c

add patch for dynamo.jobs.get_jobs

639e99b

revert --bucket-prefix to --bucket_prefix

425a610

remove todo

e0a6f4f

jhkennedy and others added 23 commits May 13, 2026 19:26

add pydantic to fix pip errors

2b62ad2

try removing openapi decorator

9fc2fa5

fix get_caller_identity

2eafd48

Return dictionaries from bucket-policy handler instead of a big string

bfd2d93

Simplify bucket-policy response

c93f210

Add dynamodb:GetItem permissions to get-files

0042abd

put the logs in the user bucket as well

485d413

actually, use event context instead of dynamo.jobs.get_job for get-files

0e7f7b4

add bucket and bucket_prefix as parameters to get-files and upload-lo…

f0aa3ef

…gs in step function

tweak bucket-policy

d6d412b

Fix get-files: exiration time optional and fix distribution url

84ad852

ruff ruff

d931c74

update get-files and upload-logs tests

ac1756d

remove redundant parameter from upload-logs step in step function

4648b8b

remove prefix == job_id assumption in upload logs

084e04b

Add OpenAPI spec for bucket-policy

73624e8

Add OpenAPI spec for bucket-policy GET endpoint

521887a

Add note to handlers to also update openapi spec

f1bf40b

Fix bucket + bucket_prefix in some job specs

6b015b5

Merge pull request #3102 from ASFHyP3/user-bucket-permissions

1142386

Updates to user defined buckets

Merge branch 'develop' into user-defined-buckets

4919ea6

Drop publish_bucket FIXME reminder comments

9201162

Co-authored-by: Joseph H Kennedy <me@jhkennedy.org>

Merge pull request #3061 from ASFHyP3/user-defined-buckets

6d98205

User Defined Buckets

jhkennedy added the minor Bump the minor version number of this project label May 20, 2026

jhkennedy marked this pull request as ready for review May 21, 2026 04:39

jhkennedy requested review from a team as code owners May 21, 2026 04:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release v10.17.0 -- Write products to user buckets#3106

Release v10.17.0 -- Write products to user buckets#3106
jhkennedy wants to merge 56 commits into
mainfrom
develop

jhkennedy commented May 20, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 20, 2026 •

edited by jhkennedy

Loading

Uh oh!

jhkennedy commented May 21, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jhkennedy commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Example jobs

Potential concerns

Follow on issues

Uh oh!

github-actions Bot commented May 20, 2026 • edited by jhkennedy Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Developer checklist

Reviewer checklist

Uh oh!

jhkennedy commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jhkennedy commented May 20, 2026 •

edited

Loading

github-actions Bot commented May 20, 2026 •

edited by jhkennedy

Loading

jhkennedy commented May 21, 2026 •

edited

Loading