[ContentUnderstanding] Add ContentRange samples for document, video, and audio. by changjian-wang · Pull Request #45679 · Azure/azure-sdk-for-python

changjian-wang · 2026-03-13T01:08:06Z

This pull request introduces a new ContentRange value type for specifying content ranges in analysis requests, and updates both the SDK and sample code to support and demonstrate its usage. The changes improve flexibility for users to restrict analysis to specific document pages or time ranges in audio/video content, and update documentation and sample code to showcase these features.

ContentRange value type introduction and integration

Added the new ContentRange class in models/_content_range.py, providing methods to construct ranges for document pages and audio/video time intervals, combine multiple ranges, and convert to string. This enables precise specification of content segments to analyze.
Integrated ContentRange into the SDK: updated begin_analyze_binary and its async variant to accept either a string or a ContentRange object for the content_range parameter, converting it to string as needed. Documentation for these methods was updated to reflect the new parameter type and usage. [1] [2] [3] [4] [5] [6]
Exported ContentRange in models/_patch.py for public access. [1] [2]

Sample code updates

Updated sample_analyze_binary_async.py to demonstrate analyzing specific pages and combined page ranges using the new ContentRange class, including example output for these scenarios. [1] [2]
Updated sample_analyze_url_async.py to show how to restrict analysis to a single page using ContentRange, and changed the document URL to a more complex sample. Also imported timedelta for potential time range usage. [1] [2] [3] [4]

Asset metadata update

Updated the asset tag in assets.json to reflect the new version, ensuring asset tracking aligns with these changes.

- Implemented ContentRange functionality in sample scripts for analyzing binary documents and URLs. - Added examples for analyzing specific pages and combined page ranges in `sample_analyze_binary.py`. - Enhanced `sample_analyze_url.py` with ContentRange examples for documents, videos, and audio, including time-based ranges. - Created unit tests for ContentRange functionality, covering various scenarios and edge cases. - Updated existing tests to validate ContentRange behavior in document and media analysis.

Copilot

Pull request overview

This PR introduces a new ContentRange value type for the Azure AI Content Understanding SDK, enabling users to specify content ranges (document pages or audio/video time intervals) when analyzing content. The class provides factory methods for constructing ranges and is integrated into the begin_analyze_binary API.

Changes:

Added ContentRange class with factory methods (page, pages, pages_from, time_range, time_range_from, combine) and exported it in the models namespace.
Updated begin_analyze_binary (sync and async) to accept ContentRange objects in addition to raw strings for the content_range parameter.
Added comprehensive sample code and tests demonstrating ContentRange usage for document, video, and audio analysis scenarios.

Reviewed changes

Copilot reviewed 12 out of 12 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`models/_content_range.py`	New `ContentRange` class with factory methods, equality, hashing, and string conversion
`models/_patch.py`	Exports `ContentRange` in `__all__`
`_patch.py`	Updates sync `begin_analyze_binary` to accept `ContentRange` and convert to string
`aio/_patch.py`	Updates async `begin_analyze_binary` to accept `ContentRange` and convert to string
`tests/test_content_range.py`	Unit tests for `ContentRange` construction, validation, equality, and integration with `AnalysisInput`
`tests/samples/test_sample_analyze_url.py`	Integration tests for ContentRange with document, video, and audio URL analysis
`tests/samples/test_sample_analyze_binary.py`	Integration tests for ContentRange with binary document analysis
`samples/sample_analyze_url.py`	Sync sample showing ContentRange usage for URL-based analysis
`samples/async_samples/sample_analyze_url_async.py`	Async sample showing ContentRange usage for URL-based analysis
`samples/sample_analyze_binary.py`	Sync sample showing ContentRange usage for binary analysis
`samples/async_samples/sample_analyze_binary_async.py`	Async sample showing ContentRange usage for async binary analysis
`assets.json`	Updated asset tag for new test recordings

...ntentunderstanding/azure-ai-contentunderstanding/tests/samples/test_sample_analyze_binary.py

...tanding/azure-ai-contentunderstanding/azure/ai/contentunderstanding/models/_content_range.py

…edia analysis

…validations in audio and video segments

github-actions · 2026-03-16T10:59:49Z

API Change Check

APIView identified API level changes in this PR and created the following API reviews

azure-ai-contentunderstanding

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

changjian-wang · 2026-03-19T01:51:50Z

@copilot
Update — 2026-03-13
For Python, the samples should explicitly demonstrate passing the content range as a raw string directly (not via any helper), covering the same scenarios as the .NET/Java typed ContentRange samples:
"1-3" for a document page range
"9-" for all pages from page 9 onward
"0-5000" for an audio/video time window (milliseconds)
"1-3,5,9-" for combined ranges
This ensures Python/JS samples serve as a direct equivalent reference for the same functionality, just expressed as strings instead of typed factory methods.

Copilot · 2026-03-19T01:51:58Z

@changjian-wang I've opened a new pull request, #45788, to work on those changes. Once the pull request is ready, I'll request review from you.

…s for content range (#45788) * Initial plan * Update samples and tests to pass content range as plain strings directly Co-authored-by: changjian-wang <15209050+changjian-wang@users.noreply.github.com> * Remove ContentRange class and use plain strings for content range Co-authored-by: changjian-wang <15209050+changjian-wang@users.noreply.github.com> * Remove duplicate raw content range sample sections Now that ContentRange class is removed and all content ranges are plain strings, the [START *_with_raw_content_range] sections duplicated values already shown in the typed content range sections (e.g. "1-3,5,9-" appeared twice in binary samples, "0-5000" twice in video URL samples). Removed the redundant sections from all four sample files. Co-authored-by: changjian-wang <15209050+changjian-wang@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: changjian-wang <15209050+changjian-wang@users.noreply.github.com>

changjian-wang · 2026-03-19T07:21:13Z

@copilot check test_sample_analyze_binary.py and test_sample_analyze_url.py, we need more precise assertions.

Copilot · 2026-03-19T07:21:22Z

@changjian-wang I've opened a new pull request, #45791, to work on those changes. Once the pull request is ready, I'll request review from you.

changjian-wang · 2026-03-19T07:51:18Z

@copilot analyze audio with contentrange need implement and follow this:

"0-5000" — first 5 seconds
"10000-" — from 10 seconds onward
"1200-3651" — sub-second precision
"0-3000,30000-" — combined time ranges

Copilot · 2026-03-19T07:51:26Z

@changjian-wang I've opened a new pull request, #45792, to work on those changes. Once the pull request is ready, I'll request review from you.

…cenarios (#45792) * Initial plan * Update audio content range samples/tests to match video scenarios: 0-5000, 10000-, 1200-3651, 0-3000,30000- Co-authored-by: changjian-wang <15209050+changjian-wang@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: changjian-wang <15209050+changjian-wang@users.noreply.github.com>

changjian-wang · 2026-03-19T08:18:02Z

@copilot The assersion in file test_sample_analyze_binary.py is not precise enough as it did not verify the specific pages extracted. We should look at the specific page number from the extracted results. We need use assert equal to verify page count.

Copilot · 2026-03-19T08:18:13Z

@changjian-wang I've opened a new pull request, #45793, to work on those changes. Once the pull request is ready, I'll request review from you.

…e tests (#45793) * Initial plan * Add precise page number assertions to test_sample_analyze_binary_with_content_range Co-authored-by: changjian-wang <15209050+changjian-wang@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: changjian-wang <15209050+changjian-wang@users.noreply.github.com>

Changjian Wang added 2 commits March 12, 2026 18:36

Update Tag in assets.json to reflect latest version

4115398

changjian-wang requested review from bojunehsu and chienyuanchang as code owners March 13, 2026 01:08

Copilot AI review requested due to automatic review settings March 13, 2026 01:08

changjian-wang requested a review from yungshinlintw as a code owner March 13, 2026 01:08

github-actions bot added the Cognitive - Content Understanding label Mar 13, 2026

Copilot started reviewing on behalf of changjian-wang March 13, 2026 01:09 View session

Copilot AI reviewed Mar 13, 2026

View reviewed changes

Changjian Wang added 2 commits March 16, 2026 18:25

Add raw ContentRange examples and update test data for document and m…

c6e8e98

…edia analysis

Enhance ContentRange tests with additional assertions for time range …

88dbaed

…validations in audio and video segments

Changjian Wang and others added 5 commits March 16, 2026 19:12

Add ContentRange sample demonstrations using a multi-page document

7767544

Potential fix for pull request finding

2f33db2

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Potential fix for pull request finding

4398bda

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Potential fix for pull request finding

2791424

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Potential fix for pull request finding

9b7d9b2

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Copilot AI mentioned this pull request Mar 19, 2026

[ContentUnderstanding] Remove ContentRange class and use plain strings for content range #45788

Merged

Copilot AI mentioned this pull request Mar 19, 2026

[ContentUnderstanding] Tighten test assertions in test_sample_analyze_binary.py and test_sample_analyze_url.py #45791

Open

Copilot AI mentioned this pull request Mar 19, 2026

[ContentUnderstanding] Align audio content range samples with video scenarios #45792

Merged

Copilot AI mentioned this pull request Mar 19, 2026

[ContentUnderstanding] Precise page number assertions in content range tests #45793

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ContentUnderstanding] Add ContentRange samples for document, video, and audio.#45679

[ContentUnderstanding] Add ContentRange samples for document, video, and audio.#45679
changjian-wang wants to merge 12 commits intomainfrom
changjian-wang/add-contentrange-samples-doc-audio-video

changjian-wang commented Mar 13, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Mar 16, 2026 •

edited

Loading

Uh oh!

changjian-wang commented Mar 19, 2026 •

edited

Loading

Uh oh!

Copilot AI commented Mar 19, 2026

Uh oh!

changjian-wang commented Mar 19, 2026

Uh oh!

Copilot AI commented Mar 19, 2026

Uh oh!

changjian-wang commented Mar 19, 2026

Uh oh!

Copilot AI commented Mar 19, 2026

Uh oh!

changjian-wang commented Mar 19, 2026

Uh oh!

Copilot AI commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

changjian-wang commented Mar 13, 2026

ContentRange value type introduction and integration

Sample code updates

Asset metadata update

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

API Change Check

Uh oh!

changjian-wang commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI commented Mar 19, 2026

Uh oh!

changjian-wang commented Mar 19, 2026

Uh oh!

Copilot AI commented Mar 19, 2026

Uh oh!

changjian-wang commented Mar 19, 2026

Uh oh!

Copilot AI commented Mar 19, 2026

Uh oh!

changjian-wang commented Mar 19, 2026

Uh oh!

Copilot AI commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Mar 16, 2026 •

edited

Loading

changjian-wang commented Mar 19, 2026 •

edited

Loading