AI Web Fetch Tools Silently Drop Content Inside Custom HTML Elements

### Confirm this is an issue with the Python library and not an underlying OpenAI API

- [x] This is an issue with the Python library

### Describe the bug

<html><head></head><body><h1>Bug Report: AI Web Fetch Tools Silently Drop Content Inside Custom HTML Elements</h1>
<p><strong>Date:</strong> March 2026<br>
<strong>Tools affected:</strong> claude.ai <code>web_fetch</code>, ChatGPT web fetch<br>
<strong>Status:</strong> Confirmed across multiple AI platforms. Not reproducible in Claude Code's WebFetch tool, curl, or Grok.</p>
<hr>
<h2>Summary</h2>
<p>The web fetch tools in claude.ai and ChatGPT both fail to extract text content from pages that use custom HTML elements (web components). Both tools return only the document <code>&lt;title&gt;</code> tag, stripping all meaningful page content. The prerendered HTML is valid, fully accessible, and correctly read by curl, Grok, Google's crawler, and Claude Code's WebFetch. The bug appears to be shared across AI platform fetch implementations and is likely rooted in a common underlying library or parsing approach.</p>

<hr>
<h2>Expected Behavior</h2>
<p>All text content nested inside custom elements should be extracted and returned. The page at this URL uses a structure like:</p>
<pre><code class="language-html">&lt;app-shell&gt;
  &lt;site-header&gt;...&lt;/site-header&gt;
  &lt;page-hero&gt;
    &lt;h1&gt;Headline text&lt;/h1&gt;
    &lt;p&gt;Body copy&lt;/p&gt;
  &lt;/page-hero&gt;
&lt;/app-shell&gt;
</code></pre>
<p>Standard <code>&lt;h1&gt;</code>, <code>&lt;h2&gt;</code>, and <code>&lt;p&gt;</code> elements exist inside the custom elements. Any conformant HTML parser should walk the full DOM tree and extract their text content.</p>
<hr>
<h2>Actual Behavior</h2>
<p>The HTML-to-markdown converter skips the entire subtree when it encounters an unknown (custom) element tag. The tool returns only the contents of the <code>&lt;title&gt;</code> tag. No other content is returned.</p>
<hr>
<h2>Verification</h2>

Tool | Reads content correctly?
-- | --
curl https://mandmkelly.com | ✅ Returns 17KB of full HTML
Grok | ✅ Reads and summarizes the full page
Claude Code WebFetch | ✅ Reads and summarizes the full page
Google crawler | ✅ Indexes the page
claude.ai web_fetch | ❌ Returns only the <title> tag
ChatGPT web fetch | ❌ Returns only the <title> tag


<hr>
<h2>Root Cause (likely)</h2>
<p>The HTML-to-markdown converter used in these fetch pipelines treats unknown element names as opaque blocks and skips their children rather than recursing into them.</p>
<p>The HTML spec defines custom elements as valid and requires parsers to treat unrecognized element names as generic container elements. The correct behavior is to recurse into their children exactly as a browser would. The current behavior — skipping the entire subtree — is non-conformant and produces silent data loss with no error or warning to the user.</p>
<p>The fact that both claude.ai and ChatGPT exhibit identical behavior suggests a shared upstream dependency, possibly a common open-source HTML-to-markdown or HTML parsing library used by both platforms.</p>
<hr>
<h2>Suggested Fix</h2>
<p>Treat unrecognized element names as passthrough containers. Recurse into their children and extract text from any standard elements found within them. This matches browser behavior and the HTML parsing specification.</p>
<hr>
<h2>Impact</h2>
<p>Any site using web components, Angular, Lit, or other frameworks that place custom element names at the top level of the content hierarchy will be completely unreadable by both claude.ai and ChatGPT fetch tools, even when the content is fully prerendered and accessible to every other crawler and tool. Both platforms will misattribute the failure to the site rather than the tool, which compounds the confusion for users and may incorrectly signal to site owners that their implementation is broken when it is not.</p>

### To Reproduce

- Open chatgpt.com and start a new conversation
- Ask Claude to fetch or review https://mandmkelly.com. Or any modern site using web components custom elements
- Observe the response

### Code snippets

```Python

```

### OS

macOS

### Python version

Python 3.x

### Library version

openai v1.0.1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AI Web Fetch Tools Silently Drop Content Inside Custom HTML Elements #3000

Confirm this is an issue with the Python library and not an underlying OpenAI API

Describe the bug

Bug Report: AI Web Fetch Tools Silently Drop Content Inside Custom HTML Elements

Summary

Expected Behavior

Actual Behavior

Verification

Root Cause (likely)

Suggested Fix

Impact

To Reproduce

Code snippets

OS

Python version

Library version

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Tool	Reads content correctly?
curl https://mandmkelly.com	✅ Returns 17KB of full HTML
Grok	✅ Reads and summarizes the full page
Claude Code WebFetch	✅ Reads and summarizes the full page
Google crawler	✅ Indexes the page
claude.ai web_fetch	❌ Returns only the <title> tag
ChatGPT web fetch	❌ Returns only the <title> tag

AI Web Fetch Tools Silently Drop Content Inside Custom HTML Elements #3000

Description

Confirm this is an issue with the Python library and not an underlying OpenAI API

Describe the bug

Bug Report: AI Web Fetch Tools Silently Drop Content Inside Custom HTML Elements

Summary

Expected Behavior

Actual Behavior

Verification

Root Cause (likely)

Suggested Fix

Impact

To Reproduce

Code snippets

OS

Python version

Library version

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions