[WIP] v2.x - Nokogiri Refactor Part 6 - Conversion of SignedDocument and final removal of REXML 🎉 #759

johnnyshields · 2025-03-16T15:11:55Z

This PR resolves the "Parser Differential" exploit vector mentioned below, by migrating to 100% Nokogiri.
https://github.blog/security/sign-in-as-anyone-bypassing-saml-sso-authentication-with-parser-differentials/

… Nokogiri.

…efactor' into signed-document-info-refactor

…nt-info-refactor

pitbulk · 2025-03-20T11:58:04Z

@ahacker1-securesaml, @p- can you help me review this PR? Thanks

ahacker1-securesaml · 2025-03-20T12:03:56Z

If we are refactoring, I just suggest we remove the Response/SignedDocument class, and make it into a function instead. This OOP style is extremely hard to audit.

p- · 2025-03-20T12:18:34Z

@pitbulk Yes I will help 👍 But I think I can't start before next week.

johnnyshields · 2025-03-20T13:13:38Z

Hi, yes I agree I am intending to make the code more functional. This PR was intended firstly to get all tests green on Nokogiri while changing relatively little.

I am thinking something like separating Response class into:

MessageParser base class (shared between Response, LogoutResponse, and SloLogoutRequest
Response (Parser)
Assertion -- immutable object representing the Assertion portion of the XML, after it has been parsed -- still thinking this over...
ResponseValidator -- functional module which handles the validation aspects.

I need to think a bit more about exactly where the SignedDocument logic belongs. The biggest problem with it is not that it is OOP per-se, but that it co-mingles two responsibilities, namely "locating the signed element" and "validating the signature". It might be possible to split these responsibilities into two passes, but this feels much less secure since it re-introduces the possibility of a "parser differential" as in the last CVE.

I may not have time to look at this further for a few weeks however.

ahacker1-securesaml · 2025-03-20T14:23:11Z

ok, ResponseValidator -> change to Assertion Validator, we don't care about the Response, since IdPs don't even sign it. Also don't mix this with the signature verification steps.
To clarify (edited): the AssertionValidator verifies the business logic associated with a SAML Assertion.

Response (Parser) -> Since the individual assertions are signed, this should return a list of Signed Assertions, and not nesscarily the whole (unsigned) SAML Authentication response.

And maybe we should do something like SignedAssertion class?

Would remove the SignedDocument class. Let's replace this with something called SignedXML, which cryptographically verifies some bytes of XML at instantianation.

Response parser can then accept SignedXML only.

Hence, whatever is used for the Response class is always signed.

johnnyshields · 2025-03-20T14:25:26Z

@ahacker1-securesaml I agree with all those points, that sounds along the right lines. I will need more time to get my head around it, the code is really spaghetti today 🍝 and the real trick is getting all the tests to pass (fortunately the test suite is quite robust!!)

ahacker1-securesaml · 2025-03-20T14:58:59Z

lib/ruby_saml/xml/signed_document_info.rb

+      # Get the ID of the signed element
+      # @return [String] The ID of the signed element
+      def subject_id
+        # TODO: The error here is problematic, perhaps it can be checked elsewhere


I would not recommend exposing an the ID of signed Element. It can't be used for security decisions, since the ID is not guaranteed to be unique & is attacker controlled.

ahacker1-securesaml · 2025-03-20T14:59:34Z

lib/ruby_saml/xml/signed_document_info.rb

+      # Get the Reference node
+      # @return [Nokogiri::XML::Element] The Reference node
+      def reference_node
+        signed_info_node.at_xpath('./ds:Reference', { 'ds' => RubySaml::XML::DSIG }) ||


Load this from the canonicalized_signed_info, since only canonicalized_signed_info was authenticated

ahacker1-securesaml · 2025-03-20T15:01:52Z

lib/ruby_saml/xml/signed_document_info.rb

+        return nil unless cert
+
+        fingerprint_alg = RubySaml::XML.hash_algorithm(algorithm).new
+        fingerprint_alg.hexdigest(cert.to_der).gsub(/[^a-zA-Z0-9]/, '').downcase


Authenticated = cert.to_der, should only use cert.to_der in future, and not cert

pitbulk · 2025-03-20T15:05:05Z

The main challenge is to:

First, verify the structure, number of signatures, number of elements, etc

validate_structure
validate_version
validate_num_assertion
validate_signed_elements
validate_signature

And once the signature is validated and we know we can trust the Assertion (because it had a Signature, or its Response has a Signature), use the right XML to keep validating rest of elements and make that the methods that retrieve info (attributes, nameId, etc use the right XML as well).

validate_id
validate_success_status
validate_no_duplicated_attributes
validate_in_response_to
validate_one_conditions
validate_conditions
validate_one_authnstatement
validate_audience
validate_destination
validate_issuer
validate_session_expiration
validate_subject_confirmation
validate_name_id

The main problem is that most of the tests use payloads with invalid signatures and in the current implementation, this is not a problem due the validation order.

As making signature validation optional is something that we better don't even add in the code, I will try to start updating the
invalid payloads, when possible to contain a valid Signature. In cases where this is not possible, we will need to adapt the test cases.

johnnyshields · 2025-03-20T15:07:22Z

@pitbulk making the signatures validatable in the tests would be a huge help to give us flexibility in refactoring.

ahacker1-securesaml · 2025-03-20T15:09:17Z

First step should be verifying signature IMO. After verifying signature, we can get some signed bytes of XML. Then Assertion re-parses the signed bytes of XML, and verifies the assertion data.

That gives us much more flexibility, if we want to test some assertion on some unsigned data, we simply mock the payload as signed.

pitbulk · 2025-03-20T15:12:36Z

@ahacker1-securesaml, If we identify that something is wrong in the XML (xsd validation, number of elements that we want to accept, etc) we save us of checking signatures on XMLs that we gonna consider already invalids.
We can revalidate the signed bytes of XML as well.

ahacker1-securesaml · 2025-03-20T15:14:55Z

I can see the performance benefits there. However I feel strongly that we would be mixing the SAML Assertion processing logic and the Signature logic.

Ideally we want to keep them separate as much as possible. And separating logic would help with security

taylorreis · 2025-03-20T15:17:50Z

@ahacker1-securesaml, would you be able to add some context to the following?

ResponseValidator -> change to Assertion Validator, we don't care about the Response, since IdPs don't even sign it.

Signing the Response alone is something that identity providers can do in practice (Entra ID is an example). The Assertion should inherit the signature.

ahacker1-securesaml · 2025-03-20T15:21:57Z

Three ways SAML can be signed:

Assertions alone: default for most Identity Providers
Assertion & Responses: default for Okta, and maybe some other IdPs
Responses alone: have not seen any IdP do this. Would require specialized configuration.

In the responses alone case, I have seem some SAML libraries only verify signatures on SAML Assertions. So it would break their code.

For the Assertions & Responses case: an attacker can trivially remove the signature on the response, and the SAML payload would still be valid.

For assertions alone case: an attacker can just change whatever they want on the response i.e. statuscode ...

So IMO, there's no point in verifying the properties of a SAML responses i.e. checking status code .... It would mean mixing unsigned data (Response) and the signed data (Assertion). That makes it really hard to analyze in terms of security, and has directly lead to vulnerabilities in libraries such as https://github.com/node-saml/node-saml

My idea is to only verify whatever is in the (signed) Assertion, be it the AudienceRestriction. These are actually important for security

AND

we separate the signature verification logic into it's own module. I don't believe it should be part of verifying the assertion logic.

taylorreis · 2025-03-20T15:43:53Z

Agree with the principles described, but not supporting a configuration that signs the Response alone would be a breaking change, given that it's currently supported in the wild:

ahacker1-securesaml · 2025-03-20T15:46:35Z

We still support signed responses, we separate business logic from signature verification steps

extracted_signature = extract_signature(untrusted_doc)
signed_bytes = get_signed_bytes(extracted_signature)
# if signed_bytes happens to be a Response, then we get the Assertion from ./saml:Assertion 
assertion = ParseAssertion(signed_bytes) 
AssertionValidator.validate(assertion) # business logic
return assertion

we just don't verify the business logic of a Response that the SAML spec supposedly requires. For example, verifying the StatusCode, or even the Version.

johnnyshields · 2025-03-20T15:59:10Z

It would definitely help us to get a library of all the vendor IDP SAML variations in the wild... maybe some other SAML lib already has one?

As it stands the current RubySaml test suite is pretty good, it should protect us from a pretty wide range of possible regressions.

pitbulk · 2025-03-21T09:15:51Z

In terms of security, I agree that if the Response is not signed, and the StatusCode is not "protected" anyone can modify it, but in terms of business, if I receive a StatusCode != Success, I don't need to do anything else, I will simply reject this SAMLResponse and don't grant access to the app, the same applies to malformed XMLs received or XMLs wellformed, but that we want to avoid and consider invalids (multiple Assertions, more than 2 Signatures).

dblessing · 2025-03-25T14:33:31Z

@johnnyshields This PR works fine with omniauth-saml. Anything specific I can help test?

johnnyshields · 2025-03-25T14:56:54Z

Just a heads up here--@pitbulk I'm not going to have the bandwidth for several months to take this PR any further--have to focus on running my company. So the last mile will have to be carried by someone else.

pitbulk · 2025-04-03T22:31:10Z

Understood, thanks for your hard work!

johnnyshields · 2025-04-30T12:34:51Z

@pitbulk I think we should merge this branch into v2.x--or even master--regardless. I realize there are more refactors needed, but perhaps we can track those in a follow-up ticket?

pitbulk · 2025-06-08T00:13:34Z

@johnnyshields, I was able to reorder the validations and adjust tests and payloads:
#764

johnnyshields · 2025-09-22T17:14:45Z

@pitbulk I will release this branch to production tomorrow. I think after it runs stablely for a week we should look at releasing it as v2.x.

It's not "perfect" but it's a heck of a lot better than the current v1.x main release.

taylorreis · 2025-10-08T21:50:15Z

@johnnyshields @pitbulk anything I can do to help with v2.x? Happy to lend a hand here or elsewhere.

johnnyshields · 2025-10-21T17:58:04Z

@pitbulk I've been using this branch extensively in production for > 1 month without issue for 10+ different SAML integrations.

Let's merge and release it--it's infinitely better than the current v1.x branch.

pitbulk · 2025-10-21T20:37:39Z

Ok, I will review next week and merge and release a 2.0.0 with this code.

Is anything else missing?

johnnyshields · 2025-10-22T02:28:08Z

@pitbulk no, nothing is missing, it's ready to release.

pitbulk · 2025-11-14T10:01:11Z

Sorry for the delay!, I'm doing some final test this weekend and gonna be released next week.

pitbulk · 2025-11-20T12:24:19Z

UPGRADING.md

-In `2.0.0`, REXML has been replaced with Nokogiri. This change should be transparent
-to most users, however, see note about Custom Metadata Fields below.
+In `2.0.0`, REXML has been replaced with Nokogiri. As a result, there are minor differences
+in how XML is generated, ncluding SAML requests and SP Metadata: 


johnnyshields · 2025-11-20T13:15:45Z

Wonderful news!

johnnyshields changed the title ~~[WIP] v2.x - Nokogiri Refactor Part X - Conversion of SignedDocument (attempt to SignedDocumentInfo class)~~ [WIP] v2.x - Nokogiri Refactor Part 6 - Conversion of SignedDocument and final removal of REXML 🎉 Mar 16, 2025

Refactor to use SignedDocumentInfo. Remove REXML entirely in favor of…

ae09439

… Nokogiri.

johnnyshields force-pushed the signed-document-info-refactor branch from 06a205c to ae09439 Compare March 17, 2025 03:18

johnnyshields mentioned this pull request Mar 17, 2025

RubySaml 2.0 release -- Final Steps #703

Open

14 tasks

johnnyshields added 4 commits March 20, 2025 00:20

Update README.md

3b16870

Merge remote-tracking branch 'remotes/johnnyshields/v2.x-decryption-r…

2ed7183

…efactor' into signed-document-info-refactor

Merge remote-tracking branch 'remotes/origin/v2.x' into signed-docume…

04a7b8e

…nt-info-refactor

Change deprecations to be removed in RubySaml 3.0.0, for SemVer reasons

591bdaa

ahacker1-securesaml reviewed Mar 20, 2025

View reviewed changes

johnnyshields added 3 commits September 19, 2025 06:56

Merge branch 'v2.x' into signed-document-info-refactor

0b0799b

Update UPGRADING.md for REXML to Nokogiri changes

a7dba95

Update add_extras in readme

ba37e1f

pitbulk reviewed Nov 20, 2025

View reviewed changes

pitbulk merged commit 828be53 into SAML-Toolkits:v2.x Nov 20, 2025
29 checks passed

This was referenced Nov 20, 2025

Parser consolidation follow-ups #775

Open

IMPORTANT: All users must upgrade to ruby-saml 1.18.0 ASAP #753

Open

Uh oh!

[WIP] v2.x - Nokogiri Refactor Part 6 - Conversion of SignedDocument and final removal of REXML 🎉 #759

[WIP] v2.x - Nokogiri Refactor Part 6 - Conversion of SignedDocument and final removal of REXML 🎉 #759

Uh oh!

Conversation

johnnyshields commented Mar 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pitbulk commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ahacker1-securesaml commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

p- commented Mar 20, 2025

Uh oh!

johnnyshields commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ahacker1-securesaml commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

johnnyshields commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ahacker1-securesaml Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

ahacker1-securesaml Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ahacker1-securesaml Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

pitbulk commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

johnnyshields commented Mar 20, 2025

Uh oh!

ahacker1-securesaml commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pitbulk commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ahacker1-securesaml commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

taylorreis commented Mar 20, 2025

Uh oh!

ahacker1-securesaml commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

taylorreis commented Mar 20, 2025

Uh oh!

ahacker1-securesaml commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

johnnyshields commented Mar 20, 2025

Uh oh!

pitbulk commented Mar 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dblessing commented Mar 25, 2025

Uh oh!

johnnyshields commented Mar 25, 2025

Uh oh!

pitbulk commented Apr 3, 2025

Uh oh!

johnnyshields commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pitbulk commented Jun 8, 2025

Uh oh!

johnnyshields commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

taylorreis commented Oct 8, 2025

Uh oh!

johnnyshields commented Mar 16, 2025 •

edited

Loading

pitbulk commented Mar 20, 2025 •

edited

Loading

ahacker1-securesaml commented Mar 20, 2025 •

edited

Loading

johnnyshields commented Mar 20, 2025 •

edited

Loading

ahacker1-securesaml commented Mar 20, 2025 •

edited

Loading

johnnyshields commented Mar 20, 2025 •

edited

Loading

ahacker1-securesaml Mar 20, 2025 •

edited

Loading

pitbulk commented Mar 20, 2025 •

edited

Loading

ahacker1-securesaml commented Mar 20, 2025 •

edited

Loading

pitbulk commented Mar 20, 2025 •

edited

Loading

ahacker1-securesaml commented Mar 20, 2025 •

edited

Loading

ahacker1-securesaml commented Mar 20, 2025 •

edited

Loading

ahacker1-securesaml commented Mar 20, 2025 •

edited

Loading

pitbulk commented Mar 21, 2025 •

edited

Loading

johnnyshields commented Apr 30, 2025 •

edited

Loading

johnnyshields commented Sep 22, 2025 •

edited

Loading

johnnyshields commented Oct 21, 2025 •

edited

Loading