Implicit constraint by andrewlee94 · Pull Request #3870 · Pyomo/pyomo

andrewlee94 · 2026-03-09T21:21:11Z

Fixes None.

Summary/Motivation:

Pyomo currently has no way of explicitly representing the constraints within an ExternalGreyBoxBlock, which means that they cannot be seen by diagnostics tool (such as Incidence Analysis). This PR adds a new optional ExternalGreyBoxConstraint component which duck-types Constraint which can be added to ExternalGreyBoxBlocks to provide a representation of the grey box constraints. The incidence analysis and PyomoNLPWithGreyBoxBlocks code has been updated to recognise and understand these components.

Changes proposed in this PR:

Adds ExternalGreyBoxConstraint objects
Updates incidence analysis tools to look for ExternalGreyBoxConstraints and include these in the incidence analysis
Updates PyomoNLPWithGreyBoxBlocks to include ExternalGreyBoxConstraints in constraint mappings

Legal Acknowledgement

By contributing to this software project, I have read the contribution guide and agree to the following terms and conditions for my contribution:

I agree my contributions are submitted under the BSD license.
I represent I am authorized to make the contributions and grant the license. If my employer has rights to intellectual property that includes these contributions, I represent that I have received permission to make contributions and grant the required license on behalf of that employer.

Robbybp

Thanks for the PR! At first glance, this looks pretty good. It will take a little while to get through this.

Can you post a simple example of calling an Incidence Analysis method on a model with a gray box block
I probably would have tried to do this only with something like your EGBConstraintBody object, i.e., I would not have created ExternalGreyBoxConstraint. Did you consider this? What was the motivation to add an "official" Component object?

andrewlee94 · 2026-03-10T01:58:35Z

@Robbybp

There are some tests which show how the Incidence Analysis tools work with ExternalGreyBoxBlocks, which are probably the best examples I have: https://github.com/Pyomo/pyomo/pull/3870/changes#diff-97563c1c5b8bf2ed2f37b88015b2fdb3004548d7c4b640dc7a68906a4f48b8d7. The big thing to note in the examples is the use of build_implicit_constraint_objects=True so that the ExternalGreyBoxConstraint objects are constructed - from there usage is identical and the results are what I expected them to look like.
The reason for a full component was to make it easy to integrate with things like the IDAES DiagnosticsToolbox. Having a component lets us use component_data_objects(ExternalGreyBoxConstraint) so it was easy to update the diagnostics tools to include these. It also lets us duck-type the implicit constraint objects so they look and behave much like constraints to these tools; the main difference being that they do not have an expr which raises an exception.

Robbybp

Some initial comments. I haven't had time to think through the implications of ExternalGreyBoxConstraint yet.

This is reminding me that our grey-box infrastructure could use some better documentation.

Robbybp · 2026-03-15T21:58:22Z

+        external_grey_box_model,
+        inputs=None,
+        outputs=None,
+        build_implicit_constraint_objects=False,


Any reason this is optional?

Only to ensure there is backward compatibility. I decided to make sure there was still a way to guarantee the model would behave the same as before.

Robbybp · 2026-03-15T22:31:35Z

+            for input_name in ext_model.input_names():
+                incident_variables.append(self._parent_model.inputs[input_name])


We can't assume that inputs are indexed by input_names(). If inputs are provided to set_external_model, they will be a reference-to-list of VarData. Here's an example that fails with the current implementation:

inputs-as-reference.py

import numpy as np import pyomo.environ as pyo from scipy.sparse import coo_matrix from pyomo.contrib.pynumero.interfaces.external_grey_box import ( ExternalGreyBoxBlock, ExternalGreyBoxModel, ) from pyomo.contrib.incidence_analysis import IncidenceGraphInterface class MyGreyBox(ExternalGreyBoxModel): def n_inputs(self): return 4 def input_names(self): return [f"x[{i}]" for i in range(1, self.n_inputs() + 1)] def n_equality_constraints(self): return 3 def equality_constraint_names(self): return [f"eq[{i}]" for i in range(1, self.n_equality_constraints() + 1)] def set_input_values(self, input_values): if len(input_values) != self.n_inputs(): raise ValueError( f"Expected {self.n_inputs()} inputs, got {len(input_values)}." ) self._inputs = np.asarray(input_values, dtype=float) def evaluate_equality_constraints(self): x1, x2, x3, x4 = self._inputs return np.array( [ x2 * x3**1.5 * x4 - 5.0, x1 - x2 + x4 - 4.0, x1 * np.exp(-x3) - 1.0, ], dtype=float, ) def evaluate_jacobian_equality_constraints(self): x1, x2, x3, x4 = self._inputs # Rows correspond to eq[1], eq[2], eq[3] # Cols correspond to x[1], x[2], x[3], x[4] rows = np.array([0, 0, 0, 1, 1, 1, 2, 2], dtype=int) cols = np.array([1, 2, 3, 0, 1, 3, 0, 2], dtype=int) data = np.array( [ x3**1.5 * x4, # d(eq1)/d(x2) 1.5 * x2 * x3**0.5 * x4, # d(eq1)/d(x3) x2 * x3**1.5, # d(eq1)/d(x4) 1.0, # d(eq2)/d(x1) -1.0, # d(eq2)/d(x2) 1.0, # d(eq2)/d(x4) np.exp(-x3), # d(eq3)/d(x1) -x1 * np.exp(-x3), # d(eq3)/d(x3) ], dtype=float, ) return coo_matrix( (data, (rows, cols)), shape=(self.n_equality_constraints(), self.n_inputs()), ) m = pyo.ConcreteModel() m.x = pyo.Var([1, 2, 3, 4], bounds=(0.0, None), initialize=1.0) m.objective = pyo.Objective( expr=m.x[1] ** 2 + 2 * m.x[2] ** 2 + 3 * m.x[3] ** 2 + 4 * m.x[4] ** 2 ) m.grey_box = ExternalGreyBoxBlock() m.grey_box.set_external_model( MyGreyBox(), inputs=[m.x[i] for i in range(1, 5)], build_implicit_constraint_objects=True, ) igraph = IncidenceGraphInterface(m) matching = igraph.maximum_matching()

I was not even aware you could do that (hence your comment about documentation I guess). What is the canonical way to defining inputs and names then?

I'm not sure if there is a "canonical" way. Here's what I remember:

Carl's original implementation created new variables on the ExternalGreyBoxBlock. These variables are indexed by the names defined on the ExternalGreyBoxModel. The names are also used under the hood in PyomoNLPWithGreyBoxBlocks (or somewhere) as keys (or something).

I did not want to duplicate variable for every input, so I added the option to provide input (and output) variables if they already exist on the model. In this case, we create a Reference to the user-provided variables. Names are still used under the hood somewhere.

You can always assume the names exist, although personally I would prefer if we did not rely on them. inputs is a Var whose values are the VarData to be treated as inputs to the grey-box, in order.

So, inputs.keys() or [v.local_name for v in inputs] then.

inputs.keys():

If inputs were provided, these are ints

If inputs were not provided, these are names

[v.local_name for v in inputs.values()]

If inputs were provided these are the original var names

If inputs were not provided, these are, e.g., inputs[input_name]

What is the canonical way to defining inputs and names then?

Names can always be accessed by ExternalGreyBoxModel.input_names. Input variables can always be accessed by ExternalGreyBoxBlockData.inputs.values. These are the canonical ways to access both of these data.

Robbybp · 2026-03-15T22:32:30Z

+                jacobian_entry = jac[con_idx, var_idx]
+
+                if abs(jacobian_entry) >= jac_tolerance:
+                    incident_variables.append(self._parent_model.inputs[input_name])


As above, we can't assume that inputs are indexed by names.

Robbybp · 2026-03-16T01:10:45Z

+    def get_incident_variables(
+        self, use_jacobian=False, jac_tolerance=JAC_ZERO_TOLERANCE
+    ):
+        """
+        Get the variables that are incident on this implicit constraint.
+
+        Parameters
+        ----------
+        use_jacobian : bool, optional
+            If True, only include variables with non-zero Jacobian entries.
+        jac_tolerance : float, optional
+            The tolerance below which Jacobian entries are considered zero.
+
+        Returns
+        -------
+        list of VarData
+            List containing the variables that participate in the expression
+
+        """
+        # There are two ways incident variables could be defined for an implicit constraint:
+        # 1) We consider all inputs to the external model to be incident on the implicit constraint
+        # 2) We consider only the inputs that have a non-zero Jacobian entry for the implicit constraint
+        # to be incident on the implicit constraint
+        # Both have their uses, so we will support both.


My preference would be to always use the Jacobian and to determine incidence using only the Jacobian's structure. I don't like using numeric values of the Jacobian as that implicitly depends on the input values we're evaluating.

In general, I'm open to other methods of determining variable-constraint incidence. Can you provide an example use-case for considering the variable-constraint incidence to be dense even if the Jacobian is sparse? Does this have to do with decomposability when solving strongly connected components?

I think there are a few things going on here, and I am not sure there is a single correct answer. Right now, I think there are three possible paths (up from the two I documented in the code), and all three likely have cases where they would be "more correct".

Firstly, a concrete case I can think of where we would want to assume all variables are incident on all constraints: the block-triangularization solver. As the ExternalGreyBoxModel can only be called as a monolithic block, from the perspective of the block-triangularization solver it cannot be decomposed which effectively means all variables are incident on all constraints.

Beyond this however, things get a bit murkier and I'll start with an example.

The main use case I have for ExternalGreyBoxModels is to wrap a procedural thermodynamic flash algorithm which handles four phases (aqueous, organic, vapor and solids). Incidentally, the procedural code returns hard-zero values for non-physical phases (which results in numerical issues I need to deal with, but that is a separate issue). For the ExternalGreyBoxModel however, this raises an issue that numerically, the structure of the model is effectively changing as phases appear and disappear. When I initially implemented this I tried to filter out all those zero values in the wrapper for efficiency, but that mean I ran into issues in the ExternalGreyBoxModel because the shape of the Jacobian was changing from iteration to iteration. My solution was to make the Jacobian fully specified, and to keep all the zero values. However, this means that we cannot deduce the numerical structure of the problem from the Jacobian shape, and hence the idea to use the value of the Jacobian to get the local strucutre.

This would also be needed/useful for detecting local singularities, such as cases where a multiplier variable goes to zero and effectively makes a constraint under/overspecified.

TLDR: I see three possible approaches, each with their own value:

Assume full incidence: useful for the block-triangularization solver, or users of a grey box who don't want to know about what is inside it.

Incidence based on Jacobian structure: cheap and unambiguous, but requires that the Jacobian shape reflect the numerical structure of the problem. This is feasible for many problems but starts to fail in cases like mine, and requires the developer of the ExternalGreyBoxModel to implement the Jacobian correctly.

Incidence based on Jacobian value: requires some inference of what Jacobian value to consider a variable incident on a constraint (and thus scaling dependent), but can handle more complex cases where the numerical structure of the problem is state-dependent.

In that case, the "gray-box way" to do this would be to have a Jacobian that captures every variable-constraint pair that can possibly be nonzero. But you're saying that that is not so useful for debugging? I'll think about what I'd like to do here.

Incidence analysis was not designed for cases where the incidence is state-dependent, so if we support this here, I'd like to think about supporting this for regular constraints as well. We could generate the entire incidence graph then filter edges (maybe with a new method) if they correspond to a low value?

I think there are two parts to that.

The first part if whether we can (and should) expect the grey box Jacobian to properly capture "every variable-constraint pair that can possibly be nonzero", as this depends on the developer of the grey box and how they implement the Jacobian. The best answer there might just be to make sure things are documented clearly.

The second part is whether incidence analysis should look for state-dependent structure, which as you note it was not designed to do (although I think it will at least exclude any expression branches with a fixed zero). Your suggestion about having a separate method for that (if we want to do it) is probably the cleanest approach for that.

However, I am not sure that addresses the point about the block-triangularization solver and the distinction between numerically decomposable and functionally decomposable.

For your use case, do you need the block-triangularization solver to support GreyBox?

I would need to think about it more, but you have a better understanding of how the everything works and what might be possible. I agree that is it definitely possible in some cases, but I can see edge cases that we would need to take care of.

For example, say we had a grey box that was fundamentally two separate, block-triangularizable sub-systems, A and B (obviously poor design by the developer, but possible). If we ran it through the BT tools and solver, it is possible that it would identify that sub-block A could be solved for before the inputs for sub-block B were known (numerically at least), meaning that if we were to call the grey box then we could have undefined behaviour when the grey box solves sub-block B (e.g. what happens if the inputs have value of None? What if the current values are infeasible for sub-block B?).

However, as I am not expecting the use the block-triangularization solver myself, I would be fine with deferring this if we think it needs more thought or effort.

In this case, B would need to be initialized with well-defined inputs. (I don't think this is a restriction. I think A's inputs need to be valid values at the start of the solve as well.) It doesn't matter if the initial values are infeasible, we just ignore B's residual while solving A. If A's outputs end up making B infeasible, that's a general problem that can happen in any BT algorithm (and in my opinion is a useful feature of the algorithm).

But this doesn't need to happen in this PR. We can always raise NotImplementedError for now. I'm just looking for ways to break this apart and limit the functionality I promise to support moving forward.

Agreed - should I add a check for ExternalGreyBoxBlocks in the BT solver and raise an exception then?

Leave it for now. First I want to (a) see what is required to support user-provided inputs and outputs and (b) decide what to do about state-dependent Jacobian structure in the GB.

I think state-dependent structure is probably something for a separate PR too - as you noted the current tools do not consider state dependence on a pure Pyomo model, so grey box models should behave the same way. If we go with that, then all we need for incidence analysis is to look at the shape of the Jacobian that is returned which should be easy.

User inputs should also just be a matter of finding the right way to get the index-name-object mappings.

andrewlee94 · 2026-03-23T08:17:11Z

@Robbybp I implemented an approach to use the Jacobian shape to determine incidence which turned out to be fairly simple, and also addressed the issue with input names. However, using your test case revealed two issues:

First and most importantly, to be able to use the Jacobian shape, it means we need to be able to evaluate the Jacobian. This means the input variables have to be initialized at least (but not converged). However, the incidence analysis tools are structural in nature, and thus are meant to be applied before a model is initialized and/or solved. Thus, there is a chance (depending on how the Grey Box is implemented), that we do not have values for the inputs and we will run into failures when we try to get the Jacobian. I ran into this with your test case, which did not initialize the variable values.
Secondly, and simpler, I assume output names can be assigned the same way, meaning we need a similar fix there as well?

andrewlee94 · 2026-03-23T08:31:50Z

Digging a little deeper, it is actually a bit more complicated. The main error is coming from the fact that the method to get the Jacobain requires the ExternalGreyBoxModel to have the _inputs attribute. However, this does not get created until the set_input_values method has been called.

This means the issue really lies in the code for the grey box, however all the examples are written this way.

andrewlee94 · 2026-03-23T09:22:03Z

@Robbybp I just pushed my latest version with the fixes for using Jacobian shape and allowing external input and output variables.

blnicho · 2026-05-14T15:07:45Z

@andrewlee94 @Robbybp is this ready for final reviews?

Robbybp · 2026-05-14T17:54:57Z

is this ready for final reviews?

I still want to go through the ImplicitConstraint implementation in more detail, but I don't think that should stop anybody else from reviewing.

blnicho · 2026-05-21T20:30:57Z

is this ready for final reviews?

I still want to go through the ImplicitConstraint implementation in more detail, but I don't think that should stop anybody else from reviewing.

@Robbybp will you have time to look at this PR this week. We would like to get it into the Pyomo May release (which has already been pushed back by a week). The dev team is starting to review it also.

jsiirola

Overall, this looks reasonable. Several questions, though...

jsiirola · 2026-05-22T05:49:25Z

+    if constraint.ctype is ExternalGreyBoxConstraint:
+        return constraint.body.get_incident_variables()
+    else:
+        return get_incident_variables(constraint.body, **kwds)


Why does this need a special override (couldn't this be handled in get_incident_variables)?

I thought it was more natural to have a function that would handle "any" constraint rather than adding more logic to get_incident_variables, which is documented to operate on expressions.

My thought process was to make the EGBBody look more like an expression so that the special case handling was unnecessary.

For example, if EGBBody had an args property that returned the input / output variables, then the existing code should be able to handle it, right?

jsiirola · 2026-05-22T06:00:56Z

+JAC_ZERO_TOLERANCE = 1e-8
+
+
+class EGBConstraintBody:


Should this use either NumericValue (or more likely NumericExpression) as a base class? This is effectively representing an anonymous named expression....

What would this look like in practice? I have not looked into NumericExpression before so I am not familiar with what it looks like and what would need to be implemented.

NumericExpression is the base class for all Pyomo expression objects (operators, named expressions, etc). Among other things, this would let the "expression" that defines the row in the EGB to be treated like any other expression.

My first thought is that this sounds a bit dangerous. The only benefit I see is access to identify_variables, and there is some potentially confusing incompatibility that would come up. What would happen if you try to use this in another constraint or expression?

It depends on the writer. For all current writers, you would end up with an error from an unsupported operator in the constraint expression, which I think is what we would want.

What benefits would we get from doing this then (besides identify_variables), and what would the "expression" actually look like - we can get a value and partial derivatives of the expression, but the form of numerical expression is not defined.

The biggest benefit is consistency. My basic idea is that a row from an EGB should look / act like a constraint ExternalFunctionExpression == rhs (where ExternalFunctionExpression is the body and rhs is either the output variable or 0). The fact that several of them are all evaluated at a single time is irrelevant to the "user experience" -- I really think that an EGB should look / act like an indexed Constraint.

jsiirola · 2026-05-22T06:44:54Z

+        implicit_constraint_id
+            The identifier for this implicit constraint or output variable


Why is this part of the container and not the Data?

This is the equivalent of the rule for EGBConstraints, where we provide the list of IDs we need to create. I changed the name to implicit_constraint_ids to be slightly more specific, and updated the doc string to explain what it is and why it is needed.

…onstraint

Robbybp · 2026-05-23T01:15:40Z

        self.assertIn('d scaling provided', solver_trace)
        # x_order: ['egb.inputs[F]', 'egb.inputs[P1]', 'egb.inputs[P3]', 'egb.inputs[Pin]', 'egb.inputs[c]', 'egb.outputs[P2]', 'egb.outputs[Pout]', 'mu']
-        # c_order: ['ccon', 'pcon', 'pincon', 'egb.pdrop1', 'egb.pdrop3', 'egb.output_constraints[P2]', 'egb.output_constraints[Pout]']
+        # c_order: ['ccon', 'pcon', 'pincon', 'egb.eq_constraints[pdrop1]', 'egb.eq_constraints[pdrop3]', 'egb.output_constraints[P2]', 'egb.output_constraints[Pout]']


Can you revert this just to clean up the diff slightly.

Robbybp · 2026-05-23T02:11:15Z

+    # Compatibility API for PyomoNLP - this is only a partial implementation
+    def get_pyomo_variables(self):
+        return self._pyomo_model_var_datas
+
+    def get_pyomo_constraints(self):
+        return list(self._pyomo_model_constraint_names_to_datas.values())
+
+    def get_pyomo_equality_constraints(self):
+        return [c for c in self.get_pyomo_constraints() if c.equality]
+
+    def get_pyomo_inequality_constraints(self):
+        return [c for c in self.get_pyomo_constraints() if not c.equality]
+
+    def get_primal_indices(self, var):
+        # get the name of the variable
+        var_name = var.getname(fully_qualified=True)
+        # get the index of the variable in the primals
+        try:
+            return self._primals_names.index(var_name)
+        except ValueError:
+            raise ValueError(f'Variable {var_name} not found in primals.')


I assume these are added for compatibility with IncidenceGraphInterface? Or is there another reason beyond this?

It may also have been needed for the IDAES diagnostics tools as they use PyomoNLP interface, and assumed the presence of some of these methods.

Robbybp · 2026-05-23T02:44:01Z

+)
+
+
+class TestExternalGreyBoxAsNLP(unittest.TestCase):


I'm not sure I understand the purpose of any of the tests in this file. The name suggests that we are testing PyomoNLPWithGreyBoxBlocks in the case where the underlying ExternalGreyBoxBlock has constructed ExternalGreyBoxConstraints.

This PR makes only minor changes to PyomoNLPWithGreyBoxBlocks, so I don't see the need for extensive testing of its variables, constraints, Jacobian, etc.

Since we now always construct the constraints, the existing tests for PyomoNLPWithGreyBoxBlocks test this already, right?

If anything, I think this file should just contain unit tests for the methods added to ducktype PyomoNLP (or these tests can go in the existing PyomoNLPWithGreyBoxNLP test file).

Did you address this in the PR you opened? I believe this was originally added when EGBConstraint were optional - the old tests covered the previous case where EGBConstraints were absent, and these tests covered when they were present.

No. I only touched test_external_grey_box_model_with_constraints.py.

I recommend removing this file.

If they don't already exist, can you add tests for the new PyomoNLPWithGreyBoxBlocks methods to the existing test_pyomo_grey_box_nlp.py file? (If they do exist, can you point them out?)

Robbybp · 2026-05-23T03:03:43Z

+    @property
+    def lb(self):
+        """float : the value of the lower bound of a ExternalGreyBoxConstraint expression.
+
+        Implicit constraints always have a lower bound of 0.
+        """
+        return 0.0
+
+    @property
+    def ub(self):
+        """float : the value of the upper bound of a ExternalGreyBoxConstraint expression.
+
+        Implicit constraints always have an upper bound of 0.
+        """
+        return 0.0
+
+    @property
+    def equality(self):
+        """bool : True. ExternalGreyBoxConstraints are always equalities."""
+        return True


Can some of these properties be removed (for now)? I think most of them are not necessary for the functionality this PR is adding. My understanding is that subclassing Component (instead of ducktyping) has the following advantages:

Access to .pprint

.name behaves like a normal Pyomo name

Compatibility with component_data_objects

Everything else, to me, has marginal value for right now. My suggestion is to remove as many properties and methods as possible from this file. My rationale is that it will be relatively easy to add these later if we have a good reason to, but relatively annoying to change if we add them now.

I am not sure if these are all strictly necessary, but they were added so that EGBConstraints would behave like normal Constraints in the IDAES diagnostics and statistics tools.

Can you show me the usage patterns where you would like this to behave like a regular constraint?

In general, I'm okay with branching on con.ctype, e.g.

[con.lower if con.ctype is Constraint else 0 for con in constraints]

instead of making this consistently ducktyped. (This is not a great example, as lower is one of the attributes I might actually keep.)

I actually prefer to shift the burden onto the user like this instead of adding more functionality that we need to support, especially when I'm not 100% sure the functionality is well-motivated.

I think at least the following attributes/methods are not necessary: strict_lower/upper, has_lb/ub, get/set_value, (l/u)slack, active, (de)activate, and one of ub/lb or upper/lower.

I was going in the opposite direction here, especially seeing that as these are equalities these values are well defined. The problem with branching on constraint type is that it does require the user (or middle developer) to be aware of the difference, and I am not that confident in users of IDAES. Specifically, in IDAES there are a decent number of statistics and diagnostics functions that rely on bounds for making determinations (including determining if a constraint is an equality, as they were written before the equality property was added/documented). My goal was to make EGBConstraints fit seamlessly with that so that i) I did not have to rewrite all those functions, and ii) to prevent issues in future from devleopers overlooking EGBConstraints as a possibility.

Robbybp · 2026-05-23T04:18:43Z

+        m.F.fix(1)
+
+        # Check Dulmage-Mendelsohn partitioning of the incidence graph
+        igraph = IncidenceGraphInterface(m, include_inequality=False)


How do these tests differ from those in the Incidence Analysis tests? I would lean towards putting all of the Incidence Analysis-Grey box integration tests in Incidence Analysis.

Robbybp · 2026-05-24T17:41:04Z

I just opened a PR into this branch, andrewlee94#2, where I remove many of the tests in test_external_grey_box_model_with_constraints.py.

Refactor EGBB constraint tests

…pyomo into implicit_constraint

Robbybp · 2026-05-25T14:14:56Z

+    implicit_constraint_id : str, optional
+        The identifier for this implicit constraint or output variable in
+        the external model.


What is the purpose of implicit_constraint_id?

This is the "rule" for the constraint - it tells us which output or equality constraint key this constraint related to. This gets passed on to the EGBConstraintBody object so that it can look up the correct values from the EGBModel.

Okay, I see. What happens if an equality constraint and an output have the same name? Do we need additional data to tells us which "type" of implicit constraint we have?

I had not considered that - my first question there is whether we should support that in general as that ambiguity might extend further than just this (i.e., you can't have two Python objects with the same name, so should we allow implicit constraints to have the same name)?

It would be possible to fully specify the constraint id by including both type and name/index, possibly just breaking the current argument into two (constraint_id and output_id) and allowing only one to be set.

Why not just use an integer to indicate the position in the list of constraints / output variables (where output variables are offset by the number of constraints)?

Why not just use an integer to indicate the position in the list of constraints / output variables

I would either go with this approach or use a tuple of (enum, int). I don't love that names are a mandatory part of ExternalGreyBoxModel and would like to rely on them as little as possible.

I agree that we should avoid strings, and the more I think about it, the more I think the id should be an integer.

jsiirola

I am somewhat torn by this PR. This implementation is pretty convoluted (I still haven't convinced myself why we need all this complexity). However, I think this is more a reflection of the design of ExternalGreyBox than it is this PR, per se. To fix it would mean a complete rewrite of the ExternalGreyBox functionality, which it probably beyond the scope of this PR (and outside the scope / timing we are trying to keep for the Pyomo / IDEAS releases).

@blnicho / @Robbybp / @andrewlee94: Is the answer that we clean this up as best we can and merge it under the umbrella of "it's going into contrib" and then defer the cleanup of this functionality until some point in the future?

jsiirola · 2026-05-28T20:13:40Z

+                f"the external model."
+            )
+
+    def _dereference_egb(self):


I don't think that this level of defensive programming in necessary, especially if the EGB keeps a dictionary that has hard references to the EGB bodies (see your comment in __init__ on L52).

jsiirola · 2026-05-28T21:19:49Z

+JAC_ZERO_TOLERANCE = 1e-8
+
+
+class EGBConstraintBody:


The biggest benefit is consistency. My basic idea is that a row from an EGB should look / act like a constraint ExternalFunctionExpression == rhs (where ExternalFunctionExpression is the body and rhs is either the output variable or 0). The fact that several of them are all evaluated at a single time is irrelevant to the "user experience" -- I really think that an EGB should look / act like an indexed Constraint.

jsiirola · 2026-05-29T02:42:07Z

+        self._constraint_id = constraint_id
+
+        self._output_idx = None
+        self._eq_cons_idx = None


Why do you need all three of these? Surely you only need one identifier (since output and eq_cons are mutually exclusive)?

We probably only need two of these - I think having three stems from the original implementation being based on providing the name of the constraint, and even then we probably did not need to store the name, just the index.

jsiirola · 2026-05-29T02:42:58Z

+    implicit_constraint_id : str, optional
+        The identifier for this implicit constraint or output variable in
+        the external model.


I agree that we should avoid strings, and the more I think about it, the more I think the id should be an integer.

jsiirola · 2026-05-29T02:44:09Z

+            )
+        elif self._constraint_id in ext_model.output_names():
+            self._output_idx = ext_model.output_names().index(self._constraint_id)
+        else:


What happens if there is a name in both the constraint names and the output names?

@Robbybp brought that up as well, and from the point of view of this PR at least maybe we don't care; if we use the separate the arguments into output and constraint index instead of name we can just use that directly.

jsiirola · 2026-05-29T02:50:58Z

+        3. Only inputs with a numerically non-zero Jacobian entry are
+           considered incident.
+
+        Option 2 is used here as it is the most general.


Should this be configurable? Also, I would think that option 1 would be the most general (or at least the least likely to incorrectly omit an incident variable)?

I requested that the sparsity pattern of the Jacobian be used. This matches the convention in Incidence Analysis and is what we want for structural singularity analysis.

Originally I did have it as configurable, but that added more complexity, both here and downstream as we now needed a way for the user to tell us which approach to use. I did also use option 1 originally myself - option 2 is the most powerful, but does rely on the writer of the grey box doing things a specific way.

jsiirola · 2026-05-29T02:59:23Z

+logger = logging.getLogger('pyomo.contrib.pynumero')
+
+
+JAC_ZERO_TOLERANCE = 1e-8


This appears to not be used?

jsiirola · 2026-05-29T03:03:11Z

+        # EGB variables cannot be fixed — the solver interface will raise a
+        # cryptic error at solve time if any are.  Check here to surface the
+        # problem early with an actionable message.


This seems like a strange place for this test. The normal solver workflow does not trigger this method, so the "sensible message" won't be seen.

Also, why are fixed variables not allowed? That seems like a bug.

Firstly - it is a known limitation of the EGB that it does not support directly fixing any of the Var's that it links to. @Robbybp might be able to give a deeper explanation, but I suspect it is becasue allowing fixed variables would mean the logic in the EGB would have to determine if a Var should be included in the matricies it emits.

This check is then a result of this - I do agree it is a weird spot for it and I hesitated to add it. I added it here due to the possibility of including the include_fixed argument; there is no reason we cannot do that here, but it would result in an issue later so this was added as an early warning.

jsiirola · 2026-05-29T03:05:41Z

+        # Placeholder for body
+        self._body = None
+
+    def _validate_implicit_constraint_id(self):


This does not seem like a useful method? If the id isn't in the maps, the __init__ should just trigger a KeyError?

Robbybp · 2026-05-29T04:13:09Z

Is the answer that we clean this up as best we can and merge it under the umbrella of "it's going into contrib" and then defer the cleanup of this functionality until some point in the future?

I agree that this PR is bigger and more complicated than it needs to be. I don't want to see it merged until it is, at a minimum, significantly fewer lines of code.

I don't think we need to "fix EGB" in this PR, but I also don't want to defer cleanup of the new code because it's "just contrib." I plan to keep reviewing and working on this bit-by-bit.

andrewlee94 · 2026-05-29T19:40:03Z

@Robbybp are there any specific bits you think should be removed at this point?

@jsiirola Do you think it is worth making the EGBConstraintBody use NumericExpression? I am slightly concerned that that could add a little more complexity (although it might end up reducing it too).

andrewlee94 added 17 commits February 5, 2026 08:03

First pass at ExternalGreyBoxConstraint

0b1fe30

Initial testing

92265d7

More testing

cc4b693

First pass at incidence analysis integration

f67e0c4

More testing

0a8f340

Formatting and linting

952d2ff

Residual calculation for output variables

85253e9

More integration testing

e9e7007

Fix bug in CondensedSparseSummation

e2628d6

Fixing bug with external vars

8970a57

Fix bug with component list with EGB

1cb696c

Minor bug fix

e77845b

Adding copyright and acknowledgement

8f3cce2

Fixing conflicts and updating headers

e694391

Running correct version of black

01818ca

Fixing formatting issues

66eab7f

Adding missing import

2a3fd7a

Robbybp reviewed Mar 10, 2026

View reviewed changes

blnicho requested review from blnicho and jsiirola March 10, 2026 18:55

andrewlee94 added 3 commits March 13, 2026 10:27

Adding import guards to tests

f3bb8fd

Formatting

1fa3ce6

Moving numpy check before import

55b6434

Robbybp reviewed Mar 16, 2026

View reviewed changes

Revising get_incident_variables

01df76b

Fixing missed test

48a4a5d

andrewlee94 marked this pull request as ready for review May 14, 2026 19:01

mrmundt requested changes May 21, 2026

View reviewed changes

jsiirola reviewed May 22, 2026

View reviewed changes

Merge branch 'main' of https://github.com/Pyomo/pyomo into implicit_c…

41183b5

…onstraint

Robbybp suggested changes May 23, 2026

View reviewed changes

Robbybp reviewed May 23, 2026

View reviewed changes

Robbybp added 6 commits May 24, 2026 09:48

compress tests for EGBConstraints constructed by the EGBB

9d10972

test constraints when existing inputs and outputs are used

cfcacd8

remove superfluous tests

3438da7

remove unused MyGreyBox

a0e6e54

remove PyomoGreyBoxNLP tests

d6fd5cc

move EGBConstraint tests to test_external_grey_box_model

17408a7

andrewlee94 and others added 8 commits May 25, 2026 08:21

Merge pull request #2 from Robbybp/rp/implicit-constraint

d3c8804

Refactor EGBB constraint tests

Merge branch 'implicit_constraint' of https://github.com/andrewlee94/…

8d538c4

…pyomo into implicit_constraint

mrmundt's comments

024ab02

jsiirola's easy comments

78b7097

Wekref and construction args

a797b71

Addressing comments about EGB arguments

f2d04ed

Formatting

e20add7

Catch some edge cases

12519c5

Robbybp reviewed May 25, 2026

View reviewed changes

andrewlee94 and others added 2 commits May 26, 2026 05:33

Removing constraint_datas method

a273f6c

Merge branch 'main' into implicit_constraint

6d61e09

jsiirola reviewed May 29, 2026

View reviewed changes

		for input_name in ext_model.input_names():
		incident_variables.append(self._parent_model.inputs[input_name])

		implicit_constraint_id
		The identifier for this implicit constraint or output variable

		logger = logging.getLogger('pyomo.contrib.pynumero')


		JAC_ZERO_TOLERANCE = 1e-8

Conversation

andrewlee94 commented Mar 9, 2026

Fixes None.

Summary/Motivation:

Changes proposed in this PR:

Legal Acknowledgement

Uh oh!

Robbybp left a comment

Choose a reason for hiding this comment

Uh oh!

andrewlee94 commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Robbybp left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andrewlee94 Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andrewlee94 commented Mar 23, 2026

Uh oh!

andrewlee94 commented Mar 23, 2026

Uh oh!

andrewlee94 commented Mar 23, 2026

Uh oh!

blnicho commented May 14, 2026

Uh oh!

Robbybp commented May 14, 2026

Uh oh!

blnicho commented May 21, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jsiirola left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andrewlee94 commented Mar 10, 2026 •

edited

Loading

andrewlee94 Mar 16, 2026 •

edited

Loading