Separate `State` and `StoppingCriterionState` initialization #9

lkdvos · 2025-12-18T01:26:04Z

This is a slight refactor of the initialization procedure, where I did the following

Separate out the initialization of the state and stopping state, such you don't need to remember to call initialize_state! on the stopping state.
Update the docs to include a custom stopping implementation with and without stateful stopping criterion
Update the docs and tests to no longer reset the iterate in the initialize_state! calls, and only generate a random starting point in initialize_state as this led to confusion in solve! without initialize_state! #8. The idea is that initialize_state! really means making sure everything is set up to correctly start running the algorithm, which does not necessarily include a reset of the iterate but rather means setup like caches, counters, auxiliary memory etc.

lkdvos · 2025-12-18T01:26:48Z

@mtfishman, does this align slightly better with what you had in mind?

codecov · 2025-12-18T01:27:52Z

Codecov Report

❌ Patch coverage is 78.26087% with 5 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (main@c0ad8e3). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
src/stopping_criterion.jl	70.00%	3 Missing ⚠️
src/interface/stopping.jl	77.77%	2 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main       #9   +/-   ##
=======================================
  Coverage        ?   57.70%           
=======================================
  Files           ?        6           
  Lines           ?      227           
  Branches        ?        0           
=======================================
  Hits            ?      131           
  Misses          ?       96           
  Partials        ?        0

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

lkdvos · 2025-12-18T01:37:05Z

https://juliamanifolds.github.io/AlgorithmsInterface.jl/previews/PR9

kellertuer · 2025-12-18T05:57:19Z

Oh, this is quiiiiiite some rework. I do not get understand what changed in practice though. It seems like now it's random whether initialise_state has an actual (algorithm) state or a stopping criterion state as third parameter? Is that the new part?

Or phrased differently: one now has to pass a sc-state to the init, but where is that from / initialised before?

lkdvos · 2025-12-18T12:10:41Z

Apologies, I could have explained this a whole lot better.

The main goal was to separate out state initialization from stopping state initialization, since I think it feels slightly awkward that you have to remember that when you are implementing an algorithm, you have to remember to also call initialize_state(!) for the stopping criterion.
Since this is something that always has to happen anyways, I tried separating that out into initialize_stopping_state! and initialize_state! in the solve! function, such that this is handled automatically.

Then I wanted to do the same thing for initialize_state and initialize_stopping_state, but since we have to attach the stopping_criterion_state into the generate state, I still have to pass that as an argument to the initialize_state, which then takes the stopping_criterion_state and generates a state from that.

Walking through the flow of solve(::Problem, ::Algorithm), we therefore have:

initialize_stopping_state(problem, algorithm, stopping_criterion = algorithm.stopping_criterion) -> stopping_criterion_state
initialize_state(problem, algorithm, stopping_criterion_state) -> state
enter solve!
initialize_stopping_state!(problem, algorithm, state, stopping_criterion = algorithm.stopping_criterion, stopping_criterion_state = state.stopping_criterion_state)
initialize_state!(problem, algorithm, state)
remainder of the algorithm

I can see the confusion with the third parameter being a state or stopping_criterion_state, but this is mostly because the signatures of the ! methods don't line up nicely with their non-mutating counterparts, and I still wanted to pass on all the parameters to the different functions in case any implementation requires access to that.

Finally, I wanted to not disallow workflows where the initialization of the stopping criterion state and of the regular state have to be intertwined and cannot be cleanly separated out.
To achieve this, I am only calling initialize_state(problem, algorithm) in the solve call, and have a default implementation that dispatches to initialize_state(problem, algorithm, initialize_stopping_state(problem, algorithm)).

docs/src/interface.md

Co-authored-by: Matt Fishman <mtfishman@users.noreply.github.com>

mtfishman · 2025-12-18T15:18:42Z

docs/src/interface.md

+function AlgorithmsInterface.initialize_state(
+        problem::SqrtProblem, algorithm::HeronAlgorithm,
+        stopping_criterion_state::StoppingCriterionState;
+        kwargs...
+    )
+    x0 = rand()


It's a small change, but in practice what I've been doing is writing the definition of initialize_state more like this:

function AlgorithmsInterface.initialize_state( problem::SqrtProblem, algorithm::HeronAlgorithm, stopping_criterion_state::StoppingCriterionState; x0 = rand() )

Maybe I was being dense, but it took me some time to realize that was a "valid" way to define it and still have control over the parts of the state initialization that I wanted to control, since that allows running solve as:

solve(SqrtProblem(16.0), HeronAlgorithm(StopAfterIteration(10)); x0 = 1.0)

I think seeing the example written that way would have helped me understand how I should define initialize_state.

However, I realized a slightly awkward thing about defining initialize_state in that way and then calling solve as solve(SqrtProblem(16.0), HeronAlgorithm(StopAfterIteration(10)); x0 = 1.0) is that then x0 gets passed to both initialize_state and initialize_state!. Maybe that's not a problem in practice but I found it to be a bit strange (i.e. should initialize_state! use x0 or not?), and also made me confused about the roles of intialize_state vs. initialize_state!. I think the suggestion in this PR of just having initialize_state! reset iteration helps clarify that issue a bit.

The comment about the kwargs is a really good one, I was looking at this a bit before and it's somewhat difficult to come up with a way to split arbitrary keywords generically, but it could even make sense to just not pass the keyword arguments to initialize_state! anymore if they have already been passed to initialize_state?

I was thinking about that, indeed maybe it makes sense to not pass the keyword arguments to initialize_state!. That could make the distinction between initialize_state and initialize_state! clearer, since then initialize_state handles external inputs (i.e. initial guesses for the iterate) while initialize_state! just handles resetting the "internal" state, such as the iteration number and stopping criterion state. Calling solve! means that you made the state already, so it seems like anything handled through keyword arguments to initialize_state! could instead be handled by just modifying the state directly (i.e. before calling solve!/initialize_state!).

Hm, I disagree here, for me functions of same name should do the same and accept the same keywords. Or to be more precise

initialize_state should create a state from new memory but also
initialize_state! applied to the same memory state should set it to the same situation as well.

kellertuer · 2025-12-19T06:40:11Z

I think for me all this is getting a bit too much.

My original goal was to maybe integrate Manopt a bit better into the Julia world so people could use it. That failed with JuMP and it also failed with Optimization.
This package here was the idea to integrate it differently, but by now I noticed, the form we ended up with would mean to basically rewrite Manopt from scratch. That is some 10k lines of code.
I did rewrite LieGroups.jl last winter and I am currently rewriting the Manifolds.jl tests. Afterwards everything is a bit nicer, but it takes a huuuuge effort of time.
I will not do that further I think. So Manopt will probably not switch to this. You seem to have super good ideas and use cases already, so should we then maybe move it to some TensorKit namespace and you continue this overall?

I do not mean this in any negative way, I think it was nice to ponder about these ideas a bit, I just notice that I do not have the capacity to continue here - neither in motivation nor time. Nor does it look like I will be able to afford any future JuliaCons anyways (celebrating 10 years without grants by now).

Let me know what you think. Without me you can also easily follow the ideas above of doing something completely different in the mutating versus non mutating variants of functions and such.

lkdvos · 2026-01-02T08:45:08Z

I completely understand what you mean, that really is a significant amount of dev time that at least for the case of Manopt has a smaller return on investment as you already have most of the features working.
If you think this is better, I would happily "adopt" this repository, we could for example move it to the QuantumKitHub organization and we'll see where to go from there.
In any case I am grateful for the discussions and your input, I know that that, in combination with the inspiration from the Manopt ecosystem has at least greatly helped me get a grasp on what I would want from this and how to achieve it.

As you may have noticed, for me as well this is something that I am working on slightly sporadically, but at the very least I am not too happy with the current setup in a large part of our ecosystems so this is less of a refactor and more of a feature improvement, so there is slightly more motivation for that.

kellertuer · 2026-01-02T11:05:22Z

Just to be clear here:

I still think this is a neat idea and nice to have
I still think the design we ended up with is quite nice and flexible

Just compared to my initial idea (“extract” the main types from Manopt and make it a package), the current design would really mean a major major rework of Manopt.jl.
I do not mind reworks, but I feel I mainly did that recently.

I also value our discussions, I think we got to something really nice. Just that I lack a bit of motivation. If you are fine with that, we can also keep it here in the JuliaManifold ecosystem, just that for now I do not see that Manopt will be rewritten to that; maybe unless there pops up some large argument that I find the motivation suddenly; but its a refactor of (I think) about 70% of the code lines.
So I do not want to give a wrong impression, when the package lives here. But if it is fine with you we can also just keep it here :)

I am also happy to help in discussions – just for now I am not sure what else I could (find the motivation to) contribute.

lkdvos · 2026-01-02T23:56:35Z

Ah, I misunderstood then, I thought you had wanted to migrate the package. I'm totally fine with keeping it here, I guess it just wasn't totally clear whether you wanted to not have to spend any time on this, or simply be kept up to date, or feel like chiming in on the design as well. Obviously I prefer to have you on board, I just don't want to force you to spend time you don't have 😉.

kellertuer · 2026-01-03T11:44:57Z

Ok. You are right I was thinking for a moment about moving the repository, since as of now it will probably have no relation to the org it is in. But well, that is also not too bad.
I think I currently miss the motivation to contribute code here.

For now, I can help in discussions though not here, I do not follow the changes and what they mean for how the interface is used. What I do not like is that there are now two functions, initialize_state and initialize_state!that- as far as I follow along here, do something completely different, though they have the same name? I think that should not be the case.

Having the same name for me means: They should do exactly the same things, just that the first creates a state and does the things, while the second does the same things on an already existing state. That should include all arguments and keywords and everything the functions do.

mtfishman · 2026-01-03T16:39:42Z

@kellertuer I agree in general that x = f(args...; kwargs...) and f!(x, args...; kwargs...) should result in the same x. I think what threw me off about the current design is that when you call solve, both initialize_state and initialize_state! get called, which doesn't seem right to me, especially if they do some non-trivial work. My initial thought was that maybe initialize_state! could just not be called from solve! (after all, users can call it manually before calling solve! if they want to), but then @lkdvos pointed out to me that it is important to reset the stopping criterion state, so that led to the idea of having a separate initialize_stopping_state!. Maybe we could go with a design where only initialize_stopping_state! gets called from solve!?

kellertuer · 2026-01-04T05:43:10Z

Hm, and that is where – as I wrote above – I by now feel lost in the current design. In my head (and what Manopt.jl does)

solve would either call initialize_stateor if it gets a state from somewhere else initialise_state!. I would prefer the first. It would never call both
solve! definitely has a state so it would call initialize_state

In my head/design idea this initialisation of a state would also care for initialising a stopping criterion – so it would either care for that implicitly or call your new function idea. I personally would not have gone for that new function, but that depends on whether such a reset is needed in other places.

That is all I can say because somewhere along the way I lost how the code is intended to work and no longer understand, why you claim that both are called (that should not be the case) and where that is happening.
This is one of the origins (besides me lacking motivation) why I started the discussion of me potentially leaving here.

mtfishman · 2026-01-05T14:17:06Z

In the code on the main branch, solve calls initialize_state:

AlgorithmsInterface.jl/src/interface/interface.jl

Line 33 in c0ad8e3

state = initialize_state(problem, algorithm; kwargs...)

and then calls solve!, which calls initialize_state!:

AlgorithmsInterface.jl/src/interface/interface.jl

Line 52 in c0ad8e3

initialize_state!(problem, algorithm, state; kwargs...)

. So if you call solve, both initialize_state and initialize_state! get called.

kellertuer · 2026-01-05T14:25:17Z

Thanks for the links. Hm, yeah, the first one is more like some allocate_state, but for the case where you call solve, this mainly really just results in “resetting” the state twice. Is that that bad?

Then we should do a allocate_state function that is used by

solve (before calling solve!)
initialize_state before calling initialise_state

I think that is a clean way to distinguish “getting” (allocating) a state (where the actual content of the variables is not necessarily deterministic even) and “(re)setting” a state to start the algorithm.

edit: I personally feel this is maybe a bit over-engineering, since just resetting twice should not be that bad, but if you want to be super precise and exact, sure...

mtfishman · 2026-01-05T14:56:00Z

I personally found it to be confusing when I was trying to understand the interface, and it made me feel like I should somehow be defining initialize_state! and initialize_state in different ways from each other. Also it doesn't seem great if initialize_state[!] involves some non-trivial work, though I guess in general it should be a subleading cost in the algorithm. An alternative could be:

function solve(problem::Problem, algorithm::Algorithm; kwargs...)
    state = initialize_state(problem, algorithm; kwargs...)
    return solve_initialized!(problem, algorithm, state)
end

function solve!(problem::Problem, algorithm::Algorithm, state::State; kwargs...)
    initialize_state!(problem, algorithm, state; kwargs...)
    return solve_initialized!(problem, algorithm, state)
end

# Like `solve!` but without calling `initialize_state!`.
function solve_initialized!(problem::Problem, algorithm::Algorithm, state::State)
    logger = algorithm_logger()
    emit_message(logger, problem, algorithm, state, :Start)
    while !is_finished!(problem, algorithm, state)
        emit_message(logger, problem, algorithm, state, :PreStep)
        increment!(state)
        step!(problem, algorithm, state)
        emit_message(logger, problem, algorithm, state, :PostStep)
    end
    emit_message(logger, problem, algorithm, state, :Stop)
    return state
end

I would find that useful anyway since I could see cases where I know my state is initialized in a specific way already, so I might prefer to call solve_initialized! directly.

lkdvos · 2026-01-05T15:04:37Z

I think @mtfishman's suggestion makes sense, and this is mostly why I wanted to try out splitting the initialization of the stopping state from the initialization of the state itself, as I would argue that solve_initialized might still have to call the initialization on the stopping state.
To make things concrete, let me summarize some of my thoughts.

I fully agree that we should make sure that functions that have the same name (possibly up to a !) should do the same thing, so we should just make sure that this is the case. In particular, I quite like Matt's suggestion about this to make sure that we don't call this function twice, since typically I would think that you'd want to implement initialize_state by allocating and then calling initialize_state!.
In that scenario, we might not need this splitting of the initialization of the stopping criterion, but I still feel like it is slightly clunky that we really require you to always call initialize_state on the stopping criterion state from within the implementation of initialize_state on the regular state. I think however that this might be a reasonable trade-off to just deal with and make sure this is properly documented, since I also agree that this PR might just be overengineering things and make things harder to follow
At the risk of misinterpreting, I think @mtfishman's main point/use-case is the idea that we want to have the option of not initializing/resetting the starting iterate, based on the fact that we might already have a good initial guess, and need some way of passing that to the solver. In principle this can already be solved without interface changes by passing it as a kwarg to initialize_state, and we could just recommend that as the way to achieve this. The alternative would be to add a function to the interface that does exactly that, which is basically the suggested code above (where the only thing I'm a bit scared about is the fact that this doesn't initialize the stopping criterion state)

kellertuer · 2026-01-05T16:12:04Z

Nirgs.

Both the ideas of the initialize_state[!]s doing something completely different as well as the even more technical and complicated name solve_initialized to me sound very strange. We have to convince a user to then implement a so-technically-called function named solve_initialized while they are probably used to implementing solve! (e.g. in Manopt, or most of the SciML-iverse).

So for now I feel, I see that there is a problem, but I see that all solutions proposed until now (besides my idea of allocation, but that is actually something we do a lot n Manifolds.jl) both overcomplicate the complexity for us as developers as well as (and that is even worse) users of the interface later.

I slowly feel, I am just speaking against a wall here anyways, so maybe continue here with a solution that no longer includes me then.

lkdvos · 2026-01-05T16:26:03Z

I think I might have not gotten my message across all that well, since I think we all kind of agree here. The current PR just isn't really the solution we are looking for, so I will close this, and just try again in a different way, taking into account the feedback from everyone.

kellertuer · 2026-01-05T16:28:08Z

Oh, sorry, to me it felt like we are turning in circles and only over-engineer and over-complicated things here. So yes – maybe a fresh start is the best idea :)

lkdvos added 3 commits December 17, 2025 20:19

update stopping interface

8d7e228

update tests

54fa9c7

update docs

ae2c1fd

lkdvos requested a review from kellertuer December 18, 2025 01:26

mtfishman reviewed Dec 18, 2025

View reviewed changes

docs/src/interface.md Outdated Show resolved Hide resolved

Update docs/src/interface.md

aa8f9f5

Co-authored-by: Matt Fishman <mtfishman@users.noreply.github.com>

mtfishman reviewed Dec 18, 2025

View reviewed changes

lkdvos closed this Jan 5, 2026

Separate State and StoppingCriterionState initialization #9

Separate State and StoppingCriterionState initialization #9

Uh oh!

Conversation

lkdvos commented Dec 18, 2025

Uh oh!

lkdvos commented Dec 18, 2025

Uh oh!

codecov bot commented Dec 18, 2025

Codecov Report

Uh oh!

lkdvos commented Dec 18, 2025

Uh oh!

kellertuer commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lkdvos commented Dec 18, 2025

Uh oh!

Uh oh!

mtfishman Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

lkdvos Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

mtfishman Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

kellertuer Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

kellertuer commented Dec 19, 2025

Uh oh!

lkdvos commented Jan 2, 2026

Uh oh!

kellertuer commented Jan 2, 2026

Uh oh!

lkdvos commented Jan 2, 2026

Uh oh!

kellertuer commented Jan 3, 2026

Uh oh!

mtfishman commented Jan 3, 2026

Uh oh!

kellertuer commented Jan 4, 2026

Uh oh!

mtfishman commented Jan 5, 2026

Uh oh!

kellertuer commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mtfishman commented Jan 5, 2026

Uh oh!

lkdvos commented Jan 5, 2026

Uh oh!

kellertuer commented Jan 5, 2026

Uh oh!

lkdvos commented Jan 5, 2026

Uh oh!

kellertuer commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Separate `State` and `StoppingCriterionState` initialization #9

Separate `State` and `StoppingCriterionState` initialization #9

kellertuer commented Dec 18, 2025 •

edited

Loading

kellertuer commented Jan 5, 2026 •

edited

Loading