add ChainRulesCore rules by mileslucas · Pull Request #3 · JuliaAstro/PSFModels.jl

mileslucas · 2021-09-11T00:22:03Z

This PR adds analytical gradients using ChainRulesCore.jl

codecov · 2021-09-11T00:23:56Z

Codecov Report

Merging #3 (eb7de41) into main (1399529) will not change coverage.
The diff coverage is n/a.

❗ Current head eb7de41 differs from pull request most recent head d46340a. Consider uploading reports for the commit d46340a to get more accurate results

@@           Coverage Diff           @@
##             main       #3   +/-   ##
=======================================
  Coverage   98.80%   98.80%           
=======================================
  Files           6        6           
  Lines          84       84           
=======================================
  Hits           83       83           
  Misses          1        1

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1399529...d46340a. Read the comment docs.

mileslucas · 2021-09-15T21:51:18Z

I don't understand why the chain rule tests are failing. Let's look at the isotropic Gaussian PSF as an example

Here is the definition of the gradient

PSFModels.jl/src/gaussian.jl

Lines 67 to 77 in 1b026c5

    
           # isotropic 
        
           function fgrad(g::Gaussian, point::AbstractVector) 
        
               f = g(point) 
        
               xdiff = first(point) - first(g.pos) 
        
               ydiff = last(point) - last(g.pos) 
        
               dfdpos = -2 * GAUSS_PRE * f / g.fwhm^2 .* SA[xdiff, ydiff] 
        
               dfdfwhm = -2 * GAUSS_PRE * f * (xdiff^2 + ydiff^2) / g.fwhm^3 
        
               dfdamp = f / g.amp 
        
               return f, dfdpos, dfdfwhm, dfdamp 
        
           end

which I wrote out by hand and can be verified with this derivation http://umdberg.pbworks.com/w/page/88516931/Example%3A%20Gradient%20of%20a%20Gaussian

here are the chain rules

PSFModels.jl/src/gaussian.jl

Lines 94 to 111 in 1b026c5

    
           function frule((Δpsf, Δp), g::Gaussian, point::AbstractVector) 
        
               f, dfdpos, dfdfwhm, dfda = fgrad(g, point) 
        
               Δf = dot(dfdpos, Δpsf.pos) + dot(dfdfwhm, Δpsf.fwhm) + dfda * Δpsf.amp 
        
               Δf -= dot(dfdpos, Δp) 
        
               return f, Δf 
        
           end 
        
           function rrule(g::G, point::AbstractVector) where {G<:Gaussian} 
        
               f, dfdpos, dfdfwhm, dfda = fgrad(g, point) 
        
               function Gaussian_pullback(Δf) 
        
                   ∂pos = dfdpos .* Δf 
        
                   ∂fwhm = dfdfwhm .* Δf 
        
                   ∂g = Tangent{G}(pos=∂pos, fwhm=∂fwhm, amp=dfda * Δf, indices=ZeroTangent()) 
        
                   ∂pos = dfdpos .* -Δf 
        
                   return ∂g, ∂pos 
        
               end 
        
               return f, Gaussian_pullback 
        
           end

using them works as intended-

using ChainRulescore, PSFModels
psf = PSFModels.Gaussian(fwhm=10)
point = [1, 2]
f, pullback = rrule(psf, point)
Δpsf, Δpoint = pullback(1.0)
f2, Δf = frule((Δpsf, Δpoint), psf, point)

# output
(0.8705505632961241, 0.7817442466933209)

but using test_frule and test_rrule consistently fails

PSFModels.jl/test/runtests.jl

Lines 99 to 111 in 1b026c5

    
           @testset "gradients" begin 
        
               # have to make sure PSFs are all floating point so tangents don't have type issues 
        
               psf_iso = Gaussian(fwhm=10.0, pos=zeros(2)) 
        
               psf_tang = Tangent{Gaussian}(fwhm=rand(rng), pos=rand(rng, 2), amp=rand(rng), indices=ZeroTangent()) 
        
               point = Float64[1, 2] 
        
               test_frule(psf_iso ⊢ psf_tang, point) 
        
               test_rrule(psf_iso ⊢ psf_tang, point) 
        
               psf_diag = Gaussian(fwhm=Float64[10, 8], pos=zeros(2)) 
        
               psf_tang = Tangent{Gaussian}(fwhm=rand(rng, 2), pos=rand(rng, 2), amp=rand(rng), indices=ZeroTangent()) 
        
               test_frule(psf_diag ⊢ psf_tang, point) 
        
               test_rrule(psf_diag ⊢ psf_tang, point) 
        
           end

abhro · 2026-01-10T03:52:59Z

The merge commits mostly tried to make the code runnable, but I don't think it still works with ChainRuleCore's newer API. The code needs to be reworked to comply

add chainrulescore and chainrulestestutils

eb7de41

mileslucas added 7 commits September 15, 2021 11:04

write up gradients for Gaussian PSF

8e2db5e

add compat

7ea628d

add chainrules testing packages

1a2546c

add printing and tests for all models

6cca536

reorganize tests

1e5489a

fix project mistype

367fd35

add debug mode

1b026c5

mileslucas and others added 3 commits September 16, 2021 13:08

add finitedifferences snippet for frule tests

d46340a

Merge remote-tracking branch 'origin/main' into ml/grads

ba4e739

Merge remote-tracking branch 'origin/main' into ml/grads

537cf3c

Add compat bounds for LinearAlgebra

fe2c50e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add ChainRulesCore rules#3

add ChainRulesCore rules#3
mileslucas wants to merge 12 commits intomainfrom
ml/grads

mileslucas commented Sep 11, 2021

Uh oh!

codecov Bot commented Sep 11, 2021 •

edited

Loading

Uh oh!

mileslucas commented Sep 15, 2021

Uh oh!

abhro commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mileslucas commented Sep 11, 2021

Uh oh!

codecov Bot commented Sep 11, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mileslucas commented Sep 15, 2021

Uh oh!

abhro commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented Sep 11, 2021 •

edited

Loading