This is a followup to the meeting with CMS last week and the meeting with CMS yesterday https://indico.cern.ch/event/1373473/
@choij1589 presented results where Drell Yan plus 4 jets shows a speedup in cudacpp vs fortran, but DY+3 does not
We should understand why CMS sees a speedup in DY+4jets but not DY+3 jets
