Releases · sdpython/onnx-diagnostic · GitHub

23 Jan 16:28

sdpython

0.8.11 Latest

Latest

#396: fix serialization for DynamicCache with different layer classes
#394: add function make_model_with_local_functions to partition a model into local functions

Assets 2

16 Jan 16:43

sdpython

0.8.10

#384: add weights_only=False when using torch.load

Assets 2

12 Jan 11:02

sdpython

0.8.9

#383: removed bool, int, float, None as input dummies for the exporter in method_to_onnx
#382: make the ordering of the inferred dynamic shapes more robust
#381: add parameter expand_batch_for to method_to_onnx
#378: implements the computation of discrepancies in method_to_onnx
#379: update the handling of cache after the removal of HybridCache, SlidingWindowCache in transformers>=5,

Assets 2

07 Jan 17:32

sdpython

0.8.8

#375: export a method to onnx in order to export using method generate
#376: fix patched lazy_initialization for transformers>=5
#372: fix patch on rotary embedding
#371: fix make_fake_with_dynamic_dimensions

Assets 2

18 Dec 16:36

sdpython

0.8.7

#366: add command line to optimize a model
#363: patch for DynamicDimConstraintPrinter
#360, #364: preliminary work for phi4

Assets 2

12 Dec 23:00

sdpython

0.8.6

#357: complete simple_loop_for, an easier to rewrite loops
#356: include qwen embedding part
#355: better command line to export models
#353, #354: add command line to compare two onnx models

Assets 2

09 Dec 18:36

sdpython

0.8.5

#349: fixes function max_diff (parameter hist)
#348: add format dot, shape to command line print
#346: fix patch for sdpa_mask_recent_torch even if it was removed in transformers>=5.0

Assets 2

05 Dec 17:51

sdpython

0.8.4

#341: preliminary support to export submodule
#340: supports devices in onnx plugs
#338: fixes ReplayConfiguration.dump, add function to select of part of a model
#337: fixes extract_subset_of_nodes
#336: implements versioned onnx plugs

Assets 2

01 Dec 16:28

sdpython

0.8.3

#331: adds a helper to convert an onnx model into dot
#330: fixes access rope_parameters for transformers>=5
#329: supports lists with OnnxruntimeEvaluator
#326: use ConcatFromSequence in LoopMHA with the loop
#325: adds plug for LoopMHA, extends the unit tests to measure the discrepancies
#324: supports FunctionProto with arguments in OnnxruntimeEvaluator
#323: drops torch 2.8 on CI
#322: support rerunning onnx kernels with torch intermediate results in side-by-side
#314: fix modelbuilder download needed after this change microsoft/onnxruntime-genai#1862
#311: use custom and local function to use PackedMultiHeadAttention from onnxruntime
#310: splits patches into multiple files
#308: add option --save_ep to dump the exported program as well as torch input
#304, #306, #316, #317, #318, #319: improves side-by-side comparison, creates command line sbs

Assets 2

14 Nov 17:28

sdpython

0.8.2

#303: fix inputs for summarization, feature extraction tasks
#302: adds helpers to analyse onnxruntime profiling
#297: experiment around a higher ops loop_for
#292, #293, #294, #295: first version of new patches for Qwen models

Assets 2