Skip to content

Releases: sdpython/onnx-diagnostic

0.8.11

23 Jan 16:28
37ab9b8

Choose a tag to compare

  • #396: fix serialization for DynamicCache with different layer classes
  • #394: add function make_model_with_local_functions to partition a model into local functions

0.8.10

16 Jan 16:43
cb87e85

Choose a tag to compare

  • #384: add weights_only=False when using torch.load

0.8.9

12 Jan 11:02
7a28903

Choose a tag to compare

  • #383: removed bool, int, float, None as input dummies for the exporter in method_to_onnx
  • #382: make the ordering of the inferred dynamic shapes more robust
  • #381: add parameter expand_batch_for to method_to_onnx
  • #378: implements the computation of discrepancies in method_to_onnx
  • #379: update the handling of cache after the removal of HybridCache, SlidingWindowCache in transformers>=5,

0.8.8

07 Jan 17:32
f4fbaa8

Choose a tag to compare

  • #375: export a method to onnx in order to export using method generate
  • #376: fix patched lazy_initialization for transformers>=5
  • #372: fix patch on rotary embedding
  • #371: fix make_fake_with_dynamic_dimensions

0.8.7

18 Dec 16:36
4c04e26

Choose a tag to compare

  • #366: add command line to optimize a model
  • #363: patch for DynamicDimConstraintPrinter
  • #360, #364: preliminary work for phi4

0.8.6

12 Dec 23:00
ddea098

Choose a tag to compare

  • #357: complete simple_loop_for, an easier to rewrite loops
  • #356: include qwen embedding part
  • #355: better command line to export models
  • #353, #354: add command line to compare two onnx models

0.8.5

09 Dec 18:36
4d706e0

Choose a tag to compare

  • #349: fixes function max_diff (parameter hist)
  • #348: add format dot, shape to command line print
  • #346: fix patch for sdpa_mask_recent_torch even if it was removed in transformers>=5.0

0.8.4

05 Dec 17:51
f677edb

Choose a tag to compare

  • #341: preliminary support to export submodule
  • #340: supports devices in onnx plugs
  • #338: fixes ReplayConfiguration.dump, add function to select of part of a model
  • #337: fixes extract_subset_of_nodes
  • #336: implements versioned onnx plugs

0.8.3

01 Dec 16:28
cccd7cd

Choose a tag to compare

  • #331: adds a helper to convert an onnx model into dot
  • #330: fixes access rope_parameters for transformers>=5
  • #329: supports lists with OnnxruntimeEvaluator
  • #326: use ConcatFromSequence in LoopMHA with the loop
  • #325: adds plug for LoopMHA, extends the unit tests to measure the discrepancies
  • #324: supports FunctionProto with arguments in OnnxruntimeEvaluator
  • #323: drops torch 2.8 on CI
  • #322: support rerunning onnx kernels with torch intermediate results in side-by-side
  • #314: fix modelbuilder download needed after this change microsoft/onnxruntime-genai#1862
  • #311: use custom and local function to use PackedMultiHeadAttention from onnxruntime
  • #310: splits patches into multiple files
  • #308: add option --save_ep to dump the exported program as well as torch input
  • #304, #306, #316, #317, #318, #319: improves side-by-side comparison, creates command line sbs

0.8.2

14 Nov 17:28
a682d15

Choose a tag to compare

  • #303: fix inputs for summarization, feature extraction tasks
  • #302: adds helpers to analyse onnxruntime profiling
  • #297: experiment around a higher ops loop_for
  • #292, #293, #294, #295: first version of new patches for Qwen models