Added column-major storage of weights and scales in INT4 quantization for model load time improvement in TRT-RTX #811
+215
−3
Loading