model_qint8_arm64.onnx is not working for aarch64 architecture. #6666

narendra9079 · 2025-01-29T11:58:48Z

narendra9079
Jan 29, 2025

I am trying to run the quantized version of https://huggingface.co/sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2/blob/main/onnx/model_qint8_arm64.onnx. on aarch64 architecture. But is not able to generate embedding. It is failing when I run sessin.run() function. But the avx2.onnx version is running fine on linux. I am using ort = { version = "=2.0.0-rc.0", default-features = false, features=["ndarray"], optional = true }
ort-sys = { version = "=2.0.0-rc.0", optional = true } these version for onnx run time.

Can you please provide some opinion ??

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

model_qint8_arm64.onnx is not working for aarch64 architecture. #6666

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

model_qint8_arm64.onnx is not working for aarch64 architecture. #6666

Uh oh!

narendra9079 Jan 29, 2025

Replies: 0 comments

narendra9079
Jan 29, 2025