At NeurIPS, Michael Hall took the stage in the Exhibit Hall to showcase “Arm AI Acceleration in Action: ExecuTorch + ONNX on Mobile and Edge.” The session walked through how Arm optimizations…

642,876 followers

At NeurIPS, Michael Hall took the stage in the Exhibit Hall to showcase “Arm AI Acceleration in Action: ExecuTorch + ONNX on Mobile and Edge.” The session walked through how Arm optimizations translate directly into real performance gains for AI workloads, running side-by-side LLM inference and computer vision models on SME2-enabled Arm CPUs and Android devices. Key takeaways: ➡️Arm-optimized ExecuTorch and XNNPack significantly boost performance on mobile and edge devices ➡️A clear optimization pipeline researchers can replicate ➡️Practical, reproducible workflows for running LLMs and CV models efficiently on Arm-based devices A huge thank you to everyone who joined the session. If you missed it, stop by booth #622 to see our demos in action!

7 Comments

Martin Mohring

thanks for the post. Could you please give us some more links on the AI topic at ARM ? Especially when people are familiar with PyTorch etc...

1 Reaction

Jayakumar S

Android Edge and deployment of Deep Learning Networks in it https://www.jkuse.com/dltrain/deploy-dl-networks/edge-native-service/j7-app ..very good tutorial..

Chester Chen

Is this for training or inference?

See more comments

To view or add a comment, sign in

Arm’s Post

More from this author

Designed for multitasking, built for the real world, and ready for everything the day brings.

Removing the Roadblocks to Mass ADAS Adoption

The top 10 hardware and software sessions at Arm DevSummit 💡👾

Explore content categories