NPU Performance Analysis Engineer

Cambridgeshire, England


Competitive base, hybrid working, bonus scheme


NPU Performance Analysis Engineer

Cambridge, UK (Hybrid)

IC Resources is seeking an NPU Performance Analysis Engineer to join our client's team in Cambridge. The successful candidate will be instrumental in identifying and addressing performance bottlenecks in AI accelerator designs, collaborating with multi-site hardware and software teams to enhance system features and make data-driven decisions for next-generation AI accelerators.

Primary Responsibilities:
  • Performance Analysis: Investigate and identify performance bottlenecks in the Neural Processing Unit (NPU) system using performance simulation tools, and make recommendations to software and hardware teams for improvements.
  • Collaboration: Work closely with engineering teams across multiple locations to analyse and resolve current engineering challenges, providing data-driven insights and recommendations.
  • Benchmark Development: Create synthetic benchmarks that target specific architectural features, facilitating the evaluation and optimisation of next-generation NPU designs.
  • Design Space Exploration: Partner with hardware design engineers to explore and refine design options for future NPUs, leveraging performance data to inform architectural enhancements.
  • Compiler Optimisation: Collaborate with compiler engineers to offer optimisation suggestions, enhancing the overall performance of the NPU system.
  • Infrastructure Development: Lead the performance team in developing infrastructure that enables thorough inspection and root-cause analysis of performance issues.
Essential Experience:
  • System Performance Analysis: Over 7 years of experience in system performance analysis, with a proven track record of identifying bottlenecks and making software and/or hardware optimisation recommendations.
  • ML Model Application: Experience in applying and optimising machine learning models, particularly in areas such as Vision, Language, or Speech.
  • Hardware Knowledge: Strong understanding of performance or functional modeling for NPUs, CPUs, or GPUs.
  • Programming Skills: Proficiency in machine learning frameworks like PyTorch or TensorFlow and programming languages such as C++ and Python.
  • Communication Skills: Excellent written and verbal communication skills, with the ability to convey complex technical information effectively.
Desired Experience:
  • Advanced Optimisation: Hands-on experience with optimising ML models for performance improvements in various application domains.
  • Collaborative Projects: Proven experience working collaboratively with cross-functional engineering teams to solve complex performance issues.
  • Infrastructure Leadership: Experience in leading teams to develop robust performance analysis and optimisation infrastructure.
What’s On Offer:
  • Competitive Salary: A base salary that reflects your expertise and experience in the field of performance analysis.
  • Professional Development: Opportunities for continuous learning and professional growth within a dynamic and innovative environment.
  • Collaborative Work Environment: A chance to work in a team that values creativity, data-driven decision-making, and cross-functional collaboration.
  • Cutting-Edge Projects: Involvement in the design and development of next-generation AI accelerators, with a significant impact on future technology.
If you are interested in this position, please apply now for immediate consideration. This is an excellent opportunity for an NPU Performance Analysis Engineer eager to work on innovative AI technology projects and make a significant impact on next-generation AI architectures.
Apply now