Make yourself visible and let companies apply to you.
Roles
CUDA Jobs
Overview
Looking for top CUDA jobs? Explore the latest CUDA developer opportunities on Haystack, your go-to IT job board for high-performance computing roles. Whether you're a CUDA programmer, GPU computing expert, or parallel processing specialist, find the perfect job that matches your skills and advance your career in cutting-edge technology today.
Senior PyTorch Engineer
Advanced Micro Devices
King's Lynn
Hybrid
Senior
Private salary
RECENTLY POSTED
+1

Company description

WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next-generation computing experiencesfrom AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, youll discover the real differentiator is our culture. We push the limits of innovation to solve the worlds most important challengesstriving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.

Job description

THE ROLE:

AMD is looking for a specialized software engineer who is passionate about improving the performance of key applications and benchmarks. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology.

THE PERSON:

Our team is seeking expert-level skilled and motivated senior developer with a strong technical foundation in C++/Python AI development in Linux environment. The ideal candidate combines deep technical expertise with strong analytical abilities, excelling both as a collaborative team member and as an independent contributor capable of defining goals, scoping work and leading development efforts end to end.

KEY RESPONSIBILITIES:

  • Model Development and Optimization: Lead the development and optimization of Deep Learning models on AMD GPUs, ensuring high performance and scalability across training and inference workloads.
  • Framework Enablement: Lead efforts to enhance PyTorch and its ecosystems on AMD GPUs, contributing performance-critical improvements to upstream open-source repositories.
  • Open-Source Intergration: Engage directly with PyTorch and other open-source maintainers to align on requirements, propose enhancements and ensure AMD contributions are successfully adopted and maintained upstream.
  • Community Feature Enablement: Collaborate with PyTorch community and ecosystem partners to design, enable and upstream new features, ensuring AMD GPUs remain first class for emerging AI workloads.
  • Cross-Functional Collaboration: Partner closely with internal GPU Libraries, compilers, and runtime teams to identify performance bottlenecks and deliver optimized solutions for large-scale AI workloads.
  • Scalable Systems Optimization: Developed innovative solutions for both scale-up (multi-GPU) and scale-out (multi-node) distributed training environments to optimize performance on AMD GPUs.
  • Engineering Excellence: Apply rigorous software engineering best practices, including code review, testing, CI/CD integration and debugging, to deliver reliable and maintainable solutions at scale.

Required profile

PREFERRED EXPERIENCE:

  • Software Engineering: Expert skills in Python and C++, with experience in debugging, performance tuning, and test design to ensure high-quality, maintainable software solutions.
  • AI Framework & Deep Learning: Strong understanding of AI frameworks such as PyTorch, Triton and vLLM, with applied knowledge across domains such as Natural Language Processing, Vision, Audio and Recommendation Systems.
  • GPU Computing: Strong experience with GPU Programming models (CUDA, HIP).
  • Development Processes & Tools: Expert with modern software practices and tooling, including debuggers, profilers, and version control systems (e.g., GitHub).
  • Collaboration & Problem Solving: Effective and efficient communicator with strong analytical and problem-solving skills, capable of working across diverse technical teams.

ACADEMIC CREDENTIALS:

  • Masters or PhD level degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field.
  • Senior Professional Experience in technical software development, with a focus on GPU optimization, performance engineering, and framework development.

What we offer

#LI-MH3

#LI-HYBRID

HPC Engineer
Pearson Whiffin IT & Digital
Derby
Hybrid
Mid - Senior
£130,446/day
+3

We are seeking an experienced High Performance Computing (HPC) Engineer to design, maintain, and optimise large-scale computing environments that support data-intensive and compute-heavy workloads. You will work closely with researchers, developers, and infrastructure teams to ensure high availability, performance, and scalability of HPC systems.

Key Responsibilities

  • Design, deploy, and manage HPC clusters (on-prem, cloud, or hybrid)
  • Install, configure, and optimise job schedulers (e.g. Slurm, PBS, LSF)
  • Tune system performance for CPU, GPU, memory, storage, and network workloads
  • Support users with application optimisation and parallelisation
  • Automate system administration using scripting and configuration management tools
  • Monitor system health, capacity, and performance
  • Troubleshoot hardware, software, and performance issues
  • Collaborate on future architecture planning and upgrades
  • Maintain documentation and best practices

Required Skills & Experience

  • Strong Linux system administration experience
  • Hands-on experience with HPC environments and parallel computing
  • Knowledge of MPI, OpenMP, and/or CUDA
  • Experience with job schedulers (Slurm preferred)
  • Familiarity with high-speed interconnects (InfiniBand, Omni-Path)
  • Experience with scripting languages (Bash, Python)
  • Understanding of performance profiling and optimisation techniques

Desirable Skills

  • Experience with GPUs and accelerator-based systems
  • Knowledge of cloud HPC (AWS, Azure, GCP)
  • Experience with containers (Singularity/Apptainer, Docker)
  • Configuration management tools (Ansible, Puppet, Chef)
  • Experience supporting scientific or research workloads
Software Engineer (Digital Signal Processing)
Expleo UK LTD
Manchester
In office
Mid - Senior
Private salary
+5

Overview

Expleo is a trusted partner for end-to-end, integrated engineering, quality services and management consulting for digital transformation. We help businesses harness unrelenting technological change to successfully deliver innovations that will help them gain a competitive advantage and improve the everyday lives of people around the globe.

A highly capable Software Engineer (Digital Signal Processing) is required to support support the development, integration, and testing of complex digital signal processing (DSP) algorithms within real-time software systems. This role is suited to professionals with strong C++ expertise, a deep understanding of computational mathematics, and experience working in model-driven development environments.

The position involves working within multidisciplinary engineering teams to translate mathematical models into robust, high-performance software implementations. The ideal candidate will be comfortable working in Linux-based environments, collaborating across modelling, software, systems, and verification functions.

Responsibilities

  • Design, develop, and optimise real-time DSP algorithms in C++ within high-performance computing environments.
  • Integrate both auto-generated and hand-crafted model-based code into larger software systems.
  • Conduct modelling, simulation, and validation of algorithms using MATLAB, Simulink, or similar environments.
  • Execute structured testing, benchmarking, and verification of DSP components to ensure precision, performance, and model fidelity.
  • Develop scripts (Python/Bash) for modelling support, data preparation, analysis, and automated testing workflows.
  • Perform Linux-target development activities including debugging, optimisation, and deployment.
  • Collaborate with engineering teams to ensure software aligns with system requirements, performance targets, and architectural constraints.
  • Apply software engineering best practices including static analysis, unit testing, and clean code principles.
  • Contribute to configuration management, issue tracking, and continuous integration processes using industry-standard tools.
  • Participate in Agile delivery frameworks, contributing to planning, reporting, and technical reviews.
  • Ensure the accuracy, consistency, and quality of algorithmic outputs and their software implementations.

Qualifications

  • Degree in Electrical Engineering, Computer Science, Mathematics, Physics, or a related discipline.
  • Strong academic or professional focus on signal processing, computational mathematics, or algorithm development.
  • Solid grounding in real-time software engineering and Linux-based development environments.

Essential skills

  • Advanced C++ programming for real-time DSP applications.
  • Strong mathematical capability, particularly in digital signal processing theory and algorithm development.
  • Proficiency with MATLAB, Simulink, or similar tools for modelling and simulation.
  • Experience with integrating autogenerated model code alongside custom software implementations.
  • Linux development expertise, including debugging, profiling, performance optimisation, and command-line workflows.
  • Python and/or Bash scripting for automation, prototyping, data manipulation, and verification tasks.
  • Familiarity with GPU acceleration techniques (e.g., CUDA, OpenCL).
  • Understanding of networking, Linux infrastructure, and system patching.

Desired skills

  • Knowledge of high-performance computing techniques and parallelisation strategies.
  • Familiarity with container technologies such as Docker or Podman.
  • Experience working in multidisciplinary engineering teams including systems, hardware, and modelling specialists.
  • Understanding of end-to-end DSP pipelines, model fidelity constraints, and real-time performance requirements.
  • Strong awareness of software quality assurance practices and continuous integration concepts.

Experience

  • Demonstrated experience implementing, validating, and benchmarking DSP algorithms.
  • Working with filters, detection algorithms, beamforming, or similar DSP domains.
  • Applying model-driven development practices across the full software lifecycle.
  • Conducting structured test and verification activities for mathematical or algorithmic components.
  • Using version control, repository management, and software lifecycle tools (e.g., Git, Bitbucket, Artifactory, Jira).
  • Working within Agile software environments or following well-defined software delivery processes.
  • Experience with containerised runtime environments or modern DevOps workflows (desirable).

What do I need before I apply

  • Eligibility for Security Clearance (SC); in many cases this may require sole UK nationality and long-term UK residency.
  • Ability and willingness to work fully onsite (location will depend on the employer/project).
  • Strong communication skills and the ability to collaborate effectively with cross-functional technical teams.
  • A proactive approach to problem solving, ownership of deliverables, and commitment to high software and mathematical accuracy.

Benefits

  • Collaborative working environment - we stand shoulder to shoulder with our clients and our peers through good times and challenges

  • We empower all passionate technology loving professionals by allowing them to expand their skills and take part in inspiring projects

  • Expleo Academy - enables you to acquire and develop the right skills by delivering a suite of accredited training courses

  • Competitive company benefits

  • Always working as one team, our people are not afraid to think big and challenge the status quo

  • As a Disability Confident Committed Employer we have committed to:

  • Ensure our recruitment process is inclusive and accessible
  • Communicating and promoting vacancies
  • Offering an interview to disabled people who meet the minimum criteria for the job
  • Anticipating and providing reasonable adjustments as required
  • Supporting any existing employee who acquires a disability or long term health condition, enabling them to stay in work at least one activity that will make a difference for disabled people

“We are an equal opportunities employer and welcome applications from all suitably qualified persons regardless of their race, sex, disability, religion/belief, sexual orientation or age”.

We treat everyone fairly and equitably across the organisation, including providing any additional support and adjustments needed for everyone to thrive

Software Engineer C
Recruit 12
Oxford
Remote or hybrid
Mid - Senior
Private salary

Software Engineer C++ GPU Acceleration

Overview

We are seeking a skilled C++ Engineer with strong GPU acceleration expertise to work on cutting-edge, high-performance systems used across entertainment, engineering, and scientific applications. This role focuses on maximising GPU-based processing performance, including real-time data handling, image processing, and machine learning workloads.

You will join a collaborative software engineering team and work closely with machine learning and research specialists in an environment that values technical excellence, innovation, and a healthy work life balance.

Key Responsibilities

  • Design and implement high-performance algorithms using CUDA
  • Manage host device interactions, including memory management, data transfer optimisation, and multi-GPU support
  • Deploy and optimise machine learning models using TensorRT within C++ applications
  • Profile and optimise GPU workloads using NVIDIA Nsight Systems and Nsight Compute
  • Configure GPU hardware and software stacks to maximise runtime performance
  • Evaluate and recommend appropriate GPU hardware for specific workloads
  • Clearly communicate GPU-related opportunities and constraints to non-technical stakeholders

Required Skills, Knowledge & Experience

  • Strong modern C++ development skills
  • Proven experience with CUDA and CUDA libraries
  • Solid understanding of software optimisation and performance tuning
  • Experience developing and profiling GPU-accelerated applications
  • Confidence working in performance-critical, real-time systems

Desirable (Not Essential)

  • Knowledge of networking, streaming, or video compression
  • Experience with real-time data pipelines or image processing systems

Working Environment

  • Collaborative, cross-disciplinary engineering culture
  • Close interaction with machine learning and research teams
  • Informal and supportive workplace with an emphasis on sustainable workloads
Page 1 of 1
Frequently asked questions
Our job board features a variety of CUDA-related positions, including GPU computing engineers, CUDA developers, parallel computing specialists, AI and machine learning engineers utilizing CUDA, and research roles focused on GPU acceleration.It's important to showcase your proficiency in CUDA programming, parallel computing concepts, GPU architecture, and related tools like NVIDIA Nsight. Experience with C/C++, Python, and frameworks such as TensorFlow or PyTorch that integrate CUDA is also valuable.Yes, our job board lists both onsite and remote CUDA positions. You can filter your job search by location to find fully remote, hybrid, or onsite opportunities that suit your preferences.CUDA expertise is in demand across multiple industries, including gaming, automotive (self-driving technologies), scientific research, finance for high-frequency trading algorithms, AI and machine learning, and data analytics.Focus on understanding parallel computing principles, optimizing GPU performance, and writing efficient CUDA kernels. Be ready to solve coding challenges related to memory management, thread synchronization, and debugging GPU code. Familiarity with common CUDA libraries and tools can also be helpful.
Feedback
Contact