Software Algorithm Engineer
Software engineer job at OmniVision Technologies
Design and optimize raw image compression algorithms (e.g., MIPI MPC, JPEG XS) for automotive image sensors, balancing bandwidth efficiency and visual quality.
Develop cybersecurity monitoring systems compliant with automotive standards (ISO 21434), including threat analysis (TARA) and vulnerability scoring (CVSS 4.0).
Prototype and model algorithms using Python/C++, and validate hardware implementation via Siemens Catapult for power/area optimization.
Collaborate with hardware engineers to integrate algorithms into sensor systems, ensuring compliance with MIPI CSE security framework.
Document algorithm design, verification results, and cybersecurity protocols for reproducibility and audit purposes.
Requirements: Master's degree or foreign equivalent degree in Computer Science, Computer Engineering, or related fields. Require advanced courses in computer vision, computer network, and computer security. Required skills:
Attack vectors including SQL injection, Cross-Site Scripting (XSS), and defense mechanisms like TLS encryption.
Design secure communication protocols and analyze vulnerabilities in virtual servers.
Image filtering (Gaussian, Sobel, Laplacian) for noise reduction, edge detection and sharpening.
Template matching for object localization and feature tracking.
Multi-scale analysis using pyramid decomposition for hierarchical processing.
Develop algorithms to reconstruct 3D depth maps from 2D stereo image pairs using epipolar geometry and disparity mapping.
Annual base salary for this role in California, US is expected to be between $129,667 - $135,000. Actual pay will be determined on a number of factors such as relevant skills and experience, and the pay of employees in the similar role.
Auto-ApplyStaff Software Algorithm Engineer
Software engineer job at OmniVision Technologies
Description OmniVision Technologies, Inc. - Santa Clara is currently looking for Staff Algorithm Engineer to support ISP algorithm development.Responsibilities:
Research new image and color processing or computer vision techniques and technologies for digital camera systems
Design and develop new algorithms for image and color processing or computer vision
Expand our ISP or CV solution to new levels of functionality, quality, and performance
Assist the hardware design team with algorithm implementation and validation
Image quality tuning and measure
Algorithm documentation
Qualifications:
MS/PhD in Electrical Engineering or Computer Science with 6+ yrs of experience.
Familiar with camera ISP algorithm and pipeline or computer vision algorithms
A solid working understanding of colorimetry and image sensing technology
Hands on experience with mobile cameras a plus
Strong C/C++ and MATLAB programming skills
Organizational skills, computer skills, writing and researching skills
Annual base salary for this role in California, US is expected to be between $150,000 - $180,000. Actual pay will be determined on a number of factors such as relevant skills and experience, and the pay of employees in the similar role. EOE/Minorities/Females/Vet/Disability
Auto-ApplySoftware Algorithm Engineer
Software engineer job at OmniVision Technologies
ISP algorithm development and ISP tuning.
Subjective/objective image quality evaluation.
Auto white balance, color correction and color management.
Design, modeling, test and analysis of the imaging pipeline.
Machine learning solution development for real-time image processing, with low latency and low power consumption.
Train different models with various sensor patterns, such as 4C pattern and Bayer Pattern.
Improve algorithm to gain higher quality on color images.
Evaluate and tune existing image processing models and verify performance.
Write algorithms documentation.
Job Requirements: Master's degree in Electrical Engineering, Computer Science, or related fields with course work of computer vision and deep learning.
Must possess the demonstrated knowledge or skills of:
Convolution filters.
Neural networks, such as CNN model and GAN model.
CNN in object recognition.
Multi-class classification and image deionizing.
Deep learning frameworks.
Data augmentation.
Fine-tuning pre-trained models.
RNN, LSTM and transformers.
Optimization.
PyTorch.
Annual base salary for this role in California, US is expected to be between $126,942 - $134,000. Actual pay will be determined on a number of factors such as relevant skills and experience, and the pay of employees in the similar role.
Auto-ApplyNeural 3D Graphics Software Engineer (SMTS)
Santa Clara, CA jobs
What you do at AMD changes everything
At AMD, we push the boundaries of what is possible. We believe in changing the world for the better by driving innovation in high-performance computing, graphics, and visualization technologies - building blocks for gaming, immersive platforms, and the data center.
Developing great technology takes more than talent: it takes amazing people who understand collaboration, respect, and who will go the “extra mile” to achieve unthinkable results. It takes people who have the passion and desire to disrupt the status quo, push boundaries, deliver innovation, and change the world. If you have this type of passion, we invite you to take a look at the opportunities available to come join our team.
The Role:
We are looking for exceptional software engineers with a passion and demonstrated ability to rapidly learn new methods (from open-source code, academic papers, etc.), develop prototypes, invent new algorithms and help put the best ones in production.
To be successful in this role, you need to be highly motivated and passionate about programming. You must love solving technical problems, think outside the box, and be versatile enough to go where the challenges take you.
The Person:
Proven track-record programming complex software in languages such as Python 3.x, Julia, C++ or CUDA
Very strong aptitude for problem solving
Works effectively both independently and in a team environment
Configuring and troubleshooting your own Linux or Windows PC
Effective written and verbal communication skills
Ability to organize and manage priorities
Key Responsibilities:
This team contributes to multiple projects that combine real-time 3D graphics and machine learning.
Depending on your level of experience and preference, you will share responsibilities such as:
R&D for a new high-performance Neural 3D Rendering project, prototyped in Julia, with many opportunities to contribute to open-source repositories.
Prototype and integrate state-of-the-art algorithms, from papers to integration into game engines, e.g. using Pytorch, JAX or Julia. Depending on the project, this may also include profiling the workload, doing in-depth optimization of the training and/or inference, interfacing with MLIR/ONNX, etc.
R&D and ML Operations (MLOps). That project uses Pytorch, python, fastai, nbdev, Azure, etc.
Collaborate and share insights to improve our GPUs with other teams, e.g. GPU hardware and software architects
All projects offer many opportunities to learn, implement, integrate, optimize or invent new state-of-the ML and 3D algorithms
Preferred Experience:
ML fundamentals and state-of-the-art algorithms, theory and practice, e.g.: CNNs and other common neural network architectures, super-resolution, transformers, diffusion models, PyTorch, JAX, ROCM/CUDA, computer vision, efficient architectures, etc.
3D graphics algorithms, APIs and game development, e.g.: DirectX, Vulkan, HLSL, SPIR-V, ray tracing, Unreal Engine, etc. In particular, experience creating 3D tools or integrating new 3D graphics features in the Unreal Engine 4/5 (using C++) would be highly appreciated.
Neural Rendering algorithms, e.g. Neural Radiance Fields (NeRF), Neural Light Fields, differentiable rasterization, etc.
Portfolio of original, technically-challenging personal programming projects, or significant contribution to open-source projects
MLOps or DevOps experience, e.g. cloud (Azure, AWS), Docker, Linux networking, system-wide profiling/debugging
Experience with complementary technologies and practices, e.g. LLVM/MLIR, in-depth performance profiling, GPU architecture, 3D Content creation, etc.
Credentials:
University degree in Computer Engineering, Computer Science or equivalent
Location:
Proximity to an AMD development office, e.g.: Markham, Canada; Santa Clara, Boston, Orlando, San Diego, Austin, etc.; Munich, Germany (flexible)
The team works in Europe or North-American time zones
#LI-SW2
Requisition Number: 182587
Country: United States State: California City: Santa Clara
Job Function: Design
Benefits offered are described here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. AMD and its subsidiaries are equal opportunity employers. We consider candidates regardless of age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status. Please click here for more information.
C++ Software Engineer, AI Open-Source Software
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
Are you interested in joining a world-class team that enables software for leading datacenters and the most powerful supercomputers? AMD is seeking talented, highly motivated Software Engineers to join our development team to enable and optimize the software ecosystem and push the boundaries of efficiency and performance for the next generation of GPU computational accelerators. Our team has an unparalleled perspective of the AI landscape and works closely with the industry's most sophisticated clients to help them leverage the latest hardware capabilities for AI workloads. As a member of our team, you will be among the first to combine new hardware with the latest applications, libraries, frameworks, and SDKs to drive innovation and solve complex challenges.
THE PERSON:
We are looking for a highly motivated and skilled Software Engineer to join our team. You will collaborate with other engineers to enable models, libraries, and applications for Instinct GPUs in both on-premises and cloud environments. Candidates should have strong C/C++ skills; Python experience is a plus. Experience analyzing and optimizing software performance is required. You must be self-motivated and able to work effectively in a team environment.
KEY RESPONSIBILITIES:
* Collaborate with AMD's architecture specialists to influence and improve future products
* Apply a data-driven approach to prioritize and target optimization efforts
* Stay informed of software and hardware trends and innovations, especially those related to algorithms and architecture
* Design and develop new groundbreaking AMD technologies
* Debug and resolve existing issues and research more efficient approaches to achieve objectives
* Build and maintain technical relationships with internal peers and external partners
PREFERRED EXPERIENCE:
* Experience with both training and inferencing machine learning models
* Experience with GPU kernel and parallel programming: ROCm, HIP, or CUDA
* Experience in performance analysis and optimization
* Highly proficient in C/C++, object-oriented programming, and tools such as debuggers and profilers
* Experience with open-source software development, engaging with upstream reviewers, and reviewing PRs.
* Experience with source code control systems such as GitHub
* Experience with Linux and containerized environments (e.g., Docker)
* Experience with software development processes such as Agile, Scrum, or Sprint
* Demonstrated ability to write high-quality code with strong attention to detail
* Effective communication and problem-solving skills
* Motivating leader with good interpersonal skills
ACADEMIC CREDENTIALS:
BS or MS or PhD in Computer Science, Computer Engineering, Electrical Engineering, or equivalent
#LI-JG1
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Prinicpal Software Engineer
San Jose, CA jobs
What you do at AMD changes everything
We care deeply about transforming lives with AMD technology to enrich our industry, our communities and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence, while being direct, humble, collaborative and inclusive of diverse perspectives. This is who we are at our best. One Company. One Team.
AMD together we advance_
PRINICPAL SOFTWARE ENGINEER
THE ROLE:
Great opportunity for a Principal Software Development Engineer career professional to join an elite team to develop ongoing solutions in the Adaptive & Embedded Computing Group (AECG) division of AMD semiconductor. AMD's new hardware programmable SoC FPGAs and Adaptive Compute Acceleration Platform (ACAP) deliver most dynamic processor technology and are achieving record performances in Data Center, Wireless/5G, Automotive/ADAS and Emulation applications.
These new applications, and heterogeneous computing architecture introduces new challenges in compilation, particularly in placement and routing. AMD's Software Implementation Tools team addresses these challenges. This position within the Vivado Software group of AECG focuses on research and development of novel multi-threaded and multi-process algorithms and Machine Learning (ML) techniques.
THE PERSON:
AECG's Vivado Software team is seeking a Principal Software Engineer with proven expertise in EDA Algorithms and C++ programming skills to work on a critical next generation state of the art Vivado Backend tool. The person will be part of FPGA Software Implementation Tools team within AECG having responsibility to innovate and develop novel Placer Algorithms and Machine learning techniques with aim of improving Quality of Results (QoR), Compile Time, and Memory Usage by orders of magnitude.
You will also be interacting with customers, helping them converge on their critical design requirements. This person will work with Tech Marketing and Applications Engineering to understand customer needs, and accordingly architect new features with ongoing engagement with Architecture team to design next gen FPGA and ACAP solutions.
KEY RESPONSIBILITIES:
Innovate and develop novel Placer algorithms to get multi-fold performance improvements (QoR, Compile Time, Memory Usage)
Interact with Customers, helping them converge on their critical design needs
Collaborate with Tech Marketing and Applications Engineering to understand customer needs and accordingly architect new features
Partner with Architecture team to design next gen FPGA and ACAP solutions
Evaluating new FPGA architectures and its impact on existing EDA tools
PREFERRED EXPERIENCE:
Proven expertise in EDA Algorithms
Programming expertise with C++
Machine Learning (ML) or Python
Knowledge of Digital Design and Field Programmable Gate Array (FPGA) is a huge plus
ACADEMIC CREDENTIALS:
Bachelor or Master's Degree in Computer Science, Computer Engineering, Electrical Engineering, or related equivalent, PhD desired, but not required
Location:
San Jose, CA
#LI-JT1
Requisition Number: 185945
Country: United States State: California City: San Jose
Job Function: Design
Benefits offered are described here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. AMD and its subsidiaries are equal opportunity employers. We consider candidates regardless of age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status. Please click here for more information.
Software Development Engineer - Kernel Development
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
As a core member of the team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. Your experience will be critical in enhancing GPU kernels, deep learning models, and finetuning inference performance across multi-GPU and multi-node systems through open-source popular frameworks like vllm/SGLang and internal inferencing platforms. You will engage with both internal framework teams and open-source maintainers to ensure seamless integration of optimizations, utilizing cutting-edge technologies and advanced engineering principles to drive continuous improvement.
THE PERSON:
Skilled engineer with strong technical and analytical expertise in Python development within Linux environments. The ideal candidate will thrive in both collaborative team settings and independent work, with the ability to define goals, manage development efforts, and deliver high-quality solutions. Strong problem-solving skills, a proactive approach, and a keen understanding of software engineering best practices are essential.
KEY RESPONSIBILITIES:
* Optimize Deep Learning Frameworks: Enhance and optimize frameworks like PyTorch / vllm / SGLang for AMD GPUs in open-source repositories.
* Design and scale: multi‑GPU inference strategies (TP/PP/EP hybrid).
Develop & Optimize Models: Design and optimize deep learning models specifically for AMD GPU performance.
* Collaborate with GPU Library Teams: Work closely with internal teams to analyze and improve training and inference performance on AMD GPUs.
* Collaborate with Open-Source Maintainers: Engage with framework maintainers to ensure code changes are aligned with requirements and integrated upstream.
* Work in Distributed Computing Environments: Optimize deep learning performance on both scale-up (multi-GPU) and scale-out (multi-node) systems.
* Utilize Cutting-Edge Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance.
* Optimize Deep Learning Pipeline: Enhance the full pipeline, including integrating graph compilers.
* Software Engineering Best Practices: Apply sound engineering principles to ensure robust, maintainable solutions.
PREFERRED EXPERIENCE:
* Kernel & Inference Frameworks: Strong background in GPU kernel development and LLM inference frameworks.
* Inference Stack Knowledge: Hands-on understanding of SGLang internals or similar stacks such as vLLM and FasterTransformer.
* Distributed & Open-Source Execution: Solid experienced with distributed inference scaling and proven contributor to upstream open-source projects
* Deep Learning Integration: Strong and significant experience in integrating optimized GPU performance into machine learning frameworks (e.g., TensorFlow, PyTorch) to accelerate model training and inference, with a focus on scaling and throughput.
* Software Engineering: Expert in Python and C++, with experience in debugging, performance tuning, and test design to ensure high-quality, maintainable software solutions.
* High-Performance Computing: Solid experienced in running large-scale workloads on heterogeneous compute clusters, optimizing for efficiency and scalability.
* Compiler Optimization: Foundational understanding of compiler theory and tools like LLVM and ROCm for kernel and system performance optimization.
ACADEMIC CREDENTIALS:
* Master's or PhD in Computer Science, Computer Engineering, Electrical Engineering, or a related field.
#LI-JG1
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Staff Software Development Engineer, Computer Vision And AI software
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
We are looking for a talented engineer to join our team: developing heterogeneous software for computer vision, image/video processing as part of the AMD Radeon Open Ecosystem (ROCm). The ideal candidate thrives in a fast-paced environment and brings strong communication, problem-solving, and prioritization skills to the table. We value individuals who are critical thinkers, detail oriented and passionate about building high-performance software.
THE PERSON:
You are accustomed to working in a dynamic, geographically distributed agile team, where partnership and collaboration are paramount. You possess excellent written and verbal communication skills, strong attention to detail, and the ability to express your work in a clear, cohesive fashion. Most importantly, you are constantly thinking of ways to make world-class software with reliability and performance.
KEY RESPONSIBILITIES:
* Design, develop, and optimize GPU software for computer vision, image and video processing workloads.
* Collaborate with cross-functional teams to deliver robust software extracting best performance from AMD hardware
* Debug and resolve complex software issues across multiple layers of the stack.
* Deliver high-quality code and documentation following best practices for open-source software development
* Work with key technical experts across AMD and with our partners and customers to improve ROCm applications, libraries, and tools, as well as AMD GPU hardware
* Apply software engineering methodologies to ensure robust, maintainable code
* Develop technical relationships with peers and partners
PREFERRED EXPERIENCE:
* Proficiency in C/C++ and Python.
* Experience in video codecs, image processing and machine learning frameworks
* Familiarity with computer vision libraries like OpenCV, FFMpeg, OpenVX etc.
* Experience with GPU programming (e.g., HIP, OpenCL, CUDA).
* Familiarity with ROCm or similar heterogeneous computing platforms.
* Background in computer vision, image processing, or video processing.
* Experience with open-source development and Git and feature tracking software (e.g., Jira)
* In-depth knowledge of best-practices in software development, including testing, profiling, debugging, documentation, version control, issue tracking, and planning
ACADEMIC CREDENTIALS:
* B.Sc. or B.Eng. degree in Computer Science, Software Engineering, Electrical Engineering, or equivalent
Advanced degrees, such as M.Sc., M.Eng., Ph.D. are preferred
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Software Development Engineer - Profiling Tools (Pre/Post - Silicon)
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE
AMD is seeking a Software Development Engineer to join the AGS Developer Tools team. In this role, you will design and develop advanced GPU profiling tools that enable developers to analyze and optimize performance for HPC, ML, and AI workloads. You will contribute to the ROCm ecosystem by building robust, scalable profiling, debugging solutions that empower developers to maximize AMD GPU capabilities.
THE PERSON
You are passionate about software engineering and performance optimization. You have a strong foundation in C++ and computer architecture, and you thrive in collaborative environments. You are detail-oriented, proactive in solving complex technical challenges, and able to communicate effectively across teams.
KEY RESPONSIBILITIES
* Design, develop, and maintain ROCprofiler-SDK components for GPU performance profiling
* Collaborate with architecture, driver, and runtime teams to enable profiling for next-generation AMD GPUs involved in Pre Silicon and Post Silicon activities.
* Implement new features and APIs to enhance profiling capabilities for AI and HPC workloads
* Optimize profiling tools for accuracy, scalability, and minimal overhead
* Debug and resolve issues in profiling workflows and improve tool reliability
* Participate in hardware bring-up and ensure profiling support for new ASICs
* Stay current with GPU architecture advancements and integrate them into ROCm profiling tools
* Contribute to documentation and developer resources for ROCprofiler-SDK
PREFERRED EXPERIENCE
* Strong proficiency in C++ and object-oriented programming
* Experience with performance analysis tools or profiling frameworks
* Familiarity with GPU programming models (HIP, OpenCL, or CUDA)
* Understanding of GPU architecture and system-level performance concepts
* Experience with multithreading and concurrency in modern C++
* Knowledge of Linux development environments; Windows experience is a plus
* Familiarity with ROCm ecosystem and tools is highly desirable
* Experience with Git-based workflows and debugging tools
* Strong problem-solving skills and ability to work independently and in a team
ACADEMIC CREDENTIALS:
* Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent
LOCATION: Santa Clara, CA
#LI-CJ3
#LI-Hybrid
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Principal Software Development Engineer-Networking
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
PMTS SOFTWARE DEVELOPMENT ENGINEER
THE ROLE:
AMD is looking for a strategic software engineering lead who is passionate about improving the performance of key applications and benchmarks. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology.
THE PERSON:
The ideal candidate should be passionate about software engineering and possess leadership skills to drive sophisticated issues to resolution. Able to communicate effectively and work optimally with different teams across AMD.
KEY RESPONSIBILITIES:
* Work with AMD's architecture specialists to improve future products
* Apply a data minded approach to target optimization efforts
* Stay informed of software and hardware trends and innovations, especially pertaining to algorithms and architecture
* Design and develop new groundbreaking AMD technologies
* Participating in new ASIC and hardware bring ups
* Debugging/fix existing issues and research alternative, more efficient ways to accomplish the same work
* Develop technical relationships with peers and partners
* Guide the design, development, and optimization of RCCL features, focusing on performance, scalability, and ease of use across various AMD platforms and interconnects (PCIe, Infinity Fabric,RoCEv2)
PREFERRED EXPERIENCE:
* Strong object-oriented programming background, C/C++ preferred
* Ability to write high quality code with a keen attention to detail
* Experience with modern concurrent programming and threading APIs
* Experience with Windows, Linux and/or Android operating system development
* Experience with software development processes and tools such as debuggers, source code control systems (GitHub) and profilers is a plus
* Effective communication and problem-solving skills
* Motivating leader with good interpersonal skills
* Deep expertise in high-performance computing, Deep Learning, and parallel programming models.
* Strong understanding of GPU architectures, and communication libraries like RCCL, roc SHMEM, and MPI.
* Experience with various interconnect technologies (PCIe, Infinity Fabric, Infiniband/RoCEv2) and network protocols (RDMA)
ACADEMIC CREDENTIALS:
* Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent
#LI-RL1
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Principal Software Development Engineer, AI Open-Source Software
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
Do you want to help build the AI software that runs on top of the world's fastest supercomputers and most advanced data centers?
At AMD, we are looking for passionate and talented Software Engineers to join our team. You will help develop cutting-edge technology that pushes the boundaries of performance and efficiency for the next generation of GPU accelerators. This is an opportunity to contribute to open-source AI software from AMD and the broader community, driving innovation and enhancing AI performance across data center GPUs.
THE PERSON:
You are a talented engineer who is passionate about advancing the frontiers of AI. You thrive in open-source environments, enjoy tackling complex technical challenges, and take pride in writing efficient, maintainable, and scalable software. You are collaborative, curious, and excited to contribute to the open-source repositories that power the next generation of AI workloads.
KEY RESPONSIBILITIES:
* Contribute to RAG, Ray, ROCm, Coding Agent, DGL, llama.cpp, verl, MegaBlocks, FlashInfer, Triton Inference Server, Taichi, and other merging open-source projects driving AI innovation.
* Collaborate with leading partners and open-source communities to enable AI workloads and improve performance on data center GPUs.
PREFERRED EXPERIENCE:
* Experience with C++, Python, or similar programming languages.
* Knowledge of AI training and inference.
* Familiarity with GPU programming (CUDA, HIP, or OpenCL) and performance optimization techniques.
ACADEMIC CREDENTIALS:
Bachelor's or master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent.
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Software Development Engineer - SGLang
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
As a core member of the team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. Your work will be instrumental in enhancing GPU kernel performance, accelerating deep learning models, and enabling training and inference at scale across multi-GPU and multi-node systems. You will collaborate across internal GPU software teams and engage with open-source communities to integrate cutting-edge compiler technologies and drive upstream contributions that benefit AMD's AI software ecosystem.
THE PERSON:
Skilled engineer with strong technical and analytical expertise in C++ development within Linux environments. The ideal candidate will thrive in both collaborative team settings and independent work, with the ability to define goals, manage development efforts, and deliver high-quality solutions. Strong problem-solving skills, a proactive approach, and a keen understanding of software engineering best practices are essential.
KEY RESPONSIBILITIES:
* Optimize Deep Learning Frameworks: Enhance performance of frameworks like TensorFlow, PyTorch, and SGLang on AMD GPUs via upstream contributions in open-source repositories.
* Develop and Optimize Deep Learning Models: Profile and tune large-scale training and inference models for optimal performance on AMD hardware.
* GPU Kernel Development: Design, implement, and optimize high-performance GPU kernels using HIP, Triton, or other relevant tools for AI operator efficiency.
* Collaborate with GPU Library and Compiler Teams: Work closely with internal compiler and GPU math library teams to integrate and align kernel-level optimizations with full-stack performance goals.
* Contribute to SGLang Development: Support optimization, feature development, and scaling of the SGLang LLM framework across AMD GPU platforms.
* Distributed System Optimization: Tune and scale performance across both multi-GPU (scale-up) and multi-node (scale-out) environments, including inference parallelism and collective communication strategies.
* Graph Compiler Integration: Integrate and optimize runtime execution through graph compilers such as XLA, TorchDynamo, or custom pipelines.
* Open-Source Collaboration: Partner with external maintainers to understand framework needs, propose optimizations, and upstream contributions effectively.
* Apply Engineering Best Practices: Leverage modern software engineering practices in debugging, profiling, test-driven development, and CI integration.
PREFERRED EXPERIENCE:
* Strong Programming Skills: Proficient in C++ and/or Python, with demonstrated ability to debug, profile, and optimize performance-critical code.
* SGLang and LLM Optimization: Hands-on experience with SGLang or similar LLM inference frameworks is highly preferred.
* Compiler and GPU Architecture Knowledge: Background in compiler design or familiarity with technologies like LLVM, MLIR, or ROCm is a plus.
* Heterogeneous System Workloads: Experience running and scaling workloads on large-scale, heterogeneous clusters (CPU + GPU) using distributed training or inference strategies.
* AI Framework Integration: Experience contributing to or integrating optimizations into deep learning frameworks such as PyTorch or TensorFlow.
* GPU Computing: Working knowledge of HIP, CUDA, or other GPU programming models; experience with GCN/CDNA architecture preferred.
ACADEMIC CREDENTIALS:
* Bachelor's and/or Master's Degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field.
#LI-JG1
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
MTS Software Development Engineer - Netlist Data Model
San Jose, CA jobs
What you do at AMD changes everything
We care deeply about transforming lives with AMD technology to enrich our industry, our communities and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence, while being direct, humble, collaborative and inclusive of diverse perspectives. This is who we are at our best. One Company. One Team.
AMD together we advance_
MTS SOFTWARE DEVELOPMENT ENGINEER
THE ROLE:
At AMD, we push the boundaries of what is possible. We believe in changing the world for the better by driving innovation in high-performance computing, graphics, and visualization technologies ' building blocks for gaming, immersive platforms, and the data center.
THE PERSON:
Developing great technology takes more than talent: it takes amazing people who understand collaboration, respect, and who will go the 'extra mile' to achieve unthinkable results. It takes people who have the passion and desire to disrupt the status quo, push boundaries, deliver innovation, and change the world. If you have this type of passion, we invite you to take a look at the opportunities available to come join our team.
KEY RESPONSIBILITES:
Work within and across teams to significantly improve runtime performance and memory usage for current and next-gen architecture
Assess new hardware architecture features in our device pipeline, drive recommendations for solutions in the tool chain
Work on EDA infra-structure projects in the domain of netlist, constraints, security, and licensing
Ensure on-time delivery of high-quality product that meets business and technical requirements
Engage with the application/field engineers to address critical customer designs issues
PREFERRED SKILLS:
Proven track record developing commercial software solutions
Expert software architecture, data structures/algorithm and C++ skills with emphasis on memory, runtime, quality, and scalability
VHDL, Verilog, or EDIF
C++ Code Debugger experience
Experience with FPGA or ASIC design flows is a plus
Experience with scripting languages (Python, Tcl)
ACADEMIC CREDENTIALS:
BS OR MS OR PhD in ELECTRICAL ENGINEERING / COMPUTER ENGINEERING / COMPUTER SCIENCE
LOCATION:
San Jose, Ca.
#LI-JT1
Requisition Number: 185141
Country: United States State: California City: San Jose
Job Function: Design
Benefits offered are described here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. AMD and its subsidiaries are equal opportunity employers. We consider candidates regardless of age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status. Please click here for more information.
SMTS Software Development Eng.
San Jose, CA jobs
What you do at AMD changes everything
We care deeply about transforming lives with AMD technology to enrich our industry, our communities and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence, while being direct, humble, collaborative and inclusive of diverse perspectives. This is who we are at our best. One Company. One Team.
AMD together we advance_
SOFTWARE DEVELOPMENT ENGINEER
THE ROLE
Developing and maintaining best in class FPGA software implementation tool including logic and physical optimization algorithms and flows.
Researching, implementing and enhancing AMD FPGA specific optimizations and flows.
Interacting with placement, routing, timing analysis, design verification, software applications, and other software tools teams.
Performing design analysis and addressing customer issues.
Taking initiative on software tool quality and performance improvements.
PREFERED EXPERIENCES:
Expertise in C++ and object oriented programming
Knowledge of Data structures, Graph Theory and Algorithms.
Proven experience in developing state of the art in one or more of the following areas: logic optimization, physical optimization, timing analysis, place & route.
Experience in developing and supporting large-scale software, including understanding usage model, writing functional specification, implementing code, testing, documentation, and providing customer support.
ACADEMIC CREDENTIALS:
BS with 8+ years of experience or MS 5+ years of experience or PhD with 3+ years of experience in Computer Science, Computer Engineering, Electrical Engineering or related equivalent
#LI-JY1
Requisition Number: 184841
Country: United States State: California City: San Jose
Job Function: Design
Benefits offered are described here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. AMD and its subsidiaries are equal opportunity employers. We consider candidates regardless of age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status. Please click here for more information.
Staff Software Development Engineer - AI/ML Frameworks, C/C++/Python
San Jose, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
This role is not eligible for visa sponsorship.
THE ROLE:
AMD is looking for a world class AI frameworks engineer who can provide technical leadership in the development of various AI frameworks in the AMD ecosystem. You will play a pivotal role in developing and optimizing deep learning frameworks for AMD GPUs. You will engage with both internal GPU library teams and open-source maintainers to ensure seamless integration of optimizations, utilizing cutting-edge compiler technologies and advanced engineering principles to drive continuous improvement.
THE PERSON:
If you are passionate about AI/ML frameworks, software architecture, and/or compilers this is your opportunity. You will be working in one of the core areas, such as AI/ML frameworks (e.g. PyTorch, vLLM, SGLang), AI runtime components, and/or optimization tooling to accelerate AI/ML workloads on AMD GPUs. You will collaborate closely with AI researchers to drive the development of framework components to efficiently map AI models to run on latest AMD GPUs. You should be someone who can work in a dynamic development environment, with excellent leadership and collaboration skills. You will work with multiple engineering teams that are geographically dispersed. You will work on next generation framework software, guiding other senior developers and domain experts.
KEY RESPONSIBILITIES:
* Optimize Deep Learning Frameworks: Enhance and optimize frameworks like PyTorch, vLLM, SGLang for AMD GPUs in open-source repositories.
* Develop GPU Kernels: Create and optimize GPU kernels to maximize performance for specific AI operations.
* Develop & Optimize Models: Design and optimize deep learning models using quantization specifically for AMD GPU performance.
* Collaborate with GPU Library Teams: Work closely with internal teams to analyze and improve training and inference performance on AMD GPUs.
* Collaborate with Open-Source Maintainers: Engage with framework maintainers to ensure code changes are aligned with requirements and integrated upstream.
* Software Engineering Best Practices: Apply sound engineering principles to ensure robust, maintainable solutions.
PREFERRED EXPERIENCE:
* GPU Kernel Development & Optimization: Experienced in designing and optimizing GPU kernels for deep learning on AMD GPUs using HIP, CUDA, and assembly (ASM). Strong knowledge of AMD architectures (GCN, RDNA) and low-level programming to maximize performance for AI operations, leveraging tools like Compute Kernel (CK), CUTLASS, and Triton for multi-GPU and multi-platform performance.
* Experience with AI software framework, such as PyTorch, vLLM, SGLang, benchmarking and profiling.
* Experience using profiling and benchmark tooling for large models.
* Experience with model optimization, such as low-precision quantization (MXFP4, FP8, INT4), sparsity.
* Solid understanding of model architectures, LLMs, MoE, diffusion.
* Proficient in C++ programming.
* Experience developing and debugging in Python.
* Team player and ready to work with a geographically distributed team.
ACADEMIC CREDENTIALS:
BS, MS or PhD in Computer Science, Computer Engineering, Electrical Engineering, or related technical fields.
#LI-TC1
#LI-HYBRID
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Principal Software Development Engineer, ML Training and Performance
San Jose, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
We are looking for a Principal Machine Learning Engineer to join our Models and Applications team. If you are excited by the challenge of distributed training of large models on a large number of GPUs, and if you are passionate about improving training efficiency while innovating and generating new ideas, then this role is for you. You will be part of a world class team focused on addressing the challenge of training generative AI at scale.
THE PERSON:
The ideal candidate should have experience with distributed training pipelines, be knowledgeable in distributed training algorithms (Data Parallel, Tensor Parallel, Pipeline Parallel, ZeRO), and be familiar with training large models at scale.
KEY RESPONSIBILITIES:
* Train large models to convergence on AMD GPUs at scale.
* Improve the end-to-end training pipeline performance.
* Optimize the distributed training pipeline and algorithm to scale out.
* Contribute your changes to open source.
* Stay up-to-date with the latest training algorithms.
* Influence the direction of AMD AI platform.
* Collaborate across teams with various groups and stakeholders.
PREFERRED EXPERIENCE:
* Experience with ML frameworks such as PyTorch, JAX, or TensorFlow.
* Experience with distributed training and distributed training frameworks, such as Megatron-LM, DeepSpeed.
* Experience with LLMs or computer vision, especially large models, is a plus.
* Excellent Python or C++ programming skills, including debugging, profiling, and performance analysis at scale.
* Experience with ML infra at kernel, framework, or system level
* Strong communication and problem-solving skills.
ACADEMIC CREDENTIALS:
* A master's degree or PhD degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
LOCATION:
* San Jose, CA or Bellevue, WA preferred. May consider other US markets within proximity of US AMD offices.
#LI-MV1
#LI-HYBRID
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Software Development Engineer - AI group
San Jose, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
A senior technical contributor that drives end-to-end delivery of software solutions, directly contributing to, and coordinating implementation and optimization across multiple teams for inference and training of machine learning models. The position will involve interfacing with software and hardware engineering teams and AMD partners to plan, develop and optimize use cases. This is an exciting opportunity to work on the cutting edge of GPU Computing for Machine Learning.
THE PERSON:
You are a subject matter expert and strong technical contributor with machine learning and GPU programming experience. You excel as part of a team where communication and team skills are highly valued.
KEY RESPONSIBILITIES:
* Work within and coordinate with a small team to analyze, implement, and optimize DirectML-TensorFlow and PyTorch for machine learning models
* Collaborate with ISV, library, compiler, driver, and hardware engineers to influence strategic decisions to achieve the highest performance for DirectML
* Innovate new algorithmic improvements that exploit the strengths of the hardware architecture to deliver the best possible machine learning performance
PREFERRED EXPERIENCE:
* Relevant experience in Machine Learning and/or GPU programming
* Experience in deep learning frameworks (e.g. TensorFlow, Keras, PyTorch, Caffe, ONNX, etc) and familiarity with CNN/LSTM model architectures
* Knowledge of CPU and GPU architecture, and experience in GPGPU programming technologies
* Experience advocating for technical solutions in a collaborative team environment
* Excellent communication and collaboration skills
ACADEMIC CREDENTIALS:
Bachelor's or Master's degree in related discipline preferred
#LI-CJ3
#LI-Hybrid
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Fellow Software Development Eng (MI-450 Fleet Management)
San Jose, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
AMD is looking for a Fellow in Infrastructure Management who is passionate about complex AI solutions, Infrastructure management software that manages GPU clusters supporting large foundation model training, high-performance inference services, multi-tenant GPU sharing and scheduling. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology.
THE PERSON:
The ideal candidate should be passionate about software engineering, system design, infrastructure management and possess leadership skills to drive sophisticated issues to resolution. Able to communicate effectively and work optimally with different teams across AMD.
KEY RESPONSIBILITIES:
Lead the technical vision, strategy, and architectural direction of our Infrastructure management software that manages our GPU clusters supporting large foundation model training, high-performance inference services, multi-tenant GPU sharing and scheduling.
Architect and implement GPU node orchestration, failure detection, auto-remediation, auto-scaling of clusters. Design infrastructure software that can support different distributed training frameworks PyTorch, Megatron, JAX, Tensorflow and different distributed inference frameworks like SGLang, VLLM, Ray. Design and deliver software that can manage different scale-up and scale out transport protocols and deliver the best network performance.
Architect telemetry, observability, and profiling systems (Prometheus/Thanos, Open Telemetry, Mimir) to measure GPU health and cluster efficiency. Architect monitoring systems that can investigate network congestion, latency spikes, scheduling inefficiencies, system bottlenecks.
Architect our Infrastructure management software for scale, efficiency and deliver industry leading GPU cluster utilization that is highly reliable and self-healing. Deliver infrastructure services that reduce job latencies for Slurm and Kubernetes clusters, improve scheduling efficiencies, and reduce operational cost.
Design and deliver AI Agents that can troubleshoot complex infrastructure problems without any human intervention and reduce OPEX cost and MTTR. Design and deliver AI Agents that can proactively identify nodes that are bound to fail before they fail.
Define our long-term infrastructure management roadmap, drive cross-team initiatives, and deliver. Mentor Principal Engineers, Technical staff across teams. Work with external partners and vendors to develop and deliver the most comprehensive infrastructure solution for AMD.
PREFERRED ACADEMIC CREDENTIALS:
Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent.
#LI-G11
#LI-HYBRID
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Staff Software Development Engineer - AI/ML frameworks, C/C++/Python
San Jose, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
AMD is looking for a specialized software engineer who is passionate about improving the performance of key applications and benchmarks. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology.
THE PERSON:
The ideal candidate should be passionate about software engineering and possess leadership skills to drive sophisticated issues to resolution. Able to communicate effectively and work optimally with different teams across AMD.
KEY RESPONSIBILITIES:
* Work with AMD's architecture specialists to improve future products
* Apply a data minded approach to target optimization efforts
* Stay informed of software and hardware trends and innovations, especially pertaining to algorithms and architecture
* Design and develop new groundbreaking AMD technologies
* Participating in new ASIC and hardware bring ups
* Debugging/fix existing issues and research alternative, more efficient ways to accomplish the same work
* Develop technical relationships with peers and partners
PREFERRED EXPERIENCE:
* Strong object-oriented programming background, C/C++ preferred
* Ability to write high quality code with a keen attention to detail
* Experience with modern concurrent programming and threading APIs
* Experience with Windows, Linux and/or Android operating system development
* Experience with software development processes and tools such as debuggers, source code control systems (GitHub) and profilers is a plus
* Effective communication and problem-solving skills
ACADEMIC CREDENTIALS:
* Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent
#LI-TC1
#LI-HYBRID
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Software Development Engineer
San Jose, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
AMD is looking for a specialized software engineer who is passionate about improving the performance of key applications and benchmarks. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology.
THE PERSON:
The ideal candidate should be passionate about software engineering and possess leadership skills to drive sophisticated issues to resolution. Able to communicate effectively and work optimally with different teams across AMD.
KEY RESPONSIBILITIES:
* Responsible for the design, implementation, analysis, and test of FPGA physical implementation tools which include optimization, placement, timing, and routing.
* Perform research and development of novel algorithms and optimization of existing algorithms to improve quality of results (QOR), runtime, and memory.
* Conduct research and development of novel placement/routing and optimization techniques for next-generation FPGAs.
* Apply a data minded approach to target optimization efforts.
* Stay informed of software and hardware trends and innovations, especially pertaining to algorithms and architecture.
* Design and develop new groundbreaking AMD technologies.
* Debugging/fix existing issues and research alternative, more efficient ways to accomplish the same work.
* Develop technical relationships with peers and partners.
PREFERRED EXPERIENCE:
* Strong object-oriented programming background, C/C++ preferred.
* Strong knowledge of Data structures, Graph Theory and Algorithms.
* Ability to write high quality code with a keen attention to detail.
* Experience with modern concurrent programming and threading APIs.
* Experience with software development processes and tools such as debuggers, source code control systems like GitHub, Perforce and profilers is a plus.
* Effective communication and problem-solving skills.
* Knowledge of EDA Algorithms and Distributed Computing is a plus.
* Knowledge of Digital Design and FPGA is a plus.
* Knowledge of optimization techniques like ILP is a plus.
ACADEMIC CREDENTIALS:
* Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent.
#LI-SC3
#LI-Hybrid
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.