Principal Platform Software Engineer - OpenBMC Platform Architect
Senior software development engineer job at NVIDIA
NVIDIA's invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing - with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company, and form teams with the most inquisitive people in the world. Join us at the forefront of technological advancement.
Are you ready to change the next generation of computing? Join us at the forefront of technological advancement. We are looking for a principal platform software architect who can lead next generation data center server product platform architecture, bring up and drive a solution to production.
What you'll be doing:
Platform architecture and hardware bring up of NVIDIA HGX GPU baseboards. Software architecture and design for various firmware, understanding embedded system limitations, Linux kernel internals to ensure performance, scalability and resiliency requirements for firmware running on embedded devices.
Working closely with hardware teams to influence hardware design and review HW architecture & schematics.
Work with internal and external team members to narrow down on performance and resiliency requirements for firmware running on Nvidia data center products. Hands on coding, code review, and BMC firmware development including various manageability features for NVIDIA's Server platforms
Actively engaged in designing and developing CI/CD framework to ensure best quality for firmware. Writing and reviewing design documents, reviewing QA test plan and working closely with all collaborators to achieve consensus for design and testability as per product requirements.
Designs solutions for errors, stats & configuration appropriate to CPU, GPU, DIMM, SSDs, NICs, IB, PSU, BMC, FPGA, CPLD etc. for enterprise readiness of NVIDIA Server platforms.
Actively work with whole org to Instruments code to ensure maximum code coverage, writing and automating unit tests for each implemented module and maintaining detailed unit test case reports.
Mentor team for best practices on writing efficient and bug free code. Works with internal and external partners to drive design architecture to real products.
Works with the security team to ensure developed code is in line with product security goals, and with hardware teams to influence hardware design and review HW architecture & schematics.
What we need to see:
Bachelor of Science Degree (or higher) or equivalent experience in Electrical or Computer Engineering or Computer Science.
15+ overall years of active development using C / C++ as primary programming language using Linux as OS.
8+ experience in technically leading a good size of team in terms of delivering large firmware or software projects. 5+ experience in working across internal and external stakeholders to narrow down on requirements and converting those requirements in architecture and drive with a team to deliver it with quality
Proven track record of delivering solutions to customers. Deep understanding of deployments at scale
Domain expertise in Data Center Firmware/software development on X86 or ARM Platforms including BMC-BIOS communication, thermal management, power management, firmware update, device monitoring, firmware security, etc.
Board Bring-up expertise with hands-on experience in Device drivers like I2C/I3C, SPI, PCIe, SMBus, Mail-box etc. as well as the device trees for uboot and Linux kernel.
Understanding on REST architecture style especially JSON over HTTPs with OAuth.
Strong programming in C/C++ in Linux operating environment, strong understanding of Linux kernel internals, strong code review skills.
You should possess excellent written and oral communication skills, good work ethics, high sense of team-work, love to produce quality work and commitment to finish your tasks every single day. You are a self-starter who loves to find creative solutions to complicated problems.
Ways to stand out from the crowd:
Consistent track record in delivering 100,000+ lines of code for a single project.
Proven record in technically leading org of 30+ engineers.
Expertise in system software and platform security for x86/ARM based Rack/Blade server systems.
NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 425,500 USD.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until December 10, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Auto-ApplyNVIDIA 2026 Internships: Software Engineering - US
Senior software development engineer job at NVIDIA
By submitting your resume, you're expressing interest in one of our 2026 Software Engineering Internships. We'll review resumes on an ongoing basis, and a recruiter may reach out if your experience fits one of our many internship opportunities.
NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest industries and profoundly impacting society - from gaming to robotics, self-driving cars to life-saving healthcare, climate change to virtual worlds where we can all connect and create.
Our internships offer an excellent opportunity to expand your career and get hands on experience with one of our industry leading Software teams. We're seeking strategic, ambitious, hard-working, and creative individuals who are passionate about helping us tackle challenges no one else can solve.
Throughout the 12-week minimum full-time internship, students will work on projects that have a measurable impact on our business. We're looking for students pursuing Bachelor's, Master's, or PhD degree within a relevant or related field.
Potential Internships in this field include:
Development Tools
Debugging complex system-level issues using Jenkins
Course or internship experience related to the following areas could be required: Relational Databases, Linear Algebra & Numerical Methods, Operating Systems (memory/resource management), Scheduling and Process Control, Hardware Virtualization
Cloud
Supporting overall architecture and design of our cloud storage infrastructure
Implementing and troubleshooting storage and data platform tools, automating storage infrastructure end-to-end
Course or internship experience related to the following areas could be required: Distributed Systems, Data Structures & Algorithms, Virtualization, Automation/Scripting, Container & Cluster Management, Debugging
Tools Infrastructure
Building industry leading technology by proving workflows and infrastructure, alongside a team of experts in production software development and chip design methodologies
Enabling success for content running on the chip from application tracing and analysis to modeling, diagnostics, performance tuning, and debugging
Course or internship experience related to the following areas and technologies could be required: Unix/Shell Scripting, Linux, Java, JavaScript (including Node, React, Vue), C++, CUDA, OOP, Go, Python, Git, GitLab, Perforce, Kubernetes and Microservices, Schedulers (LSF, SLURM), Containers (Docker), Configuration Automation (Ansible)
Machine Learning Operations
Deep Learning, GPU Computing, Accelerated Computing
Validation Frameworks for Deep Learning, Deep Learning Frameworks and Libraries (NumPy, SciPy, cu BLAS, cu DNN)
Data Preprocessing, Training Acceleration (CUDA, cu DNN, NCCL), Convolution Operations (cu DNN), Real-Time Inference (TensorRT)
Building Infrastructure for Back-End Analytics
What we need to see:
Must be actively enrolled in a university pursuing a Bachelor's, Master's, or PhD degree in Electrical Engineering, Computer Engineering, or a related field, for the entire duration of the internship.
Depending on the internship role, prior experience or knowledge requirements
could
include the following programming skills and technologies:
Java, JavaScript, (including Node, React, Vue), SQL, C, C++, CUDA, OOP, Go, Python, Git, Perforce, Kubernetes and Microservices, Schedulers (LSF, SLURM), Containers (Docker), Configuration Automation (Ansible)
Click here to learn more about NVIDIA, our early talent programs, benefits offered to students and other helpful student resources related to our latest technologies and endeavors.
Our internship hourly rates are a standard pay based on the position, your location, year in school, degree, and experience. The hourly rate for our interns is 20 USD - 71 USD.
You will also be eligible for Intern benefits.
Applications are accepted on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Auto-ApplySr. Software Development Engineer (C/C++ Required)
San Jose, CA jobs
NetApp is the intelligent data infrastructure company, turning a world of disruption into opportunity for every customer. No matter the data type, workload or environment, we help our customers identify and realize new business possibilities. And it all starts with our people.
If this sounds like something you want to be part of, NetApp is the place for you. You can help bring new ideas to life, approaching each challenge with fresh eyes. Of course, you won't be doing it alone. At NetApp, we're all about asking for help when we need it, collaborating with others, and partnering across the organization - and beyond.
About Us
Join our innovative and dynamic Cloud Engineering team at NetApp, where we are dedicated to pushing the boundaries of cloud technology. We are looking for a passionate and experienced Software Development Engineer to join us in developing industry leading data management solutions for NetApp's customers in all major Clouds. If you have hands-on experience with AWS, OCI or GCP, we want to hear from you!
Job Summary
As a Software Development Engineer in NetApp's Cloud Platforms organization, you will play a critical role in the team responsible for designing, developing, and maintaining scalable, secure and robust operating systems and capabilities that run NetApp data management applications in the Cloud.
As part of the Research and Development function, the overall focus of the group is on competitive market and customer requirements, technology advances and innovation, product quality, product cost and time-to-market. To achieve these goals, software engineers in this organization leverage their technical strengths and teamwork skills via collaboration with Cloud partners and internal Engineering partners.
Key Responsibilities
Develop a strong technical understanding of NetApp's existing data management and storage solutions in the Cloud, their interdependencies, and limitations.
Participate in all phases of the product development cycle including product design and development, test automation, debugging and support, and supporting resolution of impactful customer issues.
Participate in technical discussions within the immediate group and across functions to evaluate design and execute development plans for product features
Handle unstructured tasks and conducting research to convert vague/undefined requirements into clearly articulated ones.
Demonstrate accountability and ownership of a complex technical area of the product as a subject matter expert, while operating under limited supervision and direction, drive results and set priorities independently.
Engage in a peer-to-peer collaboration and learning model, fostering a culture of continuous learning and improvement.
Minimum Qualifications
A minimum of 8-10 years of experience in software/systems engineering is required, with programming skills in C, C++ and Python
Knowledge: Strong understanding of computer architecture, data structures, and programming practices.
Demonstrated experience as a Lead or technical mentor to junior engineers in a team.
Excellent problem-solving skills and attention to detail.
Strong verbal and written communication skills, and collaboration skills are a must.
Ability to work with limited supervision and operate and drive results independently.
Preferred Qualifications
Technical Expertise: Deep understanding of cloud technologies and orchestration frameworks with hands-on experience in AWS, Azure, Oracle or GCP.
Experience in software development of distributed data management systems (E.g.: NetApp ONTAP or similar systems).
Advanced certifications in AWS, GCP, or OCI.
Experience with infrastructure-as-code tools such as Terraform or CloudFormation.
Experience with Cloud monitoring and logging tools.
Education
Bachelor's degree in Computer Science, Engineering, or a related field.
Compensation:
The target salary range for this position is 159,800 - 237,600 USD. The salary offered will be determined by the candidate's location, qualifications, experience, and education and may be outside of this range. Final compensation packages are competitive and in line with industry standards, reflecting a variety of factors, and include a comprehensive benefits package. This may cover Health Insurance, Life Insurance, Retirement or Pension Plans, Paid Time Off (PTO), various Leave options, Performance-Based Incentives, employee stock purchase plan, and/or restricted stocks (RSU's), with all offerings subject to regional variations and governed by local laws, regulations, and company policies. Benefits may vary by country and region, and further details will be provided as part of the recruitment process.
At NetApp, we embrace a hybrid working environment designed to strengthen connection, collaboration, and culture for all employees. This means that most roles will have some level of in-office and/or in-person expectations, which will be shared during the recruitment process.
Equal Opportunity Employer:
NetApp is firmly committed to Equal Employment Opportunity (EEO) and to compliance with all federal, state and local laws that prohibit employment discrimination based on age, race, color, gender, sexual orientation, gender identity, national origin, religion, disability or genetic information, pregnancy, protected veteran status, and any other protected classification.
Why NetApp?
We are all about helping customers turn challenges into business opportunity. It starts with bringing new thinking to age-old problems, like how to use data most effectively to run better - but also to innovate. We tailor our approach to the customer's unique needs with a combination of fresh thinking and proven approaches.
We enable a healthy work-life balance. Our volunteer time off program is best in class, offering employees 40 hours of paid time per year to volunteer with their favorite organizations. We provide comprehensive medical, dental, wellness, and vision plans for you and your family. We offer educational assistance, legal services, and access to discounts. Finally, we provide financial savings programs to help you plan for your future.
If you want to help us build knowledge and solve big problems, let's talk.
Software Developer Intern 2026: SVL
San Jose, CA jobs
**Introduction** Launch your software career with IBM!As a Software Developer Intern, you'll work with cutting-edge technologies and learn from industry experts while contributing to products that reach millions of users worldwide. This isn't "shadow and learn" - you'll take ownership, deliver value and see your work impact real clients. From your first week, you'll be part of a real Agile team, coding, testing, and deploying features that matter.
Why you'll love this internship:
* Real-world projects with measurable impact
* Mentorship from experienced IBM engineers
* Networking, intern events, and career development opportunities
* A clear path to future full-time roles
At IBM you'll be part of a diverse and inclusive community, have access to premium learning resources, join intern hackathons and networking events, and set the stage for a potential full-time role after your internship.
**Your role and responsibilities**
During your internship, you will:
* Be an integral part of an Agile development team - designing, coding, testing, and deploying software that makes a real impact.
* Turn design concepts and wireframes into beautiful, functional web applications.
* Learn directly from experienced engineers through code reviews, pair programming, and mentorship.
* Work hands-on with APIs, databases, and cloud platforms to bring ideas to life.
* Apply software engineering best practices in a collaborative and inclusive environment.
Who You Are
You're curious, creative, and driven to turn ideas into code. You thrive in a collaborative environment, love solving problems, and aren't afraid to experiment, fail fast, and learn quickly. Whether you've built side projects, contributed to open source, or are exploring new frameworks just for fun - you're excited to grow with a global team of innovators at IBM.
**Required technical and professional expertise**
* Familiarity with front-end technologies (HTML, CSS, JavaScript, and frameworks like React, Angular, or Vue).
* Basic understanding of back-end technologies (Node.js, Java, Python, SQL).
* Experience using development tools (Git, IDEs) and version control.
* Curiosity to learn about cloud platforms, automation, and container technologies.
**Preferred technical and professional experience**
* Exposure to cloud platforms (IBM Cloud, AWS, or Azure).
* Experience with APIs, RESTful services, and databases (Postgres, MySQL, or similar).
* Familiarity with Agile tools, CI/CD, and automation frameworks (Jenkins, Cypress, Selenium, etc.).
* Contributions to open-source or personal coding projects.
* Passion for innovation and continuous learning
* Graduate in Dec 2026 or 2027.
IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.
Software Developer Intern 2026: SVL
San Jose, CA jobs
Introduction Launch your software career with IBM!As a Software Developer Intern, you'll work with cutting-edge technologies and learn from industry experts while contributing to products that reach millions of users worldwide. This isn't "shadow and learn" - you'll take ownership, deliver value and see your work impact real clients. From your first week, you'll be part of a real Agile team, coding, testing, and deploying features that matter.
Why you'll love this internship:
* Real-world projects with measurable impact
* Mentorship from experienced IBM engineers
* Networking, intern events, and career development opportunities
* A clear path to future full-time roles
At IBM you'll be part of a diverse and inclusive community, have access to premium learning resources, join intern hackathons and networking events, and set the stage for a potential full-time role after your internship.
Your role and responsibilities
During your internship, you will:
* Be an integral part of an Agile development team - designing, coding, testing, and deploying software that makes a real impact.
* Turn design concepts and wireframes into beautiful, functional web applications.
* Learn directly from experienced engineers through code reviews, pair programming, and mentorship.
* Work hands-on with APIs, databases, and cloud platforms to bring ideas to life.
* Apply software engineering best practices in a collaborative and inclusive environment.
Who You Are
You're curious, creative, and driven to turn ideas into code. You thrive in a collaborative environment, love solving problems, and aren't afraid to experiment, fail fast, and learn quickly. Whether you've built side projects, contributed to open source, or are exploring new frameworks just for fun - you're excited to grow with a global team of innovators at IBM.
Required education
High School Diploma/GED
Preferred education
Bachelor's Degree
Required technical and professional expertise
* Familiarity with front-end technologies (HTML, CSS, JavaScript, and frameworks like React, Angular, or Vue).
* Basic understanding of back-end technologies (Node.js, Java, Python, SQL).
* Experience using development tools (Git, IDEs) and version control.
* Curiosity to learn about cloud platforms, automation, and container technologies.
Preferred technical and professional experience
* Exposure to cloud platforms (IBM Cloud, AWS, or Azure).
* Experience with APIs, RESTful services, and databases (Postgres, MySQL, or similar).
* Familiarity with Agile tools, CI/CD, and automation frameworks (Jenkins, Cypress, Selenium, etc.).
* Contributions to open-source or personal coding projects.
* Passion for innovation and continuous learning
* Graduate in Dec 2026 or 2027.
ABOUT BUSINESS UNIT
IBM Software infuses core business operations with intelligence-from machine learning to generative AI-to help make organizations more responsive, productive, and resilient. IBM Software helps clients put AI into action now to create real value with trust, speed, and confidence across digital labor, IT automation, application modernization, security, and sustainability. Critical to this is the ability to make use of all data, because AI is only as good as the data that fuels it. In most organizations data is spread across multiple clouds, on premises, in private datacenters, and at the edge. IBM's AI and data platform scales and accelerates the impact of AI with trusted data, and provides leading capabilities to train, tune and deploy AI across business. IBM's hybrid cloud platform is one of the most comprehensive and consistent approach to development, security, and operations across hybrid environments-a flexible foundation for leveraging data, wherever it resides, to extend AI deep into a business.
YOUR LIFE @ IBM
In a world where technology never stands still, we understand that, dedication to our clients success, innovation that matters, and trust and personal responsibility in all our relationships, lives in what we do as IBMers as we strive to be the catalyst that makes the world work better.
Being an IBMer means you'll be able to learn and develop yourself and your career, you'll be encouraged to be courageous and experiment everyday, all whilst having continuous trust and support in an environment where everyone can thrive whatever their personal or professional background.
Our IBMers are growth minded, always staying curious, open to feedback and learning new information and skills to constantly transform themselves and our company. They are trusted to provide on-going feedback to help other IBMers grow, as well as collaborate with colleagues keeping in mind a team focused approach to include different perspectives to drive exceptional outcomes for our customers. The courage our IBMers have to make critical decisions everyday is essential to IBM becoming the catalyst for progress, always embracing challenges with resources they have to hand, a can-do attitude and always striving for an outcome focused approach within everything that they do.
Are you ready to be an IBMer?
ABOUT IBM
IBM's greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.
Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we're also one of the biggest technology and consulting employers, with many of the Fortune 500 companies relying on the IBM Cloud to run their business.
At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it's time for you to join us on our journey to being a responsible technology innovator and a force for good in the world.
IBM is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, genetics, pregnancy, disability, neurodivergence, age, or other characteristics protected by the applicable law. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.
OTHER RELEVANT JOB DETAILS
IBM will not be providing visa sponsorship for this position now or in the future. Therefore, in order to be considered for this position, you must have the ability to work without a need for current or future visa sponsorship.
The compensation range and benefits for this position are based on a full-time schedule for a full calendar year. The salary will vary depending on your job-related skills, experience and location. Pay increment and frequency of pay will be in accordance with employment classification and applicable laws. For part time roles, your compensation and benefits will be adjusted to reflect your hours. Benefits may be pro-rated for those who start working during the calendar year.
Staff Software Development Engineer - AI/ML Frameworks, C/C++/Python
San Jose, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
This role is not eligible for visa sponsorship.
THE ROLE:
AMD is looking for a world class AI frameworks engineer who can provide technical leadership in the development of various AI frameworks in the AMD ecosystem. You will play a pivotal role in developing and optimizing deep learning frameworks for AMD GPUs. You will engage with both internal GPU library teams and open-source maintainers to ensure seamless integration of optimizations, utilizing cutting-edge compiler technologies and advanced engineering principles to drive continuous improvement.
THE PERSON:
If you are passionate about AI/ML frameworks, software architecture, and/or compilers this is your opportunity. You will be working in one of the core areas, such as AI/ML frameworks (e.g. PyTorch, vLLM, SGLang), AI runtime components, and/or optimization tooling to accelerate AI/ML workloads on AMD GPUs. You will collaborate closely with AI researchers to drive the development of framework components to efficiently map AI models to run on latest AMD GPUs. You should be someone who can work in a dynamic development environment, with excellent leadership and collaboration skills. You will work with multiple engineering teams that are geographically dispersed. You will work on next generation framework software, guiding other senior developers and domain experts.
KEY RESPONSIBILITIES:
* Optimize Deep Learning Frameworks: Enhance and optimize frameworks like PyTorch, vLLM, SGLang for AMD GPUs in open-source repositories.
* Develop GPU Kernels: Create and optimize GPU kernels to maximize performance for specific AI operations.
* Develop & Optimize Models: Design and optimize deep learning models using quantization specifically for AMD GPU performance.
* Collaborate with GPU Library Teams: Work closely with internal teams to analyze and improve training and inference performance on AMD GPUs.
* Collaborate with Open-Source Maintainers: Engage with framework maintainers to ensure code changes are aligned with requirements and integrated upstream.
* Software Engineering Best Practices: Apply sound engineering principles to ensure robust, maintainable solutions.
PREFERRED EXPERIENCE:
* GPU Kernel Development & Optimization: Experienced in designing and optimizing GPU kernels for deep learning on AMD GPUs using HIP, CUDA, and assembly (ASM). Strong knowledge of AMD architectures (GCN, RDNA) and low-level programming to maximize performance for AI operations, leveraging tools like Compute Kernel (CK), CUTLASS, and Triton for multi-GPU and multi-platform performance.
* Experience with AI software framework, such as PyTorch, vLLM, SGLang, benchmarking and profiling.
* Experience using profiling and benchmark tooling for large models.
* Experience with model optimization, such as low-precision quantization (MXFP4, FP8, INT4), sparsity.
* Solid understanding of model architectures, LLMs, MoE, diffusion.
* Proficient in C++ programming.
* Experience developing and debugging in Python.
* Team player and ready to work with a geographically distributed team.
ACADEMIC CREDENTIALS:
BS, MS or PhD in Computer Science, Computer Engineering, Electrical Engineering, or related technical fields.
#LI-TC1
#LI-HYBRID
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Software Development Engineer - AI group
San Jose, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
A senior technical contributor that drives end-to-end delivery of software solutions, directly contributing to, and coordinating implementation and optimization across multiple teams for inference and training of machine learning models. The position will involve interfacing with software and hardware engineering teams and AMD partners to plan, develop and optimize use cases. This is an exciting opportunity to work on the cutting edge of GPU Computing for Machine Learning.
THE PERSON:
You are a subject matter expert and strong technical contributor with machine learning and GPU programming experience. You excel as part of a team where communication and team skills are highly valued.
KEY RESPONSIBILITIES:
* Work within and coordinate with a small team to analyze, implement, and optimize DirectML-TensorFlow and PyTorch for machine learning models
* Collaborate with ISV, library, compiler, driver, and hardware engineers to influence strategic decisions to achieve the highest performance for DirectML
* Innovate new algorithmic improvements that exploit the strengths of the hardware architecture to deliver the best possible machine learning performance
PREFERRED EXPERIENCE:
* Relevant experience in Machine Learning and/or GPU programming
* Experience in deep learning frameworks (e.g. TensorFlow, Keras, PyTorch, Caffe, ONNX, etc) and familiarity with CNN/LSTM model architectures
* Knowledge of CPU and GPU architecture, and experience in GPGPU programming technologies
* Experience advocating for technical solutions in a collaborative team environment
* Excellent communication and collaboration skills
ACADEMIC CREDENTIALS:
Bachelor's or Master's degree in related discipline preferred
#LI-CJ3
#LI-Hybrid
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Fellow Software Development Eng (MI-450 Fleet Management)
San Jose, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
AMD is looking for a Fellow in Infrastructure Management who is passionate about complex AI solutions, Infrastructure management software that manages GPU clusters supporting large foundation model training, high-performance inference services, multi-tenant GPU sharing and scheduling. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology.
THE PERSON:
The ideal candidate should be passionate about software engineering, system design, infrastructure management and possess leadership skills to drive sophisticated issues to resolution. Able to communicate effectively and work optimally with different teams across AMD.
KEY RESPONSIBILITIES:
Lead the technical vision, strategy, and architectural direction of our Infrastructure management software that manages our GPU clusters supporting large foundation model training, high-performance inference services, multi-tenant GPU sharing and scheduling.
Architect and implement GPU node orchestration, failure detection, auto-remediation, auto-scaling of clusters. Design infrastructure software that can support different distributed training frameworks PyTorch, Megatron, JAX, Tensorflow and different distributed inference frameworks like SGLang, VLLM, Ray. Design and deliver software that can manage different scale-up and scale out transport protocols and deliver the best network performance.
Architect telemetry, observability, and profiling systems (Prometheus/Thanos, Open Telemetry, Mimir) to measure GPU health and cluster efficiency. Architect monitoring systems that can investigate network congestion, latency spikes, scheduling inefficiencies, system bottlenecks.
Architect our Infrastructure management software for scale, efficiency and deliver industry leading GPU cluster utilization that is highly reliable and self-healing. Deliver infrastructure services that reduce job latencies for Slurm and Kubernetes clusters, improve scheduling efficiencies, and reduce operational cost.
Design and deliver AI Agents that can troubleshoot complex infrastructure problems without any human intervention and reduce OPEX cost and MTTR. Design and deliver AI Agents that can proactively identify nodes that are bound to fail before they fail.
Define our long-term infrastructure management roadmap, drive cross-team initiatives, and deliver. Mentor Principal Engineers, Technical staff across teams. Work with external partners and vendors to develop and deliver the most comprehensive infrastructure solution for AMD.
PREFERRED ACADEMIC CREDENTIALS:
Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent.
#LI-G11
#LI-HYBRID
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
MTS Software Development Engineer - Netlist Data Model
San Jose, CA jobs
What you do at AMD changes everything
We care deeply about transforming lives with AMD technology to enrich our industry, our communities and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence, while being direct, humble, collaborative and inclusive of diverse perspectives. This is who we are at our best. One Company. One Team.
AMD together we advance_
MTS SOFTWARE DEVELOPMENT ENGINEER
THE ROLE:
At AMD, we push the boundaries of what is possible. We believe in changing the world for the better by driving innovation in high-performance computing, graphics, and visualization technologies ' building blocks for gaming, immersive platforms, and the data center.
THE PERSON:
Developing great technology takes more than talent: it takes amazing people who understand collaboration, respect, and who will go the 'extra mile' to achieve unthinkable results. It takes people who have the passion and desire to disrupt the status quo, push boundaries, deliver innovation, and change the world. If you have this type of passion, we invite you to take a look at the opportunities available to come join our team.
KEY RESPONSIBILITES:
Work within and across teams to significantly improve runtime performance and memory usage for current and next-gen architecture
Assess new hardware architecture features in our device pipeline, drive recommendations for solutions in the tool chain
Work on EDA infra-structure projects in the domain of netlist, constraints, security, and licensing
Ensure on-time delivery of high-quality product that meets business and technical requirements
Engage with the application/field engineers to address critical customer designs issues
PREFERRED SKILLS:
Proven track record developing commercial software solutions
Expert software architecture, data structures/algorithm and C++ skills with emphasis on memory, runtime, quality, and scalability
VHDL, Verilog, or EDIF
C++ Code Debugger experience
Experience with FPGA or ASIC design flows is a plus
Experience with scripting languages (Python, Tcl)
ACADEMIC CREDENTIALS:
BS OR MS OR PhD in ELECTRICAL ENGINEERING / COMPUTER ENGINEERING / COMPUTER SCIENCE
LOCATION:
San Jose, Ca.
#LI-JT1
Requisition Number: 185141
Country: United States State: California City: San Jose
Job Function: Design
Benefits offered are described here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. AMD and its subsidiaries are equal opportunity employers. We consider candidates regardless of age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status. Please click here for more information.
SMTS Software Development Eng.
San Jose, CA jobs
What you do at AMD changes everything
We care deeply about transforming lives with AMD technology to enrich our industry, our communities and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence, while being direct, humble, collaborative and inclusive of diverse perspectives. This is who we are at our best. One Company. One Team.
AMD together we advance_
SOFTWARE DEVELOPMENT ENGINEER
THE ROLE
Developing and maintaining best in class FPGA software implementation tool including logic and physical optimization algorithms and flows.
Researching, implementing and enhancing AMD FPGA specific optimizations and flows.
Interacting with placement, routing, timing analysis, design verification, software applications, and other software tools teams.
Performing design analysis and addressing customer issues.
Taking initiative on software tool quality and performance improvements.
PREFERED EXPERIENCES:
Expertise in C++ and object oriented programming
Knowledge of Data structures, Graph Theory and Algorithms.
Proven experience in developing state of the art in one or more of the following areas: logic optimization, physical optimization, timing analysis, place & route.
Experience in developing and supporting large-scale software, including understanding usage model, writing functional specification, implementing code, testing, documentation, and providing customer support.
ACADEMIC CREDENTIALS:
BS with 8+ years of experience or MS 5+ years of experience or PhD with 3+ years of experience in Computer Science, Computer Engineering, Electrical Engineering or related equivalent
#LI-JY1
Requisition Number: 184841
Country: United States State: California City: San Jose
Job Function: Design
Benefits offered are described here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. AMD and its subsidiaries are equal opportunity employers. We consider candidates regardless of age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status. Please click here for more information.
Staff Software Development Engineer - AI/ML frameworks, C/C++/Python
San Jose, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
AMD is looking for a specialized software engineer who is passionate about improving the performance of key applications and benchmarks. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology.
THE PERSON:
The ideal candidate should be passionate about software engineering and possess leadership skills to drive sophisticated issues to resolution. Able to communicate effectively and work optimally with different teams across AMD.
KEY RESPONSIBILITIES:
* Work with AMD's architecture specialists to improve future products
* Apply a data minded approach to target optimization efforts
* Stay informed of software and hardware trends and innovations, especially pertaining to algorithms and architecture
* Design and develop new groundbreaking AMD technologies
* Participating in new ASIC and hardware bring ups
* Debugging/fix existing issues and research alternative, more efficient ways to accomplish the same work
* Develop technical relationships with peers and partners
PREFERRED EXPERIENCE:
* Strong object-oriented programming background, C/C++ preferred
* Ability to write high quality code with a keen attention to detail
* Experience with modern concurrent programming and threading APIs
* Experience with Windows, Linux and/or Android operating system development
* Experience with software development processes and tools such as debuggers, source code control systems (GitHub) and profilers is a plus
* Effective communication and problem-solving skills
ACADEMIC CREDENTIALS:
* Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent
#LI-TC1
#LI-HYBRID
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Software Development Engineer
San Jose, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
AMD is looking for a specialized software engineer who is passionate about improving the performance of key applications and benchmarks. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology.
THE PERSON:
The ideal candidate should be passionate about software engineering and possess leadership skills to drive sophisticated issues to resolution. Able to communicate effectively and work optimally with different teams across AMD.
KEY RESPONSIBILITIES:
* Responsible for the design, implementation, analysis, and test of FPGA physical implementation tools which include optimization, placement, timing, and routing.
* Perform research and development of novel algorithms and optimization of existing algorithms to improve quality of results (QOR), runtime, and memory.
* Conduct research and development of novel placement/routing and optimization techniques for next-generation FPGAs.
* Apply a data minded approach to target optimization efforts.
* Stay informed of software and hardware trends and innovations, especially pertaining to algorithms and architecture.
* Design and develop new groundbreaking AMD technologies.
* Debugging/fix existing issues and research alternative, more efficient ways to accomplish the same work.
* Develop technical relationships with peers and partners.
PREFERRED EXPERIENCE:
* Strong object-oriented programming background, C/C++ preferred.
* Strong knowledge of Data structures, Graph Theory and Algorithms.
* Ability to write high quality code with a keen attention to detail.
* Experience with modern concurrent programming and threading APIs.
* Experience with software development processes and tools such as debuggers, source code control systems like GitHub, Perforce and profilers is a plus.
* Effective communication and problem-solving skills.
* Knowledge of EDA Algorithms and Distributed Computing is a plus.
* Knowledge of Digital Design and FPGA is a plus.
* Knowledge of optimization techniques like ILP is a plus.
ACADEMIC CREDENTIALS:
* Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent.
#LI-SC3
#LI-Hybrid
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Software Development Engineer - Kernel Development
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
As a core member of the team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. Your experience will be critical in enhancing GPU kernels, deep learning models, and finetuning inference performance across multi-GPU and multi-node systems through open-source popular frameworks like vllm/SGLang and internal inferencing platforms. You will engage with both internal framework teams and open-source maintainers to ensure seamless integration of optimizations, utilizing cutting-edge technologies and advanced engineering principles to drive continuous improvement.
THE PERSON:
Skilled engineer with strong technical and analytical expertise in Python development within Linux environments. The ideal candidate will thrive in both collaborative team settings and independent work, with the ability to define goals, manage development efforts, and deliver high-quality solutions. Strong problem-solving skills, a proactive approach, and a keen understanding of software engineering best practices are essential.
KEY RESPONSIBILITIES:
* Optimize Deep Learning Frameworks: Enhance and optimize frameworks like PyTorch / vllm / SGLang for AMD GPUs in open-source repositories.
* Design and scale: multi‑GPU inference strategies (TP/PP/EP hybrid).
Develop & Optimize Models: Design and optimize deep learning models specifically for AMD GPU performance.
* Collaborate with GPU Library Teams: Work closely with internal teams to analyze and improve training and inference performance on AMD GPUs.
* Collaborate with Open-Source Maintainers: Engage with framework maintainers to ensure code changes are aligned with requirements and integrated upstream.
* Work in Distributed Computing Environments: Optimize deep learning performance on both scale-up (multi-GPU) and scale-out (multi-node) systems.
* Utilize Cutting-Edge Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance.
* Optimize Deep Learning Pipeline: Enhance the full pipeline, including integrating graph compilers.
* Software Engineering Best Practices: Apply sound engineering principles to ensure robust, maintainable solutions.
PREFERRED EXPERIENCE:
* Kernel & Inference Frameworks: Strong background in GPU kernel development and LLM inference frameworks.
* Inference Stack Knowledge: Hands-on understanding of SGLang internals or similar stacks such as vLLM and FasterTransformer.
* Distributed & Open-Source Execution: Solid experienced with distributed inference scaling and proven contributor to upstream open-source projects
* Deep Learning Integration: Strong and significant experience in integrating optimized GPU performance into machine learning frameworks (e.g., TensorFlow, PyTorch) to accelerate model training and inference, with a focus on scaling and throughput.
* Software Engineering: Expert in Python and C++, with experience in debugging, performance tuning, and test design to ensure high-quality, maintainable software solutions.
* High-Performance Computing: Solid experienced in running large-scale workloads on heterogeneous compute clusters, optimizing for efficiency and scalability.
* Compiler Optimization: Foundational understanding of compiler theory and tools like LLVM and ROCm for kernel and system performance optimization.
ACADEMIC CREDENTIALS:
* Master's or PhD in Computer Science, Computer Engineering, Electrical Engineering, or a related field.
#LI-JG1
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Staff Software Development Engineer, Computer Vision And AI software
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
We are looking for a talented engineer to join our team: developing heterogeneous software for computer vision, image/video processing as part of the AMD Radeon Open Ecosystem (ROCm). The ideal candidate thrives in a fast-paced environment and brings strong communication, problem-solving, and prioritization skills to the table. We value individuals who are critical thinkers, detail oriented and passionate about building high-performance software.
THE PERSON:
You are accustomed to working in a dynamic, geographically distributed agile team, where partnership and collaboration are paramount. You possess excellent written and verbal communication skills, strong attention to detail, and the ability to express your work in a clear, cohesive fashion. Most importantly, you are constantly thinking of ways to make world-class software with reliability and performance.
KEY RESPONSIBILITIES:
* Design, develop, and optimize GPU software for computer vision, image and video processing workloads.
* Collaborate with cross-functional teams to deliver robust software extracting best performance from AMD hardware
* Debug and resolve complex software issues across multiple layers of the stack.
* Deliver high-quality code and documentation following best practices for open-source software development
* Work with key technical experts across AMD and with our partners and customers to improve ROCm applications, libraries, and tools, as well as AMD GPU hardware
* Apply software engineering methodologies to ensure robust, maintainable code
* Develop technical relationships with peers and partners
PREFERRED EXPERIENCE:
* Proficiency in C/C++ and Python.
* Experience in video codecs, image processing and machine learning frameworks
* Familiarity with computer vision libraries like OpenCV, FFMpeg, OpenVX etc.
* Experience with GPU programming (e.g., HIP, OpenCL, CUDA).
* Familiarity with ROCm or similar heterogeneous computing platforms.
* Background in computer vision, image processing, or video processing.
* Experience with open-source development and Git and feature tracking software (e.g., Jira)
* In-depth knowledge of best-practices in software development, including testing, profiling, debugging, documentation, version control, issue tracking, and planning
ACADEMIC CREDENTIALS:
* B.Sc. or B.Eng. degree in Computer Science, Software Engineering, Electrical Engineering, or equivalent
Advanced degrees, such as M.Sc., M.Eng., Ph.D. are preferred
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Software Development Engineer - Profiling Tools (Pre/Post - Silicon)
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE
AMD is seeking a Software Development Engineer to join the AGS Developer Tools team. In this role, you will design and develop advanced GPU profiling tools that enable developers to analyze and optimize performance for HPC, ML, and AI workloads. You will contribute to the ROCm ecosystem by building robust, scalable profiling, debugging solutions that empower developers to maximize AMD GPU capabilities.
THE PERSON
You are passionate about software engineering and performance optimization. You have a strong foundation in C++ and computer architecture, and you thrive in collaborative environments. You are detail-oriented, proactive in solving complex technical challenges, and able to communicate effectively across teams.
KEY RESPONSIBILITIES
* Design, develop, and maintain ROCprofiler-SDK components for GPU performance profiling
* Collaborate with architecture, driver, and runtime teams to enable profiling for next-generation AMD GPUs involved in Pre Silicon and Post Silicon activities.
* Implement new features and APIs to enhance profiling capabilities for AI and HPC workloads
* Optimize profiling tools for accuracy, scalability, and minimal overhead
* Debug and resolve issues in profiling workflows and improve tool reliability
* Participate in hardware bring-up and ensure profiling support for new ASICs
* Stay current with GPU architecture advancements and integrate them into ROCm profiling tools
* Contribute to documentation and developer resources for ROCprofiler-SDK
PREFERRED EXPERIENCE
* Strong proficiency in C++ and object-oriented programming
* Experience with performance analysis tools or profiling frameworks
* Familiarity with GPU programming models (HIP, OpenCL, or CUDA)
* Understanding of GPU architecture and system-level performance concepts
* Experience with multithreading and concurrency in modern C++
* Knowledge of Linux development environments; Windows experience is a plus
* Familiarity with ROCm ecosystem and tools is highly desirable
* Experience with Git-based workflows and debugging tools
* Strong problem-solving skills and ability to work independently and in a team
ACADEMIC CREDENTIALS:
* Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent
LOCATION: Santa Clara, CA
#LI-CJ3
#LI-Hybrid
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Principal Software Development Engineer-Networking
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
PMTS SOFTWARE DEVELOPMENT ENGINEER
THE ROLE:
AMD is looking for a strategic software engineering lead who is passionate about improving the performance of key applications and benchmarks. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology.
THE PERSON:
The ideal candidate should be passionate about software engineering and possess leadership skills to drive sophisticated issues to resolution. Able to communicate effectively and work optimally with different teams across AMD.
KEY RESPONSIBILITIES:
* Work with AMD's architecture specialists to improve future products
* Apply a data minded approach to target optimization efforts
* Stay informed of software and hardware trends and innovations, especially pertaining to algorithms and architecture
* Design and develop new groundbreaking AMD technologies
* Participating in new ASIC and hardware bring ups
* Debugging/fix existing issues and research alternative, more efficient ways to accomplish the same work
* Develop technical relationships with peers and partners
* Guide the design, development, and optimization of RCCL features, focusing on performance, scalability, and ease of use across various AMD platforms and interconnects (PCIe, Infinity Fabric,RoCEv2)
PREFERRED EXPERIENCE:
* Strong object-oriented programming background, C/C++ preferred
* Ability to write high quality code with a keen attention to detail
* Experience with modern concurrent programming and threading APIs
* Experience with Windows, Linux and/or Android operating system development
* Experience with software development processes and tools such as debuggers, source code control systems (GitHub) and profilers is a plus
* Effective communication and problem-solving skills
* Motivating leader with good interpersonal skills
* Deep expertise in high-performance computing, Deep Learning, and parallel programming models.
* Strong understanding of GPU architectures, and communication libraries like RCCL, roc SHMEM, and MPI.
* Experience with various interconnect technologies (PCIe, Infinity Fabric, Infiniband/RoCEv2) and network protocols (RDMA)
ACADEMIC CREDENTIALS:
* Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent
#LI-RL1
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Principal Software Development Engineer, AI Open-Source Software
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
Do you want to help build the AI software that runs on top of the world's fastest supercomputers and most advanced data centers?
At AMD, we are looking for passionate and talented Software Engineers to join our team. You will help develop cutting-edge technology that pushes the boundaries of performance and efficiency for the next generation of GPU accelerators. This is an opportunity to contribute to open-source AI software from AMD and the broader community, driving innovation and enhancing AI performance across data center GPUs.
THE PERSON:
You are a talented engineer who is passionate about advancing the frontiers of AI. You thrive in open-source environments, enjoy tackling complex technical challenges, and take pride in writing efficient, maintainable, and scalable software. You are collaborative, curious, and excited to contribute to the open-source repositories that power the next generation of AI workloads.
KEY RESPONSIBILITIES:
* Contribute to RAG, Ray, ROCm, Coding Agent, DGL, llama.cpp, verl, MegaBlocks, FlashInfer, Triton Inference Server, Taichi, and other merging open-source projects driving AI innovation.
* Collaborate with leading partners and open-source communities to enable AI workloads and improve performance on data center GPUs.
PREFERRED EXPERIENCE:
* Experience with C++, Python, or similar programming languages.
* Knowledge of AI training and inference.
* Familiarity with GPU programming (CUDA, HIP, or OpenCL) and performance optimization techniques.
ACADEMIC CREDENTIALS:
Bachelor's or master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent.
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Software Development Engineer - SGLang
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
As a core member of the team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. Your work will be instrumental in enhancing GPU kernel performance, accelerating deep learning models, and enabling training and inference at scale across multi-GPU and multi-node systems. You will collaborate across internal GPU software teams and engage with open-source communities to integrate cutting-edge compiler technologies and drive upstream contributions that benefit AMD's AI software ecosystem.
THE PERSON:
Skilled engineer with strong technical and analytical expertise in C++ development within Linux environments. The ideal candidate will thrive in both collaborative team settings and independent work, with the ability to define goals, manage development efforts, and deliver high-quality solutions. Strong problem-solving skills, a proactive approach, and a keen understanding of software engineering best practices are essential.
KEY RESPONSIBILITIES:
* Optimize Deep Learning Frameworks: Enhance performance of frameworks like TensorFlow, PyTorch, and SGLang on AMD GPUs via upstream contributions in open-source repositories.
* Develop and Optimize Deep Learning Models: Profile and tune large-scale training and inference models for optimal performance on AMD hardware.
* GPU Kernel Development: Design, implement, and optimize high-performance GPU kernels using HIP, Triton, or other relevant tools for AI operator efficiency.
* Collaborate with GPU Library and Compiler Teams: Work closely with internal compiler and GPU math library teams to integrate and align kernel-level optimizations with full-stack performance goals.
* Contribute to SGLang Development: Support optimization, feature development, and scaling of the SGLang LLM framework across AMD GPU platforms.
* Distributed System Optimization: Tune and scale performance across both multi-GPU (scale-up) and multi-node (scale-out) environments, including inference parallelism and collective communication strategies.
* Graph Compiler Integration: Integrate and optimize runtime execution through graph compilers such as XLA, TorchDynamo, or custom pipelines.
* Open-Source Collaboration: Partner with external maintainers to understand framework needs, propose optimizations, and upstream contributions effectively.
* Apply Engineering Best Practices: Leverage modern software engineering practices in debugging, profiling, test-driven development, and CI integration.
PREFERRED EXPERIENCE:
* Strong Programming Skills: Proficient in C++ and/or Python, with demonstrated ability to debug, profile, and optimize performance-critical code.
* SGLang and LLM Optimization: Hands-on experience with SGLang or similar LLM inference frameworks is highly preferred.
* Compiler and GPU Architecture Knowledge: Background in compiler design or familiarity with technologies like LLVM, MLIR, or ROCm is a plus.
* Heterogeneous System Workloads: Experience running and scaling workloads on large-scale, heterogeneous clusters (CPU + GPU) using distributed training or inference strategies.
* AI Framework Integration: Experience contributing to or integrating optimizations into deep learning frameworks such as PyTorch or TensorFlow.
* GPU Computing: Working knowledge of HIP, CUDA, or other GPU programming models; experience with GCN/CDNA architecture preferred.
ACADEMIC CREDENTIALS:
* Bachelor's and/or Master's Degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field.
#LI-JG1
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Neural 3D Graphics Software Engineer (SMTS)
Santa Clara, CA jobs
What you do at AMD changes everything
At AMD, we push the boundaries of what is possible. We believe in changing the world for the better by driving innovation in high-performance computing, graphics, and visualization technologies - building blocks for gaming, immersive platforms, and the data center.
Developing great technology takes more than talent: it takes amazing people who understand collaboration, respect, and who will go the “extra mile” to achieve unthinkable results. It takes people who have the passion and desire to disrupt the status quo, push boundaries, deliver innovation, and change the world. If you have this type of passion, we invite you to take a look at the opportunities available to come join our team.
The Role:
We are looking for exceptional software engineers with a passion and demonstrated ability to rapidly learn new methods (from open-source code, academic papers, etc.), develop prototypes, invent new algorithms and help put the best ones in production.
To be successful in this role, you need to be highly motivated and passionate about programming. You must love solving technical problems, think outside the box, and be versatile enough to go where the challenges take you.
The Person:
Proven track-record programming complex software in languages such as Python 3.x, Julia, C++ or CUDA
Very strong aptitude for problem solving
Works effectively both independently and in a team environment
Configuring and troubleshooting your own Linux or Windows PC
Effective written and verbal communication skills
Ability to organize and manage priorities
Key Responsibilities:
This team contributes to multiple projects that combine real-time 3D graphics and machine learning.
Depending on your level of experience and preference, you will share responsibilities such as:
R&D for a new high-performance Neural 3D Rendering project, prototyped in Julia, with many opportunities to contribute to open-source repositories.
Prototype and integrate state-of-the-art algorithms, from papers to integration into game engines, e.g. using Pytorch, JAX or Julia. Depending on the project, this may also include profiling the workload, doing in-depth optimization of the training and/or inference, interfacing with MLIR/ONNX, etc.
R&D and ML Operations (MLOps). That project uses Pytorch, python, fastai, nbdev, Azure, etc.
Collaborate and share insights to improve our GPUs with other teams, e.g. GPU hardware and software architects
All projects offer many opportunities to learn, implement, integrate, optimize or invent new state-of-the ML and 3D algorithms
Preferred Experience:
ML fundamentals and state-of-the-art algorithms, theory and practice, e.g.: CNNs and other common neural network architectures, super-resolution, transformers, diffusion models, PyTorch, JAX, ROCM/CUDA, computer vision, efficient architectures, etc.
3D graphics algorithms, APIs and game development, e.g.: DirectX, Vulkan, HLSL, SPIR-V, ray tracing, Unreal Engine, etc. In particular, experience creating 3D tools or integrating new 3D graphics features in the Unreal Engine 4/5 (using C++) would be highly appreciated.
Neural Rendering algorithms, e.g. Neural Radiance Fields (NeRF), Neural Light Fields, differentiable rasterization, etc.
Portfolio of original, technically-challenging personal programming projects, or significant contribution to open-source projects
MLOps or DevOps experience, e.g. cloud (Azure, AWS), Docker, Linux networking, system-wide profiling/debugging
Experience with complementary technologies and practices, e.g. LLVM/MLIR, in-depth performance profiling, GPU architecture, 3D Content creation, etc.
Credentials:
University degree in Computer Engineering, Computer Science or equivalent
Location:
Proximity to an AMD development office, e.g.: Markham, Canada; Santa Clara, Boston, Orlando, San Diego, Austin, etc.; Munich, Germany (flexible)
The team works in Europe or North-American time zones
#LI-SW2
Requisition Number: 182587
Country: United States State: California City: Santa Clara
Job Function: Design
Benefits offered are described here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. AMD and its subsidiaries are equal opportunity employers. We consider candidates regardless of age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status. Please click here for more information.
Prinicpal Software Engineer
San Jose, CA jobs
What you do at AMD changes everything
We care deeply about transforming lives with AMD technology to enrich our industry, our communities and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence, while being direct, humble, collaborative and inclusive of diverse perspectives. This is who we are at our best. One Company. One Team.
AMD together we advance_
PRINICPAL SOFTWARE ENGINEER
THE ROLE:
Great opportunity for a Principal Software Development Engineer career professional to join an elite team to develop ongoing solutions in the Adaptive & Embedded Computing Group (AECG) division of AMD semiconductor. AMD's new hardware programmable SoC FPGAs and Adaptive Compute Acceleration Platform (ACAP) deliver most dynamic processor technology and are achieving record performances in Data Center, Wireless/5G, Automotive/ADAS and Emulation applications.
These new applications, and heterogeneous computing architecture introduces new challenges in compilation, particularly in placement and routing. AMD's Software Implementation Tools team addresses these challenges. This position within the Vivado Software group of AECG focuses on research and development of novel multi-threaded and multi-process algorithms and Machine Learning (ML) techniques.
THE PERSON:
AECG's Vivado Software team is seeking a Principal Software Engineer with proven expertise in EDA Algorithms and C++ programming skills to work on a critical next generation state of the art Vivado Backend tool. The person will be part of FPGA Software Implementation Tools team within AECG having responsibility to innovate and develop novel Placer Algorithms and Machine learning techniques with aim of improving Quality of Results (QoR), Compile Time, and Memory Usage by orders of magnitude.
You will also be interacting with customers, helping them converge on their critical design requirements. This person will work with Tech Marketing and Applications Engineering to understand customer needs, and accordingly architect new features with ongoing engagement with Architecture team to design next gen FPGA and ACAP solutions.
KEY RESPONSIBILITIES:
Innovate and develop novel Placer algorithms to get multi-fold performance improvements (QoR, Compile Time, Memory Usage)
Interact with Customers, helping them converge on their critical design needs
Collaborate with Tech Marketing and Applications Engineering to understand customer needs and accordingly architect new features
Partner with Architecture team to design next gen FPGA and ACAP solutions
Evaluating new FPGA architectures and its impact on existing EDA tools
PREFERRED EXPERIENCE:
Proven expertise in EDA Algorithms
Programming expertise with C++
Machine Learning (ML) or Python
Knowledge of Digital Design and Field Programmable Gate Array (FPGA) is a huge plus
ACADEMIC CREDENTIALS:
Bachelor or Master's Degree in Computer Science, Computer Engineering, Electrical Engineering, or related equivalent, PhD desired, but not required
Location:
San Jose, CA
#LI-JT1
Requisition Number: 185945
Country: United States State: California City: San Jose
Job Function: Design
Benefits offered are described here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. AMD and its subsidiaries are equal opportunity employers. We consider candidates regardless of age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status. Please click here for more information.