Custom Solutions Architect
Solutions architect job at Intel
This role serves as a primary technical interface between CEG, Go-To-Market, and our customers, translating complex system-level requirements into winning custom silicon and ASIC solutions. Will work closely with key custom silicon and ASIC customers to develop and drive end-to-end product solutions that both delight the customer and drive business growth. Includes providing proof of concept solutions for proposing design alternatives that meet performance, power, area, timing, and thermal requirements. Review, challenge, and influence cross functional teams, roadmaps, and technology. Will drive technology benchmarking and key metrics to drive pre-silicon modeling for accuracy and best in class proposals that incorporate unit cost and PPA optimizations. Combines deep technical expertise with customer relationship management to drive revenue growth across wireless, networking, data center, and AI market segments.
Key Responsibilities
+ Serve as primary technical point of contact for strategic customers
+ Conduct technical discovery sessions to understand customer system-level requirements
+ Analyze customer use cases across AI, wireless, networking, and data center applications
+ Translate business requirements into technical specifications and solution architectures
+ Design and propose comprehensive silicon solutions that meet customer performance, power, and cost targets
+ Package integrated solutions spanning AI accelerators to radio frequency applications
+ Develop compelling technical proposals and presentations for customer engagements
+ Collaborate with sales teams to create winning competitive positioning
+ Provide expertise in System-on-Chip (SoC) design methodologies and best practices
+ Guide technology node selection based on performance, power, and cost optimization
+ Conduct technology node benchmarking and competitive analysis
+ Lead cross-functional design teams through solution development lifecycle
+ Drive project deliverables, timelines, and milestone tracking
+ Coordinate IP selection and integration requirements with internal and external teams
+ Ensure alignment between customer expectations and engineering capabilities
+ Drive design reviews and technical risk mitigation strategies
Required Skills/Experience
+ Experience in multiple domains of SoC design - backend, frontend, test, and packaging.
+ Experience in leading edge technology nodes
+ Strong understanding of design trade-offs - PPA, Cost, Technology Node, IP requirements
+ Excellent communication and presentation skills
Preferred Skills/Experience
+ Experience with AI/ML accelerator architectures and requirements
+ Experience leading high performance teams
+ Project management experience
+ Previous experience in foundry and/or ASIC vendor environments
**Qualifications:**
+ Bachelors or Masters degree in Electrical Engineering, Computer Engineering, or related field
+ 10+ Years of relevant experience in semiconductor SoC design
**Job Type:**
Experienced Hire
**Shift:**
Shift 1 (United States of America)
**Primary Location:**
US, Texas, Austin
**Additional Locations:**
US, Arizona, Phoenix, US, California, Folsom, US, California, Santa Clara, US, Oregon, Hillsboro
**Business group:**
Intel makes possible the most amazing experiences of the future. You may know us for our processors. But we do so much more. Intel invents at the boundaries of technology to make amazing experiences possible for business and society, and for every person on Earth. Harnessing the capability of the cloud, the ubiquity of the Internet of Things, the latest advances in memory and programmable solutions, and the promise of always-on 5G connectivity, Intel is disrupting industries and solving global challenges. Leading on policy, diversity, inclusion, education and sustainability, we create value for our stockholders, customers, and society.
**Posting Statement:**
All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.
**Position of Trust**
This role is a Position of Trust. Should you accept this position, you must consent to and pass an extended Background Investigation, which includes (subject to country law), extended education, SEC sanctions, and additional criminal and civil checks. For internals, this investigation may or may not be completed prior to starting the position. For additional questions, please contact your Recruiter.
**Benefits:**
We offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock, bonuses, as well as, benefit programs which include health, retirement, and vacation. Find more information about all of our Amazing Benefits here:
**********************************************************************************
Annual Salary Range for jobs which could be performed in the US: $247,810.00-349,850.00 USD
The range displayed on this job posting reflects the minimum and maximum target compensation for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific compensation range for your preferred location during the hiring process.
**Work Model for this Role**
This role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change.
Senior AI SoC Architect
Solutions architect job at Intel
Job Details:Job Description:
At Intel, we are committed to creating world-changing technology that enriches the lives of every person on earth. Our AI SoC Architecture Team is at the forefront of innovation, responsible for architecting world-leading datacenter AI SoCs and delivering the next generation of rack-scale AI servers. Join us in our mission to shape the future of technology as part of Intel's highly regarded AI SoC Engineering Group, headquartered in Santa Clara, CA, with additional sites in Folsom, CA, and Bangalore, IN
We are seeking an outstanding Senior Computer/SoC Architect to join our AI Architecture team. In this role, you will research, develop, and lead Intel architecture as we re-imagine how to build AI SoCs at Intel and in the semiconductor industry. This is an excellent opportunity to influence architectural direction, define the next generation of Intel AI SoC Architecture, and collaborate across disciplines to drive innovation.
Key Responsibilities
Define and lead the development of next-generation Intel AI SoC Architecture.
Inform and influence architectural direction from upper management and business group leaders.
Direct architecture for the entire product lifecycle, from early pathfinding through execution to silicon debug and beyond.
Collaborate across disciplines to identify technical solutions and assess key design trade-offs.
Define system/subsystem architecture for correct integration and optimized power/performance.
Author architecture specifications and facilitate correct implementation by development teams.
Work through ambiguity to make critical decisions, proactively mitigate problems, and ensure excellent technical work to achieve business results.
Actively engage with colleagues to achieve a common goal and mentor across the division to grow future technical contributors.
As a Successful Candidate, You Must Possess
Strong leadership skills and the willingness to mentor junior architects.
Excellent verbal and written communication skills.
A self-motivated attitude with strong problem-solving skills, able to deal with ambiguity.
Willingness to lead cross-discipline teams and collaborate in a high-paced atmosphere.
If you are ready to make an impact and take your career to the next level, we invite you to join our team and contribute to Intel's mission of creating world-changing technology. Apply now to become a part of our innovative AI SoC Architecture Team and help shape the future of technology.
Qualifications:
You must possess the below minimum qualifications to be initially considered for this position. Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates. This position is not eligible for Intel immigration sponsorship.
Minimum Qualifications:
Bachelor's degree or higher in Computer Engineering, Computer Science, Electrical Engineering, or similar field of study.
10+ years of diverse system-on-chip architecture, design, verification, firmware, and implementation experience.
Preferred Qualifications:
Master´s degree.
10+ years of PCIe expertise
Job Type:Experienced HireShift:Shift 1 (United States of America) Primary Location: US, California, FolsomAdditional Locations:US, California, San Francisco, US, California, San Jose, US, California, Santa ClaraBusiness group:Intel makes possible the most amazing experiences of the future. You may know us for our processors. But we do so much more. Intel invents at the boundaries of technology to make amazing experiences possible for business and society, and for every person on Earth. Harnessing the capability of the cloud, the ubiquity of the Internet of Things, the latest advances in memory and programmable solutions, and the promise of always-on 5G connectivity, Intel is disrupting industries and solving global challenges. Leading on policy, diversity, inclusion, education and sustainability, we create value for our stockholders, customers, and society.Posting Statement:All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.Position of TrustN/A
Benefits:
We offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock, bonuses, as well as, benefit programs which include health, retirement, and vacation. Find more information about all of our Amazing Benefits here:
**********************************************************************************
Annual Salary Range for jobs which could be performed in the US: $211,820.00-335,680.00 USDThe range displayed on this job posting reflects the minimum and maximum target compensation for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific compensation range for your preferred location during the hiring process.
Work Model for this Role
This role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change.
Auto-ApplySolutions Architect, Generative AI
Santa Clara, CA jobs
NVIDIA is seeking an outstanding AI Engineer or Solutions Architect to join our growing team focused on ecosystem partner enablement for Generative AI. In this role, you will lead by example, acting as both a strategic technical expert and a hands-on developer. You will directly build innovative proof-of-concept solutions and reference architectures for innovative AI agents, demonstrating the full power of the NVIDIA full-stack accelerated Generative AI platforms. By developing these foundational solutions, you will provide partners with the technical blueprints and expert guidance needed to architect and deploy their own transformative applications using NVIDIA full AI stack, from GPU systems and CUDA to NeMo and Nemotron.
The Generative AI Partners Enablement Solutions Architect team is committed to leveraging advanced technologies to address and expedite the deployment of solutions for customers' real-world challenges. We act as trusted technical advisors and partners to our ecosystem. As a member of NPN Generative AI Solution Architecture team, you will be immersed in a diverse, supportive environment where everyone is inspired to do their life's work. Come join the team and see how you can make a lasting impact on the world by applying accelerated computing AI and solve category defining systems and production grade AI solutions at scale.
What you will be doing:
* Building an end-to-end agentic AI applications that solve real-world enterprise problems across various industries.
* Serve as the primary technical domain expert for pre- and post-sale for partners, embedding deeply with them to design and deploy Generative AI solutions at scale. Maintain strong relationships with leadership and technical teams to drive adoption, and successful utilization of NVIDIA GenAI platforms.
* Accelerate partner/customer time to value by providing repeatable reference architecture guidance, building hands-on prototypes, and advising on standard methodologies for scaling solutions to productions.
* Establish the scope, success metrics, and evaluation criteria for partner-led customer projects, ensuring alignment to standardized and reproducible GPU-accelerated workflows.
* Enable strategic partners to build their own Professional Services, platforms and products by integrating and accelerating using NVIDIA technologies for high-impact customer workloads. You will proactively find opportunities to drive deeper adoption and utilization of NVIDIA's Generative AI products.
* Codify knowledge and operationalize technical success practices to help partners scale impact across industries and workloads.
What we need to see:
* MS or PhD degree in Computer Science/Engineering, Machine Learning, Data Science, Electrical Engineering or a closely related field (or equivalent experience).
* 5+ years of meaningful work experience in deploying AI models at scale as a Software Engineer or Deep Learning engineer.
* Consistent track record of building enterprise-grade agentic AI systems using open-source models and solid foundation in deep learning, with a particular emphasis on LLM and VLM.
* Hands-on experience with LLM and agentic frameworks (NeMo Agent Toolkit, LangChain, Semantic Kernel, Crew.ai, AutoGen) and evaluation and observability platforms. Comfortable building prototypes or proofs of concept
* Strong coding development and proficiency in Python, C++ and Deep Learning frameworks (PyTorch, or TensorFlow).
* Excellent communication and presentation skills to effectively collaborate with both internal executives, partners and customers.
Ways to stand out from the crowd:
* Demonstrate expertise in building applications and systems using NeMo Framework, Nemotron, Dynamo, TensorRTLLM, NIMs, AI Blueprints. And actively contribute to the open-source community.
* Take end-to-end ownership of projects, proactively acquiring new skills or knowledge as needed to drive success.
* Excel in fast-paced environments, adeptly managing multiple workstreams and prioritizing for the highest customer impact.
* Understanding of different advanced agent architectures and emerging communication protocols (MCP, OpenAI Agentic SDK, or Google A2A).
* NVIDIA GPUs and system software stacks (e.g. NCCL, CUDA), as well as HPC technologies such as InfiniBand, MPI, NVLink and others.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until November 13, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Auto-ApplySolutions Architect, Inference Deployments
Santa Clara, CA jobs
We're forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA's GPU technology and Kubernetes. As a Solutions Architect (Inference Focus), you'll collaborate closely with our engineering, DevOps, and customer success teams to foster enterprise AI adoption. Together, we'll introduce generative AI to production!
What you'll be doing:
* Help customers craft, deploy, and maintain scalable, GPU-accelerated inference pipelines on Kubernetes for large language models (LLMs) and generative AI workloads.
* Enhance performance tuning using TensorRT/TensorRT-LLM, NVIDIA NIM, and Triton Inference Server to improve GPU utilization and model efficiency.
* Collaborate with multi-functional teams (engineering, product) and offer technical mentorship to customers implementing AI at scale.
* Architect zero-downtime deployments, autoscaling (e.g., HPA or equivalent experience with custom metrics), and integration with cloud-native tools (e.g., OpenTelemetry, Prometheus, Grafana).
What we need to see:
* 5+ Years in Solutions Architecture with a proven track record of moving AI inference from POC to production on Kubernetes.
* Experience architecting GPU allocation using NVIDIA GPU Operator and NVIDIA NIM Operator. Troubleshoot sophisticated GPU orchestration, optimize with Multi-Instance GPU (MIG), and ensure efficient utilization in Kubernetes environments.
* Proficiency with TensorRT-LLM, Triton, and TensorRT for model optimization and serving.
* Success stories optimizing LLMs for low-latency inference in enterprise environments.
* BS or equivalent experience in CS/Engineering.
Ways to stand out from the crowd:
* Prior experience deploying NVIDIA NIM microservices for multi-model inference.
* Serverless Inference, knowledge of FaaS patterns (e.g., Google Cloud Run, AWS Lambda, NVCF) with NVIDIA GPUs.
* NVIDIA Certified AI Engineer or similar.
* Active contributions to Kubernetes SIGs or AI inference projects (e.g., KServe, Dynamo, SGLang or similar).
* Familiarity with networking concepts which support multi-node inference such as MPI, LWS or similar.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until November 25, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Auto-ApplySolutions Architect, AI Hyperscalers
Santa Clara, CA jobs
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by extraordinary technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing.
NVIDIA is searching for an AI/ML Solutions Architect focusing on Hyperscale customers and Cloud Service Providers. Your primary responsibilities will be to lead software customer technical engagement for AI training, inference and infrastructure being deployed at vast scale. You will work across multiple organizations within NVIDIA as well as at the customer to ensure successful and trouble-free deployments. If you would you like to partner with a large company to build automation and management to create a robust large scale artificial intelligence infrastructure and are interested in the optimization and characterization of customer specific AI models and pipelines - you should apply!
What you'll be doing:
* As a key technical member of a focused account team, you will serve as the main point of contact for NVIDIA products, enabling internet giants and cloud providers to have an innovative AI/ML software infrastructure.
* Work directly with best-in-class engineering teams to secure design wins, address challenges, bring solutions to production, and support them throughout their lifecycle.
* Become a trusted advisor to your customer by understanding their environment, constraints, and long-term strategy. Translate these insights into product requirements and innovative solutions.
* Help your customer enhance the value of NVIDIA technology, and provide feedback to NVIDIA for future product improvements.
* Facilitate the resolution of customer issues, offering timely and proactive communications to mitigate risks.
* Lead workshops, demos, and proof-of-concepts to showcase NVIDIA's AI/ML capabilities.
* Guide customers on standard processes for scalable AI model deployment and inference optimization.
What we need to see:
* Minimum of a BS/MS in Computer Science, Electrical Engineering, or equivalent experience.
* 4+ years of engineering experience with a proven track record in AI/ML-focused projects or enterprise-grade solutions.
* Proven understanding of Linux, including solving, optimization, and customization for AI/ML workloads.
* Strong understanding of data science and machine learning infrastructure-software and hardware.
* Professional-level communication skills, including the ability to tailor messages for varying technical audiences and maintain composure in high-pressure situations.
* Excellent follow-up and interpersonal skills, with a true passion for problem-solving.
* Proficient in Python, with the ability to develop scripts and build custom tools. Experience with parallel programming or GPU acceleration (e.g., CUDA) is helpful.
* Shown eagerness to learn and apply new technologies.
Ways to stand out from the crowd:
* Experience with Chatbots, RAG pipelines, vector databases, and distributed training or inference workloads.
* Experience or background in HPC (High Performance Computing) environments for AI or ML applications.
* Familiarity with multi-node GPU clusters and performance tuning for large-scale AI workloads.
* Experience developing in cloud and/or virtualized environments, containerized solutions, with knowledge of Docker, Kubernetes
* Background with common deep learning frameworks such as PyTorch or JAX.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until October 11, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Auto-ApplySolutions Architect, Worldwide Field Operations - New College Graduate 2026
Santa Clara, CA jobs
NVIDIA is the world leader in GPU-accelerated computing and AI, and we need Solutions Architects to help our customers adopt GPU Deep Learning, accelerated data analytics, and other GPU-accelerated technologies. As a new college graduate with a technical degree, our rotation program will team you up with experienced Solutions Architects for on-the-job training as you work as part of an NVIDIA account team supporting customers building solutions with our software and hardware platforms. A Solutions Architect is the first line of technical expertise between NVIDIA and our partners and customers. You will dynamically engage with developers, scientific researchers, and data scientists to help them identify their critical workloads and integrate our libraries into their platforms.
Spend 18 months rotating quarterly through customer-facing teams supporting different industries such as hyperscale cloud service providers, retail, healthcare, automotive, robotics and AI Factory. In the process, you will have the opportunity to become a specialist in our enterprise products as well as our developer software platforms such as NVIDIA AI Enterprise, CUDA-X and Omniverse. At the end of 18 months, after becoming a domain specialist on at least 1 cloud managed service and NVIDIA technology, you will join a team as a Solutions Architect in an area of your choosing.
What you'll be doing:
* Be responsible for the setup of experiments, tests, equipment, and otherwise facilitate evaluations that help solve customer problems using NVIDIA technologies
* Partner with Sales Account Managers or Developer Relations Managers to secure design wins, assist in technical conversations and support the product through customer proof-of-concept evaluations
* Establish close technical ties to the customer account, establishing personal relationships to facilitate rapid resolution of customer issues
* Work closely and collaborate with the NVIDIA customer account team, other Solutions Architects, and/or product engineering teams during quarterly rotation assignments
* You will raise and provide timely advance warning of critical customer issues that require additional attention
* Present platform solutions to customers, partners, community, etc.
* Some rotation assignments might require up to 15% travel
What we need to see:
* Bachelors or Masters degree in Computer Science, Math, Physics or related technical field or equivalent experience
* Data Sciences, Deep Learning, or Machine Learning coursework
* Experience with at least one scripting language (i.e. Python)
* Programming skills in 1 or more high-level languages (C, C++, Java, etc)
* Motivated self-starter with an equal balance of strong problem-solving skills and customer-facing communication skills
* Strong verbal and written communication skills, including presentation skills to engage any audience
* Strong collaboration and interpersonal skills
* Passion for continuous learning and knowledge transfer
* Enjoy working in a constantly evolving environment without losing focus
Ways to stand out from the crowd:
* Experience working on AI Deep Learning and Machine Learning Applications, AI Model Training/Inferencing or other GPU related technologies
* Experience using TensorFlow, PyTorch or other DL framework
* Experience working with Docker containers and Kubernetes
* CUDA programming experience
* Experience working with both on-prem and cloud-based infrastructure
* System-level experience with both hardware and software
* Large-scale systems management experience
* Exposure to cloud service platforms such as AWS, Azure, or GCP through coursework or through certification programs
With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our best-in-class engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 104,000 USD - 172,500 USD for Level 1, and 120,000 USD - 189,750 USD for Level 2.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until October 11, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Auto-ApplySolutions Architect, AI and ML
Santa Clara, CA jobs
NVIDIA is building the world's leading AI company, and we are looking for an experienced Cloud Solution Architect to help assist customers with adoption of GPU hardware and Software, as well as building and deploying Machine Learning (ML) , Deep Learning (DL), data analytics solutions on various Cloud Computing Platforms. As part of the Solutions Architecture team, we work with some of the most exciting computing hardware and software technologies including the latest breakthroughs in machine learning and data science. A Solutions Architect is the first line of technical expertise between NVIDIA and our customers so you will engage directly with developers, researchers, and data scientists with some of NVIDIA's most strategic technology customers as well as work directly with business and engineering teams on product strategy. We are looking for a Solutions Architect to help drive end-to-end technology solutions applying NVIDIA's full set of technologies based on business needs of customers. Join us in this exciting endeavor!
What you will be doing:
* Working with Cloud Service Providers to develop and demonstrate solutions based on NVIDIA's ML/DL and data science software and hardware technologies
* Build and deploy AI/ML solutions at scale using NVIDIA's AI software on cloud-based GPU platforms.
* Build custom PoCs for solution that address customer's critical business needs applying NVIDIA hardware and software technology
* Partner with Sales Account Managers or Developer Relations Managers to identify and secure new business opportunities for NVIDIA products and solutions for ML/DL and other software solutions
* Prepare and deliver technical content to customers including presentations about purpose-built solutions, workshops about NVIDIA products and solutions, etc.
* Conduct regular technical customer meetings for project/product roadmap, feature discussions, and intro to new technologies. Establish close technical ties to the customer to facilitate rapid resolution of customer issues
What we need to see:
* 3+ years of Solutions Engineering (or similar Sales Engineering roles) or equivalent experience
* 3+ years of work-related experience in Deep Learning and Machine Learning, including deep learning frameworks TensorFlow or PyTorch, GPU, and CUDA experience extremely helpful.
* BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Statistics, Physics, or other Engineering fields or equivalent experience.
* Established track record of deploying solutions in cloud computing environments including AWS, GCP, or Azure
* Knowledge of DevOps/ML Ops technologies such as Docker/containers, Kubernetes, data center deployments
* Ability to use at least one scripting language (i.e., Python)
* Good programming and debugging skills
* Ability to communicate your ideas/code clearly through documents, presentation etc.
Ways to stand out from the crowd:
* AWS, GCP or Azure Professional Solution Architect Certification.
* Hands-on experience with NVIDIA GPUs and SDKs (e.g. CUDA, RAPIDS, Triton etc.)
* System-level experience specifically GPU-based systems
* Experience with Deep Learning at scale
* Familiarity with parallel programming and distributed computing platforms
We make extensive use of conferencing tools, but occasional travel is required for local on-site visit to customers and industry events. NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 120,000 USD - 189,750 USD for Level 2, and 148,000 USD - 235,750 USD for Level 3.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until December 21, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Auto-ApplySolutions Architect, Generative AI
Santa Clara, CA jobs
NVIDIA is seeking an outstanding AI Engineer or Solutions Architect to join our growing team focused on partner enablement for Generative AI. In this role, you will lead by example, acting as both a strategic technical expert and a hands-on developer. You will directly build innovative proof-of-concept solutions and reference architectures for innovative AI applications, demonstrating the full power of the NVIDIA accelerated Generative AI platforms. By developing these foundational solutions, you will provide partners with the technical blueprints and expert guidance needed to architect and deploy their own transformative applications using NVIDIA full AI stack, from GPU systems and CUDA to NeMo and Triton.
The Generative AI Partners Enablement SA team is dedicated to applying next-generation technologies to solve customer problems. We act as trusted advisors and technical partners to our ecosystem. As a member of NPN Generative AI Solution Architecture team, you'll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world by applying accelerated computing AI and solve category defining systems and production grade AI solutions at scale.
What you will be doing:
* Serve as the primary technical domain expert for pre- and post-sale for partners, embedding deeply with them to design and deploy Generative AI solutions. Maintain strong relationships with leadership and technical teams to drive adoption, and successful utilization of NVIDIA GenAI platforms.
* Accelerate partner/customer time to value by providing repeatable reference architecture guidance, building hands-on prototypes, and advising on standard methodologies for scaling solutions to productions.
* Define the scope, success metrics, and evaluation criteria for partner-led customer projects, ensuring they are built on standardized and reproducible GPU-accelerated workflows.
* Enable strategic partners to launch their own Professional Services and platforms by tailoring NVIDIA agentic AI blueprints for high-impact customer workloads. You will proactively find opportunities to drive deeper adoption and utilization of NVIDIA's Generative AI products.
* Codify knowledge and operationalize technical success practices to help partners scale impact across industries and workloads.
What we need to see:
* MSc, PhD in Computer Science, Electrical Engineering, Software Engineer, ML Engineer, or related fields (or equivalent experience).
* 5+ years of relevant work experience in developing and deploying AI models at scale as a Software Engineer or deep learning engineer.
* Consistent track record of building enterprise-grade agentic AI systems using open-source models and solid foundation in deep learning, with a particular emphasis on generative models.
* Hands-on experience with LLM and agentic frameworks (NeMo Agent Toolkit, LangChain, Semantic Kernel, Crew.ai, AutoGen) and evaluation and observability platforms. Comfortable building prototypes or proofs of concept
* Strong coding development and proficiency in Python, C++ and Deep Learning frameworks (PyTorch, or TensorFlow).
* Excellent communication and presentation skills to effectively collaborate with both internal executives, partners and customers.
Ways to stand out from the crowd:
* Demonstrate expertise and hands-on experience with NVIDIA AI platforms.
* Understanding of different advanced agent architectures and emerging communication protocols (MCP or Google A2A).
* Excellent practical knowledge of Generative AI and LLM development. Ability to train GPT and Megatron Models.
* Understanding of MLOps life cycle management and experience with LLMOps workflows.
* Experience with CUDA programming and benchmarking and analyzing performance foundation models.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until August 14, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Auto-ApplySolutions Architect - Cloud Providers and Hyperscale
Santa Clara, CA jobs
We are now looking for a Solutions Architect! NVIDIA is searching for Solutions Architect with expertise in AI, Machine Learning, and HPC for Hyperscale and Cloud Providers focus. Primary responsibilities will be to lead technical engagements with customers as they integrate, optimize, and apply NVIDIA's hardware and software technologies.
Would you like to collaborate with some of the biggest companies developing brand new AI solutions by applying both NVIDIA and cloud technologies? Interested in broadening your skills in building robust AI pipelines and deployment of large-scale AI models in the cloud? Then read on
What you'll be doing:
* Lead on NVIDIA software products within a focused account team, assisting large customers and cloud providers in developing new workflows using NVIDIA technologies (hardware and software).
* Work closely with outstanding engineering and product teams to tackle tough problems and bring NVIDIA solutions to market in customer products and workflows
* Become a trusted advisor for the customer by understanding their environment, constraints, and business models and then translate those into product requirements and solutions to solve their problems applying NVIDIA technologies.
* Conduct regular technical customer meetings for project/product details, feature discussions, introductions to new technologies, and debugging sessions
* Work with customers to build PoCs for solutions to address critical business
* Prepare and deliver technical content to customers including presentations, workshops, etc.
What we need to see:
* BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other Engineering fields or equivalent experience.
* Motivation and skills to help drive technical pre-sales activities.
* 5+ years of Solutions Engineering (or similar Sales Engineering roles) experience.
* Familiarity (work experience) with Python, scripting, etc.
* Familiarity with AI frameworks
* Effective time management and capable of balancing multiple tasks.
* Ability to communicate ideas clearly through documents, presentation, etc.
Ways to stand out from the crowd:
* External customer facing skill-set and background
* Hands-on experience developing AI applications using NVIDIA technologies (GPUs and/or software)
* Cloud experience (applying cloud concepts like Database, etc. in developing workflows for customer use cases
* Hands-on experience with GPU systems in general including but not limited to AI workflow development, performance development, AI benchmarking, etc.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until October 13, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Auto-ApplySolutions Architect - Accelerated Computing
Santa Clara, CA jobs
We are looking for an Accelerated Computing Solutions Architect! NVIDIA is searching for a Solutions Architect with expertise in AI, Machine Learning, and HPC including performance optimization, application development and networking technologies. Primary responsibilities will be to lead technical engagements with customers as they investigate performance optimization and improvements for their large scale AI and HPC application deployments. Would you like to collaborate with some of the biggest companies developing brand-new AI solutions by applying both NVIDIA and cloud technologies and optimizing them for outstanding performance? Interested in broadening your skills in building robust AI pipelines and deployment of large-scale AI models in the cloud? Then read on!
What you'll be doing:
Work with NVIDIA Cloud Service Providers and NVIDIA Cloud Partners on accelerated computing and performance optimization problems
Guide customers through their journey from cluster and application deployment as well as optimization towards large-scale AI training and inference and HPC
Build custom product demonstrations and PoCs for solutions addressing critical customer business needs as well as analyze, debug and optimize customer workloads
Conduct regular technical customer meetings for project/product details, feature discussions, introductions to new technologies
Prepare and deliver technical content to customers at workshops, conferences, etc.
What we need to see:
BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other Engineering/Science or equivalent experience.
Motivation and skills to help drive technical pre-sales activities
5+ years of Solutions Engineering, Solutions Architecture or similar Sales Engineering experience
Practical knowledge of data center scale optimization - data center topologies, networking software optimization
Knowledge of software development - specifically Python, scripting, etc.
Ability to communicate ideas clearly through documentation, presentations, customer meetings, etc.
Ways to stand out from the crowd:
External customer facing skill-set and background
Hands-on experience developing AI applications using NVIDIA technologies (GPUs and/or software) and optimizing/debugging performance with NCCL, etc.
Cloud experience (applying cloud concepts like Database, etc. in developing workflows for customer use cases)
Hands-on experience with GPU systems in general including but not limited to AI workflow development, performance development, AI benchmarking, etc
Effective time management and capability to balance multiple tasks and multiple customers at once while thinking creatively to debug and solve problems
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until October 13, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Auto-ApplySolutions Architect - NVIDIA Cloud Partners
Santa Clara, CA jobs
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. As an NVIDIAN, you'll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
We are looking for an experienced Solutions Architect to help bridge the gap between design and deployment of large-scale AI and HPC GPU infrastructure. Do you want to be part of the team that brings GenAI, AI, ML, etc. hardware and software technologies to production in the field? We are looking for a Solutions Architect to join the NVIDIA team focused on supporting customers as they build the next-generation GPU infrastructure for their customers. As a part of the NVIDIA solutions architecture team, you will be driving end-to-end technology solution integration with some of NVIDIA's most strategic customers as well as offering recommendations for business and engineering teams based on customer feedback on product strategy.
What you'll be doing:
Collaborating with NVIDIA Cloud Partners to create, implement, and put into operation NVIDIA's innovative hardware and software solutions.
Partner with Sales Account Managers and other business leads to identify and secure business opportunities for NVIDIA products and solutions.
Act as the primary technical support for customers during the development, construction and production of extensive GPU cloud infrastructure through whole customer lifecycle.
Conduct regular technical customer meetings for project/product details, feature discussions, intro to new technologies, and debugging sessions.
Work with customers to build PoCs for solutions to address critical business needs by building out networking and compute infrastructure.
Prepare and deliver technical content to customers including presentations, workshops, etc.
Analyze and develop joint solutions for customer performance and scaling issues.
What we need to see:
BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other Engineering fields or equivalent experience.
Motivation and skills to own and drive technical engagements with customers throughout full customer life-cycle.
7+ years of Solution Engineering (or similar Sales Engineering, Cloud Engineering) experience working directly with partners and customers.
Experience crafting and deploying large-scale cluster environments.
Practical expertise in data center design, development and execution for AI and HPC.
Efficient time management and capable of balancing multiple tasks. Ability to communicate ideas clearly through documents, presentations, etc.
Ways to stand out from the crowd:
Practical familiarity with NVIDIA hardware (such as GPUs, ETH/IB networking components, storage, etc.) within extensive AI and HPC cluster settings.
Practical knowledge of NVIDIA systems technology such as NCCL, DCGM, UFM, Mission Control, Base Command Manager, etc.
Background with at scale GPU systems in general, encompassing performance testing, AI benchmarking, and more.
Practical involvement in cluster administration and coordination (SLURM, K8s, etc.).
We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until December 19, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Auto-ApplySolutions Architect, Generative AI Specialist
Santa Clara, CA jobs
Are you a creative data scientist who loves solving problems with AI? Do you enjoy developing case studies and teaching others how to use new AI technology? We are looking for a passionate Generative AI Specialist to join NVIDIA as a Solution Architect (SA) in Santa Clara, CA, USA. As a member of our partner enablement team, you will have the unique opportunity to use NVIDIA's innovative AI platform to create sample solutions that our customers and partners can adapt to real-world applications. We have assembled a world-class team of experts whose work is accelerating enterprise adoption of AI.
What you'll be doing:
One of the perks of joining NVIDIA is that you'll enjoy access to incredible emerging technology. You will use our amazing platform to demonstrate end-to-end deployment of AI solutions in exciting areas such as healthcare, retail, and financial services.
Develop reference architectures showing the application of AI in industry-specific contexts.
Build sample applications that bring these AI workflows to life.
Lead live workshops and trainings to ensure partner and customer success.
Scale your knowledge by sharing your lessons with other NVIDIA teams around the globe.
Foster learning and improvement by providing valuable feedback to sales and product specialists.
What we need to see:
5+ years of relevant work experience in developing and deploying scalable LLM models, and enterprise applications using Pytorch or TensorFlow with Software Engineer or ML Engineer background.
Proven track record of building enterprise RAG-based systems using open-source frameworks such as LlamaIndex, LangChain, Malevis, Haystack etc.
Excellent practical knowledge of Generative AI and LLMs. Ability to train and fine-tune GPT based models, such as Llama-2, Megatron , GPT-3
Robust foundational expertise, MS or advanced degree in a field related to AI or Computer Science, or equivalent experience.
Passion for knowledge sharing and track record of building educational content in the field of AI or Cloud.
Clear love of technology and AI-as illustrated by knowledge of emerging trends and tools.
Strong communication skills and experience working in external-facing roles. Comfortable working in a highly collaborative environment.
Ways to stand out from the crowd:
While not required, the following experiences and skills will set you apart:
Hands-on experience with NVIDIA AI Enterprise AI Software, Base Command Manager, NeMo, and NVIDIA Inference Microservices (NIMs), RAPIDS.
Excellent communication and presentation skills to effectively collaborate with both internal and external customers
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until November 8, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Auto-ApplySenior Principal Consultant or Architect
San Francisco, CA jobs
**Introduction** A career in IBM Consulting is rooted by long-term relationships and close collaboration with clients across the globe. You'll work with visionaries across multiple industries to improve the hybrid cloud and AI journey for the most innovative and valuable companies in the world. Your ability to accelerate impact and make meaningful change for your clients is enabled by our strategic partner ecosystem and our robust technology platforms across the IBM portfolio
**Your role and responsibilities**
Currently, we are looking for a highly experienced, team-oriented Oracle Cloud Payroll Functional Lead to join our talented consulting team. This is a US based, full-time position, with travel to customer sites as needed.
What You'll Do:
Consult on best practices on Oracle Cloud Payroll policies
Be an expert in the configuration of and management of the Oracle Cloud ERP Payroll applications
Provide best-practice guidance on payroll business processes and implementation
Support the definition and validation of various payroll related conversion activities
**Required technical and professional expertise**
Bachelor degree (or equivalent experience)
Minimum 5 years of experience as an Oracle Cloud Payroll Lead with 2-4 years of experience in implementing Oracle Cloud
Experience with public sector clients like state governments, counties and cities, considered a plus
Applicants with hands-on experience with Oracle HCM Cloud Tools such as HCM Extract, HDL, PBL experience are preferred
IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.
Director Solutions Architect -GPU
San Jose, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
The AMD Datacenter GPU team is seeking an experienced Director Solutions Architect to join our team focused on enabling large clusters for AI & HPC workloads.
THE PERSON:
The candidate will be a technical expert in datacenter infrastructure with deep knowledge of datacenter design, strong knowledge of compute (CPUs/GPUs), networking, and storage solutions, and experience partnering with customers to support RFP development. This role offers the opportunity to work at the cutting edge of AI & HPC infrastructure, solving complex technical challenges and helping customers implement transformative datacenter solutions at-scale.
KEY RESPONSIBILITIES:
* Lead customer technical discovery with data/ML, platform, and infrastructure stakeholders; map business goals to AI & HPC workloads and success metrics.
* Assess current system state (GPUs/accelerators, storage, fabric, security) and identify gaps, risks, and define required POCs.
* Shape reference architectures for large AI & HPC clusters (rack design, GPU topology, RoCE/InfiniBand, NVMe/parallel FS) aligned to customer constraints (power, cooling, space).
* Create high-level design.
* Partner with the business development and product teams to build ROI/TCO models. (CapEx/OpEx, $/token, $/inference) and craft the value story.
* Support draft of technical sections of RFIs/RFPs; produce architecture diagrams, deployment plans, and implementation timelines.
* Partner with program & engineering teams to define POC success criteria, test plans, and exit reports.
* Collaborate with product management to foster product roadmap improvements.
* Network design for high-throughput GPU clusters (scale-up / scale-out / OOB), cabling.
* Storage architectures optimized for AI data pipelines.
* Datacenter layout strategies / power /cooling.
* Rack power delivery / mechanicals.
PREFFERED EXPERIENCE:
* Solid years of experience designing and implementing large-scale infrastructure solutions.
* Strong understanding of datacenter networking and storage architectures.
* Experience with GPU-accelerated computing environments.
* Proven track record of creating technical documentation and reference architectures.
* Excellent communication skills with the ability to explain complex technical concepts.
* Experience working directly with customer technical teams.
ACADEMIC CREDENTIALS:
* Bachelor's degree or higher in Computer Science, Electrical Engineering or closely related field.
LOCATION:
* San Jose, CA
#LI-EV1
#LI-HYBRID
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Principal Power Architect
Santa Clara, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
Join our Data Center Platform Architecture Engineering team as a Lead Principal Power Architect, where you will influence the next generation of GPU platform power delivery. In this high-impact position, you will define end-to-end power architecture features with a focus on efficiency, scalability, and system-level performance. You will work closely with cross-functional partners across silicon, system, mechanical, and thermal engineering to bring innovative power architectures from concept to production. This team prioritizes continuous technical innovation, collaboration, and career development while delivering industry-leading technologies to market.
THE PERSON:
You are a strategic thinker with deep power electronics engineering expertise and a passion for solving complex architectural challenges. You excel in fast-paced, highly visible environments and bring a balanced mix of technical depth, analytical rigor, and cross-functional communication. You are proactive, detail-oriented, and skilled at driving clarity across ambiguous or complex technical problems. Your ability to model, prototype, debug, and influence system architecture makes you a key contributor to the platform's success.
KEY RESPONSIBILITIES:
* Architectural Definition & Modeling: Define and develop GPU end-to-end platform power delivery architecture features with emphasis on power efficiency.
* Build advanced power and performance models to evaluate architectural and system-level trade-offs.
* Technical Innovation: Drive innovation across architecture development, methodology enhancements, and cross-functional technical initiatives that advance platform capabilities.
* Prototyping & Validation: Support the development of prototype proof-of-concepts. Analyze and resolve complex issues throughout the architectural and product development lifecycle.
* System Debug: Lead debugging efforts during bring-up, validation, and production phases of SOC programs.
* Cross-Functional Collaboration: Partner closely with System Architects, Silicon/ASIC Design, PCB Design, Mechanical Engineering, Thermal Engineering, and Validation teams to ensure alignment on power architecture goals and seamless integration.
* Specification & Documentation: Develop clear system specifications, architecture documents, and implementation guidelines for engineering teams.
PREFERRED EXPERIENCE:
* Experience in GPU/CPU architecture with strong emphasis on power delivery and energy efficiency
* Proficiency with high-power GPU/CPU architecture topologies (Buck, Boost, Buck-Boost, LLC, Full Bridge, Half-Bridge, etc.)
* Strong understanding of modeling tools and methodologies (Simplis, Cadence, HSPICE)
* Expertise in debug techniques and methodologies
* Proficient with common lab equipment and hands-on board/platform-level debug, including delivery, sequencing, analysis, and optimization
* Deep knowledge of system architecture, technical debug, and validation strategies
* Experience with end-to-end power architecture definition in large-scale platforms
* Familiarity with system-level co-design considerations (thermal management, mechanical constraints, signal integrity)
* Strong analytical and problem-solving skills with exceptional attention to detail
* Self-starter with the ability to independently drive initiatives to completion
ACADEMIC CREDENTIALS:
* Master's or Ph.D. in Electrical Engineering, with a focus on power electronics or power integrity (preferred)
Location could also be in Austin, TX, Secaucus, NJ, Seattle or Bellevue, WA, or Longmont, CO
This role is not eligible for visa sponsorship
#LI-TL1
#HYBRID
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Principal SoC Architect, Discrete GPU SoCs - 113062
Santa Clara, CA jobs
What you do at AMD changes everything
At AMD, we push the boundaries of what is possible. We believe in changing the world for the better by driving innovation in high-performance computing, graphics, and visualization technologies - building blocks for gaming, immersive platforms, and the data center.
Developing great technology takes more than talent: it takes amazing people who understand collaboration, respect, and who will go the “extra mile” to achieve unthinkable results. It takes people who have the passion and desire to disrupt the status quo, push boundaries, deliver innovation, and change the world. If you have this type of passion, we invite you to take a look at the opportunities available to come join our team.
Principal SoC Architect, Discrete GPU SoCs - 113062
The Role:
AMD is looking for an outstanding technical contributor to help optimize performance of next-generation Discrete GPU SoCs. As a passionate and dedicated Performance Architect in the SOC Architecture Team, you will optimize SOC performance by identifying opportunities across all parts of system architecture, spanning the application and related frameworks, driver, GPU core, memory & I/O subsystems. You will use performance models for micro architecture trade-offs, collect and analyze performance profiling data and traces to isolate performance bottlenecks, and share the insights to help drive features into the next generation of GPUs. The ideal candidate is expected to be well informed on latest trends in GPU and CPU Systems, and be able to quickly ascertain, in a data-driven manner, how next generation GPUs and platforms should be engineered to support those needs.
Key Responsibilities:
SoC level Performance analysis for multiple SoCs in Discrete GPU product line.
Has technical time horizon of generally 6 to 18 months.
Explore and evaluate architectural design choices in Fabrics, Caches, and IPs in the SoC.
Work closely with other HW and SW architects to understand the architecture, the workloads and to propose solutions to improve/enhance performance with given SoC/IP and markets.
Architectural modeling (as needed), performance triage, deeper analysis, identifying bottlenecks and solutions for performance improvement.
Proposing, communicating, and implementing solutions to issues.
Participate in new workload definition and/or workload optimizations.
Deep engagement with Performance modelling team for new features development, developing performance analysis plan in Architecture phase, analyzing results and model correlation.
Work closely with Design and Performance Verification teams to set Performance KPIs, Performance Verification test plan reviews and triaging Performance issues identified by them.
Work closely with Post-Si teams for correlation of Pre-Silicon models and assumptions, and support them for Performance validation issues.
Position Requirements
Exceptional foundation in systems architecture, cutting across CPU or GPU, memory, storage and I/O subsystems
Experience in using Performance Modeling and Simulation Tools for CPUs, GPUs or SoC components at different abstraction levels (Functional, TLM or Cycle Accurate)
Experience analyzing CPU, GPU or System-level Micro-Architectural features to identify performance bottlenecks within different workloads
Experience with Performance monitoring tools such as Hardware performance counters and Visualization tools
Excellent communication skills (verbal, written, and presentation)
Exposure to GPU performance benchmarks and workloads are a plus
Proficiency in Unix, C++, and Scripting languages such as Python is a plus
More than 8 years relevant industry experience
ACADEMIC CREDENTIALS:
Bachelor, Master or PhD Degree, emphasis in Electrical or Computer Engineering, Computer Architecture, or Computer Science with SoC/IP performance studies.
Requisition Number: 113062
Country: United States State: California City: Santa Clara
Job Function: Design
AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. AMD and its subsidiaries are equal opportunity employers. We consider candidates regardless of age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status. Please click here for more information.
Senior Solution Engineer, AI Factory Triage
Santa Clara, CA jobs
NVIDIA is looking for an engineer who wants the excitement of direct customer interaction, and the reward of contributing to software and products, to join our team of Solution Engineers supporting the NVIDIA's GPU accelerated platforms in AI Factories! You will be working directly with customers to get them solutions on the latest NVIDIA platforms including the GB200. We are looking for an experienced engineer to triage customers' hardware platform issues and AI/ML workloads in huge datacenters of rack-scale platforms, solve customer problems, and contribute to products and software tooling. You must have excellent problem-solving abilities, communication skills and be able to work on multiple projects and tasks. You must be technically strong in Linux, have solid programming skills, and experience with multi-GPU platforms. Expertise analyzing performance of distributed GPU-accelerated workloads is a plus.
What you'll be doing:
Provide direct support to our NVIDIA Enterprise customers and work to answer questions, reproduce, resolve, or advance customer issues.
Work with engineering teams on customer issues, providing logs, reproduction information, and other triage information.
Create/update product and/or support tools.
Take ownership and drive customer issues from inception to resolution.
Document customer interactions and better enhance our knowledge base.
Develop features and tools as part of solution engineering efforts to support NVIDIA technologies
Occasional work on weekends and holidays to support customers
What we need to see:
Minimum of a BS in Computer Engineering, Electrical Engineering, or equivalent experience.
At least 5+ years of engineering experience with multi-GPU platforms
Strong system software (firmware, BIOS, kernel, driver, operating system) expertise
Solid understanding of Linux and the ability to analyze, optimize, and customize Linux environments for AI/ML workloads.
Containerized solutions experience with Docker, Kubernetes, Slurm
Professional-level communication skills, including adjusting communication to the technical level of the audience, and staying calm and focused in negative situations.
Excellent follow-up and organizational skills, with a passion or love for solving problems.
Proficient in C/C++ programming of platform OS, firmware, BIOS, kernel, drivers
Proficient in Python programming with the ability to build custom tools
Ways to stand out from the crowd:
Background with parallel programming or GPU acceleration (e.g., CUDA)
Experience developing in GPU accelerated / cloud / virtualized environments
Experience analyzing software performance of distributed workloads
Clustering or HPC data center technologies including Upper Layer Protocols (NCCL, MPI)
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 136,000 USD - 212,750 USD for Level 3, and 168,000 USD - 264,500 USD for Level 4.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until August 5, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Auto-ApplySenior Cloud Architect Engineer
San Jose, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
The Cloud Architect will be responsible for providing technical support to Engineering and Corporate organizations at AMD. This position will be required to support AMD's global cloud infrastructure in a dynamic, fast-paced environment. Furthermore, this person will be collaborating globally on efforts for various IT activities related to the AMD Engineering AI/GPU - Compute Environment, in accordance with AMD Worldwide IT strategies and objectives.
THE PERSON:
You're a highly motivated team player with a strong development background, problem solving mentality, excellent communication skills, ability to prioritize tasks along with willingness to learn and adapt. Excellent teamwork skills and capable of working independently.
KEY RESPONSIBILITIES:
* Design, develop, deploy, monitor, maintain, and evolve cloud-native resources, tools, services, reusable modules (infrastructure-as-code-practices) and frameworks to secure and automate provisioning of cloud infrastructure that empowers our users across Azure, AWS, GCP.
* Provide customers with standards and best practices on how to deploy and consume cloud-based services.
* Proactively seek opportunities to improve operational efficiency of teams and usage of cloud services.
* Contribute to a strong team-culture and an atmosphere of cross-functional teamwork.
* Work with internal customers in managing incident tickets to achieve operational excellence.
* Work with global teams to provide support and complete IT projects.
* Create secure hybrid deployments of virtual machines, and PaaS solutions in Azure, AWS, GCP.
* Work with Project teams to understand and accommodate application architecture and the App's specific requirements for Azure, AWS, and GCP.
* Collaborate with other engineers and stakeholders to share knowledge and build expertise for IaaS, PaaS, and Saas deployment.
* Collaborate with onshore and offshore resources.
* Implementing and automating security controls, governance processes, and compliance validation by closely partnering with the Security Team to incorporate respective requirements and best practices to keep our Cloud Env safe and secure.
* Applies experience in migrating on-premises applications and workloads to Azure, AWS, GCP using cloud technologies and providing support.
* Drives identity (IAM), access, and configuration management for cloud native tools.
* Responsible for the Recovery and Continuity process for cloud environments.
PREFERRED EXPERIENCE:
Cloud Systems Engineer general experience of various CSPs fundamentals with:
* Terraform, YAML, Jenkins, GitHub actions.
* Python, Golang, Shell, Java/J2EE, NodeJS, ReactJS, HTML5, PyTorch
* Able to build and support a full CI/CD pipeline to support consistent code deployment.
* Preferred understanding of AI framework
* Managing GPU clusters optimizing GPU-based services/tools/software
* Experience with Container technologies (GKE, EKS, ECS, Docker, Kubernetes) is desirable.
* Understanding CHANGE Management/Release Process
* Strong analytical and problem-solving skills.
* Strong understanding of Agile/Scrum methodologies.
* Strong written and verbal communication skills. Ability to effectively communicate technical issues and solutions to peers and external vendors.
* Strong active listening and consensus-building skills and passionate about learning and sharing knowledge with others.
* Infrastructure automation like Ansible, Terraform, or Cloud Formation, Deployment Mgr., and Resource Mgr.
* Designing, developing, and implementing solutions that improve efficiency and reduce costs through Kubernetes/containers, virtualization, functions, and automation.
* Experiences in building and managing complex cloud environments in Azure, AWS, GCP including security measures for encryption, authorization, and protocols.
* Monitoring system performance, conducting capacity planning, identifying trends, and providing recommendations to improve service levels via automation.
* Working closely with software development teams to troubleshoot and resolve issues.
* Understand cloud networking (VPCs), Load balancers, WAFs and CDNs
* Experience deploying, managing, administering, and migrating Infrastructure platforms in a Hybrid environment.
* Strong understanding of different deployment resource types and when to deploy each type (IaaS, PaaS, SaaS).
* Experience with Cloud native monitoring tools and also Nagios, ELK stack, Kibana/Prometheus.
* Proactive and empathetic mindset - you love to roll up your sleeves to fix problems for our customers.
* Ability to juggle multiple projects and priorities and re-prioritize as necessary to align with current business.
* Strong organizational ability.
ACADEMIC CREDENTIALS:
* Bachelor's degree in Computer Science, Engineering, or a related field.
LOCATION:
San Jose, CA
#LI-MF2
#LI-HYBRID
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Radeon Power and Performance POWER ARCHITECT/LEAD
Santa Clara, CA jobs
What you do at AMD changes everything
At AMD, we push the boundaries of what is possible. We believe in changing the world for the better by driving innovation in high-performance computing, graphics, and visualization technologies - building blocks for gaming, immersive platforms, and the data center.
Developing great technology takes more than talent: it takes amazing people who understand collaboration, respect, and who will go the “extra mile” to achieve unthinkable results. It takes people who have the passion and desire to disrupt the status quo, push boundaries, deliver innovation, and change the world. If you have this type of passion, we invite you to take a look at the opportunities available to come join our team.
THE ROLE:
You will be part of a small, but dedicated team driving discrete GPU products' performance attainment solutions across hardware, software and the platform. The main tasks will be performance analysis of gaming workloads and building/maintaining a framework of tools that facilitate performance analysis.
THE PERSON:
You have good communication and analytical/problem-solving skills.
You are a fast learner and willing to take on new challenges.
You are a self-starter who can work independently but works toward making the overall product successful.
You should have good analytical and problem-solving skills with an interest in constantly making things stronger, better, faster.
KEY RESPONSIBILITIES:
You will use your academic and/or work experience in CPU/GPU architecture, programming, or data analysis to tackle tasks ranging from creating new analysis tools, improving upon existing ones, and doing performance analysis on important workloads.
Build upon current analysis flows to debug/address end user problems
Work alongside team members to implement new functionality
Generate/analyze performance data on new GPU hardware
PREFERRED EXPERIENCE:
Previous experience with performance analysis and optimization
Exposure to CPU/GPU architectures
Previous experience visualizing or analyzing data are desirable.
Good programming skills: preference for Python. Database/data visualization skills are a bonus
ACADEMIC CREDENTIALS:
BS degree in EE and CS/CE or ECE degree(s) are required.
LOCATION:
Austin, TX; Santa Clara, CA; Markham, Canada
#LI-SW2
Requisition Number: 176228
Country: United States State: California City: Santa Clara
Job Function: Design
Benefits offered are described here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. AMD and its subsidiaries are equal opportunity employers. We consider candidates regardless of age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status. Please click here for more information.
Lead GPU Machine Learning & HPC Architect
Folsom, CA jobs
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
RTG (Radeon Technologies Group) Architecture team in Folsom, CA is passionate about developing next-generation GPU solutions. As a Lead GPU Machine Learning & HPC Architect, you will collaborate with a strong architecture and design team on developing next generation products for data centers and super-computers. You will engage in architecture exploration, modeling and analysis of ML/HPC workloads. Through your experiments and analysis, you will provide valuable insight into new and emerging hardware and software technologies and MI line of products.
THE PERSON:
You have excellent analytical and problem-solving skills, along with attention to detail. You are an effective team player who focuses on collaboration, team building, mentoring, and furthering team success. You have strong communication, time management, and presentation skills
KEY RESPONSIBILITIES:
* Communicate and collaborate with a network of experienced architects and designers around the world.
* Identify complex technical problems, break them down, summarize possible solutions
* Work with architects to propose innovative solutions that can be implemented in HW, validated by developing various models/simulators
* Collect/summarize data or simulation results for consumption by architects and design teams
PREFERRED EXPERIENCE:
* Knowledge in GPU architectures, basic knowledge of CPU architecture
* Background in Network-on-Chip (NoC) design and interconnect systems
* Experience in machine learning (ML) networks including TensorFlow and PyTorch
* Understanding of Graphics and Compute API's such as CUDA, OpenCL, and Vulkan
* Experience with operating systems (OS) and device driver development is a plus
* Strong programming foundation in C, C++, and scripting languages (Python, etc.)
* Experience in hardware modeling and design using RTL or SystemC
ACADEMIC CREDENTIALS:
* Undergrad degree required. Bachelor of Science, Masters, or PhD degree with emphasis in Electrical Engineering, Computer architecture, or Computer Science with relevant experience preferred
This role is not eligible for Visa sponsorship.
#LI-BM1
#LI-Hybrid
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.