Senior Quality Assurance Engineer jobs at Sunrun - 368 jobs
QA/QC Commissioning Associate II
CPG 4.9
Ashburn, VA jobs
Position: QA/QC Commissioning Associate II Location: Remote Job Id: 698 # of Openings: 1 TITLE: QA/QC Commissioning Associate II LOCATION: REMOTE - with 75% travel POSITION SUMMMARY: The QA/QC Commissioning Associate II assists in quality control and quality assurance of data center critical systems preparing for the commissioning process. The QA/QC Commissioning Associate assists the QA/QC Engineer to ensure that the correct equipment has been purchased and that installation is in accordance with industry standards and equipment specifications. This role will develop skills and industry knowledge to perform increasingly more complex commissioning tasks.
ESSENTIAL DUTIES AND RESPONSIBILITIES: To perform this job successfully, an individual must be able to perform the following satisfactorily; other duties may be assigned. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
Develop QA/QC documents of the complete project including certificates, calibration, test results, inspection requests, non-compliance reports and site instruction/observations, permanent materials delivered, and other important QA/QC documents
Follow all standards to perform inspection and tests on all procedures and oversee all testing methods and maintain high standards of quality for all processes
Review the quality of all materials at the site and ensure compliance with all project specifications and quality and collaborate with the department for all material procurement and maintain a quality of materials
Support the effective implementation of all test and inspection schedules and ensure adherence to all procedures and coordinate with various teams to perform quality audits on processes
Assist employees to ensure knowledge of all quality standards and ensure compliance to all quality manuals and procedures and collaborate with contractors and suppliers to maintain the quality of all systems
Manage to lift all types of equipment and handle the efficient storage of all hazardous materials and perform quality audits as per the required schedule
Understand all products and non-conformance processes and evaluate all documents to ensure the maintenance of optimal quality and prepare monthly reports to evaluate performance
Monitor an efficient system and record for all project activities and analyze all processes to ensure all work according to quality requirements
Understand all work methods and maintain knowledge on all quality assurance standards and monitor continuous application for all quality assurance processes and recommend corrective actions for all processes
Support and follow a method statement for the activity including risk assessment and job safety environmental analysis and Inspection Test Plan and Checklist based on specifications of the project
Liaise the Technical Engineer for submission of material submittals to Consultant
Develop and maintain inspection reports
Ensure compliance to federal and state laws, as well as company standards and specifications
Maintain calibration of quality testing equipment
Perform inspections across all stages of production
Advising on procedures to improve production efficiency
Prepare and maintain test data for review
Evaluate data and draft reports, noting any relevant deviations from existing standards
Identify areas for quality control improvement and implement new methods accordingly
Communicate quality or compliance concerns with urgency
QUALIFICATIONS: To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
Education/Experience (Desired):
Technical Military MOS, trade school and/or degree
Experience and/or education and internship in complex facilities or mission critical projects is preferred
Any civilian or military technical certifications is a plus
Experience with writing and enforcing standard operating procedures (SOPs)
Solid understanding of test equipment & software
Minimum of 2-4 years of inspection and/or production experience
Strong working knowledge of various mathematical concepts including fractions, ratios, and proportions
Demonstrated ability to work independently with minimal supervision
Excellent organizational skills
Demonstrated ability to analyze and interpret information
Must be a US Citizen
Must be willing to travel 75%
Computer Skills:
Advanced Excel skills preferred
Experience using Microsoft Office Suite, Word and Microsoft Project
Basic knowledge of systems design for various projects
Certificates and Licenses:
No certificates or licenses required
Supervisory Responsibilities:
No supervisory responsibilities for this position.
Physical Demands: The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
Occasionally lift and or move objects 10 to 50; Frequently required to stand, walk, stoop, kneel, crouch or crawl. The employee is occasionally required to sit and climb or balance. Specific vision abilities for this job include close vision, distance vision, color vision, peripheral vision, depth perception and the ability to adjust and focus. Noise Level can be moderate to high.
The above job description is not intended to be an all-inclusive list of duties and standards of the position. Incumbents will follow any other instructions, and perform any other related duties, as assigned by their supervisor.
CPG is an equal opportunity employer. We will consider all employment applicants without attention to age, race, color, religion, sex, sexual orientation, gender identity, national origin, veteran, or disability status.
CPG Participates in E-Verify
#:LI-TG1
Pay Range: $65,013 - $97,580 per year Apply for this Position
$65k-97.6k yearly 6d ago
Looking for a job?
Let Zippia find it for you.
QA/QC Commissioning Associate III
CPG 4.9
Chicago, IL jobs
Position: QA/QC Commissioning Associate III Location: Remote Job Id: 825 # of Openings: 1 TITLE: QA/QC Commissioning Associate III LOCATION: Remote - preferably someone that lives within 90 miles of Chicago, IL POSITION SUMMMARY: The QA/QC Commissioning Associate III assists in quality control and quality assurance of data center critical systems preparing for the commissioning process. The QA/QC Commissioning Associate assists the QA/QC Engineer to ensure that the correct equipment has been purchased and that installation is in accordance with industry standards and equipment specifications. This role will develop skills and industry knowledge to perform increasingly more complex commissioning tasks.
ESSENTIAL DUTIES AND RESPONSIBILITIES: To perform this job successfully, an individual must be able to perform the following satisfactorily; other duties may be assigned. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
Develop QA/QC documents of the complete project including certificates, calibration, test results, inspection requests, non-compliance reports and site instruction/observations, permanent materials delivered, and other important QA/QC documents
Follow all standards to perform inspection and tests on all procedures and oversee all testing methods and maintain high standards of quality for all processes
Review the quality of all materials at the site and ensure compliance with all project specifications and quality and collaborate with the department for all material procurement and maintain a quality of materials
Support the effective implementation of all test and inspection schedules and ensure adherence to all procedures and coordinate with various teams to perform quality audits on processes
Assist employees to ensure knowledge of all quality standards and ensure compliance to all quality manuals and procedures and collaborate with contractors and suppliers to maintain the quality of all systems
Manage to lift all types of equipment and handle the efficient storage of all hazardous materials and perform quality audits as per the required schedule
Understand all products and non-conformance processes and evaluate all documents to ensure the maintenance of optimal quality and prepare monthly reports to evaluate performance
Monitor an efficient system and record for all project activities and analyze all processes to ensure all work according to quality requirements
Understand all work methods and maintain knowledge on all quality assurance standards and monitor continuous application for all quality assurance processes and recommend corrective actions for all processes
Support and follow a method statement for the activity including risk assessment and job safety environmental analysis and Inspection Test Plan and Checklist based on specifications of the project
Liaise the Technical Engineer for submission of material submittals to Consultant
Develop and maintain inspection reports
Ensure compliance to federal and state laws, as well as company standards and specifications
Maintain calibration of quality testing equipment
Perform inspections across all stages of production
Advising on procedures to improve production efficiency
Prepare and maintain test data for review
Evaluate data and draft reports, noting any relevant deviations from existing standards
Identify areas for quality control improvement and implement new methods accordingly
Communicate quality or compliance concerns with urgency
QUALIFICATIONS: To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
Education/Experience (Desired):
Technical Military MOS, trade school and/or degree
Experience and/or education and internship in complex facilities or mission critical projects is preferred
Any civilian or military technical certifications is a plus
Experience with writing and enforcing standard operating procedures (SOPs)
Solid understanding of test equipment & software
Minimum of 5-9 years of inspection and/or production experience
Strong working knowledge of various mathematical concepts including fractions, ratios, and proportions
Demonstrated ability to work independently with minimal supervision
Excellent organizational skills
Demonstrated ability to analyze and interpret information
Must be a US citizen
Must be able to travel 70%
Must live reasonably close to Chicago. Illinois
Computer Skills:
Advanced Excel skills preferred
Experience using Microsoft Office Suite, Word and Microsoft Project
Basic knowledge of systems design for various projects
Certificates and Licenses:
No certificates or licenses required
Supervisory Responsibilities:
No supervisory responsibilities for this position.
Physical Demands: The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
Occasionally lift and or move objects 10 to 50; Frequently required to stand, walk, stoop, kneel, crouch or crawl. The employee is occasionally required to sit and climb or balance. Specific vision abilities for this job include close vision, distance vision, color vision, peripheral vision, depth perception and the ability to adjust and focus. Noise Level can be moderate to high.
The above job description is not intended to be an all-inclusive list of duties and standards of the position. Incumbents will follow any other instructions, and perform any other related duties, as assigned by their supervisor.
CPG is an equal opportunity employer. We will consider all employment applicants without attention to age, race, color, religion, sex, sexual orientation, gender identity, national origin, veteran, or disability status.
CPG Participates in E-Verify
#:LI-TG1
Pay Range: $72,671 - $108,954 per year Apply for this Position
$72.7k-109k yearly 5d ago
Principal Software Engineer, Managed AI
Crusoe Energy Systems LLC 4.1
San Francisco, CA jobs
Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI - without sacrificing scale, speed, or sustainability.
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.
About This Role:
As a Principal Software Engineer on the Managed AI team at Crusoe, you'll have a pivotal role in shaping the architecture and scalability of our next-generation AI inference platform. You will lead the design and implementation of core systems for our AI services, including resilient fault-tolerant queues, model catalogs, and scheduling mechanisms optimized for cost and performance. This role gives you the opportunity to build and scale infrastructure capable of handling millions of API requests per second across thousands of customers.
From day one, you'll own critical subsystems for managed AI inference, helping to serve large language models (LLMs) to a global audience. As part of a dynamic, fast-growing team, you'll collaborate cross-functionally, influence the long-term vision of the platform, and contribute to cutting-edge AI technologies. This is a unique opportunity to build a high-performance AI product that will be central to Crusoe's business growth.
What You'll Be Working On:
Design and Development:
Lead the design and implementation of core AI services, including:
Resilient fault-tolerant queues for efficient task distribution.
Model catalogs for managing and versioning AI models.
Scheduling mechanisms optimized for cost and performance.
High-performance APIs for serving AI models to customers.
Scalability and Performance:
Build and scale infrastructure to handle millions of API requests per second.
Optimize AI inference performance on GPU-based systems.
Implement robust monitoring and alerting to ensure system health and availability.
Collaboration and Innovation:
Collaborate closely with product management, business strategy, and other engineering teams.
Influence the long-term vision and architectural decisions of the AI platform.
Contribute to open-source AI frameworks and participate in the AI community.
Prototype and iterate on new features and technologies.
What You'll Bring to the Team:
Strong Engineering Fundamentals:
Advanced degree in Computer Science, Engineering, or a related field.
Demonstrable experience in distributed systems design and implementation.
Proven track record of delivering early-stage projects under tight deadlines.
Expertise in using cloud-based services, such as, elastic compute, object storage, virtual private networks, managed database, etc.
AI/ML Expertise:
Experience in Generative AI (Large Language Models, Multimodal).
Familiarity with AI infrastructure, including training, inference, and ETL pipelines.
Software Engineering Skills:
Experience with container runtimes (e.g., Kubernetes) and microservices architectures.
Experience using REST APIs and common communication protocols, such as gRPC.
Demonstrated experience in the software development cycle and familiarity with CI/CD tools.
Preferred Qualifications:
Proficiency in Golang or Python for large-scale, production-level services.
Contributions to open-source AI projects such as VLLM or similar frameworks.
Performance optimizations on GPU systems and inference frameworks.
Personal Attributes:
Proactive and collaborative approach with the ability to work autonomously.
Strong communication and interpersonal skills.
Passion for building cutting-edge AI products and solving challenging technical problems.
Benefits:
Industry competitive pay
Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid Commuter FSA benefit of $200 per month
Compensation:
Compensation will be paid in the range of $256,000 - $320,000 a year + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
#J-18808-Ljbffr
$256k-320k yearly 4d ago
Senior Software Engineer, Developer Experience
Crusoe Energy Systems LLC 4.1
San Francisco, CA jobs
About the Role
Crusoe Developer Experience is a new team charged with fueling the future of Crusoe's developer muscle. We set the tone for how engineering is done, ensuring that as Crusoe grows, their developer systems and code delivery capacity grow as well. We have a very broad charter that spans the software life cycle from design through coding, testing, and deployment. Our mission is to empower Crusoe engineers to deliver positive change to customers quickly, safely and efficiently. As an early member of the Developer Experience team, you will play a key part in unlocking the potential of Crusoe's coding ecosystem. At Crusoe, you will be challenged with deep problems such as kernel upgrades as well as broad problems such as establishing first principles around unit testing, code reviews and rollouts.
As a member of this team, you will take a key role in contributing to technical direction, communicating a strategic vision, and establishing a company culture of engineering excellence. You will work alongside stakeholders and customers to ensure that our work is aligned with long term company objectives - informing those objectives along the way. You'll also lead efforts to define the future state of Crusoe's evolving and rapidly growing software development efforts.
A Day In The Life
Partner with the broader engineering organization to build and define standard practices for how services are operated and observed throughout their lifecycle.
Establish an opinionated, flexible and cost-effective toolchain for delivering customer value at scale.
Fully integrate Crusoe's developer systems to keep our developers in the flow state, ruthlessly eliminating toil that slows us down.
Immerse yourself in the developer experience so that you can identify and eliminate pain points.
Create libraries, tools, and pre-production environments for vetting service APIs and interactions between microservices.
Unify both internal tooling and vendor services to automate, build efficiency, and optimize security.
Innovate across the development lifecycle from source code, editors, build, CI, CD, platform runtime environments, telemetry, optimizations, monitoring and alerting.
Establish a culture of continuous quality delivery that scales as Crusoe scales.
Work diligently to build quality, efficient systems and processes to increase the impact of engineers around you.
You Will Thrive In This Role If You Have
You have expertise in understanding technical decisions, evaluating tradeoffs, and how these decisions impact individuals who will use what you build to optimize their work.
Are knowledgeable on how to leverage Gitlab to work with multiple repositories or Github.
You have fluent knowledge of industry-standard build tooling, containerization, and open source development tools, libraries, and frameworks.
Have experience working with modern build systems like Buck, Bazel, or others.
Have professional experience working with Kubernetes clusters or Kubernetes in general.
You have an understanding of testing infrastructure.
Experience with DevOps, Site Reliability, Release Engineering, or similar.
Bonus points if you have hands on working experience with Linux image construction - not just kernel but package building.
You have experience in relevant and modern open source programming languages (we use Golang).
You have empathy for building developer and operator workflows and productivity.
You have a passion for staying current on recent industry practices, open source advancements and an efficient developer community.
You like solving complex problems and then automating the solutions.
You enjoy working side-by-side with other top-notch engineers.
You hold either a BS or MS Degree in an Engineering or Analytical field (e.g., Computer Science, Engineering, Mathematics, Statistics, Operations Research, Management Science) or equivalent experience.
Benefits
Industry competitive pay
Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid commuter benefit; $300 per month
Compensation Range
Compensation will be paid in the range of $166,000 - $200,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's knowledge, education, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
#J-18808-Ljbffr
$142k-187k yearly est. 5d ago
Principal Software Engineer, SDN Networking
Crusoe Energy Systems LLC 4.1
San Francisco, CA jobs
Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI - without sacrificing scale, speed, or sustainability.
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.
About This Role:
We are seeking a Principal Software Engineer - Software Defined Networking lead the development and execution of our Software Defined Networking strategy. This role will be instrumental in driving innovation and performance improvements within our network infrastructure by leveraging cutting‑edge technologies like XDP/EBPF, DPDK, SmartNICs, and DPU/IPUs.
What You'll Be Working On:
Develop and execute the roadmap for the Software Defined Networking strategy at Crusoe Cloud.
Guide the engineering team through architecture decisions, design processes, design reviews, code reviews, and implementation tasks.
Collaborate with the network infrastructure organization to develop industry‑leading networking infrastructure.
Lead Linux Kernel and driver development, system architecture design, production support, and cross‑functional collaboration.
What You'll Bring to the Team:
10+ years of related experience building and operating at scale in a production environment.
Proven experience in system programming with C, C++ and/or Rust.
Extensive knowledge of Linux Systems Internals and computer architecture.
Expertise in Network Programming and Packet Processing pipelines.
Hands‑on experience with kernel bypass technologies such as XDP/EBPF, AF_XDP, and DPDK.
In‑depth understanding of TCP/IP and network accelerators like Mellanox/Nvidia SmartNIC (ConnectX6/7), DPU Bluefield3, and Intel IPU.
Familiarity with SR‑IOV, vDPA, and scalable functions.
Strong background in kernel or embedded development, particularly with the Linux kernel.
Experience with Open vSwitch, Openflow, and Open Virtual Networking.
Knowledge of professional software engineering practices and best practices for the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
Demonstrated track record of contributions to the open source community (e.g., Open vSwitch/OVS, Open Virtual Networking/OVN, Multus, Cilium).
Bonus Points:
Advanced degree in Computer Science, Engineering, or a related field.
Proven leadership experience in a technical role.
Experience with cloud networking platforms (AWS, Azure, GCP) and virtualization technologies (VMware, KVM).
Benefits:
Industry competitive pay
Restricted Stock Units in a fast growing, well‑funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short‑term and long‑term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid commuter benefit; $300 per month
Compensation:
Compensation will be paid in the range of $238,000 - $298,000 a year + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's knowledge, education, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
#J-18808-Ljbffr
Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI - without sacrificing scale, speed, or sustainability.
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.
About the Role:
We are seeking Staff/Senior Staff Software Engineers to architect, design, and develop Cloud Infrastructure management systems and platforms; and deliver E2E use cases and workflows for a vertically integrated AI-First Crusoe Cloud. You will play a crucial role in building systems and platforms to efficiently plan, monitor, deploy and operate Crusoe Cloud and deliver on key business revenue metrics.
You'll be instrumental in evaluating and hands‑on implementing and building platforms, tools, and frameworks, focusing on reliability, scalability, operational efficiency, and ease of use. Your expertise will be vital in streamlining our infrastructure planning and management processes and workflows, driving efficiency, and enhancing our cloud platform's overall performance and reliability as we dramatically scale out our Crusoe Cloud products and services by 10X+.
As a part of your responsibilities, you'll develop and refine technical designs and architecture, mentor fellow engineers, and actively contribute to team growth in collaboration with engineering managers.
What You'll Be Working On:
Collaborate extensively across teams to architect, design, implement physical infrastructure management software systems, availability platforms, and frameworks to meet E2E use cases of our customers we host on our AI Infrastructure and provide best customer experience.
Champion the reliability, scalability, and security of our systems and platforms - you'll be the guardian of our infrastructure!
Develop workflows to drive efficiency and meet key business objectives and metrics.
Design and implement high‑performing, highly available cloud architectures optimizing for both performance and cost‑effectiveness.
Streamline cloud deployment, configuration management, and operations by developing and maintaining effective platforms, interfaces, and automation tooling.
Actively contribute to the evolution of our platform, collaborating closely with cross‑functional development teams to ensure smooth integration and deployment.
What You'll Bring to the Team:
A Bachelor's degree in Computer Science or Software Engineering, and 10+ years of relevant experience.
10+ years of experience building and operating distributed systems at scale.
Proven experience with building reliable, scalable, efficient, and secure cloud platforms and systems and effectively running them in production environments.
Fluency in programming languages such as Go, Rust, Java or C++.
A collaborative approach (platform mindset) to working with development and operations teams to build and maintain a robust platform and effectively drive adoption.
Understanding of cloud security best practices and the ability to implement secure configurations.
Excellent troubleshooting and problem‑solving skills to tackle complex infrastructure issues.
Excellent communication skills
Embody the Company values.
Bonus Points:
Hands‑on experience deploying, managing, and troubleshooting Kubernetes clusters.
Experience working in a fast‑paced, startup environment
A passion for building an energy‑first scalable AI Infrastructure.
A passion for sustainability and innovation - Crusoe Energy is revolutionizing clean energy production, and we're looking for individuals who share our enthusiasm!
Benefits:
Industry competitive pay
Restricted Stock Units in a fast growing, well‑funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short‑term and long‑term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid commuter benefit; $300 per month
Compensation:
Compensation will be paid in the range of $209,000 - $253,000. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
#J-18808-Ljbffr
$209k-253k yearly 2d ago
Staff Software Engineer, Slurm
Crusoe Energy Systems LLC 4.1
San Francisco, CA jobs
Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI - without sacrificing scale, speed, or sustainability.
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.
About the Role:
We are actively seeking an exceptional Staff Software Engineer to join our cloud software team, focusing specifically on building and operating Slurm as a fully managed cloud service within Crusoe Cloud. This role is crucial for delivering next-generation orchestration capabilities to power GPU-accelerated and high-performance computing (HPC) at scale.
Your expertise will be instrumental in designing and scaling our carbon-reducing operating model, and advancing our AI training clusters to lead the industry in reliability and performance. You will shape the technical direction of systems that allow customers to run advanced workloads across CPUs, NVIDIA and AMD GPUs, and high-performance networking environments.
You will be involved in writing and reviewing code, contributing to proposals, and drafting architecture documents. You will evaluate tools and frameworks, considering their impact on reliability, scalability, operational costs, and ease of adoption.
What You'll Be Working On:
Lead the development and engineering of our managed Slurm offering, providing a seamless experience for AI/ML and HPC customers who rely on robust Slurm job scheduling.
Contribute to the development of scalable and robust software solutions, closely aligning with the strategic objectives outlined in the Crusoe Cloud roadmap.
Design, build, and maintain Kubernetes operators and controllers dedicated to managing the lifecycle, configuration, and state of large-scale Slurm clusters.
Drive the integration of GPU acceleration in the Slurm environment, including device plugin architecture, GPU operators, accelerator-aware scheduling, and resource allocation.
Ensure that high-performance networking technologies, such as InfiniBand and RoCE, are correctly leveraged for distributed GPU workloads running through Slurm.
Implement and manage features such as multi-tenancy, cluster lifecycle management, auto-scaling, and high availability for the managed Slurm control plane services.
Develop scalable systems to compete with leading managed services.
Support the development of your peers by sharing knowledge and providing guidance in technical discussions.
What You'll Bring to the Team:
You have 7+ years of experience working in software engineering, with strong experience in Systems Engineering. Experience in distributed systems, cloud, or HPC environments is a must
You possess 2+ years of programming experience in GoLang. Strong proficiency in other systems languages (Rust, C++, Python for HPC tooling) is also beneficial.
You have extensive experience with Kubernetes and Linux Engineering and debugging.
You possess deep knowledge of Slurm (Simple Linux Utility for Resource Management) administration and the architecture required for managing compute jobs in high-performance environments.
You are skilled in infrastructure as code and familiar with systems-level challenges, ideally with experience utilizing Terraform.
You understand Argo, CI/CD, and Automated Testing pipelines. You can design system architecture, taking ownership of system architecture, including CI/CD pipelines, while ensuring adherence to security standards.
Strong knowledge of container networking (CNI plugins, service meshes) and Linux networking fundamentals.
Familiarity with GPU integration in Kubernetes, including device plugins and GPU operators.
You have excellent communication skills, both verbal and written.
Compensation Range
Compensation will be paid in the range of $185,000 - $224,000. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
#J-18808-Ljbffr
$185k-224k yearly 4d ago
Staff Software Engineer, SDN Networking
Crusoe Energy Systems LLC 4.1
San Francisco, CA jobs
Cruose's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI - without sacrificing scale, speed, or sustainability.
Be part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.
About This Role:
Crusoe Cloud seeks a highly skilled and experienced Staff Software Engineer to lead the development and execution of our cutting-edge Software Defined Networking strategy. You will play a pivotal role in driving innovation and performance improvements within our network infrastructure by leveraging advanced technologies such as XDP/EBPF, DPDK, SmartNICs, and DPUs/IPUs.
What You'll Be Working On:
Develop and execute the roadmap for Crusoe Cloud's Software Defined Networking strategy.
Lead the engineering team through all phases of the software development lifecycle, including architecture decisions, design processes, design reviews, code reviews, and implementation tasks.
Collaborate closely with the network infrastructure organization to develop and deploy industry-leading networking solutions.
Lead the design, development, and support of Linux Kernel and driver components, focusing on system architecture and optimization.
Drive the adoption and integration of kernel bypass technologies such as XDP/EBPF, AF_XDP, and DPDK.
Deeply understand and leverage network accelerators such as Mellanox/Nvidia SmartNICs (ConnectX6/7), DPU Bluefield3, and Intel IPU.
Collaborate with cross-functional teams across the organization to ensure successful project delivery and operational excellence.
What You'll Bring to the Team:
6+ years of proven experience in building and operating high-performance networking systems in a production environment.
Strong proficiency in system programming languages such as C, C++, and/or Rust.
Deep expertise in Linux Systems Internals, including kernel architecture, memory management, and device drivers.
In-depth knowledge of network programming principles and packet processing pipelines.
Hands-on experience with kernel bypass technologies like XDP/EBPF, AF_XDP, and DPDK.
Proven understanding of TCP/IP and networking accelerators such as Mellanox/Nvidia SmartNICs, DPU Bluefield3, and Intel IPU.
Familiarity with technologies like SR-IOV, vDPA, and scalable functions.
Strong background in kernel or embedded development, with a focus on the Linux kernel.
Experience with Open vSwitch, Openflow, and Open Virtual Networking technologies.
Proven ability to effectively communicate and collaborate with both technical and non-technical stakeholders.
Demonstrated commitment to professional software engineering best practices, including coding standards, code reviews, source control management, testing, and operations.
A strong track record of contributions to the open-source community (e.g., Open vSwitch/OVS, Open Virtual Networking/OVN, Multus, Cilium).
Bonus Points:
Advanced degree in Computer Science, Engineering, or a related field.
Proven leadership experience in a technical role.
Strong analytical and problem-solving skills.
Experience with cloud networking platforms (AWS, Azure, GCP) and virtualization technologies (VMware, KVM).
Benefits:
Industry competitive pay
Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid Commuter FSA benefit of $200 per month
Compensation:
Compensation will be paid in the range of $185,000 - $224,000 per year + Bonus. Restricted Stock Units are included in all offers. Compensation will be determined based on the applicant's knowledge, education, experience, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
#J-18808-Ljbffr
Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI - without sacrificing scale, speed, or sustainability.
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.
About This Role:
The Crusoe Cloud Software Development team is seeking a passionate and experienced Senior Staff Software Engineer specializing in Hypervisor Virtualization and Virtualization Research. This pivotal role is critical in the design, development, and optimization of our virtualization technologies, specifically tailored for an all‑AI cloud infrastructure. A deep understanding of hypervisor internals, CPU and memory virtualization, I/O virtualization, and performance optimization is essential for developing reliable, high‑performance, and secure virtualized environments that power our cutting‑edge AI products. This is a full‑time position.
What You'll Be Working On:
Hypervisor Development & Optimization: Design, develop, and optimize core hypervisor components (e.g., KVM, QEMU, or custom solutions) to achieve maximum performance and efficiency for AI workloads. This includes focusing on CPU, memory, and I/O virtualization techniques.
Virtualization Research & Innovation: Conduct in‑depth research into advanced virtualization technologies, exploring novel approaches for isolating and accelerating AI compute, storage, and networking resources. Identify and prototype new virtualization features and enhancements to improve density, throughput, and latency.
Virtual Hardware & Device Emulation: Develop and enhance virtual hardware components and device emulation, ensuring optimal performance and compatibility for specialized AI accelerators (e.g., GPUs, DPUs) within the virtualized environment.
Performance Analysis & Tuning: Analyze and enhance the performance of the entire virtualization stack, from the hypervisor to the virtualized guest OS, with a specific focus on optimizing for AI/ML workloads. This includes profiling, bottleneck identification, and implementing low‑level optimizations.
System‑Level Troubleshooting: Diagnose and resolve complex system issues within the virtualization layer. Work closely with hardware and guest OS teams to debug and resolve integration challenges.
Code Review and Quality Assurance: Conduct thorough code reviews to ensure the highest level of software quality, reliability, and security within the hypervisor and virtualization components.
Cross‑Functional Collaboration: Collaborate with other engineering teams, including hardware design, OS development, and AI/ML application teams to ensure cohesive and integrated product development.
Technical Leadership: Provide technical guidance and mentorship to junior engineers, fostering a culture of technical excellence and collaborative problem‑solving within the virtualization team.
What You'll Bring to the Team:
Hypervisor Expertise: Proven deep knowledge of hypervisor internals (e.g., KVM, QEMU, Xen, or other bare‑metal hypervisors), including CPU virtualization (VT‑x/AMD‑V), memory virtualization (EPT/NPT, MMU), and I/O virtualization (SR‑IOV, virtio).
Virtualization Concepts: Strong understanding of virtual machine lifecycle, live migration, snapshotting, and fault tolerance mechanisms.
Linux Kernel Familiarity: Experience with Linux kernel internals as they pertain to virtualization, including device drivers, memory management, and scheduling within a virtualized context.
Hardware Understanding: Familiarity with hardware architectures relevant to virtualization, including CPUs (x86, ARM), GPUs, and Smart NICs/DPUs. Experience with hardware offloads and acceleration for virtualization.
Performance Optimization: Demonstrated ability to identify and resolve performance bottlenecks in complex virtualized systems. Experience with profiling tools and techniques.
Debugging & Troubleshooting: Strong debugging skills in complex, distributed systems at the hypervisor and kernel levels.
Bonus Points:
Experience with virtualization specifically for AI/ML workloads, including GPU virtualization or direct pass‑through.
Familiarity with container runtimes and their interaction with hypervisors.
Contributions to open‑source virtualization projects.
Experience with security hardening of hypervisors and virtual machines.
Benefits:
Industry competitive pay
Restricted Stock Units in a fast growing, well‑funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short‑term and long‑term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid commuter benefit; $300 per month
Compensation:
Compensation will be paid in the range of $204,000 - $247,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
#J-18808-Ljbffr
$146k-203k yearly est. 5d ago
Senior Staff Software Engineer, Storage
Crusoe Energy Systems LLC 4.1
San Francisco, CA jobs
Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI - without sacrificing scale, speed, or sustainability.
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.
About This Role:
As a Senior Staff Software Engineer on the Cloud Storage team, you will lead the development and execution of our storage strategy. You will work extensively with cloud storage primitives, utilizing advanced storage engineering concepts to drive innovation and performance improvements.
What You'll Be Working On:
Lead Storage Strategy Development and Execution: Define and execute the roadmap for the Crusoe Cloud storage strategy, aligning with overall business objectives.
Lead Engineering Team: Serve as the engineering lead for the Cloud Storage team, collaborating with technology and engineering leadership to define and implement long-term strategic goals.
Guide Engineering Practices: Provide technical leadership and guidance to the engineering team throughout the entire software development lifecycle, including architecture decisions, design reviews, code reviews, implementation tasks, and production support.
Develop and Optimize Storage Infrastructure: Collaborate closely with the infrastructure organization to design, develop, and optimize industry-leading storage infrastructure solutions.
Lead File System Development: Lead the development and maintenance of high-performance and reliable file systems, ensuring optimal performance and data integrity.
Storage Architecture Design: Design and implement robust and scalable storage architectures, considering factors such as performance, reliability, availability, and cost-effectiveness.
Cross-functional Collaboration: Foster strong collaboration with other teams across the organization, including infrastructure, software engineering, and product development.
What You'll Bring to the Team:
System Programming Expertise: Proven experience in system programming with languages such as C, C++, and/or Rust.
Linux Systems Knowledge: Extensive knowledge of Linux Systems Internals and computer architecture.
Cloud Storage Design & Development: Ability to design, develop, and deploy highly scalable and distributed cloud storage solutions.
Storage Engineering Fundamentals: Strong understanding of storage engineering concepts, including data protection mechanisms (e.g., redundancy, replication, encryption), fault tolerance, and storage technologies (e.g., NVMe, SSDs).
Storage Technologies: In-depth understanding of at least one of the following: block storage, object storage, and/or file storage.
Storage Protocols: Familiarity with industry-standard storage protocols such as NFS, SMB, iSCSI, and NVMe‑oF.
Software Engineering Best Practices: Expertise in professional software engineering practices, including coding standards, code reviews, source control management, build processes, testing, and operations.
Open Source Contributions: Demonstrated track record of contributions to the open source community (e.g., Ceph, GlusterFS, OpenEBS).
Communication & Collaboration: Excellent communication and collaboration skills, with the ability to effectively communicate technical concepts to both technical and non-technical audiences.
Bonus Points:
Networking Fundamentals: Strong understanding of physical and software-defined networking concepts.
Kernel/Embedded Development: Background in kernel or embedded development, particularly with the Linux kernel.
Kubernetes & Cloud-Native Storage: Experience with Kubernetes CSI and cloud-native storage solutions.
Infrastructure as Code: Exposure to Infrastructure as Code tooling (e.g., Ansible, Chef, Puppet, Terraform).
Programming Languages: Programming experience in Java or Go.
Leadership Experience: Proven leadership experience in a technical role.
Analytical & Problem-Solving Skills: Strong analytical and problem-solving skills.
Advanced Degree: Advanced degree in Computer Science, Engineering, or a related field.
Benefits:
Industry competitive pay
Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid commuter benefit; $300 per month
Compensation Range:
Compensation will be paid in the range of $245,000 - $290,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
#J-18808-Ljbffr
$146k-203k yearly est. 5d ago
Senior Software Engineer, Managed Cloud Services
Crusoe Energy Systems LLC 4.1
San Francisco, CA jobs
A leading technology firm in San Francisco is seeking a Software Engineer to design, build, and scale customer-facing platforms. Responsibilities include creating scalable infrastructure, collaborating with teams, and driving innovation. Ideal candidates will have cloud expertise and experience in modern programming languages. The position offers competitive salary and benefits including stock options and health insurance.
#J-18808-Ljbffr
$131k-185k yearly est. 1d ago
Senior Software Engineer - Managed Kubernetes & Cloud Infra
Crusoe Energy Systems LLC 4.1
San Francisco, CA jobs
A leading technology firm in California is seeking a Senior Software Engineer to drive innovation in cloud software solutions. This role involves designing scalable systems, collaborating on development, and managing Kubernetes environments. Ideal candidates have 5-7 years in software engineering, strong GoLang skills, and a solid understanding of cloud technologies, including Kubernetes and Terraform. The company offers competitive pay, RSUs, and comprehensive benefits including health insurance and generous paid time off.
#J-18808-Ljbffr
$131k-185k yearly est. 5d ago
Senior+ Software Engineer, Storage
Crusoe Energy Systems LLC 4.1
San Francisco, CA jobs
Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI - without sacrificing scale, speed, or sustainability.
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.
About This Role
The Cloud Storage team at Crusoe seeks a Staff Software Engineer to lead the development and execution of our storage strategy. This role will be instrumental in driving innovation and performance improvements within our cloud storage infrastructure. You will work extensively with cloud storage primitives, utilizing advanced storage engineering concepts to build and operate high-performance, scalable, and reliable storage solutions.
What You'll Be Working On
Lead Engineering Efforts: Lead engineering efforts on cloud storage features by collaborating with product and engineering to define and execute features on the roadmap.
Software Development: Write and review code, generate and review design documentation. Participate in qualifications and rollouts of software across the stack journeying from bare metal to user-facing APIs.
Technical Leadership: Guide the engineering team through architecture decisions, design processes, design reviews, code reviews, and implementation tasks.
Team Mentorship: Mentor and grow engineers on your team, fostering an environment of teamwork and continuous learning.
Cross-Team Collaboration: Champion and lead initiatives across the engineering organization such as tech talks, open source development, and book clubs.
Performance Optimization: Benchmark, analyze, and improve scale, performance, and resiliency issues.
What You'll Bring to the Team
Cloud Storage Expertise: Hands-on experience building and operating large scale, complex distributed cloud computing infrastructure products. Preferably, experience building redundant and fault tolerant storage solutions with backups, replication, encryption, and data protection mechanisms.
Software Engineering Fundamentals: Knowledge of professional software engineering practices and best practices for the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
Technical Proficiency: Strong experience with at least one application programming language like Java or Go. Exposure to Infrastructure as Code tooling with any of Ansible, Chef, Puppet, and/or Terraform. Knowledge of Linux Systems Internals and computer architecture.
Communication & Collaboration: Strong communication and collaboration skills.
Safety and Compliance: Must be able to pass a background check.
Bonus Points
Storage Technologies: Hands‑on experience with storage technologies such as NVMe, SSDs, and distributed storage systems.
Storage Protocols: In-depth understanding in at least one of block storage, object storage, and/or file storage. Familiarity with storage protocols like NFS, SMB, iSCSI, and NVMe-oF.
Open Source Contributions: Demonstrated track record of contributions to the open source community (e.g., Ceph, GlusterFS, OpenEBS).
System Programming: Proven experience in system programming with C, C++, and/or Rust.
Networking: An understanding of physical and software‑defined networking concepts.
Education: Advanced degree in Computer Science, Engineering, or a related field.
Benefits
Industry competitive pay
Restricted Stock Units in a fast growing, well‑funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short‑term and long‑term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid commuter benefit; $300 per month
Compensation Range
Compensation Range:
Compensation will be paid in the range of $155‑250k a year + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
#J-18808-Ljbffr
Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI - without sacrificing scale, speed, or sustainability.
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.
About the Role:
We are actively seeking an exceptional Senior Software Engineer for our cloud software team who will contribute to the operations of our cutting‑edge infrastructure. Your expertise will be instrumental in designing and scaling our carbon‑reducing operating model, as well as managing critical hardware, software, and network components.
In this role, you will be involved in writing and reviewing code, contributing to proposals and architecture documents. You will evaluate tools and frameworks, carefully considering their impact on reliability, scalability, operational costs, and ease of adoption. Your expertise in orchestration and optimization will be instrumental in advancing our managed Kubernetes and AI training clusters, ensuring they lead the industry in reliability and performance.
What You'll Be Working On:
Contribute to the development of scalable and robust software solutions, closely aligning with the strategic objectives outlined in the Crusoe Cloud roadmap
Work collaboratively with tech leads and engineers to create a dynamic environment where creativity and technical excellence are encouraged, leading to the development of cutting‑edge cloud solutions
Continuously stay abreast of the latest trends and techniques in cloud software, incorporating these insights to keep Crusoe's offerings innovative
While you won't have formal management responsibilities, you will support the development of your peers by sharing knowledge and providing guidance in technical discussions
What You'll Bring to the Team:
You have 5-7 years of experience working in software engineering, with strong experience in Systems Engineering
You possess 2+ years of programming experience in GoLang
You have experience with Kubernetes and Linux Engineering and debugging
You are skilled in infrastructure as code and familiar with systems‑level challenges
You have experience with Terraform and GCP (preferred)
You understand Argo, CI/CD, and Automated Testing pipelines
You can build and manage Kubernetes operators and controllers, developing and maintaining essential components that ensure the reliability and efficiency of the Kubernetes environment
You can develop scalable systems to compete with leading services like Google Kubernetes Engine (GKE) and Amazon Elastic Kubernetes Service (EKS)
You can oversee critical projects with broad impact, leading initiatives focused on networking, quality control, and automation to ensure optimal performance and reliability
You can design system architecture, taking ownership of system architecture, including CI/CD pipelines, while ensuring adherence to security standards
You have excellent communication skills, both verbal and written
Benefits:
Industry competitive pay
Restricted Stock Units in a fast growing, well‑funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short‑term and long‑term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid commuter benefit; $300 per month
Compensation Range:
Compensation will be paid in the range of $180,000 - $210,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's knowledge, education, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
#J-18808-Ljbffr
About the Role
We are looking for a highly skilled engineer with deep expertise in building and operating observability platforms at scale. You will design, develop, and run Crusoe's next-generation observability stack, enabling engineers to understand the internal state of distributed systems through metrics, logs, and traces. Your work will ensure reliability, performance, and actionable insights across Crusoe's global infrastructure and cloud platform.
What You'll Be Working On
Designing and operating scalable observability systems (metrics, logging, tracing) across multi-datacenter Kubernetes environments
Architecting end-to-end telemetry pipelines, including ingestion, storage, querying, and visualization
Extending monitoring and alerting with Prometheus, Alertmanager, Thanos/Cortex, Grafana, and OpenTelemetry
Building scalable log collection and processing pipelines with Fluent Bit, Vector, Loki, or ELK/Opensearch stacks
Implementing distributed tracing platforms (Tempo, Jaeger, OpenTelemetry) and integrating with service meshes, load balancers, and APIs
Defining and driving adoption of SLOs, SLIs, and error budgets across services and teams
Automating provisioning and scaling of observability infrastructure with Kubernetes, Terraform, and custom tooling (Go, Python)
Ensuring reliability and cost efficiency of telemetry pipelines while supporting high-volume workloads (AI/ML, HPC clusters, GPU infrastructure)
Embedding security best practices into observability platforms, including RBAC, TLS, secret management, and multi-tenant access controls
Partnering with engineering teams to embed observability into applications, services, and infrastructure
Mentoring engineers and shaping Crusoe's observability strategy and technical roadmap
What You'll Bring to the Team
7+ years of experience in infrastructure or platform engineering, with a focus on observability and monitoring systems
Deep expertise with metrics systems (Prometheus, Thanos, Mimir, Cortex), logging pipelines (Fluent Bit, Vector, Loki, ELK/Opensearch), and tracing platforms (Jaeger, Tempo, OpenTelemetry)
Strong programming skills in Go or Python for automation, operators, and custom integrations
Experience running observability platforms on Kubernetes and operating them at scale across multi-datacenter environments
Proven ability to design, optimize, and scale telemetry pipelines handling high cardinality and high throughput data
Solid understanding of distributed systems, performance engineering, and debugging complex workloads
Familiarity with service meshes, networking, and workload instrumentation (Envoy, Istio, OpenTelemetry SDKs)
Strong collaboration skills and the ability to influence engineering teams to adopt observability best practices
Bonus Points
Contributions to open source observability projects (Prometheus, OpenTelemetry, Grafana, Loki, etc.)
Experience supporting AI/ML or GPU-heavy environments with high observability demands
Knowledge of event-driven or streaming systems (Kafka, NATS, Pulsar) used in telemetry pipelines
Experience implementing cost optimization strategies for large-scale observability platforms
Background in incident response, chaos engineering, and reliability practices
Benefits
Industry competitive pay
Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid commuter benefit; $300 per month
Compensation
Compensation will be paid in the range of $166,000 - $201,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
#J-18808-Ljbffr
$131k-185k yearly est. 2d ago
Senior Software Engineer, Managed Services
Crusoe Energy Systems LLC 4.1
San Francisco, CA jobs
Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI - without sacrificing scale, speed, or sustainability.
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.
About This Role:
We are seeking talented Software Engineers to design, build, and scale Crusoe Cloud's customer-facing platforms and managed services. Your work will focus on delivering a world-class customer experience, empowering users to unlock the full potential of Crusoe Cloud through innovative and reliable cloud products. This is a full-time position.
What You'll Be Working On:
Building Foundational Infrastructure: Build and scale core infrastructure services that manage critical resources within our cloud platform. This involves designing, developing, and deploying robust and reliable systems from the ground up.
Scalable Design: Design highly scalable, durable, and reliable platform services that prioritize ease of use.
Cross Functional Collaboration: Lead projects that require collaborating with engineering, cloud support, site reliability, and product teams to assess tools, frameworks, and solutions that align with both customer and operational needs.
Innovation: Implement features that differentiate Crusoe Cloud, focusing on operational efficiency, low-touch adoption, turn-key AI services, and scalability.
What You'll Bring to the Team:
Cloud Expertise: Proven ability to design and scale fault-tolerant distributed systems and develop managed cloud services.
Technical Proficiency: Strong fundamentals in microservices and infrastructure technologies like Docker, Kubernetes, Terraform, and CI/CD systems. Experience with observability principles and technologies, e.g., time-series databases, log aggregation, distributed tracing
Customer-Centric Mindset: A passion for creating intuitive, high-quality solutions that directly impact customer success and satisfaction.
Collaboration Skills: Ability to work with cross-functional teams to align priorities and deliver customer-first solutions.
Communication Skills: Exceptional ability to articulate complex ideas and align technical solutions with customer needs.
Team Leadership: Mentor engineers, enhance hiring practices, and contribute to building a strong, inclusive engineering culture.
Professional Experience: 3-5 years of software development experience, including programming with modern compiled languages such as Go, Rust, Java, or C++.
Benefits
Industry competitive pay
Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement
Subscription to the Calm app
MetLife Legal
Company paid commuter benefit; $300 per month
Compensation:
Compensation will be paid in the range of $112,000 - $161,000. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's knowledge, education, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
#J-18808-Ljbffr
$112k-161k yearly 1d ago
QA/QC Commissioning Associate III
CPG 4.9
Houston, TX jobs
Position: QA/QC Commissioning Associate III Location: Houston, TX Job Id: 796 # of Openings: 1 TITLE: QA/QC Commissioning Associate III LOCATION: Houston, TX POSITION SUMMMARY: The QA/QC Commissioning Associate III assists in quality control and quality assurance of data center critical systems preparing for the commissioning process. The QA/QC Commissioning Associate assists the QA/QC Engineer to ensure that the correct equipment has been purchased and that installation is in accordance with industry standards and equipment specifications. This role will develop skills and industry knowledge to perform increasingly more complex commissioning tasks.
ESSENTIAL DUTIES AND RESPONSIBILITIES: To perform this job successfully, an individual must be able to perform the following satisfactorily; other duties may be assigned. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
Develop QA/QC documents of the complete project including certificates, calibration, test results, inspection requests, non-compliance reports and site instruction/observations, permanent materials delivered, and other important QA/QC documents
Follow all standards to perform inspection and tests on all procedures and oversee all testing methods and maintain high standards of quality for all processes
Review the quality of all materials at the site and ensure compliance with all project specifications and quality and collaborate with the department for all material procurement and maintain a quality of materials
Support the effective implementation of all test and inspection schedules and ensure adherence to all procedures and coordinate with various teams to perform quality audits on processes
Assist employees to ensure knowledge of all quality standards and ensure compliance to all quality manuals and procedures and collaborate with contractors and suppliers to maintain the quality of all systems
Manage to lift all types of equipment and handle the efficient storage of all hazardous materials and perform quality audits as per the required schedule
Understand all products and non-conformance processes and evaluate all documents to ensure the maintenance of optimal quality and prepare monthly reports to evaluate performance
Monitor an efficient system and record for all project activities and analyze all processes to ensure all work according to quality requirements
Understand all work methods and maintain knowledge on all quality assurance standards and monitor continuous application for all quality assurance processes and recommend corrective actions for all processes
Support and follow a method statement for the activity including risk assessment and job safety environmental analysis and Inspection Test Plan and Checklist based on specifications of the project
Liaise the Technical Engineer for submission of material submittals to Consultant
Develop and maintain inspection reports
Ensure compliance to federal and state laws, as well as company standards and specifications
Maintain calibration of quality testing equipment
Perform inspections across all stages of production
Advising on procedures to improve production efficiency
Prepare and maintain test data for review
Evaluate data and draft reports, noting any relevant deviations from existing standards
Identify areas for quality control improvement and implement new methods accordingly
Communicate quality or compliance concerns with urgency
QUALIFICATIONS: To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
Education/Experience (Desired):
Technical Military MOS, trade school and/or degree
Experience and/or education and internship in complex facilities or mission critical projects is preferred
Any civilian or military technical certifications is a plus
Experience with writing and enforcing standard operating procedures (SOPs)
Solid understanding of test equipment & software
Minimum of 5-9 years of inspection and/or production experience
Strong working knowledge of various mathematical concepts including fractions, ratios, and proportions
Demonstrated ability to work independently with minimal supervision
Excellent organizational skills
Demonstrated ability to analyze and interpret information
Must be a US citizen
Must be able to travel 70%
Computer Skills:
Advanced Excel skills preferred
Experience using Microsoft Office Suite, Word and Microsoft Project
Basic knowledge of systems design for various projects
Certificates and Licenses:
No certificates or licenses required
Supervisory Responsibilities:
No supervisory responsibilities for this position.
Physical Demands: The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
Occasionally lift and or move objects 10 to 50; Frequently required to stand, walk, stoop, kneel, crouch or crawl. The employee is occasionally required to sit and climb or balance. Specific vision abilities for this job include close vision, distance vision, color vision, peripheral vision, depth perception and the ability to adjust and focus. Noise Level can be moderate to high.
The above job description is not intended to be an all-inclusive list of duties and standards of the position. Incumbents will follow any other instructions, and perform any other related duties, as assigned by their supervisor.
CPG is an equal opportunity employer. We will consider all employment applicants without attention to age, race, color, religion, sex, sexual orientation, gender identity, national origin, veteran, or disability status.
CPG Participates in E-Verify
#:LI-TG1
Pay Range: $74,851 - $112,222 per year Apply for this Position
$74.9k-112.2k yearly 6d ago
Software Engineer, Backend
Unit 4.8
San Francisco, CA jobs
Unit is an AI video company. We're building GTV, a consumer product that combines creativity and automation to deliver a next-generation video experience. Our team includes senior people from Instagram, TikTok, and NVIDIA. We're based in San Francisco and backed by Khosla Ventures, Air Street Capital, and other leaders in AI.
Role Overview
We're looking for an experienced Backend Software Engineer to join the GTV team. You'll work closely with researchers, engineers, and designers to build our core systems, including video generation workflows, reinforcement learning infrastructure, real-time data pipelines, and more. This role is for someone who loves building high-performance systems and wants to reimagine the future of media.
Responsibilities
Design and develop scalable backend systems that power the creative engine behind GTV
Partner with researchers, engineers, and designers to develop new product features and APIs
Build core infrastructure for custom models, agentic workflows, and reinforcement learning loops
Drive reliability and efficiency of distributed systems with automated testing, performance benchmarking, and end-to-end observability
Requirements
5+ years of software engineering experience with proficiency in Python
Industry experience with AWS (S3, Lambda, RDS, SQS, ECS), FastAPI, Docker, CI/CD pipelines, and other backend services
Deep understanding of generative models, distributed systems, information retrieval, large-scale data processing, and asynchronous workflows
Proven ability to debug complex systems, navigate ambiguity, and take ownership of challenging problems
Excellent communication and collaboration with research, engineering, and design teams
Nice To Have
Prior work experience at early-stage or high-growth startups
Familiarity with video generation, encoding, playback, and distribution
Background in generative AI and consumer-facing products
Compensation
$200K-$250K base salary
Competitive equity grant
Top-tier benefits (medical, dental, and vision)
Daily dinner stipend
Relocation support
#J-18808-Ljbffr
$200k-250k yearly 1d ago
Software Engineer, Full Stack
Unit 4.8
San Francisco, CA jobs
Unit is an AI video company. We're building GTV, a consumer product that combines creativity and automation to deliver a next-generation video experience. Our team includes senior people from Instagram, TikTok, and NVIDIA. We're based in San Francisco and backed by Khosla Ventures, Air Street Capital, and other leaders in AI.
Role Overview
We're looking for an experienced Full Stack Software Engineer to join the GTV team. You'll work closely with researchers, engineers, and designers to build our core products - taking ideas from prototype to launch. This role is for someone who loves building zero-to-one products and wants to reimagine the future of media.
Responsibilities
Design and develop full-stack products for GTV, including beautiful user interfaces, robust APIs, and scalable infrastructure
Partner with researchers, engineers, and designers to integrate generative models and unlock new consumer use-cases
Build high-quality product experiences with a focus on latency, usability, and visual craftsmanship
Work with customers and creatives to understand their needs and translate them into effective product solutions
Requirements
3+ years of software engineering experience with proficiency in frontend (NextJS, Tailwind) and backend (NodeJS, Python)
Industry experience with Vercel, AWS (S3, Lambda, RDS, SQS, ECS), FastAPI, Docker, CI/CD pipelines, and other cloud services
Deep understanding of UX design principles, real-time applications, API design patterns, generative models, and distributed systems
Proven ability to debug complex systems, navigate ambiguity, and take ownership of challenging problems
Excellent communication and collaboration with research, engineering, and design teams
Bonus
Prior work experience at early-stage or high-growth startups
Familiarity with video generation, encoding, playback, and distribution
Background in generative AI and consumer-facing products
Compensation
$175K-$250K base salary
Competitive equity grant
Top-tier benefits (medical, dental, and vision)
Daily dinner stipend
Relocation support
#J-18808-Ljbffr
$175k-250k yearly 3d ago
Software QA Automation Engineer III
Aerovironment 4.6
Melbourne, FL jobs
The Software QA Automation Engineer III designs automated solutions to perform applicable software validations. In this role, one regularly collaborates with our development team but also operates with a large degree of autonomy.
Position Responsibilities
Participates in the design, expansion and maintenance of automated testing suite
Defines and plan scope, resource needs, benchmarks and goals of manual & automation work
Defines and implement QA practices, procedures, standards and reporting
Identifies project risks, quantify risk/benefit relationships and provide alternative solutions as well as risk mitigation
Designs test plans, scenarios and cases to exercise new functionality & identify breaking issues
Analyzes, designs, programs, debugs, and modifies software enhancements and/or new products used in local, networked, cloud-based or Internet-related computer programs
Partners with resources as needed to validate software with project timeline
Mentors less experienced team members on QA/QC concepts, methodologies and best practices
Works on problems of diverse scope where analysis of data requires evaluation of identifiable factors
Other duties as assigned
Basic Qualifications (Required Skills & Experience)
Bachelor's degree in related discipline is required or equivalent combination of education, training, and experience
Minimum 5 - 8 years of relevant experience
Experienced in Software QA automation
Experienced in building and optimizing automation frameworks
Experienced performing code reviews and mentoring team members on automation concepts and best practices
Other Qualifications & Desired Competencies
Champions quality by forging influential relationships across QA, Development, Product, and DevOps
Demonstrates strong debugging / problem resolution skills, and competency in multitasking and handling multiple time critical issues / projects simultaneously
Demonstrates passion to continuously improve and execute tests for a faster and higher quality result
Is an experienced professional with a full understanding of area of specialization; resolves a wide range of issues in creative ways
Demonstrates good judgment in selecting methods and techniques for obtaining solutions
Able to excel in a fast-paced, deadline-driven environment, where small teams share a broad variety of duties
Displays strong initiative and drive to accomplish goals and meet company objectives
Takes ownership and responsibility for current and past work products
Is committed to learning from mistakes and driven to improve and enhance performance of oneself, others, and the company
Has effective interpersonal and communication skills
Focuses on teamwork, collaboration and puts the success of the team above one's own interests
Physical Demands
Ability to work in an office and R&D environment (Constant)
Required to sit and stand for long periods, talk, hear, and use hands and fingers to operate a computer and telephone keyboard
Occasionally may be required to travel within the Continental U.S. (20%)
The salary range for this role is:
$81,481 - $115,500
AeroVironment considers several factors when extending an offer, including but not limited to, the location, the role and associated responsibilities, a candidate's work experience, education/training, and key skills.
ITAR Requirement:
T
his position requires access to information that is subject to compliance with the International Traffic Arms Regulations (“ITAR”) and/or the Export Administration Regulations (“EAR”). In order to comply with the requirements of the ITAR and/or the EAR, applicants must qualify as a U.S. person under the ITAR and the EAR, or a person to be approved for an export license by the governing agency whose technology comes under its jurisdiction. Please understand that any job offer that requires approval of an export license will be conditional on AeroVironment's determination that it will be able to obtain an export license in a time frame consistent with AeroVironment's business requirements. A “U.S. person” according to the ITAR definition is a U.S. citizen, U.S. lawful permanent resident (green card holder), or protected individual such as a refugee or asylee. See 22 CFR § 120.15. Some positions will require current U.S. Citizenship due to contract requirements.
Benefits: AV offers an excellent benefits package including medical, dental vision, 401K with company matching, a 9/80 work schedule and a paid holiday shutdown. For more information about our company benefit offerings please visit: **********************************
We also encourage you to review our company website at ******************** to learn more about us.
Principals only need apply. NO agencies please.
Who We Are
Based in California, AeroVironment (AVAV) is a global leader in unmanned aircraft systems (UAS) and tactical missile systems. Founded in 1971 by celebrated physicist and engineer, Dr. Paul MacCready, we've been at the leading edge of technical innovation for more than 45 years. Be a part of the team that developed the world's most widely used military drones and created the first submarine-launched reconnaissance drone, and has seven innovative vehicles that are part of the Smithsonian Institution's permanent collection in Washington, DC.
Join us today in developing the next generation of small UAS and tactical missile systems that will deliver more actionable intelligence to our customers so they can proceed with certainty - and succeed.
What We Do
Building on a history of technological innovation, AeroVironment designs, develops, produces, and supports an advanced portfolio of unmanned aircraft systems (UAS) and tactical missile systems. Agencies of the U.S. Department of Defense and allied military services use the company's hand-launched UAS to provide situational awareness to tactical operating units through real-time, airborne reconnaissance, surveillance, and target acquisition.
We are proud to be an EEO/AA Equal Opportunity Employer, including disability/veterans. AeroVironment, Inc. is an Equal Employment Opportunity (EEO) employer and welcomes all qualified applicants. Qualified applicants will receive fair and impartial consideration without regard to race, sex, color, religion, national origin, age, disability, protected veteran status, genetic data, sexual orientation, gender identity or other legally protected status.
ITAR
U.S. Citizen, U.S. Permanent Resident (Green Card holder), asylee/refugee status as defined by 8 U.S.C. 1324b(a)(3) or a person approved for an export license from the appropriate governing agency.