A prominent insurance group is seeking a mid-level DevOps Engineer in San Francisco, California. This position involves managing cloud infrastructure and collaborating with technical teams to ensure optimal software delivery and system performance. Ideal candidates will have extensive experience in DevOps practices and tools, including Azure, Terraform, Docker, and Kubernetes. Competitive salary benefits are provided.
#J-18808-Ljbffr
$120k-150k yearly est. 2d ago
Looking for a job?
Let Zippia find it for you.
Senior Networking AI Engineer (Remote) - Design & Scale AI
Nvidia Corporation 4.9
Remote job
A leading technology company is seeking a Senior Software Engineer focused on Networking to provide expertise in AI networking systems. The ideal candidate will have experience with embedded systems and networking protocols, and will work closely with customers to develop solutions. This role offers a competitive salary range of $148,000-$235,750 (Level 3) and $184,000-$287,500 (Level 4) depending on experience.
#J-18808-Ljbffr
A tech company specializing in AI systems for business is seeking a Senior DevOps Engineer to enhance its cloud and deployment platform. You will have broad ownership across infrastructure, CI/CD, and Kubernetes, focusing on reliability, security, and developer productivity. Ideal candidates should have strong experience with AWS, Terraform, and Kubernetes, and a deep understanding of developer challenges. This role offers competitive compensation between $190K and $240K depending on experience, alongside a generous pre-IPO equity package and benefits including unlimited PTO.
#J-18808-Ljbffr
$190k-240k yearly 3d ago
Staff Infrastructure Engineer SF, NYC, or Remote (USA)
Hex 3.9
Remote job
Hex is changing the way people work with data. Our platform makes analytics workflows more powerful, collaborative, and shareable. Hex solves key pain points with today's data and analytics tooling, and is loved by thousands of users all over the world for the beautiful UI, new superpowers, and boundless flexibility.
We are a tight-knit crew of engineers, designers, and data aficionados. Our roadmap is full of big ideas and little details, and we would love your help bringing them to life.
Hex has raised over $100m from great VCs and angels, giving us many years of runway and the ability to pay competitive salaries, offer great benefits, and provide meaningful equity.
We're seeking an experienced infrastructure engineer to join us as a technical leader who will shape the future of our platform architecture! You'll work directly with our engineering leadership to drive infrastructure strategy, mentor our growing team, and build systems that scale with our ambitious growth plans. We recently raised a Series C and are experiencing rapid growth not just in the number of customers and users, but also in the kinds of data workflows we can support with our kernel compute backend.
This isn't a hands-off leadership role - you'll be deeply technical while providing strategic direction. We need someone who has strong opinions backed by experience and isn't afraid to make the hard decisions that come with rapid scaling.
What you will do Strategic Leadership
Define and execute our infrastructure roadmap across our multi-tenant and single-tenant stacks
Establish engineering standards, practices, and tooling across the infrastructure team
Collaborate with product and engineering teams to align infrastructure investments with business objectives
Lead deep database performance optimization and scaling strategies
Lead infrastructure cost optimization and capacity planning initiatives
Technical Ownership
Architect and implement scalable solutions on our AWS/Kubernetes/PostgreSQL/Redis stack
Design container orchestration strategies with Kubernetes patterns and resource optimization
Design and build robust CI/CD pipelines and deployment strategies
Drive reliability engineering practices including monitoring, alerting, and incident response
Evaluate and integrate new technologies that enhance our platform capabilities
Team Development
Mentor engineers and help grow their technical skills
Participate in hiring and building out the infrastructure team
Foster a culture of technical excellence and continuous learning
Lead technical design reviews and architecture discussions
About You Technical Expertise
7+ years of infrastructure engineering experience with 3+ years in technical leadership roles
Deep expertise with AWS services (EC2, RDS, EKS, networking, security)
Production experience with Kubernetes orchestration and container management
Experience with database performance engineering - query optimization, execution plan analysis, and datastore selection for different workload patterns
Proficiency with infrastructure as code (Terraform, CloudFormation, or similar)
Solid understanding of application deployment and scaling
Knowledge of security best practices and compliance frameworks
Leadership Qualities
Track record of leading technical initiatives in fast-growing companies
Strong opinions on engineering best practices with the flexibility to adapt
Excellent communication skills and ability to influence across organizations
Comfortable with ambiguity and rapid decision-making in a startup environment
Startup Experience
Understanding of the unique challenges of scaling infrastructure during hypergrowth
Ability to balance technical debt with feature velocity
Experience with resource constraints and scrappy problem-solving
Bonus Points
Advanced Kubernetes operators development and custom resource definitions
Background with observability tools (Datadog, New Relic, Prometheus/Grafana)
Contributions to open source infrastructure projects
Experience with multi-region deployments and disaster recovery planning
Our stack
Our product is a web-based notebook and app authoring platform. Our frontend is built with Typescript and React, using a combination of Apollo GraphQL and Redux for managing application state and data. On the backend, we also use Typescript to power an Express/Apollo GraphQL server that interacts with Postgres, Redis, and Kubernetes to manage our database and Python kernels. Our backend is tightly integrated with our infrastructure and CI/CD, where we use a combination of Terraform, Helm, and AWS to deploy and maintain our stack.
In addition to our unique culture, Hex proudly offers a competitive total rewards package, including but not limited to, market-benched salary & equity, comprehensive health benefits, and flexible paid time off.
The salary range for this role is: $215,000 - $270,000
The salary range shown may be a reflection of additional factors such as geographical location and skill ranges/levels we're open to. Placement in the salary range will be decided upon completion of the interview process, taking into account factors like leaving room for growth, internal fairness & parity, your demonstrated skills, and the depth of your experience. Our Recruiting team will be able to provide more details during the interview process.
#J-18808-Ljbffr
$215k-270k yearly 1d ago
Staff DevOps Engineer - Splunk (Remote)
Cisco Systems 4.8
Remote job
This role can be performed anywhere in the United States.
Role
Cisco's Web Engineering team is a dynamic, best-in-class group partnering with digital marketing functions to define, execute, and govern strategies that maximize value from Adobe Experience Manager (AEM) implementations. As a DevOps champion, you will maintain the web stack across all environments, collaborate with vendors to manage production, and focus on continuous integration and continuous deployment (CI/CD). You share responsibility for the quality and reliability of all deliverables-if you build it, you run it.
You will work closely with internal web engineering, IT, and Security Operations teams to build and sustain public-facing websites and applications. Leveraging AI-augmented observability and assurance tools, you will proactively monitor and optimize web infrastructure performance. Developing AI literacy and prompt engineering skills to effectively interact with AI assistants and tools integrated into the DevOps workflow is essential.
You are highly motivated, detail-oriented, reliable, and thrive in a fast-paced environment.
Requirements
Bachelor's or Master's degree in Computer Science or related field, or equivalent experience with a strong computer science foundation; 6+ years as a DevOps or Site Reliability Engineer.
Preferred Technical Skills
Designing and building systems for websites, applications, and web properties.
Hands-on experience with Content Management Systems such as Adobe Experience Manager (AEM) and WordPress.
Proficient with AWS cloud services and tools (EC2, ALB, NLB, VPC, EKS, ECS, Route53, S3, EBS, EFS) and related products like Packer, Ansible, Linux, Docker.
Familiarity with version control systems (GitLab/GitHub, Bitbucket, SVN) and build tools.
Experience with web servers such as Apache and Nginx.
Knowledge of networking concepts including TCP/IP, HTTP/HTTPS, firewalls, and iptables within AWS and Linux environments.
Configuration management expertise, preferably with Ansible (also Chef, Puppet).
Strong understanding and practical experience with CI/CD pipelines and tools such as Jenkins, Selenium, Serenity, Cucumber, Sonar, and Gherkin.
Familiarity with Content Delivery Networks (CDNs) like Akamai or Amazon CloudFront.
Security and compliance knowledge, including IAM and cloud auditing/monitoring tools.
AI and Observability
Experience with AI-powered observability platforms such as Splunk Observability Cloud and ThousandEyes, including AI/ML-driven event correlation and root cause analysis.
Basic understanding of AI and machine learning concepts, prompt engineering, and responsible AI usage.
Experience with AI-driven automation in CI/CD pipelines and infrastructure management.
Soft Skills
Ability to quickly learn new skills and tools.
Strong troubleshooting skills for performance and stability issues.
Clear communication, thorough documentation, and collaborative peer review.
Why Cisco?
At Cisco, we're revolutionizing how data and infrastructure connect and protect organizations in the AI era - and beyond. We've been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.
Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you'll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.
We are Cisco, and our power starts with you.
Message to applicants applying to work in the U.S. and/or Canada
The starting salary range posted for this position is $183,800.00 to $263,600.00 and reflects the projected salary range for new hires in this position in U.S. and/or Canada locations, not including incentive compensation*, equity, or benefits.
Individual pay is determined by the candidate's hiring location, market conditions, job-related skillset, experience, qualifications, education, certifications, and/or training. The full salary range for certain locations is listed below. For locations not listed below, the recruiter can share more details about compensation for the role in your location during the hiring process.
U.S. employees are offered benefits, subject to Cisco's plan eligibility rules, which include medical, dental and vision insurance, a 401(k) plan with a Cisco matching contribution, paid parental leave, short and long-term disability coverage, and basic life insurance. Please see the Cisco careers site to discover more benefits and perks. Employees may be eligible to receive grants of Cisco restricted stock units, which vest following continued employment with Cisco for defined periods of time.
U.S. employees are eligible for paid time away as described below, subject to Cisco's policies:
10 paid holidays per full calendar year, plus 1 floating holiday for non-exempt employees
1 paid day off for employee's birthday, paid year-end holiday shutdown, and 4 paid days off for personal wellness determined by Cisco
Non-exempt employees** receive 16 days of paid vacation time per full calendar year, accrued at rate of 4.92 hours per pay period for full-time employees
Exempt employees participate in Cisco's flexible vacation time off program, which has no defined limit on how much vacation time eligible employees may use (subject to availability and some business limitations)
80 hours of sick time off provided on hire date and each January 1st thereafter, and up to 80 hours of unused sick time carried forward from one calendar year to the next
Additional paid time away may be requested to deal with critical or emergency issues for family members
Optional 10 paid days per full calendar year to volunteer
For non-sales roles, employees are also eligible to earn annual bonuses subject to Cisco's policies.
Employees on sales plans earn performance-based incentive pay on top of their base salary, which is split between quota and non-quota components, subject to the applicable Cisco plan. For quota-based incentive pay, Cisco typically pays as follows:
.75% of incentive target for each 1% of revenue attainment up to 50% of quota;
1.5% of incentive target for each 1% of attainment between 50% and 75%;
1% of incentive target for each 1% of attainment between 75% and 100%; and
Once performance exceeds 100% attainment, incentive rates are at or above 1% for each 1% of attainment with no cap on incentive compensation.
For non-quota-based sales performance elements such as strategic sales objectives, Cisco may pay 0% up to 125% of target. Cisco sales plans do not have a minimum threshold of performance for sales incentive compensation to be paid.
The applicable full salary ranges for this position, by specific state, are listed below:
New York City Metro Area:
$183,800.00 - $303,100.00
Non-Metro New York state & Washington state:
$163,600.00 - $269,800.00
For quota-based sales roles on Cisco's sales plan, the ranges provided in this posting include base pay and sales target incentive compensation combined.
** Employees in Illinois, whether exempt or non-exempt, will participate in a unique time off program to meet local requirements.
Cisco is an Affinitive Action and Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis.
Cisco will consider for employment, on a case by case basis, qualified applicants with arrest and conviction records.
#J-18808-Ljbffr
A leading software company in San Francisco is looking for an experienced software engineer specializing in cloud infrastructure. This role involves driving cloud strategies, building self-service platforms, and ensuring reliability standards across services. The ideal candidate will thrive in a fast-paced environment and have a strong technical background. The position offers competitive salary ranging from $130,000 to $250,000 based on experience.
#J-18808-Ljbffr
$130k-250k yearly 5d ago
Remote DevOps & IaC Engineer for AI Pipelines
Labelbox 4.3
Remote job
A leading AI data platform is seeking a DevOps/IaC Engineer on a contract basis. This remote role focuses on designing and maintaining cloud infrastructures for AI pipelines, automating processes with tools like Terraform, and improving system performance. Ideal candidates should have substantial expertise in DevOps, a deep interest in AI infrastructure, and proficiency in major cloud providers. Join a team making a significant impact in the AI industry while enjoying flexible work hours.
#J-18808-Ljbffr
$119k-162k yearly est. 5d ago
Cloud DevOps Engineer - Latin America - Remote
Azumo, LLC
Remote job
Azumo is currently looking for a highly motivated Cloud DevOps Engineer to develop and maintain cloud infrastructure for next-generation web, mobile, and IoT Applications. The position is FULLY REMOTE based in Latin America.
IT Infrastructure Management:
● Provisioning of infrastructure components for supporting business services
● Developing, configuring, and deploying tools to be used by SRE/DevOps teams
● Developing and maintenance of system documentation
● Change Management: Determining and validating new features and updates for the
managed infrastructure Site Availability, Reliability and Serviceability
● Handling support escalation issues and incident reviews
● Availability and Performance Monitoring
● Service Recovery and Emergency Response
● Capacity Planning
● Automated Infrastructure Provisioning (Infrastructure as Code) DevOps
● Creation and maintenance of CI/CD pipelines
● Creation and maintenance of Docker images
● Creation and maintenance of automation scripts
● Support to developers and QA teams during the software lifecycle process
● Deploy applications to production and non-production environments
The Cloud DevOps Engineer will be based in Latin America. Compensation is commensurate with experience and candidate potential.
At Azumo we strive for excellence and strongly believe in professional and personal growth. We want each individual to be successful and pledge to help each achieve their goals while at Azumo and beyond. Challenging ourselves and learning new technologies is at the core of what we do.
Based in San Francisco, California, Azumo is an innovative nearshore software development firm helping organizations build, deploy, and maintain modern web, mobile, data and cloud applications.
If you are based in Latin America, qualified for the opportunity and looking for a challenge, please apply online at ************************** or connect with us at ***************
● You have worked as an SRE/DevOps/SysAdmin for more than 5 years
● Required Technical Skills You can demonstrate working experience on the following
technology stack. Certifications are a plus
● Linux Systems Administration (RedHat Enterprise , Ubuntu, Debian): +5 years
● Kubernetes: 3+ years
● Linux Containers: 3+ years
● Helm, Kustomize or equivalent: 1+ year
● Istio or equivalent: 1+ year
● Terraform: 2+ years
● Bash: 5+ years
● AWS EKS or GCP GKE, AKS : 2+ years
● GitLab Pipelines or GitHub Actions: 2+ years
● GIT or equivalent: 3+ years
● AWS, GCP, Azure: 2+ years
● Python, Go or equivalent: 2+ years
● MongoDB, PostgreSQL, AWS RDS, AWS DynamoDB or equivalent: 2+ years
● Security hardening of Linux OS, Kubernetes, Istio, Containers: 2+ years
● Datadog, AWS Cloudwatch or equivalent: 2+ years
Company benefits include:
Paid time off
English classes
U.S. Holidays
Training
Udemy free Premium access
Profit Sharing
Mentored career development
$US Remuneration
#J-18808-Ljbffr
$112k-154k yearly est. 5d ago
Senior DevOps Engineer - Remote AI Healthcare Infra
Akasa
Remote job
A healthcare AI company is seeking a Sr. Infrastructure Engineer to enhance and scale their systems. Responsibilities include managing infrastructure with Terraform and Kubernetes, creating monitoring solutions, and troubleshooting. Ideal candidates will have 5+ years in Python, strong observability skills, and a collaborative mindset. The role is based in South San Francisco with expectations of attending in-person co-working days.
#J-18808-Ljbffr
$112k-154k yearly est. 5d ago
Senior DevOps Engineer - AI Cloud, Kubernetes & CI/CD Lead
Stack Ai, Inc.
Remote job
A cutting-edge AI infrastructure company is seeking a Senior DevOps Engineer to design, build, and manage scalable systems. Responsibilities include owning cloud infrastructure, leading Kubernetes operations, and streamlining CI/CD pipelines. The ideal candidate will have over 5 years of experience and expertise in DevOps tools and cloud services. This role offers hybrid flexibility, allowing collaboration in San Francisco or fully remote work within an innovative team that significantly influences the company's impact in the AI sector.
#J-18808-Ljbffr
A leading technology firm in San Francisco is seeking a DevOps Engineer to streamline the software development lifecycle. You will design CI/CD pipelines, manage cloud infrastructure, and automate deployments. Ideal candidates should have a Bachelor's degree, 2+ years of DevOps experience, and proficiency with CI/CD tools and cloud platforms. The position offers competitive salary and flexible working options.
#J-18808-Ljbffr
$112k-154k yearly est. 1d ago
DevOps Engineer - AI-Powered DevTools (Hybrid)
Coderabbit
Remote job
A leading R&D company in San Francisco seeks a DevOps Engineer to scale and secure infrastructure for AI-enabled tools. You will design CI/CD pipelines, improve system reliability, and collaborate with various teams for optimal performance. Ideal candidates have 3-5 years in a fast-paced tech environment, with expertise in cloud services and CI/CD practices. The role offers a competitive salary, equity, and a hybrid work culture.
#J-18808-Ljbffr
A technology company is seeking a DevOps Engineer to design and maintain scalable infrastructure solutions, collaborating closely with software engineers. This role requires at least 3 years of experience in distributed systems and cloud-based environments such as AWS or Azure, along with experience in containerization technologies like Docker and Kubernetes. The company values a collaborative approach and offers a salary range of $150,000 - $200,000 per year plus equity and benefits.
#J-18808-Ljbffr
$150k-200k yearly 5d ago
Lead Platform DevOps Engineer - Cloud & Containers (Remote)
Booz Allen Hamilton 4.9
Remote job
A leading consulting firm is seeking a DevOps engineer in Washington, DC, to build and manage cloud-based container platforms. The role requires extensive experience in developing tools for DevOps processes and troubleshooting pipeline issues. Ideal candidates should have a Bachelor's degree or equivalent experience, with proficiency in Kubernetes, Docker, and cloud solutions. The firm values a collaborative culture, providing comprehensive benefits and opportunities for professional growth.
#J-18808-Ljbffr
$95k-125k yearly est. 4d ago
Infrastructure Engineer, AI & LLM Platform (Hybrid)
Ivo
Remote job
A forward-thinking tech company in San Francisco is seeking an Infrastructure Engineer to design and manage complex distributed systems. As part of the engineering team, you will own the future of the infrastructure, manage customer deployments, and enhance performance monitoring. The ideal candidate is passionate about LLMs and eager to push boundaries in a hybrid work environment. Competitive compensation ranges from $225K to $485K annually, depending on experience.
#J-18808-Ljbffr
$115k-175k yearly est. 3d ago
Infrastructure Platform Engineer
Fieldguide
Remote job
About Us
Fieldguide is establishing a new state of trust for global commerce and capital markets through automating and streamlining the work of assurance and audit practitioners specifically within cybersecurity, privacy, and financial audit. Put simply, we build software for the people who enable trust between businesses.
We're based in San Francisco, CA, but built as a remote‑first company that enables you to do your best work from anywhere. We're backed by top investors including Bessemer Venture Partners, 8VC, Floodgate, Y Combinator, DNX Ventures, Global Founders Capital, Justin Kan, Elad Gil, and more.
We value diversity - in backgrounds and in experiences. We need people from all backgrounds and walks of life to help build the future of audit and advisory. Fieldguide's team is inclusive, driven, humble and supportive. We are deliberate and self‑reflective about the kind of team and culture that we are building, seeking teammates that are not only strong in their own aptitudes but care deeply about supporting each other's growth.
As an early stage start‑up employee, you'll have the opportunity to build out the future of business trust. We make audit practitioners' lives easier by eliminating up to 50% of their work and giving them better work‑life balance. If you share our values and enthusiasm for building a great culture and product, you will find a home at Fieldguide.
About the Role
As an Infrastructure Platform Engineer at Fieldguide, you will play a pivotal role in our mission to design, build, and deliver a cutting‑edge foundation for our product teams in a dynamic, cloud‑based environment. You'll collaborate closely with our talented product developers to craft and maintain the optimal cloud infrastructure, finding the balance between performance, resilience, and cost‑effectiveness. You'll be at the forefront of emerging technologies, overseeing our production environment by monitoring availability and taking a holistic approach to system health to ensure top‑notch quality, security, and reliability. Your role will also encompass providing primary operational support and engineering expertise for our distributed software applications across the entire Fieldguide environment.
What You'll Do
Design, build, and maintain core platform infrastructure to support scalable, reliable, and secure services across engineering teams.
Develop and manage Infrastructure as Code (IaC) using tools like Terraform to ensure consistent, reliable environments.
Collaborate with software and data engineering teams to provide self‑service infrastructure and platform tooling that accelerates development workflows.
Monitor and improve system reliability, performance, and cost efficiency through metrics, logging, and alerting frameworks (e.g., Datadog, CloudWatch).
Ensure infrastructure security and compliance by implementing best practices for identity management, network segmentation, secrets handling, and vulnerability management.
Contribute to incident response and postmortem processes, driving root cause analysis and long‑term improvements.
Mentor and collaborate with other engineers, fostering a culture of reliability, automation, and continuous improvement.
Support disaster recovery and business continuity planning, ensuring high availability and resilience of critical systems.
Document infrastructure design, architecture decisions, and operational procedures for transparency and team enablement.
Who You Are
You have extensive hands‑on experience in constructing complex cloud solutions using multiple AWS services.
You are skilled in provisioning and configuring cloud services using Terraform and the AWS CLI / API.
You have proficiency in designing effective monitoring/alerting and log aggregation solutions using tools like Datadog and AWS CloudWatch (New Relic, Prometheus/Grafana, etc.).
You have a solid understanding of data systems, including both SQL and NoSQL.
You have experience in developing and maintaining software in security and regulatory compliance environments (SOC 2, PCI‑DSS, HIPAA, etc.).
You are comfortable in participating in on‑call support to ensure 24/7 availability of services.
You have a passion for mentoring and coaching other engineers.
You have excellent communication and organizational skills, capable of managing multiple competing priorities in a rapidly evolving environment.
Bonus Points
You have experience with GraphQL as a database front‑end API.
You have experience with database system architecture (e.g., Postgres) and observability, to help us increase our overall database performance.
You have experience both working with AI, and with providing it as a tool for engineers and our internal applications to utilize.
You have experience working through and designing for security audits (e.g., SOC2, PCI, etc.).
More about Fieldguide
Fieldguide is a values‑based company. Our values are:
Fearless - Inspire & break down seemingly impossible walls.
Fast - Launch fast with excellence, iterate to perfection.
Lovable - Deliver happiness & 11 star experiences.
Owners - Execute & run the business with ownership.
Win‑win - Create mutual value & earn trust for life.
Inclusive - Scale the best ideas with inclusive teams.
Some of our benefits include
Competitive compensation packages with meaningful ownership
Flexible PTO
401k
Wellness benefits, including a bundle of free therapy sessions
Technology & Work from Home reimbursement
Flexible work schedules
#J-18808-Ljbffr
$115k-175k yearly est. 2d ago
Hybrid Cloud & Infrastructure Engineer
State Bar of California 3.7
Remote job
A state legal authority in California is seeking an Infrastructure and Cloud Engineer to manage its hybrid cloud and on-premises infrastructure. This role involves optimizing performance across enterprise platforms like Microsoft Azure and SQL Server while supporting a collaborative environment. The ideal candidate will have a Bachelor's degree and two years of relevant experience. This position allows for remote work up to four days a week, reflecting a commitment to work-life balance and modern workplace practices.
#J-18808-Ljbffr
$109k-149k yearly est. 2d ago
Network Engineer, Operations & Reliability
Fluidstack
Remote job
About the Role
Fluidstack is seeking a Network Operations Engineer to serve as a Regional Site Lead for one of our datacenter campuses. This is a hybrid role that combines hands‑on Tier 2/3 network operations with site leadership responsibilities. You'll be the boots‑on‑the‑ground expert for your assigned datacenter/campus, ensuring network reliability through incident response, break‑fix coordination, and operational excellence. You'll work remotely when workload allows but be onsite as needed for deployments, complex troubleshooting, and critical incidents.
This role is ideal for experienced network operators who want ownership of a datacenter campus while being part of a broader operations organization. You'll partner closely with the Operations & Reliability pillar lead, centralized NOC for Tier 1 escalations, and cross‑functional teams including Deployment, Hardware, and DC Operations. Success means maintaining high availability for your region, building strong relationships with onsite teams, and growing into regional operations leadership as the team scales.
Focus
Regional Operations Ownership: Serve as the primary network operations contact for your assigned datacenter campus. Own network health, respond to incidents escalated from NOC, and ensure fabrics run reliably. Build deep knowledge of your region's network topology, common failure modes, and operational characteristics.
Tier 2+ Incident Response: Handle network incidents escalated from Tier 1 NOC during your coverage window. Troubleshoot complex issues across physical and logical layers, coordinate with other engineers for follow‑the‑sun coverage, and drive incidents to resolution. Lead incident response when you're the subject matter expert on the ground.
Break‑Fix Coordination: Coordinate hardware break‑fix activities with onsite DC Operations technicians. Manage linecard swaps, optic replacements, device troubleshooting, and RMA processes. Ensure physical infrastructure issues are resolved quickly and don't impact production workloads.
Deployment Support: Provide operational support during new datacenter deployments and expansions in your region. Partner with Deployment teams on turn‑up activities, validate production readiness, and ensure smooth handovers from deployment to operations. Be the person who ensures new pods integrate seamlessly into operational workflows.
Runbook Execution & Improvement: Execute operational runbooks for common failure scenarios and maintenance procedures. Identify gaps in runbooks, document lessons learned, and provide feedback to the Operations pillar lead on runbook improvements. Build the operational knowledge base for your region.
Cross‑Team Collaboration: Build strong relationships with onsite DC Operations teams, structured cabling vendors, and hardware logistics partners. Serve as the network engineering liaison for your datacenter campus. Communicate clearly about network status, planned maintenance, and operational issues.
Regional Mentorship: As the regional team scales, mentor junior operations engineers assigned to your datacenter. Share operational knowledge, provide guidance during incidents, and help build regional operations capacity.
About You
Strong Operations Background: 5-8 years in network engineering with significant hands‑on operational experience. You've run production networks, responded to incidents at all hours, and debugged complex failures under pressure. You understand the difference between "working" and "production‑ready."
Datacenter Fabric Expertise: Deep experience operating modern datacenter networks including EVPN/VXLAN, BGP, CLOS topologies, and high‑radix switching. You're comfortable troubleshooting Layer 2/3 issues, BGP routing problems, fabric misconfigurations, and physical layer failures.
Incident Response Excellence: Proven ability to lead incident response, perform systematic troubleshooting, and drive issues to resolution. You remain calm during outages, communicate clearly with stakeholders, and know when to elevate versus dig deeper. You've been the person others call when things break.
Site Leadership Capability: You've been the go‑to network person for a site, datacenter, or region before. You understand how to build relationships with onsite teams, coordinate physical infrastructure work, and represent network engineering in a field environment. You know how to get things done in operational settings.
Operational Pragmatism: You balance perfection with progress. You can troubleshoot with imperfect information, make pragmatic decisions under time pressure, and prioritize based on business impact. You document as you go and continuously improve operational processes.
Hybrid Work Comfort: You're productive working remotely but understand that datacenter operations sometimes require hands‑on presence. You're comfortable with flexible schedules that adapt to operational needs-sometimes remote, sometimes onsite for days or weeks during critical periods.
Nice to Haves
AI/HPC Fabric Operations: Experience operating AI/ML or HPC fabrics with RDMA (RoCEv2), lossless Ethernet (PFC, ECN), or high‑performance networking. You understand the operational precision required when network performance directly impacts workload completion.
Regional/Campus Operations Leadership: You've been a site lead, campus engineer, or regional operations lead before. You know how to coordinate across teams in a specific geographic location while reporting into a centralized organization.
Hardware Break‑Fix Experience: Hands‑on experience coordinating hardware repairs, RMAs, and physical infrastructure work. You understand datacenter logistics, vendor escalation processes, and how to work effectively with onsite technicians.
Observability & Monitoring: Familiarity with network monitoring platforms, alerting systems, and telemetry collection. You've used monitoring tools to diagnose issues proactively and tune alerting to reduce noise.
Automation Exposure: Basic scripting or automation experience (Python, Ansible) for operational tasks. You may not be writing complex automation but you understand how to leverage tools to improve operational efficiency.
Follow‑the‑Sun Experience: Experience working in distributed operations teams with follow‑the‑sun coverage models. You understand how to hand off incidents cleanly, communicate operational status across time zones, and coordinate with global teams.
Salary & Benefits
Competitive total compensation package (salary + equity).
Retirement or pension plan, in line with local norms.
Health, dental, and vision insurance.
Generous PTO policy, in line with local norms.
The base salary range for this position is $150,000 - $250,000 per year, depending on experience, skills, qualifications, and location. This range represents our good faith estimate of the compensation for this role at the time of posting. Total compensation may also include equity in the form of stock options.
We are committed to pay equity and transparency.
Fluidstack is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Fluidstack will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
#J-18808-Ljbffr
$150k-250k yearly 1d ago
Infrastructure and Cloud Engineer
New River Community College 3.7
Remote job
Office of Information Technology
Annual Salary Range: $95,784 - $127,713
FLSA Exempt / Union Represented
allows for up to four days of remote work per week.
About the Office
The Office of Information Technology (IT) is responsible for enabling State Bar's internal and external stakeholders by the management, implementation, and maintenance of technology that supports the State Bar's mission and goals. The office's primary goals are to build and maintain functional capabilities, support innovation, and ensure that all systems are running smoothly, efficiently, and securely to meet the needs of the organization and its stakeholders.
Job Overview
The Office of Information Technology is seeking an Infrastructure and Cloud Engineer to administer, support, and optimize the State Bar's hybrid cloud and on-premises infrastructure. This role supports core enterprise platforms including Microsoft Azure, Microsoft 365, Windows Server, Active Directory and Entra ID, virtualization technologies, SQL Server, enterprise storage, and network and telecommunications systems.
The engineer plays a key role in infrastructure modernization and cloud transformation, improving operational efficiency and service reliability across a multi‑site enterprise environment. The position works across cloud, server, identity, networking, storage, and endpoint management domains and collaborates closely with cybersecurity, application teams, and vendor partners to support secure and resilient enterprise operations.
Ideal Candidate
The ideal candidate has a solid foundation in cloud and infrastructure technologies and has expertise across areas such as Azure, Microsoft 365, Windows Server, identity services, networking, storage, and endpoint management in a hybrid environment. They are curious, willing to learn, and able to apply their skills across a variety of technical tasks.
They work independently, bring a growth mindset, and collaborate well with others. They communicate clearly, stay organized, and approach problem‑solving in a steady and thoughtful way. They are dependable, take ownership of their work, and are motivated to contribute to meaningful projects as part of a collaborative, service‑oriented team at the State Bar.
Distinguishing Characteristics
IT Analyst I - Entry‑level class; performs less than full range of duties; less complex matters; under more direct supervision.
IT Analyst II - Journey‑level class; performs full range of duties; more complex matters; under less direct supervision.
Examples of Essential Duties
Evaluates customer technical needs and recommends solutions; plans, determines requirements, designs, builds, customizes, tests, implements, maintains and/or enhances hardware and software systems.
Provides professional customer support for system‑related software/hardware issues, interacts with clients to analyze requirements and recommend technology solutions.
Develops cost‑benefit analyses, evaluates risk options, ensures project compliance with procedures, budgets, and resource utilization.
Coordinates project scopes, budgets, resources; interfaces with clients; designs and implements testing and QA processes.
Coordinates IT activities of departments/vendors; resolves obstacles; manages delivery and installation.
Prepares technical documentation, procedural plans, reports; participates in committees, task forces; attends trainings.
Builds positive relationships with employees, vendors, and the public; exercises technical supervision; provides after‑hours support.
Job Specific Examples of Essential Duties
Manage and optimize cloud infrastructure across IaaS, PaaS, and hybrid environments.
Administer Microsoft 365 services (Exchange Online, Teams, SharePoint, OneDrive) and related identity, security, compliance configurations.
Monitor and optimize performance across server, network, storage, cloud, and database systems.
Administer Windows Servers and Azure VMware Solution, including configuration, maintenance, upgrades, patching, restoration.
Design, configure, install, and maintain enterprise network infrastructure.
Troubleshoot and resolve network and system connectivity issues.
Develop and maintain network access, security, and change‑control procedures.
Analyze business needs and prepare technical design specifications for network solutions.
Design, implement, and maintain telecommunications systems.
Administer and maintain SQL Server environments, including tuning, indexing, optimization, backup, recovery.
Implement and test backup, recovery, restoration procedures for storage systems.
Prepare documentation and operational procedures for storage management and recovery.
Lead and coordinate technical infrastructure projects.
Provide customer support and deliver user and technical training.
Coordinate procurement activities and vendor partnerships.
Support identity lifecycle operations in Active Directory and Entra ID.
Administer Microsoft Intune for device provisioning, compliance, application deployment.
Administer ManageEngine AD Manager Plus and M365 Manager Plus for reporting and provisioning workflows.
Provide infrastructure data and system insights to assist cybersecurity teams.
Desired Knowledge
Azure infrastructure operations, optimization practices, Azure VMware Solution.
Microsoft 365 administration (Exchange Online, Teams, SharePoint, OneDrive).
PowerShell or VBScript for automation and system management.
Monitoring, logs, alerts, system health across infrastructure.
Windows Server and Active Directory administration (Group Policy, DNS, identity security).
Network routing, switching, wireless technologies, networking security.
Firewalls, routers, switches, Cisco technologies.
Telephone and audio‑visual technologies.
SQL Server administration, hybrid database environments, high availability.
Storage technologies (SAN, fiber channel).
Backup, recovery, disaster recovery (snapshots, mirroring, failover).
Entra ID directory services, identity lifecycle operations.
Microsoft Intune device and endpoint management concepts.
ManageEngine AD Manager Plus and M365 Manager Plus administration.
Desired Ability
Gather, analyze and evaluate data for logical reasoning and recommendations.
Research, design, implement, and maintain hardware and software solutions.
Communicate technical information to varied audiences.
Interpret and explain policies and procedures.
Plan, organize, prioritize work to meet deadlines.
Utilize specialized terminology; interpret technical information.
Adapt quickly to changes.
Communicate effectively in writing and orally.
Maintain effective working relationships within and outside the department.
Prepare documentation for procedures, processes, tables.
Identify and resolve performance and security issues.
Lead and coordinate technical projects; manage tasks; support long‑term planning.
Use monitoring and analytics tools for system performance.
Install, configure, secure, optimize server platforms.
Administer and troubleshoot Microsoft 365 services and security compliance.
Plan, design, install, document network infrastructure.
Monitor network performance and security.
Administer SQL Server environments including high‑availability.
Maintain and support backup/recovery and storage solutions.
Collaborate with cybersecurity teams during audits and incident response.
Minimum Qualifications
Education: Bachelor's degree in a related field or equivalent academic achievement.
Experience: Two (2) years of full‑time, progressively responsible experience in analyzing and troubleshooting computer applications and operations.
Licenses/Certificates: Possession of approved IT certificates and/or completion of other approved technology training may substitute for some or all of the required education. Certification hours equal one (1) year of education.
About the State Bar
The State Bar of California's mission is to protect the public and includes the primary functions of licensing, regulation, discipline of attorneys; the advancement of ethical and competent practice of law; and support of efforts for greater access to and inclusion in the legal system.
Our Values
Clarity | Investing in Our People | Excellence | Respect | Growth Mindset
Learn more about our values.
DEI Statement
We are a diverse, equitable, and inclusive workplace where all of our employees and prospective employees experience fairness, dignity, and respect.
Learn more about our commitment to DEI.
#J-18808-Ljbffr
$95.8k-127.7k yearly 1d ago
Payments Systems Engineer - Global Checkout, Hybrid (SF)
Openai 4.2
Remote job
A pioneering technology firm in San Francisco is seeking a Software Engineer to join their Payments Team. This role involves architecting and scaling core payment infrastructure, collaborating with cross-functional teams, and ensuring reliable financial systems. Candidates should have over 5 years of experience in software engineering with a focus on payments or financial systems. This position offers a hybrid work model and the opportunity to contribute to the sustainability of AGI.
#J-18808-Ljbffr
Nowadays, it seems that many people would prefer to work from home over going into the office every day. With remote work becoming a more viable option, especially for linux engineers, we decided to look into what the best options are based on salary and industry. In addition, we scoured over millions of job listings to find all the best remote jobs for a linux engineer so that you can skip the commute and stay home with Fido.
We also looked into what type of skills might be useful for you to have in order to get that job offer. We found that linux engineer remote jobs require these skills:
Python
Troubleshoot
Bash
Cloud
Unix
We didn't just stop at finding the best skills. We also found the best remote employers that you're going to want to apply to. The best remote employers for a linux engineer include:
Lockheed Martin
IEHP
System One
Since you're already searching for a remote job, you might as well find jobs that pay well because you should never have to settle. We found the industries that will pay you the most as a linux engineer:
Health care
Finance
Automotive
Top companies hiring linux engineers for remote work