Hardware Support Engineer
Infrastructure engineer job in Palo Alto, CA
Cognizant is a leading provider IT and BPO services, providing critical initiatives to a variety of global clients. The Hardware Operations team is a part of a high profile client project that provides interactive panoramas from positions along many streets in the world. Hardware Operations is responsible for building, testing, deploying, and maintaining imagery hardware and sensors used on different platforms.
The Hardware Support Engineer needs to have the ability to test and debug advanced, client-designed equipment like high-resolution cameras, mechanical frames, cables, and geolocation hardware, as well as completing on demand engineering tests and producing technical solutions. Take on project management tasks, analyze common hardware failures, and help create and maintain technical documentation. Act as a subject matter expert for global hardware troubleshooting and repair procedures. identifying and addressing technical knowledge gaps within the team. Work closely with regional tech leads to improve processes and troubleshoot intermittent faults. This role is based in Palo Alto, CA. Only local candidates will be considered.
Role Responsibilities
Responsible in assisting engineering teams that are testing advanced client-designed equipment such as high resolution cameras, mechanical frames, cables and overall geolocation equipment when required and agreed by the
team lead
Manage field support technical projects.
Analyze common HW failures by testing and debugging specific client designed electronics equipment
Responsible for supporting the development of the documentation of test procedures and reports for new and existing hardware/components
Audit and maintain knowledge resources with accurate and useful information
Perform the role of a SME in the review process of hardware support troubleshooting and repair procedures at a global scale. This includes:
Monitoring the technical standards of the field support team globally
Identifying technical knowledge gaps and work with the leads to address those needs
Identifying patterns of service affecting failures to develop technical or process improvements.
Being fully aligned with the Global and Regional Leads on current field support workflows, project requirements in order to participate in process improvements.
Identifying common intermittent faults and analyzing cause and repair
Desired Skills & Experience
At least 3 years of relevant work experience
Must have the ability to travel, often on short notice.
Must have a Valid passport or the ability to obtain one within
A clear driving record (MVR) and the ability to operate motor vehicles.
Expert knowledge of and experience with Hand tools, light power tools, Types and application of hardware, Measuring tools and techniques, Basic automotive repair, Understanding of wiring diagrams, Electronics repairs. Intermediate knowledge of Linux
Working knowledge of Ticketing systems, Gmail and Google Calendar, Basic spreadsheet tasks (Google Sheets, preferred), Moderate to advanced automotive repair.
Familiarity with various testing techniques and great hardware troubleshooting skills
Hardware troubleshooting, installation and warehouse management
Familiarity with vehicle parts, computer parts , camera parts, PCB etc
Excellent interpersonal and communication skills with the ability to operate and communicate effectively with people at all levels of the business.
Comfortable with a rapidly-changing environment
Strong problem-solving skills and excellent attention to detail
Able to work independently, motivated, proactive attitude with a passion for learning and creative problem solving
Hourly Rate and Other Compensation:
The annual salary for this position is between $72,000 - $93,000 depending on experience and other qualifications of the successful candidate.
This position is also eligible for Cognizant's discretionary annual incentive program, based on performance and subject to the terms of Cognizant's applicable plans.
Benefits: Cognizant offers the following benefits for this position, subject to applicable eligibility requirements:
Medical/Dental/Vision/Life Insurance
Paid holidays plus Paid Time Off
401(k) plan and contributions
Long-term/Short-term Disability
Paid Parental Leave
Employee Stock Purchase Plan
Disclaimer: The hourly rate, other compensation, and benefits information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time, subject to applicable law.
LA County (only): Qualified applicants with arrest and/or conviction records will be considered for employment.
Cognizant will only consider applicants for this position who are legally authorized to work in the United States without requiring company sponsorship now or at any time in the future.
Server Administrator
Infrastructure engineer job in San Francisco, CA
Candidates ONLY "No 3rd Party Candidates"
The Server Administrator will work with a broad range of customers, partners, and key stakeholders in administrative and academic units to provide best-in-class server administration services.
Required Qualifications
Bachelor's degree, or equivalent combination of experience/training, in one or more of the following fields: computer science, engineering, computer information systems, etc.
3+ years of experience in one or more of the following fields: server administration, information technology, etc.
Prior experience installing, configuring, modifying, and supporting Windows and Linux operating systems, hypervisor, and other virtualization technologies.
Prior experience in information technology, platform services, or server administration.
Experience with monitoring, auditing, tuning, analysis and optimization of system performance, security and capacity planning, patching, and upgrades.
Prior experience with Unix and PowerShell scripting and scripting with Perl, Python, or other modern languages.
Proficiency in key Infrastructure as Code (IaC) methodologies and principles.
Strong customer service skills.
Ability to triage and escalate to supervisors and/or other teams for resolution.
Strong written and verbal communication skills and ability to communicate technical information and ideas to a diverse community of colleagues and stakeholders.
Ability to establish and advance positive working relationships and strong rapport with team members, stakeholders, and customers.
Strong organizational skills and ability to balance competing priorities and support concurrent projects.
Demonstrated problem-solving skills; scopes solutions based on knowledge of available resources and timelines.
Must have Windows sys admin, Linux, RHEL, Ubuntu, Citrix XenServer, Vmware, Ansible
Nice to have: EPIC, Bigfix, ServiceNOW, Morpheus, iDRAC, Netscaler
Staff ML Infrastructure Engineer
Infrastructure engineer job in Fremont, CA
Staff / Lead ML Infrastructure Engineer
San Francisco, CA - Onsite
Salary - Over market average + equity
We are building one of the world's leading generative video and multimodal AI platforms, and we're looking for a senior infrastructure engineer to drive the backbone that makes it possible. This role is ideal for an engineer from a top-tier tech company who has built cloud-scale systems, high-performance compute platforms, and battle-tested CI/CD pipelines that support complex ML workloads.
What You'll Own
Core ML Platform Architecture: Design and evolve the infrastructure that supports large-scale generative video and multimodal model training, evaluation, and deployment.
High-Throughput Compute Systems: Build and optimize GPU/TPU clusters, distributed training systems, and orchestration layers tailored for video-heavy pipelines.
Production Reliability for Generative Models: Create the tooling and services needed to safely push frequent model updates while handling massive compute loads and long-running jobs.
End-to-End CI/CD for ML: Lead the development of automated pipelines for model training, validation, artifact management, and production rollout.
Multimodal Data Infrastructure: Build systems to ingest, version, transform, and serve large-scale video, audio, and text datasets with high reliability.
Internal Developer Experience: Partner with research, product, and applied ML teams to build intuitive internal tooling for experiment tracking, model lineage, and resource scheduling.
Technical Leadership: Mentor engineers, set platform standards, and influence long-term architectural direction.
What You've Done
Experience architecting and operating large-scale infrastructure at a cloud provider, hyperscaler, or leading AI company.
Built or owned mission-critical CI/CD systems, high-capacity compute platforms, or data infrastructure supporting ML teams.
Deep experience with distributed compute across GPUs/accelerators, Kubernetes, and cloud infrastructure (AWS/GCP/Azure).
Strong engineering fundamentals in Python, Go, or equivalent languages.
Previous exposure to ML training pipelines-especially systems that handle heavy video, multimodal, or high-dimensional data.
Demonstrated ability to lead complex cross-org initiatives and drive technical strategy.
Nice to Have
Experience with video processing systems, large-scale media pipelines, or streaming architectures.
Familiarity with modern multimodal or video-generation frameworks (PyTorch, JAX, diffusers, custom accelerators).
Experience with Ray, Triton, CUDA optimization, or specialized scheduling for ML workloads.
Background working in high-growth AI startups or research-focused environments.
Security and compliance considerations for models that generate or process user content.
Why Join
Shape the underlying platform powering one of the most advanced generative video systems in the world.
Influence the future of multimodal AI by building infrastructure that directly accelerates research and product breakthroughs.
Work closely with experienced founding engineers, researchers, and platform builders from leading tech companies.
Highly competitive compensation, meaningful equity, and strong in-person engineering culture in San Francisco.
Security and Infrastructure Engineer
Infrastructure engineer job in San Francisco, CA
Role: Security & Infrastructure Engineer (Consultant)
Client: Pharma Client
We are looking for a strong Security and Infrastructure Engineer with hands-on experience in infrastructure management and configuration assessments across SaaS platforms, cloud environments, and on-prem systems.
Must Haves:
4-year Technical Degree
8+ years in IT Infrastructure & Security Engineering
5+ years scripting using Python & PowerShell
3+ years working with compliance frameworks (CIS Benchmarks, NIST, ISO)
Selection Priorities:
Strong Infrastructure & Security experience
Pharma/BioTech/Healthcare/Life Sciences background
Local to SF, CA
Senior Lead IT / Systems Administrator
Infrastructure engineer job in San Ramon, CA
About the Company
At the Pac-12, we are passionate about sports and technology! As part of our team, you will play a key role in driving the engineering operations and technology initiatives that power our business.
About the Role
We are seeking a highly skilled and motivated Sr. Lead IT/Systems Administrator to oversee and optimize our company's IT infrastructure, ensuring it meets the needs of our growing business. This role is perfect for an experienced, hands-on IT leader who thrives on technical challenges, strategic planning, and leading teams to success.
You will be responsible for designing, implementing, and maintaining complex IT systems, ensuring stability, security, and performance. The Sr. Lead IT/Systems Administrator will work closely with senior leadership to align IT strategies with business objectives while fostering a collaborative and innovative environment within the IT team.
The ideal candidate will be well versed in multiple operating systems, including Windows, Mac, and Linux, and have strong expertise in SQL and Proxmox virtualized environments. They will be a hands-on leader with a collaborative, “let's do it together” attitude, committed to elevating the IT and Systems team. Experience with SentinelOne, NinjaOne, and JAMF is a strong plus.
RESPONSIBILITIES
Leadership & Team Management:
Lead and mentor a team of IT support technicians and systems administrators, ensuring they have the resources, guidance, and training to grow and excel.
Champion a collaborative and high-performance culture within the IT department, encouraging knowledge sharing, innovation, and growth.
IT Infrastructure & Operations Management:
Oversee the design, implementation, and maintenance of the company's IT infrastructure, ensuring network availability, reliability, and scalability.
Manage system and server administration (Windows/Linux), SQL databases, cloud services, virtualized environments (e.g., VMware, Proxmox), and Mac environments.
Ensure a secure, stable, and efficient IT environment by proactively identifying potential issues and implementing solutions.
Maintain oversight of all IT assets, including hardware, software, and cloud services.
Develop and maintain disaster recovery and business continuity plans to safeguard business operations.
Strategic Planning & Project Management:
Collaborate with senior leadership to develop and implement IT strategies that align with the company's overall business objectives.
Oversee IT projects from initiation to completion, ensuring they are delivered on time, within scope, and within budget.
Develop long-term technology roadmaps, recommending improvements and upgrades to ensure the IT environment is future-proof and scalable.
Manage and optimize IT budget, ensuring efficient allocation of resources and cost-effective solutions.
Security & Compliance:
Ensure that IT systems are secure, compliant with industry standards, and adhere to data protection regulations.
Develop and enforce IT security policies, implementing robust security measures such as OS firewalls, encryption, and intrusion detection.
Conduct regular security audits and risk assessments to identify vulnerabilities and mitigate risks.
Stay current with industry trends and evolving cybersecurity threats, ensuring the organization is always protected.
Vendor & Stakeholder Management:
Manage relationships with external vendors, service providers, and contractors, ensuring service level agreements (SLAs) are met and costs are controlled.
Lead negotiations and procurement for IT services, equipment, and software.
Collaborate with other departments and business units to understand and support their technical needs.
User Support & Training:
Oversee the development and delivery of IT training programs to enhance user knowledge and improve system efficiency.
Ensure the IT helpdesk provides high-quality, timely support for all staff across various technical issues.
Ensure documentation and knowledge bases are continuously updated to support end-user training and troubleshooting.
QUALIFICATIONS
Education: Bachelor's degree in Information Technology, Computer Science, or related field.
Experience: 8+ years of experience in IT operations, with a minimum of 2-4 years in a senior technical role overseeing infrastructure and systems management.
Proven expertise in managing large-scale IT infrastructure, including networks, servers (Windows/Linux), virtualization platforms (VMware, Proxmox), cloud technologies (AWS, Google Cloud), and database management.
Demonstrated ability to design, implement, and support highly available, secure, and scalable IT systems.
Strong background in IT security and risk management, including experience with firewalls, VPNs, intrusion detection systems, and encryption technologies.
Proven success in managing IT budgets, resources, and complex projects.
Technical Skills
Solid understanding of networking protocols (TCP/IP, DNS, DHCP)
Expertise in systems administration, including Windows Server, Linux, Mac and cloud environments.
Experience with database management (MSSQL Server).
Proficiency with cybersecurity tools, practices, and frameworks (e.g., firewalls, endpoint protection, SIEM, SentinelOne).
Soft Skills:
Exceptional leadership and team management skills, with the ability to motivate and develop high-performing teams.
Strong problem-solving, analytical, and troubleshooting abilities.
Excellent verbal and written communication skills, able to translate technical concepts to non-technical audiences.
Strong project management skills, able to prioritize and manage multiple projects simultaneously.
Results-oriented with a proactive, “can-do” attitude.
Preferred Qualifications:
Relevant certifications such as CompTIA Network+, Security+, Microsoft Certified: Windows Administrator, AWS Certified Solutions Architect, PMP, or ITIL.
Experience with broadcast technologies (e.g., playout automation, video servers, MAM, streaming protocols) is a plus.
WORKING CONDITIONS:
Primarily office-based in San Ramon, CA with occasional remote work flexibility.
Evening or weekend work for system maintenance, upgrades, or emergency support.
Evening or weekend work to provide IT and systems support for scheduled live productions.
Limited travel to other company locations may be required.
COMPENSATION
The exact salary will depend on the successful candidate's, relevant skills, experience, and qualifications.
PAC-12 OVERVIEW
The Pac-12 stands at a defining moment in its history. Founded in 1915, the league's rich legacy of athletic and academic excellence spans over 100 years. Supported by world-class service and empowerment, Pac-12 student-athletes have earned more than 500 NCAA team championships. Now with a renewed and bold vision for its future, the Pac-12 has undergone significant transformation on its journey to launching a new collegiate athletics conference, custom-built for both the modern-day student-athlete and an evolving college sports landscape.
Under the leadership of Commissioner Teresa Gould, the Pac-12 embarks on creating a new legacy, composed of nine member universities, a one-of-its-kind and state-of-the-art broadcast production facility in San Ramon, CA., and a reimagined commercial enterprise that is uniquely positioned to drive strategic partnerships, brand enhancement, revenue generation and other growth opportunities to unlock new and lasting value for both its member universities and its student-athletes. Currently composed of Oregon State University and Washington State University, the league will welcome seven new full members beginning with the 2026-27 season, including Boise State University, Colorado State University, California State University, Fresno, Gonzaga University, San Diego State University, Texas State University and Utah State University.
```
System Engineer
Infrastructure engineer job in Santa Rosa, CA
Systems Engineer - Video Intelligence Infrastructure - San Francisco
About the Company
A Series A Funded start-up who already have millions in recurring revenue are building next-generation AI infrastructure for video intelligence are looking for a Systems Engineer to join their team.
What You'll Be Doing:
Design and engineer systems that handle compute, scheduling, and orchestration of complex ML + ETL pipelines
Optimize hyper-fast distributed systems running at the scale of thousands of GPUs
Build systems that process video data quickly, reliably, and cost-effectively at internet scale
Develop robust internal tooling and CI/CD pipelines for rapid ML team iteration
Focus on system uptime and performance optimization for mission-critical infrastructure
What We're Looking For:
3+ years building foundational data infrastructure
Experience designing and maintaining pipelines that process petabytes of data
Strong background developing CI/CD pipelines for ML-focused teams
Excellent coding skills in Go and Python
Independent contributor who leads by example
What's In It For You
Competitive salary up to $250k
Small, high-impact team with significant growth trajectory
Opportunity to work with top AI video labs and collect world-class datasets
Apply now for immediate consideration!
Network Infrastructure Engineer
Infrastructure engineer job in San Francisco, CA
Akkodis is seeking a - “Infrastructure Network Engineer” for a Contract position with a client located San Francisco, CA.
Pay Range: $60-$64/hr. (The rate/salary may be negotiable based on experience, education, geographic location, and other factors.).
Job Overview:
An Infrastructure Network Engineer to join our Engineering team, providing network administration and troubleshooting, network architecture and design guidance and testing pre-release products within our corporate network, throughout the development lifecycle. You are highly analytical and able to use your networking skills as part of our Engineering team to help craft a resilient, sophisticated, and creative network that serves as a showcase for all that Meraki can do. Excellent communication skills, problem-solving abilities, and project management abilities are essential. Those with a strong sense of team, desire to help others, and a passion for IT are encouraged to apply!
Key Responsibilities:
• Work with the team to design and maintain the “best” example of a Meraki/Cisco network for us to demo to our customers.
• Day-to-day management of Cisco/Meraki's cloud network infrastructure, including design, build and troubleshooting.
• Test pre-release hardware in our environment, providing feedback to Product teams on performance and functionality.
• Configure and set up new and existing Cisco Meraki MX security appliances, Catalyst/MS switches, and Catalyst/MR wireless access points.
• Capture and log traffic to define patterns and optimize the performance of our network.
• Lead the planning, design, and deployment of complex enterprise wireless networks, including Wi-Fi and IoT solutions.
• Conduct wireless site surveys (active, passive, and predictive) to determine optimal access point placement, coverage, and capacity.
• Perform in-depth RF spectrum analysis to identify and mitigate sources of interference that may impact wireless network performance.
• Install, configure, and manage wireless network hardware, including access points.
Qualifications:
• Minimum 5 to 7 years of network engineering experience.
• You are a great team player who is comfortable working in a dynamic environment.
• Strong ability to configure, implement, test, and maintain all LAN/WAN components and connections.
• Advanced experience designing, building and troubleshooting Meraki/Cisco cloud networks.
• Experience implementing voice in a real-world multi-site environment.
• Must be organized, with excellent verbal and written communication skills.
• A drive to contribute to the mission of yourself and your teammates, through the work you contribute.
• Ability to work autonomously and with a team in a fast-paced environment.
Nice-to-Have:
• ECMS, CCNA or higher Cisco certifications
Equal Opportunity Employer/Veterans/Disabled
Benefit offerings available for our associates include medical, dental, vision, life insurance, short-term disability, additional voluntary benefits, an EAP program, commuter benefits, and a 401K plan. Our benefit offerings provide employees the flexibility to choose the type of coverage that meets their individual needs. In addition, our associates may be eligible for paid leave including Paid Sick Leave or any other paid leave required by Federal, State, or local law, as well as Holiday pay where applicable. Disclaimer: These benefit offerings do not apply to client-recruited jobs and jobs that are direct hires to a client.
To read our Candidate Privacy Information Statement, which explains how we will use your information, please visit *****************************************
The Company will consider qualified applicants with arrest and conviction records in accordance with federal, state, and local laws and/or security clearance requirements, including, as applicable:
· The California Fair Chance Act
· Los Angeles City Fair Chance Ordinance
· Los Angeles County Fair Chance Ordinance for Employers
· San Francisco Fair Chance Ordinance
(Oracle ERP/PLSQL/SCM) IT Engineer
Infrastructure engineer job in San Jose, CA
Role: (Oracle ERP/PLSQL/SCM) IT Engineer
Type: W2 Contract
At least 9 years of experience with Information Technology
Domain experience in supply chain, order management, shipping, inventory management
Prior Cisco-exp would be a great advantage
Understanding of Oracle PL/SQL, SQL, ERP (OM, Inventory, Shipping, Receiving), Java and various integration technologies and approaches, to be able to comprehend existing as well as design new solutions
Sound data analysis skills
Expertise in grasping the complexity of current state application design, analyze new requirements to design new solution options and develop functional specification & author technical user story for developers and QA team members
Experienced in test case preparation/reviews, supporting QA exercise and issue resolutions
Perform validations of the capabilities once developed to ensure compliance with the business requirements
Perform demos to stakeholders
Ability to work in teams within a diverse/multi-stakeholder environment
Ability to interact effectively across cross-functional teams to iron out integration needs
Experience and desire to work in a Global delivery environment
Strong analytical abilities
Good communication skills
IT Engineer - Oracle ERP & PL/SQL
Infrastructure engineer job in San Jose, CA
IT Engineer - Oracle ERP & PL/SQL (6-Month Contract | On-site, San Jose, CA)
We are seeking a highly experienced and technically skilled IT Engineer with deep expertise in Oracle E-Business Suite (EBS) and PL/SQL development. This is a critical 6-month contract role, requiring candidates to work On-site in San Jose, CA, with the possibility of extension.
Role Details:
Job Title: IT Engineer - Oracle ERP & PL/SQL
Location: San Jose, CA (On-site Required)
Duration: 6 months (with potential for extension)
Implementation Partner: Infosys
End Client: To be disclosed (Major Enterprise Client)
IMPORTANT: Eligibility Requirements
Due to client mandates, only candidates who are legally authorized to work in the US without sponsorship can be considered:
US Citizens (USC) & Green Card Holders (GC) ONLY
Job Description & Technical Requirements
The ideal candidate will have a minimum of 8 years of experience in Oracle EBS technical development and support. You will be responsible for designing, developing, and optimizing solutions within the Oracle database environment.
Core Technical Expertise:
Oracle EBS & PL/SQL Development: Minimum 8 years of hands-on technical development and support experience in Oracle E-Business Suite (EBS) and Oracle PL/SQL.
Database Programming: Expertise in Oracle PL/SQL, SQL, including designing, developing, and maintaining stored procedures, packages, functions, and triggers to support and enhance EBS applications.
Module Understanding: Comprehensive technical understanding of key Oracle EBS modules, specifically Order Management, Shipping, and Inventory.
Customization (RICEW): Proven ability to develop and customize Oracle EBS applications, including Reports, Interfaces, Conversions, Extensions, and Workflows (RICEW objects), using Oracle development tools (Oracle Forms, Oracle Reports, Oracle Workflow, and BI Publisher).
Performance Tuning: Strong skills in optimizing SQL queries and PL/SQL code for maximum performance and efficiency within the Oracle database environment.
Architecture: Strong understanding of Oracle EBS architecture and data models.
Key Responsibilities:
Technical Support: Provide technical support and troubleshooting for Oracle EBS modules, resolving complex issues related to performance, data integrity, and system errors.
Collaboration: Collaborate effectively with functional teams, business analysts, and end-users to translate business requirements into technical specifications and solutions.
Implementation: Implement and configure Oracle E-Business Suite (EBS) applications, ensuring alignment with business processes.
SDLC Participation: Actively participate in all phases of the Software Development Lifecycle (SDLC).
Documentation: Create and maintain comprehensive technical documentation and user guides.
📩 How to Apply:
You can share resumes at ******************** OR Call us on *****************
Distributed Systems Engineer / AI Workloads
Infrastructure engineer job in San Mateo, CA
We are actively searching for a Distributed Systems Engineer to join our team on a permanent basis. In this founding engineer role you will focus on building next-generation data infrastructure for our AI platform. If you have a passion for distributed systems, unified storage, orchestration, and retrieval for AI workloads we would love to speak with you. Our office is located in downtown SF and we collaborate two days a week onsite.
Your Rhythm:
Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security
Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient
Tackle complex challenges in distributed systems, databases, and AI infrastructure
Collaborate with technical leadership to define and refine the product roadmap
Write high-quality, well-tested, and maintainable code
Contribute to the open-source community and engage with developers in the space
Your Vibe:
3+ years of professional distributed database systems experience
Expertise in building and operating scalable, reliable and secure database infrastructure systems
Strong knowledge around distributed compute, data orchestration, distributed storage, streaming infrastructure
Strong knowledge of SQL and NoSQL databases, such as MySQL, Postgres, and MongoDB.
Programming skills in Python
Passion for building developer tools and scalable infrastructure
Available to collaborate onsite 2 days a week
Our Vibe:
Relaxed work environment
100% paid top of the line health care benefits
Full ownership, no micro management
Strong equity package
401K
Unlimited vacation
An actual work/life balance, we aren't trying to run you into the ground. We have families and enjoy life too!
Distributed Systems Engineer
Infrastructure engineer job in San Francisco, CA
San Francisco, CA (Onsite)
About the Company
A fast-moving AI research group is building the core video data infrastructure used by leading AI labs and major tech companies. The team is small at around fifteen people, nearly all engineers, and recently pivoted to focus exclusively on high-quality video data at massive scale.
The shift has driven significant revenue growth, and they are now planning to expand the team steadily over the next few months.
The culture is straightforward: engineering led, product focused, low ego, and built around people who enjoy ownership. They work in person five days a week in their San Francisco office, moving quickly, solving hard problems, and avoiding micromanagement.
The Role
This position focuses on designing and scaling distributed systems that support huge ML and ETL workloads across petabytes of video. You will own core infrastructure: compute scheduling, orchestration, throughput, reliability, cost efficiency, and the internal tooling that keeps the entire engineering group moving at pace.
The company is beginning to scale its infrastructure footprint aggressively, and this role will become central to that growth. It is a hands-on IC position suited to someone who has operated critical systems before and wants to shape the foundation of a rapidly expanding platform.
What You'll Work On
• Architect and scale distributed systems for large-scale ML and ETL workloads
• Build compute orchestration and scheduling across thousands of GPUs
• Improve uptime, resilience, and execution speed of high-volume data pipelines
• Design pipelines capable of handling petabyte-level video datasets
• Lead the development of CI/CD and internal tooling for fast iteration
• Partner closely with research engineers delivering new video models and algorithms
• Operate in a high-trust environment with strong autonomy and clear ownership
Requirements
• 3+ years building foundational distributed systems or data infrastructure
• Experience running critical systems at significant scale
• Proficient across cloud architectures
• Strong coding experience with Go (preferred) and Python
• Background building or maintaining large-scale pipelines
• Experience with ML-focused CI/CD and automation
• Video domain experience is not required
• Operates as a strong IC who leads through action
• Fully onsite in San Francisco, Monday to Friday
Culture Fit
• Enjoys ambiguity, problem discovery, and self-direction
• Communicates clearly and concisely
• Shows strong intellectual curiosity
• Low ego, collaborative mindset
• Motivated by building core systems in a small, high-caliber team
Red flags include weak communication, low curiosity, or unclear motivation for the domain.
Interview Process
Intro call focused on culture, curiosity, and communication
Technical discussion on background and complexity of past work
Problem-solving session with a research engineer
Onsite research problem and collaboration exercise
MEP Systems Engineer
Infrastructure engineer job in Redwood City, CA
Ready to play a key role in building the future of living? Join Samara in tackling California's housing shortage and enabling people to attain sustainable housing without compromising design or quality. Our flagship product, Backyard, is a fully turnkey, premium accessory dwelling unit (ADU) designed for homeowners and real estate developers. As we expand our offerings and scale our in-house development initiatives, we're at a pivotal moment, redefining homeownership through high-quality, attainable infill housing. Backed by top-tier investors, including Airbnb, Thrive Capital, and 8VC, Samara is positioned for significant growth and market impact.
To support our next phase of growth, we're hiring product-focused engineers to advance and scale the technical foundation of our modular system. These roles go beyond traditional design work-they refine system standards, improve factory repeatability, and ensure our units are code-compliant, manufacturable, and built to the highest standards of quality and performance.
The MEP Systems Engineer will be responsible for the detailed design and implementation of mechanical, electrical, plumbing, and PV systems tailored for modular construction building systems. This role requires a deep understanding of MEP systems combined with practical experience in modular construction. You will collaborate closely with leadership, crossfunctional design and engineering teams to integrate all technical and user experience requirements into our designs to ensure optimal functionality, sustainability, and compliance with all regulations.
What You'll Do
Design and develop integrated MEP systems for our new and existing designs including solar energy systems, including PV and ESS, optimized for prefabricated modular construction
Ensure that solar and energy storage designs align with overall MEP system functionality and building energy requirements
Lead the creation of comprehensive design documents, schematics, component material selections and system layouts, preferably using CAD and BIM software
Provide technical leadership during the installation and commissioning phases to ensure systems meet design specifications and performance standards
Conduct system testing and validation to ensure functionality, efficiency, and safety of both MEP and PV installations
Collaborate closely with installation teams to facilitate seamless and efficient factory and onsite implementation of design
Engage in research and application of the latest technologies and practices in renewable energy and modular construction
Work with program managers and other engineering disciplines to ensure holistic integration of all systems within Samara modular units
What We're Looking For
Modular construction experience in factory builds, multi-mod, stackable and/or other hands on related experience.
Licensed Electrician or Mechanical Contractor -and/or- Bachelor's degree in Mechanical, Electrical, or Energy Systems Engineering, or a related field
Professional Engineering (PE) license preferred
Minimum of 7 years of experience in one of the following: Mechanical, Electrical, Solar and/or Plumbing System design
Comprehensive knowledge of building codes, safety regulations, and sustainability practices relevant to MEP and renewable energy systems
Proficiency in design software such as Onshape, Revit, and/or other BIM methodologies preferred
Excellent problem-solving skills and the ability to adapt designs to changing technological and regulatory landscapes
Strong communication and leadership skills, capable of driving project decisions and managing complex stakeholder relationships
Ability to travel to our factory in Mexico up to 25-40%.
What We Offer
Salary range of $120-160K and performance-based bonuses.
Hybrid work schedule with 3 days each week in our Redwood City office.
Snacks and Lunch on in-office days
Early stage employee equity.
Exceptional health, dental, and vision insurance.
401k eligibility after 6 months.
Flexible PTO policy.
How to Apply
If you're excited to support Samara's mission and have the skills to match, we'd love to hear from you. Please submit your resume and a brief letter of introduction to our team.
Let's build something extraordinary-together.
Lab / Systems Engineer
Infrastructure engineer job in Santa Clara, CA
W2 Contract
Salary Range: $93,600 - $114,400 per year
Duties and Responsibilities:
Monitor, install, and upgrade the software and hardware that form the test lab infrastructure.
Oversee and troubleshoot system performance.
Extend monitoring tools to cover additional use cases.
Expand existing documentation.
Research and diagnose failures.
Requirements and Qualifications:
BS in computer science, IT, or related field. Alternatively, 5 years of experience in a similar role.
Knowledge of Linux System Administration, installation, configuration, and troubleshooting
Knowledge of Bash scripting or similar command-line tools
Knowledge of Python or GoLang
Experience setting up x86 server hardware, BIOS/UEFI, and other server/rack configuration tasks
Nice to Have
Knowledge of IPMI/DRAC/Redfish or similar out-of-band management tools
Familiarity with Infrastructure-as-Code or similar concepts
Experience with configuration management systems like Ansible/Chef/Puppet,/salt
Knowledge of mac OS system Administration, installation, configuration, and troubleshooting
Knowledge of networking
Knowledge of TCP or OSI layers
Knowledge of NAT
Experience with IPv6
Basic understanding of VLANs/Firewalls/etc
Desired Skills and Experience
Linux System Administration, Bash scripting, Python, GoLang, x86 server hardware configuration, BIOS/UEFI configuration, server rack configuration, hardware installation, software installation, system monitoring, system upgrades, performance troubleshooting, monitoring tools development, technical documentation, failure diagnosis, IPMI, DRAC, Redfish, out-of-band management, Infrastructure-as-Code, Ansible, Chef, Puppet, Salt, configuration management, mac OS system administration, networking, TCP/IP, OSI model, NAT, IPv6, VLANs, firewalls, infrastructure management, command-line tools, system configuration, troubleshooting
Bayside Solutions, Inc. is not able to sponsor any candidates at this time. Additionally, candidates for this position must qualify as a W2 candidate.
Bayside Solutions, Inc. may collect your personal information during the position application process. Please reference Bayside Solutions, Inc.'s CCPA Privacy Policy at *************************
Network Engineer
Infrastructure engineer job in Fremont, CA
Experience Needed:
CCNP/JNCIP preferable
Experience in new PoP or Datacenter builds.
Understanding of outside plant construction requirements with fiber and/or copper or coax experience preferred.
Solid understanding of fiber-optic technology including cable types, connector types, optic types, patch panels, and optical transport technologies.
Rack and stack; Installation of new racks and devices.
Experience in device installation and testing, software/firmware upgrade, re-bootstrapping and decommission.
Experience in network optimization (re-stripes, migrations, upgrades, swaps, capacity upgrades).
Knowledge of DCN and experience in DCN installation, migrations, and upgrades.
Experience in break fix support
Knowledge of Juniper, Ciena, Infinera, Cisco, Arista and ZPE systems (Network Deployment)
Strong attention to detail with excellent time management and organization skills.
Excellent experience in maintaining documentation.
Demonstrated ability to analyze complex situations and utilize troubleshooting skills, systems and tools, and creative problem-solving abilities under pressure.
Programming and scripting capabilities are strongly desired.
Excellent communication skills.
Ability to work within a global team in a fast-paced and dynamic environment with limited supervision.
Duties:
Deploy, configure, and support a large-scale production and corporate network and server infrastructure in data centers, Point of Presence (POP), edge, backbone, datacenter, and content delivery network (CDN) infrastructure.
Provide onsite network support and expertise on local data center campus and remote support for POP sites while working with local vendor support staff
Schedule and perform network maintenance, repair, and upgrade tasks as needed while limiting the impact on the production network
Calculate and document equipment power requirements and work with Engineering, Facilities Operations, and/or collocation vendors to meet these requirements.
Work closely with Network Engineering, Logistics, and equipment vendors as new equipment and technologies are integrated into the production network.
Work with network provisioning engineers to turn up new circuits and engage and escalate with vendors to troubleshoot out-of-service or faulty circuits.
Queue Management for tasks and incidents, participate in ongoing DC/POP deployment projects
Use internal tools and scripts to configure, monitor, and repair servers and network equipment
Develop internal documentation and conduct training as needed or appropriate
Proactively contribute to documentation, automation and processes as they evolve
Follow, improve, and implement data center and POP best practices
Participate in on call activities and follow escalation process to support the infrastructure 24/7
Staff ML Infrastructure Engineer
Infrastructure engineer job in San Francisco, CA
Staff / Lead ML Infrastructure Engineer
San Francisco, CA - Onsite
Salary - Over market average + equity
We are building one of the world's leading generative video and multimodal AI platforms, and we're looking for a senior infrastructure engineer to drive the backbone that makes it possible. This role is ideal for an engineer from a top-tier tech company who has built cloud-scale systems, high-performance compute platforms, and battle-tested CI/CD pipelines that support complex ML workloads.
What You'll Own
Core ML Platform Architecture: Design and evolve the infrastructure that supports large-scale generative video and multimodal model training, evaluation, and deployment.
High-Throughput Compute Systems: Build and optimize GPU/TPU clusters, distributed training systems, and orchestration layers tailored for video-heavy pipelines.
Production Reliability for Generative Models: Create the tooling and services needed to safely push frequent model updates while handling massive compute loads and long-running jobs.
End-to-End CI/CD for ML: Lead the development of automated pipelines for model training, validation, artifact management, and production rollout.
Multimodal Data Infrastructure: Build systems to ingest, version, transform, and serve large-scale video, audio, and text datasets with high reliability.
Internal Developer Experience: Partner with research, product, and applied ML teams to build intuitive internal tooling for experiment tracking, model lineage, and resource scheduling.
Technical Leadership: Mentor engineers, set platform standards, and influence long-term architectural direction.
What You've Done
Experience architecting and operating large-scale infrastructure at a cloud provider, hyperscaler, or leading AI company.
Built or owned mission-critical CI/CD systems, high-capacity compute platforms, or data infrastructure supporting ML teams.
Deep experience with distributed compute across GPUs/accelerators, Kubernetes, and cloud infrastructure (AWS/GCP/Azure).
Strong engineering fundamentals in Python, Go, or equivalent languages.
Previous exposure to ML training pipelines-especially systems that handle heavy video, multimodal, or high-dimensional data.
Demonstrated ability to lead complex cross-org initiatives and drive technical strategy.
Nice to Have
Experience with video processing systems, large-scale media pipelines, or streaming architectures.
Familiarity with modern multimodal or video-generation frameworks (PyTorch, JAX, diffusers, custom accelerators).
Experience with Ray, Triton, CUDA optimization, or specialized scheduling for ML workloads.
Background working in high-growth AI startups or research-focused environments.
Security and compliance considerations for models that generate or process user content.
Why Join
Shape the underlying platform powering one of the most advanced generative video systems in the world.
Influence the future of multimodal AI by building infrastructure that directly accelerates research and product breakthroughs.
Work closely with experienced founding engineers, researchers, and platform builders from leading tech companies.
Highly competitive compensation, meaningful equity, and strong in-person engineering culture in San Francisco.
System Engineer
Infrastructure engineer job in San Mateo, CA
Systems Engineer - Video Intelligence Infrastructure - San Francisco
About the Company
A Series A Funded start-up who already have millions in recurring revenue are building next-generation AI infrastructure for video intelligence are looking for a Systems Engineer to join their team.
What You'll Be Doing:
Design and engineer systems that handle compute, scheduling, and orchestration of complex ML + ETL pipelines
Optimize hyper-fast distributed systems running at the scale of thousands of GPUs
Build systems that process video data quickly, reliably, and cost-effectively at internet scale
Develop robust internal tooling and CI/CD pipelines for rapid ML team iteration
Focus on system uptime and performance optimization for mission-critical infrastructure
What We're Looking For:
3+ years building foundational data infrastructure
Experience designing and maintaining pipelines that process petabytes of data
Strong background developing CI/CD pipelines for ML-focused teams
Excellent coding skills in Go and Python
Independent contributor who leads by example
What's In It For You
Competitive salary up to $250k
Small, high-impact team with significant growth trajectory
Opportunity to work with top AI video labs and collect world-class datasets
Apply now for immediate consideration!
Distributed Systems Engineer / AI Workloads
Infrastructure engineer job in Fremont, CA
We are actively searching for a Distributed Systems Engineer to join our team on a permanent basis. In this founding engineer role you will focus on building next-generation data infrastructure for our AI platform. If you have a passion for distributed systems, unified storage, orchestration, and retrieval for AI workloads we would love to speak with you. Our office is located in downtown SF and we collaborate two days a week onsite.
Your Rhythm:
Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security
Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient
Tackle complex challenges in distributed systems, databases, and AI infrastructure
Collaborate with technical leadership to define and refine the product roadmap
Write high-quality, well-tested, and maintainable code
Contribute to the open-source community and engage with developers in the space
Your Vibe:
3+ years of professional distributed database systems experience
Expertise in building and operating scalable, reliable and secure database infrastructure systems
Strong knowledge around distributed compute, data orchestration, distributed storage, streaming infrastructure
Strong knowledge of SQL and NoSQL databases, such as MySQL, Postgres, and MongoDB.
Programming skills in Python
Passion for building developer tools and scalable infrastructure
Available to collaborate onsite 2 days a week
Our Vibe:
Relaxed work environment
100% paid top of the line health care benefits
Full ownership, no micro management
Strong equity package
401K
Unlimited vacation
An actual work/life balance, we aren't trying to run you into the ground. We have families and enjoy life too!
Distributed Systems Engineer
Infrastructure engineer job in Fremont, CA
San Francisco, CA (Onsite)
About the Company
A fast-moving AI research group is building the core video data infrastructure used by leading AI labs and major tech companies. The team is small at around fifteen people, nearly all engineers, and recently pivoted to focus exclusively on high-quality video data at massive scale.
The shift has driven significant revenue growth, and they are now planning to expand the team steadily over the next few months.
The culture is straightforward: engineering led, product focused, low ego, and built around people who enjoy ownership. They work in person five days a week in their San Francisco office, moving quickly, solving hard problems, and avoiding micromanagement.
The Role
This position focuses on designing and scaling distributed systems that support huge ML and ETL workloads across petabytes of video. You will own core infrastructure: compute scheduling, orchestration, throughput, reliability, cost efficiency, and the internal tooling that keeps the entire engineering group moving at pace.
The company is beginning to scale its infrastructure footprint aggressively, and this role will become central to that growth. It is a hands-on IC position suited to someone who has operated critical systems before and wants to shape the foundation of a rapidly expanding platform.
What You'll Work On
• Architect and scale distributed systems for large-scale ML and ETL workloads
• Build compute orchestration and scheduling across thousands of GPUs
• Improve uptime, resilience, and execution speed of high-volume data pipelines
• Design pipelines capable of handling petabyte-level video datasets
• Lead the development of CI/CD and internal tooling for fast iteration
• Partner closely with research engineers delivering new video models and algorithms
• Operate in a high-trust environment with strong autonomy and clear ownership
Requirements
• 3+ years building foundational distributed systems or data infrastructure
• Experience running critical systems at significant scale
• Proficient across cloud architectures
• Strong coding experience with Go (preferred) and Python
• Background building or maintaining large-scale pipelines
• Experience with ML-focused CI/CD and automation
• Video domain experience is not required
• Operates as a strong IC who leads through action
• Fully onsite in San Francisco, Monday to Friday
Culture Fit
• Enjoys ambiguity, problem discovery, and self-direction
• Communicates clearly and concisely
• Shows strong intellectual curiosity
• Low ego, collaborative mindset
• Motivated by building core systems in a small, high-caliber team
Red flags include weak communication, low curiosity, or unclear motivation for the domain.
Interview Process
Intro call focused on culture, curiosity, and communication
Technical discussion on background and complexity of past work
Problem-solving session with a research engineer
Onsite research problem and collaboration exercise
Staff ML Infrastructure Engineer
Infrastructure engineer job in Santa Rosa, CA
Staff / Lead ML Infrastructure Engineer
San Francisco, CA - Onsite
Salary - Over market average + equity
We are building one of the world's leading generative video and multimodal AI platforms, and we're looking for a senior infrastructure engineer to drive the backbone that makes it possible. This role is ideal for an engineer from a top-tier tech company who has built cloud-scale systems, high-performance compute platforms, and battle-tested CI/CD pipelines that support complex ML workloads.
What You'll Own
Core ML Platform Architecture: Design and evolve the infrastructure that supports large-scale generative video and multimodal model training, evaluation, and deployment.
High-Throughput Compute Systems: Build and optimize GPU/TPU clusters, distributed training systems, and orchestration layers tailored for video-heavy pipelines.
Production Reliability for Generative Models: Create the tooling and services needed to safely push frequent model updates while handling massive compute loads and long-running jobs.
End-to-End CI/CD for ML: Lead the development of automated pipelines for model training, validation, artifact management, and production rollout.
Multimodal Data Infrastructure: Build systems to ingest, version, transform, and serve large-scale video, audio, and text datasets with high reliability.
Internal Developer Experience: Partner with research, product, and applied ML teams to build intuitive internal tooling for experiment tracking, model lineage, and resource scheduling.
Technical Leadership: Mentor engineers, set platform standards, and influence long-term architectural direction.
What You've Done
Experience architecting and operating large-scale infrastructure at a cloud provider, hyperscaler, or leading AI company.
Built or owned mission-critical CI/CD systems, high-capacity compute platforms, or data infrastructure supporting ML teams.
Deep experience with distributed compute across GPUs/accelerators, Kubernetes, and cloud infrastructure (AWS/GCP/Azure).
Strong engineering fundamentals in Python, Go, or equivalent languages.
Previous exposure to ML training pipelines-especially systems that handle heavy video, multimodal, or high-dimensional data.
Demonstrated ability to lead complex cross-org initiatives and drive technical strategy.
Nice to Have
Experience with video processing systems, large-scale media pipelines, or streaming architectures.
Familiarity with modern multimodal or video-generation frameworks (PyTorch, JAX, diffusers, custom accelerators).
Experience with Ray, Triton, CUDA optimization, or specialized scheduling for ML workloads.
Background working in high-growth AI startups or research-focused environments.
Security and compliance considerations for models that generate or process user content.
Why Join
Shape the underlying platform powering one of the most advanced generative video systems in the world.
Influence the future of multimodal AI by building infrastructure that directly accelerates research and product breakthroughs.
Work closely with experienced founding engineers, researchers, and platform builders from leading tech companies.
Highly competitive compensation, meaningful equity, and strong in-person engineering culture in San Francisco.
System Engineer
Infrastructure engineer job in Fremont, CA
Systems Engineer - Video Intelligence Infrastructure - San Francisco
About the Company
A Series A Funded start-up who already have millions in recurring revenue are building next-generation AI infrastructure for video intelligence are looking for a Systems Engineer to join their team.
What You'll Be Doing:
Design and engineer systems that handle compute, scheduling, and orchestration of complex ML + ETL pipelines
Optimize hyper-fast distributed systems running at the scale of thousands of GPUs
Build systems that process video data quickly, reliably, and cost-effectively at internet scale
Develop robust internal tooling and CI/CD pipelines for rapid ML team iteration
Focus on system uptime and performance optimization for mission-critical infrastructure
What We're Looking For:
3+ years building foundational data infrastructure
Experience designing and maintaining pipelines that process petabytes of data
Strong background developing CI/CD pipelines for ML-focused teams
Excellent coding skills in Go and Python
Independent contributor who leads by example
What's In It For You
Competitive salary up to $250k
Small, high-impact team with significant growth trajectory
Opportunity to work with top AI video labs and collect world-class datasets
Apply now for immediate consideration!