Data engineer jobs in Pleasanton, CA - 12,263 jobs
All
Data Engineer
Data Scientist
Game Engineer
Staff Data Scientist
Insight Global
Data engineer job in Sunnyvale, CA
JOB TITLE: Staff Data Scientist
DURATION: Full-time
Key Responsibilities:
Lead the design, development, testing, and global deployment of large-scale time series forecasting models (including Regression models and state of the art time series specific models for example N-BEAST, PatchTST) to support complex retail and e-commerce hierarchies. Introduce causal modeling approaches to conduct impact analysis for future forecast.
Continuously enhance forecasting strategies by incorporating advanced machine learning architectures, including RNNs (sequence modeling), CNNs (temporal feature extraction), and Attention-based mechanisms to improve accuracy, scalability, and robustness in time series forecasting.
Advance causal modeling frameworks to quantify event impacts and integrate causal insights into forward-looking forecasts.
Build and maintain experimentation pipelines (A/B testing, quasi-experiments, multi-armed bandits) for evaluating causal impacts of interventions.
Mentor junior scientists, review research and production code, and ensure reproducibility and scalability in pipelines.
Collaborate with engineering to implement forecasting + optimization systems in production (Airflow, Astronomer, Spark/Ray).
Act as technical lead on multiple projects, balancing research rigor with business delivery.
Must Have Requirements:
Strong foundation in Causal Inference, Statistical Analysis, and advanced Machine Learning methods
Hands-on experience with a wide range of ML techniques, with a deep understanding of their advantages and limitations across different scenarios.
Ability to integrate statistical expertise with machine learning methods to maximize the value and interpretability of ML solutions.
Proficiency in Python, SQL, PyTorch, Spark/Ray, and stats/econometrics libraries.
Experience deploying ML systems at scale on cloud platforms (GCP/Azure).
Plusses:
Publications and open-source contributions spanning the full spectrum of modern Machine Learning, from Statistical Learning (e.g., Bayesian modeling, causal inference, high-dimensional statistical methods) to Deep Learning (e.g., convolutional and transformer-based architectures) and Reinforcement Learning (e.g., dynamic programming, policy gradient methods).
Exposure to ML observability: drift detection, retraining triggers, and causality-informed monitoring.
Timeseries - will be working on it but can learn on the job
Job Description:
Chance to work on financial data for complex problems/challenges.
Utilize LLMs/Genai systems and architectures to build and deploy state-of-the-art Genai systems.
Our team collaborates closely with Finance teams to enhance financial planning and strategic decision-making through cutting-edge data-driven solutions.
We specialize in a range of initiatives which provides actionable insights into trends and patterns and leveraging Generative AI (Genai) to produce concise, insightful summaries that empower decision-makers.
By integrating these innovative approaches, we strive to drive efficiency, accuracy, and impactful outcomes in financial operations.
About Team:
Our team works closely with our US stores and eCommerce business to better serve customers by empowering team members, stores, and merchants with technological innovation. From groceries and entertainment to sporting goods and crafts, we offer an extensive selection that our customers value, whether they shop online , through one of our mobile apps, or in-store. Focus areas include customers, stores and employees, in-store service, merchant tools, merchant data science, and search and personalization.
$107k-155k yearly est. 2d ago
Looking for a job?
Let Zippia find it for you.
Data Scientist - All Levels (Manager, Principal, Staff)
Holistic Partners, Inc.
Data engineer job in Sunnyvale, CA
Job Title: Data Scientist - All Levels (Manager, Principal, Staff)
Bentonville, AR (relocation package available only for Bentonville)
Duration: Full Time
IV Process:
Pre-screening
Three virtual client rounds (plus an additional round for Manager level)
Multiple interviews can be scheduled in one day
Positions Available (Exclusive):
1 Manager of Data Science
1 Principal Engineer of Data Science
3 Staff Engineers of Data Science
Core Must-Have Skills Across All Levels
Generative AI (Gen AI)
Large Language Models (LLM, GPT, BERT, RAG)
Time Series Modeling / Forecasting
Prompting & Fine-Tuning
Managerial/leadership experience per level (Manager → mid-size team, Principal → top of team, Staff → lead-level experience)
Role-Specific Job Description
Manager of Data Science
Responsibilities:
Lead a team of data scientists and ML engineers
Build and deploy Gen AI and traditional ML applications for Walmart Finance
Drive cross-functional projects in NLP, LLM, timeseries forecasting, and recommendation systems
Maintain technical and business knowledge, mentor team members
Requirements:
Experience managing mid-size internal team of DS/MLEs
Experience in NLP, LLMs, timeseries, traditional ML for retail/e-commerce
Strong organizational, interpersonal, and cross-functional collaboration skills
Principal Data Scientist
Responsibilities:
Develop LLM-powered intelligent systems (Q&A, recommendations, autonomous agents)
Collaborate cross-functionally with DS, MLEs, UX, and product teams
Deploy scalable AI/ML solutions, mentor junior DS
Contribute to internal/external AI/ML research
Requirements:
Proven deployment of high-risk NLP applications in production
Strong ML foundations (statistics, optimization, DL)
Advanced proficiency in Python, ML/DS libraries, deep learning frameworks
Familiarity with ML infra (Kubeflow, MLflow, Airflow)
Bonus Skills:
Text-to-SQL/Text-to-Cypher, recommender systems, fine-tuning LLMs, publications in top-tier ML/NLP venues
Staff Engineer of Data Science
Responsibilities:
Lead data-driven projects for Finance using LLMs/Gen AI
Build, deploy, and productionize AI/ML solutions for large-scale applications
Collaborate cross-functionally with stakeholders and business owners
Requirements:
Master's/PhD in Statistics, Analytics, CS, or related field with 5+ years experience
Hands-on experience with LLMs, Gen AI ecosystems (GPT, LLaMA, Mistral, Claude, Gemini, AWS Sonnet)
RAG, AI agent development, LangChain, LangGraph
Strong solution architecture mindset and problem-solving skills
Nice-to-Have:
Big Data/Spark experience, Cloud ML (GCP, Azure), GPU DL training
Behavioral Qualifications:
Adaptable, problem-solver, technically strong, collaborative, able to manage multiple priorities
$107k-155k yearly est. 4d ago
Data Scientist
Us Tech Solutions 4.4
Data engineer job in Sunnyvale, CA
Chance to work on financial data for complex problems/challenges. Utilize LLMs/Genai systems and architectures to build and deploy state-of-the-art Genai systems.
Our team collaborates closely with Finance teams to enhance financial planning and strategic decision-making through cutting-edge data-driven solutions. We specialize in a range of initiatives which provides actionable insights into trends and patterns and leveraging Generative AI (Genai) to produce concise, insightful summaries that empower decision-makers. By integrating these innovative approaches, we strive to drive efficiency, accuracy, and impactful outcomes in financial operations.
About Team:
Our team works closely with our US stores and eCommerce business to better serve customers by empowering team members, stores, and merchants with technological innovation. From groceries and entertainment to sporting goods and crafts, extensive selection that our customers value, whether they shop online through one of our mobile apps, or in-store. Focus areas include customers, stores and employees, in-store service, merchant tools, merchant data science, and search and personalization.
What you'll do
Lead the design, development, testing, and global deployment of large-scale time series forecasting models (including Regression models and state of the art time series specific models for example N-BEAST, PatchTST) to support complex retail and e-commerce hierarchies. Introduce causal modeling approaches to conduct impact analysis for future forecast.
Continuously enhance forecasting strategies by incorporating advanced machine learning architectures, including RNNs (sequence modeling), CNNs (temporal feature extraction), and Attention-based mechanisms to improve accuracy, scalability, and robustness in time series forecasting.
Advance causal modeling frameworks to quantify event impacts and integrate causal insights into forward-looking forecasts.
Build and maintain experimentation pipelines (A/B testing, quasi-experiments, multi-armed bandits) for evaluating causal impacts of interventions.
Mentor junior scientists, review research and production code, and ensure reproducibility and scalability in pipelines.
Collaborate with engineering to implement forecasting + optimization systems in production (Airflow, Astronomer, Spark/Ray).
Act as technical lead on multiple projects, balancing research rigor with business delivery.
What you'll bring
Strong foundation in Time Series Forecasting, Causal Inference, Statistical Analysis, and advanced Machine Learning methods
Hands-on experience with a wide range of ML techniques, with a deep understanding of their advantages and limitations across different scenarios.
Ability to integrate statistical expertise with machine learning methods to maximize the value and interpretability of ML solutions.
Proficiency in Python, SQL, PyTorch, Spark/Ray, and stats/econometrics libraries.
Experience deploying ML systems at scale on cloud platforms (GCP/Azure).
Great to have
Publications and open-source contributions spanning the full spectrum of modern Machine Learning, from Statistical Learning (e.g., Bayesian modeling, causal inference, high-dimensional statistical methods) to Deep Learning (e.g., convolutional and transformer-based architectures) and Reinforcement Learning (e.g., dynamic programming, policy gradient methods).
Exposure to ML observability: drift detection, retraining triggers, and causality-informed monitoring.
Background in retail, e-commerce, or operations analytics.
About US Tech Solutions:
US Tech Solutions is a global staff augmentation firm providing a wide range of talent on-demand and total workforce solutions. To know more about US Tech Solutions, please visit ************************
US Tech Solutions is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, colour, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Recruiter Details:
Recruiter name: Ajeet Kumar
Recruiter's email id : *****************************
JobDiva ID :: JobDiva # 25-53396
$102k-149k yearly est. 5d ago
Principal Data Scientist
Tarana Wireless Inc. 4.1
Data engineer job in Milpitas, CA
Join the Team That's Redefining Wireless Technology
At Tarana, we're more than just a fast-growing tech company-we're a team of bold innovators on a mission to revolutionize broadband. Our groundbreaking Fixed Wireless Access technology is delivering fiber-class internet speeds worldwide, bridging the digital divide in ways previously thought impossible.
We're looking for anexceptional Principal Data Scientist to join our team and drive innovation through advanced analytics and machine learning. In this lead role, you will shape our data science strategy, mentor talented team members, and deliver high-impact solutions that transform how we leverage data to achieve business objectives.
What You'll Do:
Lead end-to-end data science projects from problem formulation through model development, deployment, and monitoring in production environments.
Design and implement advanced machine learning algorithms, statistical models, and AI solutions that drive business value.
Provide technical leadership and mentorship to junior data scientists and dataengineers, fostering a culture of excellence and continuous learning.
Partner with engineering teams to architect scalable data pipelines and ML infrastructure.
Establish best practices for experimentation, model training, model evaluation, and monitoring.
Stay current with the latest advancements in data science, machine learning, and artificial intelligence.
Collaborate with cross‑functional teams, including product, engineering, and business stakeholders, to identify and address key business challenges.
Present findings and recommendations to senior leadership and other stakeholders.
What You'll Need:
Master's or Ph.D. in Computer Science, Statistics, Mathematics, or a related quantitative field.
10+ years of experience in data science or a related field (at least the last 5 years in data science), with a proven track record of leading impactful projects that achieve clear customer outcomes through product delivery.
Deep expertise in machine learning techniques, including supervised and unsupervised learning, deep learning, NLP, computer vision, or reinforcement learning.
Strong programming skills in Python or R, with experience in ML frameworks (TensorFlow, PyTorch, scikit‑learn).
Proficiency in SQL and experience working with large‑scale datasets and distributed computing frameworks (Spark, Dask).
Demonstrated experience deploying models to production and monitoring model performance.
Demonstrated cross‑functional experience with diverse teams across the organization.
Excellent communication skills with the ability to explain complex technical concepts to non‑technical audiences.
Strong business acumen and ability to connect data science work to organizational objectives.
Experience mentoring and developing junior team members.
Bonus Points For:
Experience in cloud platforms (AWS, GCP, Azure) and MLOps practices.
Track record of publishing research or contributing to open‑source projects.
Experience with A/B testing and causal inference methodologies.
Knowledge of dataengineering principles and ETL processes.
Leadership experience in managing or leading data science teams.
What we offer:
We don't just build next‑gen wireless technology - we build people.
The salary range for this position is: $200,000 to $260,000
Compensation will be determined based on several factors including, but not limited to: skill set, years of experience and the employee's geographic location.
Tarana provides competitive benefits to employees in this role including: Medical, dental and vision benefits, 401K match, flexible time off and stock option.
Join Tarana and help shape the future of wireless connectivity.
About Us
Tarana's mission is to accelerate the deployment of fast, affordable, and reliable internet access around the world. Through a decade of R&D and over $400M of investment, the Tarana team has created and continues to enhance a suite of next‑generation fixed wireless access (ng FWA) technologies. Its unique ng FWA platform delivers game‑changing advances in broadband economics in mainstream and underserved markets, using both licensed and unlicensed spectrum. Tarana's ng FWA technology has been embraced by more than 300 service providers in 24 countries. Tarana is headquartered in Milpitas, California, with additional research and development in Pune, India. Learn more at ***********************
#J-18808-Ljbffr
$200k-260k yearly 4d ago
Data Scientist - Product
Pantera Capital
Data engineer job in San Francisco, CA
As an early member of the Data Science team, you will play a pivotal role in shaping the direction of our product and company. This is an opportunity to join a fast-growing startup and help define the data science function. You will have the chance to contribute to our mission to be the world's most knowledge-centric company by reshaping the future of search and technology.
Responsibilities
Develop data-driven insights from user behavior to inform our product roadmap and accelerate adoption
Come up with hypotheses and validate them by designing, running, and analyzing A/B tests
Figure out the right metrics and visualizations to track and implement them in dashboards, from features to company-wide
Work closely with other functions, such as engineering, product, growth, GTM, design, and user research.
Build tables to make analysis more efficient and make data more accessible
Qualifications
4+ years of experience working as a data scientist or related role
Experienced working at a fast-growing company working on consumer and/or growth
SQL expertise
Extensive A/B testing experience
Experience building dashboards using BI tools (ex: Omni, Mode, Hex, Looker or similar tools)
Self-starter and take ownership of your work end-to-end
Comfortable with open-ended problems
Experience with data modeling
Nice to have
Experience as one of the first data scientists at an early/growth stage company
Have dataengineering and/or analytics engineering experience (ideally with dbt)
Experience with one or more area: ads, enterprise sales, search, large language models
Experience working with user research to blend qualitative and quantitative insights
Experience with Snowflake (especially as an admin)
Python experience
ML experience
The cash compensation range for this role is $200,000 - $230,000.
At Perplexity, we've experienced tremendous growth and adoption since publicly launching the world's first fully functional conversational answer engine just over a year ago. Our AI-powered search assistant has amassed 10 million monthly active users as of early 2024, with our mobile apps installed over 1 million times across iOS and Android devices. In 2023 alone, we served over 500 million queries from users around the globe.
To support our rapid expansion, we've raised significant funding from some of the most respected investors in technology. In January 2024, we raised $73.6 million in a Series B round led by IVP, with participation from NVIDIA, Jeff Bezos' investment fund, NEA, Databricks, and other prominent firms. We followed that up with a $62.7 million Series B1 round in April 2024 led by Daniel Gross, valuing Perplexity at over $1 billion.
Our prominent investor base includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Naval Ravikant, Tobi Lutke, and many other visionary individuals.Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from the amounts listed above.
Equity: In addition to the base salary, equity is part of the total compensation package.
Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.
#J-18808-Ljbffr
$200k-230k yearly 2d ago
Staff Data Scientist
Cervin
Data engineer job in San Francisco, CA
Arkestro's Predictive Procurement accelerates enterprise spend transformation, using AI and game theory to unlock trapped savings and reduce risk, enabling teams to influence significantly more spend. By combining AI with deep Negotiation Science, Supplier Science, and Process Science procurement teams can improve win-rates while strengthening supply chain agility.
As a fast-growing tech company, we're looking for builders and innovators - people who thrive in the face of ambiguity and who have a selfless dedication to do whatever it takes to make Arkestro and our customers successful. We believe in egoless execution and we are looking for people who will work together to solve hard problems. If you're excited to help shape our future, contribute to our company culture, and help to drive our business forward there is a tremendous opportunity for you here at Arkestro! See Arkestro in action at arkestro.com.
About this Role
We are looking for an experienced data scientist to join our growing Data Team and help drive the development of cutting-edge data, machine learning, and AI solutions that optimize procurement for our enterprise customers. The Staff Data Scientist will work closely with dataengineers, senior stakeholders, and other data scientists to design and implement scalable data pipelines, data algorithms, and ML models. You will take ownership of major data science initiatives such as entity resolution, pricing algorithms, demand forecasting, recommendations, and automating workflow decisions. The role requires strong technical skills, business acumen, and the ability to effectively communicate with both technical and non-technical stakeholders across the organization.
Responsibilities Leading
Lead cross-team initiatives requiring data science expertise, driving alignment between Engineering, Product, Customer Success, and Data teams
Participate actively in discussions with Product Management, shaping what we build and why based on data insights and technical feasibility
Build and maintain trust with Customer Organization through clear communication of DS capabilities, limitations, and roadmap
Present complex technical work to executive leadership and non-technical stakeholders in ways that drive strategic decisions
Own the data science roadmap and its evolution in partnership with VP Data, DS Leads, and Product leadership
Set clear expectations and clearly communicate project statuses to stakeholders throughout the company
Foster a culture of continuous learning through mentorship of junior data scientists and sharing of best practices
Building
Design, prototype, and productionize machine learning models to optimize procurement processes
Build and deploy AI systems into the procurement workflow, including robust eval frameworks
Collaborate with DataEngineers on ML infrastructure, pipelines, and operations to support model training and deployment
Define and evolve Data Science processes and practices
Keep a startup mindset: ship early and collect feedback sooner than later
Stay up-to-date (and keep the team up to date) on latest techniques and tools in ML & AI
Necessary Qualifications
6+ years experience in data science or related quantitative field
Strong expertise in machine learning, statistics, and data modeling
Proficiency (6+ years experience) in Python and SQL
Experience with AWS, Snowflake, dbt or similar cloud data stack
Proven ability to communicate complex technical concepts to non-technical stakeholders and drive consensus across teams with competing priorities
Track record of leading cross-functional initiatives with measurable organizational impact beyond a single team
BS/MS in Computer Science, Statistics, Applied Math or related field
Ideal Qualifications
Demonstrated ability to mentor and elevate technical talent across teams, not just direct reports
PhD in quantitative field
Experience applying ML to procurement/supply chain data
Knowledge of game theory and mechanism design
Technologies and tools we use
Data Motion: Snowflake, dbt, Pinecone, OneSchema
ML and DS: Python, pandas, SQLAlchemy, Pydantic
Workflow and Deployment: Github, AWS, Datadog, Code Climate
Documentation and coordination: Jira, Confluence, Slack, Fellow, Lattice, Google Workspace
Pay Range
$190,000 - $220,000 USD
Benefits
Competitive salary and startup equity
Medical, Dental, Vision insurance premiums covered up to 100% (employee only)
401K discretionary matching
Unlimited PTO
A remote-first team with regular opportunities to get together in person for team building, design sprints, and customer visits
Annual budget of $1,000 for learning and professional development
Diverse, inclusive, highly collaborative, and vibrant culture
Equal Opportunity Employer
Arkestro is an equal opportunity employer that is committed to diversity and inclusion in the workplace. We prohibit discrimination and harassment of any kind based on race, color, sex, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other protected characteristic as outlined by federal, state, or local laws.
Disclaimer
Please note this job description may not be inclusive of all assigned duties, responsibilities, or aspects of the job described and that additional tasks may be assigned to the employee from time to time; or the scope of the job may change as necessitated by business demands. Arkestro reserves the right to change duties, responsibilities and activities at any time with or without notice.
#J-18808-Ljbffr
$190k-220k yearly 4d ago
Data Scientist - Customer Growth & Experience Analytics
Rippling
Data engineer job in San Francisco, CA
A technology company based in San Francisco is seeking a Data Scientist to enhance customer experience and retention. The ideal candidate will work with cross-functional teams, employing data analysis to influence decision-making and improve customer satisfaction. Applicants should hold a Master's degree and have strong skills in SQL and Python, along with experience in business intelligence tools. This position offers a competitive salary range from $108,000 to $189,000 annually.
#J-18808-Ljbffr
$108k-189k yearly 4d ago
Data Scientist
Arcade 4.6
Data engineer job in San Francisco, CA
Arcade is building the world's first AI physical product creation platform, where imagination becomes reality. Our platform lets anyone design, purchase, and sell custom, manufacturable products using natural language and generative AI. We believe everyone should have the power to create physical goods as easily as they post online, and we're building the infrastructure to make that real.
We've raised $42M from a world-class group of investors, including Reid Hoffman, Forerunner Ventures (Kirsten Green), Canaan Partners (Laura Chau), Adverb Ventures (April Underwood), Factorial Funds (Sol Bier), Offline Ventures (Brit Morin), Sound Ventures (Ashton Kutcher), Inspired Capital (Alexa von Tobel), and Torch Capital (Jonathan Keidan). Our angel investors include Elad Gil, Ev Williams, Marissa Mayer, Sara Beykpour, Kayvon Beykpour, Anna Veronika Dorogush, Eugenia Kuyda, David Luan, Sharon Zhou, Kelly Wearstler, Karlie Kloss, Colin Kaepernick, Christy Turlington Burns, and Jeff Wilke.
Arcade is headquartered in San Francisco's Presidio and led by serial entrepreneur Mariam Naficy (Minted, Eve), and a founding team with deep experience in generative AI, design systems, and supply chain. We're pioneering a new category at the intersection of AI, personal expression, and on-demand manufacturing, and we're building fast.
Role Summary
We are seeking an analytical, entrepreneurial Data Scientist to lead Arcade's analytics, experimentation, and data science capabilities. This is a highly cross‑functional role that combines business analytics, product experimentation, and data platform leadership.
The ideal candidate is passionate about data‑driven decision‑making, hands‑on analysis, and building tools that help guide business and product strategy. You will work directly with Arcade's leadership team-including Product, Go‑To‑Market, and Operations-to translate strategic objectives into measurable insights, models, and dashboards.
You will own the company's data stack, ensuring the reliability and scalability of our analytics infrastructure. This role is based at Arcade's headquarters in the Presidio, San Francisco (4-5 days per week on‑site).
Responsibilities
Manage all aspects of Arcade's analytics, testing, and business intelligence initiatives
Partner with business leadership to design and maintain KPIs that govern company performance and resource allocation
Lead the design, implementation, and interpretation of A/B tests to inform product and marketing decisions
Develop and automate dashboards and reporting systems; train business users to self‑serve insights
Build scalable analysis frameworks to support Arcade's AI‑driven product design and commerce platform
Collaborate with Product Management, Go‑To‑Market and Operations teams to develop tools and models that guide business operations
Conduct ad‑hoc strategic and financial analyses to support the CEO and Chief of Staff in company‑wide planning and decision‑making
Ensure best practices in data quality, experimentation methodology, and statistical rigor
Experience
2-3 years of experience in a data science, analytics, or business intelligence role
Proficiency in SQL and Python; strong experience using Jupyter notebooks for analysis and presentation
Demonstrated experience designing and analyzing A/B tests and other causal inference methods
Hands‑on familiarity with Hex, Tableau, or Looker-must have experience building user‑facing dashboards
Experience with BigQuery and the Google Cloud data ecosystem preferred; experience with any major cloud analytics platform required
Strong understanding of data modeling, ETL pipelines, and data warehouse architecture
Experience building predictive or statistical models that directly inform product, marketing, or business outcomes
Excellent communication skills, with the ability to synthesize complex quantitative findings into clear, actionable recommendations for non‑technical stakeholders
Nice to Have
Experience in e‑commerce, marketplaces, or AI‑driven product platforms
Familiarity with dbt, Retool, or modern data stack tools
Experience integrating analytics into workflow tools and operational decision‑making
Strong business intuition and ability to connect analytical insights to strategic priorities
Qualifications
Bachelor's degree in Statistics, Computer Science, Mathematics, Economics, Engineering, or a related quantitative field
Strong analytical and problem‑solving skills with high attention to detail
Demonstrated ability to operate independently and thrive in a fast‑paced startup environment
Proven ability to work cross‑functionally and influence through data and insight
Arcade is an Equal Opportunity Employer committed to inclusion and diversity. We welcome people of different backgrounds, experiences, abilities, and perspectives and will consider all qualified applicants for employment in accordance with all state, local, and federal laws.
#J-18808-Ljbffr
$123k-171k yearly est. 3d ago
Data Scientist - TV Ad Effectiveness & Modeling
EDO) Entertainment Data Oracle, Inc.
Data engineer job in San Francisco, CA
A leading data analytics firm in Los Angeles is seeking a Data Scientist to enhance their AdEngage platform and measure ad effectiveness. The ideal candidate has over 3 years of experience in coding with Python, model tuning, and SQL proficiency. Offering a competitive salary and flexible time off, this role supports a hybrid working environment with a focus on innovation in advertising analytics.
#J-18808-Ljbffr
$108k-155k yearly est. 6d ago
Principal Data Scientist, Healthcare AI & ML
Tend
Data engineer job in San Francisco, CA
A healthcare technology company in San Francisco is seeking a Principal Data Scientist to develop predictive models aimed at optimizing patient outcomes and operational efficiency. This role requires over 7 years of experience in data analysis, with hands-on expertise in Python and machine learning. The ideal candidate will work with emerging AI technologies to improve healthcare delivery. Competitive compensation is offered, along with substantial health benefits and a positive work culture.
#J-18808-Ljbffr
$108k-155k yearly est. 6d ago
Applied Product Data Scientist - Growth & Experiments
Openai 4.2
Data engineer job in San Francisco, CA
A leading AI research firm in California seeks a Data Scientist to enhance product development through data-driven insights. The role involves defining metrics, executing A/B tests, and collaborating with teams to answer data questions. Ideal candidates will have strong SQL and Python skills and a track record of navigating ambiguous environments. The position offers a hybrid work model with relocation assistance.
#J-18808-Ljbffr
$124k-169k yearly est. 4d ago
Principal Data Scientist, AI & Anomaly Detection
Cisco Systems 4.8
Data engineer job in San Francisco, CA
A leading global technology company in San Francisco seeks a senior software engineer with expertise in anomaly detection and machine learning. You will lead projects to develop scalable cloud-based solutions, collaborating with teams and mentoring others in advanced AI techniques. A PhD or equivalent experience is essential, along with proven leadership in cloud architecture. The role offers a competitive salary and extensive benefits including paid time off and flexible working arrangements.
#J-18808-Ljbffr
$128k-161k yearly est. 3d ago
Data Scientist
Dyneti Technologies
Data engineer job in San Mateo, CA
About Us
At Dyneti, we believe digital payments should be seamless and secure. That's why we built DyScan, a software library that allows digital merchants to prevent fraud and improve conversion by taking a photo of a credit card.
Dyneti was founded by a fraud prevention expert from Uber, and has raised funding from an exceptional lineup of investors, including Y Combinator. We've processed hundreds of millions of credit card scans around the world, and our customers include Fortune 100 companies and some of the fastest growing tech unicorns.
Job Overview
Join Dyneti as a founding member of the technical team. You will have the opportunity to work closely with the CEO and engineering to track emerging fraud trends, translate insights into product improvements, and build and launch your ideas. This role sits at the intersection of data, engineering, and real-world fraud strategy and is critical to shaping how DyScan evolves to stay ahead of increasingly sophisticated attackers.
We're Looking for Someone Who
Enjoys significant ownership over user-facing product
Can work independently
Is comfortable turning research papers into product
Has 2+ years experience with data science or has a PhD in a STEM field
Has an interest in software engineering
Builds with a product-first approach. You move quickly from analysis to action and care about measurable outcomes
In the News
*******************************************************
************************************************************************
********************************************************************************
Dyneti is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.
#J-18808-Ljbffr
$108k-155k yearly est. 3d ago
AI Data Scientist
Peppermill 4.4
Data engineer job in San Francisco, CA
Job Type: Full-time Department: Software Development/Engineering Reports To: CTO
About the Role:
We are looking for a talented and driven AI Data Scientist to join our dynamic team working to make AI accessible to all. I n this role, you will help us make better business decisions based on our data. The ideal candidate will have a working knowledge of statistics, mathematics, and data science programming languages (e.g., SQL, R, Python). Your primary responsibilities will be performing statistical analyses, running custom SQL queries,and identifying patterns and trends that can improve our products' and services' efficiency and usability. You will also be expected to maintain and improve our data infrastructure and tools ( design, develop, and maintain web applications and platforms ) , working across both front-end and back-end systems. You will collaborate with cross-functional teams to deliver scalable and innovative solutions that are challenging to build but have a large impact .
This is an exciting opportunity to work in a fast-paced environment and contribute to building innovative applications that make a meaningful impact.
Key Responsibilities:
Work with stakeholders help define product roadmap and ML/AI toolsets
Work with developers to Build, deploy, and maintain data management systems and back-end data infrastructure for our business intelligence pipeline
Help define and create tools for customers to use data visualizations, reports, dashboards, and data audits
Proactively participate in developing scalable Business Intelligence (BI) tools and predictive analytics solutions using a variety of techniques ranging from data aggregation to data mining.
Prepare presentations and reports of statistical concepts and research results related to efficiency initiatives to be shared with a non-statistical audience and senior stakeholders.
Understand customer needs of a variety of industries and help build and design toolsets for customers.
Perform other duties as assigned.
Qualifications and Skills:
Required:
Bachelor's degree in data science, data analytics, or related field
At least 6 -10 years of experience in data science with expert-level Python, SQL, and statistical modelling experience
Proven experience as an AI Data Scientist or similar role
Proficiency in program languages (Python, R)
Proficiency in machine learning (TensorFlow)
Knowledge of data science toolkits - NumPy
Working knowledge of statistical models and business intelligence
Familiarity with cloud-based infrastructure
Experience in developing LLM systems, RAG pipelines, and ML pipelines
Proficiency in Git/GitHub, and Docker
Proficiency in database design and management (SQL/NoSQL)
Analytical thinking for dissecting complex data and extracting valuable insights
Problem-solving for systematically approaching problems and devising practical solutions
Must reside in the San Francisco Bay Area
Preferred:
Master's degree or Ph.D. in a quantitative field, such as statistics, computer science, data science, mathematics, or engineering, or a related field - or equivalent experience or education
Solid understanding of statistical modeling, machine learning algorithms, and data mining techniques
Familiarity with real-time data processing frameworks
Experience with fundamental AWS services and concepts
Experience with Queueing systems (RabbitMQ / Kafka / etc.)
Understands and supports MLOps practices
Proficient with Cloud Based Data Science Tools and Processes
Create data visualizations, reports, dashboards, and data audits
Leverage predictive models to optimize customer experiences
Understanding automated anomaly detection processes/ methods
Work closely with product managers, designers, and other developers & engineers to ensure project goals are met
Work across time zones to interact with remote teams
Participate in code reviews, brainstorming sessions, and team stand-ups
Comfortable interacting with Management and Engineering staff to use Data Science to bring ideas and analysis into production products
Analytical skills with an ability to independently evaluate and develop innovative solutions to complex situations
Assist with conducting needs assessment and requirements gathering to design and assist in deploying data analytics solutions.
Perform data manipulation and analytics and translate insights into actionable recommendations to management and customers
Prepare presentations and reports of statistical concepts and research results related to efficiency initiatives to be shared with a non-statistical audience and senior stakeholders
Written and verbal communication skills and presentation skills. Ability to communicate with internal and external customers on issues of moderate to considerable importance, up to and including senior management
Demonstrated ability to foster and maintain relationships with an ability to work as part of a cross functional team
Communication- both verbal and non-verbal - for clearly conveying complex data insights to non-technical audiences
Stay up to date with the latest industry trends and technologies
Propose and implement improvements to the development process
Curiosityfor exploring new data processing methodologies and tools
Adaptabilityfor adopting new data science technologies and methodologies
We are interested in every qualified candidate who is eligible to work in the United States. However, we are not able to sponsor visas .
Ready to Join Us?
If you're passionate about software development, thrive in dynamic environments, and want to work on impactful projects, we would love to hear from you! Apply today to be part of our team and help shape the future of technology.
How to Apply (resume required - must reside in the San Francisco Bay Area) #J-18808-Ljbffr
$100k-137k yearly est. 6d ago
Data Scientist - AI, Experiments, & Equity
Sierra 4.4
Data engineer job in San Francisco, CA
A leading AI-focused company in San Francisco is seeking an experienced data scientist. The role involves driving data strategy to enhance user engagement and product experience. Ideal candidates have extensive experience in data science, strong technical proficiency in Python and SQL, and a collaborative mindset. This position offers flexible paid time off, medical benefits, and an inclusive work environment.
#J-18808-Ljbffr
$108k-147k yearly est. 6d ago
Data Scientist (MMM)
Data Freelance Hub 4.5
Data engineer job in San Bruno, CA
This role is for a Data Scientist (MMM) on a 6‑month contract, paying $50+/hour, remote in the U.S. Requires a Ph.D., 3+ years in data analytics, and experience with MMM, digital marketing, SQL, Python, and cloud technologies.
Responsibilities
Perform hands‑on coding to retrieve and analyze large datasets using Python and SQL
Integrate disparate data sources and leverage state‑of‑the‑art analytics best practices to deliver integrated, actionable insights to partners and stakeholders
Manage and streamline the data extraction process with great attention to detail
Assess the potential usefulness, validity, and rigor of new data sources
Work with a cross‑functional team to ensure that the quality of the data is of the highest standard
Help with media mix models to connect the impact of marketing tactics and business short‑term and long‑term outcomes
Qualifications
Ph.D. degree in statistics/mathematics, engineering, computer science, economics, or a related field
3+ years of industry experience in a data, analytics, and data science role
Experience with advertising, measurement, and/or digital marketing analytics
Experience with advertising technology platforms, Ad servers, DSPs, DMPs, etc.
Proficient coding skills (SQL/Python/R)
Deep knowledge of relational database capabilities and experience with big data technologies (Hive/Hadoop)
Proficient BI/BA data visualization tools (Tableau, Power BI, ThoughtSpot, Looker, etc.)
Experience working with Marketing Mix Modeling (MMM) and/or Multi‑Touch Attribution Models (MTA)
Experience with modeling and machine learning
Experience with cloud technologies such as GCP, AWS, and Azure
Experience in integrating, structuring, and analyzing large amounts of data from diverse sources
Strong project management skills, attention to detail, and documentation skills
Strong written and verbal communication with the ability to communicate complex technical topics clearly to a range of audiences and ‘tell a story that provides insight into the business
Passion for working in a fast‑paced agile environment
A collaborative mindset and sense of curiosity
Experience with predictive modeling algorithms and optimization techniques
Experience applying statistics to business problems
About FocusKPI
FocusKPI is a data science and technology firm specializing in predictive analytics practice and methodologies. Founded in 2010, FocusKPI, Inc. (FocusKPI) is a U.S. company headquartered in Silicon Valley, California with an East Coast office in Boston, Massachusetts.
85 Great Portland Street, London, England, W1W 7LT
#J-18808-Ljbffr
$50 hourly 3d ago
Staff Machine Learning Data Engineer
Backflip 3.7
Data engineer job in San Francisco, CA
Mechanical design, the work done in CAD, is the rate-limiter for progress in the physical world. However, there are only 2-4 million people on Earth who know how to CAD. But what if hundreds of millions could? What if creating something in the real world were as easy as imagining the use case, or sketching it on paper?
Backflip is building a foundation model for mechanical design: unifying the world's scattered engineering knowledge into an intelligent, end-to-end design environment. Our goal is to enable anyone to imagine a solution and hit “print.”
Founded by a second-time CEO in the same space (first company: Markforged), Backflip combines deep industry insight with breakthrough AI research. Backed by a16z and NEA, we raised a $30M Series A and built a deeply technical, mission-driven team.
We're building the AI foundation that tomorrow's space elevators, nanobots, and spaceships will be built in.
If you're excited to define the next generation of hard tech, come build it with us.
The Role
We're looking for a Staff Machine Learning DataEngineer to lead and build the data pipelines powering Backflip's foundation model for manufacturing and CAD.
You'll design the systems, tools, and strategies that turn the world's engineering knowledge - text, geometry, and design intent - into high-quality training data.
This is a core leadership role within the AI team, driving the data architecture, augmentation, and evaluation that underpin our model's performance and evolution.
You'll collaborate with Machine Learning Engineers to run data-driven experiments, analyze results, and deliver AI products that shape the future of the physical world.
What You'll Do
Architect and own Backflip's ML data pipeline, from ingestion to processing to evaluation.
Define data strategy: establish best practices for data augmentation, filtering, and sampling at scale.
Design scalable data systems for multimodal training (text, geometry, CAD, and more).
Develop and automate data collection, curation, and validation workflows.
Collaborate with MLEs to design and execute experiments that measure and improve model performance.
Build tools and metrics for dataset analysis, monitoring, and quality assurance.
Contribute to model development through insights grounded in data, shaping what, how, and when we train.
Who You Are
You've built and maintained ML data pipelines at scale, ideally for foundation or generative models, that shipped into production in the real world.
You have deep experience with dataengineering for ML, including distributed systems, data extraction, transformation, and loading, and large-scale data processing (e.g. PySpark, Beam, Ray, or similar).
You're fluent in Python and experienced with ML frameworks and data formats (Parquet, TFRecord, HuggingFace datasets, etc.).
You've developed data augmentation, sampling, or curation strategies that improved model performance.
You think like both an engineer and an experimentalist: curious, analytical, and grounded in evidence.
You collaborate well across AI development, infra, and product, and enjoy building the data systems that make great models possible.
You care deeply about data quality, reproducibility, and scalability.
You're excited to help shape the future of AI for physical design.
Bonus points if:
You are comfortable working with a variety of complex data formats, e.g. for 3D geometry kernels or rendering engines.
You have an interest in math, geometry, topology, rendering, or computational geometry.
You've worked in 3D printing, CAD, or computer graphics domains.
Why Backflip
This is a rare opportunity to own the data backbone of a frontier foundation model, and help define how AI learns to design the physical world.
You'll join a world-class, mission-driven team operating at the intersection of research, engineering, and deep product sense, building systems that let people design the physical world as easily as they imagine it.
Your work will directly shape the performance, capability, and impact of Backflip's foundation model, the core of how the world will build in the future.
Let's build the tools the future will be made in.
#J-18808-Ljbffr
A leading gaming technology company in San Francisco is seeking a Software Developer to join its innovative team. The ideal candidate will have 6 months to 2 years of experience in Golang and/or Java, with a passion for developing services that power games. Key responsibilities include articulating technical solutions and driving improvements in operational excellence. A Bachelor's degree in Computer Science or equivalent experience is preferred. Join an inclusive team dedicated to redefining gaming!
#J-18808-Ljbffr
$123k-178k yearly est. 4d ago
Principal Data Scientist
Us Tech Solutions 4.4
Data engineer job in Sunnyvale, CA
What you'll do...
Join Client as a Principal Data Scientist to lead the development and deployment of advanced analytical models that drive strategic business decisions.
This role requires expertise in machine learning, statistical analysis, and coding to design scalable solutions addressing complex business challenges.
The position involves collaborating across functions to translate data insights into actionable strategies, mentoring team members, and ensuring high-quality data sourcing and validation.
The ideal candidate will apply innovative techniques to enhance data-driven decision-making and contribute to Client's continued growth and operational excellence.
About the team:
The Data Science team employs advanced analytical and modeling techniques to address complex business challenges.
They specialize in model assessment, validation, and deployment using tools such as Python, SQL, and machine learning frameworks.
Collaborating closely with business stakeholders, the team translates requirements into actionable insights and data-driven solutions.
Emphasizing rigorous statistical methods and data visualization, they ensure clear communication of findings.
The team promotes continuous learning and innovation, guiding members in best practices and emerging technologies to deliver impactful results aligned with business goals.
What you'll do:
Develop and implement advanced machine learning models to address complex business challenges and drive data-informed decisions.
Lead exploratory data analysis, feature engineering, and statistical modeling to extract actionable insights from data sources.
Collaborate with cross-functional teams to translate business requirements into scalable data science solutions aligned with strategic objectives.
Oversee model validation, tuning, deployment, and lifecycle management to ensure accuracy, robustness, and sustainability.
Mentor and guide team members on analytical techniques, coding best practices, and data visualization to enhance overall team capability.
Communicate findings effectively to stakeholders, supporting data-driven decision-making and continuous improvement initiatives.
What you'll bring:
Extensive experience in developing and deploying machine learning models using Python and related frameworks.
Proficiency in advanced statistical analysis and data visualization techniques to derive actionable business insights.
Strong knowledge of data sourcing, quality assessment, and management within complex business environments.
Expertise in model validation, tuning, and lifecycle management to ensure robust and scalable solutions.
Ability to translate business requirements into technical strategies and lead cross-functional teams effectively.
Familiarity with cloud platforms such as AWS SageMaker for scalable model development and deployment.
Demonstrated leadership in mentoring associates and driving continuous improvement in data science practices.
Minimum Qualifications...
Outlined below are the required minimum qualifications for this position. If none are listed, there are no minimum qualifications.
Option 1: Bachelors degree in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology or related field and 5 years' experience in an analytics related field.
Option 2: Masters degree in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology or related field and 3 years' experience in an analytics related field.
Option 3: 7 years' experience in an analytics or related field
Preferred Qualifications...
Outlined below are the optional preferred qualifications for this position. If none are listed, there are no preferred qualifications.
Data science, machine learning, optimization models, PhD in Machine Learning, Computer Science, Information Technology, Operations Research, Statistics, Applied Mathematics, Econometrics,
Publications or active peer reviewer in related journals or conference,
Successful completion of one or more assessments in Python, Spark, Scala, or R, Using open source frameworks (for example, scikit learn, tensorflow, torch),
We value candidates with a background in creating inclusive digital experiences, demonstrating knowledge in implementing Web Content Accessibility Guidelines (WCAG) 2.2 AA standards, assistive technologies, and integrating digital accessibility seamlessly.
The ideal candidate would have knowledge of accessibility best practices and join us as we continue to create accessible products and services following Client's accessibility standards and guidelines for supporting an inclusive culture.
About US Tech Solutions:
US Tech Solutions is a global staff augmentation firm providing a wide range of talent on-demand and total workforce solutions. To know more about US Tech Solutions, please visit ************************
US Tech Solutions is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Ensures effective and efficient operations through conducting operations analyses (i.e. operational effectiveness and capacity utilization), and recommends improvements.
Details
Job ID-26-00519
$102k-149k yearly est. 3d ago
Lead Data Scientist, Generative AI & Anomaly Detection
Cisco Systems 4.8
Data engineer job in San Jose, CA
A leading technology company is seeking an experienced engineer to join their team focused on building intelligent ML systems for observability. The ideal candidate will have a PhD in Computer Science or a related field, along with significant experience in cloud-based architectures like AWS or Azure. Responsibilities include leading the development of machine learning algorithms and enhancing AI features across platforms. This position offers an attractive salary and extensive benefits, including flexible vacation policies and a comprehensive insurance package.
#J-18808-Ljbffr
How much does a data engineer earn in Pleasanton, CA?
The average data engineer in Pleasanton, CA earns between $93,000 and $183,000 annually. This compares to the national average data engineer range of $80,000 to $149,000.
Average data engineer salary in Pleasanton, CA
$131,000
What are the biggest employers of Data Engineers in Pleasanton, CA?
The biggest employers of Data Engineers in Pleasanton, CA are: